<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>The Warehouse | Macro Paper Warehouse</title><link>https://macropaperwarehouse.com/</link><atom:link href="https://macropaperwarehouse.com/index.xml" rel="self" type="application/rss+xml"/><description>The Warehouse</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><item><title>A Learning Model of Financial Instability</title><link>https://macropaperwarehouse.com/papers/a-learning-model-of-financial-instability/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-learning-model-of-financial-instability/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Williams asks whether the recurrent boom-bust dynamics of Minsky&amp;rsquo;s financial instability hypothesis — &amp;ldquo;periods of stability lead to periods of instability&amp;rdquo; — can arise endogenously from a tractable rational-agent model in which investors learn about asset returns. This matters because standard rational-expectations asset-pricing models cannot generate the high, volatile price-dividend ratios, sizeable risk premia, and recurrent crashes seen in data, and because Minsky&amp;rsquo;s narrative has long lacked a clean formal mechanism. The paper&amp;rsquo;s main contribution is theoretical (a new instability/limit-cycle result for adaptive learning), with a secondary quantitative exercise.&lt;/p&gt;
&lt;p&gt;Model setup: A small-open-economy variant of the Lucas (1978) consumption-based asset-pricing model studied under learning by Adam, Marcet and Nicolini (2016). A representative agent with power utility (risk aversion gamma, discount factor beta) can borrow/lend at a fixed risk-free gross return R and holds a unit supply of stock paying an i.i.d.-growth dividend (log dividend growth = d + sigma*W, with centered binomial shocks W in {-1,1}). Adding the risk-free asset creates a portfolio problem and endogenous debt dynamics (the net asset position omega), which the closed-economy literature lacks. Agents wrongly believe log returns are i.i.d. binomial with mean m and standard deviation s, and update (m, s^2) by constant-gain recursive least squares with gain epsilon (the weight on new information). A borrowing/leverage constraint (0 &amp;lt;= v &amp;lt;= vbar on the stock portfolio share) ensures equilibrium exists. The self-confirming equilibrium (SCE) has (m,s)=(mu,sigma), v=1, omega=1, and a constant price-dividend ratio.&lt;/p&gt;
&lt;p&gt;Mechanism: The pricing function is extremely steep near v=1; the derivative at the SCE is delta&amp;rsquo;(1)=delta*(1+delta*), so with a mean P/D near 29 a 1-percentage-point fall in v (to 0.99) implies roughly a 30% drop in P/D (to ~20.3). Tranquil periods lower volatility estimates, raising v and prices; once heavily invested, the economy is fragile. Booms end via two mechanisms: binding leverage constraints (rare in the calibration, driving only one crash in the long simulation) and — the novel and dominant channel — a rapid boom raising perceived variance faster than perceived mean, causing agents to cut v and triggering a crash.&lt;/p&gt;
&lt;p&gt;Main quantitative findings (with magnitudes and scope): Theoretically, the SCE is stable only for gains below a threshold; at epsilon-bar the Jacobian of the averaged system has complex eigenvalues on the unit circle (a Neimark-Sacker / discrete Hopf bifurcation), and above it a stable limit cycle exists (Theorem 1, using Kuznetsov 1998). The threshold is approximately epsilon-bar = 8.9 x 10^-4, far below the calibrated epsilon = 0.0052 (about six times larger), so empirically plausible gains imply instability. Eigenvalues at threshold: 0.512 +/- 0.859i = e^(+/-1.0333i). Calibration uses Shiller (2024) S&amp;amp;P 500 data, 1871-2022 annual: empirical P/D mean 28.97, sd 15.53; log P/D mean 3.25, sd 0.46; 100x log return mean 6.51, sd 16.90; dividend growth 100x(d,sigma)=(1.56, 11.104). Optimizing (beta,gamma,epsilon) the baseline matches log P/D (mean 3.15 vs 3.25, sd 0.46 vs 0.46) and returns (6.44 vs 6.51; sd 16.85 vs 16.90) with beta=0.979, gamma=3.278, epsilon=0.0052, and a low risk-free rate 100xlog R=0.87. Crashes (defined as a 30% P/D drop) occur every ~38 years in the baseline vs ~25 years in data; matching the data frequency would need a larger gain near 0.025. The closed-economy and rational-expectations versions essentially cannot produce such crashes. Drawbacks: consumption growth is too volatile (sd ~16.79 vs 1.27 in data) and return predictability is far stronger than in the data.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What exactly drives the instability, and how is it established rather than merely simulated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Instability comes from the feedback between beliefs (m, s) and the net asset/debt position omega: beliefs set the portfolio share, which sets prices and returns, which feed back into beliefs. Williams formalizes this by stacking current beliefs, lagged beliefs, and the state omega into a 5-dimensional first-order system X_{t+1}=G(X_t, chi_t), then studies the deterministic averaged system Xbar_{t+1}=Gbar(Xbar_t) (averaging only over the i.i.d. dividend shocks chi, NOT over omega as the small-gain limit does). Linearizing at the SCE fixed point, Theorem 1 shows all Jacobian eigenvalues lie inside the unit circle for gains below a threshold epsilon-bar, a complex pair hits the unit circle at epsilon-bar (Neimark-Sacker bifurcation), and a unique stable closed invariant curve (limit cycle) appears for epsilon just above. He verifies the nondegeneracy and stability conditions numerically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does small-gain analysis mislead here, and what is the methodological contribution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Standard learning convergence results take the gain to zero, treating state dynamics as &amp;lsquo;fast&amp;rsquo; relative to beliefs and averaging over the state. Williams shows this is valid only for extremely small gains in his model because the radius of stability is tiny (epsilon-bar ~ 8.9e-4). Averaging over omega destroys the very belief-state feedback that drives cycles. His contribution to the learning literature is applying discrete-time bifurcation theory (Kuznetsov 1998) to show a Neimark-Sacker bifurcation and stable limit cycle in an economic learning model — which he states is novel — relating it to prior cautions by Cho (2018), Chien-Cho-Ravikumar (2020), and instability examples in Evans-Honkapohja (2009) and Honkapohja-McClung (2023).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two crash mechanisms and which dominates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Binding leverage constraint: if v hits vbar during a boom, inflows stop, generating a negative return surprise that lowers the mean estimate and cuts v. This is rare in the calibration — it drives only the final crash in the long simulation. (2) Endogenous volatility: a rapid boom raises both the estimated mean and variance of returns; when the variance effect dominates, agents cut the risky share even without hitting the constraint. Because the economy is in the steeply sloped pricing region, a tiny cut produces a large crash. This is the dominant, novel mechanism and causes all other crashes, including those in the highlighted closeup. In one example the portfolio share peaks just above one (period 441), and a move from v=1.004 to 1.000 produces about a 48% P/D drop; the cascade bottoms near v=0.47 and P/D around 2, a decline of over 95% from peak.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the representative boom-bust cycle look like quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In a &amp;gt;1,000-period simulation, P/D rises 30-50% within a span of years then crashes by a similar or larger amount. In the detailed cycle the P/D rises from 30 to 50 over a few periods before crashing to around 2. After a crash, volatility estimates start high and decline monotonically over roughly 50 periods; agents slowly raise v, prices rise (amplified by the omega multiplier as accumulated bonds are sold), until a rapid boom enters the fragile region and crashes again. Severe crashes of similar magnitude recur at periods 327, 442, 801, and 1067.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of stochastic shocks versus endogenous dynamics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Conditional impulse responses (at periods 432, 438, 440 into a boom) show shocks matter most early: at t=432 a positive shock reinforces the boom while a negative shock dampens fluctuations with little belief change. By t=438 positive/negative impulses are qualitatively similar but differ in magnitude. By t=440 the endogenous dynamics dominate and shock differences are minimal — the boom continues only a couple periods before a severe crash. Shocks govern timing and magnitude, but endogenous belief changes ultimately drive the cycles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the open-economy assumption matter, and what is the closed-economy comparison?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline is a small open economy: international trade in bonds (fixed R) but only domestic equity trade, which permits nonzero net debt and asset flows. This debt/portfolio-adjustment channel is essential. In the closed economy (R adjusts each period to clear bonds at zero net supply, v=1), with baseline parameters the fit is much worse: P/D too high (3.70), returns lower (4.08), and far less volatile (sd P/D 0.15). Re-optimizing the closed model improves means but misses volatilities (overshoots return sd at 17.74, undershoots P/D sd at 0.36) and requires very different parameters (beta=0.903, gamma=4.736, epsilon=0.0272); crashes occur only every ~469 years (extremely rare). Intermediate cases with partial interest-rate adjustment keep the closed-economy qualitative features. The empirical justification: foreign investors held 33% of US Treasuries, 27% of corporate debt, but only 17% of US equities in 2023 (vs 46% Treasuries and 9% equities in 2006).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the speed of learning (gain) trade off against fit?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;As the gain falls toward zero, the P/D ratio converges to its SCE value log(P/D)~3.6 and its distribution concentrates there (lower volatility); higher gains raise volatility and crash frequency but lower the mean P/D because more time is spent recovering from crashes (booms are short-lived, crashes slow to recover — an asymmetry). The calibration balances mean and volatility of P/D at epsilon=0.0052, but matching the observed crash frequency would need a larger gain near 0.025. The model can match price level/volatility OR crash frequency but struggles to match the speed of market dynamics simultaneously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main empirical drawbacks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Consumption growth is far too volatile (model sd ~16.79 vs data 1.27), inherited from using volatile empirical dividend growth as the driving process; treating stocks as levered equity claims (Abel 1999) could break the consumption-dividend link. (2) Return predictability — both autocorrelation and long-term reversal — is much stronger than in the data, where it is weak at best; additional shocks or heterogeneity would dampen it. (3) The subjective excess return is essentially uncorrelated with the P/D ratio, whereas survey expected returns are positively correlated with P/D (Greenwood-Shleifer 2014; Adam-Marcet-Beutel 2017; Barberis et al. 2018); allowing different gains for the mean and variance moves the model closer to survey evidence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Versus Branch and Evans (2011), who also have agents learning about risk and return: their booms/crashes are rare &amp;rsquo;escape&amp;rsquo; events from equilibrium, whereas in Williams&amp;rsquo;s model they are typical outcomes driven by a fundamental instability (a stable limit cycle), not rare escapes. Versus Adam, Marcet and Nicolini (2016): Williams adds a fixed-rate risk-free asset, creating a portfolio problem and debt dynamics (omega) that are crucial for the boom-bust cycles. Versus behavioral/extrapolation and diagnostic-expectations models (Barberis et al. 2018; Bordalo-Gennaioli-Shleifer 2018; Bianchi-Ilut-Saijo 2024), Williams uses standard adaptive learning, and crucially crashes collapse valuations far below fundamentals (not mere reversion to fundamentals), with stability breeding instability as in Minsky.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A full policy analysis is outside the paper&amp;rsquo;s scope, but Williams notes a higher interest rate lowers excess stock returns and makes boom-bust cycles less frequent — yet potentially more severe (when a boom does occur, larger price/return spikes). This implies policymakers face tradeoffs more complex than simply &amp;rsquo;leaning against the wind&amp;rsquo; of bubbles. The scope conditions: the model has exogenous output growth, a representative agent, a constant risk-free rate, and a constant rational-expectations P/D, so all fluctuations are attributed to learning; relaxing these (e.g., for finance-real interactions) is left for future work.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>A Theory of Price Caps on Non-Renewable Resources</title><link>https://macropaperwarehouse.com/papers/a-theory-of-price-caps-on-non-renewable-resources/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-theory-of-price-caps-on-non-renewable-resources/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks what the optimal response of an exhaustible-resource producer is to sanctions in the form of a price cap, and how a sanctioning coalition should set the cap. The motivation is the $60-per-barrel cap on seaborne Russian crude imposed by the G7, EU and Australia in December 2022 (with $100/barrel for high-value and $45/barrel for low-value refined products), whose stated aim was to cut Russian revenue without triggering a global supply shock. The authors (two of whom were involved in designing the policy) argue that static models, frictionless Hotelling models, and truncated-supply-curve intuitions are all inadequate, and build a dynamic structural model.&lt;/p&gt;
&lt;p&gt;Model setup: A petrostate extracts an exhaustible resource (reserves normalized to 1) whose price follows a Cox-Ingersoll-Ross (Feller square-root) process, estimated on monthly real oil prices 1973-2024 (deflated WTI), yielding long-run mean p̃=$76 (2024 prices), volatility ς=2.43, and mean reversion D=0.21 annually, implying a price half-life of ln2/D = 3.6 years and a right-skewed Gamma limiting distribution. Preferences are CRRA with γ=2 (baseline); marginal extraction cost M=$19/barrel (Osintseva 2021); real discount rate 3%; and non-oil income τ=2, implying commodity sales fund between 1/3 and 1/2 of state income. A two-period model first shows that sufficiently severe financial frictions (low saving returns, high borrowing rates, fixed participation costs Φ) make the producer endogenously live hand-to-mouth (Propositions 1-2), consuming oil proceeds directly; the infinite-horizon model takes this as given.&lt;/p&gt;
&lt;p&gt;Main findings: (1) Even without physical adjustment costs, optimal supply is highly inelastic — supply falls sharply below $40/barrel and reaches zero just below $30 — matching Russia&amp;rsquo;s observed price-insensitivity. A novel decomposition attributes the shape to four forces: time-the-market, revenue-smoothing, precautionary, and non-homotheticity effects, with their balance governed by γ. (2) A perfect (universal, credible, permanent) price cap shifts the supply curve OUTWARD — the producer extracts MORE — because the cap removes price upside, making reserves less valuable (non-homotheticity) and, under market power, eliminating the point of restricting supply (a binding cap means cutting volume no longer raises price). (3) Consequently a binding perfect cap can LOWER and stabilize world prices, and the stabilizing benefit is LARGER the greater the producer&amp;rsquo;s market power (demand elasticity calibrated to 1/ϵ=0.25; short-run literature range [0.07,0.14]). (4) An imperfect (leaky and/or temporary) cap produces highly state-dependent behavior: when the market is already tight (reference price high, above ~$150/barrel in the calibration), the producer optimally &amp;lsquo;shuts in,&amp;rsquo; cutting output toward the shadow-fleet capacity κ and selling only outside the cap — DESTABILIZING the market exactly when prices are high. With κ=0.01 (about one-third of normal extraction), a leaky cap reduces the welfare damage to the producer by about two-thirds relative to a perfect cap, even though contemporaneous profits fall up to 50% when shutting in. (5) The authors introduce a &amp;lsquo;sanctions possibility frontier&amp;rsquo; trading producer harm v(p̄) against the excess probability of a price shock ϕ(p̄) (P(price&amp;gt;$120), ~12% historically). The optimal cap is HIGHER (less aggressive) the greater the leakage; preferences (weight λ) matter mainly at intermediate leakage. Policy corollary: effective enforcement is a precondition for setting a low cap.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core conceptual contribution about how a price cap operates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper argues a price cap is not a truncation of the existing supply curve but a fundamental change to the stochastic environment the producer faces. By capping prices at min{p,p̄}, it eliminates the upside of high prices, lowers the value of reserves, and reduces uncertainty. Because the environment changes, the policy rules must be recomputed rather than read off the pre-policy supply curve adjusted with a vertical segment above p̄.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does a perfect price cap make the producer extract MORE, counter to policymaker intuition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms. First, the non-homotheticity effect: with outside income τ&amp;gt;0, less valuable reserves are depleted faster, so capping the price (which lowers reserve value) raises the extraction rate. Second, for a producer with market power, a binding cap removes the incentive to restrict supply — curbing volume no longer raises the (capped) price, rendering market power ineffective. The supply curve under a binding cap closely follows the no-volatility supply curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the four forces in the supply-curve decomposition and what governs them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Time-the-market: sell more when prices are high. (2) Revenue-smoothing: with γ&amp;gt;1 the income effect dominates, so the producer extracts more when prices are low/expected to rise to smooth revenue. (3) Precautionary: price volatility induces conservation (extract less today); found quantitatively small. (4) Non-homotheticity: a permanently less valuable resource (low or capped price) is extracted faster, like greater impatience. Their balance is governed by preferences, specifically γ (inverse IES). Higher γ strengthens revenue-smoothing and weakens time-the-market; as γ→0 the model collapses to the frictionless Hotelling benchmark with infinitely elastic supply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the empirical evidence presented, and what is the identification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Section 2.5 tests whether financially constrained producers have more inelastic supply. Using 53 OPEC supply-news announcements 1984-2017 (from Känzig 2021) as price shocks, the authors examine production changes in 70 non-OPEC countries in the month after versus before each announcement. The dependent variable is the change in log production, sign-flipped so that producing more when prices fall (or less when prices rise) counts negatively. Regressing on the share of years a country had above-median debt-to-GDP yields a negative coefficient of -0.026 (std err 0.010), consistent with financially constrained countries having more inelastic supply. A country-risk-premium measure (Damodaran 2022) gives a similar but noisier result. Identification rests on OPEC announcements being exogenous price-news shocks to non-OPEC producers; threats include the announcements not being clean exogenous shocks and the debt-to-GDP dummy proxying other country characteristics — the paper treats this as motivating, not causal-structural, evidence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model incorporate market power and how is it endogenous?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;World demand is isoelastic: pw=δ(r+y)^(-ϵ), where r is stochastic rest-of-world residual supply, y is producer output, and 1/ϵ is demand elasticity. The effective elasticity εD=ϵ·y/(r+y) depends on the producer&amp;rsquo;s market share, so market power evolves endogenously with past extraction (Cournot intuition). Market power makes the producer more conservationist in normal times, exerting upward price pressure. 1/ϵ is set to 0.25; the process for r is estimated by simulated method of moments so the laissez-faire equilibrium price matches the estimated oil-price process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the &amp;rsquo;leaky&amp;rsquo; cap modeled and what is the shut-in strategy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A shadow-fleet parameter κ∈[0,1] is the fraction of reserves exportable outside the cap per unit time (κ=0 is a perfect cap). With market power plus leakage, when the market is tight and prices are high, the producer optimally cuts output toward κ, selling only outside the regime at elevated prices (&amp;lsquo;shut-in&amp;rsquo;). In the calibration with κ=0.01 (about a third of normal extraction), shut-in to κ is optimal when prices exceed ~$150/barrel; between $60 and $120 the cap still expands supply. So the cap stabilizes near the $76 long-run average but destabilizes when prices are already high.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the welfare and profit impact of a leaky cap?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Shutting in is not driven by higher contemporaneous profits — those fall by up to 50% relative to a perfect cap unless prices already exceed ~$150 — but by a more spread-out production profile that raises intertemporal welfare. Producer welfare rises with κ. Quantitatively, a leaky cap with κ=0.01 reduces the welfare damage inflicted on the producer by about two-thirds relative to a perfect cap, showing leakage sharply blunts the sanction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is cap non-credibility (temporariness) modeled?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cap removal is a Poisson event with intensity λ, so duration is exponentially distributed. With a perceived 50% probability of removal within the first year, λ=0.69. Expecting the cap to be temporary makes the producer more inclined to shut in and keep barrels underground for extraction after removal, reinforcing the shadow-fleet mechanism and further weakening the cap&amp;rsquo;s stabilization effect; intertemporal welfare effects are significantly diminished.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the sanctions possibility frontier and how is the optimal cap chosen?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The policymaker minimizes v(p̄)+λ·ϕ(p̄), where v is proportional producer welfare loss from the value function and ϕ is the excess probability of an oil shock (P(pw&amp;gt;$120), baseline ~12% matching history). For each leakage level κ, the sanctions possibility frontier maps achievable (v,ϕ) combinations across cap levels. With a perfect cap the frontier is upward-sloping (no trade-off) and the optimum is the lowest cap above marginal cost. With leakage it becomes downward-sloping, creating a trade-off, and the frontier steepens as κ rises. Example: at κ=1/6, a cautious policymaker (λ=2) picks $55/barrel while an aggressive one (λ=1) picks $20; as leakage grows both converge to about $100. The optimal cap rises with leakage; preferences matter mainly at intermediate leakage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It contrasts with the frictionless Hotelling (1931) model (perfectly elastic supply) and with Anderson, Kellogg &amp;amp; Salant (2018), who derive inelasticity from geological well-pressure constraints — here inelasticity comes instead from financial frictions and market power. It differs from Stiglitz (1976), who found market power irrelevant to extraction quantity, because of positive marginal costs, financial frictions, and non-oil income. It complements empirical work (Babina et al. 2023 on market fragmentation and discounts), Salant (2023) on pre-announcement, Sappington &amp;amp; Turner (2023, static Cournot), Wachtmeister et al. (2023, quantitative), and Cardoso et al. (2024, endogenous shadow fleet). No separate drilling decision is modeled, for parsimony.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are reported?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results are robust to: (a) excluding US/UK from the cross-country regression or using a 6-month horizon; (b) using the Damodaran country-risk-premium measure; (c) an alternative increasing, L-shaped marginal-cost curve with a 3% capacity constraint (Rystad/Wachtmeister data, M(y)=1.5+sqrt(0.25/(0.03-y))) — all conclusions hold, except predicted extraction is capped at the 3% capacity limit; and (d) HARA utility (nesting CRRA and CARA), available on request. The constant-marginal-cost main specification is chosen because it more clearly exposes the incentive to increase extraction (medium-term view).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the scope conditions and caveats on the policy conclusions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The stabilizing-cap result requires the cap to be &amp;rsquo;not too leaky&amp;rsquo; and credible. The destabilizing shut-in only kicks in at high reference prices (above ~$150 in calibration). The financial-frictions/hand-to-mouth assumption is motivated by sanctioned petrostates specifically (frozen reserves — $300bn of Russian central-bank reserves frozen — sanctioned banks, war financing); it may apply less to unconstrained producers. The model is partial equilibrium (no general-equilibrium world economy, no strategic multi-state interaction, no endogenous shadow-fleet investment in the main analysis), and abstracts from storage and from a separate drilling margin. The policymaker objective is assumed linear in (v,ϕ).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Price cap (as a tool of statecraft)&lt;/strong&gt;: In this paper, a sanction that lets the producer sell only at or below a ceiling p̄ when using coalition-controlled services, so the price received is pr=min{p,p̄}. Crucially it is interpreted not as a truncation of the supply curve but as a fundamental change to the stochastic environment, eliminating price upside and reducing reserve value and uncertainty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous hand-to-mouth behavior&lt;/strong&gt;: The result (Propositions 1-2) that sufficiently severe financial frictions — low saving returns, high borrowing costs, and/or fixed participation costs Φ — make the producer optimally consume oil proceeds period-by-period without using financial markets, regardless of its preferences. This is taken as the operating assumption for the dynamic model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-homotheticity effect&lt;/strong&gt;: With outside (non-oil) income τ&amp;gt;0, a permanently less valuable resource — whether from a low permanent price or a binding cap — is extracted faster, because reserve depletion is a less threatening prospect. It makes the producer behave as if more impatient and is a key driver of the outward supply shift under a cap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shut-in strategy&lt;/strong&gt;: Under a leaky cap with market power, the producer sharply cuts extraction toward the shadow-fleet capacity κ when prices are already high, selling only outside the cap at elevated prices. It lowers contemporaneous profits (up to 50%) but raises intertemporal welfare via a more spread-out production profile; it destabilizes the market precisely when it is tight.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shadow fleet / leakage (κ)&lt;/strong&gt;: The fraction of reserves the producer can export outside the cap regime per unit time (κ∈[0,1]); κ=0 is a perfect cap. For Russia it represents non-coalition tanker/insurance capacity; the paper notes the share of Russian oil outside the cap rose from about 20% (April 2022) to 67% (August 2024).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sanctions possibility frontier&lt;/strong&gt;: A novel menu, for each leakage level κ, of the achievable combinations of damage inflicted on the producer (v) and the probability of an oil-market shock (ϕ) across cap levels. Upward-sloping under a perfect cap (no trade-off; pick lowest cap), it becomes downward-sloping and steeper under leakage, making the optimal cap preference-dependent and increasing in leakage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reference price&lt;/strong&gt;: The hypothetical equilibrium price that would prevail if the producer did not exercise market power — a monotone transformation of the state variable rt. It measures market tightness cleaned of the sanctioned producer&amp;rsquo;s endogenous decisions, and the cap&amp;rsquo;s price-lowering effect is larger when the reference price is high.&lt;/p&gt;</description></item><item><title>A Tractable Income Process for Business Cycle Analysis</title><link>https://macropaperwarehouse.com/papers/a-tractable-income-process-for-business-cycle-analysis/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-tractable-income-process-for-business-cycle-analysis/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Guvenen, McKay, and Ryan estimate a stochastic income process for US male workers that simultaneously matches five empirical regularities from Social Security Administration administrative panel data covering 1978–2011: (i) flat and acyclical variance of income growth rates, (ii) volatile and procyclical Kelley skewness, (iii) very high kurtosis — targeted at 20 for one-year changes and 12 for five-year changes — (iv) a near-linear rise in cross-sectional log-income variance from age 25 to 55, and (v) a systematic factor structure in business cycle incidence whereby income losses during recessions are predictably related to a worker&amp;rsquo;s pre-recession income rank. All five facts are drawn from Guvenen et al. (2014) and Guvenen et al. (2021), which document them from SSA records on individual income histories.\n\nThe income process adds three key departures to the workhorse persistent-plus-transitory Gaussian specification. First, transitory &amp;ldquo;nonemployment&amp;rdquo; shocks — arriving annually with approximately 45% probability and drawn from an exponential distribution — create fat tails through their arrival (large income losses) and departure (large income gains), and leave a persistent &amp;ldquo;scarring&amp;rdquo; residue through a passthrough parameter ψ estimated at 9.4% in the baseline nonemployment model. Each year, roughly 8.6% of workers experience income declines of 50% or more from the nonemployment shock alone, and 1.8% fall to effectively zero income. The scarring mechanism makes the left tail of the income growth density fatter than the right tail, consistent with the data (left-tail log-density slope 1.4, right-tail slope –2.2). Second, innovations to the persistent AR(1) component are drawn from a time-varying three-component normal mixture — with the dominant central component realized with about 83% probability and near-zero standard deviation (~1%), flanked by left-tail and right-tail components with probabilities of ~10.9% and ~6.2% and standard deviations of ~16.4% and ~19.2% — whose means shift with contemporaneous aggregate wage income growth (xt = β·Δwt). This mean-shifting mechanism generates procyclical skewness under an acyclical variance, because it redistributes probability mass between the tails without altering mixture probabilities or component variances. Third, a piecewise-linear factor structure makes each individual&amp;rsquo;s income sensitivity to aggregate fluctuations depend on the persistent component of income (γi + zi,t), with a kink separating two slope regimes. In the Great Recession, workers at the 10th percentile of pre-recession income lost approximately 18 percentage points more than workers at the 90th percentile; both the bottom and top deciles were more exposed than the middle of the distribution, producing a V-shaped incidence pattern.\n\nEstimation uses simulated method of moments (SMM) with 360,000 simulated individuals per year, a 1947 burn-in start, and optimization via the TikTak global algorithm. Six models of increasing complexity are estimated, each requiring only one individual state variable (the persistent component z) — matching the parsimony of the standard model. The workhorse Gaussian model (Model 1) understates the variance of one-year log income changes by 60–80%; introducing nonemployment shocks (Model 2) largely resolves this, matching one-year variance exactly and narrowing the five-year shortfall to 30%. Adding the time-varying normal mixture (Model 3) generates procyclical skewness and acyclical variance. Adding the factor structure (Model 4) captures differential recession exposure. Models 5 and 6 introduce Heterogeneous Income Profiles (HIP, σκ = 0.015) and estimate AR(1) persistence freely, obtaining ρ ≈ 0.80, which better captures the right tail of the income growth distribution.\n\nThe paper recommends Model 5 as a general-purpose benchmark (without the factor structure), Model 4 when differential business cycle incidence is central, and Model 3 when maximum parsimony is needed. The richer income dynamics documented here have direct implications for quantifying the welfare cost of business cycles, the value of social insurance, the design of automatic stabilizers, the distribution of marginal propensities to consume, and asset pricing under heterogeneous agents.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the estimation procedure and what data does it use?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses simulated method of moments (SMM), targeting approximately 120+ moments derived from Social Security Administration administrative panel data on individual income histories of US male workers over 1978–2011 (from Guvenen et al. 2014 and 2021). The simulation panel contains 360,000 individuals per year, initialized in 1947 with a burn-in period. Optimization uses the TikTak global algorithm (Arnoud et al., 2019). Moments targeted include the 10th, 50th, and 90th percentiles of one-, three-, and five-year income growth averaged across 1979–2011 (nine moments); kurtosis at one-year and five-year horizons (two moments); cross-sectional variance of log income at ages 25, 35, 45, and 55 (four moments); left- and right-tail mass and log-density slopes from the 1995–1996 income growth distribution (four moments); the full time series of Kelley skewness for one-, three-, and five-year changes (93 moments); and piecewise-linear slopes of the factor structure for seven business cycle episodes — four recessions and three expansions covering 1979–2010 (14 moments). Moments are weighted approximately equally, with skewness moments down-weighted collectively.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three key departures from the workhorse Gaussian model and what feature does each address?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, transitory &amp;rsquo;nonemployment&amp;rsquo; shocks drawn from an exponential distribution, arriving with ~45% annual probability, along with a scarring parameter ψ that loads a fraction of the transitory shock onto the persistent state — this generates the high kurtosis, thick tails, and asymmetry (steeper right than left tail) of the income growth distribution. Second, a three-component time-varying normal mixture for persistent innovations — the component means shift with the aggregate wage component xt = β·Δwt — producing procyclical skewness and acyclical variance simultaneously. Third, a piecewise-linear factor structure f(γi + zi,t) mediating each individual&amp;rsquo;s exposure to aggregate fluctuations, capturing the V-shaped relationship between pre-recession income rank and recession income loss.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the scarring mechanism and how large is it empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Transitory nonemployment shocks ζi,t are assigned with probability (1 − pζ) each year and drawn from an exponential distribution with parameter λ, where ℓi,t ∈ [0,1] represents the income fraction lost. A fraction ψ of this transitory shock flows permanently into the persistent state zi,t via ˜ηi,t = ηi,t + ψζi,t. In Model 2, the annual probability of receiving a nonemployment shock is 45% (pζ ≈ 0.55), λ = 3.357 (mean income loss fraction ≈ 0.30), and ψ = 9.4%. Each year, 8.6% of workers experience income declines of 50% or more from the nonemployment shock alone, and 1.8% effectively lose all income (full-year nonemployment). The scarring makes the right tail steeper than the left tail in the income growth distribution, as re-employed workers do not return to their pre-shock income level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the time-varying normal mixture generate procyclical skewness without changing variance?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The three normal mixture components for the persistent innovation η are: a central component (probability ~83%, standard deviation ~1%), a left-tail component (~10.9%, ~16.4% sd), and a right-tail component (~6.2%, ~19.2% sd). Their means shift via the latent variable xt = β·Δwt: the central and left-tail means move with xt while the right-tail mean does not. A normalization ensures xt has zero mean-income effect. In recessions (xt &amp;lt; 0, Δwt &amp;lt; 0), the left-tail component&amp;rsquo;s mean shifts down and the right-tail component&amp;rsquo;s mean shifts up relative to the central, generating more left-skewed draws without changing the probabilities or variances of the components — hence acyclical variance and procyclical skewness. Alternative designs (cyclical mixture probabilities or variances) did not generate both patterns simultaneously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the factor structure and how non-monotonic is it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In deep recessions the factor structure is broadly monotone decreasing over the bulk of the distribution (lower-income workers lose more), with the 10th percentile losing about 18 percentage points more than the 90th percentile in the Great Recession (2007–2010). However, the pattern reverses for the top 10% of the income distribution: high earners also face large losses in financial-market-driven recessions, producing a V-shape. The piecewise-linear model f(q) with a kink at q-bar and slopes α1 (below) and α2 (above) captures this. The model fits the Great Recession V-shape and the mild 1990–1992 and 2000–2002 recessions (where the pattern is flatter, consistent with smaller drops in wt), but struggles to fit the large top-income losses in 2000–2002 without an additional stock-market-correlated factor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the levels-vs-differences puzzle and how is it resolved?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The canonical persistent-plus-transitory Gaussian model (Model 1) faces a fundamental tension: it can fit the cross-sectional variance of log income levels at each age, but it then understates the variance of one-year and five-year log income changes by 60–80% (squared standard deviations from Figures 8a and 9a). This tension was documented by Heathcote, Perri, and Violante (2010). Introducing the nonemployment shocks in Model 2 largely resolves it: the one-year variance of log income changes is matched exactly, and the five-year understatement narrows to about 30%. The nonemployment shock contributes high-frequency variance in income changes without requiring a comparably large increase in the variance of the persistent state, because it is mostly transitory.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does HIP play and what tensions does it create?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Heterogeneous Income Profiles (HIP, σκ = 0.015 from Baker 1997 and Guvenen et al. 2021) allow AR(1) persistence ρ to be estimated freely rather than restricted to 1. The estimated ρ falls to 0.80 in Models 5 and 6. HIP provides a convex component to the lifecycle variance profile (from dispersion in individual growth-rate slopes κi) that offsets the concave contribution of mean-reverting persistent shocks, maintaining a near-linear age-variance profile at ρ &amp;lt; 1. Lower persistence better fits the right tail of annual income growth and the standard deviation of five-year changes. However, in Model 6 HIP worsens the fit to the factor structure, because mean reversion at ρ &amp;lt; 1 already generates faster income growth for low-income workers in expansions, reducing the work the factor structure needs to do in booms while resisting the factor structure&amp;rsquo;s ability to generate large losses for low-income workers in recessions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and alternative specifications are estimated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper estimates two supplementary models reported in Appendix B. Model 2&amp;rsquo; removes the scarring component (ψ ≡ 0) from Model 2, finding a worse fit particularly in the histogram, kurtosis, and lifecycle inequality moments. Model 3&amp;rsquo; replaces the time-varying mixture with a static normal mixture (β ≡ 0), still improving over Model 2 (objective falls from 2.44 to 2.26) via better tail fit and average skewness, but without capturing the procyclical skewness time series. Model 4&amp;rsquo; removes time variation from the innovation distribution (β ≡ 0) while retaining the factor structure, showing that the factor structure fit survives without time variation in skewness. Additionally, the paper discusses a special parsimony case: under ρ = 1, homothetic preferences, and no factor structure, z can be normalized away entirely, leaving no individual state variable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work on non-Gaussian income processes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Kaplan, Moll, and Violante (2018) capture leptokurtic income growth but include no business cycle variation and no factor structure. McKay (2017), McKay and Reis (2021), and Catherine (2021) allow for procyclical skewness in income risk but do not target high kurtosis or a factor structure. Bhandari, Evans, Golosov, and Sargent (2021) allow for a factor structure but do not match higher-moment properties of income risk. Other work documenting the relevant facts includes Guvenen, Ozkan, and Song (2014) for countercyclical skewness in US SSA data; Guvenen, Karahan, Ozkan, and Song (2021) for lifecycle earnings dynamics from the same source; Harmenberg (2021) and Kramarz, Nimier-David, and Delemotte (2021) for related European evidence; and Guvenen, Schulhofer-Wohl, Song, and Yogo (2017) for factor structure evidence labeled &amp;lsquo;worker betas.&amp;rsquo; This paper is the first to jointly target and fit all four properties within a single tractable process that adds only one state variable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy and structural implications highlighted by the paper?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Leptokurtic income risk (high kurtosis, fat tails) has quantitatively important effects on the value of social insurance and optimal redistribution (Saez, 2001; Golosov, Troshkin, and Tsyvinski, 2016) and interacts with borrowing constraints to shape the distribution of wealth and marginal propensities to consume (Kaplan, Moll, and Violante, 2018). Cyclical variation in income risk — the procyclical skewness feature — matters for the welfare cost of business cycles (Storesletten, Telmer, and Yaron, 2001; Krebs, 2003, 2007) and for the optimal design and welfare value of automatic stabilizers (McKay and Reis, 2021; Bhandari et al., 2021). The factor structure is relevant for cyclical variation in income inequality and for asset pricing under household heterogeneity (Mankiw, 1986; Constantinides and Duffie, 1996; Constantinides and Ghosh, 2016). The scope condition throughout is male US workers in the SSA administrative data; no direct results are provided for female workers, self-employed individuals, or other countries, though the modeling framework is general.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What practical guidance does the paper provide for incorporating the process into dynamic models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper provides explicit Bellman equation structure: cash on hand m and the persistent income state z are the two endogenous individual state variables (z being the single income-process state variable), with individual parameters γ and κ treated as fixed effects. Income at each node requires evaluating a closed-form expression from Equation 1. Expectations over next-period z and ζ are handled via quadrature, with the time-varying mixture of normals requiring quadrature nodes that shift with the aggregate state S and S′ — following McKay and Reis (2021). Under the special case ρ = 1, homothetic preferences, and no factor structure, all variables can be normalized by exp(z + γ), eliminating z as a state variable and reducing the problem to one with no idiosyncratic income state. The authors note that a perpetual-youth demographic structure avoids tracking age as a state variable.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Procyclical skewness&lt;/strong&gt;: In the paper&amp;rsquo;s sense: the Kelley skewness of the cross-sectional distribution of one-year and five-year income growth rates falls significantly during every NBER recession (distribution shifts left — more large negative shocks, fewer large positive ones) and rises during expansions, while the standard deviation of that distribution shows no discernible cyclical pattern. This is a feature of the income shock distribution itself, not of average income levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nonemployment shock with scarring&lt;/strong&gt;: A transitory income loss event modeled as an exponential random variable ℓi,t ∈ [0,1] (representing the fraction of income lost) arriving with probability ~45% per year. A fraction ψ of this transitory shock is loaded permanently onto the persistent income state — the &amp;lsquo;scarring&amp;rsquo; effect — so that re-employed workers do not fully return to their pre-shock income trajectory. In the paper&amp;rsquo;s model this single mechanism generates high kurtosis, thick double-Pareto tails, and asymmetric tail slopes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Time-varying normal mixture for persistent innovations&lt;/strong&gt;: A three-component mixture of normals for the AR(1) innovation η in which the component means (not probabilities or variances) shift proportionally to contemporaneous aggregate wage income growth via a loading parameter β. A mean-preserving normalization ensures no effect on average income. This mean-shifting mechanism moves probability mass between the central and tail components of the innovation distribution, generating procyclical skewness while keeping income growth variance acyclical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Factor structure in business cycle incidence&lt;/strong&gt;: A systematic, pre-determined relationship between a worker&amp;rsquo;s position in the persistent income distribution and the magnitude of income change experienced during a given recession or expansion. Modeled as a piecewise-linear function f(γi + zi,t) that multiplies the aggregate income component wt, with slopes that differ below and above an estimated kink point. Empirically, the factor structure produces a V-shaped incidence pattern: income losses in deep recessions are largest at both the bottom and top of the pre-recession income distribution, and smallest in the middle.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Income scarring parameter (ψ)&lt;/strong&gt;: The fraction of a transitory nonemployment shock ζi,t that is permanently loaded onto the persistent income state zi,t via the equation ˜ηi,t = ηi,t + ψζi,t. Estimated at 9.4% in Model 2 and 15.1% in Model 3. Controls the degree to which transitory shocks generate long-lasting income effects and determines the relative steepness of the left versus right tails of the annual income growth distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneous Income Profiles (HIP)&lt;/strong&gt;: Individual-specific linear deterministic growth-rate slopes κi distributed with standard deviation σκ = 0.015 (calibrated from Baker 1997 and Guvenen et al. 2021), representing permanent heterogeneity in the steepness of individual income trajectories over the lifecycle. Introducing HIP allows the AR(1) persistence parameter ρ to be estimated below 1 (≈0.80 in Models 5–6) while preserving the near-linear age-variance profile, because the convex variance contribution of heterogeneous slopes offsets the concavity induced by mean-reverting persistent shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Kelley skewness&lt;/strong&gt;: In the paper&amp;rsquo;s use: a robust, percentile-based measure of skewness defined as [(P90 − P50) − (P50 − P10)] / (P90 − P10), which the paper prefers for income growth distributions because it is less sensitive to extreme outliers than moment-based skewness. Used as the primary target for capturing business cycle variation in the shape of the income growth distribution.&lt;/p&gt;</description></item><item><title>Adverse Selection and Small Business Finances</title><link>https://macropaperwarehouse.com/papers/adverse-selection-and-small-business-finances/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/adverse-selection-and-small-business-finances/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks why small firms hold large quantities of liquid assets — cash and cash equivalents that earn low or negative real returns — even when external credit is available. The conventional answer is a precautionary motive: liquidity buffers the risk of being shut out of credit markets. Liang proposes a second, complementary motive: a signaling motive, whereby firms hold liquid assets specifically to pledge as collateral and credibly signal their repayment ability to lenders, thereby obtaining better loan terms. The empirical backdrop is striking: about 28% of small business assets are cash and cash equivalents (Kauffman Firm Survey 2011 wave); about 7% of commercial business loans are secured by liquid collateral (SSBF 2003); and 43% of small firms sought a commercial business loan in 2020.&lt;/p&gt;
&lt;p&gt;The theoretical framework embeds directed search (Guerrieri, Shimer, and Wright 2010, hereafter GSW) and asymmetric information inside a Lagos-Wright general equilibrium monetary model. There are two types of entrepreneurs — low types (success probability δ_L) and high types (δ_H &amp;gt; δ_L) — who privately know their own type. Bankers post loan contracts specifying a down payment d, loan amount ℓ, and repayment R, and then entrepreneurs direct their search to contracts. Investment opportunities arrive stochastically. Entrepreneurs who fail to match with a banker self-finance from their liquid holdings; this endogenous outside option gives liquidity value and generates a precautionary demand for it. The opportunity cost of holding liquidity equals the policy rate i (equivalently, the inflation rate π).&lt;/p&gt;
&lt;p&gt;The main equilibrium characterization (Proposition 2) shows that as the policy rate rises, the economy passes through four regimes: (1) no participation in the credit market; (2) only high types borrow, no screening needed; (3) both types borrow, bankers screen using down payment only; (4) both types borrow, bankers screen using both down payment and loan approval rate (market tightness). The key distortion is in the extensive margin: under adverse selection with binding incentive constraints, high-type borrowers must pledge more liquid assets (dH = zH &amp;gt; z*_H) and face a tighter loan market (θ_H &amp;lt; θ*_H) than under complete information, but the loan size is undistorted (ℓ_H = ℓ*_H, Proposition 3). Low-type borrowers&amp;rsquo; allocations are never distorted by adverse selection.&lt;/p&gt;
&lt;p&gt;The interest rate pass-through from the policy rate to the real lending rate on high-type loans can be negative (Proposition, Section 4 and Figure 5). With an urn-ball matching function, γ_H (the real lending rate for high types) falls in i when screening is active, even as the aggregate lending rate rises monotonically. With a Cobb-Douglas matching function, lending rates always increase in i. Whether negative pass-through obtains therefore depends on the matching technology.&lt;/p&gt;
&lt;p&gt;Screening intensity — the degree to which high-type borrowers must hold excess liquidity and accept lower loan approval odds — is non-monotone in the low types&amp;rsquo; success probability δ_L (Proposition 4). When δ_L is very small or very close to δ_H, a small down payment suffices. Distortions are largest for intermediate values of δ_L, where the low types have large incentives to misreport but the cost of mimicry is neither trivially high nor trivially low.&lt;/p&gt;
&lt;p&gt;Without the self-finance channel — the endogenous outside option — both the precautionary and signaling motives vanish entirely, and liquid assets become redundant (Proposition 5). Bankers then use only market tightness to screen, which is less costly than using both down payment and approval rate. This result cleanly isolates why self-finance is the structural ingredient making liquidity essential.&lt;/p&gt;
&lt;p&gt;On policy, the competitive equilibrium is generically constrained inefficient when both screening tools are used, because bankers in one submarket do not internalize the externality they impose on the other submarket through the binding incentive constraint. A utilitarian social planner who faces the same information and search frictions can restore the complete information allocation by taxing high types and subsidizing low types, under a sufficient condition (Proposition 6): the high types&amp;rsquo; surplus from borrowing relative to self-finance exceeds the low types&amp;rsquo; net gain from misreporting, scaled by the population ratio and inverse success probability ratio. This condition is more likely to hold when i is large, when there are few low types (small ν_L), or when the low types&amp;rsquo; net gain from misreporting is small. Conversely (Proposition 7), the competitive equilibrium is constrained efficient — and no transfers are needed — if δ_L/δ_H + ν_H/ν_L &amp;lt; 1, which obtains when the low types are very risky (low δ_L) or very numerous (high ν_L), making subsidization costly.&lt;/p&gt;
&lt;p&gt;Empirically, Liang estimates a dynamic panel model of liquidity-to-assets ratios using the Kauffman Firm Survey (KFS), a longitudinal survey of 4,928 new U.S. firms from 2004-2011 (660 in the balanced panel after cleaning). Using a first-difference transformation with Anderson-Hsiao IV (instrumenting lagged differenced liquidity-to-assets with its second lag and differenced liquid collateral with its own lag), the preferred estimate (column 5) shows that firms holding liquid collateral to obtain loans hold on average 19.83% more liquid assets as a share of total assets before the loan application than do comparable firms that pledge illiquid or no collateral. This is treated as evidence for the signaling motive. The precautionary motive is confirmed: firms reporting credit difficulties hold an additional 9.93% of total assets in liquid form, and a one-percentage-point increase in R&amp;amp;D-to-assets (proxy for growth opportunities) is associated with 0.09% higher liquidity-to-assets. The transaction motive is confirmed: a one-percentage-point increase in total assets is associated with 0.09% lower liquidity-to-assets. The tax and agency motives are not statistically significant for small firms.&lt;/p&gt;
&lt;p&gt;A moral hazard extension (Appendix E) relaxes the assumption that banknotes can only be used to purchase capital. When entrepreneurs can divert loan proceeds to consumption (at cost), a third screening tool is added — loan size — and equilibria are more distorted and more likely to be distorted (Propositions 8-10). The threshold i above which two-tool screening kicks in falls, and loan amounts are reduced below the complete information optimum, which does not occur in the baseline.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the paper&amp;rsquo;s core identification challenge in the empirical section, and how does it address it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main challenge is that the decision to pledge liquid collateral is endogenous to unobserved firm characteristics that also affect liquidity holdings. OLS suffers from omitted variable bias (the lagged liquidity-to-assets ratio is correlated with the error). Fixed effects corrects for firm heterogeneity but introduces Nickell (1981) downward bias in the lagged dependent variable. The first-difference transformation removes fixed effects but creates a mechanical correlation between the differenced lagged liquidity variable and the differenced error. The Anderson-Hsiao IV strategy instruments the differenced lagged liquidity-to-assets with its second lag in levels (column 4) and additionally instruments differenced future liquid collateral with its own lagged difference (column 5), addressing the endogeneity of the collateral-pledging decision. The Cragg-Donald Wald F-statistic is 62.056, exceeding the Stock-Yogo weak instrument threshold of 7.03, supporting instrument relevance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the signaling mechanism in precise terms, and how does it differ from Leland-Pyle (1977)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the model, high-type entrepreneurs hold excess liquid assets (beyond what precaution alone requires) and pledge them as down payments on bank loans. Because the precautionary marginal benefit of holding liquid assets is higher for high types (they have better investment projects and thus more to gain from self-financing), the cost of holding the additional liquidity required by a high-type loan contract is lower for high types than for low types. This makes the down-payment requirement a credible separating device: low types will not mimic high types by holding the required level of liquidity because the cost of doing so outweighs the savings on repayment. The marginal benefit of liquidity thus includes both a precautionary term (gain when unmatched) and a signaling term (relaxes the incentive compatibility constraint on low types). Leland-Pyle (1977) also features signaling through self-finance, but obtains a continuum of signaling equilibria. The present model has a unique separating equilibrium because directed search imposes bilateral matching and a capacity constraint on bankers, eliminating the equilibrium multiplicity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are the four equilibrium regimes generated and what determines which one prevails?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The regime depends on the opportunity cost of holding liquidity i (equivalently, the policy rate) relative to three cutoffs i &amp;lt; i-bar &amp;lt; i-double-bar. At low i, both types prefer self-finance (high net return on liquidity, so the gain from a bank loan is small). As i rises, high types enter the credit market first because they have a larger surplus from obtaining a bank loan; low types follow at a higher cutoff. Once both types are in the market, the incentive compatibility constraint for low types (IC-LH) may or may not bind. When IC-LH is slack, only a small down payment is needed, and the allocation is undistorted (regime 3). When IC-LH binds — at yet higher i because holding large amounts of liquidity becomes even more attractive to misreporting low types as the precautionary value of liquidity falls — bankers must use both down payment and market tightness, distorting the allocation (regime 4). The policy rate thus operates on the outside option, reshaping the credit market structure endogenously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the loan size (intensive margin) undistorted even when the extensive margin (market tightness and down payment) is distorted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Once bankers successfully screen out low types using down payment and market tightness, they have no further incentive to distort the loan amount issued upon matching. The first-order condition for loan size in the high-type contract remains δ_H f&amp;rsquo;(ℓ_H) = 1 (Equation 8), which is the complete information optimum. The logic is that down payment and market tightness are the instruments that affect the incentive compatibility constraint, and once these are set at levels that prevent mimicry, the loan size can be set efficiently to maximize surplus from the match. This is a standard feature of competitive screening equilibria in the GSW framework and contrasts with the moral hazard extension, where the loan size is distorted because diversion of funds is possible.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the key externality that makes the competitive equilibrium constrained inefficient, and how does the planner correct it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bankers in the high-type submarket post contracts taking the payoff of low-type entrepreneurs (in the low-type submarket) as given. But the low-type payoff enters their incentive compatibility constraint (IC-LH), which governs how much down payment and rationing they must impose. When the planner raises the low-type payoff (by subsidizing low types), the IC-LH constraint relaxes: the low types are already better off and have less incentive to mimic. This allows bankers to offer high types smaller down payments and more loan supply, increasing high-type welfare. If the benefit to high types (lower screening cost) exceeds the tax cost, a Pareto improvement is possible. The planner implements this through type-contingent transfers: taxing bankers who serve high types, subsidizing bankers who serve low types. The planner can internalize the cross-submarket externality because it controls both submarkets simultaneously, whereas competitive bankers each maximize their own submarket&amp;rsquo;s contracts taking the other as given.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the non-monotonicity of screening intensity in δ_L, and what is the intuition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 4 shows that the equilibrium high-type liquidity holding z_H and market tightness θ_H are non-monotone in δ_L (the low type success probability), with a cutoff δ-bar_L. For low δ_L: either the low types are not in the loan market at all, or they would not want to mimic the high types even if the down payment is small, because the precautionary value of holding so much liquidity outside the loan market is very low for low types with poor prospects. As δ_L rises (low types become moderately good), they want to mimic high types more aggressively (higher repayment savings) while the cost of mimicry remains moderate, so down payment and rationing must both be higher. At very high δ_L (low types nearly as good as high types), the types are similar and a small amount of screening suffices again. Distortions peak at intermediate δ_L where the benefit-cost ratio of misreporting for low types is maximized.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the moral hazard extension change the results compared with the baseline?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the baseline, banknotes can only purchase capital (observable investment). In the extension (Appendix E), banknotes can also buy consumption goods at unit cost C(χ), introducing dual deviation: a low-type entrepreneur who misreports can both obtain a high-type loan and divert some of the proceeds to consumption. This raises the low types&amp;rsquo; payoff from misreporting (U^mh_LH &amp;gt; U_LH), tightening the incentive constraint. As a result: (i) a third screening tool is deployed — bankers reduce the loan size below the complete information optimum (ℓ^mh_H &amp;lt; ℓ*_H); (ii) the threshold i above which multi-tool screening kicks in is lower (i-double-bar^mh ≤ i-double-bar), so distorted equilibria occur over a larger parameter space; (iii) in the distorted region, allocations are more distorted along all three margins (loan size, liquidity, market tightness). When χ ≤ δ_L/δ_H (the cost of diverting banknotes to consumption is high enough that low types prefer to invest all proceeds), the extension coincides exactly with the baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to Guerrieri, Shimer, and Wright (2010) and what does it add?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;GSW show that directed search with adverse selection generates a unique separating equilibrium in which market tightness (loan approval rate) is the dominant screening device, while down payment (liquidity) is not used when the self-finance option is absent. In GSW&amp;rsquo;s setup applied to credit markets, liquid assets are redundant — without an endogenous outside option, there is no precautionary demand and no signaling demand for liquidity (Proposition 5 of this paper). Liang&amp;rsquo;s contribution is to introduce the self-finance channel as an endogenous outside option to the GSW framework. This makes liquidity valuable both outside the credit market (precautionary motive) and inside it (signaling/screening device). The result is that both down payment and market tightness are used as screening instruments in the fully distorted regime, whereas GSW uses only market tightness. This also changes the constrained efficiency analysis: Liang shows that the planner can fully undo adverse selection under certain conditions, a result that does not arise in the vanilla GSW model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness and consistency checks are run in the empirical section?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical section runs OLS (column 1), one-way fixed effects (column 2), first-difference transformation OLS (column 3), Anderson-Hsiao IV with one instrument (column 4), and Anderson-Hsiao IV with two instruments (column 5, the preferred specification). The consistency of the lagged liquidity estimator is checked against the Nickell bounds: Bond (2002) recommends the consistent estimate should lie between the OLS and FE estimates (0.4920 and -0.1833); the preferred IV estimate (0.2766) satisfies this. Instrument strength is verified with the Cragg-Donald Wald F-statistic (62.056 vs. threshold 7.03). The paper acknowledges that the liquid collateral coefficient may be biased in either direction: upward if firms that plan to pledge liquid collateral but fail to obtain loans are misclassified as non-signalers, or downward if ineligible firms (with insufficient liquid assets to pledge) are misclassified as non-signalers. The direction of bias is ambiguous, which limits the paper&amp;rsquo;s ability to bound the true signaling motive magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, the paper recommends cross-subsidization — taxing high-type borrowers and subsidizing low-type borrowers — to restore the complete information allocation when the equilibrium is distorted. This is implementable through type-contingent tax policies on bank loans. The scope condition (Proposition 6) is that the high types&amp;rsquo; net surplus from borrowing must exceed the low types&amp;rsquo; scaled gain from misreporting (Equation 11); this is more likely to hold when i is large (high policy rate), ν_L is small (few low types), or δ_L/δ_H is very small or very close to 1 (extreme types). Second, and more restrictively, if δ_L/δ_H + ν_H/ν_L &amp;lt; 1 (low types are very risky or very numerous), the competitive equilibrium is already constrained efficient and no transfers are needed. Third, on monetary policy: a rise in the policy rate can trigger a transition from an undistorted to a distorted equilibrium, causing welfare to fall. The paper interprets this as a caution against using high policy rates when credit market adverse selection is a concern. The paper also connects to loan guarantee programs (analogous to low-type subsidies), citing Chilean evidence (Cowan et al. 2015) showing that guarantees increase both guaranteed and non-guaranteed credit supply, consistent with the model&amp;rsquo;s cross-submarket externality mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main data limitations acknowledged in the empirical analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The KFS records the type of debt collateral only in the last three years of the survey (2009-2011), severely limiting the time dimension for liquid collateral analysis. This prevents the use of GMM estimators (Arellano-Bond 1991) that require different lag instruments across periods. The KFS does not record ex post loan outcomes (interest rates, default rates), so the paper cannot directly test the model&amp;rsquo;s prediction that loans with liquid collateral carry lower interest rates and lower default rates (unlike Berger et al. 2016 using Bolivian data). Loan application outcomes are also not available, preventing a sample restriction to successful applicants, which would resolve one direction of bias in the signaling motive estimator. The liquid collateral variable encompasses all debt types (business loans, credit cards, lines of credit), not only commercial bank loans, which is the model&amp;rsquo;s focus.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Signaling motive for liquidity&lt;/strong&gt;: In the paper&amp;rsquo;s sense: small firms hold liquid assets specifically to satisfy bank down payment requirements, thereby credibly signaling their investment quality (high success probability) to lenders who cannot observe borrower type. This is distinct from the textbook corporate finance definition of signaling; here the signal operates through costly liquid collateral pledged inside the credit contract, not through equity stakes or dividends.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-finance channel&lt;/strong&gt;: In the paper&amp;rsquo;s sense: the outside option to bank borrowing, in which an entrepreneur uses accumulated liquid holdings to directly purchase capital and invest when she either fails to match with a banker or prefers not to. The channel is endogenous — its value depends on the entrepreneur&amp;rsquo;s liquidity holdings z and investment success probability δ_j — and is the structural ingredient that makes liquidity valuable both inside and outside the credit market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market tightness (θ) as a screening device&lt;/strong&gt;: In the paper&amp;rsquo;s sense: bankers deliberately make high-type loan contracts scarce (low θ_H, i.e., few bankers per entrepreneur in the high-type submarket), reducing the loan approval probability µ(θ_H). Because low types have a lower surplus from obtaining a high-type loan than high types do, they are disproportionately discouraged by a low approval probability. Market tightness is the extensive-margin screening instrument in the GSW framework; this paper adds down payment as a second instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Down payment (d) as inside collateral&lt;/strong&gt;: In the paper&amp;rsquo;s sense: liquid assets pledged at the time of loan application, paid from the entrepreneur&amp;rsquo;s own liquid holdings z. Called &amp;lsquo;inside collateral&amp;rsquo; because the pledged assets (liquidity) are used in financing the project, as opposed to &amp;lsquo;outside collateral&amp;rsquo; (equipment, inventory) not used in the financed project. The down payment is the intensive-margin screening instrument; high types pledge d_H = z_H, their full liquid holdings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Constrained efficiency with adverse selection&lt;/strong&gt;: In the paper&amp;rsquo;s sense: the best allocation achievable by a social planner who faces the same information asymmetry (types are private) and the same search frictions as agents, and who maximizes a welfare-weighted sum of entrepreneur payoffs subject to incentive compatibility, participation, and budget balance constraints. The paper shows the competitive equilibrium may fail constrained efficiency due to a cross-submarket externality not internalized by individual bankers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dual deviation (moral hazard extension)&lt;/strong&gt;: In the paper&amp;rsquo;s sense (Appendix E): when loan proceeds (banknotes) can be used to purchase consumption goods as well as capital, a low-type entrepreneur who misreports her type faces two deviation margins — misreporting her type (adverse selection) and diverting loan proceeds to consumption rather than investment (moral hazard). Dual deviation raises the low types&amp;rsquo; payoff from mimicry and forces bankers to add loan size as a third screening tool, at the cost of an inefficiently small loan.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Opportunity cost of liquidity (i) and regime transitions&lt;/strong&gt;: In the paper&amp;rsquo;s sense: i = 1/(β(1+r_z)) − 1, the per-period cost of holding one unit of liquid assets, which equals the inflation rate π in steady state. As i increases, it simultaneously raises the self-finance outside option (liquidity becomes a better investment channel) and affects the low types&amp;rsquo; incentive to mimic high types, triggering discrete transitions between four equilibrium regimes from no credit market participation through increasingly distorted screening configurations.&lt;/p&gt;</description></item><item><title>An Analytical Model of Behavior and Policy in an Epidemic</title><link>https://macropaperwarehouse.com/papers/an-analytical-model-of-behavior-and-policy-in-an-epidemic/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/an-analytical-model-of-behavior-and-policy-in-an-epidemic/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper builds a tractable, fully analytical version of the workhorse macro-epidemiology (&amp;ldquo;econ-epi&amp;rdquo;) model and uses it to characterize how susceptible individuals behave during a deadly epidemic, how a social planner would have them behave, and the externality that separates the two. The motivation is that prior macro-SIR results came almost entirely from numerical simulation; a closed-form treatment can expose general insights those simulations missed and provide a transparent benchmark for any future epidemic. The model appends the standard Kermack-McKendrick SIR system (susceptible S, infected I, recovered R, deceased D, with transmission rate β, recovery rate γr, death rate γd, and γ := γr + γd) with forward-looking agents who choose an activity level λ ∈ [0,1] that scales transmission via β = βa·λ + βo. The single key modeling departure is LINEAR (rather than convex) costs of mitigation, microfounded by indivisible activity choices in the spirit of Rogerson (1988); this makes the optimal control bang-bang or singular and yields closed-form solutions. Three constants organize the analysis: the herd immunity threshold S̄ := γ/β, the basic reproduction number R0 := 1/S̄, and the infection fatality rate IFR := γd/γ. A central composite statistic is the cost-benefit ratio of mitigation κ := (uW − uL)/(βa·IFR·VSL), where VSL := uW/ρ is the value of statistical life in utility terms.\n\nMain results. (1) Decentralized equilibrium (Proposition 1): there is no mitigation at the very start and the very end of the epidemic; mitigation occurs only over an interval [t0, t1). Susceptibles begin mitigating just below full susceptibility, the infection rate peaks exactly at t0 (when precautions are greatest), and from then on the effective reproduction number sits slightly below one, producing a gently declining infection path — a pattern the author notes is broadly consistent with first-wave Covid-19 data. The equilibrium infection trajectory is approximated by the simple ray I(t) ≈ (S(t)/S̄)·κ, and the equilibrium steady-state susceptibility is S∞ ≈ S̄ − S̄·√(2κR0). A higher κ and lower S̄ both reduce mitigation and raise infections (a &amp;ldquo;fatalism effect&amp;rdquo;). (2) Socially optimal behavior (Propositions 2-3): optimal policy is bang-bang (λ* ∈ {0,1}) — no mitigation at start and end, full mitigation in a single intermediate interval. The planner &amp;ldquo;holds fire,&amp;rdquo; lets infections climb high, then imposes maximal restrictions late, driving the system quickly to herd immunity. The optimal long-run susceptibility is S∞* ≈ S̄ − S̄·2κR0/(κR0 − 1)². (3) The externality: contrary to the conventional view, susceptibles&amp;rsquo; privately optimal behavior is EXCESSIVELY cautious — the equilibrium infection rate lies below the optimal infection rate for any S above herd immunity — yet cumulative deaths are HIGHER in equilibrium than under the planner. Mitigation by susceptibles mostly substitutes infection risk intertemporally (&amp;ldquo;flattening the curve also makes it fatter&amp;rdquo;); beyond eliminating epidemic overshoot it cannot prevent the inevitable share 1 − S̄ from being infected. The planner&amp;rsquo;s late-strong-short lockdown comes close to implementing a lottery that randomly selects who gets sick.\n\nImplications. Because the externality runs in the opposite direction to standard intuition, optimal policy can call for the government to INCREASE interaction (the paper cites the UK&amp;rsquo;s 2020 &amp;ldquo;Eat Out To Help Out&amp;rdquo; subsidy as an analogue). Results are framed as technical/foundational insights, not direct prescriptions: the benchmark abstracts from reinfection, variants, vaccines/cures, healthcare capacity limits, and endogenous IFR, all of which can shift specific recommendations while leaving the underlying forces intact.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the &amp;lsquo;identification&amp;rsquo; or solution strategy, and what makes the analytical characterization possible?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a theory paper, so the relevant strategy is solving the dynamic optimization analytically rather than empirically. The enabling assumption is LINEAR costs of mitigation (instantaneous utility u = λ·uW + (1−λ)·uL), microfounded by indivisible activity choices as in Rogerson (1988), where λ is the probability of being active in a mixed-strategy equilibrium. Linearity makes the current-value Hamiltonian linear in the control λ, so the optimal control is bang-bang or singular with switching function ψ(t) := uW − uL − (ηs(t) − ηi)·βa·I(t). This permits closed-form characterization of switching points and trajectories. The main &amp;rsquo;threat&amp;rsquo; the author addresses is generality: does linearity drive the conclusions? Section VI shows numerically that convex costs (U = uL + λ^(1−α)·(uW − uL), with α the convexity degree) merely smooth out the kinks and corners without changing qualitative features — passing what the author calls the &amp;lsquo;Solow test.&amp;rsquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the core economic mechanism behind &amp;rsquo;excessive caution,&amp;rsquo; and the two ways the paper frames the externality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In equilibrium, the singular-control optimality condition equates a constant marginal cost of mitigation (uW − uL) to a marginal benefit (ηs(t) − ηi)·βa·I(t). The shadow value of being susceptible ηs(t) rises over time (cumulative future infection risk and cumulative future mitigation effort both decline as the epidemic progresses), while ηi is constant. To keep the equation balanced, βa·I(t) must fall, so agents become more cautious over time. First framing of the externality: the planner recognizes that at least 1 − S̄ of the population must eventually be infected (and a share IFR of those die); individuals recognize this too (perfect foresight) but each wants to avoid being in the infected group, so they over-mitigate, merely delaying rather than preventing infections. Second framing: stronger mitigation today lowers near-term infections but raises later infections — &amp;lsquo;flattening the curve also makes it fatter&amp;rsquo; — so beyond removing overshoot, mitigation only substitutes infection risk intertemporally. The planner internalizes the whole time path; individuals take the aggregate infection rate as given.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the optimal lockdown &amp;rsquo;late, strong, and short&amp;rsquo; rather than gradual?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;From the planner&amp;rsquo;s law of motion, the velocity Ṡ/S is proportional to I. An interior λ would lower instantaneous costs proportionately but increase the duration of mitigation more than proportionately (since both λ and I are lower), so gradualism is dominated. This makes optimal policy bang-bang with a single interval of maximal restriction. The planner therefore holds fire, lets I climb high (where the system moves fast), then imposes λ=0 to drive the trajectory quickly to herd immunity — minimizing cumulative deaths at minimum cost rather than flattening the curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do equilibrium and optimal cumulative deaths compare, and why does the more cautious equilibrium produce MORE deaths?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cumulative deaths equal IFR·(1 − S∞). The equilibrium steady-state susceptibility S∞ ≈ S̄ − S̄·√(2κR0) lies below the planner&amp;rsquo;s S∞* ≈ S̄ − S̄·2κR0/(κR0 − 1)², meaning the equilibrium overshoots herd immunity by more, so 1 − S∞ (cumulative infections) and hence deaths are higher in equilibrium. The equilibrium&amp;rsquo;s caution lowers the infection rate at each S above herd immunity and stretches the epidemic out (raising economic cost), but does not prevent the inevitable infections and in fact allows more overshoot than the planner&amp;rsquo;s quick-to-herd-immunity strategy. Cumulative death toll is increasing in R0 and in κ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the cost-benefit ratio κ and the &amp;lsquo;fatalism effect&amp;rsquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;κ := (uW − uL)/(βa·IFR·VSL) combines preferences, epidemiology, and policy effectiveness: the numerator is the utility cost of mitigation; the denominator is the benefit (lower activity reduces transmission by βa, preventing deaths by IFR, each life worth VSL = uW/ρ). A higher κ lowers mitigation and raises the equilibrium infection rate, starts mitigation later (lower S(t0)), and raises cumulative deaths. The &amp;lsquo;fatalism effect&amp;rsquo; has two parts: a lower S̄ (greater lifetime chance of falling ill) dissuades mitigation today; and the high expected cumulative future mitigation effort at the epidemic&amp;rsquo;s start lowers the value of staying alive, further tempering precaution. The simple approximation I(t) ≈ (S(t)/S̄)·κ captures the first part but omits the second.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the practical &amp;lsquo;back-of-the-envelope&amp;rsquo; contribution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper provides a recipe to trace the equilibrium epidemic path without solving the full dynamic model: (1) compute the thresholds S(t0) ≈ 1 − κ/(√(2κR0)·(1−S̄))·S̄(1−S̄), S(t1) ≈ S̄ − ρ/(βo + βa), and S∞ ≈ S̄ − S̄·√(2κR0); (2) plot the ray I = (S/S̄)·κ between the thresholds; (3) splice it on both sides with the no-mitigation (λ=1) trajectory I = −S + S̄·log S + C0. This rivals running the naive SIR model in simplicity but is grounded in optimizing behavior, giving a more plausible benchmark for human populations. The author intends it for forecasting any future epidemic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the results relate to and differ from prior numerical econ-epi work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The equilibrium characterization is qualitatively consistent with Farboodi et al. (2021) — little mitigation at the start, then a jump keeping the effective reproduction number just below 1 — the only difference being their path is smoother due to convex costs. Eichenbaum-Rebelo-Trabandt (2021) get a qualitatively different, still hump-shaped equilibrium infection path because in their calibration mitigation is too weak to push the effective reproduction number below 1 (so βo is not &amp;lsquo;sufficiently low&amp;rsquo;). For the planner, the paper&amp;rsquo;s late-strong-short lockdown differs from work finding early/strong responses (Farboodi et al.) or intermediate restrictions (Alvarez et al. 2021; Eichenbaum et al. 2021), for two reasons: (1) this model rules out suppression/vaccine arrival as a feasible endgame, whereas papers allowing vaccine arrival find early strong suppression optimal; (2) the planner here controls only susceptibles&amp;rsquo; behavior with linear costs, whereas broader instruments and convex costs make intermediate restrictions more attractive. The paper is, to the author&amp;rsquo;s knowledge, the first to derive equilibrium and optimal behavior fully analytically and to show the susceptibles&amp;rsquo; externality makes the infection rate too LOW socially.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What do the costate (shadow-value) dynamics reveal?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The private value of infection ηi = (uI + (γr/ρ)·uW)/(ρ+γ) is time-invariant (payoffs while ill/recovered/dead don&amp;rsquo;t depend on timing). The social value of an infected person η&lt;em&gt;i is time-varying because the planner internalizes onward transmission via a (η&lt;/em&gt;i − η&lt;em&gt;s)(βaλ&lt;/em&gt; + βo)S* term. η&lt;em&gt;i is deeply negative at the epidemic&amp;rsquo;s start (diverging as I→0, because an infinitesimal seed inflicts unboundedly large relative damage), rises sharply and roughly tracks the private value during the bulk of the epidemic (e.g. when S ∈ [0.5, 0.9]), and settles just above zero in the long run. In the long run the social value of an additional infected person can even be negative when γd is high, because the value of that person&amp;rsquo;s life is below the welfare loss from infections they spread. The social value of a susceptible η&lt;/em&gt;s is always below the private value (except converging to uW/ρ in the long run), reflecting unpriced future contagion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness/extension checks does the paper run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Section VI: (1) Convex costs (numerical, α=0.3) smooth kinks but preserve qualitative features. (2) Broader planner instruments — controlling susceptibles AND infected (without distinguishing them), or restricting everyone identically — are &amp;lsquo;double-edged&amp;rsquo;: more costly (especially late when many are recovered) but more effective because they also restrict the infected; effectiveness gains peak at intermediate restrictions (around λ=1/2) due to the quadratic contact function, which makes intermediate restrictions and earlier/longer lockdowns more attractive, moving results toward Alvarez et al. (2021). Section VII discusses healthcare/ICU capacity constraints (optimal to hold infections at the capacity level until near herd immunity; endogenous IFR brings equilibrium and optimal paths closer but doesn&amp;rsquo;t change the externality&amp;rsquo;s nature), feasible suppression (optimal policy becomes a discrete choice between herd-immunity and best suppression strategy; equilibrium behavior is largely insensitive to suppression feasibility), and temporary immunity/endemicity (strengthens the fatalism effect, raising equilibrium infections; optimal policy still rushes to steady state, now also to avoid costly multiple waves).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the calibration used for the figures, and is it meant to be quantitatively serious?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The calibration resembles Covid-19 but is explicitly illustrative, not a serious quantitative calibration. A model period is a week. Epidemiological parameters: βo = 0.7, βa = 1.24, γr = 0.77, γd = 0.0078, implying R0 = 2.5, S̄ = 0.4, IFR = 1%, and average disease duration of 9 days; under full mitigation (λ=0) R0 falls to 0.9. Annual discount rate is 4% (weekly ρ = 0.96^(−1/52) − 1). Utility is logarithmic; weekly consumption is $60,000/52 ≈ $1,250 so uW = log(1250) ≈ 7; full lockdown cuts consumption 20%, giving uL = 6.6, (uW − uL)/uL = 3.2%. With VSL = $10 million, κ = 0.002 (0.2%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key caveats and the scope of the policy implications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The author stresses the model is a stripped-down BENCHMARK: no reinfection, no variants, constant IFR, no cure or vaccine (so herd immunity pins down minimum feasible deaths). Specific results are &amp;rsquo;technical contributions, not direct normative prescriptions.&amp;rsquo; The striking implication that a planner might subsidize interaction (forcing susceptibles to interact, since optimal activity sometimes exceeds equilibrium activity) faces an implementability problem — restricting activity is easier than increasing it. The herd-immunity-quick strategy ceases to be optimal once suppression is feasible (vaccine/cure expected), ICU constraints bind with endogenous IFR, or immunity is only temporary; but the underlying forces (the susceptibles&amp;rsquo; intertemporal infection-substitution externality) continue to operate in all these richer settings.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Herd immunity threshold (S̄)&lt;/strong&gt;: S̄ := γ/β, the level of susceptibility below which the infected pool shrinks; in this model, because there is no cure or vaccine, it pins down the minimum feasible deaths and is the endgame both equilibrium and planner converge toward.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cost-benefit ratio of mitigation (κ)&lt;/strong&gt;: κ := (uW − uL)/(βa·IFR·VSL), a composite statistic combining preferences, epidemiology, and policy effectiveness; the numerator is the utility cost of mitigation and the denominator the benefit (transmission reduction βa times deaths averted IFR times value of statistical life). Higher κ means less mitigation and more infections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Excessive caution / susceptibles&amp;rsquo; externality&lt;/strong&gt;: The paper&amp;rsquo;s central finding that privately optimal mitigation by susceptibles is too cautious socially — the equilibrium infection rate lies below the optimal rate for any S above herd immunity — because each individual wants to avoid being in the inevitable infected share, merely substituting infection risk intertemporally rather than preventing it; the conventional one-way infected-spreader externality view is therefore incomplete.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Linear costs of mitigation / singular control&lt;/strong&gt;: The assumption (microfounded by indivisible activity choices à la Rogerson 1988) that utility is linear in activity λ, making the Hamiltonian linear in the control so the optimum is bang-bang or singular; this delivers sharp closed-form solutions whose intuitions survive under convex costs (the &amp;lsquo;Solow test&amp;rsquo;).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Late-strong-short lockdown&lt;/strong&gt;: The socially optimal policy in this benchmark: hold fire while infections climb high, then impose maximal restrictions (λ=0) in a single intermediate interval that quickly drives the system to herd immunity — minimizing cumulative deaths at minimum cost rather than flattening the curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Costates (ηs, ηi)&lt;/strong&gt;: Shadow values of being in the susceptible and infected states. ηi (private) is constant since the payoffs of being ill are timing-independent; the planner&amp;rsquo;s η*i is time-varying because it internalizes onward transmission and can even be negative in the long run when the death rate is high.&lt;/p&gt;</description></item><item><title>An irrelevance theorem for risk aversion and time-varying risk</title><link>https://macropaperwarehouse.com/papers/an-irrelevance-theorem-for-risk-aversion-and-time-varying-risk/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/an-irrelevance-theorem-for-risk-aversion-and-time-varying-risk/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Chen and Palomino prove a general irrelevance theorem identifying when risk aversion and time-varying risk are irrelevant for key model dynamics in representative-agent macroeconomic models. The central research question is why advances in risk modeling — Epstein-Zin (EZ) recursive preferences, long-run risk, disaster risk — generate rich asset price behavior in endowment economies but fail to produce commensurate effects in standard production economies. The paper resolves this puzzle by characterizing the precise structural conditions under which risk parameters become irrelevant, and provides a taxonomy for how models can escape those conditions.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a representative-agent model with EZ preferences, which separate the elasticity of intertemporal substitution (EIS, parameter psi) from risk aversion (gamma). The remaining economic structure — production technology, resource constraints, government policy, financial sector — is assumed to exhibit an analogous separation: variables that control expected values (&amp;ldquo;first moment states,&amp;rdquo; such as capital and productivity) are separated from variables that control higher central moments (&amp;ldquo;higher moment states,&amp;rdquo; such as stochastic volatility of productivity). The paper proceeds through three settings of increasing generality: a two-period illustrative model, a dynamic stochastic growth model with capital adjustment costs (Jermann 1998) and heteroskedastic AR(1) productivity, and a fully abstract general model covering a broad class of rational-expectations equilibrium systems.&lt;/p&gt;
&lt;p&gt;The central result is Theorem 1: if (1) intertemporal and risk preferences are separated (EZ-style), (2) first and higher moment drivers of the remaining model structure are separated, and (3) constraints are approximately linear, then risk aversion gamma and higher-moment parameters theta_h are irrelevant for the elasticity of any endogenous variable — including all asset prices — with respect to first moment states and lagged endogenous variables. Formally, in the solution z_t = z + Z_z&lt;em&gt;z_{t-1} + Z_x&lt;/em&gt;x_t + Z_h*h_t, the elasticity matrices Z_z and Z_x are independent of gamma and theta_h. Risk parameters affect only model intercepts and steady states (the constant z) and the elasticity with respect to higher moment states (Z_h). Thus augmenting a stochastic growth model with shocks to volatility or risk aversion has no effect on impulse responses to productivity shocks or other first-moment disturbances.&lt;/p&gt;
&lt;p&gt;In the homoskedastic special case (constant volatility), risk aversion is irrelevant for the impulse response of every variable, including all asset prices. This clarifies the Tallarini (2000) separation: it is not a separation between macroeconomic and financial variables, but between means (average equity premium, steady-state levels) and volatilities and impulse responses. Risk aversion affects the level of the equity premium but not stock price volatility or impulse responses.&lt;/p&gt;
&lt;p&gt;Numerical verification using projection methods (Caldara et al. 2012) confirms irrelevance holds even at risk aversion of 100 and unconditional volatility of volatility of 80% of baseline. A second, richer model class — with EIS of 0.3, capital adjustment cost elasticity of 3, and left-skewed gamma-distributed productivity shocks calibrated to match Bekaert and Engstrom (2017) quarterly consumption growth moments (kurtosis 4.04, skewness -0.399, matching model kurtosis of 4 and skewness of -0.82) — produces an equity premium more than three times larger than the baseline class and a stock price elasticity with respect to productivity about three times larger, yet continues to display irrelevance: risk aversion and time-varying risk have essentially no effect on the stock price elasticity with respect to productivity.&lt;/p&gt;
&lt;p&gt;The theorem extends to smooth ambiguity preferences (Klibanoff, Marinacci, Mukerji 2005) and multiplier preferences (Hansen and Sargent 2001) as long as risk adjustments remain functions of higher-moment state variables. The paper also derives the Barro-King (1984) comovement restriction under recursive preferences (Appendix C), showing that in the neoclassical structure only productivity shocks generate positive comovement of consumption, investment, and labor. This interacts with the irrelevance theorem to explain why production-economy asset pricing models face a compounded difficulty: volatility and risk-aversion shocks cannot break irrelevance within the standard structure, and they also cannot generate the required comovement without additional mechanisms.&lt;/p&gt;
&lt;p&gt;The paper provides a unified taxonomy for generating a meaningful role for risk in production economies. One can &amp;ldquo;break&amp;rdquo; irrelevance by removing one of the three assumptions: (1) allowing risk aversion to vary with economic conditions as in Campbell-Cochrane (1999) habit formation or heterogeneous agents; (2) introducing non-separability between first and higher moments in production, as in Di Tella and Hall (2022) where entrepreneurial idiosyncratic risk makes aggregate volatility endogenous; or (3) incorporating sufficient nonlinearity via occasionally binding constraints, as in Brunnermeier-Sannikov (2014) or Gourio-Ngo (2020) near the zero lower bound. Alternatively, one can &amp;ldquo;adapt&amp;rdquo; to irrelevance by driving dynamics with higher-moment shocks — volatility shocks (Basu-Bundick 2017, combined with nominal rigidities to preserve comovement) or risk-aversion shocks (Basu et al. 2024, combined with an investment reallocation channel).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core intuition behind the irrelevance theorem?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Euler equation under EZ preferences decomposes into an Intertemporal Term (characterizing expected consumption-return tradeoffs, driven by EIS) and a Risk Term (characterizing tradeoffs across unexpected future states, driven by risk aversion). In standard models, the production technology is &amp;lsquo;a perfect foresight model with shocks tacked on&amp;rsquo;: transformation across time is separated from transformation across future states. Because constraints are approximately linear, innovations to endogenous variables with respect to first-moment shocks (productivity, capital) do not contain investment or other endogenous variables, so the Risk Term is a function only of higher-moment states. Differentiating the Euler equation with respect to a first-moment state therefore eliminates the Risk Term entirely, leaving only the Intertemporal Term and making the solution for that elasticity independent of gamma and sigma.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the Tallarini (2000) result clarified and extended?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Tallarini (2000) shows that risk aversion is irrelevant for quantity dynamics in a homoskedastic real business cycle model. This is widely interpreted as a separation between macroeconomic (quantity) and financial (price) variables. The paper shows this interpretation is incorrect. When shocks are homoskedastic, risk aversion is irrelevant not just for quantities but for all asset price dynamics, including stock price volatility. The actual separation is between means (steady states, intercepts, average equity premium — all of which depend on risk aversion) and volatilities and impulse responses (which do not). The paper extends Tallarini&amp;rsquo;s result by showing irrelevance holds for all endogenous variables including stock prices, by showing it persists under heteroskedasticity for elasticities with respect to first-moment states specifically, and by generalizing to abstract models beyond the neoclassical RBC framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three conditions required for irrelevance and what is the role of each?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The three conditions are: (1) Separation of intertemporal and risk preferences — EZ-style preferences ensure risk aversion gamma enters only the Risk Term of the Euler equation, not the Intertemporal Term. If preferences are non-separable (e.g., power utility, habit formation), gamma enters the intertemporal tradeoff and affects first-moment elasticities. (2) Separation of first and higher moment drivers in the remaining model structure — production technology and all other constraints must not link transformation of goods across time to transformation across states. If higher-moment variables appear in the production function or resource constraint (e.g., idiosyncratic risk in entrepreneurial production as in Di Tella-Hall 2022), first-moment states appear in the Risk Term and irrelevance breaks. (3) Approximate linearity of constraints — nonlinearities create interactions between current state values and forward-looking volatility. Strong enough nonlinearities (such as those introduced by occasionally binding constraints near the zero lower bound or in financial crisis models) can cause irrelevance to fail even when conditions (1) and (2) hold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the formal mathematical structure of the general model and theorem?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The general model consists of a system of expectational equilibrium conditions E[f(z_{t+1}, x_{t+1} | z_t, x_t, h_t, z_{t-1}; Theta)] = 0, where z_t are endogenous variables, x_t are first-moment exogenous states following a heteroskedastic AR(1) with shock distribution conditional on h_t, and h_t are higher-moment states with an independent AR(1) process. The equilibrium conditions split into constraints (f0, depending only on theta_0, not gamma or theta_h) and asset-pricing Euler equations (depending on the EZ SDF, hence on gamma). The proof uses a risk-adjusted affine approximation (Assumptions 1 and 2): constraints are approximated as conditionally affine in states; the CGF of shocks is conditionally affine in h_t. Conjecturing a linear solution z_t = z + Z_z&lt;em&gt;z_{t-1} + Z_x&lt;/em&gt;x_t + Z_h*h_t and applying the method of undetermined coefficients in separate layers shows that Z_z satisfies a quadratic matrix equation depending only on theta_0 (Proposition 2, Equation 171), and Z_x satisfies a Sylvester equation also depending only on theta_0 and Z_z (Equation 172). Since neither equation involves gamma or theta_h, those parameters are irrelevant for Z_z and Z_x. Z_h and z do depend on all parameters including gamma and theta_h.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the irrelevance theorem interact with the Barro-King (1984) comovement constraint?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Barro and King (1984) show that, in the neoclassical structure, shocks other than productivity shocks fail to generate the observed positive comovement of consumption, investment, and labor. The paper derives this result under recursive preferences in Appendix C, confirming it extends to the EZ case. The comovement constraint implies that, within the neoclassical structure, the magnitude of higher-moment shocks must be limited to preserve comovement — production-economy asset pricing models typically drive business cycles with productivity shocks rather than volatility or risk-aversion shocks. But the irrelevance theorem implies that productivity shock impulse responses are independent of risk. Together, these results explain why modeling asset prices in production economies is non-trivial: one must simultaneously address comovement (ruling out large higher-moment shocks as the primary business cycle driver) and irrelevance (meaning productivity shocks cannot be enriched with risk dynamics). A successful model must either break irrelevance or adapt to it with mechanisms that also solve the comovement problem.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does it mean to &amp;lsquo;break&amp;rsquo; irrelevance and what are the main examples?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Breaking irrelevance means removing one of the three conditions so that risk aversion or risk parameters enter the elasticity with respect to first-moment states. Examples: (1) Campbell-Cochrane (1999) external habit: risk aversion varies over time as consumption approaches habit, creating time-varying links between the intertemporal and risk terms of the Euler equation. Heterogeneous households (Guvenen 2009) produce similar effects. (2) Di Tella and Hall (2022): entrepreneurs face uninsurable idiosyncratic shocks, making the aggregate production function incorporate risk. Volatility is endogenous and affects how the economy responds to first-moment shocks. Colacito et al. (2014), Decker et al. (2016), and Belo (2010) similarly incorporate production risk-return tradeoffs. (3) Brunnermeier-Sannikov (2014) financial frictions and Gourio-Ngo (2020) zero lower bound: occasionally binding constraints introduce strong enough nonlinearities to break the affine approximation and generate large endogenous volatility far from the steady state. A non-separable production example is also given: if k_{t+1} = (k+i)*1{epsilon &amp;gt;= 0}, investment appears in the consumption innovation and hence in the Risk Term, causing gamma and sigma to enter the first-moment elasticity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does it mean to &amp;lsquo;adapt&amp;rsquo; to irrelevance and what are the main examples?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Adapting to irrelevance means staying within the class of models covered by the theorem but driving business cycle dynamics with shocks to higher-moment states rather than first-moment states. In this approach, risk aversion and risk parameters remain irrelevant for how the model responds to first-moment shocks (productivity, capital), but they do affect the elasticity with respect to higher-moment shocks and thus drive important dynamics. Basu and Bundick (2017) drive cycles with shocks to the volatility of time preference and maintain positive comovement of consumption, investment, and labor by incorporating nominal rigidities (New-Keynesian frictions break the Barro-King constraint). Basu et al. (2024) drive cycles with shocks to risk aversion and recover comovement via a novel investment reallocation channel between labor and capital. Dupor and Mehkari (2014) document other mechanisms that can overcome the comovement problem, including consumption-investment complementarities and externalities in leisure preferences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper extend irrelevance beyond Epstein-Zin preferences?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper shows irrelevance holds for a broader family of preferences as long as the log SDF can be written as a base component m*&lt;em&gt;{t+1} plus additional risk adjustments m&lt;/em&gt;{i,t+1} = f_tilde_i(Lambda, theta_0) * A_i * z_{t+1}, where Lambda is a generalized risk parameter vector (encompassing ambiguity aversion and other attitudes), and the associated certainty equivalent condition E_{i,t}[A_i&lt;em&gt;z_{t+1}] = -H_{i,t}[f_hat_i * A_i&lt;/em&gt;z_{t+1}] holds. This formulation covers smooth ambiguity preferences (Klibanoff et al. 2005, illustrated via Ju-Miao 2012 generalized smooth ambiguity with ambiguity aversion parameter eta) and multiplier preferences (Hansen-Sargent 2001). The key property for irrelevance to hold is that the risk adjustments are solely functions of higher-moment state variables h_t. For smooth ambiguity, irrelevance holds if belief dynamics are exogenous, as in Ilut-Schneider (2014).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What numerical exercises are conducted to validate the approximate linearity assumption?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two classes of models are solved using projection methods (Caldara et al. 2012), which provide the highest accuracy among available solution methods and capture time variation in risk premiums that second-order perturbation methods cannot. Class 1 replicates Tallarini (2000): EIS = 1, elasticity of investment = 10, normally distributed shocks (gamma shape parameter = 600), calibrated to HP-filtered output volatility of about 1.5% per quarter. Class 2 introduces larger frictions: EIS = 0.3, elasticity of investment = 3, left-skewed gamma shocks with shape parameter 6 (implying kurtosis = 4, skewness = -0.82, consistent with Bekaert-Engstrom 2017 empirical moments of quarterly consumption growth: kurtosis 4.04, skewness -0.399). For both classes, risk aversion is varied up to 100 and the unconditional volatility of volatility up to 80% of the baseline volatility. In both classes, the stock price elasticity with respect to productivity shows essentially no variation with risk aversion or volatility-of-volatility (though a slight negligible median decline is noted), while the equity premium and the stock price elasticity with respect to volatility respond clearly to those risk parameters. The exercise also shows Class 2 produces an equity premium more than three times larger than Class 1 and a stock price elasticity with respect to productivity about three times larger, yet irrelevance persists.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from Backus, Ferriere, and Zin (2015)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Backus, Ferriere, and Zin (2015) is the closest predecessor, providing irrelevance results for several specific models of time-varying risk and time-varying ambiguity. However, the paper argues they share the common misinterpretation of the Tallarini property as a separation between quantities and prices. The present paper extends their results into a fully abstract, general model structure with arbitrary equilibrium conditions and arbitrary shock distributions, proving irrelevance without tying it to specific model structures. This generality allows the paper to clarify that the separation is between means and volatilities, not between macro and finance variables. The paper also provides a clearer account of how models generate meaningful risk dynamics by breaking or adapting to the three theorem conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the relationship between the paper&amp;rsquo;s results and risk-adjusted affine approximations in the prior literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The proof builds directly on the risk-adjusted affine approximation methodology of Jermann (1998), Malkhozov (2014), and Lopez, Lopez-Salido, and Vazquez-Grande (2018). These approximations preserve exact equality for the nonlinear expectation and certainty equivalent equations (not linearizing them) while linearizing other constraints. Special cases of the irrelevance result appear in the second- and third-order perturbation solutions of Schmitt-Grohe and Uribe (2004) and Van Binsbergen et al. (2012), which this paper unifies and generalizes. The use of entropy (the conditional cumulant generating function operator) to summarize higher-order terms is motivated by Backus et al. (2014), who show entropy effectively summarizes asset pricing properties of pricing kernels. The conditionally affine CGF assumption (Assumption 2) generalizes the normal-shock setting where CGFs are exactly affine in h_t.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the scope conditions and limitations of the theorem?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The theorem applies under three maintained assumptions: (1) separation of preferences (EZ-style or the broader class in Section 4.4), (2) separation of first and higher moment drivers in all model constraints including government, financial sector, labor markets, and endowment processes, and (3) approximate linearity — formally, that the affine approximation (Assumptions 1 and 2) is accurate. The theorem does NOT apply when: constraints are strongly nonlinear due to occasionally binding constraints (ZLB, financial crisis regimes); production incorporates endogenous risk-return tradeoffs; risk aversion varies endogenously with the state (habit formation, wealth distribution with heterogeneous agents); or belief dynamics are endogenous in the ambiguity case. The paper cannot provide a complete characterization of when nonlinearities are &amp;lsquo;strong enough&amp;rsquo; to break irrelevance — numerical evidence suggests simply increasing risk aversion or vol-of-vol is insufficient, but occasionally binding constraints in the literature have been shown to be sufficient. The theorem also assumes the first and higher moment state shocks are independent (Equation 54), a modeling assumption that drives the separation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What do the results imply for how the field should model asset prices in production economies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The theorem implies that meaningful risk modeling in production economies is fundamentally more demanding than in endowment economies. In endowment economies, adding EZ preferences with high risk aversion or stochastic volatility directly affects how asset prices respond to the endowment process. In production economies, these same additions have no effect on impulse responses to productivity shocks — the primary drivers of business cycles in the neoclassical structure — because productivity is a first-moment state. Successful production-economy asset pricing models must therefore either: incorporate mechanisms that connect intertemporal and risk tradeoffs in production (endogenous volatility, incomplete markets, idiosyncratic risk); introduce sufficient structural nonlinearity; or drive business cycles with higher-moment shocks combined with additional mechanisms to preserve comovement. The paper suggests that the limited success of long-run risk and disaster risk models in production economies is not a failure of calibration but a logical consequence of the theorem&amp;rsquo;s conditions being satisfied.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;First moment states&lt;/strong&gt;: Exogenous state variables that affect expected values of the model structure (e.g., productivity level, capital stock) but not the higher central moments of the shock distributions. In the general model, x_t with shock distribution having zero mean conditional on h_t but variance and higher moments controlled entirely by h_t, not x_t itself.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Higher moment states&lt;/strong&gt;: Exogenous state variables that control the conditional higher central moments (variance, skewness, kurtosis) of the shock distributions but not their means — e.g., stochastic volatility of productivity h_t. Risk aversion and parameters governing higher moments (theta_h) are irrelevant for elasticities with respect to first-moment states but are critical for elasticities with respect to higher-moment states.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Irrelevance (in this paper&amp;rsquo;s sense)&lt;/strong&gt;: The property that risk aversion gamma and higher-moment parameters theta_h do not enter the matrices Z_z and Z_x in the solution z_t = z + Z_z&lt;em&gt;z_{t-1} + Z_x&lt;/em&gt;x_t + Z_h*h_t. These parameters are irrelevant for impulse responses and dynamic elasticities with respect to first-moment states, though they do affect steady states (z), model intercepts, and elasticities with respect to higher-moment states (Z_h).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Breaking irrelevance&lt;/strong&gt;: Removing one of the three theorem conditions — separability of preferences, separability of first and higher moment drivers in constraints, or approximate linearity — so that risk aversion or risk parameters enter the first-moment elasticities. Requires economically substantive modifications such as endogenous risk-return tradeoffs in production, habit formation, or occasionally binding constraints.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adapting to irrelevance&lt;/strong&gt;: Staying within the class of models covered by the theorem — accepting that risk parameters do not affect first-moment impulse responses — but driving business cycle dynamics primarily with shocks to higher-moment states (volatility, risk aversion). Requires additional mechanisms (nominal rigidities, reallocation channels) to maintain positive comovement of consumption, investment, and labor, which higher-moment shocks cannot generate in the neoclassical structure alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-adjusted affine approximation&lt;/strong&gt;: A solution method that preserves the nonlinear expectation and certainty equivalent equations exactly (not linearizing them, thereby retaining all risk effects) while log-linearizing the remaining constraints. The resulting solution is affine in the state variables, with the CGF of shocks assumed to be conditionally affine in the higher-moment states h_t. This approach captures higher-order risk terms while maintaining analytical tractability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Entropy operator&lt;/strong&gt;: The conditional matrix operator H_t[u] = log E_t[exp(u - E_t[u])], equivalent to the vectorized conditional cumulant generating function (CGF) evaluated at 1. Used to represent all higher-order terms in the equilibrium conditions compactly; the key technical tool enabling the proof to separate expectational terms (independent of risk parameters) from entropy terms (functions of higher-moment states).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Means-volatilities separation&lt;/strong&gt;: The corrected characterization of Tallarini (2000)&amp;rsquo;s result: risk aversion affects model means (intercepts, steady states, average equity premium) but not volatilities or impulse responses of any variable — including asset prices — when shocks are homoskedastic. This reinterpretation replaces the widely held but incorrect view that Tallarini establishes a separation between macroeconomic and financial variables.&lt;/p&gt;</description></item><item><title>Armed conflict exposure and trust: evidence from a natural experiment</title><link>https://macropaperwarehouse.com/papers/armed-conflict-exposure-and-trust-evidence-from-a-natural-experiment/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/armed-conflict-exposure-and-trust-evidence-from-a-natural-experiment/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks how individual-level exposure to internal armed conflict shapes social capital, specifically trust in institutions and trust in people. The question matters because trust is a core component of social capital that underpins cooperation, economic growth, financial development, political participation, and post-conflict recovery; yet the empirical literature is split between studies finding conflict erodes trust and studies finding &amp;ldquo;post-traumatic growth&amp;rdquo; that enhances pro-sociality. The authors argue prior work cannot cleanly identify causal effects because of non-random selection into exposure, attrition from migration/death, and confounding conflict-induced changes in the socio-economic environment.&lt;/p&gt;
&lt;p&gt;The empirical strategy exploits a natural experiment in Turkey: mandatory conscription assigns every male citizen, via a lottery, to a military base, and a significant share are randomly sent to bases in the eastern/south-eastern conflict zone where the state has fought the PKK since 1984. By sampling ex-recruits who live in peaceful western districts, exposure during military service is the respondents&amp;rsquo; only personal contact with the conflict, isolating individual-level effects from environmental confounds. Data come from a field survey of 5,024 randomly selected adult males in 29 western districts in summer/fall 2019 (response rate 83%); eligible men had completed service between 1984 and 2014. Only 5 respondents did not answer the military-service questions.&lt;/p&gt;
&lt;p&gt;Two exposure measures are built. ACE (Exposure to Armed Conflict Environment) is the standardized number of combatant casualties in the county and during the period of a respondent&amp;rsquo;s service, drawn from the Turkish State-PKK Conflict Event Database; its variation comes from four exogenous components (birthdate-driven timing, regulation-driven duration, clash intensity, and lottery-assigned location). TDE (Traumatic Direct Experiences) is a binary indicator equal to 1 if the respondent was wounded in armed clashes or had someone around them killed/hurt; 2% reported being wounded and 15% reported others around them killed or hurt. ACE and TDE correlate only 0.25. Two trust outcomes: Institutional Trust (average of 14 five-point items: army, judiciary, parliament, TV, newspapers, parties, clergy, universities, environmental orgs, charities, police, banks, private companies, EU) and Social Trust (trust in unfamiliar people / strangers). The army was the most trusted institution (~75% high trust vs. 43% for courts, 35% for parliament). Estimation is OLS with age, education, and minority controls, standard errors clustered at the living-block level.&lt;/p&gt;
&lt;p&gt;Main findings: the two exposure types have opposing effects. In the preferred specification including both measures, ACE raises Institutional Trust (about 0.02, significant at 5%) and Social Trust (about 0.03, significant at 5%), while TDE lowers Institutional Trust (about -0.15, 5%) and Social Trust (about -0.11, 1%). ACE is insignificant when TDE is omitted because it then pools traumatized and non-traumatized recruits, biasing it toward zero. There is no significant ACE-by-TDE interaction, so the negative trauma effect is independent of conflict intensity. Effects are similar in sign and magnitude across both trust dimensions, indicating an encompassing change rather than institution-specific distrust. Interactions with time-since-service are insignificant, implying the effects are permanent.&lt;/p&gt;
&lt;p&gt;Mechanism: the authors invoke Janoff-Bulman&amp;rsquo;s (1992) &amp;ldquo;shattered assumptions&amp;rdquo; theory. TDE is positively associated with depression and insecurity indexes, which in turn correlate negatively with both trust measures; ACE is not significantly related to depression/insecurity. There is no significant relationship between exposure and trust in the army, ruling out an accountability mechanism. Heterogeneity by in-group: TDE raises trust in family (coping mechanism) but, like strangers, friends show positive ACE and (insignificant) negative TDE effects, arguing against parochialism as the main driver. Implications: distinguish contextual from direct exposure; design psychological recovery programs for veterans; estimates are likely conservative given the limited 6-18 month exposure window.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification relies on Turkey&amp;rsquo;s conscription lottery, which randomly assigns drafted men to military bases, a significant share of which lie in the eastern/south-eastern conflict zone. Because the sample is drawn only from peaceful western districts, service is the respondents&amp;rsquo; sole exposure to the conflict, isolating individual-level effects from conflict-induced changes in the socio-economic environment. ACE&amp;rsquo;s variation comes from four exogenous components: birthdate-driven timing of service, regulation-driven service duration (18 months in the 80s, 15 in 1992, 18 in 1995, 15 in 2003, 12 in 2014), clash intensity around the base, and lottery-assigned location. Threats: (1) non-random base assignment - addressed by balance tests (Table 2) showing no systematic differences in age, ethnicity, or height by conflict-zone assignment; education differs because college graduates are slightly skewed toward western bases (40% of non-college-grads served in the east vs. 30% of college grads), but the difference vanishes when college graduates (9.3% of sample) are excluded, education is controlled in all specs, and a no-college-grad sample (Table A2) is robust; (2) self-selection into dangerous tasks/violence for TDE - addressed by the fact that task assignments are made by command at the start of service before behavior is observed, and Table 3 balance tests show wounded vs. non-wounded respondents do not differ on pre-military characteristics; an alternative TDE (observing a fellow soldier hurt/killed, immune to own risk-taking) yields similar results (Table A1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The proposed mechanism is a transformation of fundamental world assumptions (benevolence, meaning, safety of the world) per Janoff-Bulman (1992). Distinguishing tests: (1) TDE affects a broad range of trust dimensions but is NOT significantly related to trust in the army, ruling out an accountability interpretation (which would predict distrust concentrated on state security institutions) and a comradeship interpretation (which would predict effects only on social trust). (2) TDE is positively and significantly associated with depression and insecurity indexes (Tables 7-8), and these indexes are themselves negatively and significantly related to both trust measures, consistent with shattered world assumptions. (3) ACE is not significantly associated with depression/insecurity; the authors note these scales are worded to detect negative states and may miss the positive feelings ACE could elicit, and that indirect environmental exposure plausibly has weaker effects on fundamental beliefs than direct trauma.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central heterogeneity is by exposure type: contextual exposure (ACE) raises trust, direct trauma (TDE) lowers it. No significant ACE-by-TDE interaction, so trauma&amp;rsquo;s effect does not depend on conflict intensity. No significant moderation by time since service (Table 6), implying permanent effects. In-group heterogeneity (Table 9, ordered logit): TDE significantly raises trust in family (coefficient 0.26, 5%), interpreted as a coping mechanism of retreating to closest networks; trust in friends shows positive ACE (0.07, 5%) and negative but insignificant TDE, mirroring the stranger result. The similar pattern for strangers and friends argues against parochialism as the primary driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Alternative TDE defined as observing a fellow soldier hurt/killed, more immune to own risk-taking (Table A1) - results unchanged. (2) Excluding college graduates (Table A2) - results unchanged. (3) Tobit specification accounting for the censored nature of trust measures (Table A3) - similar results. (4) Including a conflict-zone dummy and base-district fixed effects (Tables A4-A5) to absorb unobserved location heterogeneity (though the authors note these likely absorb part of the ACE variation, so they are not in the baseline). (5) Separate results for each of the 14 institutional-trust dimensions (Table A6) and excluding one dimension at a time from the composite index - results stable. (6) Alternative standard-error clustering at home-district or region levels - unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the draft-lottery natural-experiment tradition (Angrist 1990 on Vietnam; Angrist-Chen 2011; Galiani et al. 2011; Grossman et al. 2015) and the conflict-and-social-capital literature (Rohner et al. 2013; Cassar et al. 2013; Bauer et al. 2016; Kijewski-Freitag 2018). It differs by: (1) cleanly identifying causal effects free of environmental confounds, since trust is measured in untouched western locations rather than in transformed post-conflict settings; (2) carefully separating contextual from direct exposure, which many studies cannot; (3) proposing a novel individual-level psychological mechanism (shattered world assumptions) rather than the economic/institutional-legacy channels (Besley-Reynal-Querol 2014; Nunn-Wantchekon 2011; Grosjean 2014) or the inter-group-competition/parochialism explanation (Bauer et al. 2016). The authors argue the heterogeneity they document can help reconcile the conflicting positive and negative findings in prior literature - prior &amp;lsquo;pro-social&amp;rsquo; effects may reflect coping-driven re-creation of safe social space (consistent with Grosjean&amp;rsquo;s (2014) &amp;lsquo;dark nature&amp;rsquo; of conflict-induced pro-sociality), not genuine restoration of trust.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two main implications: (1) researchers and policy advisers should carefully distinguish contextual from direct conflict exposure when studying behavioral outcomes; (2) the findings inform the design of psychological and social recovery programs for combat veterans and victimized post-conflict populations. Scope conditions: the study is specific to the Turkish conflict setting and limited to male ex-combatants; it remains open whether effects generalize to women, civilians, or other countries. Because exposure lasted only a pre-determined 6-18 months after which recruits returned to peaceful lives, the authors argue estimates are conservative relative to populations living in protracted conflict environments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What additional findings or caveats are noted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors report (results not shown) that individuals with traumatic experiences are more likely to participate in political organizations, and cite Kibris-Nelson (2021) that such individuals are more likely to start their own businesses (while being less successful at it), consistent with coping strategies of creating a controllable environment. They concede the mechanism evidence for the positive ACE effect is &amp;lsquo;somewhat less clear&amp;rsquo; than for TDE, and offer an alternative possibility that whether intense-environment survival raises trust may be moderated by how heroically the veteran&amp;rsquo;s social network views his service. The depression subscale is the 6-item Brief Symptoms Inventory; insecurity is an 8-item scale. Roughly 6.5 million of the 15 million men drafted since 1984 are estimated to have served in the conflict zone.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Exposure to Armed Conflict Environment (ACE)&lt;/strong&gt;: A standardized, individual-specific measure of contextual conflict exposure equal to the number of combatant casualties in the county and during the time period of a respondent&amp;rsquo;s military service. It captures immersion in the conflict environment with high geo-temporal precision and is treated as exogenous because its components (birthdate-driven timing, regulation-driven duration, clash intensity, lottery-assigned location) are outside the individual&amp;rsquo;s control.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Traumatic Direct Experiences (TDE)&lt;/strong&gt;: A binary indicator equal to 1 if a respondent was personally wounded in armed clashes or had someone around them killed or hurt during military service. It captures direct, personal experience of violence as distinct from mere presence in a conflict environment; in the sample 2% were wounded and 15% had others around them hurt/killed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional Trust&lt;/strong&gt;: In the paper&amp;rsquo;s sense, the simple average of a respondent&amp;rsquo;s 5-point Likert trust ratings across 14 public and private organizations (army, judiciary, parliament, media, parties, clergy, universities, environmental orgs, charities, police, banks, private companies, EU) - deliberately broad so as not to over-weight state institutions directly tied to the conflict.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Trust&lt;/strong&gt;: A generalized form of trust measured by how much a respondent trusts people they are not familiar with (strangers), rather than the vaguer &amp;lsquo;most people&amp;rsquo; wording, chosen to minimize in-group/out-group and ethnic associations and isolate generalized trust in others.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shattered assumptions&lt;/strong&gt;: The paper&amp;rsquo;s operative mechanism, drawn from Janoff-Bulman (1992): people hold core assumptions that the world is benevolent, meaningful, and safe; traumatizing experiences shatter these positive assumptions, eroding deeply rooted trust - whereas surviving a dangerous environment without mishap can instead reinforce them. Trust, depression, and insecurity are treated as observable implications of these otherwise-unobservable world assumptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Parochialism / parochial altruism&lt;/strong&gt;: The rival hypothesis (associated with Bauer et al. 2016) that conflict exposure increases in-group favoritism while eroding out-group trust. The paper tests and largely rejects it as the primary driver because ACE raises trust in both strangers and friends and the in-group (family) pattern does not match parochial predictions.&lt;/p&gt;</description></item><item><title>Bargaining with renegotiation in models with on-the-job search</title><link>https://macropaperwarehouse.com/papers/bargaining-with-renegotiation-in-models-with-on-the-job-search/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bargaining-with-renegotiation-in-models-with-on-the-job-search/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper resolves a long-standing theoretical impasse in labor search models: how to model wage bargaining when workers search on the job (OJS) and the quit rate depends on the wage. Shimer (2006) showed that this wage-dependent turnover creates a potentially non-convex bargaining set, causing the Nash bargaining solution to break down and generating equilibrium multiplicity. Gottfries introduces renegotiation — wages are fixed under a contract that expires at a Poisson rate γ, after which a new wage is bargained — as the device that simultaneously restores uniqueness and nests the earlier models of Pissarides (1994), Mortensen (2003), and Shimer (2006) as limit cases.&lt;/p&gt;
&lt;p&gt;The model is a continuous-time, frictional labor market with risk-neutral firms and workers. Unemployed workers receive job offers at rate λu; employed workers receive outside offers at rate λe; matches dissolve exogenously at rate δ. Wages are determined through non-cooperative alternating-offers bargaining in the spirit of Rubinstein (1982) and Binmore et al. (1986), with worker bargaining power β. The key innovation is that contracted wages last until renegotiation, which arrives at a Poisson rate γ(F), where F indexes match quality (and hence wage expectations about future renegotiations). As γ → ∞ (continuous renegotiation), the model converges to Pissarides (1994): values solve the Nash bargaining solution with perfectly transferable values, and worker turnover is independent of the current contracted wage. As γ → 0 (no renegotiation), the model converges to the unique equilibrium from Shimer (2006) and Mortensen (2003), with wages playing a strong role in retaining workers. Equilibrium uniqueness follows because renegotiation makes match types payoff-relevant — wage expectations about future negotiations differ across types, so the Nash product cannot be constant on the support, pinning down the initial condition for the wage differential equation.&lt;/p&gt;
&lt;p&gt;The main mechanism is a turnover-retention channel that amplifies worker bargaining power. Because a higher wage reduces the quit rate, and marginal quits are bilaterally inefficient (the firm loses its profits when the worker leaves), agreeing on a higher wage partially recoup losses through longer match duration. This acts as an additional source of worker surplus share on top of the primitive bargaining power β. The strength of this channel is governed by θ — the expected fraction of the discounted match duration covered by a given contracted wage. Higher θ (less frequent renegotiation) means wages matter more for turnover and workers extract more surplus. Lower θ (more frequent renegotiation) attenuates the channel.&lt;/p&gt;
&lt;p&gt;Calibrated to US labor market data — a 45% monthly job-finding rate (Shimer 2012), a 3.2% monthly job-to-job transition rate (Moscarini and Thomsson 2007), a 5% unemployment rate, a 5% annual discount rate, and targeting a labor share of 2/3 and a lognormal wage-offer distribution with scale parameter σ = 0.16 (Gottfries and Teulings 2017) and a mean-to-minimum wage ratio of 1.7 (Hornstein et al. 2007) — the model implies sharply different primitive bargaining powers depending on the assumed renegotiation frequency. Under continuous renegotiation (γ = ∞), the calibrated bargaining power of workers is β = 0.46. Under never-renegotiated wages (γ = 0), β = 0.02. The implication is that the correct inference about worker bargaining power from observed wage distributions is very sensitive to the assumed renegotiation regime.&lt;/p&gt;
&lt;p&gt;For minimum wages, the paper proves that, holding firm entry and the reservation wage constant, any minimum wage increase raises the entire wage distribution in the sense of first-order stochastic dominance (Proposition 2). However, the extent of spillovers above the minimum wage depends critically on renegotiation frequency. In a high-commitment economy (low γ) versus a low-commitment economy (high γ) with identical pre-policy wage distributions, the high-commitment economy exhibits strictly larger wage spillovers throughout the support above the minimum (Proposition 3). The intuition is that a spike in the mass of workers at the minimum wage creates a strong incentive for firms to offer higher wages to reduce costly turnover — but this incentive only materializes when wages are sticky enough that turnover responds appreciably to them. With continuous renegotiation, the spillover vanishes entirely and only a mass point at the minimum wage remains. In the limit of no renegotiation, the model resembles the wage-posting model, which produces especially large spillovers by construction.&lt;/p&gt;
&lt;p&gt;An extension endogenizes the contract length. Firms optimally choose the renegotiation frequency after observing the match type. Two regimes emerge: when worker bargaining power is sufficiently high or productivity rises quickly relative to profits, firms prefer continuous renegotiation; otherwise, an interior contract length strictly above zero is optimal, and firms with all the bargaining power prefer no renegotiation. This implies that the polar assumptions of full commitment or no commitment standard in the literature arise only as boundary cases.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core theoretical problem this paper addresses?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Shimer (2006) demonstrated that when a worker&amp;rsquo;s quit rate depends on the contracted wage, the bargaining set can become non-convex, violating a key condition for the Nash bargaining solution. He proposed a non-cooperative alternating-offers bargaining game but showed that it produces a continuum of equilibria. The existing literature responded either by removing bargaining (wage posting, all bargaining power to firms) or by making turnover independent of the wage (counteroffers by the incumbent firm). Gottfries provides a solution that preserves both bargaining and wage-dependent turnover by introducing renegotiation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does renegotiation restore equilibrium uniqueness?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Without renegotiation and with homogeneous productivities (as in Shimer 2006), the match type F is not payoff-relevant: only the current contracted wage matters, so the Nash product is constant on the wage support and any wage in that support is a potential equilibrium outcome. With renegotiation, each type F is associated with a distinct expected future wage (wage expectation), which is payoff-relevant because it governs future turnover. Different types therefore face different Nash products, and the product cannot be constant across types. This forces the Nash product to be increasing to the left of the bargaining outcome and decreasing to the right for each type, providing a unique interior maximum and a unique initial condition w(0) = max{βx(0) + (1−β)wr, wmin}. The paper also shows that alternative refinements — large-friction limits or the case where λe = 0 — yield the same unique equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model nest Pissarides (1994) and Mortensen (2003)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;As γ → ∞ (continuous renegotiation, θ → 0), the contracted wage becomes irrelevant because future wages are renegotiated almost immediately. The worker&amp;rsquo;s quit decision is then independent of the current wage, so values solve the standard Nash bargaining solution with perfectly transferable values, exactly as in Pissarides (1994). As γ → 0 (no renegotiation, θ → 1), the wage lasts the full duration of the match, turnover responds maximally to wages, and the equilibrium values correspond to Mortensen (2003, Section 4.3.4) with a unique initial condition (rather than the multiplicity in Shimer 2006). Intermediate values of γ correspond to no prior model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the mechanism by which workers receive a share of surplus exceeding their bargaining power β?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When a worker bargains for a higher wage, she reduces her quit probability. Marginal quits are bilaterally inefficient because the firm loses its profits when the worker leaves to a marginally better job (even though the transition is socially efficient once the new employer&amp;rsquo;s value is counted). The reduction in inefficient separations increases the joint match surplus. Formally, the extra surplus share comes from the term λe · [w&amp;rsquo;(F)/(δ+ρ+λe(1−F))] · [(δ+ρ+λe(1−F))/(δ+ρ+γ(F)+λe(1−F))] · Π(F,w(F)), which is the density of incoming offers per unit wage increase multiplied by the fraction of the match duration covered by the contracted wage, multiplied by the profit level lost at each marginal quit. This term is zero when γ → ∞ (continuous renegotiation) and is largest when γ = 0 (no renegotiation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is θ and what role does it play?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;θ is defined for the homogeneous-productivity case as the expected fraction of the expected discounted match duration that an agreed wage remains in force. It captures the marginal relative importance of the current contracted wage versus the wage expectation (which governs future renegotiated wages). θ = 1 corresponds to no renegotiation (the contracted wage lasts the whole match), θ → 0 corresponds to continuous renegotiation. The renegotiation rate is γ(F) = [(1−θ)/θ] · (δ+ρ+λe(1−F)). A small increase in the wage by w&amp;rsquo;(F)dF decreases turnover by θ dF in the homogeneous case, so θ directly scales the turnover-retention channel and hence workers&amp;rsquo; effective surplus share.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the calibration reveal about the relationship between renegotiation assumptions and inferred bargaining power?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Holding transition rates fixed (λu = 0.45, λe = 0.181, δ = 0.024 per month) and targeting a 2/3 labor share and a lognormal wage-offer distribution (σ = 0.16, mean-min ratio 1.7), the calibrated worker bargaining power β is 0.46 under continuous renegotiation (γ = ∞) and only 0.02 under no renegotiation (γ = 0). The calibrated productivity distribution also differs markedly: no-renegotiation requires a much fatter right tail in firm productivities to match the same wage distribution because the labor share falls sharply in the upper tail when bargaining power is low and wages are infrequent renegotiated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper prove about minimum wage spillovers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 2 proves that, holding firm entry constant and adjusting unemployment benefits to keep the reservation wage constant, a minimum wage increase raises the equilibrium wage distribution in the sense of first-order stochastic dominance. Proposition 3 proves that, comparing a high-commitment economy H (lower γH) and a low-commitment economy L (higher γL) that have identical pre-policy wage distributions (and therefore βH &amp;lt; βL), the high-commitment economy H exhibits strictly higher wages at every rank F after a small minimum wage increase. The mechanism is that a mass of workers at the minimum wage creates a dense region of outside options, making it worthwhile for firms to accept higher wages to reduce turnover — but only when committed wages are sticky enough to affect actual turnover.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What happens to the wage distribution spike at the minimum wage when renegotiation is frequent?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under the baseline assumption that workers move when indifferent (no mass points), the equilibrium has no spike; the mass at the minimum wage spreads continuously upward. When this assumption is relaxed and workers may stay when indifferent (following Shimer 2006), an equilibrium with a mass point at the minimum wage exists. Equation (19)/(20) show the equilibrium mass point at the minimum wage is increasing in the renegotiation rate γ (higher γ → larger spike). This occurs because with frequent renegotiation, spillovers above the minimum wage are small, so the density just above the minimum is high, which in turn supports a large mass at the minimum. The paper parameterizes this with φ = 0.04 (ratio of mass at minimum wage to density just above) and illustrates with θ = 0.02 (long contracts) and θ = 0.5 (short contracts).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does endogenizing the contract length change the predictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When firms choose the renegotiation frequency after observing the match type, two regimes emerge. In the first, the firm would not benefit from raising the wage above the continuous-renegotiation Nash-bargaining level: this happens when worker bargaining power is sufficiently high or productivity increments are large relative to profits. Firms then choose continuous renegotiation (γ = ∞) for that match type. In the second regime, lower turnover makes it profitable to commit to a higher wage via a longer contract; firms pick an interior γ satisfying the envelope condition. With all bargaining power to the firm (β = 0), the optimum is no renegotiation (infinite contract length). The equilibrium in the endogenous-contract model satisfies a differential equation that coincides with the wage-posting model differential equation in the interior region, providing a microfoundation for wage-posting results even when workers have some bargaining power. The model also provides a uniqueness justification for equilibria in Coles (2001) and Coles and Mortensen (2016).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to Brügemann, Gautier, and Menzio (2015)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Brügemann, Gautier, and Menzio (2015) identify a similar surplus-retention mechanism in a model where a single firm bargains successively with many workers: agreeing on a high wage with one worker is &amp;lsquo;cheap&amp;rsquo; because the firm can recoup part of the cost through lower wages agreed with subsequent workers. Gottfries&amp;rsquo; mechanism is the bilateral analogue: within a single match, a higher wage is cheap because it reduces wasteful turnover and extends the profitable match duration. Both models generate workers capturing a surplus share above their primitive bargaining power, but through distinct channels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What assumptions are needed for uniqueness and what relaxing them implies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two key restrictions are imposed. First, Markov strategies are required and wage functions must be weakly increasing in match type F; without this, equilibria exist in which workers accept lower-productivity jobs for a higher current wage, creating decreasing wage functions. Second, workers must move with positive probability when indifferent between offers, which eliminates mass points on the support. Shimer (2006) showed that when indifferent workers never move, multiple equilibria with mass points exist. Relaxing the second restriction opens the door to a spike at the minimum wage in the minimum wage application. Alternative refinements — large-friction limits, the limiting case as λe → 0, or as β → 0 — all single out the same unique equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main policy implication is that the spillover effects of minimum wage increases depend critically on the degree of wage commitment in the labor market. In economies where wages are rarely renegotiated (higher θ), minimum wage increases spread substantially up the wage distribution; in economies with continuous renegotiation, only a spike at the minimum results with little or no spillover. This has direct implications for empirical studies of minimum wages: the observed pattern of spillovers is informative about the prevailing renegotiation regime. The scope conditions are: (i) partial equilibrium (firm entry and reservation wage are held fixed); (ii) all matches remain profitable at the minimum wage (wmin &amp;lt; x(0)); (iii) random rather than directed search. The paper does not provide an empirical test or identification strategy for the renegotiation frequency itself.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the limits and caveats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model treats the renegotiation frequency as an exogenous parameter (except in Section 6). The calibration does not structurally identify the renegotiation frequency from data; it instead illustrates sensitivity. The analysis of minimum wages is partial equilibrium — firm entry and reservation wages are held fixed — and the paper notes that general equilibrium effects (entry, reservation wages) are ambiguous in sign and difficult to identify empirically. The model has no on-the-job search effort endogeneity or worker heterogeneity (workers are homogeneous ex ante). The wage-posting and counteroffers models studied in the literature require strong commitment assumptions that this model relaxes but does not fully endogenize in a dynamic contracting sense.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Renegotiation (frequency parameter γ)&lt;/strong&gt;: The Poisson rate at which a contracted wage expires and a new wage is bargained. In the paper&amp;rsquo;s own sense, γ indexes the degree of wage commitment: γ = 0 means the contracted wage lasts the entire match (perfect commitment, no renegotiation); γ → ∞ means the wage is continuously reset (no commitment). The frequency γ governs how much the contracted wage — versus future renegotiated wages — matters for the worker&amp;rsquo;s turnover decision, and hence how much of the match surplus the worker captures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bilateral inefficiency of transitions&lt;/strong&gt;: The paper defines a job-to-job transition as bilaterally inefficient when the value to the worker at the new job is less than the total surplus of the existing match. Since the firm loses its profits when the worker quits, the pair jointly would prefer the worker to stay — yet the worker moves whenever her individual value is higher elsewhere. The gap between individual and joint incentives is the source of bilateral inefficiency; it is what makes turnover-reduction through higher wages mutually beneficial and gives workers extra bargaining power beyond β.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Match type (F) and wage expectation&lt;/strong&gt;: In the model, F is a match quality drawn from the uniform distribution on [0,1] upon meeting. F determines both the productivity x(F) and the wage expectation — the anticipated outcome of future renegotiations. Critically, the wage expectation is the payoff-relevant state variable that differs across types and thereby distinguishes matches, restoring equilibrium uniqueness. Higher F is associated with higher wage expectations, lower turnover, and greater match surplus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Commitment parameter (θ)&lt;/strong&gt;: Defined for the homogeneous-productivity case as the expected fraction of the expected discounted match duration for which the currently agreed wage remains in force. θ = 1 corresponds to no renegotiation; θ → 0 to continuous renegotiation. A one-unit wage increase reduces turnover by θ in equilibrium, so θ directly scales the turnover-retention channel and the extra surplus share flowing to workers beyond their primitive bargaining power β.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Minimum wage spillover&lt;/strong&gt;: The paper uses &amp;lsquo;spillover&amp;rsquo; to mean the upward shift in wages paid by firms above the minimum wage that results from a minimum wage increase. Mechanically, a minimum wage creates a mass of workers at the floor; if turnover responds to wages (i.e., commitment is high), firms above the minimum prefer to raise wages to avoid losing workers to the mass point competitors, spreading the effect. The paper proves (Proposition 3) that spillovers are strictly larger in higher-commitment (lower γ) economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Markov-perfect equilibrium (MPE) of the bargaining game&lt;/strong&gt;: The equilibrium concept applied to the alternating-offers bargaining game. In an MPE, offer and acceptance rules depend only on the current match type F, not on prior bargaining history. This restriction, combined with the renegotiation structure, is what allows the paper to derive a unique differential equation for the wage function w(F) and a unique initial condition, yielding the unique equilibrium wage distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Turnover-retention channel&lt;/strong&gt;: The mechanism by which a higher contracted wage reduces the worker&amp;rsquo;s quit probability and thereby increases the joint match surplus. Because marginal quits are bilaterally inefficient, a small wage increase generates a surplus gain proportional to the density of arriving outside offers times the expected fraction of the match covered by the contracted wage times firm profits — exactly the extra term that elevates the worker&amp;rsquo;s effective surplus share above β. This channel is the paper&amp;rsquo;s central contribution to understanding why workers capture more than their bargaining power suggests.&lt;/p&gt;</description></item><item><title>Capital Flows and the Global Collateral Cycle</title><link>https://macropaperwarehouse.com/papers/capital-flows-and-the-global-collateral-cycle/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/capital-flows-and-the-global-collateral-cycle/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;The paper asks why large gross financial flows exist between similarly rich countries (especially the U.S. and Europe), why financial integration raises rather than lowers asset price volatility, and why safe-asset prices rise during crises. The authors argue that cross-country disparities in collateral technology — the capacity to securitize domestic assets into state-contingent tranches — can account for all three phenomena simultaneously, without invoking differences in preferences, endowments, production technologies, or idiosyncratic shocks.&lt;/p&gt;
&lt;p&gt;The model is a two-country (Home = U.S., Foreign = Europe) collateral general equilibrium model built on Geanakoplos (2003). Agents within each country are risk-neutral but heterogeneous in beliefs (indexed by optimism parameter i). The only asymmetry across countries is the collateral technology: Home collateral can back any state-contingent promise (tranching), while Foreign collateral can back only non-contingent debt (leverage). Both countries share common shocks. Collateral requirements are endogenously determined in equilibrium. The authors first characterize static autarky and integrated equilibria analytically, then simulate a three-period dynamic model calibrated with dUU = dDU = 1 and dDD = 0.2.&lt;/p&gt;
&lt;p&gt;In the static numerical example (dD = 0.2, uniform beliefs γ(i) = i), Foreign autarky yields an asset price of p* = 0.75 with marginal buyer i&lt;em&gt;₁ = 0.69. Home autarky yields a higher asset price of p = 0.83 (marginal buyers i₁ = 0.65, i₂ = 0.10) and a D-tranche price of πT = 0.18. In international equilibrium, the Home price rises further to p̂ = 0.86, the Foreign price falls to p̂&lt;/em&gt; = 0.73, and the D-tranche price rises to π̂T = 0.19. Financial integration moves identical-payoff asset prices further apart (Proposition 2), and the Law of One Price fails with a strictly positive collateral gap Δ̂ = p̂ − p̂* = dD(γ(î₁) − γ(î₂)) (Proposition 1).&lt;/p&gt;
&lt;p&gt;In the dynamic three-period model (dDD = 0.2), the Foreign autarky leverage cycle produces a 25% asset price fall from p&lt;em&gt;₀ = 0.96 to p&lt;/em&gt;D = 0.72 after scary bad news. The Home autarky securitization cycle produces a larger 39% fall from p₀ = 1.21 to pD = 0.74. Financial integration amplifies both: the Home price in international equilibrium starts higher at p̂₀ = 1.40 and falls 44% to p̂D = 0.79; the Foreign price falls from p̂&lt;em&gt;₀ = 0.91 to p̂&lt;/em&gt;D = 0.68 (25%), both crashes exceeding their autarky counterparts. The collateral gap is pro-cyclical, falling from Δ̂₀ = 0.49 at s=0 to Δ̂D = 0.11 at s=D. Gross flows are also pro-cyclical: Home gross inflows drop from 0.266 to 0.173 and gross outflows from 0.378 to 0.215 from the good to the bad state. The trade balance deficit collapses from TBH₀ = 0.12 to TBH_D = 0.04. Meanwhile, the Arrow D security (the negative beta, super-safe tranche) rises in price counter-cyclically from π̂⁰_D = 0.85 to π̂^D_D = 0.96 in international equilibrium, and is always priced higher in international equilibrium than in Home autarky.&lt;/p&gt;
&lt;p&gt;Four mechanisms drive the results. First, the collateral value premium: tranching splits cash flows to serve heterogeneous buyers and raises asset prices above the unsecuritized level, producing a law-of-one-price failure. Second, bidirectional gross flows: Foreign investors demand Arrow D tranches available only from Home; Home investors buy cheap Foreign bonds because the basis (price of replicating Arrow portfolio minus price of non-contingent Foreign bond) is positive. Third, a permanent trade deficit for Home: Home&amp;rsquo;s collateral-driven wealth advantage (Corollary 2) generates higher consumption purchases in every state, and the trade deficit equals eY·Δ̂/(2e_c0 + eY(p̂+p̂*)) in all states. Fourth, the Global Collateral Cycle: scary bad news curtails the feasibility of creating negative beta tranches, making Home&amp;rsquo;s effective collateral advantage procyclical even though the technology itself is fixed, driving procyclical gross flows and trade imbalances and counter-cyclical safe-asset prices through a supply channel that complements the conventional demand-side flight-to-safety.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What drives gross financial flows in both directions between two otherwise identical countries?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Foreign agents demand Arrow D securities (negative beta tranches) that only Home can produce via its superior collateral technology. This generates gross inflows to Home. Simultaneously, Home agents buy Foreign bonds because the basis is positive — the foreign non-contingent bond trades cheaper than a replicating portfolio of Arrow securities produced at Home. This generates Home gross outflows. Both directions arise purely from the collateral technology disparity, with no role for interest rate differentials, endowment differences, or idiosyncratic shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the Law of One Price failure and how is it characterized analytically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 1 establishes that in any international equilibrium, the collateral gap Δ̂ = p̂ − p̂* = dD(γ(î₁) − γ(î₂)) &amp;gt; 0. Two assets with identical payoffs trade at different prices because the Home asset can be tranched into state-contingent claims sold to different buyers, generating a collateral value premium, while the Foreign asset can only back non-contingent debt. Corollary 1 shows the basis β = π̂U + π̂D − 1 &amp;gt; 0 and Δ̂ = dD·β, linking both deviations to the degree of collateral technology advantage measured by dD.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does Home run a permanent trade deficit and how large is it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 5 proves that in the home-biased neutral international equilibrium, Home runs a trade deficit in every state (0, U, D). Because financial integration raises Home asset prices (Proposition 2), Home agents are wealthier in every state (Corollaries 2 and 3). By homotheticity, Home purchases more of every good, including foreign consumption goods. The deficit at s=0 equals eY·Δ̂ / (2e_c0 + eY(p̂+p̂*)) = eY·dD·β / (same denominator). This mechanism does not require Home to have a lower interest rate or higher saving — the collateral advantage directly raises Home&amp;rsquo;s permanent wealth. In the numerical example, TBH₀ = 0.12.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does financial integration increase asset price volatility rather than reduce it through diversification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Integration raises the collateral value of Home assets at s=0 because Foreign demand for D tranches is added to domestic demand, pushing prices to a higher starting point (p̂₀ = 1.40 vs. p₀ = 1.21 in Home autarky). After scary bad news, the same Securitization Cycle dynamic that would reduce Home prices in autarky now operates from a higher starting point and propagates to Foreign asset prices, because Foreign assets are priced relative to Home assets. Price crashes deepen: Home falls 44% in IE versus 39% in autarky; Foreign falls 25% from a lower s=0 base. The collateral gap and the volume of negative beta assets that can be created both collapse after bad news, reinforcing the price drop.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the supply channel for safe-asset price appreciation during crises, and how does it differ from the flight-to-safety demand channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The supply channel works through the endogenous collapse in the quantity of Arrow D (negative beta) securities created from Home collateral after scary bad news. Since the collateral&amp;rsquo;s worst-case payoff worsens at s=D, fewer Arrow D securities can be guaranteed per unit of collateral, even though the technology itself is unchanged. The reduced supply — combined with persistent demand from pessimistic agents — drives up the Arrow D price (from 0.85 to 0.96 in the IE numerical example). This contrasts with the conventional flight-to-safety demand channel, in which agents shift demand toward safe assets due to heightened risk aversion. Both channels operate simultaneously in the model: the wealth redistribution toward pessimists at s=D also raises aggregate effective risk aversion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does Home&amp;rsquo;s collateral technology advantage create exorbitant privilege?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The exorbitant privilege arises because only Home can create negative beta (Arrow D) securities, but both Home and Foreign agents demand them. In international equilibrium the Arrow D price is always higher than in Home autarky — Foreign demand adds to domestic demand while supply remains constrained by Home collateral. This means Home&amp;rsquo;s collateral generates a rent above the payoff value. In turn, Home is wealthier in every state and can run a permanent trade deficit, receiving more consumption goods from the world in exchange for financial claims that in aggregate pay less (because distinct buyers value distinct tranches more than the aggregate). The collateral gap measuring this privilege is larger in IE than the autarky spread, and it is pro-cyclical — largest in good times.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is &amp;lsquo;scary bad news&amp;rsquo; and why does it create amplified price crashes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Scary bad news is a shock at s=D that simultaneously (i) worsens expected payoffs and (ii) raises downside variance, so the collateral&amp;rsquo;s worst-case value from D is much lower (dDD = 0.2 versus dUU = 1). In Foreign autarky this reduces the maximum non-contingent debt that can be collateralized, sharply reducing leverage and hence the price of risky assets beyond what the direct dividend news implies — the Leverage Cycle of Geanakoplos (2003). In Home autarky the same scary news reduces the quantity of Arrow D securities that can be created, causing an even larger asset price crash — the Securitization Cycle of Fostel and Geanakoplos (2012a). In international equilibrium both cycles interact, as the higher collateral values at s=0 unwind more sharply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What refinement resolves multiplicity in the international equilibrium and what does it imply for gross flows?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because Home and Foreign consumption goods and Arrow U securities are perfect substitutes under linear utility, the international equilibrium has a continuum of solutions for individual portfolio allocations. The authors introduce a &amp;lsquo;home-biased neutral&amp;rsquo; refinement in two steps: first, &amp;rsquo;neutrality&amp;rsquo; selects the allocation where agents seeking proportional payoffs hold proportional portfolios (this is justified as the limit of small perturbations breaking perfect substitutability); second, &amp;lsquo;home bias&amp;rsquo; requires each agent to hold all domestic goods before holding foreign ones, minimizing the scale of gross flows. Even under this most conservative refinement, Propositions 3 and 4 establish that Home is a seller of Arrow D and net seller of Arrow U securities (gross inflows) and a buyer of Foreign bonds (gross outflows), and Proposition 5 establishes the permanent trade deficit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from the prior global imbalances literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The standard literature (Caballero-Farhi-Gourinchas 2008, Mendoza-Quadrini-Rios-Rull 2009, Angeletos-Panousi 2011) explains capital flows via differences in insurance capacity or financial development that affect autarkic savings rates and interest rates, generating primarily net capital flows and current account imbalances. Maggiori (2017) assumes Home financiers face weaker borrowing constraints, allowing them to absorb aggregate risk. The present paper differs: (i) all investment returns and insurance possibilities are identical across countries — only the collateral technology differs; (ii) the paper focuses on gross flows, which dwarf net flows; (iii) flows are driven by positive-supply collateral-backed cash flows, not zero-supply Arrow securities; (iv) financial integration increases rather than decreases volatility (contra Mendoza-Quadrini 2010 who find integration attenuates U.S. crisis severity); (v) the mechanism generates violations of the Law of One Price, not just interest rate differentials.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main testable implications and what data would be needed to test them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Section V lists eight testable implications: (1) securitization raises collateral prices relative to identical unsecuritized foreign collateral, testable via option-adjusted spreads on mortgages versus sovereign bonds across countries; (2) larger securitization gaps predict larger gross flows in both directions, requiring data on cross-border securitization trades; (3) larger securitization gaps predict larger trade imbalances; (4) larger collateral technology gaps increase global asset price volatility in both countries; (5) changes in financial integration affect price volatility; (6) larger technology gaps increase pro-cyclicality of gross and net flows; (7) larger gaps increase counter-cyclicality of super-safe asset prices; (8) changes in financial integration affect flow cyclicality. The authors note that cross-border securitization trade data are currently scarce and call for a taxonomy of collateral structures and volumes by country as a preliminary step.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What scope conditions and extensions are discussed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model abstracts from production and investment, so results apply to the trade balance not the current account. The authors conjecture that adding production (cf. Fostel-Geanakoplos 2016) would reinforce Home&amp;rsquo;s current account deficit via collateral-driven over-investment. There are no exchange rates; the conjecture is that differentiated goods would imply a stronger Home currency, connecting to the exorbitant privilege literature (Gourinchas-Rey 2022, Jiang-Krishnamurthy-Lustig 2024). All agents are risk-neutral, which makes equilibria tractable but rules out curvature-based risk-sharing motives; the authors interpret heterogeneous optimism as a proxy for heterogeneous risk aversion or hedging mandates. Shocks are common, not idiosyncratic; idiosyncratic shocks would add further risk-sharing motives on top of the collateral channel but the authors argue their mechanism is conceptually distinct. Partial correlation of asset payoffs across countries is considered in an appendix extension and shown to reinforce the main results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper handle the relationship between the collateral technology and the quantity of safe assets in the cycle?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The key insight is that while the collateral technology (the set of contracts J available) is fixed across the cycle, the amount of negative beta assets that can actually be created varies endogenously with the collateral&amp;rsquo;s payoff characteristics. At s=0, with a worst-case payoff dD = p*D = 0.72 for the dynamic problem, substantial Arrow D securities can be created. At s=D, the worst-case payoff is dDD = 0.2, drastically curtailing the feasible quantity of Arrow D securities per unit of collateral. This procyclical variation in effective securitization capacity, driven by scary bad news, is what generates the Global Collateral Cycle — the collateral technology itself is constant but the &amp;lsquo;room&amp;rsquo; to use it varies with macroeconomic conditions.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Collateral technology&lt;/strong&gt;: The legally enforceable set J of financial contracts that can be created using a domestic asset as collateral; in the paper it determines whether an asset can back state-contingent (tranching, Home) or only non-contingent (leverage, Foreign) promises, and it applies only to domestic collateral because enforcement depends on domestic courts and legal infrastructure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negative beta asset (super safe asset)&lt;/strong&gt;: A financial asset whose price typically rises when aggregate conditions worsen; in the model this is the Arrow D security (a tranche promising payment only in the bad state D), whose real-world analogues include AAA securitization tranches and U.S. Treasuries. In the paper&amp;rsquo;s static model, the D-tranche price rises from 0.74 to 0.92 in Home autarky after bad news, and from 0.85 to 0.96 in international equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collateral gap (Δ̂)&lt;/strong&gt;: The equilibrium price difference p̂ − p̂* between identical-payoff assets in Home and Foreign arising purely from the difference in collateral technologies; always strictly positive in international equilibrium and equal to dD(γ(î₁) − γ(î₂)), measuring the collateral value premium of the Home asset. In the dynamic model it falls pro-cyclically from 0.49 at s=0 to 0.11 at s=D.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Basis (β)&lt;/strong&gt;: The premium of a replicating portfolio of Arrow securities over a non-contingent bond with the same aggregate payoff: β = π̂U + π̂D − 1; always positive in international equilibrium and equal to Δ̂/dD, reflecting that contingent claims backed by Home collateral command a higher combined price than their non-contingent Foreign equivalent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scary bad news&lt;/strong&gt;: A negative shock that simultaneously lowers expected payoffs and raises downside variance, so that the collateral&amp;rsquo;s worst-case value from the bad state is lower than from the initial state; following Geanakoplos (2003, 2010), this type of news causes endogenous collapses in leverage and securitization volume beyond what the fundamental payoff news alone would imply, generating amplified asset price crashes and the leverage/securitization cycle dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global Collateral Cycle&lt;/strong&gt;: The international financial cycle generated by the interaction of disparate collateral technologies and scary bad news: in the down phase, the feasible quantity of Home-created negative beta assets falls (supply contraction), the collateral gap shrinks, gross flows collapse, trade imbalances narrow, risky asset prices crash further than in autarky in both countries, and safe-asset prices rise above their autarky levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collateral value&lt;/strong&gt;: The component of a risky asset&amp;rsquo;s equilibrium price that exceeds its expected payoff value and arises from the asset&amp;rsquo;s capacity to serve as collateral backing contingent financial promises; it is positive when heterogeneous buyers are willing to pay a combined premium for distinct tranches relative to what a single buyer would pay for the undivided asset, as in the floater/inverse-floater securitization example described in the paper.&lt;/p&gt;</description></item><item><title>Codification, Technology Absorption, and the Globalization of the Industrial Revolution</title><link>https://macropaperwarehouse.com/papers/codification-technology-absorption-and-the-globalization-of-the-industrial-revolution/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/codification-technology-absorption-and-the-globalization-of-the-industrial-revolution/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Why did the First Industrial Revolution (IR) spread to Meiji Japan—and to essentially no other non-Western country—during the first wave of globalization? The paper tests Mokyr&amp;rsquo;s hypothesis that &amp;ldquo;technical literacy,&amp;rdquo; i.e., the codification of engineering, commercial, and industrial knowledge in the local vernacular, was a necessary condition for absorbing IR technologies. The motivating puzzle: after opening to trade (1858) and the Meiji Restoration (1868), 80% of Japanese exports were still primary products as late as ~1883 and real per capita GDP growth was only 0.6%/yr (1870-1883/85); then in a brief 13-year window (1883-1896) the manufacturing export share tripled and stabilized at around 60% of exports until WWII.&lt;/p&gt;
&lt;p&gt;Data and setup: The authors build several novel datasets. (1) A cross-language measure of codification: scraping national/major libraries and WorldCat for technical books (agriculture, applied sciences, commerce, industry, technology) in 33 languages, 1500-1930. (2) &amp;ldquo;British Patent Relevance&amp;rdquo; (BPR): the cosine similarity (TF-IDF, unigrams+bigrams) between the digitized synopses of all British patents 1780-1852 (from Woodcroft 1857) and a hand-curated corpus of 460 English-language 19th-century technical manuals matched to SITC industries. BPR measures the world supply of codifiable IR knowledge by industry and is deliberately not based on what Japan translated (to avoid endogeneity). (3) The first harmonized, bilateral, industry-level trade dataset for the 19th century: 37 regions, 93 industries, quinquennial 1880-1910, built from reporting countries Japan, US, Belgium, Italy. Outcomes are annualized industry export growth ({1880,1885} to {1905,1910}) and, in robustness, productivity/comparative-advantage growth following Costinot et al. (2012) and Amiti-Weinstein (2018).&lt;/p&gt;
&lt;p&gt;Main findings (with magnitudes): A Japanese industry with a one-standard-deviation higher BPR experienced annual export growth ~12 percentage points faster and annual productivity (comparative-advantage) growth ~1.2 percentage points faster (coefficients 0.121*** and 0.012***). Cross-sectionally, the BPR-growth relationship is positive and significant only for Japan and other codifying countries: for non-Japan regions the BPR coefficient is negative (-0.030***), while English-, French-, and the &amp;ldquo;top-4 codified&amp;rdquo; (English/French/German/Italian) regions show positive coefficients (0.042**, 0.032**, 0.078***), smaller than Japan&amp;rsquo;s. Low-income and Asian regions tend negative (divergence), not always significant. Time-series: regressing Japanese export growth from 1875 to varying end-years, the BPR coefficient is negative/significant in the 1875-1880 placebo window (Japan resembled the periphery), flips around 1890, and is positive and significant at 1% by 1895—coinciding with Japan&amp;rsquo;s catch-up in codification.&lt;/p&gt;
&lt;p&gt;Mechanism and the Meiji &amp;ldquo;natural experiment&amp;rdquo;: In 1870, 84% of all technical books were in four languages (English, French, German, Italian); an Arabic-only reader had access to just 71 technical books. Japan started ordinary but codified explosively: technical-book growth jumped from 1.6%/yr (1600-1860) to 8.8%/yr (1870-1900); translated technical books rose from 8 (1500-1860) to 608 by 1900; Japanese technical books in the NDL grew from 706 (1880) to 2,823 (1890). State provision solved a public-goods/coordination problem: the government built English-Japanese dictionaries (ETSJ 1862/1866, FSEJ 1871) creating standardized Japanese jargon from Chinese glyphs, and 74% of identified technical-book translators (1870-1885) were government employees. Implication: low-cost vernacular access to technical knowledge was a necessary (not sufficient) condition for IR diffusion; where regions were linguistically/geographically distant from Western Europe, codification required state provision (a Gerschenkronian role for the state).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two-pronged. (1) Cross-sectional: regress region-industry export growth on BPR interacted with region-group dummies, with exporter fixed effects, exploiting that BPR is global (not Japan-specific) and that Japan was uniquely a codifier in the periphery. If codification is the mechanism, only codifying regions should show a positive BPR-growth link. (2) Time-series: exploit the sharp timing of Japanese codification (two well-demarcated periods—pre vs. post technical literacy in the 1880s) by estimating the BPR coefficient on Japanese export growth from 1875 to rolling end-years. The 1875-1880 window serves as a placebo (Japan not yet literate). Main threat is omitted-variable bias: that BPR is correlated with distance to the technology frontier, fundamental comparative advantage, Meiji institutional reforms, or industry steam-intensity. The cross-section addresses the &amp;lsquo;BPR matters everywhere&amp;rsquo; and income/geography confounds; the timing addresses slow-moving confounds (literacy, Tokugawa culture, gradual reforms) since reforms like tax/banking/railroads were mostly in place by 1875, 15-37 years before the BPR effect appears.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are the cross-section and time-series results distinguished from confounders empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the cross-section, income terciles (High/Medium/Low) and an Asia dummy are added: no region group replicates Japan&amp;rsquo;s positive pattern; the poorest and Asian regions show negative (divergence) coefficients. The placebo (1875-1880) yields a negative significant BPR coefficient for Japan itself—identical in sign to non-codifiers—then flips positive/significant by 1895, which conventional &amp;lsquo;opening to trade&amp;rsquo; (1858) or &amp;lsquo;Meiji Restoration&amp;rsquo; (1868) stories cannot explain because the effect appears 37 and 27 years later, respectively.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Japan&amp;rsquo;s BPR coefficient is larger (though not always significantly) than that of European codifiers, consistent with Japan having more to learn from British patents as a late industrializer. Among non-codifiers, low-income and Asian regions show negative BPR-growth relationships (divergence). Within codifiers, English- and French-speaking regions individually have positive but smaller and less precisely estimated coefficients; pooling the top-4 codified languages sharpens significance (0.078***). The time-series point estimates for Japan slowly decline after 1900 (not significantly), consistent with Japan shifting to Second Industrial Revolution technologies and becoming less reliant on older IR ones.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Alternative patent corpora: results are nearly identical using British patents 1853-1879 (full text and AI-summarized) and US patents 1836-1860 and 1861-1879 (coefficients 0.121, 0.116, 0.111, 0.115), though later/US patents lower the R-squared, suggesting the 1780-1852 IR patents best explain Japanese export growth. (2) Productivity instead of exports (Costinot et al. 2012 comparative-advantage growth): qualitatively the same, 1.2 pp/yr for a 1-SD BPR increase, with deterioration in non-codifiers. (3) Confounders: controlling for British-colony status (insignificant) and industry steam-power intensity (French 1860s data) does not affect results. (4) Sample selection: dropping non-manufacturing sectors, excluding Asian destination markets, and dropping major export products (textiles, iron/metal) all leave the results intact, indicating broad-based change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on Mokyr (2011) on &amp;rsquo;technical knowledge&amp;rsquo;/&amp;lsquo;access costs&amp;rsquo; for European industrialization, extending it outside Europe with a Gerschenkronian twist (state as provider of the codification public good). It contributes to the technology-adoption-lags literature (Comin and Hobijn 2010; ~45-year average lags) by offering a friction explanation. It departs from prior Meiji studies (Sussman-Yafeh 2000; Tang; Morck-Nakamura; Bernhofen-Brown) that found banking, railroads, constitutional/monetary reforms had little measurable growth impact—offering codification as the resolution to &amp;lsquo;what drove the Meiji Miracle,&amp;rsquo; consistent with Broadberry et al. (2025) dating Japan&amp;rsquo;s convergence to ~1890 driven by manufacturing productivity. It also extends the knowledge-codification literature (Dittmar 2011; Brown 2024; Abramitzky-Sin 2014) by linking codified vernacular knowledge directly to industry growth rather than indirect outcomes like city growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Public provision of technical knowledge in the vernacular can relax a critical bottleneck to industrialization, especially for regions linguistically/geographically distant from the technology frontier where the market undersupplies this public good. Scope conditions: codification is necessary but NOT sufficient. The Meiji model required complementary investments—language/jargon standardization, mass education for absorptive capacity (literacy &amp;gt;90% for army conscripts by 1909; ~40% of elementary class time on science), tacit-knowledge acquisition (2,400 hired foreigners providing 9,506 person-years of training; study-abroad missions), and tax capacity (1873 Land Tax Reform). China&amp;rsquo;s post-1949 codification under Zhou did not yield sustained growth until Maoist policies (Great Leap, Cultural Revolution) ended—&amp;rsquo;the exception that proves the rule.&amp;rsquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What external-validity evidence is offered beyond Japan?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Meiji codification model was studied and transplanted by Park Chung Hee in South Korea (took power 1961; KIST; researcher counts rose sharply) and Zhou Enlai in China (premier 1949; Russian-language translation drive with USSR as the &amp;lsquo;Britain&amp;rsquo;). In 1950, Japan had ~70,000 technical books, China ~1,000, Korea &amp;lt;100; China surpassed 30,000 by the early 1960s. Korea&amp;rsquo;s per capita income clearly rises after Park; China&amp;rsquo;s codification did not translate into growth until after 1976. These are explicitly presented as suggestive/non-causal, plus appendix discussions of British India and Late Imperial Russia.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are notable caveats and measurement choices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;BPR uses British 1780-1852 patent synopses and English manuals deliberately (Britain as IR leader; Japan hired British instructors and used British textbooks; avoids endogeneity from Japanese translation choices). It excludes tacit knowledge and secrecy-protected innovation by design. English codification is likely underestimated (British Library was un-scrapable after a 2023 cyberattack; Library of Congress used instead). German patents/trade data were excluded for coverage/reliability reasons. Linguistic-distance evidence on 1870/1913 GDP is explicitly not interpreted causally. The aggregate growth correlations for Japan, Korea, and China are described as suggestive, not causal.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Codification (of technical knowledge)&lt;/strong&gt;: The creation of a means of transmitting engineering, commercial, and industrial knowledge—via language creation and written messages (manuals, textbooks, dictionaries)—that does not require direct contact between the knowledge originator and the recipient (Cowan and Foray 1997). In the paper&amp;rsquo;s sense it is a non-rival public good that the market undersupplies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technical literacy / technical knowledge&lt;/strong&gt;: Following Stevens (1995) and Mokyr, the codified engineering, commercial, and industrial practices a practitioner needs to set up and run modern factory-based manufacturing; the paper measures it as the stock of vernacular technical books (agriculture, applied sciences, commerce, industry, technology), excluding theoretical/hard-science and non-firm subjects like medicine.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;British Patent Relevance (BPR)&lt;/strong&gt;: An industry-level measure equal to the cosine similarity (TF-IDF weighted) between the vectorized text of British patent synopses (1780-1852) and the vectorized text of English technical manuals for that industry; it proxies how much codifiable IR knowledge a given industry stood to gain, and is independent of what was actually translated into Japanese.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Access costs&lt;/strong&gt;: Mokyr&amp;rsquo;s (2011) term for the cost of obtaining usable technical knowledge; the paper argues vernacular codification (dictionaries, translations) lowered these costs, and that linguistic distance from English/Latin-Greek roots and physical distance from Europe raised them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technology absorption / absorptive capacity&lt;/strong&gt;: The complementary conditions needed to use codified knowledge—prior language/jargon development, literacy and scientific training, and tacit knowledge—all of which the Meiji state invested in (dictionaries, compulsory education, &amp;rsquo;live machines&amp;rsquo;/foreign instructors, study-abroad missions).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Defensive modernization (Gerschenkronian state role)&lt;/strong&gt;: The paper&amp;rsquo;s reading that an existential external threat aligned the Japanese elite behind aggressive state-led adoption of Western science, casting the state as the critical agent supplying the codification public good in late industrialization—a Gerschenkronian extension of Mokyr applied outside Europe.&lt;/p&gt;</description></item><item><title>Corrigendum to "Job Ladders by Firm Wage and Productivity" [Review of Economic Dynamics 58C (2025) 101307]</title><link>https://macropaperwarehouse.com/papers/corrigendum-to-job-ladders-by-firm-wage-and-productivity-review-of-economic-dynamics-58c-2025-101307/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/corrigendum-to-job-ladders-by-firm-wage-and-productivity-review-of-economic-dynamics-58c-2025-101307/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation. On-the-job search models typically organize firms along a &amp;ldquo;job ladder&amp;rdquo; — a common ranking by workers of available jobs — but they disagree on whether the rung is best captured by a firm&amp;rsquo;s average wage or its productivity, and empirical guidance has been scarce. Bertheau and Vejlin ask: (i) Is average wage or productivity the better empirical measure of a firm&amp;rsquo;s location on the job ladder? (ii) How does job creation across these ladders vary in the cross-section and over the business cycle? (iii) Do recessions slow reallocation into better firms (a &amp;ldquo;sullying&amp;rdquo; effect) or speed it up (a &amp;ldquo;cleansing&amp;rdquo; effect)? This matters for models of aggregate labor-market fluctuations and any imperfect-labor-market model that assumes some jobs are more desirable than others.&lt;/p&gt;
&lt;p&gt;Data and strategy. The authors build matched employer-employee data from Danish administrative registers covering all employment relationships at DAILY frequency from 1992 to 2013, merged with firm financial-accounting data (sales, value added, capital stock, FTE employment, workforce composition). The sample is restricted to manufacturing, services, and trade (industries present from 1992); aggregate unemployment ranges from 3% to 10% over the period, spanning several recessions. Daily timing removes the time-aggregation bias of quarterly data (Bertheau and Vejlin 2022 show quarterly data overstate the EE transition rate by ~30%). Firms are ranked within industry-year cells by (a) residualized average hourly wage and (b) total factor productivity (TFP) estimated via the Olley-Pakes (1996) control-function approach (investment data available from 1999). Following Haltiwanger et al. (2018b), &amp;ldquo;low&amp;rdquo; firms are the bottom employment-weighted quintile and &amp;ldquo;high&amp;rdquo; firms the top two quintiles. Net employment change is decomposed into a net poaching (employer-to-employer/EE) channel and a net nonemployment channel; EE transitions are direct moves with under seven days of nonemployment. Taber and Vejlin (2020) find 80% of EE transitions are voluntary, so poaching flows reveal worker preferences. Cyclical indicators are the change in the unemployment rate (first difference) and the level (HP-filtered deviation from trend).&lt;/p&gt;
&lt;p&gt;Main findings (magnitudes). (1) Productivity is the better job-ladder measure. Residualized wage and TFP are only weakly correlated (Spearman 0.32). Cross-sectionally, the high-vs-low gap in net job creation is far larger for TFP (0.52% vs -0.39%) than for wages (0.26% vs 0.22%), and the net-poaching differential is larger for productivity (0.75%) than wages (0.61%), since workers move up the productivity ladder faster than the wage ladder. (2) Cyclicality differs by ladder. A one-percentage-point rise in the CHANGE in unemployment raises the high-low differential job-creation rate by 0.30 pp for TFP — about 32% of the average TFP differential — driven entirely by the nonemployment channel (0.38 pp), while the poaching channel pulls the opposite way (-0.08 pp). This is a cleansing effect: low-productivity firms both fire more workers to nonemployment AND stop hiring from nonemployment in recessions. For the WAGE ladder the total differential instead contracts by 0.08 pp, because high-wage firms stop poaching (-0.21 pp) — the wage ladder breaks down (a sullying effect). (3) Measurement matters. Using sales per worker instead of TFP yields 0.12 pp on the change-in-unemployment indicator (~40% smaller than TFP&amp;rsquo;s 0.30), and with the LEVEL of unemployment the sign flips: TFP gives +0.11 pp but sales per worker gives -0.08 pp — matching Haltiwanger et al. (2021) on US LEHD data, implying their result reflects sales-per-worker proxying, not a US-Denmark difference.&lt;/p&gt;
&lt;p&gt;Implications. Productivity (not the spot wage) is what workers climb toward, consistent with sequential-auction/outside-option models (Postel-Vinay and Robin 2002). Business-cycle labor models need endogenous hiring rates, since firms shut down hiring rather than only firing in recessions.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the empirical strategy for ranking firms, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Firms are ranked within 2-digit NACE industry-year cells (68 industries) on two dimensions: (a) residualized average hourly wage (regressing firm average wage on workforce tenure, education, age, gender, plus year FE) and (b) TFP from an Olley-Pakes (1996) control-function production function using value added, capital stock, FTE employment, and workforce composition, estimated separately by industry. Quintiles are employment-weighted, so results are interpreted as effects on the average worker. To avoid reclassification bias, firms are ranked on year t-1 measures for flows in year t. Threats: (i) Olley-Pakes uses investment as the productivity proxy but investment data exist only from 1999, so coefficients are estimated post-1999 and back-applied, assuming production technology did not change materially over 1992-2013 — an explicit assumption. (ii) They cannot use Ackerberg-Caves-Frazer (2015) or Levinsohn-Petrin (2003) because detailed intermediate-input data are missing for most firms/years. (iii) AKM firm fixed effects are avoided because the large share of small firms induces limited-mobility bias; residualized average wages are used instead (Haltiwanger et al. 2021 find no difference between AKM FE and average wages). Results are robust to an unresidualized wage measure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two channels and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Net job creation is decomposed as Net Job Creation = Net Poaching (EE hires minus EE separations) + Net Nonemployment (hires from minus separations to nonemployment). EE/poaching transitions are direct employer changes with fewer than seven days of nonemployment between jobs (threshold varied, results similar). Poaching flows are treated as primarily voluntary (80% per Taber and Vejlin 2020), so they reveal the job ladder; nonemployment flows capture involuntary separations and hiring from the jobless pool. The daily data are essential to cleanly separate EE moves from moves through a nonemployment spell.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across firm types and channels is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cross-section: high-wage firms grow mainly via net poaching (0.21%) plus a little net nonemployment (0.06%); low-wage firms LOSE workers to poaching (-0.40%) but GAIN strongly via nonemployment (0.62%), so they still grow (0.22%). Low-productivity firms also lose via poaching (-0.47%) but, unlike low-wage firms, grow only marginally via nonemployment (0.08%), so they shrink overall (-0.39%). Low-type firms (both rankings) have more churn (higher hires and separations) than high-type firms. Over the cycle (Table 3, change in unemployment): when unemployment rises, low-productivity firms contract more (-1.02 pp) than high (-0.71 pp), driven by the nonemployment margin (-1.05 vs -0.67 pp) and by hiring from nonemployment rather than separations (hiring is more cyclically sensitive, consistent with Shimer 2012). High-wage firms contract more than low-wage firms; for high-wage firms separations to nonemployment rise sharply (0.26 pp vs 0.04 pp for low-wage), consistent with Mueller (2017) and Zullig (2022) that high residual-wage workers are more cyclically sensitive. Low-wage firms net-gain through poaching in recessions (0.08 pp) because poaching separations fall more than poaching hires.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the cyclicality regression estimates in detail?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Regressions of differential (high-minus-low) flow rates on a cyclical indicator (times 100), with seasonal dummies and a time trend, 82 quarterly observations. Change-in-unemployment, TFP: Total 0.30 pp (SE 0.10, ***), Poaching -0.08 (0.04, *), Nonemployment 0.38 (0.09, ***). Level-of-unemployment, TFP: Total 0.11 (0.05, **), Nonemployment 0.13 (0.04, ***), Poaching -0.02 (ns). Change-in-unemployment, Wage: Total -0.08 (0.06, ns), Poaching -0.21 (0.08, ***), Nonemployment 0.13 (0.06, **). Level-of-unemployment, Wage: Total -0.17 (0.03, ***), Poaching -0.15 (0.03, **&lt;em&gt;), Nonemployment -0.02 (ns). The authors note that a 2-pp rise in unemployment (typical in a recession) raises the TFP differential job-creation rate by ~66% (2&lt;/em&gt;0.30/0.91) of its mean.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How robust are the results to alternative measures and classifications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cross-sectional results are similar across TFP, value added per worker, and sales per worker, and across three high/low cutoffs (baseline top-2/bottom-1 quintiles; Haltiwanger 2021 top-2/bottom-3; Haltiwanger 2015 top-1/bottom-1). TFP consistently yields the largest net-poaching differential, so it is argued superior, though cross-sectional differences are minor. The key DIVERGENCE is in business-cycle estimates: sales per worker underestimates cyclicality (0.12 vs 0.30 pp on change-in-unemployment) and FLIPS sign on the level indicator (-0.08 vs +0.11 pp), a pattern confirmed across all three classifications. Value added per worker and an alternative OLS-based TFP measure both track baseline TFP closely and, crucially, do NOT produce the sign switch on the level indicator — isolating sales per worker as the outlier. Ranking on profits or employment growth (unreported) gives qualitatively similar results to TFP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from the closest prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Closest empirical work is Haltiwanger et al. (2018a, 2021) on US LEHD data: 2018a concludes firm wage beats firm size as a job-ladder proxy and that high-wage firms are more cyclically sensitive; 2021 finds whether recessions cleanse depends on the cyclical indicator, using sales per worker as a productivity proxy. This paper adds direct TFP (LEHD lacks it), uses daily rather than quarterly data (removing time-aggregation bias, ~30% on EE rates), and shows the wage-ranking results replicate Haltiwanger qualitatively while TFP gives different and stronger conclusions. The wage-vs-sales sign discrepancy is shown to be a measurement artifact, not a US-Denmark institutional difference. Theoretically it is closest to Audoly (2020) and Moscarini and Postel-Vinay (2013), in which better (high-type) firms are more cyclically sensitive because they poach more in expansions when the unemployed pool is small; the paper finds support for this poaching margin using TFP but, being empirical, focuses on which firm characteristic best measures the ladder. It differs from Sorkin (2018), which identifies good firms via revealed preference but does not link them to productivity, and complements Lochner and Schulz (forthcoming) on sorting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the theoretical/policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Recessions speed productivity-enhancing reallocation (cleansing via the nonemployment channel) but impede progression up the wage ladder (sullying via the poaching channel). A central modeling implication: the cleansing effect is driven only PARTLY by the classical Mortensen-Pissarides (1994) channel of firing unproductive workers; equally important, low-productivity firms STOP HIRING from nonemployment in recessions. Models with exogenous arrival rates cannot fit this (more jobs should be created from nonemployment when unemployment is high); endogenous hiring decisions are needed (e.g., Lise and Robin 2017, where low aggregate states shift the vacancy distribution toward high types). Scope conditions: estimates come from Denmark&amp;rsquo;s flexicurity labor market (low firing/hiring regulation, decentralized firm-level wage bargaining, mobility closer to the US than to France/Italy — a Dane is ~2x more likely than a French/Italian worker to make a voluntary EE move, a US worker 2.5x), 1992-2013, manufacturing/services/trade only; means-tested social assistance prevents separating active from inactive nonemployment. Magnitudes are conditional on the chosen productivity measure — using sales per worker would understate or reverse the cleansing finding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the nature of this record (corrigendum)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The DOI 10.1016/j.red.2025.101320 is a corrigendum to the original RED article 101307 (2025). The full-text file provided is the underlying working paper (IZA Discussion Paper No. 15872, January 2023), itself a heavily revised version of an earlier IZA paper, &amp;lsquo;Employment Reallocation over the Business Cycle: Evidence from Danish Data,&amp;rsquo; a chapter of Bertheau&amp;rsquo;s PhD dissertation. The summary reflects the substantive paper content; the corrigendum itself (corrections to the published version) is not detailed in the provided text.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Job ladder&lt;/strong&gt;: A common ranking by workers of available jobs from less to more desirable; the paper tests whether the rung is best indexed by a firm&amp;rsquo;s average wage or its TFP, treating the measure that best predicts voluntary (poaching) moves up as the true ladder.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Net poaching channel&lt;/strong&gt;: Net employer-to-employer (EE) flows — hires poached from other firms minus separations to other firms (direct moves with under seven days of nonemployment). Treated as primarily voluntary (80% per Taber and Vejlin 2020) and thus revealing of the job ladder.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Net nonemployment channel&lt;/strong&gt;: Net flows between a firm and the nonemployment pool — hires from nonemployment minus separations to nonemployment; not distinguished by type of nonemployment because Danish means-tested assistance prevents separating active from inactive jobseekers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cleansing effect&lt;/strong&gt;: In this paper&amp;rsquo;s sense, recessions direct/retain employment in more productive firms: the high-low productivity gap in job creation WIDENS in recessions, as low-productivity firms both separate more workers to nonemployment and stop hiring from it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sullying effect&lt;/strong&gt;: Workers are matched to better firms at a lower rate in bad times: the differential net POACHING rate between high and low firms shrinks in recessions, so the (especially wage) job ladder breaks down and workers get stuck in low-rung firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;TFP (Olley-Pakes control function)&lt;/strong&gt;: Revenue-based total factor productivity estimated via the Olley-Pakes (1996) two-step method, using firm investment as a proxy for unobserved productivity; preferred over labor productivity/sales per worker because it nets out capital intensity and better predicts employment growth and net poaching.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Time-aggregation bias&lt;/strong&gt;: The distortion in measured EE transitions when employment is observed only at low (e.g., quarterly) frequency, which conflates EE moves with moves through short nonemployment spells; daily Danish data avoid it (quarterly data overstate EE rates by ~30%, Bertheau and Vejlin 2022).&lt;/p&gt;</description></item><item><title>Cross-Border Spillovers: How U.S. Monetary Conditions Affect M&amp;As Around the World</title><link>https://macropaperwarehouse.com/papers/cross-border-spillovers-how-u.s.-monetary-conditions-affect-mas-around-the-world/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/cross-border-spillovers-how-u.s.-monetary-conditions-affect-mas-around-the-world/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper examines how unexpected changes in U.S. monetary policy transmit to cross-border merger and acquisition (M&amp;amp;A) activity globally, covering both the volume of deals and their quality as measured by acquirer stock price reactions. The motivation is threefold: M&amp;amp;As represent a large, discrete form of capital reallocation with measurable quality proxies (announcement returns); their financing structure makes them especially sensitive to balance-sheet conditions; and cross-border deals offer a clean lens on international spillovers from core-country monetary policy.&lt;/p&gt;
&lt;p&gt;The country-level analysis draws on SDC Platinum data covering 560,118 completed deals from over 180 economies between 2000 and 2019, representing US$41.1 trillion in combined transaction value, with cross-border deals accounting for 32.6% of the total (approximately US$13.4 trillion). The firm-level analysis uses the ORBIS M&amp;amp;A database, covering 311,485 completed deals from 164,891 acquirer firms across 177 countries. The key exogenous variable is the Iacoviello and Navarro (2019) annual U.S. monetary policy shock series, which isolates unexpected changes in the federal funds rate by stripping out systematic Taylor-rule responses to macroeconomic conditions. Foreign currency (FX) liability exposure is constructed from SDC Loans and Bonds data at the country level (flows of non-financial corporate FX bond and loan issuance, averaging 13.4% of GDP) and at the firm level by applying the country-level FX debt share to ORBIS balance-sheet totals (averaging 8.3% of assets). Identification rests on bilateral country-pair fixed effects (absorbing persistent bilateral determinants such as language, geography, and income), year fixed effects, and the interaction between firm-level FX exposure and an externally constructed, disaggregated macro shock, making reverse causality unlikely.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: (1) A 100-basis-point unexpected tightening in U.S. monetary policy is associated with a 7.3% decline in the total value of cross-border M&amp;amp;A deals and a 1.3% decline in deal count. The larger response in value than count implies that large transactions are disproportionately affected. These effects hold when U.S.-involved pairs are excluded, confirming genuine third-country spillovers. (2) The transmission is amplified by FX liabilities through a net worth channel: when U.S. policy tightens, the dollar appreciates, raising the local-currency value of foreign-currency debt and eroding acquirer net worth. A one percentage point tightening is associated with an estimated decline in cross-border M&amp;amp;A activity of approximately 0.83% for an acquirer country at the 25th percentile of FX liabilities (e.g., Brazil or Portugal), compared to more than 5.21% for a country at the 75th percentile (e.g., Belgium or Tunisia). (3) At the firm level, a one percentage point monetary tightening reduces the probability of a cross-border acquisition by approximately 1.5 percentage points for a firm at the 25th percentile of FX debt-to-assets, compared to 2.5 percentage points for a firm at the 75th percentile — a difference of about 1 percentage point attributable purely to FX exposure heterogeneity. (4) Replacing monetary policy shocks with U.S. NEER changes produces consistent results: a one-unit dollar appreciation has no significant effect at the 25th FX percentile firm but reduces the probability of cross-border M&amp;amp;A by about 5.9 percentage points at the 75th percentile. (5) Domestic M&amp;amp;A activity is not significantly affected by U.S. monetary shocks (confirming the channel operates through FX exposure), while domestic policy rates depress domestic deal value by approximately 2.7% per percentage point of tightening. (6) U.S. monetary policy shocks dominate euro-area shocks: when both are included together, U.S. monetary policy shock × acquirer FX liabilities remains negative and highly significant, while the euro-area interaction becomes small and insignificant. (7) For deal quality: tighter U.S. monetary conditions are associated with higher acquirer abnormal returns across all announcement horizons and both full-sample and cross-border subsamples. Predicted announcement returns are strongly negative when monetary policy is most accommodative and rise monotonically as policy tightens — consistent with a screening interpretation in which tight financial conditions select for value-creating deals and easy conditions enable empire-building.&lt;/p&gt;
&lt;p&gt;The dual pattern — easier U.S. conditions increase both deal volume and deal underperformance — points to capital misallocation: loose monetary spillovers generate more cross-border acquisitions, but those acquisitions on average destroy acquirer shareholder value. The policy implication is not to restrict cross-border M&amp;amp;As but to heighten macro-prudential attention to corporate leverage and asset quality when global financing conditions are accommodative. The results also provide an additional rationale for emerging market central bank exchange rate smoothing as a macro-prudential tool, insofar as limiting currency appreciation under global easing cycles may restrain unsound debt-financed acquisitions.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The country-level strategy uses bilateral country-pair fixed effects to absorb all time-invariant drivers of cross-border M&amp;amp;A (geography, language, bilateral treaties, income) and interacts the Iacoviello-Navarro U.S. monetary policy shock — constructed as Taylor-rule residuals, thus exogenous to any individual country&amp;rsquo;s conditions — with lagged country-level FX liabilities. Year fixed effects are included in some specifications. The firm-level strategy adds firm fixed effects (controlling for all time-invariant firm-level heterogeneity) and, in the most demanding specification, acquirer country-by-year fixed effects (absorbing all time-varying local macroeconomic conditions). The main threats addressed are: (1) Reverse causality — firms are too small relative to the U.S. monetary policy setting to affect the shock; (2) Endogeneity of FX liabilities — the firm-level proxy applies a country-average FX debt ratio from SDC to ORBIS balance-sheet totals, not firm-specific borrowing choices, so it reflects economy-wide currency borrowing patterns rather than individual strategic decisions; (3) Domestic monetary policy confounding — including acquirer and target short-term policy rates and their interactions with FX liabilities leaves the U.S. shock coefficient essentially unchanged; (4) Valuation effects — results hold for deal count as well as deal value; (5) Tax/regulatory arbitrage — results hold after dropping transactions involving tax-haven jurisdictions (about 2.6% of country-level and about 12,113 of firm-level observations).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the net worth channel and how is it distinguished empirically from other potential channels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The net worth channel, formalized in Diamond, Hu, and Rajan (2020), operates as follows: easier U.S. monetary conditions cause the dollar to depreciate (or non-dollar currencies to appreciate), reducing the local-currency value of foreign-currency-denominated debt and thereby increasing the net worth of firms that borrowed in dollars or other foreign currencies. Higher net worth expands borrowing capacity (financing becomes asset-based and procyclical) and enables acquisitions. The converse holds when U.S. policy tightens. The empirical distinction from a pure interest-rate-level channel is provided by the interaction between U.S. monetary shocks and firm-level FX liabilities: if the channel were simply the global cost of capital, all firms should respond equally regardless of their FX debt share. The significantly negative interaction term — consistent across country-level and firm-level specifications — specifically implicates balance-sheet exposure rather than a generic credit-conditions effect. The channel is also distinguished from domestic monetary transmission by the finding that domestic policy rates matter for domestic deals but not cross-border deals, while U.S. shocks matter for cross-border deals but not domestic ones (when interaction effects are examined). Dollar appreciation effects (using U.S. NEER) mirror the monetary shock results and directly capture the exchange-rate leg of the net worth channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across countries and firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Country-level heterogeneity: The sensitivity of cross-border M&amp;amp;A to U.S. tightening rises sharply with the level of corporate FX liabilities. A country at the 25th percentile of net FX liabilities (e.g., Brazil or Portugal) sees about 0.83% decline per pp of tightening, versus more than 5.21% for a country at the 75th percentile (e.g., Belgium or Tunisia). This pattern holds whether FX liabilities are measured with SDC, IMF, or BIS data, and for both total FX liabilities and USD-only liabilities (with the dollar-specific measure showing even more pronounced heterogeneity). Advanced economies dominate global M&amp;amp;A by value (approximately $34.9 trillion or 85%), with the U.S. alone at $17.6 trillion, but the spillover mechanism is documented beyond U.S.-involved pairs. Firm-level heterogeneity: Serial acquirers (firms with three or more deals in the sample) also show significant sensitivity to U.S. monetary conditions interacted with FX debt, indicating the effect is not limited to one-time acquirers. Firms in tradable sectors (agriculture, mining, manufacturing) show no significantly different response from firms in non-tradable sectors. U.S. acquirers show weaker sensitivity, consistent with their borrowing in domestic currency. The FX exposure effect is concentrated on acquirer-side balance sheets; target-country FX liabilities show point estimates in the same direction but are not robustly significant, suggesting the main transmission operates through acquirer finance rather than target-country conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the evidence on deal quality and how is it measured?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Deal quality is measured by market-adjusted acquirer excess returns (abnormal returns) over horizons of one to four quarters following the M&amp;amp;A announcement, benchmarked against a country-specific equity index from Global Financial Data. The stock price reaction to the announcement is used as a proxy for the expected quality of the investment at the time, based on the reasoning that acquisitions involve substantial, relatively immediate, and difficult-to-reverse financial commitments, making the announcement return a reliable contemporaneous signal. The specification regresses acquirer abnormal returns on lagged U.S. monetary policy shocks, controlling for acquirer fixed effects, country fixed effects, or no fixed effects, across the full deal sample and the cross-border subsample. Findings: coefficients on U.S. monetary policy shocks are consistently positive and statistically significant across all specifications and horizons, meaning tighter conditions predict higher acquirer excess returns. Figure 5 shows that predicted returns are strongly negative when monetary policy is most accommodative, remain negative through much of the shock distribution, and rise monotonically into positive territory as policy tightens. The interpretation offered is a screening effect: high financing costs filter out low-quality empire-building acquisitions, while easy conditions lower the bar for what gets financed. This quality degradation under easy conditions, combined with higher deal volumes under easy conditions, constitutes the capital misallocation finding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run at both country and firm levels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Country-level robustness: (1) Replication with deal count instead of deal value to rule out pure valuation effects — results are qualitatively the same. (2) Restricting to &amp;rsquo;established markets&amp;rsquo; (roughly 80 countries with at least 10 serial acquirers), which yields a larger effect magnitude (8.1% decline in value per 100bps). (3) Replacing SDC FX liabilities with IMF IIP and BIS Locational Banking Statistics measures — results remain qualitatively similar. (4) Including domestic short-term policy rates and their interactions with FX liabilities — the U.S. shock interaction coefficient is essentially unchanged. (5) Comparing U.S. versus euro-area monetary policy shocks — U.S. shock dominates; EA shock becomes insignificant when both are included. (6) Excluding tax-haven jurisdictions (about 2.6% of observations) — results consistent with baseline. (7) Lagging the monetary policy variable by one year and FX liabilities by two years — results qualitatively similar though standard errors increase. Firm-level robustness: (1) Linear probability model on the full sample of ~686,000 firm-year observations (compared to the conditional logit on ~170,000 with within-firm variation) — key findings hold. (2) Using non-current FX liabilities instead of total FX debt — results remain statistically significant. (3) Constructing firm-level FX debt from BIS data following Kalemli-Ozcan et al. (2021) — results consistent though significant only at 10% level due to smaller country coverage. (4) Adding domestic policy rates — U.S. shock remains dominant; domestic rates and their FX interactions are insignificant for cross-border deals. (5) Extending to domestic M&amp;amp;A firm-level regressions — the U.S. shock × FX liabilities interaction is significant even for domestic deals (though the direct U.S. shock effect is not), suggesting the balance-sheet channel extends to within-country activity once the interaction is isolated. (6) Testing tradable vs. non-tradable sectors — no significantly different response; results hold across sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from Erel, Liao, and Weisbach (2012) and other closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Erel et al. (2012) is the closest antecedent. It analyzes persistent bilateral determinants of cross-border M&amp;amp;A (language, geography, treaty status, relative valuation via exchange rate and stock market appreciation), finding that acquirer-country exchange rate and stock market appreciation increases cross-border acquisitions toward that country&amp;rsquo;s firms as targets. The current paper uses bilateral fixed effects to absorb those persistent determinants and focuses on the time-series variation driven by an exogenous, externally constructed U.S. monetary policy shock interacted with balance-sheet FX exposure. The mechanism differs: rather than exchange-rate-driven valuation effects per se, the paper emphasizes net worth through the FX liability channel, distinguishing it from a pure relative-price view of cross-border M&amp;amp;A flows. Relative to di Giovanni (2005), which found that domestic financial development drives M&amp;amp;A outflows in the 1990s, this paper focuses on global monetary conditions since 2000. Relative to Diamond et al. (2020), the paper takes the theoretical net worth channel to a global empirical test using actual M&amp;amp;A data and adds the misallocation angle via announcement returns. The paper also extends previous work on FDI and capital flow misallocation by documenting misallocation specifically through M&amp;amp;A quality (announcement returns), which prior literature did not analyze. Other exchange-rate papers (Pelli 2018; Fransson 2010; Georgopoulos 2008) focus on the direct exchange rate level rather than the mechanism running through FX-debt net worth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three sets of implications are discussed. First, cross-border M&amp;amp;A inflows to a country should not be interpreted as an unambiguous signal of that country&amp;rsquo;s economic strength or attractiveness; a significant portion of the time-series variation reflects monetary conditions in core countries rather than local fundamentals. Second, easy monetary conditions at the core can generate a legacy of overleveraged corporates in non-core countries: firms increase FX debt during accommodative periods to finance acquisitions that often destroy value, then face balance-sheet stress when core conditions tighten. The authors suggest this is especially concerning because the activity being financed — acquisitions — has highly uncertain productivity benefits. The regulatory implication is heightened macro-prudential attention to corporate leverage and acquisition activity during periods of global monetary ease, not an outright ban on cross-border M&amp;amp;A. Third, the results offer an additional rationale for emerging market central bank exchange rate smoothing: by dampening the appreciation of domestic currencies during easy global conditions, central banks may limit the net worth expansion that fuels excessive FX-debt-financed acquisitions, adding a macro-prudential dimension to what is often framed as a pure competitiveness or capital-flow management motive. Scope conditions: results are based on 2000–2019 data, so the sample predates major post-2019 shocks; effects are most pronounced for acquirers with above-median FX liabilities and may be less relevant for domestic-currency borrowers (including U.S. firms); the quality evidence uses announcement returns, which measure market expectations at announcement rather than realized post-merger performance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper find about the U.S. dollar&amp;rsquo;s special role versus the euro&amp;rsquo;s role?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper directly tests whether the U.S. is distinctive among reserve-currency issuers by constructing euro-area (EA) monetary policy shocks using a parallel methodology (ECB shadow rate, Taylor-rule residuals, following the spirit of Iacoviello and Navarro 2019). When EA shocks alone are considered, the interaction between EA monetary policy shocks and acquirer FX liabilities is negative but only marginally significant. When both U.S. and EA shocks are included simultaneously, the U.S. shock × acquirer FX liabilities interaction is negative and highly significant while the EA equivalent becomes small and statistically insignificant. Interactions involving target-country FX liabilities are not significant for either shock. The authors interpret this as consistent with the dominant international role of the U.S. dollar: because much global corporate FX borrowing is in dollars, U.S. monetary conditions are the primary driver of net worth through the FX channel, while euro-area policy has at best weak independent effects once U.S. conditions are controlled for.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the data limitations and caveats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Several limitations are acknowledged. First, deal value is missing for 61.4% of observations in the SDC country-level data and 65.6% in the ORBIS firm-level data, likely concentrated in smaller private transactions. The paper addresses this by treating year-zeros for country pairs that have previously reported positive deal values as genuine zeros rather than missing, but this assumption may introduce noise. Second, the firm-level FX liability measure is a proxy constructed by applying a country-level FX debt share to firm-level total liabilities from ORBIS (because ORBIS M&amp;amp;A data do not record currency denomination of debt and there are no unique identifiers to link individual firms to SDC). This introduces measurement error but arguably also reduces endogeneity from firm-specific borrowing decisions. Third, the stock return analysis is restricted to 2010–2019 because of data availability from ORBIS and GFD, a shorter window than the 2000–2019 M&amp;amp;A sample. Fourth, the paper does not track post-merger performance over time (only announcement returns), leaving open whether deals that look poor at announcement do in fact underperform over multi-year horizons. Fifth, because targets typically exit the dataset after acquisition, the authors cannot build a target-firm panel, limiting firm-level analysis to the acquirer side. The authors flag data on FX exposure of the corporate sector as an important area for improvement and note that examining acquisition-induced leveraging dynamics over time is an avenue for future research.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the take-away for the global financial cycle literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper contributes to the &amp;lsquo;global financial cycle&amp;rsquo; tradition (Rey 2013; Kalemli-Ozcan 2019) by documenting a specific and previously under-studied channel through which U.S. monetary conditions affect real investment decisions globally: corporate control reallocation via M&amp;amp;A, operating through the net worth of foreign-currency borrowers. Unlike studies focused on cross-border lending or portfolio flows, M&amp;amp;A data provide a direct proxy for investment quality (announcement returns), allowing the authors to move beyond documenting that spillovers exist to showing that they have welfare-relevant misallocation consequences. The dominance of U.S. over EA shocks in driving this channel is consistent with the dollar&amp;rsquo;s hegemonic role in global corporate borrowing (Maggiori, Neiman, and Schreger 2020). The paper also complements the macro-prudential angle in Diamond et al. (2020) and Hofmann et al. (2019) by showing that asset-based borrowing during easy monetary periods generates procyclical M&amp;amp;A activity that underperforms when measured by market expectations at announcement.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Net worth channel (of monetary policy spillovers)&lt;/strong&gt;: As used in this paper (building on Diamond, Hu, and Rajan 2020): the mechanism by which U.S. monetary easing causes the dollar to depreciate, raising the local-currency net worth of non-U.S. firms with dollar- or foreign-currency-denominated liabilities, expanding their borrowing capacity on an asset-based basis and enabling additional acquisitions. Conversely, U.S. tightening appreciates the dollar, erodes net worth, and reduces cross-border acquisition activity — especially for firms with large FX debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;FX liabilities (foreign currency liabilities)&lt;/strong&gt;: In this paper, debt obligations denominated in a currency other than the borrower&amp;rsquo;s domestic currency. Measured at the country level using SDC bond and loan issuance data (flow-based, non-financial corporates only, averaging 13.4% of GDP), and at the firm level by applying that country-level FX debt share to ORBIS balance-sheet total liabilities (averaging 8.3% of assets). The key heterogeneity variable: firms and countries with higher FX liabilities exhibit amplified sensitivity to U.S. monetary shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Acquirer excess (abnormal) return&lt;/strong&gt;: Market-adjusted stock return of the acquiring firm over one-to-four quarters following the M&amp;amp;A announcement date, computed as the acquirer&amp;rsquo;s raw return minus the contemporaneous country-specific equity index return from Global Financial Data. Used as a contemporaneous market signal of expected deal quality; a negative abnormal return at announcement is interpreted as the market assessing the acquisition as value-destroying.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capital misallocation (via monetary spillovers)&lt;/strong&gt;: As documented in this paper: the joint pattern in which accommodative U.S. monetary conditions generate both more cross-border M&amp;amp;A transactions and lower-quality transactions (negative acquirer announcement returns), implying that easy financing conditions direct resources toward acquisitions that destroy rather than create value. The paper does not measure misallocation in terms of productivity dispersion across firms but in terms of the gap in deal quality between loose- and tight-monetary-condition periods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monetary policy shock (Iacoviello-Navarro)&lt;/strong&gt;: An annual, exogenous measure of unexpected changes in U.S. monetary policy, constructed by Iacoviello and Navarro (2019) as the residuals from regressing the federal funds rate on a standard set of macroeconomic controls (a Taylor-rule approach). The shock captures the component of policy change that is not explained by systematic responses to inflation, output, or other macro variables, allowing the authors to treat it as exogenous to conditions in any individual non-U.S. country.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Screening effect (of tight monetary conditions)&lt;/strong&gt;: The paper&amp;rsquo;s interpretation of why tighter U.S. conditions predict higher acquirer announcement returns: when financing is expensive and difficult to obtain, firms pursue only acquisitions with clear strategic or synergistic rationale, so the average deal quality is higher. Conversely, in liquidity-abundant environments, managerial agency problems (empire-building, growth-for-growth&amp;rsquo;s-sake) face fewer financial constraints, leading to value-destroying acquisitions that pass the financing test.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cross-border M&amp;amp;A (as a distinct investment form)&lt;/strong&gt;: As framed in this paper: an acquisition in which the acquirer and target are headquartered in different countries, resulting in a change of control. Distinct from greenfield FDI (new asset creation) and from portfolio equity flows in that it involves immediate, large capital commitments, usually accompanied by significant leverage taken on by the acquirer, with a measurable contemporaneous quality signal (announcement return). The authors restrict the sample to control-transfer transactions (majority stake, excluding LBOs, spin-offs, recapitalizations, partial stakes, and privatizations).&lt;/p&gt;</description></item><item><title>Did the US Really Grow Out of Its World War II Debt?</title><link>https://macropaperwarehouse.com/papers/did-the-us-really-grow-out-of-its-world-war-ii-debt/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/did-the-us-really-grow-out-of-its-world-war-ii-debt/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation. The fall in the US federal debt-held-by-the-public/GDP ratio from a postwar peak of 106% in fiscal year 1946 to a trough of 23% in 1974 is widely cited (Elmendorf-Mankiw, Krugman) as evidence that an economy &amp;ldquo;grows out of&amp;rdquo; debt because the GDP growth rate exceeds the interest rate on government debt (r &amp;lt; g). That narrative underpins the modern view (Blanchard 2019; Furman-Summers 2020) that high public debt &amp;ldquo;may have no fiscal cost.&amp;rdquo; Acalin and Ball ask how much of the postwar debt decline was genuinely due to growth exceeding undistorted real interest rates, versus three other factors: primary budget surpluses, the Fed&amp;rsquo;s 1942-1951 interest-rate peg before the Fed-Treasury Accord, and surprise inflation.&lt;/p&gt;
&lt;p&gt;Method and data. The authors simulate counterfactual debt/GDP paths from the standard debt-dynamics identity D_t = (1+i_t)D_{t-1} - P_t, starting from the actual 1946 debt level and holding nominal GDP fixed at its historical path. They build three counterfactuals: (i) &amp;ldquo;primary balance&amp;rdquo; (set primary surplus to zero each year); (ii) &amp;ldquo;adjusted interest rate&amp;rdquo; (remove distortions from both the peg and surprise inflation); and (iii) &amp;ldquo;combined&amp;rdquo; (both), whose path is driven purely by r* - g, the undistorted real rate minus growth. A key innovation is measuring the &amp;ldquo;reverse maturity structure&amp;rdquo; — the fractions of currently outstanding debt issued in each past year — using Hall-Payne-Sargent (2018) data for 1942-1960 and CRSP thereafter. They construct a term structure of inflation expectations from one-year (Livingston, SPF) and ten-year (FRB/US) survey data, and estimate undistorted peg-era real rates from ex-ante real rates on securities issued in 1952-1961. T-bills and TIPS are assumed unaffected by inflation surprises (conservative). Debt is par value, held by the public, by fiscal year.&lt;/p&gt;
&lt;p&gt;Main quantitative findings. In the combined counterfactual, debt/GDP falls only to 74% in 1974 (vs. 23% actual); the individual counterfactuals give 40% (primary balance) and 51% (adjusted rate) in 1974. Of the actual 83-point fall (106 to 23), 51 points are explained by surpluses plus rate distortions, decomposed as 17 points from surpluses alone, 28 from rate distortions alone, and 6 from their interaction; only 32 points (the fall to 74%) reflect growth net of undistorted rates. Extending to the present, the combined counterfactual ratio starts rising in 1980, dipping to 70% in 1979 before climbing to 84% in 2022 — only 22 points below the 1946 level of 106. Over the full 76 years, undistorted growth alone would have cut debt/GDP by just 22 points. The post-1979 reversal reflects a sign change in r* - g: average r* rose from 2.3% (1947-1979) to 2.8% (1980-2022) while average g fell from 3.5% to 2.6%. The estimated undistorted real-rate term structure is 1.7% (1yr), 2.2% (5yr), 2.5% (10yr), 2.7% (30yr).&lt;/p&gt;
&lt;p&gt;Mechanisms and implications. Primary surpluses averaged 1.1% of GDP over 1947-1974 (peaking at 6.3% in 1948), then turned to persistent deficits. The peg (caps of 0.375% on bills to 2.5% on 30-year bonds) combined with post-1946 inflation surges (CPI averaging 7.1% in FY1947-1951) produced deeply negative ex-post real rates; the aggregate interest-rate adjustment x_t reached 13 points in 1947 and 8 points in 1951. Policy implication: the distortions are unlikely to recur (no peg/price controls planned, Fed committed to low inflation, shorter average maturity — down from 4.4 years in 1951 to 2.2 years in 2022 — blunts inflation&amp;rsquo;s effect), so substantially reducing today&amp;rsquo;s 97% (FY2022) ratio will likely require primary surpluses, which CBO projections suggest are not forthcoming.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/counterfactual strategy and what are its main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;There is no causal identification in the econometric sense; the strategy is an accounting simulation of the debt-dynamics identity under counterfactual interest rates and primary balances, holding nominal GDP (and real GDP and undistorted real rates) fixed at historical values. Threats: (1) the undistorted peg-era real rates are unobserved and must be guessed from 1952-1961 ex-ante real rates; (2) the reverse maturity structure (weights w) is held at historical levels even though higher counterfactual debt would alter issuance; (3) general-equilibrium feedback is ignored — higher counterfactual debt would raise real rates and crowd out capital, lowering GDP, both of which would push debt/GDP even higher, so the authors interpret their paths as LOWER BOUNDS; (4) pre-1943 debt is not adjusted for surprise inflation because long-term expectations data do not exist before 1943, which the authors argue biases against finding a large inflation role.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are the effects of the peg and surprise inflation distinguished, and can they be separated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The adjusted-interest-rate scenario removes both jointly. The authors state it would be difficult to separate them cleanly because that requires measures of expected inflation during the peg period (1942-1951), and there are no data on long-term inflation expectations before 1951 or short-term expectations before 1947 (start of Livingston). For post-1952 debt, the surprise-inflation adjustment is pi_t minus the expectation formed when the security was issued; for peg-era debt the adjustment is the gap between the ex-post real rate and the assumed undistorted real rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the decomposition relative to Hall and Sargent (2011)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Hall-Sargent decompose the 1946-1974 debt/GDP change into r-g and primary surpluses but do not ask how interest-rate distortions shape r-g. Replicating their approach (Table 2A), the authors attribute -48.1 points to r-g and -29.6 points to primary surpluses (the terms sum to -78 points, less than the actual -82.9 because of the debt-dynamics residual). The paper&amp;rsquo;s extension (Table 2B) splits the -48.1 r-g contribution into only -11.7 points from r*-g (undistorted) and -36.3 points from the distortion r-r*, with surpluses still -29.6. So most of the apparent &amp;lsquo;growth out of debt&amp;rsquo; was actually interest-rate distortion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why do the Table 2 surplus contributions differ from the Table 1 scenario differences?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In Table 2 surpluses contribute -29.6 points, larger than the 17-point effect implied by the Table 1 difference between actual 1974 debt/GDP and the primary-balance scenario. The reason is an interaction: eliminating surpluses raises the debt path d_{t-1}, which magnifies the r-g term, so additional debt is partly eroded by r-g. The authors call the Figure 7 / Table 1 scenario paths the more precise representation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the findings reconcile with Blanchard&amp;rsquo;s (2019) claim that r &amp;lt; g since 1979?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors find r &amp;gt; g on average since 1979 (even in the primary-balance counterfactual with actual ex-post rates), so debt/GDP would rise. The difference from Blanchard is purely measurement: (1) they use the government&amp;rsquo;s interest payments on outstanding debt — the rates set at issuance — whereas Blanchard uses current market yields (a weighted average of 1- and 10-year Treasury rates), which since 1979 have been lower because rates trended down; (2) the authors use pre-tax rates while Blanchard uses after-tax rates. Figure A.11 confirms: with the authors&amp;rsquo; measure debt/GDP rises 1979-2022; with Blanchard&amp;rsquo;s pre-tax market yields it rises then falls back near its 1979 level; with his after-tax rates it falls significantly. The authors argue the rate paid by the government is the relevant one for the debt-dynamics identity, and that a natural baseline assumes debt has no net effect on tax revenue (so pre-tax rates apply).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is a notable nuance about the post-1979 period in the primary-balance counterfactual?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The post-1979 rise in debt/GDP is LARGER in the primary-balance counterfactual (19 points, from 34% to 53%) than in the combined counterfactual (14 points). This is because inflation surprises since 1979 have on average been negative (post-Volcker disinflation, actual below expected), raising ex-post real rates and thus debt/GDP. It confirms that actual r has exceeded g since 1979.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Undistorted peg-era real rates shifted by +/-0.5% and +/-1% across the whole term structure: 1974 combined debt/GDP ranges from 67% (-1%) to 81% (+1%) around the 74% baseline; 2022 ranges from 78% to 91% around 84% (Table A.2). (2) Pre-1962 interest measured by net interest times 1.1; using net interest directly gives 73% in 1974 and 83% in 2022 vs. 74% and 84% baseline. (3) The debt-dynamics residual epsilon (mainly Treasury cash balances) is held at historical values; setting it to zero gives a combined counterfactual of 78% in 1974 and 77% in 2022, showing the residual contributed -0.19% GDP/year on average over 1947-1974 and +0.25% over 1975-2022. (4) Term-structure shape assumptions and the GDP-deflator-vs-CPI expectation-error approximation are checked in the Appendix as reasonable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across the debt structure matters?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The reverse maturity structure is central: the share of debt with reverse maturities above five years peaked at 48% in 1951 (long-term WWII bonds), then fell, fluctuating between 10% and 25% from 1975-2022; average reverse maturity fell from 4.4 years in 1951 to 2.2 years in 2022. Shorter maturity means inflation surprises erode less debt — a reason later inflation surprises had smaller effects than the 1940s-1970s ones. T-bills (assumed unaffected by surprise inflation since rolled over at adjusting rates) and TIPS (post-1997, indexed) are excluded from the inflation-surprise adjustment. Non-marketable debt fell from 23% of total in 1960 to 3% in 2022; its reverse maturity structure is assumed constant after 1960.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the timing/measurement complications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Unit is fiscal year (July-June before FY1977, October-September after), creating a &amp;lsquo;Transitional Quarter&amp;rsquo; in Q3 1976 requiring special handling. Inflation is GDP-deflator growth. Pre-1970 deflator expectations are proxied from Livingston CPI forecasts assuming equal expectation errors for CPI and deflator. Ten-year expectations before 1968 are fitted from one-year expectations via a regression (1968-1997) with a negative coefficient (-1.549) on the change in smoothed one-year expectations, capturing long-term expectations lagging short-term moves.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the postwar debt reduction came largely from one-off distortions (the peg with price controls, and surprise inflation) unlikely to recur — and the Fed is committed to low inflation while shorter average maturity weakens inflation&amp;rsquo;s erosive power — economic growth alone is unlikely to resolve the current ~97% (FY2022) ratio. Substantial reduction will probably require primary surpluses, which CBO projects will not occur under current policy (large primary deficits forecast for three decades). Scope conditions: results are lower bounds (GE crowding-out omitted); they depend on the assumed undistorted real-rate term structure; the 2021-2022 inflation surge is again temporarily reducing debt/GDP.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Dispersion Over the Business Cycle: Passthrough, Productivity, and Demand</title><link>https://macropaperwarehouse.com/papers/dispersion-over-the-business-cycle-passthrough-productivity-and-demand/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dispersion-over-the-business-cycle-passthrough-productivity-and-demand/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Carlsson, Clymo, and Joslin use Swedish manufacturing firm-level microdata for 1998–2013 to separately identify and characterize the cyclical behavior of physical productivity (TFPQ) shocks and demand shocks at the firm level, two forces that are observationally equivalent under the standard CES-demand benchmark. The paper&amp;rsquo;s central contribution is threefold: it documents new empirical facts about dispersion cyclicality, estimates a non-constant-elasticity (non-CES) demand curve directly from firm-level price and quantity data, and embeds those estimates into a quantitative heterogeneous-firm model to study the aggregate consequences of each type of dispersion shock.&lt;/p&gt;
&lt;p&gt;The data combine four Swedish register sources: the Företagens Ekonomi (FEK) survey for bookkeeping variables; the Industrins Varuproduktion (IVP) survey for 8-digit product-level price and quantity data used to construct firm-level price indices; the Konjunkturstatistik för Industrin (KFI) survey for quarterly capacity-utilization data; and additional investment deflators. The unbalanced panel contains 3,181 unique manufacturing firms and 15,044 firm-year observations. TFPQ is measured using a Cobb-Douglas value-added production function with factor utilization adjustment; factor elasticities are estimated via cost shares at the 2-digit sector level, yielding an average labor share of 0.735.&lt;/p&gt;
&lt;p&gt;Demand is estimated using the Gopinath-Itskhoki-Rigobon (GIR) flexible demand curve, which nests CES as the limiting case. TFPQ innovations instrument for price in a second-order approximation, following Foster, Haltiwanger, and Syverson (2008). The main-sample estimates yield theta = 2.94 (average elasticity) and eta = 4.27 (super-elasticity), both significant at the 1% level. The second-order price term is statistically significant at the 5% level in all three samples, decisively rejecting CES. These estimates imply that a 5% price increase raises the demand elasticity from 2.94 to 3.74, while a 5% price reduction reduces it to 2.42, creating a &amp;ldquo;real rigidity&amp;rdquo; in the sense of Ball and Romer (1990): raising price loses many customers while lowering it gains few.&lt;/p&gt;
&lt;p&gt;Incomplete passthrough of TFPQ shocks is a central empirical finding. OLS estimates yield beta_z = -0.124; first-difference estimates yield -0.097. Even in the subsample of firms that adjusted all product-level prices in a given year, TFPQ passthrough remains near -0.10, ruling out Calvo or menu-cost price stickiness as the sole driver. Longer-horizon (two- and three-year) first-difference regressions produce similar estimates, ruling out Rotemberg gradual adjustment as well. The non-CES demand curve alone implies a static-optimal passthrough of theta/(theta + eta) = 3/(3 + 4.3) = 41%, so real rigidity explains most of the incompleteness even before accounting for adjustment costs. Demand shocks pass through to prices at a rate of 0.209-0.235, a non-zero result rationalized in the quantitative model by input adjustment costs.&lt;/p&gt;
&lt;p&gt;On cyclicality of dispersion, both TFPQ and demand shock dispersion are countercyclical, but demand dispersion rises by more and is more robust across recession episodes. In 2009 (the Great Recession), the IQR of demand shock growth was 56% above its non-recession average, while the IQR of TFPQ shock growth rose 36%. Sales dispersion rose 58% (IQR) in 2009. A semi-structural variance decomposition shows that demand shocks account for 63% of average sales growth dispersion and approximately 80% of its increase in 2009; TFPQ dispersion contributes only marginally to sales dispersion because the TFPQ variance is shrunk by a factor of roughly 25 on its way to sales growth through the chain of low passthrough and demand elasticity. Demand accounts for about 50% of average price growth dispersion and 40% of its cyclical increase in 2009; TFPQ accounts for about 10% of price dispersion on average.&lt;/p&gt;
&lt;p&gt;The quantitative heterogeneous-firm model extends Bloom (2009) and Bloom et al. (2018) to continuous time with both TFPQ and demand shocks, non-CES demand (theta = 3, eta = 4.3 from the estimates), and non-convex input adjustment costs on a composite scale factor covering both capital and labor. The resale loss kappa = 0.3565 is taken from Bloom et al. (2018). The model is calibrated to match IQRs of 0.2 for TFPQ and demand shock log-changes in the low-uncertainty state, consistent with pre-crisis Swedish data. For the high-uncertainty state, the calibration targets the Great Recession peaks: a 30% rise in TFPQ dispersion (sigma_z(2) = 1.38 sigma_z(1)) and a 60% rise in demand dispersion (sigma_epsilon(2) = 1.90 sigma_epsilon(1)), reflecting the empirical finding that demand dispersion increases more.&lt;/p&gt;
&lt;p&gt;A simulated transition to the high-uncertainty state causes aggregate output to fall by 3.5%. Decomposing into the Bloom (2009) &amp;ldquo;volatility effect&amp;rdquo; (realized shocks drawn from the high-dispersion distribution, firms believe low) and &amp;ldquo;uncertainty effect&amp;rdquo; (firms believe high, shocks drawn from low distribution), the paper finds both effects are negative in the non-CES model, in sharp contrast to Bloom (2009) where the volatility effect is positive (the Oi-Hartman-Abel effect). Non-CES demand amplifies the total output decline by approximately 40% relative to the CES model (peak fall 2.5% vs. 1.75%), primarily by reversing the sign of the volatility effect. Increased demand dispersion drives almost all of the first-year output decline and the majority of the uncertainty effect; TFPQ dispersion is the main driver of the negative volatility effect via markup dispersion. The inaction rate among firms jumps from 50% to 95% on impact of the uncertainty shock, then recovers within one year. TFPQ uncertainty induces little wait-and-see behavior because firms optimally adjust inputs by only 23% of the TFPQ shock size (versus 200% under CES), so uncertainty about TFPQ translates mainly into markup uncertainty. Demand uncertainty triggers strong wait-and-see behavior because demand directly maps one-for-one into desired input use.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the paper&amp;rsquo;s core identification strategy for separating TFPQ and demand shocks, and what are the main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors identify TFPQ from a utilization-adjusted Cobb-Douglas value-added production function, then estimate demand using TFPQ innovations as instruments for price. TFPQ innovations are valid instruments because they shift marginal cost without directly shifting demand, tracing out the demand curve. The utilization adjustment (from the KFI managerial survey) is critical: without it, demand shocks that reduce utilization would appear as negative TFPQ shocks, biasing demand elasticity estimates upward and breaking instrument validity. The paper validates the adjustment by showing that firms reporting &amp;lsquo;insufficient demand&amp;rsquo; exhibit 15% lower utilization on average, and 23% lower during the Great Recession. A second threat is quality change in firm-level prices; the authors address this with (a) robustness using the Eslava et al. (2023) CUPI quality-adjusted price index and (b) a single-product-firm subsample. Demand and passthrough results are similar across all three price index approaches. The within-firm focus (demeaning by firm and sector-year fixed effects throughout) mitigates cross-sectional comparability issues but limits misallocation-level analyses analogous to Hsieh and Klenow (2009).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the non-CES demand curve identified, and what exactly does the super-elasticity parameter eta measure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The GIR demand curve is q = (1 - eta * log p)^(theta/eta). A second-order approximation around the firm&amp;rsquo;s average price yields log q = -theta * p_hat - (eta&lt;em&gt;theta/2) * p_hat^2 + fixed effects + epsilon, where p_hat is the firm&amp;rsquo;s demeaned log relative price. Regressing real sales on p_hat and p_hat^2, instrumented by demeaned TFPQ and its square, recovers theta = -b1 and eta = 2&lt;/em&gt;b2/b1. Because p_hat is demeaned at the firm level, the estimates capture within-firm nonlinearity in the price-sales relationship, not cross-sectional heterogeneity in elasticity levels. The parameter eta is the &amp;lsquo;super-elasticity&amp;rsquo;: it measures how much the demand elasticity itself changes with the price. When eta &amp;gt; 0, a firm that raises its price faces an increasingly elastic demand curve (loses customers rapidly), and one that lowers its price faces a less elastic curve (gains customers slowly). The estimated eta = 4.27 in the main sample is roughly half the value of 10 studied (but not estimated) in Klenow and Willis (2016) and larger than the approximately 2 used in Berger and Vavra (2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper distinguish the &amp;lsquo;volatility effect&amp;rsquo; from the &amp;lsquo;uncertainty effect&amp;rsquo; in the quantitative model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following Bloom (2009), the paper simulates two counterfactuals. The uncertainty effect holds shocks drawn from the low-dispersion distribution (s=1) but lets firms believe that the high-uncertainty state (s=2) has arrived; this isolates the precautionary wait-and-see channel. The volatility effect draws shocks from the high-dispersion distribution (s=2) but lets firms believe they are in the low-uncertainty state; this isolates the direct effect of realizing more extreme shocks on aggregate output. In the non-CES model, both effects are negative. The uncertainty effect is dominated by demand uncertainty because demand shocks directly affect desired input use one-for-one, so uncertainty about future demand creates strong incentives to pause investment. TFPQ uncertainty induces little wait-and-see behavior because the optimal scale adjustment to a TFPQ shock is only 23% of the shock magnitude (vs. 200% under CES). The volatility effect is dominated by TFPQ dispersion because realized TFPQ shocks generate markup dispersion via incomplete passthrough, creating misallocation. Under CES, the volatility effect from TFPQ is positive (OHA effect: convex output-productivity relationship); non-CES demand makes the output-productivity relationship concave for eta large enough, flipping the sign.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What mechanism makes TFPQ passthrough so low in both the data and the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms operate. First, non-CES demand itself: when eta &amp;gt; 0, raising price increases the demand elasticity, and lowering price decreases it. This means the benefit to revenue from a price cut (following a productivity gain that reduces costs) is muted because the firm gains fewer customers than under CES. The static optimal passthrough is theta/(theta + eta) = 3/(7.3) = 41%. Second, non-convex input adjustment costs further reduce passthrough by making firms reluctant to change their scale in response to TFPQ shocks. In the model, the investment threshold is nearly flat across a wide range of TFPQ values (shown in Figure 6, left panel), reflecting that optimal scale barely responds to productivity. Together these mechanisms reproduce TFPQ passthrough of 20-30% in model-simulated data vs. 10-24% in the actual data, both far below the CES benchmark of 100%. The paper also verifies that low passthrough persists in the subsample of flexible-price firm-years, ruling out sticky prices as the primary driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does demand shock dispersion, rather than TFPQ dispersion, dominate the variance decompositions of sales and price growth?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The contribution of TFPQ dispersion to sales dispersion is (1-theta)^2 * beta_z^2 * Var(z). With beta_z = -0.097 and theta = 2.99, the TFPQ variance is shrunk by approximately (1-2.99)^2 * (0.097)^2 = 4 * 0.0094 ≈ 0.04, so only about 4% of TFPQ variance propagates to sales variance. This extremely small multiplier reflects two successive attenuation steps: low TFPQ passthrough to prices (beta_z^2 ≈ 0.01) and a small price-to-sales elasticity. Demand shocks, by contrast, affect sales directly through the demand curve without a price intermediary: the contribution is ((1-theta)*beta_epsilon + 1)^2 * Var(epsilon). With beta_epsilon = 0.209 and theta = 2.99, the multiplier is ((1-2.99)*0.209 + 1)^2 = (1 - 0.416)^2 = 0.34, about eight times larger than for TFPQ even though both shocks have similar variance. The cyclical increase is even more skewed toward demand because demand dispersion rises by 56% vs. 36% for TFPQ in 2009.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to TFPR dispersion, and what does it say about using TFPR as a sufficient statistic?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;TFPR = p * z. For arbitrary passthrough, TFPR growth = beta_epsilon * delta_epsilon + (beta_z + 1) * delta_z. Because passthrough from both shocks is incomplete, TFPR growth reflects a mixture of both underlying shocks. The paper shows via a variance decomposition of TFPR that TFPQ is the main driver of TFPR growth dispersion—accounting for roughly 60% on average—because low passthrough means prices move little, leaving TFPQ changes to dominate TFPR. However, this finding obscures the importance of demand shocks for aggregate outcomes: demand dispersion is the dominant driver of sales growth dispersion and wait-and-see behavior, yet TFPR growth dispersion mostly reflects TFPQ. A researcher relying on TFPR dispersion to infer uncertainty would correctly detect productivity uncertainty but would miss the more cyclically important demand uncertainty channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the Oi-Hartman-Abel (OHA) and wait-and-see mechanisms work differently under non-CES vs. CES demand?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under CES demand, sales of each firm are s = z^(theta-1) * exp(epsilon), and aggregate output is E[z^(theta-1)] which is convex in z, so a mean-preserving spread in TFPQ raises aggregate output (OHA effect). Under the estimated non-CES parameters (theta=3, eta=4.3), the approximate relationship yields output proportional to z^0.82, which is concave, so a mean-preserving spread in TFPQ reduces aggregate output. The mechanism is that under non-CES demand, TFPQ shocks pass through incompletely to prices and thus create markup dispersion: high-productivity firms have high markups, low-productivity firms have low markups, and the resulting misallocation reduces total output even relative to a social planner who would set p=mc. For wait-and-see: under CES, optimal input adjustment to a TFPQ shock equals (theta-1) times the shock, which is 200% for theta=3; under non-CES with eta=4.3, it is only (theta^2/(theta+eta) - 1) * shock = 0.233 * shock = 23%. This means firms adjust scale very little in response to TFPQ uncertainty, dampening the wait-and-see channel for TFPQ. TFPQ uncertainty then causes uncertainty about markups, which is costly but does not trigger large investment adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role do adjustment costs play, and how robust are the results to the structure of those costs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Non-convex adjustment costs on a composite firm-scale factor x = k^alpha * l^(1-alpha) create an inaction region: firms neither invest nor disinvest until shocks are sufficiently large. In the low-uncertainty state, the model generates a yearly inaction rate of 25.4% (consistent with pre-crisis Swedish data showing roughly 15%). When uncertainty rises, the inaction region widens, the inaction rate jumps to 95% on impact, and firms let their scale shrink via depreciation. The baseline calibration uses the resale loss kappa = 0.3565 from Bloom et al. (2018). The paper also calibrates kappa to the Swedish inaction rate (kappa = 0.1165), which delivers qualitatively identical dynamics but a smaller amplitude recession (1.7pp vs. 3.5pp output fall). The paper also solves a version with adjustment costs only on capital (as in Bachmann and Bayer, 2013): the wait-and-see effect is dampened but the qualitative results hold—demand uncertainty still dominates TFPQ uncertainty in driving wait-and-see, and non-CES demand still reverses the sign of the OHA effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the price wedge and time-varying passthrough?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The passthrough equation residual (price wedge, tau) captures price changes unexplained by TFPQ and demand shocks. It could reflect un-modeled shocks (e.g., financial constraints, as Gilchrist et al. (2017) document for Sweden), markup decisions, or measurement error. The price wedge makes a meaningful contribution to both average sales/price dispersion and to the rise in 2009. Time-varying passthrough is also documented: TFPQ passthrough is countercyclical (more negative in recessions), while demand passthrough is procyclical (falls in recessions when firms receive more extreme idiosyncratic demand shocks). Redoing the variance decomposition with year-by-year passthrough estimates makes demand&amp;rsquo;s contribution to sales dispersion in 2009 even larger, because firms adjust prices less to demand shocks during the recession, leaving more of the demand shock impact in sales.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across industries and firm types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Sectoral demand elasticity estimates from the pooled 22-sector sample yield an average theta of 3.89 and median of 2.73 for the linear CES model; for the non-linear model, average theta is 3.26 and average eta is 7.42, with substantial positive skew. The median non-linear eta of 5.37 is larger than the pooled estimate of 4.27, indicating the pooled estimate is pulled down by some sectors with smaller deviations from CES. Key empirical results (greater cyclicality of demand dispersion, incomplete TFPQ passthrough) hold within each major sector and across balanced panels, the single-product subsample, and the CUPI price-index sample. Time-varying passthrough is also found to be systematically higher by about 25% in the post-2008 period compared to the pre-2008 period, suggesting a structural shift in how demand shocks transmit to prices, though the paper does not investigate the source of this change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run on the demand and passthrough estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Demand estimation robustness: (1) piece-wise linear specification (elasticity of 2 below average price, 4 above average price, significant at 0.1% level); (2) balanced panel; (3) excluding the Great Recession; (4) using Statistics Sweden firm identifiers instead of authors&amp;rsquo; own; (5) CUPI price index; (6) single-product firms; (7) sector-by-sector estimation; (8) including firm and sector-year fixed effects directly in the nonlinear regression (rather than pre-demeaning). All exercises confirm statistically significant eta and broadly similar theta. Passthrough robustness: (1) OLS vs. IV (lagged shocks) vs. first-differences; (2) balanced panel; (3) single-product subsample; (4) two-period lagged instruments (beta_z = -0.294, beta_epsilon = 0.249); (5) flexible-price subsample; (6) longer-horizon (two- and three-year) first differences for TFPQ. Corroboration: TFPQ innovations are positively associated with reported process innovations in Eurostat CIS data (7% greater TFPQ growth for process innovators); negative demand shocks are correlated with managers reporting &amp;lsquo;insufficient demand&amp;rsquo; in KFI data (8% lower demand growth).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from and relate to Bloom (2009) and Bloom et al. (2018)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bloom (2009) and Bloom et al. (2018) model a single composite firm-level shock (implicitly TFPR) in a CES-demand economy, finding that uncertainty shocks reduce output through wait-and-see behavior but generate a positive volatility effect (OHA) that partly offsets the uncertainty effect. The present paper adds two departures: (1) it separates TFPQ and demand shocks and shows they have distinct empirical and aggregate implications; (2) it replaces CES demand with an estimated non-CES demand curve. Departure (2) reverses the OHA effect, amplifying the total output decline by around 40% relative to the CES model. Departure (1) shows that the uncertainty channel operates primarily through demand, while TFPQ operates primarily through the volatility channel. The quantitative model uses the same non-convex adjustment cost structure and calibration approach as Bloom et al. (2018) to ensure comparability. The paper also relates to Bachmann and Bayer (2013) and Mongey and Williams (2017), who find smaller aggregate effects with adjustment costs only on capital; the present paper notes that adjustment costs on both capital and labor are needed for large wait-and-see effects, but qualitative conclusions are unchanged with capital-only costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy and theoretical implications of the findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, policies aimed at reducing firm-level demand uncertainty (e.g., demand stabilization, aggregate demand management) have larger aggregate output effects than policies addressing productivity uncertainty, because demand uncertainty triggers wait-and-see investment behavior while TFPQ uncertainty is largely absorbed in markups without changing investment much. Second, TFPQ dispersion is still harmful but through misallocation: policies that reduce markup dispersion induced by productivity differentials can raise aggregate output without requiring reduced dispersion per se. Third, the finding that TFPR dispersion is a poor proxy for demand shock dispersion has implications for how researchers use TFPR as a measure of misallocation or uncertainty: it conflates two distinct forces with different aggregate implications. Fourth, the estimated super-elasticity provides a data-disciplined input for calibrating models with real rigidities, directly relevant for the Ball-Romer nominal non-neutrality question—higher real rigidities amplify the output effects of monetary policy shocks. The authors flag this as a natural extension. The scope conditions are: Swedish manufacturing, annual data 1998-2013, partial equilibrium model (aggregate price level exogenous), firms with matching price and utilization data (large-firm bias).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What additional findings are documented regarding the cyclicality of other firm-level variables?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Beyond TFPQ and demand dispersion, the paper documents that dispersion of sales growth, price growth, labor, intermediate goods, and capacity utilization are all countercyclical. The IQR of sales growth was 58% above the non-recession average in 2009 and 9% above in 2001; the IQR of price growth was 83% above in 2009 and 5% above in 2001. The one notable exception is investment, which displays procyclical dispersion (less dispersed during the Great Recession). The paper also documents that roughly 30% of firms report insufficient demand at all their plants in the survey data; average capacity utilization is 88% with median 91% and standard deviation of 14.1%; and about 25% of firm-year observations involve utilization at or above 100%.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Physical total factor productivity (TFPQ)&lt;/strong&gt;: Firm-level quantity productivity: output per unit of inputs, measured from a utilization-adjusted Cobb-Douglas value-added production function. Distinct from revenue TFP (TFPR = p*z) because it abstracts from demand conditions and price-setting. In this paper, TFPQ is estimated within firm over time using the cost-share approach and a capacity-utilization correction from managerial survey data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demand shock (epsilon)&lt;/strong&gt;: The idiosyncratic component of a firm&amp;rsquo;s demand curve that captures its ability to sell more (or fewer) units at a given price in a given year, reflecting changes in customer base size or customers&amp;rsquo; willingness to pay. Estimated as the residual from the GIR demand curve after controlling for firm fixed effects, sector-time fixed effects, and the firm&amp;rsquo;s own price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-CES demand curve / super-elasticity (eta)&lt;/strong&gt;: A demand specification adapted from Gopinath, Itskhoki, and Rigobon (2010) in which the demand elasticity is not constant but rises with the firm&amp;rsquo;s price. The parameter eta (estimated at 4.27 in the main sample) governs how fast the elasticity rises with the price: when eta &amp;gt; 0, firms gain few customers by cutting price (elasticity falls as price falls) and lose many customers by raising price (elasticity rises as price rises). This is the source of &amp;lsquo;real rigidity&amp;rsquo; that makes incomplete TFPQ passthrough optimal.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incomplete TFPQ passthrough&lt;/strong&gt;: The empirical finding that firms reduce their prices by far less than one-for-one in response to a productivity gain (estimated beta_z = -0.097 to -0.124, far from the CES benchmark of -1). The paper attributes this primarily to non-CES demand real rigidity (which implies an optimal static passthrough of only 41% given the estimated parameters) and secondarily to adjustment costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Oi-Hartman-Abel (OHA) effect&lt;/strong&gt;: The positive &amp;lsquo;volatility effect&amp;rsquo; in standard CES-demand uncertainty models: because output is a convex function of TFPQ under CES, a mean-preserving spread in productivity raises aggregate output (lucky firms expand more than unlucky firms contract). The paper overturns this result by showing that with non-CES demand (eta sufficiently large), the output-productivity relationship becomes concave, so TFPQ dispersion reduces aggregate output via markup misallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wait-and-see channel&lt;/strong&gt;: The mechanism by which uncertainty about future shocks causes firms with non-convex input adjustment costs to pause investment: firms prefer to remain inactive and let inputs depreciate rather than invest or disinvest, at the risk of having to pay an irreversibility cost if the shock turns out to have been in the opposite direction. In this paper, this channel is driven primarily by demand uncertainty because demand shocks determine how many units a firm can sell and hence its desired input level; TFPQ uncertainty does not trigger strong wait-and-see behavior because the optimal scale response to TFPQ shocks is small under non-CES demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Markup dispersion / misallocation&lt;/strong&gt;: Dispersion across firms in the ratio of price to marginal cost, arising in this paper from incomplete TFPQ passthrough: firms with high productivity set high markups rather than passing through productivity gains as price cuts. The resulting wedge between prices and marginal costs means that resources are misallocated (too little output at high-productivity firms relative to the social optimum), reducing aggregate output. This is the channel through which TFPQ dispersion harms the aggregate economy in the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price wedge (tau)&lt;/strong&gt;: The residual from the passthrough regression: the component of firm price changes unexplained by the estimated TFPQ and demand shocks. Interpreted as capturing un-modeled shocks (financial constraints, markup adjustments) and potentially measurement error. The price wedge makes a meaningful contribution to both average sales/price dispersion and to the Great Recession increase in dispersion.&lt;/p&gt;</description></item><item><title>Entrepreneurial Investment Dynamics and the Wealth Distribution</title><link>https://macropaperwarehouse.com/papers/entrepreneurial-investment-dynamics-and-the-wealth-distribution/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/entrepreneurial-investment-dynamics-and-the-wealth-distribution/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper investigates how the illiquidity of entrepreneurial capital shapes investment dynamics and wealth inequality. The central question is whether entrepreneurship drives wealth heterogeneity or merely attracts the already-wealthy — and, specifically, whether the investment behavior of nascent entrepreneurs can be rationalized by frictions on capital reallocation rather than financial constraints alone.&lt;/p&gt;
&lt;p&gt;The empirical foundation is the restricted Kauffman Firm Survey (KFS), a single-cohort panel of 3,140 U.S. firms founded in 2004 and tracked through 2011. The key measurement is the log average revenue product of capital (log ARPK), residualized on two-digit NAICS industry fixed effects and time dummies. Two striking facts emerge. First, the cross-sectional distribution of log ARPK is left-skewed (skewness approximately -0.33, mean -0.49, standard deviation 1.75, kurtosis 5.7). Second, the distribution shows asymmetric persistence: the autocorrelation of log ARPK in the bottom quintile (ρ₁ = 0.897) is statistically significantly larger than in the top quintile (ρ₅ = 0.443), and the diagonal entry of the estimated transition matrix for the first quintile (0.614) substantially exceeds that for the fifth (0.568). These facts are inconsistent with standard models: a frictionless dynamic investment model with time-to-build predicts i.i.d. ARPK; one with collateral constraints predicts right-skewness and right-tail persistence.&lt;/p&gt;
&lt;p&gt;The model extends Cagetti and De Nardi (2006) by distinguishing between liquid bonds and illiquid entrepreneurial capital. Capital adjustment generates four friction types: a proportional fixed cost (fs) on upward investment, a proportional transaction cost (λ) on downsizing, an additional proportional cost (ζ) on exit, and a minimum capital requirement on entry. The model is calibrated via indirect inference to identifying moments from the KFS (persistence and skewness of log ARPK, investment rate distribution, share of employer firms, entry and exit rates) plus economy-wide targets (entrepreneur fraction, interest rate of 3–4%).&lt;/p&gt;
&lt;p&gt;The FULL-sample calibration yields λ = 0.43 (43% loss on capital sold by continuing entrepreneurs) and ζ = 0.55 (additional 55% write-down upon exit), with a proportional fixed cost fs = 0.035 (3.5%). The effective net collateral constraint is approximately 44% of the real capital value. These frictions are quantitatively large: eliminating them under general equilibrium raises aggregate TFP in the entrepreneurial sector by 23.3% and average welfare by 23.1% in consumption equivalent variation terms. Decomposing the welfare losses relative to a complete-markets benchmark shows that approximately 89% of the total welfare loss (relative to full frictions) is attributable to market incompleteness and financial frictions, with the remaining 11% directly attributable to the illiquidity frictions — that is, frictions alone account for roughly 7.15 percentage points of a total 64.8% lifetime consumption welfare loss.&lt;/p&gt;
&lt;p&gt;A key finding on wealth inequality contradicts prior literature. When calibrated to KFS micro-data, the model generates a Gini coefficient of 0.65 (FULL sample) or 0.53 (NAICS54), well below the empirical U.S. Gini of approximately 0.8. The top 1% hold only 26% of wealth in the FULL calibration versus roughly 30% empirically. This contrasts with Quadrini (2000) and Cagetti and De Nardi (2006), who match the wealth distribution by calibrating to PSID or SCF household survey data. The reason for the gap is the left-skewed, illiquidity-depressed returns to entrepreneurship in the KFS: the calibrated returns to scale (ν = 0.79 FULL, 0.82 NAICS54) and the transaction costs together suppress the variance of capital income returns. Removing illiquidity frictions raises the Gini from 0.65 to 0.77 (fixed-r partial equilibrium) or 0.72 (general equilibrium), demonstrating that capital illiquidity compresses the wealth distribution by depressing average entrepreneurial returns.&lt;/p&gt;
&lt;p&gt;Three policy experiments — credit expansion (reducing borrowing spreads à la SBA 7(a) programs), a government buyer-of-last-resort for used capital (Resale I), and exit-cost reduction (Fire sale) — all raise welfare by 0.07–0.15% in consumption equivalent terms and TFP by 0.5–0.9% relative to benchmark. Resale policies are preferred by entrepreneurs; workers prefer the credit policy. All three policies benefit lower-wealth households more than wealthy ones (the richest decile suffers welfare losses due to the savings tax used to finance the programs). The paper concludes that policies addressing capital illiquidity can yield welfare gains comparable to or exceeding standard credit provision programs, and that the distinction between illiquidity risk and financial constraint risk has first-order importance for policy design.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What are the two core empirical facts from the KFS that motivate the paper, and why do standard models fail to generate them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, the cross-sectional distribution of log ARPK among KFS firms is left-skewed (skewness ≈ -0.33), not symmetric or right-skewed. Second, log ARPK shows higher persistence in the left tail (autocorrelation ρ₁ = 0.897 for bottom-quintile firms) than in the right tail (ρ₅ = 0.443). A frictionless dynamic model with time-to-build predicts i.i.d. log ARPK that inherits the distribution of TFP innovations, generating no skewness under Gaussian shocks and no persistence. Models with collateral constraints (as in Cagetti and De Nardi 2006) generate right-skewed ARPK with right-tail persistence, because constrained firms operate below optimal scale, pushing ARPK above the unconstrained optimum. Neither class of models can produce the left-skewed, left-tail-persistent pattern in the KFS.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the mechanism by which partial irreversibility generates left-skewness and left-tail persistence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Partial irreversibility creates an asymmetry between the purchase price and the resale price of capital (the resale price being 1 − λ per unit). When a bad productivity shock hits, the option value of waiting to recover is higher than the cost of holding excess capital, so entrepreneurs adopt a &amp;lsquo;wait-and-see&amp;rsquo; attitude and maintain oversized firms rather than downsizing immediately. This creates a left tail of low-ARPK, large-capital firms. Moreover, since the incentive to wait is itself persistent (the transitory bad shock must resolve before the entrepreneur will downsize), the left tail displays higher autocorrelation. The exit cost ζ amplifies this for the exit margin: entrepreneurs with poor draws stay in business longer than is efficient, further extending the left tail. The right tail is not symmetrically elongated because entrepreneurs seeking to expand face a different option value (the call option value of capital rises), leading them to invest to smaller sizes, slightly thickening the right tail — but not enough to overcome the left-tail extension.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the calibration strategy, and which parameters are identified by which moments?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Eleven parameters are jointly calibrated to KFS moments via indirect inference. The key mappings are: the downsizing transaction cost λ is identified by the asymmetric left-tail persistence of log ARPK (the ratio ρ₁/ρ₅ increases monotonically in λ); the exit cost ζ is identified by the skewness of log ARPK (higher ζ monotonically increases left skewness); the collateral constraint ϕ also affects skewness but has no monotone effect on ρ₁/ρ₅, aiding separation; the returns to scale ν is identified by the coefficient from a log-revenue on log-capital regression for employer firms; the fixed investment cost fs is identified by the fraction reporting positive investment; TFP shock autocorrelation ρ_z is identified by investment rate autocorrelation; the shock standard deviation σ_z by the coefficient of variation of investment rates; and the worker signal distortion and entrepreneur signal distortion parameters control entry and exit rates respectively. The discount factor β pins down the interest rate. Two separate calibrations are run: one targeting full KFS sample moments (FULL) and one targeting the modal industry — Professional, Scientific and Technical Services (NAICS54, 24.7% of the sample) — as a robustness check.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main calibrated parameter values and how do they compare across the FULL and NAICS54 calibrations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For the FULL calibration: λ = 0.43, ζ = 0.55, ϕ = 0.92, fs = 0.035, ρ_z = 0.66, σ_z = 0.43, ν = 0.79, β = 0.9265, α_e = 0.63. For NAICS54: λ = 0.53, ζ = 0.75, ϕ = 0.035, fs = 0.23, ρ_z = 0.66, σ_z = 0.43, ν = 0.82, β = 0.94, α_e = 0.50. The illiquidity parameters (λ and ζ) are larger in NAICS54 than in FULL. The collateral constraint parameter ϕ differs substantially (0.92 FULL versus 0.035 NAICS54), though the net effective collateral constraint (accounting for λ and depreciation) converges to a similar range in both calibrations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are the illiquidity and financial friction channels distinguished both theoretically and empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Theoretically, collateral constraints (parameterized by ϕ) make the lower support of log ARPK truncated from the left (log ARPK ≥ log(r+δ) - log α), generating right-skewness and right-tail persistence. Illiquidity frictions (λ and ζ), by contrast, induce a wait-and-see option value that extends the left tail of ARPK while leaving the right tail relatively thinner, generating left-skewness and left-tail persistence. Empirically, the paper proposes using the sign and magnitude of the skewness of log ARPK (negative implies illiquidity dominates; positive implies financial frictions dominate) and the ratio of left-tail to right-tail persistence (ρ₁/ρ₅ &amp;gt; 1 indicates illiquidity frictions, &amp;lt; 1 indicates financial frictions) as discriminating statistics. Separately, the portfolio composition of entrepreneurs offers a further discriminating test: increasing illiquidity drives entrepreneurs to hold more liquid assets (flight to liquidity), while tightening collateral constraints pushes entrepreneurs toward more illiquid assets in their portfolios.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the aggregate TFP and welfare findings from the counterfactual analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under general equilibrium, removing all illiquidity frictions (λ = ζ = fs = 0) raises entrepreneurial sector TFP by 23.3% and average economy-wide welfare by 23.1% in consumption equivalent variation. Under partial equilibrium (fixed interest rate), welfare gains are even larger: 24.8% (entrepreneur subgroup) and 58.3% (worker subgroup), for an economy-wide average of 16.6%. The GE result is somewhat lower because the interest rate adjusts when more capital flows into entrepreneurship. The average productivity of entrepreneurs (conditional on being an entrepreneur) is 8.8% higher in the no-friction world than in the benchmark. The TFP gains arise from both extensive-margin selection (higher-productivity entrepreneurs enter; lower-productivity ones exit) and intensive-margin reallocation (high-productivity firms operate closer to optimal scale; low-productivity firms downsize rather than persist).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper decompose total welfare losses between market incompleteness and the illiquidity distortions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following Buera and Shin (2011), the paper computes welfare as a fraction of lifetime consumption relative to a complete-markets benchmark (a social planner&amp;rsquo;s problem where the planner allocates occupational choice and capital optimally). Relative to complete markets, the economy with no illiquidity frictions but with market incompleteness loses approximately 57.7% of lifetime consumption. The benchmark economy (with all frictions) loses approximately 64.8% of lifetime consumption relative to complete markets. The difference — approximately 7.15 percentage points — is attributed to the illiquidity frictions. As a share of the total frictional loss, about 89% is attributable to market incompleteness and financial frictions, and 11% to the illiquidity frictions. While 11% may seem small as a fraction, in absolute terms it is economically non-trivial.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the paper find that entrepreneurship cannot match the empirical wealth distribution when calibrated to the KFS?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model generates a Gini of 0.65 (FULL) or 0.53 (NAICS54) against a U.S. empirical Gini of approximately 0.8. The top 1% holds roughly 26% of wealth in the FULL calibration versus around 30% empirically. Two factors suppress capital income risk in the KFS-calibrated model. First, the calibrated returns to scale (ν = 0.79 FULL, 0.82 NAICS54) are lower than those used by Cagetti and De Nardi (2006) (ν ≈ 0.88), which were calibrated to PSID/SCF data on large-ish successful firms. Lower ν translates exponentially into lower variance of capital income. Second, the illiquidity frictions directly depress average returns to entrepreneurship by raising the user cost of capital and forcing entrepreneurs into suboptimal firm sizes. These two forces together prevent the model from generating the thick right tail of wealth needed to match empirical distributions. The paper argues that the KFS captures &amp;lsquo;broad&amp;rsquo; small-scale entrepreneurship, not the high-growth, high-return entrepreneurs who likely account for the top of the wealth distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does capital illiquidity affect the wealth distribution conditional on holding returns to scale fixed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;More illiquid capital (higher λ or ζ) compresses the wealth distribution and lowers the Gini coefficient. The Gini rises from 0.65 (benchmark FULL calibration) to 0.77 under partial equilibrium without illiquidity frictions, and to 0.72 under general equilibrium without illiquidity frictions (while holding the net collateral constraint constant). The NAICS54 benchmark Gini is 0.53, rising to 0.76 (PE) or 0.68 (GE) without illiquidity frictions. The mechanism is that illiquid capital depresses the average return to entrepreneurial wealth, which compresses the income process and reduces the variance of wealth accumulation. Additionally, illiquid capital forces entrepreneurs to hold more bonds as a liquidity buffer, reducing the overall scale of their business investment and thus their lifetime income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three policy experiments and their comparative findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The three policies are all financed by a proportional tax on bond savings returns. (1) Credit expansion: the government subsidizes borrowing intermediation costs (analogous to SBA 7(a)/CDC 504 programs), reducing the spread between the saving and borrowing rate. Economy-wide welfare rises by about 0.147%; TFP rises by about 0.9% relative to benchmark. Workers benefit more (0.169%) than entrepreneurs (-0.006% average for all entrepreneurs, since most wealthy entrepreneurs do not borrow and pay the tax). (2) Resale policy I (Buyer of last resort for all used capital): government offers a higher resale price q ≥ 1 − λ. Economy-wide welfare rises about 0.076%; TFP rises 0.6%. Entrepreneurs gain (0.084%) while workers also gain (0.074%) indirectly through the option value of future entrepreneurship. (3) Fire-sale (exit cost reduction only, Resale II): government subsidizes exiting entrepreneurs&amp;rsquo; capital resale. Economy-wide welfare rises 0.073%; TFP rises 0.5%. Workers prefer credit; entrepreneurs prefer resale policies. Wealthiest decile suffers welfare losses under all three policies. All welfare numbers are in consumption equivalent variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to Cagetti and De Nardi (2006) and where does it diverge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper builds directly on the Cagetti and De Nardi (2006) framework of occupational choice and incomplete markets with collateral constraints, extending it by separating liquid bonds from illiquid physical capital. In Cagetti and De Nardi (2006), bonds and capital are perfect substitutes; the sole friction is a collateral constraint that limits investment. The paper shows that this one-asset framework generates right-skewed ARPK and right-tail persistence — inconsistent with KFS facts. The paper&amp;rsquo;s two-asset framework with partial irreversibility generates left-skewed ARPK and left-tail persistence. Furthermore, Cagetti and De Nardi (2006) calibrate to PSID/SCF income data and successfully match the wealth distribution; the paper shows this success partly reflects the higher returns to scale implied by those data. When calibrated directly to KFS firm-level data, the model substantially undershoots the empirical wealth inequality, because the KFS captures a representative sample of small-scale entrepreneurs with genuinely lower returns to scale and significant illiquidity frictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the options value effect and the collateral constraint channel in the model, and how do they differ?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The options value effect is described as the primary distortion. When capital is illiquid (λ or ζ &amp;gt; 0), the put option value of capital falls (selling capital is costly), raising the threshold signal required for workers to enter entrepreneurship, and raising the threshold signal required for incumbents to exit. As a result, entry rates fall, exit rates fall, potential entrepreneurs delay entry, and poorly performing entrepreneurs overstay. Along the intensive margin, the asymmetric purchase/resale price leads entrepreneurs planning to downsize to wait (operating larger-than-optimal firms) and entrepreneurs planning to invest to be more cautious (operating smaller-than-optimal firms). The collateral constraint channel is a secondary effect: illiquid capital reduces the net resale value that can serve as collateral (effective constraint = (1-λ)(1-δ)(ϕ)k&amp;rsquo;), tightening the borrowing constraint even when the formal collateral parameter ϕ is moderate. Crucially, while tighter ϕ forces entrepreneurs to hold more illiquid capital (no flight to liquidity), higher λ forces entrepreneurs to hold more liquid assets (flight to liquidity) — a key empirical distinction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness exercises does the paper conduct?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper runs two separate full calibrations: one to the entire KFS sample (FULL) and one to the modal industry NAICS54 (Professional, Scientific and Technical Services, 24.7% of the sample). Both calibrations are used to assess the wealth distribution findings. The paper also examines moments at the two-digit industry level (only one industry shows statistically significant results due to small sample size, though most show economically significant signs). An additional measurement error parameter is explored in the appendix, where capital is assumed to be observed with multiplicative log-normal error; this helps improve model fit to the data. All policy experiments are computed under both partial equilibrium (fixed interest rate) and general equilibrium. The paper also analytically proves (in the appendix) the ARPK distribution properties for the four benchmark frameworks (frictionless, time-to-build only, static collateral constraints, and dynamic collateral constraints), establishing the theoretical necessity of partial irreversibility for the facts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in welfare effects is documented across the wealth distribution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under all three policy experiments, welfare gains decrease with wealth. The poorest households gain the most in consumption equivalent variation terms because they receive a disproportionate share of the program&amp;rsquo;s benefits (better borrowing conditions, higher resale prices, improved option value of entrepreneurship) while paying a smaller absolute share of the savings tax used to finance the programs. The top 10% richest households — who are the primary taxpayers — experience welfare losses under all three policies. This pattern holds across credit, resale, and fire-sale policies, though the magnitude varies. Separately, entrepreneurs (who are wealthier on average, with over 50% concentrated in the top wealth decile) mostly lose from the credit policy (they fund it but don&amp;rsquo;t directly borrow) while gaining from resale policies (they benefit from higher capital resale prices regardless of wealth position). Workers (who are generally poorer) overwhelmingly gain from credit policies since the option value of switching to entrepreneurship rises substantially.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper imply for interpreting the literature on financial constraints and entrepreneurship?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper issues several cautionary findings. First, the implied formal collateral parameter is relatively loose (ϕ = 0.92), consistent with Hurst and Lusardi (2004), Nanda (2011), and Robb and Robinson (2014) — who find no evidence that average entrepreneurs face severe financial constraints. However, once illiquidity is accounted for, the effective (net) collateral constraint is only about 44% of real capital value, consistent with Evans and Jovanovic (1989) and Cagetti and De Nardi (2006). This suggests that what appears empirically as &amp;lsquo;financial constraint&amp;rsquo; is partly a manifestation of capital illiquidity: banks lend less against entrepreneurial capital because its resale value is low, not primarily because of limited commitment. Second, empirical studies using regional variation in financial conditions to identify financial constraint effects may suffer from omitted variable bias, since resale prices of capital are also highly correlated with local financial conditions. Third, aggregate statistics such as startup rates and investment levels cannot distinguish between illiquidity shocks and financial constraint shocks; portfolio composition (the ratio of liquid to illiquid assets) is a more informative diagnostic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the paper&amp;rsquo;s contribution to the misallocation literature relative to Hsieh and Klenow (2009), Asker et al. (2014), and Midrigan and Xu (2014)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Hsieh and Klenow (2009) and Asker et al. (2014) focus on the dispersion of log MRPK as a measure of misallocation, where adjustment costs (similar to fs and λ here) can generate observed dispersion without implying inefficiency. Midrigan and Xu (2014) focus on financial constraints (similar to ϕ) as the source of misallocation. The paper argues that these frameworks produce observationally equivalent outcomes in terms of log MRPK dispersion alone, making it impossible to distinguish between the two. The paper&amp;rsquo;s contribution is to show that the skewness of log ARPK and the asymmetric tail persistence are additional moments that can discriminate between the two types of frictions: negative skewness and left-tail dominance point to illiquidity frictions, while positive skewness and right-tail dominance point to financial frictions. This provides a new empirical diagnostic tool for decomposing sources of capital misallocation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Average Revenue Product of Capital (ARPK)&lt;/strong&gt;: In the paper&amp;rsquo;s usage, ARPK = Y_it / K_{i,t-1}, the ratio of a firm&amp;rsquo;s real revenue to its beginning-of-period real capital stock, used as the primary measure of capital productivity. Log ARPK is residualized on two-digit NAICS industry fixed effects and time dummies before analysis, removing industry-level heterogeneity in capital shares and aggregate shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Partial irreversibility&lt;/strong&gt;: The friction arising from an asymmetry between the purchase price of new capital (normalized to 1) and the resale price of used capital (1 − λ for downsizing incumbents, and (1 − ζ)(1 − λ) for exiting entrepreneurs). This is modeled as a proportional transaction cost on capital sales and is interpreted as the difficulty of recouping original investment, analogous to a low resale value of used entrepreneurial equipment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wait-and-see attitude&lt;/strong&gt;: The behavioral response of entrepreneurs facing downside productivity shocks when capital is illiquid: rather than immediately downsizing or exiting upon a bad shock, they maintain larger-than-optimal firm sizes while waiting for conditions to improve. This is optimal because the transaction cost of selling capital makes the option of waiting (and possibly recovering) more valuable than the cost of operating an oversized firm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Net collateral constraint (effective collateral parameter)&lt;/strong&gt;: Denoted ϕ̃ = (1 − λ)(1 − δ)ϕ, this is the fraction of entrepreneurial capital&amp;rsquo;s real value that can actually be pledged as collateral, after accounting for the reduced resale value from illiquidity (1 − λ) and physical depreciation (1 − δ). The paper distinguishes this from the formal limited-commitment parameter ϕ to show that observed financial constraints partly reflect capital illiquidity rather than contracting failures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Options value effect&lt;/strong&gt;: The mechanism through which capital illiquidity distorts both the entry/exit decision and the intensive margin of investment. For downsizing incumbents, the put option value of capital (the option to sell it) falls when the resale price is low, inducing them to delay disinvestment. For potential entrants, the call option value of capital (the upside of entering) falls because losses upon exit are larger, raising the productivity signal threshold for entry. This is described as the primary distortion channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Span-of-control parameter (returns to scale, ν)&lt;/strong&gt;: The parameter ν ∈ (0,1) in the entrepreneurial production function y = z(k^{α_e} l^{1-α_e})^ν, capturing the extent to which managerial talent becomes diluted as firm size increases. The paper identifies ν = 0.79 (FULL) from the coefficient of a log-revenue on log-capital regression for employer firms, and shows that ν is the dominant determinant of the variance of capital income returns and hence the model&amp;rsquo;s ability to generate wealth inequality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption equivalent variation (CEV)&lt;/strong&gt;: The welfare metric used throughout the paper. For each household i, CEV µ_i is defined as the percentage increase in reference-economy consumption (or lifetime consumption stream) that makes the household indifferent between the reference economy and the economy of interest. Positive CEV means the new economy is preferred. Aggregate welfare is the distribution-weighted average of individual CEVs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asymmetric persistence&lt;/strong&gt;: The empirical fact, documented in the KFS, that log ARPK shows higher autocorrelation at the bottom quintile (ρ₁ = 0.897) than at the top quintile (ρ₅ = 0.443), confirmed by both a conditional autocorrelation regression and a quintile transition matrix. This asymmetry is a key moment used to identify and distinguish illiquidity frictions (which produce left-tail persistence) from collateral constraints (which produce right-tail persistence).&lt;/p&gt;</description></item><item><title>Entry decision, the option to delay entry, and business cycles</title><link>https://macropaperwarehouse.com/papers/entry-decision-the-option-to-delay-entry-and-business-cycles/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/entry-decision-the-option-to-delay-entry-and-business-cycles/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; US cohorts of establishments born in recessions persistently employ fewer workers at entry and over their life cycle, yet are on average more productive than expansionary cohorts; the number of entrants is procyclical and roughly four times as volatile as aggregate employment. Standard firm-dynamics models cannot reproduce this strong, persistent selection of entrants without generating excessive variation in aggregate variables, because the expected lifetime value of entry is relatively insensitive to aggregate shocks of reasonable magnitude. The paper asks what makes initial aggregate conditions matter so much for the selection of entrants, and answers: potential entrants&amp;rsquo; ability to delay entry, a margin missing from existing frameworks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model setup.&lt;/strong&gt; The author builds a discrete-time, infinite-horizon firm-dynamics model with endogenous entry and exit, building on Moreira (2015) in the style of Hopenhayn (1992). The only aggregate shock is an exogenous AR(1) aggregate demand shock z. Heterogeneous incumbents differ in idiosyncratic productivity s (AR(1)) and customer capital b (accumulated from past sales, depreciating at rate δ), operate under monopolistic competition, draw a random fixed operating cost each period, and may exit endogenously or via a random exit shock γ. A constant mass of potential entrants holds heterogeneous signals q about post-entry productivity, drawn from a time-invariant Pareto distribution W(q). The key deviation: entrants may keep their signal and delay, observing a new z next period (probability τ of retaining the signal; τ=0 nests the standard model, τ=1 is the baseline). This creates a non-negative option value of delay V^w(q,z) that rises with q and with z.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings (with magnitudes).&lt;/strong&gt; The option to delay generates a countercyclical opportunity cost of entry: for reasonable parameters, entrants postpone until the present value of entry is up to twice the fixed entry cost. The threshold signal is countercyclical, so recessionary cohorts are fewer but more productive. Expected delay duration ranges from zero to six periods (years), negatively correlated with q. Calibrated to BDS establishment data 1977-2015 (a period is a year), with ρz=0.57, σz=0.0022, and τ=1 (an alternative identification gives τ=0.965, with nearly identical dynamics). The mechanism raises the variance of the number of entrants, for a given shock process, by about seven times. Recessionary (expansionary) cohorts employ 5.7% fewer (5.0% more) workers than the average cohort, persisting beyond 15 years; shutting down delay (τ=0) collapses this to ~1%, so ~80% of cohort-employment variation comes from delayers. Average recessionary productivity is ~3% higher under τ=1 vs only 0.4% under τ=0. The full model explains more than three-fourths of the persistence and variance of aggregate employment (model autocorrelation 0.57 vs data 0.61; std 0.012 vs 0.015). Empirically, cohort-level employment differences are driven by the composition (high-productivity/high-growth share), not the number, of entrants; the persistent customer-capital process plays a minor role (&amp;lt;7%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Implications.&lt;/strong&gt; Validating against the Great Recession: cohorts entering 2008-2016 account for ~45% of the depth (of an 8.9% drop in 2012) and ~85% of the slow recovery by 2016 in the data; the model reproduces ~39% of the 2012 depth and ~75% by 2016, with most of it coming from the entry margin. A standard model without delay, calibrated to the same facts, requires σz ~7x larger, yields aggregate-employment variance 1.7x the data, and predicts a Great-Recession employment drop twice as large as observed. Matching aggregate employment instead requires aggregate-demand-shock autocorrelation 1.40x and variance 25x higher. Ignoring the option to delay therefore yields misleading predictions about entrants&amp;rsquo; responses to permanent, temporary, and anticipated (news) policy shocks.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core mechanism that amplifies the effect of initial aggregate conditions on entrant selection?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The option to delay entry. Because entering today and entering tomorrow are mutually exclusive, waiting carries a non-negative option value V^w(q,z) that rises with the signal q and with aggregate demand z. With this intertemporal choice, a firm enters only if its gross value of entry exceeds the &lt;em&gt;total&lt;/em&gt; opportunity cost = fixed entry cost ce + option value of delay. This total cost is countercyclical (up to twice ce in recessions), so the threshold signal q*(z) becomes much more elastic to z. Even a small change in the relative benefit of entering today vs tomorrow shifts selection substantially, whereas without delay (τ=0) entry follows a neoclassical rule — enter if net lifetime benefits are non-negative — and the threshold barely moves with z.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does a firm ever find it optimal to delay, given it forgoes period profits?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The decision hinges on the net value of waiting, V^w(q,z) − (V^gross(q,z) − ce). The aggregate demand level at entry affects not only first-period profits but also the expected post-entry survival rate (1−γ)G(c*_f), which is procyclical: in recessions the expected long-run value is lower, raising the risk of premature post-entry failure. This procyclical &amp;lsquo;discount factor&amp;rsquo; makes entry during expansions more valuable. Medium-productivity firms wait until the expected survival rate is high enough to compensate for low early-life demand. The author stresses that without irreversible and endogenous exit, the benefits of waiting would always be negative — endogenous exit risk is essential to the mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Who delays, and who does not?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Delay has no effect on high- and low-productivity potential entrants; only medium-range-signal firms (q in [q*&lt;em&gt;{τ=0}(z), q*&lt;/em&gt;{τ=1}(z)]) find it profitable to wait for better aggregate demand. The lower the aggregate demand, the wider this range. At the business-cycle peak, nobody delays, so selection coincides with and without the option.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the empirical identification strategy and its main threat?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using the Business Formation Statistics (BFS), based on IRS EIN/SS-4 applications matched to BDS new employer businesses, the author separates applications that form a business within the first four quarters (First 4Q) from the second four quarters (Second 4Q), 2004Q3-2016Q4. The &amp;lsquo;wait-and-see&amp;rsquo; channel is identified from the share of late start-ups = Second4Q/(First4Q+Second8Q), which is significantly countercyclical (Fact 2). The main confound (Fact 3&amp;rsquo;s threat): bad aggregate conditions could lengthen the &lt;em&gt;time required to build&lt;/em&gt; a business (e.g., harder credit access in recessions) rather than reflecting deliberate waiting. The author controls for this using the average duration of business formation within the first four quarters and the total number of formations within eight quarters; the countercyclical share of late start-ups survives (Table 2, coefficient -0.304*** on HP real-GDP cycle). A separate caveat: the author cannot evaluate the &lt;em&gt;economic&lt;/em&gt; magnitude of the channel from data, because entrants who delay AND delay applying for EINs, or who apply but never return, are unobserved — hence the quantitative role is assessed via the structural model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the testable implication that distinguishes the mechanism, and is it borne out in data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model predicts that recessionary cohorts have, on average, HIGHER long-run survival rates than expansionary cohorts (countercyclical survival), because firms wait until expected survival is high enough. Without the option (τ=0) the model produces acyclical survival rates. In BDS data 1979-2015, cohort survival rates at ages g=1..5 are persistently negatively correlated with aggregate conditions at entry (e.g., for S3, corr with HP real-GDP cycle = -0.38, p=0.02; corr with Ihp = -0.46, p=0.00), robust across HP, linear-trend, unemployment, and NBER indicators, and across firm- vs establishment-level units. Note two counteracting forces: low demand directly lowers survival (higher failure) but raises it via selection; the net countercyclicality supports the selection channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the model calibrated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;17 parameters; a period = a year, unit = establishment. β=0.96 (4% riskless rate). Demand/customer-capital/productivity parameters from Foster et al. (2008, 2016): ρs=0.814, price elasticity ρ=1.622, demand-to-customer-capital elasticity η=0.919, depreciation δ=0.188. Entrant-distribution, selection, survival, size, and growth parameters (q, ξ, ce, μf, σf, γ, b0, σ_s, σ_e, α) jointly matched to BDS cohort moments (average entry rate ~12.1%, entrant employment share, size and survival to 30 years, employment share to age 5). The aggregate demand process (ρz=0.57, σz=0.0022) is calibrated to the autocorrelation (0.25) and std (0.06) of the HP-filtered (smoothing 100) entry rate. τ set to 1; an alternative strategy using the aggregate-employment time series identifies τ=0.965.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper decompose the source of persistent cohort-employment differences?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Counterfactuals (Table 6) hold the variation in the &lt;em&gt;number&lt;/em&gt; of entrants fixed while varying composition. &amp;lsquo;Adjust lowest s&amp;rsquo; (number variation from low-productivity firms) yields small, transient cohort-employment effects; &amp;lsquo;adjust highest s&amp;rsquo; yields large, persistent effects. The baseline lies between them: medium-productivity firms that delay amplify the procyclical variation in &lt;em&gt;high-productivity&lt;/em&gt; entrants, raising persistence. This matches Decker et al. (2014) and Pugsley-Sedlacek-Sterk: a small share of high-growth firms drives cohort contributions, and ex-ante entrant types explain most post-entry performance. The &amp;lsquo;only selection&amp;rsquo; counterfactual (shutting demand effects on post-entry firms) shows the customer-capital process contributes less than 7% to cohort-employment persistence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the impulse-response analysis illustrate propagation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A one-time negative demand shock sized to cut entrants by 25% (the Great-Recession magnitude): the baseline economy takes 3 years to recover half the employment decline and another 12 years to recover an additional 25%. An economy where the shock does not affect the entry margin recovers three-fourths of the decline in only 2 years, even when the shock is enlarged to match the baseline&amp;rsquo;s initial employment drop. Persistent entry-margin shocks accumulate, substantially deepening and prolonging the downturn (Table 9).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With the option to delay, entrant responses depend on the &lt;em&gt;relative&lt;/em&gt; benefit of entering today vs tomorrow, so policy effects vary with type, magnitude, timing, and duration. (1) A temporary cut in fixed entry cost raises the number of entrants more than a permanent cut during recessions, with equal effect in expansions; marginal entrants are high-productivity firms in recessions, low-productivity in expansions. Without the option, the response is invariant to policy duration. (2) News of a future entry-cost cut (after T periods) weakly &lt;em&gt;raises&lt;/em&gt; the threshold signal in all states — i.e., reduces entry today — and for small T this indirect, entry-deterring effect can dominate the eventual entry boost; standard models would only transmit such news through general-equilibrium channels. Scope: results derive from a partial-equilibrium reduced form; the author argues (Appendix A.3) that in general equilibrium the option value stays non-negative, so the entry threshold is weakly higher than in models without persistent signals, though procyclical wages partly offset the procyclical-discount-factor force.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It addresses the Samaniego (2008) result that entry/exit are insensitive to reasonable productivity shocks and the Lee-Mukoyama (2018) &amp;lsquo;puzzle&amp;rsquo; of generating strong entrant selection. Rather than imposing cyclical entry costs (Lee-Mukoyama 2018), an entry function (Sedlacek-Sterk 2019), or exogenous entry-specific shocks (Clementi-Palazzo 2016; Sedlacek-Sterk 2017), it derives amplified selection endogenously from the option to delay. It complements &amp;lsquo;missing generation&amp;rsquo; (Gourio-Messer-Siemer) and demand-side (Sedlacek-Sterk; Moreira) explanations of procyclical cohort employment, extends the real-options literature (Bernanke 1993; Dixit-Pindyck 1994; Pindyck 2009; Bloom 2009) to the entry margin, and reinforces Sedlacek-Sterk&amp;rsquo;s finding that entry-stage selection, not post-entry choices, drives cohort contributions to aggregate fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What extensions and robustness checks are provided?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) A two-stage entry phase (Appendix A.1) micro-founds the constant mass of potential entrants by adding an &amp;lsquo;aspiring start-up&amp;rsquo; free-entry stage, calibrated so only ~13% of aspiring start-ups (cq=0.022) become actual entrants, reconciling the low BFS application-to-employer-business transition rate (~14% over two years). (2) Allowing accumulation of delayed potential entrants (Appendix A.2) &lt;em&gt;amplifies&lt;/em&gt; cyclical differences across cohorts and increases procyclical entry-rate variation. (3) A general-equilibrium version (Appendix A.3) shows the model performs at least as well as standard models. Empirical results are robust to alternative cycle definitions (HP, linear trend, unemployment deviations, NBER), to firm- vs establishment-level units, to annual vs quarterly BFS data, and to ten-year pre-crisis cohort averages in the Great-Recession exercise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What caveats does the author flag?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model generates a countercyclical average entrant size (consistent with Lee-Mukoyama 2015 for manufacturing plants) but at odds with Sedlacek-Sterk&amp;rsquo;s finding of procyclical entrant size in BDS; the author conjectures that allowing procyclical initial customer capital would only widen cyclical cohort-employment differences. The economic magnitude of the wait-and-see channel cannot be measured directly because key delaying groups are unobserved in BFS. Other Great-Recession forces (credit crunch, structural change in entrants) are not modeled and could also explain the 2008-2016 cohort employment drop. Explaining whether delayed entrants actually return to the market is left for future research.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Option value of delay (V^w(q,z))&lt;/strong&gt;: The present value a potential entrant forgoes by entering today instead of retaining its productivity signal and entering in a future period. It is non-negative everywhere, weakly increases in the signal q and in aggregate demand z, and exists only because exit is irreversible and endogenous (otherwise waiting would never pay).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Countercyclical opportunity cost of entry&lt;/strong&gt;: The total cost of entering — fixed entry cost ce plus the option value of delay — which rises in recessions (up to twice ce). It endogenously raises the elasticity of entry to aggregate demand and creates a group of firms that stay out despite positive expected net profits.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Threshold signal q&lt;/em&gt;_τ(z)&lt;/em&gt;*: The minimum productivity signal at which a potential entrant chooses to enter at aggregate state z. It is countercyclical; under τ=1 it equals the signal at which gross entry value equals the total opportunity cost, and it is far more elastic to z than the τ=0 (no-delay) threshold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Signal q and probability of recalling the signal τ&lt;/strong&gt;: q is a potential entrant&amp;rsquo;s heterogeneous, time-invariant signal about its initial post-entry productivity (drawn from Pareto W(q)). τ is the probability a delaying entrant keeps that signal next period; τ=0 collapses the model to a standard framework, τ=1 is the baseline (calibrated; identified value τ=0.965).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Customer capital (b)&lt;/strong&gt;: A demand-side stock tied to a firm&amp;rsquo;s past sales, depreciating at rate δ, that shifts demand for its differentiated good. Because it accumulates from prior sales, it slows firms&amp;rsquo; demand adjustment and creates persistence in production and employment, distinct from productivity differences (per Foster et al. 2016).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wait-and-see channel&lt;/strong&gt;: The empirical counterpart of the option-to-delay mechanism: a bad aggregate state at entry induces some potential entrants to postpone forming a business, raising the (countercyclical) share of late start-ups in BFS data, distinct from recessions merely lengthening the time required to build a business.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Recessionary vs expansionary cohorts&lt;/strong&gt;: Cohorts of establishments that begin operating when aggregate demand is below (z&amp;lt;1) vs above (z&amp;gt;1) the stochastic steady state. Recessionary cohorts are fewer, more productive, higher-survival, and persistently smaller in employment.&lt;/p&gt;</description></item><item><title>Expecting Floods: Firm Entry, Employment, and Aggregate Implications</title><link>https://macropaperwarehouse.com/papers/expecting-floods-firm-entry-employment-and-aggregate-implications/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/expecting-floods-firm-entry-employment-and-aggregate-implications/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper studies how the &lt;em&gt;expectation&lt;/em&gt; of rising flood risk — distinct from realized flood events — reshapes where firms locate, where workers live and how much they work, and what this implies for U.S. aggregate output. The motivation is climate-driven: roughly 6 million Americans lived within a 100-year flood zone in 1998, rising to 13 million by 2018, and FEMA floodplains are projected to grow about 45% by century&amp;rsquo;s end. Prior work largely studied actual floods or housing-price effects; this is among the first to examine firm entry and employment responses to anticipated risk.&lt;/p&gt;
&lt;p&gt;Data and design: The authors digitize FEMA Special Flood Hazard Zone maps (historic Q3 maps tied to 1998 Flood Insurance Rate Maps, and 2018 National Flood Hazard Layer), measuring flood risk as the share of land area within flood zones at the county and ZIP-code (ZCTA) level. Average flood-zone share rose 1.5 percentage points from 1998 to 2018, with a 20-pp increase at the 90th percentile of ZIP-level changes. Firm entry/exit, employment, population and county real GDP come from Census Business Dynamics Statistics, ZIP Codes Business Patterns, and BEA; actual flood events come from the Dartmouth Flood Observatory. The baseline specification is a two-period (1998, 2018) fixed-effects regression with county (or ZCTA) fixed effects, state-by-year fixed effects, demographic/economic controls (female labor share, manufacturing share, population density, China import-penetration change), and a control for actual flooded area.&lt;/p&gt;
&lt;p&gt;Main reduced-form findings: A one-standard-deviation (7-percentage-point) increase in flood risk over 1998-2018 reduced firm entry by 1.2%, employment by 1.2%, population by 0.8% (smaller than employment, implying both relocation and labor-supply margins), and real GDP by 2.4%. Firm exits also &lt;em&gt;declined&lt;/em&gt; with higher risk (smaller magnitude), reflecting reduced business dynamism. A county at the 90th percentile of risk increase saw a 3.3% drop in firm entry. ZIP-level estimates are similar. An IV using the interaction of rest-of-state risk change with local geo-climatic conditions (rainfall, temperature, evaporation) yields comparable magnitudes (entry -1.2%, employment -1.4%, GDP -2.2%); a placebo (1990-1998 outcomes) test is insignificant. In sharp contrast, actual flood &lt;em&gt;events&lt;/em&gt; had negligible effects on entry, exit, employment and population, but a one-SD (0.4) increase in flooded-area share lowered real GDP by 0.2% in the same year, driven by current-year shocks (lagged effects negligible).&lt;/p&gt;
&lt;p&gt;Model and quantification: The authors build a spatial-equilibrium model (McFadden 1978 location choice, Krugman 1980 monopolistic competition) with M = 2,772 counties (96% of 2018 GDP), σ = 5, exit rate κ = 0.08. Flood risk operates through three channels: direct damage, an employment channel (relocation + endogenous labor supply), and a love-of-variety channel (fewer firms). Damage parameters are disciplined by reduced-form evidence (δ = 0.005, δκ = 0.003) and Barrage (2020) (η = 0.002); labor-supply elasticities φL = 1.55, φM = 0.83 are set by indirect inference targeting employment and population responses. Non-targeted moments (output, entry, exit) match the data.&lt;/p&gt;
&lt;p&gt;Counterfactuals: Eliminating 2018 flood risk shows it reduced aggregate output by 0.52% (employment -0.31%, firm entry -0.30%, welfare -0.51%). Decomposition: direct damage -0.11% (21%), labor relocation 0%, labor supply -0.33% (63%), variety -0.08% (15%) — so about 80% of the loss is expectation-driven and 20% direct damage. Effects are highly unequal: top-5% and top-1% counties (by output loss) lost 7.9% and 13.9% of output. A projected 4.5% rise in at-risk properties (2020-2050) would cut output 0.12%. Extensions (entry costs in goods, interregional trade, capital and land) yield somewhat larger losses (0.57%, 0.62%, 0.67%). Policy implication: counting only direct damages badly understates disaster costs and the social cost of carbon, because firms and workers rationally adjust to anticipated risk.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core design is a two-period (1998 and 2018) fixed-effects regression of log outcomes (firm entry, exit, employment, population, real GDP) on the share of land in FEMA flood zones, absorbing locality fixed effects (time-invariant characteristics like industry composition), state-by-year fixed effects (statewide growth/business cycles), demographic/economic controls, and a control for actual flooded area. The main threat is measurement error in FEMA risk maps: some underlying data are outdated, and political-economy incentives lead politicians and homeowners to resist map updates to avoid higher insurance premiums, so designations may reflect politics rather than true risk. A second threat is omitted local economic trends correlated with both risk and outcomes. The authors address measurement error with a Bartik-type IV (rest-of-state average risk change interacted with own geo-climatic features — satellite temperature, cumulative rainfall, evaporation), controlling for cumulative past flooded area. IV estimates are close to the fixed-effects ones (entry -1.2%, employment -1.4%, GDP -2.2%), with first-stage KP F-statistics around 63-66. A placebo/pre-trend test (regressing 1990-1998 changes on 1998-2018 risk changes, following Goldsmith-Pinkham et al. 2020) yields small, insignificant coefficients, arguing against omitted-trend confounding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms, and how are they distinguished empirically and in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three channels: (1) direct damage — realized floods lower firm productivity and firm survival; (2) employment channel — anticipated risk lowers real wages/amenities, prompting out-migration and reduced labor supply per household; (3) love-of-variety — fewer firms enter, reducing the variety component of welfare/output. Empirically, the authors distinguish &lt;em&gt;flood risk&lt;/em&gt; (long-run anticipation) from &lt;em&gt;flood events&lt;/em&gt; (short-run realization) by estimating both: risk hits entry/employment/population strongly while events do not, but events hit current-year GDP (productivity) while risk hits it more through adjustment. In the model, direct damages are calibrated from the actual-flood GDP and exit responses (δ, δκ); the employment and variety channels are separated in the counterfactual by sequentially allowing population shares, then labor supply, then variety to respond. The decomposition attributes -0.11% to direct damage, ~0% to labor relocation (offsetting in- and out-migration), -0.33% to labor supply, and -0.08% to variety.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does population fall less than employment, and why do firm exits decline?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Employment falls 1.2% while population falls only 0.8% for a one-SD risk increase, implying the response is not purely relocation — remaining households also reduce labor supply. This motivates introducing a positive labor-supply elasticity φL alongside migration elasticity φM, capturing &amp;lsquo;immobile labor&amp;rsquo; (as in Autor et al. 2013) where some workers cut hours rather than move. Firm exits decline with higher risk even though floods mechanically raise closures, because higher risk deters entry so much that the stock of firms shrinks, lowering the base of firms that can exit — reflecting reduced business dynamism rather than greater firm survival.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Large regional dispersion. While national output fell 0.52%, the top-5% and top-1% counties by output loss lost 7.9% and 13.9% of output respectively (the abstract describes top-5% losses of 7-14%). The hardest-hit counties — coastal and riverine areas in southern and eastern regions (e.g., Cape May NJ, Marion County FL, Sharkey County MS) — lost population, labor supply per household, and firms (top-1% counties: -6.1% population, -4.7% labor supply per household, -10.8% firms). Conversely, mildly affected counties (some Midwestern) were &amp;lsquo;winners,&amp;rsquo; gaining in-migration, more firm entry, and higher labor supply per worker. For the 2020-2050 projection, direct damages play a &lt;em&gt;smaller&lt;/em&gt; relative role (12% vs 21% for 2018) because projected risk increases are more positively correlated with regional productivity, amplifying aggregate adjustment effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Controlling vs. not controlling for actual flooded area leaves risk estimates stable. (2) ZIP-code-level regressions exploiting finer spatial variation give similar magnitudes (establishments -0.233, employment -0.240, payroll -0.221). (3) Restricting to counties with available Q3 (1998) FEMA maps gives qualitatively similar, slightly larger estimates (Appendix Table A.2); the authors conservatively use baseline estimates for calibration. (4) IV estimation and (5) placebo pre-trend tests as above. (6) Lagged flood shocks (Appendix A.4) have negligible effects, confirming floods act through current-year productivity. (7) Model non-targeted moments (output, entry, exit) match data, and model-data correlations of regional GDP, population, emp-to-pop ratio, and firm count are near unity. (8) The implied regional-population-to-real-wage elasticity φM(1+φL) ≈ 2.1 lies within the 1.1-2.5 range from Fajgelbaum et al. (2018).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What model extensions are explored and how do results change?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Four extensions, all yielding somewhat larger output losses than the 0.52% baseline: (1) entry costs paid partly/fully in final goods rather than labor — with α=1 the loss is 0.57%, because final-goods prices respond more to risk than wages; (2) interregional trade with traded/nontraded sectors — requires a larger labor-supply elasticity (φL=1.72) to match data, giving a 0.62% loss; (3) capital (mobile, rented at constant global rate) and land (fixed, congestion force) in production — 0.67% loss, since risk also lowers the capital-to-labor ratio (by 0.34%) as capital becomes relatively more expensive, outweighing land congestion (small land share). The authors read the modest size of these differences as evidence the simplified baseline captures the key forces.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It contributes to climate-spatial-economics work (Costinot et al. 2016, Desmet et al. 2021, Alvarez &amp;amp; Rossi-Hansberg 2021, Rudik et al. 2021). Closest are three flood-aggregate studies: Desmet et al. (2021) on coastal-flooding costs via migration and local technology investment; Balboni (2019) on infrastructure misallocation under sea-level risk; Lin et al. (2021) on coastal housing construction. Differences: prior work focuses mainly on coastal land inundation from sea-level rise, whereas this paper uses historic flood-zone designation maps capturing overall flood risk and studies production damage rather than land loss; and it reconciles structural estimates with reduced-form evidence showing firm/worker responses to &lt;em&gt;risk&lt;/em&gt; differ from responses to &lt;em&gt;actual floods&lt;/em&gt;. Relative to Kocornik-Mina et al. (2020) (satellite-nightlight evidence that floods reduce output transiently), this paper confirms the short-run finding but shows risk has larger, longer-run effects via behavioral adjustment. It relates to Hino &amp;amp; Burke (2020) (same risk data; floods cut property values 1-2%), interpreting housing-price effects as amenity changes; their estimate implies a 0.3-0.6% utility loss, comparable to the paper&amp;rsquo;s calibrated amenity loss of 0.2%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central implication is that evaluations counting only direct flood damages substantially understate true costs, since about 80% of the 0.52% 2018 output loss comes from expectation-driven adjustments (labor supply, migration, fewer firms) rather than the 20% direct damage. Direct damages (-0.11%) match FEMA&amp;rsquo;s ~$17B/year (~0.1% of GDP) estimate, validating the model&amp;rsquo;s lower bound. Policies addressing climate damage — and estimates of the social cost of carbon — should incorporate firms&amp;rsquo; and workers&amp;rsquo; long-run general-equilibrium adjustments. Scope conditions: the analysis is U.S.-specific (chosen for systematic flood-risk data), uses establishments as &amp;lsquo;firms,&amp;rsquo; abstracts from flood insurance (justified by near-actuarially-fair pricing evidence) and from explicit housing, treats unmapped areas as zero-risk, and assumes observed FEMA designations are the risk signal agents act on despite measurement error. The authors note the approach generalizes to other natural disasters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are notable caveats or limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;GDP data do not capture variety/welfare changes, so the love-of-variety channel matters for welfare but is invisible in GDP-based estimates. The amenity parameter η is not directly estimated but imported from Barrage (2020) (output-to-utility damage ratio ~3); the authors note η has little effect on national productivity impact because amenity mostly drives offsetting migration. Labor supply is assumed fixed before shocks (micro-founded by job-search frictions). Flood insurance and housing are not modeled explicitly. Risk is measured by flood-zone land share, which is converted to flood probabilities {rm} via a regression of 2015-2019 actual flooded shares on 2018 zone shares. The two-period long-run design limits dynamics, and counties without FEMA maps are assigned zero risk.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Flood risk vs. flood events&lt;/strong&gt;: The paper sharply separates anticipated flood risk (the share of local land in FEMA Special Flood Hazard Zones, a long-run signal firms/workers observe and act on) from realized flood events (the share of area actually flooded in a given year, from Dartmouth data). Risk drives firm-entry and employment relocation; events drive transient productivity/GDP losses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expectation effects (vs. direct damages)&lt;/strong&gt;: Output losses arising because firms and workers rationally adjust location, entry, and labor supply in anticipation of flood risk — comprising the employment and variety channels. In 2018 these accounted for about 80% (the employment channel 0.33% plus variety 0.08% of the 0.52% loss), four times the 20% from direct physical damage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Employment channel&lt;/strong&gt;: In the model, the mechanism by which higher flood risk lowers real wages and amenities, inducing both out-migration (relocation, ~0% net aggregate effect due to offsetting regions) and reduced labor supply per household (the dominant -0.33% component), governed by elasticities φM (migration) and φL (labor supply).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Love-of-variety channel&lt;/strong&gt;: The output/welfare loss from fewer firms entering under higher risk, operating through the CES variety term (agglomeration force 1/(σ-1)). It reduced 2018 output by 0.08% and matters for welfare but is not captured in GDP data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Direct damage channel&lt;/strong&gt;: The component of flood losses from realized floods lowering firm productivity (parameter δ=0.005) and destroying a fraction of firms (δκ=0.003) plus amenity loss (η=0.002), calibrated from the short-run actual-flood reduced-form estimates; it caused a 0.11% output decline in 2018 (21% of the total).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Indirect inference calibration&lt;/strong&gt;: The simulated-method-of-moments procedure (Gouriéroux &amp;amp; Monfort 1996) used to set labor-supply elasticities φL=1.55 and φM=0.83: running the same 1998-vs-2018 panel regressions on model-generated data and choosing elasticities so model employment and population responses to flood risk match the empirical coefficients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Immobile labor&lt;/strong&gt;: Following Autor et al. (2013), the model feature that some households respond to local flood risk by reducing labor supply rather than relocating, which is why employment falls more (1.2%) than population (0.8%) and motivates a positive labor-supply elasticity φL.&lt;/p&gt;</description></item><item><title>Firm dynamics, monopsony, and aggregate productivity differences</title><link>https://macropaperwarehouse.com/papers/firm-dynamics-monopsony-and-aggregate-productivity-differences/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-dynamics-monopsony-and-aggregate-productivity-differences/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; Firms are larger and grow faster over the life cycle in high-income countries, while labor markets in poorer countries are less competitive (employers hold more wage-setting power). The paper asks how important employer labor market power (monopsony) is for explaining cross-country differences in firm dynamics and aggregate productivity. The novelty is that beyond the standard static misallocation-of-workers channel, monopsony also distorts &lt;em&gt;selection into entrepreneurship&lt;/em&gt; and &lt;em&gt;productivity-enhancing technology adoption&lt;/em&gt;, potentially making the losses larger than prior static estimates suggest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and setup.&lt;/strong&gt; Stylized facts come from the World Bank Enterprise Surveys (WBES), an establishment-level survey of non-agricultural, non-financial private firms with at least 5 full-time permanent employees, covering more than 90 countries from 2006 to 2021, merged with World Development Indicators GDP per capita (2017 constant USD). The estimation sample restricts to countries that ever had GDP per capita above 25,000 USD and to manufacturing firms with non-missing sales/workers/material/capital data, yielding 37,096 firm-year observations across 31 middle- and high-income countries (poorest: Kazakhstan, 19,615 USD in 2009; richest: Ireland, 91,791 USD in 2020). Local labor markets are defined as location-industry (2-digit ISIC v3.1) pairs. The model is a dynamic general-equilibrium neoclassical-monopsony model with occupational choice (entrepreneur vs. wage worker), endogenous productivity investment, and Card-et-al.-style taste-for-employer (amenity) differentiation that gives firms wage-setting power. It is calibrated to the Netherlands (GDP per capita 54,275 USD; median wage markdown 1.301, implying firm-level labor supply elasticity 3.318) via method of simulated moments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main quantitative findings.&lt;/strong&gt; Empirically, moving from poorer to richer countries in the sample, average firm age triples from 11 to nearly 30 years; annualized firm growth rises ~1.6 percentage points per year per doubling of GDP per capita; the share of firms doing R&amp;amp;D more than doubles (from ~15% to &amp;gt;40%); product innovation rises from 20% to 80% and process innovation from 20% to 50%; and median wage markdowns fall (from ~2.25 at 25,000 USD GDP per capita — workers paid ~55% below marginal product — to ~1.25 at 60,000 USD — paid 20-25% below). The calibrated model matches a right-skewed firm-size distribution, life-cycle growth, employer turnover, age distribution, and R&amp;amp;D share (sum of squared deviations between empirical and simulated moments = 1.7%). In counterfactuals raising the markdown from 1.2 to 3, average firm growth shrinks by more than half (from ~150% to ~50%), average firm size falls from ~60 to ~45 employees, the innovating share halves (from ~40% to ~25%), and average firm productivity is ~20% higher in competitive markets. Differences in wage markdown alone account for &lt;strong&gt;25%&lt;/strong&gt; of observed cross-country TFP variation (model TFP std dev 0.051 vs. data 0.201), and &lt;strong&gt;no less than 11%&lt;/strong&gt; across robustness checks. In a Netherlands-vs-Greece decomposition, about &lt;strong&gt;85%&lt;/strong&gt; of the model-implied TFP gap is attributable to lower technology adoption, ~9% to distorted selection into entrepreneurship, and ~6% to static employment reallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms and implications.&lt;/strong&gt; Labor market competition acts as a “skill-biased” force favoring high-productivity firms through three channels: (i) static labor reallocation toward high-productivity, low-amenity firms; (ii) improved selection into entrepreneurship (low-productivity high-amenity agents stop being able to profitably attract workers as ϵL rises); and (iii) higher returns to innovation. The policy implication is that raising labor market competition in less-developed economies could yield substantial productivity gains, and that prior static studies understate the cost of monopsony because they omit the dynamic investment/selection channels.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/calibration strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is calibrated to the Netherlands using a mix of externally set and internally estimated (method-of-simulated-moments) parameters. Externally: model period = 1 year; σν (Gumbel scale) normalized to 1; β = 0.961 (4% annual rate); δw = 0.025 (40-year working life); revenue elasticity of labor ξ = 0.333 (estimated via control function in Section 2); labor supply elasticity ϵL = 3.318 backed out from median markdown 1.301 via ϵL = 1/(µ−1). Six parameters {c_f, c_x, p_i, p_n, σ_z, σ_a} are estimated by MSM. The markdown itself is a key input and is estimated as the ratio of marginal revenue product of labor to wage, with revenue elasticity ξ from a standard control-function approach. Threats: the markdown estimate drives the whole quantitative exercise; the WBES sample is truncated at firms with ≥5 employees (biasing toward larger firms), addressed by re-estimating with imputed moments; and the cross-country counterfactual attributes all variation in ϵL to labor market power while holding all other parameters at Netherlands values, so other cross-country differences are not separately identified.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three mechanisms and how are they distinguished quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Static labor allocation: lower competition raises marginal factor cost only for sufficiently high-productivity firms, reallocating employment toward less-productive, lower-paying employers. (2) Selection into entrepreneurship: when ϵL is low, amenities matter more for profits, letting low-productivity high-amenity agents profitably self-select into entrepreneurship. (3) Technology adoption: returns to innovation increase with ϵL, so weak competition lowers the share of firms investing. They are distinguished via a decomposition that sequentially fixes policy functions at benchmark levels: ~6% of the TFP loss is from employment allocation alone, ~85% from the distortion to innovation policy, and ~9% from distorted selection into entrepreneurship.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across firms is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Firms differ in entrepreneurial productivity z and amenity a. Average revenue product of labor rises with productivity and falls with amenities, and this dispersion is much steeper under weak competition: the elasticity of APL with respect to productivity is 0.31 in the baseline (Netherlands) vs 0.79 in the counterfactual (Greece), and with respect to amenities -0.28 vs -0.81. High-productivity, low-amenity firms face the biggest barriers in less-competitive markets and stay inefficiently small; low-productivity, high-amenity firms are propped up. Innovation distortion is concentrated among high-productivity firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run and what do they show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Four main checks, each reported as the share of cross-country TFP variation explained (data std dev 0.201): (1) Productivity-amenity correlation — allowing entrants to draw correlated (z,a) with σ_za = 0.296 (matching Sockin 2024’s 0.622 wage-satisfaction correlation) lowers explained variation to ~15% (model std dev 0.030), because correlation reduces scope for reallocation. (2) Costs in terms of labor instead of final goods (per Klenow and Li 2025) gives ~22% (std dev 0.044). (3) Imputed firm-level moments covering all firms (not just ≥5 employees) gives ~14% (std dev 0.028). (4) Over-identified alternative identification using size/age/R&amp;amp;D shares and annualized growth gives ~11% (std dev 0.023). The headline range is therefore 25% baseline, no less than 11% across checks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on static monopsony cost estimates: Berger et al. (2022, eliminating US labor market power raises average wage 48%, welfare +6% of lifetime consumption); Armangüé-Jubert et al. (2025, labor market power explains 15% of GDP-per-capita gap over development); Deb et al. (2022, less competition lowered US low/high-skill wages 12% and 11%); Amodio et al. (2025b, eliminating monopsony in Peru raises earnings 26%); Bachmann et al. (2022, monopsony caused a 10% aggregate productivity loss in East Germany). Its contribution is to add the entrepreneurial-selection and innovation channels, yielding larger losses than static studies, and to bridge the monopsony-cost literature with the misallocation literature (Restuccia-Rogerson, Guner et al., Hsieh-Klenow).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Raising labor market competition (higher firm-level labor supply elasticity) improves allocative efficiency, selection into entrepreneurship, and innovation, raising firm growth and aggregate productivity. Scope conditions: the quantitative results apply to middle- and high-income countries (sample restricted to those ever above 25,000 USD GDP per capita); the 25% headline depends on the assumption that initial productivity and amenities are independent (falls to ~15% under positive correlation); and the decomposition attributing 85% to innovation is specific to the Netherlands-vs-Greece comparison. The model treats labor supply elasticity differences as the sole varying parameter, so the counterfactuals isolate the labor-market-power channel rather than reproducing total cross-country income gaps.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the Netherlands-vs-Greece comparison specifically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Greece has roughly half the GDP per capita of the Netherlands (29,000 vs 54,000 USD) and much weaker competition (wage markdown 2.623 vs 1.301, labor supply elasticity 0.616 vs 3.318). In the Greece counterfactual, average firm size is 26 vs 59 employees, life-cycle growth 84.5% vs 153%, average age 22.5 vs 30 years, and R&amp;amp;D investing share 18% vs 41%. Labor market competition differences explain 29% of the firm-size gap, 27% of the firm-age gap, and 74% of the R&amp;amp;D-share gap between the two countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the model get right that was not targeted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The firm size and age distributions are not targeted yet are matched: in the data ~57.6% of firms have &amp;lt;20 employees and ~6.2% have &amp;gt;100; ~60% of firms are under 30 years old and ~10% over 60. The estimated parameters imply investing firms are 15% more likely to grow (p_i=0.649 vs p_n=0.499); innovation and operating costs equal ~43% and ~8% of average incumbent profits respectively; standard errors are small, indicating informative moments.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Firm Heterogeneity, Market Power and Macroeconomic Fragility</title><link>https://macropaperwarehouse.com/papers/firm-heterogeneity-market-power-and-macroeconomic-fragility/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-heterogeneity-market-power-and-macroeconomic-fragility/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Ferrari and Queirós ask why US recoveries have become progressively slower and argue that rising firm heterogeneity and market power — well-documented long-run trends — can substantially increase the probability that a moderate aggregate shock triggers a quasi-permanent slump rather than a transitory recession. They call this probability macroeconomic fragility.&lt;/p&gt;
&lt;p&gt;The theoretical framework is an RBC model with oligopolistic (Cournot) competition, endogenous firm entry, and elastic capital and labor supply (GHH preferences). The economy consists of many product markets; within each market, firms with heterogeneous idiosyncratic TFP compete in quantities, with the marginal firm earning zero net profit. A central complementarity drives the results: more competition raises factor shares and factor prices, which expands factor supply, which in turn allows more firms to enter, sustaining high competition. This complementarity can generate multiple stochastic steady-states — a high-competition, high-output regime and a low-competition, low-output regime.&lt;/p&gt;
&lt;p&gt;Two forces increase fragility by shrinking the basin of attraction around the high steady-state. First, a mean-preserving spread (MPS) in idiosyncratic TFP: the dominant firm expands market share, factor shares fall (market-power effect), the factor price index drops, and smaller firms approach their exit threshold — requiring only a smaller shock to trigger cascading exit. Second, rising fixed production costs: the unstable steady-state shifts toward the high steady-state, narrowing the gap and making downward transitions more likely.&lt;/p&gt;
&lt;p&gt;The model is calibrated three times — to match COMPUSTAT moments in 1975, 1990, and 2007 — varying only the log-normal standard deviation of idiosyncratic productivity (λ = 0.182, 0.213, 0.232) and the fixed cost parameter (c × 10⁻³ = 0.351, 0.691, 0.751). The fixed-to-total-cost ratio in COMPUSTAT rises from 21.9% in 1975 to 31.7% in 1990 to 36.9% in 2007; the standard deviation of log revenues rises from 1.59 to 1.91 to 2.04.&lt;/p&gt;
&lt;p&gt;The quantitative results are stark. The 1975 economy has a unimodal ergodic distribution (one stable steady-state); the 1990 and 2007 economies are bimodal (two stable steady-states). When subjected to the same TFP shock sequence (εt = −σε for four quarters), output falls 4.0% after five quarters in the 1975 economy, 5.1% in 1990, and 5.9% in 2007; after 100 quarters, the 2007 economy remains 6.3% below pre-shock output, against 3.0% for 1990 and 1.3% for 1975. For a larger shock (εt = −2σε for six quarters), only the 2007 economy transitions permanently to the low steady-state, with output 12.5% below trend after 100 quarters. The minimum shock required to trigger a downward transition is 6.84σε for the 1990 economy but only 1.62σε for the 2007 economy. In Monte Carlo simulations, the probability of a recession exceeding 10% of output over a 40-quarter window is 1.7% in 1975, 12.4% in 1990, and 19.6% in 2007. In expectation, the 2007 economy experiences such a recession every 70 years, the 1990 economy every 95 years, and the 1975 economy every 380 years.&lt;/p&gt;
&lt;p&gt;Applying the 2008–09 TFP shocks to the 2007-calibrated model generates a persistent deviation from trend: output is 12.1% below trend by 2019, investment 14.4% below, and hours 9.8% below — closely matching the data (14.2%, 14.7%, and 5.5% respectively). The same shocks applied to the 1975 and 1990 economies produce no permanent transition; by 2040 the 1975 (1990) economy is only 1.5% (4.7%) below trend.&lt;/p&gt;
&lt;p&gt;Cross-industry evidence corroborates the mechanism. Using US Census and BLS data on 791 six-digit NAICS industries, the authors find that a 1 percentage point higher pre-crisis four-firm concentration ratio (CR4) in 2007 is associated with 1.8–1.9 percentage points lower employment growth, 2–3 percentage points lower net firm entry, and a larger decline in the labor share between 2007 and 2016. These qualitative and quantitative patterns are matched by simulated cross-industry regressions from the model.&lt;/p&gt;
&lt;p&gt;On policy, an entry subsidy that eliminates fixed-cost barriers for the approximately 11.8% of markets with positive fixed costs can prevent downward transitions and yields a welfare gain of roughly 10% in consumption-equivalent terms in the 2007 economy. A revenue subsidy applied to all firms achieves welfare gains between 30% and 50% for a 20% subsidy rate, acting as a steady-state selection device by shifting probability mass from the low to the high competition regime. These gains are nonlinear: even a 5% revenue subsidy yields roughly a 20% welfare gain in the 2007 economy. The gains are in line with Edmond et al. (2023), who find welfare costs of markups up to 50%.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the model&amp;rsquo;s identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is primarily theoretical and quantitative rather than identification-based in the econometric sense. The causal claim — that rising firm heterogeneity and fixed costs increase macroeconomic fragility — comes from two sources: (1) analytic comparative statics (Propositions 4–6) that formally show fragility rises with a mean-preserving spread on TFP or with fixed costs, and (2) calibration counterfactuals where the 1975, 1990, and 2007 economies face the same shock sequence but differ only in λ and c. The cross-industry regressions are reduced-form and subject to standard endogeneity concerns — pre-crisis concentration could be correlated with industry-specific demand shocks coinciding with 2008. The authors partially address this by including pre-crisis growth trends as controls and sector fixed effects, but do not use an instrumental variable for concentration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the core mechanism linking firm heterogeneity to fragility, and how is it distinguished from steady-state multiplicity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The mechanism runs through factor markets. When idiosyncratic TFP dispersion rises (MPS), the dominant firm expands market share and charges a higher markup, depressing the aggregate factor share (Proposition 4). This reduces the factor price index and real wages, contracting labor supply. Marginal firms, already earning near-zero profits, move closer to their exit threshold. A smaller aggregate shock suffices to push them out, triggering cascading exit, a further collapse in competition, a further fall in factor prices, and a self-reinforcing transition to the low steady-state. Fragility is distinct from multiplicity: the existence of two steady-states is a necessary but not sufficient condition for fragility. Fragility specifically measures the size of the basin of attraction around the high steady-state from below — how large a shock is needed to trigger a downward transition. An economy can have two steady-states but be highly resilient if the basin is wide.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What roles do the three model channels (endogenous market structure, oligopolistic markups, elastic factor supply) play quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors isolate each channel by shutting it down one at a time and comparing output volatility (Table 8). In the baseline, the standard deviation of log output is 0.063 and autocorrelation is 0.975. Fixing the number of firms (removing the endogenous market structure channel, leaving only elastic factor supply) reduces output standard deviation to 0.035, accounting for 55% of baseline volatility. Replacing oligopoly with monopolistic competition (constant markups, love-for-variety active) recovers 0.049 — approximately 78% of baseline — implying the endogenous markup channel accounts for about one-fourth of total amplification. The love-for-variety channel accounts for another approximately one-fourth. Crucially, all three alternative models exhibit unimodal ergodic distributions, confirming that all three channels are jointly required to generate steady-state multiplicity and the model&amp;rsquo;s nonlinear amplification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented and how does it motivate the model&amp;rsquo;s calibration?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Rising US firm heterogeneity is documented along three dimensions: (1) standard deviation of log revenues (sales) for COMPUSTAT firms, rising from 1.59 in 1975 to 1.91 in 1990 to 2.04 in 2007; (2) the average ratio of fixed (SG&amp;amp;A) to total costs (fixed + COGS), rising from 21.9% in 1975 to 31.7% in 1990 to 36.9% in 2007; (3) sales-weighted average markups for public firms rising from 1.28 in 1975 to 1.37 in 1990 to 1.46 in 2007 (from De Loecker et al., 2020). These moments are the calibration targets for the time-varying parameters λ and c. The structural parameters (elasticities of substitution σI = 1.46 and σG = 11.50) are time-invariant and calibrated jointly to the markup levels across the three years.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper&amp;rsquo;s account of the Great Recession differ from other slow-recovery theories?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Most related theories attribute slow recovery to (1) the zero lower bound on interest rates and constrained monetary policy (Christiano et al., 2015; Eggertsson et al., 2019; Guerrieri and Lorenzoni, 2017), (2) endogenous TFP decay through R&amp;amp;D decisions (Anzoategui et al., 2019; Bianchi et al., 2019; Queralto, 2020), or (3) declining firm entry per se (Clementi and Palazzo, 2016). Ferrari and Queirós instead argue the 2008 shock was not unusually large — the same shock does not cause a permanent transition in the 1975 or 1990 economies — but rather that the US economy had become structurally more fragile over the preceding decades due to rising concentration and fixed costs. The closest related model is Schaal and Taschereau-Dumouchel (2018), who also use coordination failures among oligopolistic firms to generate multiple steady-states. The key contribution of Ferrari and Queirós relative to that work is the explicit role of cross-sectional firm heterogeneity in determining the probability of transitions, and the empirical documentation that rising heterogeneity preceded the crisis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the cross-industry empirical results in detail?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The dataset covers 791 six-digit NAICS industries from the US Census, SUSB, and BLS, with the concentration variable defined as CR4/CR50 (top-4 share scaled by top-50 share). Key results: (1) Employment: a 1 pp higher CR4/CR50 in 2007 is associated with 1.77–1.89 pp lower annualized employment growth between 2007 and 2016 (significant at 1%); robust to controlling for pre-crisis employment trends and sector fixed effects. (2) Payroll: similarly negative coefficient of approximately −0.041 on log payroll growth. (3) Net firm entry: a 1 pp higher concentration is associated with 2–3 pp lower post-crisis net entry. (4) Labor share: a negative relationship between 2007 concentration and the change in industry labor share between 2008 and 2016 (coefficient approximately −0.031, significant at 10%). All results are mirrored qualitatively and quantitatively in simulated cross-industry regressions from the model: concentrated markets in the model experience 5.4% larger drops in employment, 3.7% higher firm exit, and 1.1% larger decline in labor share.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and extensions are reported?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Several extensions and checks are noted: (1) An alternative shock — fluctuations in the fraction of industries with positive fixed costs (xc) rather than TFP shocks — also replicates the medium-run behavior of the US economy, with output falling roughly 15% on impact and remaining −18% below trend in the long run; the cross-sectional implications are unchanged. (2) The 1990 recession counterfactual: applying 1990–1991 recession shocks to the 1990 economy produces no permanent transition, but the same shocks applied to the 2007 economy do, confirming that fragility rather than shock size drove the 2008 outcome. (3) Factor-price-dependent fixed costs: Ferrari and Queirós (2022) show steady-state multiplicity is preserved when fixed costs depend on factor prices. (4) Varying M: results are unchanged for M = 50 and M = 100 potential firms per market. (5) The cross-industry regressions are robust across multiple specifications including controls for the number of firms in 2007, pre-crisis growth, and sector fixed effects (Appendix B.7).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the model&amp;rsquo;s aggregate predictions for labor share, profit share, and markups post-2008, and how do they compare to data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Between 2007 and 2016, the model predicts (Table 9): a 0.4 pp decline in the aggregate labor share (data: −2.9 pp decline; the model explains approximately 14% of the total decline, or 17% accounting for the pre-crisis trend); a 0.9 pp increase in the profit share (data: +3.2 pp; model explains 30% of the trend deviation); a 3.7 point increase in sales-weighted markups for COMPUSTAT firms (data: +14.2 points; model explains 26% of the total increase and 58% of the deviation from the pre-crisis trend). The model also predicts a persistent fall in the number of firms in markets with positive fixed costs of 13.4 log points, compared to the observed 15.1 log point decline in the number of US firms with at least one employee. The model understates the magnitude of all these changes, but correctly signs and persists them, consistent with its role in providing a partial explanation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper studies two interventions: (1) An entry subsidy covering a fraction τf of fixed costs for markets with c &amp;gt; 0 (roughly 11.8% of all markets). A 5% entry subsidy is sufficient to eliminate the welfare costs associated with multiplicity in the 2007 economy; higher subsidies improve allocation within the high steady-state. An entry subsidy large enough to prevent downward transitions yields approximately 10% welfare gain in consumption-equivalent terms. The effect is highly targeted and quantitatively modest per-dollar because only 11.8% of markets are affected. (2) A revenue subsidy τR applied to all firms, equivalent to a fraction of revenues subsidized. Even a 5% revenue subsidy generates approximately 20% welfare gain in the 2007 economy by shifting probability mass from the low to the high competition regime. A 20% revenue subsidy yields gains between 30% and 50% in the 1990 and 2007 economies. The gains are nonlinear in the economies with multiple steady-states, and much smaller in the 1975 economy, which has only one steady-state. A revenue tax has asymmetric large welfare costs in the 1990 economy (which has large output gaps between regimes) relative to the 2007 economy (smaller gap but higher transition probability). The welfare gains come from two sources: reducing static markup distortions and reducing the dynamic cost of transitions (quasi-permanent slumps).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What caveats and limitations does the paper acknowledge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors are explicit about several limitations. First, the model lacks sunk entry costs: all entry decisions are static, which may understate hysteresis and overstate the responsiveness of exit to shocks. Introducing sunk costs with oligopolistic competition poses a computational challenge (20^10 partial equilibria for M=20 and 10 values per firm). Second, idiosyncratic productivities are time-invariant, ruling out Schumpeterian creative destruction within the model. Third, the model features only one-sided market power (product markets only); recent work on labor-market oligopsony could interact with the mechanism. Fourth, the model has no monetary policy channel; the interaction between monetary policy and endogenous market structure is left for future research. Fifth, the model explains only a fraction of the observed post-2008 declines in the labor share (14–17%), profit share (30%), and markup levels (26% of total, 58% of trend deviation), suggesting complementary mechanisms are at work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper characterize the relationship between the Great Moderation and rising fragility?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper directly addresses the apparent tension between the Great Moderation (declining aggregate output volatility from 1980 to 2007) and the model&amp;rsquo;s prediction of rising fragility over the same period. The resolution is that aggregate output volatility is the product of exogenous TFP shock volatility and endogenous amplification. If exogenous TFP shocks became less volatile over time (a plausible claim, attributed to demographic shifts and the rising share of low-volatility service industries), then aggregate volatility could have declined even as endogenous amplification increased. Fragility, as defined in the paper, is about the probability of large discrete transitions, not about the variance of the ergodic distribution around a single steady-state. An economy can exhibit lower volatility on average while being more prone to catastrophic (quasi-permanent) downturns.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Macroeconomic Fragility&lt;/strong&gt;: The probability of long slumps, formally measured as the proximity of the high stable steady-state to the preceding unstable steady-state (χ = KU/K*). A higher χ means a smaller negative shock is sufficient to trigger a permanent downward transition. Fragility is distinct from steady-state multiplicity (which is necessary but not sufficient) and distinct from stability (which measures the full basin of attraction in both directions).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Competition-Factor Supply Complementarity&lt;/strong&gt;: The positive feedback loop through which more competitive product markets generate higher factor shares and factor prices, inducing higher labor and capital supply, which in turn allows more firms to enter and compete. This complementarity is the structural foundation for multiple steady-states in the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mean-Preserving Spread (MPS) on Idiosyncratic TFP&lt;/strong&gt;: An increase in cross-firm productivity dispersion that leaves the average unchanged. In the model&amp;rsquo;s context, an MPS raises aggregate TFP (allocative efficiency effect as output shifts to high-productivity firms) but lowers the factor share and factor price index (market power effect as concentration increases), and shrinks the stable steady-state&amp;rsquo;s capital level while raising the unstable steady-state&amp;rsquo;s capital level — thereby increasing fragility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Low Competition Trap&lt;/strong&gt;: The low stable steady-state in which the economy becomes trapped following a transition from the high steady-state. Characterized by fewer active firms, higher markups, lower factor shares, lower capital stock, and lower output relative to the high steady-state. In the 2007 calibration, the two steady-states are approximately 21% apart in output terms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Market Structure&lt;/strong&gt;: The model feature whereby the number of active firms in each product market is determined endogenously by a free-entry condition: the marginal firm exactly breaks even (net profits equal fixed costs). This makes the number of firms — and hence the degree of competition, markups, and factor shares — respond endogenously to aggregate shocks and capital accumulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Factor Price Index (Θ)&lt;/strong&gt;: A composite of the wage and rental rate representing the minimum cost of one unit of output for a firm with unit productivity. In the model, Θ equals the product of the aggregate factor share and aggregate TFP. It serves as a sufficient statistic for both factor prices and the competitive environment, decreasing with higher firm heterogeneity (via lower factor shares) and increasing with more firms (via higher competition).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Great Deviation&lt;/strong&gt;: The paper&amp;rsquo;s term (following Hall, 2011) for the persistent and widening gap between actual US output and its pre-2007 trend following the 2008–09 recession. In the data, real GDP per capita was 14.2% below its pre-crisis trend as of 2019Q1, a deviation far larger and more persistent than in any prior postwar recession. The paper&amp;rsquo;s model rationalizes this as a transition to the low steady-state.&lt;/p&gt;</description></item><item><title>From Population Growth to TFP Growth</title><link>https://macropaperwarehouse.com/papers/from-population-growth-to-tfp-growth/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/from-population-growth-to-tfp-growth/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks how the well-documented slowdown in labor-force growth affects aggregate total factor productivity (TFP) growth, a question that prior work on business dynamism had left unanswered. The authors build a general-equilibrium business-dynamics model that embeds two engines of productivity growth: innovation by young entrants (a step-size improvement over the leading-productivity frontier, in the spirit of Romer 1990 and Aghion-Howitt 1992) and steady productivity growth by mature leading businesses. Population (labor-force) growth determines the demographic composition of the business stock, because the number of firms must grow in proportion to the labor force along any balanced growth path (BGP). A slower labor force therefore shifts the firm distribution toward older incumbents.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central theoretical result is a &amp;ldquo;sufficient statistic&amp;rdquo; for whether slower population growth reduces TFP growth: the employment-size growth rate of surviving old businesses, which converges to the ratio gS/gX (the productivity growth of leading businesses divided by average economy-wide productivity growth). If gS/gX &amp;lt; 1 — i.e., old firms&amp;rsquo; productivity grows more slowly than the economy average — then a lower labor-force growth rate raises the share of old firms and drags down aggregate productivity growth. Both the sign and the magnitude of the effect are characterized in closed form.&lt;/p&gt;
&lt;p&gt;The model is calibrated to U.S. and Japanese establishment data (Business Dynamics Statistics; Economic Census and Establishment/Enterprise Census), targeting the life-cycle profiles of exit rates, average employment size by age, and the employment growth rate of surviving businesses, with the reference period 1980–1999. The U.S. labor-force growth rate used in calibration is 1.67 percent per year (average 1980–1999); Japan&amp;rsquo;s is 0.72 percent. A key calibrated quantity is gS: 1.060 for the U.S. and 1.030 for Japan, reflecting the faster decline in the size of surviving old establishments in Japan relative to the U.S. The benchmark model adds entry congestion (parameter ϕ = 0.55, taken from Karahan, Pugsley and Sahin 2024) and spillovers from young to old firms&amp;rsquo; productivity growth (γ = 0.342, estimated from BDS data using venture capital investment as an IV).&lt;/p&gt;
&lt;p&gt;Main quantitative findings across BGPs: In the U.S., the projected decline in labor-force growth from approximately 2.59 percent (1970–1980) to 0.26 percent (2050–2060) implies a long-run reduction in TFP growth of approximately 0.3 percentage points. In Japan, the decline from approximately 1.86 percent (1950–1960) to −0.97 percent (2050–2060) — a drop of more than 3 percentage points — implies a long-run reduction in TFP growth of approximately 0.6 percentage points. These effects are substantially attenuated when congestion and spillovers are removed: the U.S. effect falls from 0.30 to 0.19 percentage points and the Japan effect falls from 0.63 to 0.41 percentage points in the simplest model, so roughly 65 percent of the benchmark effect is attributable to the core mechanism alone.&lt;/p&gt;
&lt;p&gt;For the transition analysis, the model accounts for approximately 49.7 percent of the observed U.S. TFP growth slowdown between 1980–1999 and 2000–2019 (an observed decline of 0.184 percentage points, model-explained 0.091 percentage points). In Japan, the model explains approximately 24.2 percent of a larger observed slowdown of 0.451 percentage points (model: 0.109 pp). A critical feature of the dynamics is that TFP growth responds sluggishly to population growth changes. Two transitional counterbalancing forces explain this: (1) a &amp;ldquo;level-vs-growth&amp;rdquo; effect — on impact, a higher share of older (larger and more productive) firms temporarily raises productivity growth in levels even while it lowers the growth rate in the long run; and (2) a &amp;ldquo;labor-reallocation&amp;rdquo; effect — fewer entrants means less labor in the innovation sector and more in production, temporarily raising the production-sector labor share and boosting measured TFP growth. Both effects fade as the economy converges to the new BGP.&lt;/p&gt;
&lt;p&gt;Looking forward, the expected further decline in TFP growth from population aging is -0.05 to -0.06 percentage points for the U.S. between 2020 and 2100 (benchmark, without incorporating forecasts), and -0.14 to -0.17 percentage points for Japan over the same horizon. When BLS/CAO forecasts for labor-force growth through 2060 are incorporated, these magnitudes rise to -0.07 to -0.08 pp (U.S.) and -0.24 to -0.34 pp (Japan) between 2020 and 2100. Cross-sectional IV regressions using lagged state birth rates as instruments confirm that a 1-percentage-point change in labor-force growth maps to approximately a 0.1 to 0.2 percentage-point change in labor productivity growth across U.S. states, consistent with model predictions. Local projections using U.S. state data 1977–2019 show that the dynamic pattern in data (initial positive then negative response of productivity growth to a labor-force shock) mirrors the model&amp;rsquo;s transitional dynamics closely.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the paper&amp;rsquo;s core theoretical result, and what is the &amp;lsquo;sufficient statistic&amp;rsquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main result (Lemma 4) states that if the employment-size growth rate of surviving old businesses is negative — equivalently, if gS/gX &amp;lt; 1 — then an increase in the labor-force growth rate raises average productivity growth, and vice versa. The &amp;lsquo;sufficient statistic&amp;rsquo; is gS/gX, the ratio of old-firm productivity growth to economy-wide average productivity growth. This ratio asymptotically equals the employment growth rate of surviving old firms in a BGP (Lemma 3). Lemma 5 further shows that the magnitude of the effect is increasing in how fast old firms&amp;rsquo; size shrinks, i.e., larger when gS/gX is further below 1. This means the calibration of the life-cycle profile of surviving business growth is the decisive input for the quantitative results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two growth engines in the model and how do they interact?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The first engine is innovation by new entrants: innovators choose a step size g relative to the average leading-firm productivity frontier χ, paying convex research costs. The free-entry condition ties the step size to structural parameters (research cost slope and entry cost), making g* constant in equilibrium. The second engine is the exogenous (in the benchmark) or endogenous (in extensions) productivity growth of leading businesses at rate gS per period. Both engines operate simultaneously: gX is determined by a weighted average of these two sources, where the weight on the old-firm engine equals their share in the firm distribution. Population growth affects this weight by determining the number of new entrants relative to incumbents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What identification strategy is used in the empirical validation and what are the threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two empirical strategies are used. First, local projections (Jordà 2005) using U.S. state-level data 1977–2019 regress the change in labor productivity growth over horizons i = 0 to 8 years on the change in labor-force growth, controlling for seven lags of each variable and a quadratic time polynomial. This establishes that the dynamic pattern in the data mirrors the model-predicted non-monotonic response (initial positive effect, then negative and significant effects at 2–5 years). Second, cross-sectional IV regressions for U.S. states average 2004–2024 data and use the lagged state birth rate (pushed back 20 years) as an instrument for labor-force growth, with controls for initial GDP per capita and state population. The main threat is reverse causality: workers may relocate to states with higher expected productivity growth. The authors note the IV addresses this by using birth rates from 20 years prior. A further threat acknowledged is knowledge spillovers across states, which would bias the local-projection coefficient downward.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper say about the role of entry congestion and innovation spillovers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Entry congestion modifies the free-entry condition to make entry costs rise with the ratio of entrants to population (with elasticity ϕ = 0.55). This means that when population growth slows and fewer entrants arrive, entry costs fall, which discourages innovation intensity (lower g*), adding a second channel through which slower population growth lowers TFP growth. Innovation spillovers allow the productivity growth of leading businesses (gS) to respond positively to lagged aggregate productivity growth (with elasticity γ = 0.342, estimated via IV). When population growth slows and productivity growth falls, spillovers to incumbents also fall, amplifying the total effect. Together, these features explain roughly 35 percent of the benchmark effect beyond what the core mechanism delivers alone: the U.S. effect rises from 0.19 pp (no congestion, no spillovers) to 0.30 pp in the benchmark.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the robustness checks on the BGP results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Five alternative productivity processes are considered. Case 1 is a standard two-state AR(1), Case 2 allows transition probabilities to depend on age, Case 3 uses deterministic productivity growth by type (high and low) with age-dependent transitions, Case 4 is the benchmark (asymmetric absorbing high-productivity state with tenure-dependent productivity history), and Case 5 cuts the productivity jump θ in half. All five deliver similar qualitative results, with the long-run U.S. effect ranging from -0.15 to -0.22 percentage points compared to -0.19 in the benchmark. The AR(1) specification (Case 1) yields the smallest effect because it misses the growth of young and old businesses in the data. Endogenous exit is examined in a separate extension: the exit rate declines further when population growth falls (amplifying the old-firm share effect), but this is nearly exactly offset by higher innovation incentives from longer business horizons, resulting in very small net change. Endogenous innovation by leading businesses is also explored and found to amplify the result at low population growth rates (making the effect nonlinear and potentially larger in future decades), but its impact at observed historical ranges is modest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the transitional dynamics differ from the BGP comparison, and why?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The BGP comparison provides the long-run effect of a permanently different population growth rate on TFP growth. The transition shows that convergence to this new BGP is very slow — taking more than 20 years to reach the new steady-state share of young businesses after a step decline in population growth. This slowness is driven by two counterbalancing forces. The level-vs-growth effect: on impact, a lower entry rate raises the share of larger, more productive older firms, which temporarily boosts the level of productivity growth even as the long-run growth rate falls (because young firms have lower productivity levels despite faster productivity growth). The labor-reallocation effect: fewer entrants mean less labor in the innovation sector, reallocating workers to production, which temporarily raises the production-employment share and therefore measured TFP growth. As a result, the model accounts for 49.7 percent of the U.S. TFP growth slowdown between 1980–1999 and 2000–2019, not the full long-run 0.30 pp effect. The sensitivity analysis shows that lower sS, lower β, or higher gS all speed up convergence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to Karahan, Pugsley and Sahin (2024) and Hopenhayn, Neira and Singhania (2022)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both prior papers show that slower labor-force growth reduces business dynamism by generating a startup deficit and shifting the firm age distribution toward older incumbents. They share the basic Hopenhayn (1992) firm-dynamics structure with this paper. The key distinction is that those papers focus on entry rates, exit rates, employment concentration, and labor market dynamics as outcomes, whereas Inokuma and Sanchez focus on TFP growth. As a validation exercise, this paper shows its model also reproduces the decline in U.S. business dynamism (entry rate, exit rate, share of young establishments) when fed the trend in labor-force growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to Peters and Walsh (2022)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Peters and Walsh (2022) also studies population growth and productivity. Their framework builds on Klette and Kortum (2004) and emphasizes scale effects, variety expansion, market concentration, and markups, abstracting from firm life-cycle dynamics. This paper instead builds on Hopenhayn (1992) and focuses on how innovation intensity varies with firm age. The two mechanisms are complementary: the life-cycle mechanism in this paper would add 56 percent to the productivity growth decline found in Peters and Walsh (Peters and Walsh find approximately 0.23 pp per 1 pp decline in population growth, almost all from varieties; Inokuma and Sanchez find 0.13 pp per 1 pp for the U.S., so the combined effect would be roughly 0.36 pp).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented in the paper?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The most important heterogeneity is between the U.S. and Japan. Japan&amp;rsquo;s establishments exhibit a much flatter size profile by age (the ratio of employment in establishments 29+ years to age-1 establishments is 1.5 in Japan versus 3.5 in the U.S.) and a sharper decline in the size of surviving old establishments, yielding a calibrated gS of 1.030 for Japan versus 1.060 for the U.S. This implies a larger sufficient statistic |1 - gS/gX| for Japan and therefore a larger elasticity of TFP growth to population growth: 0.6 pp effect for Japan versus 0.3 pp for the U.S. over their respective projected population growth declines. Within the model, the two types of firms (laggard and leading) have different survival rates (sS &amp;gt; sU), different productivity levels (leading firms are roughly 200 vs 10 employees on average), and different exit dynamics (laggards face much higher exit rates, especially when young).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper does not focus on policy prescriptions, but the implied lesson is that policies affecting the entry rate of new firms — or the productivity life-cycle of mature incumbents — are the primary levers for mitigating the TFP drag from aging populations. Because the effect operates through firm-age composition, any policy that encourages new business formation (lowering entry costs, relaxing congestion) would partially offset the demographic headwind. The scope conditions are important: the main result holds under a perfectly elastic supply of new businesses, constant entrant innovation intensity, and exogenous survival/productivity profiles. Congestion and spillovers amplify the mechanism. When exit is endogenous, competing forces nearly cancel, so the result is robust. The direction of the effect depends critically on gS &amp;lt; gX (i.e., old firms&amp;rsquo; productivity growing more slowly than average), which is empirically verified for both the U.S. and Japan. If the sufficient statistic were positive (gS &amp;gt; gX), slower population growth would raise TFP growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper say about scale effects and how they interact with the life-cycle mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In a CES variety model (as in Peters and Walsh 2022), gTFP = g_tilde_X + (1/(sigma-1)) * gN, adding a direct scale effect where slower population growth reduces the number of varieties and TFP directly. Calibrating sigma = 4 (consistent with Jones 2022), this implies a 0.33 pp TFP decline per 1 pp population growth decline from the variety channel. The life-cycle mechanism in this paper adds 0.13 pp for the U.S. and 0.22 pp for Japan per 1 pp decline. Thus the two mechanisms together would imply a 0.46 to 0.55 pp decline per 1 pp of population growth slowdown — 30 to 60 percent larger than the variety channel alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the &amp;rsquo;level-vs-growth&amp;rsquo; effect and how does it arise?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When population growth slows suddenly, the entry rate falls and fewer young firms enter. This means the firm pool immediately becomes more skewed toward older, larger, more productive incumbents. On impact, this raises the average level of productivity in the economy (because old firms have higher levels, even if slower growth rates). This temporarily boosts the growth rate of average productivity in the short run, even though in the long run the effect is to lower TFP growth (because old firms&amp;rsquo; productivity growth rate gS is below gX). This transient positive effect on TFP growth counterbalances and delays the long-run decline, contributing to the sluggish response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does the discount factor and household preferences play in the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The household problem involves standard intertemporal optimization with risk aversion ε = 2 and discount factor β = 0.96. These parameters enter the speed of convergence in the transition: lower β increases the speed of convergence (sensitivity analysis shows β has an elasticity of -4.212 for convergence speed). Along the BGP, household preferences determine the interest rate through the Euler equation and affect the capital share α-tilde, which varies across BGPs. The paper notes that d(alpha-tilde)/d(gM) is likely negative, meaning that lower population growth also reduces the capital share, amplifying the effect on TFP growth, though extreme parameter values could reverse this.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the data sources and what moments are targeted in calibration?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For the U.S.: establishment-level data from the Business Dynamics Statistics (BDS), spanning 1978 onwards; labor force data from BLS Current Population Survey (1949–2019) and Lebergott (1966) for 1900–1948; TFP from Penn World Table 10.0; venture capital investment from PwC/CB Insights MoneyTree. For Japan: establishment data from the Establishment and Enterprise Census (1981–2006) and Economic Census (2009–2021); labor force from Statistics Bureau of Japan; TFP from PWT 10.0. Calibration targets 32 moments for the U.S. (31 life-cycle bars plus average productivity growth) and 20 for Japan. The targeted moments are the exit rate by establishment age (with equal weighting), the average employment size profile by age, and the growth rate of surviving establishments by age. Ten parameters are jointly estimated to minimize the distance between model-implied and data moments.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Sufficient statistic (gS/gX)&lt;/strong&gt;: The employment-size growth rate of surviving old businesses, which asymptotically equals the ratio of old-firm productivity growth (gS) to economy-wide average productivity growth (gX). This single ratio determines both the sign (if less than 1, slower population growth reduces TFP growth) and the magnitude (the faster gS/gX falls below 1, the larger the effect) of population growth&amp;rsquo;s impact on productivity growth along balanced growth paths.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Leading versus laggard businesses&lt;/strong&gt;: The paper&amp;rsquo;s two-type firm classification. Laggard businesses start with productivity θ·χ·g (below the frontier), grow at a flat rate, and face high exit rates; they can transition to the leading group with age-dependent probability λ_a. Leading businesses begin at or above the frontier (productivity χ·g at entry), grow at constant rate gS per period, and face lower exit rates. The share of leading versus laggard firms — and the speed at which laggards transition — determines the life-cycle productivity profile that is central to the sufficient statistic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Level-vs-growth effect&lt;/strong&gt;: A transitional counterbalancing force: when population growth slows, fewer young (small, low-productivity-level) firms enter, immediately raising the average level of productivity in the firm pool and temporarily boosting measured productivity growth, even though the long-run effect is negative. The short-run level gain outweighs the long-run growth-rate loss, delaying the TFP growth decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor-reallocation effect&lt;/strong&gt;: A second transitional counterbalancing force: lower entry rates reduce the number of workers employed in innovation (research and development) activities, reallocating them to goods production. This increase in the production-sector labor share temporarily raises measured TFP growth. Like the level-vs-growth effect, it fades as the economy converges to the new balanced growth path.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Entry congestion&lt;/strong&gt;: An extension to the free-entry condition in which the per-entrant cost rises with the ratio of the entry rate to population growth (with elasticity ϕ = 0.55). When population growth slows, congestion costs fall, reducing the incentive to invest in high-step-size innovation, thus providing a second channel through which slower population growth reduces TFP growth beyond the core composition channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Innovation spillovers&lt;/strong&gt;: A mechanism by which the productivity growth of already-leading businesses (gS) responds positively to lagged aggregate productivity growth gX (with estimated elasticity γ = 0.342). This link means that when population growth slows and gX falls, mature firms also grow more slowly, amplifying the initial effect. Calibrated using OLS and IV (venture capital investment as instrument) regressions of old-establishment productivity growth on aggregate past productivity growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Balanced growth path (BGP) comparison&lt;/strong&gt;: The primary analytical exercise: comparing steady-state TFP growth rates across economies that differ only in their constant labor-force growth rate. This isolates the long-run equilibrium effect, abstracting from the transitional dynamics that counteract the decline in the short run. The BGP effect is larger than what is observed during any historical transition window because of the slow convergence.&lt;/p&gt;</description></item><item><title>General Equilibrium Effects in Space: Theory and Measurement</title><link>https://macropaperwarehouse.com/papers/general-equilibrium-effects-in-space-theory-and-measurement/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/general-equilibrium-effects-in-space-theory-and-measurement/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;How do international trade shocks propagate through spatially connected regional labor markets, and how large are the general equilibrium effects that standard shift-share specifications miss? Adão, Arkolakis, and Esposito address this question by extending shift-share empirical designs to incorporate general equilibrium (GE) effects arising from spatial links between markets. Their motivation is that the difference-in-difference logic of standard shift-share regressions recovers only the differential response of treated versus control regions, not the level response that includes indirect (spillover) effects propagating through trade, labor supply, and agglomeration links. Ignoring these indirect effects biases estimates of trade shocks&amp;rsquo; aggregate labor market consequences.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a multi-sector general equilibrium spatial model with N markets linked through three channels: (i) gravity-type trade demand, (ii) endogenous labor supply that depends on wages and price indices in all markets, and (iii) local labor productivity that depends on employment (agglomeration). The key theoretical result is that wage and employment responses to trade shocks decompose into two shift-share exposure vectors — a revenue exposure (proportional to the ADH import penetration measure, weighted by sectoral employment shares) and a consumption cost exposure (weighted by sectoral spending shares) — multiplied by bilateral reduced-form elasticity matrices (βij and φij). These elasticities are sufficient statistics for GE aggregation and can be expressed as a series expansion of the &amp;ldquo;spatial links&amp;rdquo; matrix, which is itself a function of trade demand substitution, labor supply substitution, and agglomeration elasticities. When demand substitution dominates (gross substitution property holds), indirect effects reinforce direct effects: a negative revenue shock in one CZ reduces demand for goods from other CZs, propagating wage and employment losses outward.&lt;/p&gt;
&lt;p&gt;The authors apply the framework to the China shock, using 722 U.S. Commuting Zones (CZs) over 1990–2007, following Autor, Dorn, and Hanson (2013) (ADH). The revenue exposure measure is identical to the ADH instrumental variable (employment-share-weighted Chinese export growth to non-U.S. developed countries); the consumption exposure is analogously constructed using sectoral spending shares from input-output tables. Structural parameters are estimated using a Model-implied Optimal IV (MOIV) two-step GMM estimator derived from Chamberlain (1987).&lt;/p&gt;
&lt;p&gt;Main quantitative findings: (1) In a simple extension of ADH, the indirect revenue spillover effect on neighboring CZs is roughly three times larger in magnitude than the direct effect of a CZ&amp;rsquo;s own import competition exposure — an increase of $1,000 in Chinese imports per U.S. worker in nearby CZs is associated with 1.3 log-point lower employment growth and 1.0 log-point lower wage growth in a given CZ. (2) Consumption cost shifts (cheaper imports) have no statistically significant direct or indirect effect on employment or wages, consistent with a weak price elasticity of labor supply relative to the wage elasticity. (3) Structural parameter estimates yield: labor productivity–employment elasticity ψ = 0.56 (agglomeration), labor supply–wage elasticity φw = 2.11, labor supply–price elasticity φp = −1.36, trade elasticity ε = 3.94. (4) In GE aggregation, the China shock reduced average U.S. CZ wages by approximately 4.0 log-points and employment by approximately 2.8 log-points between 1990 and 2007, with the indirect revenue channel (−4.24 log-points for wages, −4.95 log-points for employment) dominating the direct revenue effect (−0.81 and −1.94 respectively) and being partially offset by positive consumption cost effects (+0.98 wages, +3.18 employment). Average real wages rose by 0.16 log-points on net, but 39% of CZs experienced real wage declines. Standard deviations of responses were 1.30 for wages, 3.31 for employment, and 1.75 for real wages, indicating large cross-CZ heterogeneity. (5) Model fit: the baseline estimated model yields fit coefficients close to 1 (0.67 for wages, 0.90 for employment), whereas quantitative models calibrated with Ricardian/standard parameters yield fit coefficients of 3.56 to 10.42, indicating their predicted responses are too small by factors of 4–10. Simple aggregation of the ADH specification implies employment losses of only 1.5 log-points — less than half the authors&amp;rsquo; baseline estimate.&lt;/p&gt;
&lt;p&gt;The key mechanism driving the amplification is strong agglomeration (ψ ≈ 0.56), which roughly doubles typical calibrations from Krugman-type models and is absent in Ricardian frameworks. Demand-side trade links propagate revenue shocks across CZs with similar sectoral composition and trade partners. The policy implication is that analyses of trade shocks using standard shift-share regressions — which absorb common indirect effects in time fixed effects — systematically understate aggregate employment and wage losses.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification rests on the same orthogonality condition used by ADH and Kovak (2013): observed shock exposure (revenue and consumption shift-share measures) is mean-independent of unobserved residuals. This is implied by independence between the observed Chinese export shock and unobserved trade cost shocks, given the initial trade matrix. The authors use the ADH instrument (Chinese export growth to non-U.S. developed countries) to construct exogenous sectoral shifts, exploiting cross-CZ variation in initial industry composition. The main threats are: (i) unobserved shocks correlated with pre-existing industry composition (e.g., concurrent automation), addressed by controlling for lagged population growth (following Greenland et al. 2019) and the full ADH control set; (ii) spatial correlation of residuals, addressed by clustering standard errors at the state level and by robustness using the inference procedure in Adão et al. (2019); (iii) simultaneity, since the MOIV estimator instruments the non-linear functions of shock exposure with model-implied moment functions that are functions of the observed shifts only.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two shift-share exposure measures and how do they differ?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The revenue exposure (IPW) is the standard ADH shift-share variable: the product of Chinese export growth to other developed countries and the CZ&amp;rsquo;s initial employment share in each sector, summed across sectors. It captures the shock to the demand for a CZ&amp;rsquo;s goods. The consumption cost exposure (IPC) is an analogous variable where the share is the CZ&amp;rsquo;s sectoral spending share (including intermediate inputs, constructed using national input-output tables interacted with regional employment shares) rather than employment share. It captures the shock to the CZ&amp;rsquo;s cost of living and input costs. The two measures have a spatial correlation of 0.34. Standard deviations across CZs are 2.52 for IPW and 1.22 for IPC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three spatial channels determine GE reduced-form elasticities: (1) trade demand links — markets with similar sectoral composition and trade partners are closer substitutes, so a revenue shock in one CZ propagates negatively to CZs competing for the same export destinations; (2) labor supply links — employment responses in one CZ to wage/price changes in another, captured through migration (parametrized by bilateral birth-state shares) and the local wage and price elasticities of labor supply; (3) agglomeration — local labor productivity responds positively to local employment, amplifying both direct and indirect effects. Empirically, the authors distinguish these by estimating separate parameters (ψ for agglomeration, φw for wage elasticity of labor supply, φp for price elasticity, φm for migration links, ε for trade elasticity), with identification coming from cross-CZ heterogeneity in bilateral trade shares, sector specialization, and migration shares. The weak IPC effect (statistically insignificant) points to a small φp, while the large employment and wage responses to IPW point to large φw and ψ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the estimated structural parameters and how do they compare to existing literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Panel A estimates (without migration): ψ = 0.56 (s.e. 0.07), φw = 2.11 (s.e. 0.25), φp = −1.36 (s.e. 0.24), ε = 3.94 (s.e. 0.41). Panel B (with migration): nearly identical point estimates but standard errors two to five times larger due to high collinearity of bilateral migration and trade shares; φm = −0.06 (s.e. 0.05), not statistically significant. The agglomeration elasticity ψ = 0.56 is roughly twice the Krugman (1980) implied value (~0.2) used by Monte et al. (2018) and far above zero (used in Ricardian frameworks by Galle et al. 2017, Caliendo et al. 2018, 2019). It is closer to Kline and Moretti (2014)&amp;rsquo;s estimate of ~0.4 from regional demand shocks. The labor supply elasticity φw = 2.11 is three times the median micro-estimate in Chetty et al. (2013) and is consistent with aggregate employment responses. The trade elasticity ε ≈ 4 is within standard literature ranges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in spatial effects is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;There is substantial heterogeneity in both direct and indirect reduced-form elasticities across CZs. For revenue shifts, the 10th/50th/90th percentiles of direct wage elasticities are 0.44/0.67/1.67, and for employment 0.92/1.46/3.97. For indirect effects, median values are 0.002 (wages) and 0.003 (employment), but the 90th percentile is 0.021 and 0.039 respectively. The simple gravity proxy zij (inverse distance weighted by population) explains only a small fraction of variation in indirect effects; instead, the elements of the full spatial links matrix (bilateral revenue shares yij and trade demand substitutability χij) explain roughly 50% of variation in indirect effects across CZ pairs. Both manufacturing and non-manufacturing employment show significant indirect effects; wage responses are mainly driven by the non-manufacturing sector (consistent with ADH). 39% of CZs experienced real wage declines despite a small average real wage gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For the simple ADH extension (Table 1): (i) varying the distance decay parameter δ ∈ (1,8); (ii) using CZ size vs. no size weighting in zij; (iii) restricting to same-state CZs for indirect effects; (iv) weighting CZs by 1990 population; (v) using the Adão et al. (2019) inference procedure; (vi) alternative spending share constructions. For the structural estimation: (i) allowing for trade imbalances (following Dekle et al. 2007); (ii) calibrating migration links from external estimates; (iii) alternative numeraire for labor supply homogeneity (national vs. world price index). In all cases, indirect effects remain negative and significant, and reduced-form elasticities are highly correlated with baseline estimates. Counterfactual employment losses range from −0.5 to −5.4 log-points depending on the labor supply normalization and migration specification, with average wage decline remaining close to 4 log-points across specifications. The NTR gap (Pierce and Schott 2016) as the sector-level shifter also yields qualitatively similar results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper evaluate the fit of quantitative spatial models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors propose regressing actual changes in CZ employment/wages on model-predicted responses (equation 39) and checking whether the slope coefficient ρ is close to 1. A coefficient much greater than 1 means the model&amp;rsquo;s predicted responses are too small relative to actual cross-CZ variation. The baseline structural estimates yield fit coefficients of 0.67 (wages) and 0.90 (employment) — close to 1. Alternative calibrations from quantitative frameworks yield coefficients of 3.56–10.42 for wages and 6.60–10.42 for employment, indicating those models underpredict differential responses by factors of 4–10. The main driver is weak agglomeration forces: setting ψ = 0 (Ricardian) vs. ψ = 0.56 (baseline) dramatically degrades fit. Setting φw = −φp (labor supply responding to real wages only, as in Caliendo et al. 2019) makes employment fit estimates very imprecise because the consumption price channel becomes too strong relative to its empirical counterpart.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the quantitative GE impact of the China shock on average U.S. CZ wages and employment, and how does it decompose?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Over 1990–2007: average wage fell by 3.98 log-points (s.d. 1.30), average employment fell by 2.78 log-points (s.d. 3.31), average real wage rose by 0.16 log-points (s.d. 1.75). Decomposition of wage change: direct revenue effect −0.81 (s.d. 1.79), direct consumption cost effect +0.98 (s.d. 1.36), indirect revenue effect −4.24 (s.d. 1.71), indirect consumption cost effect +0.09 (s.d. 1.18). The indirect revenue channel dominates; consumption gains are not large enough to offset revenue losses. For real wages, the main components are: terms-of-trade loss from wage decline (−0.98, s.d. 2.53), productivity/efficiency gains (+3.14, approximately), and consumption cost gains. Most impact occurred in the 2000–2007 sub-period after China&amp;rsquo;s WTO accession.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do these GE estimates compare to estimates from the existing literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Simple aggregation of the ADH specification (ignoring GE indirect effects) implies average wage losses of 1.17 log-points and employment losses of 1.50 log-points — less than half the authors&amp;rsquo; GE estimates. Including intuitive distance-weighted indirect effects (ADH extension in Table 1 column 3) brings employment estimates closer (−4.51 log-points) but with correlation below 0.5 with baseline cross-CZ heterogeneity predictions. Quantitative spatial models calibrated with standard parameters (Ricardian, weak agglomeration) generate average responses near zero and are often uncorrelated with actual CZ outcomes. The key reason quantitative models underperform is that they specify agglomeration forces as too weak (ψ ≈ 0 versus the estimated 0.56) and labor supply sensitivity to import prices as too strong relative to wage sensitivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the consumption cost (IPC) channel and why does it matter less than the revenue channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The IPC captures the welfare gain from cheaper Chinese imports: as Chinese productivity rises, import prices fall, increasing real purchasing power and potentially stimulating labor supply. However, the estimated labor supply price elasticity (φp = −1.36) is substantially smaller in absolute value than the wage elasticity (φw = 2.11), so the positive employment and wage response to lower import prices is weaker than the negative response to falling demand for local output. Empirically, both the direct and indirect effects of IPC are statistically insignificant in the simple ADH extension (Table 1, columns 2 and 4), consistent with weak φp. The structural estimation exploits all channels to pin down φp precisely. Input-output linkages (CZs using inputs from sectors with stronger Chinese export growth) are incorporated in IPC and are also found to have no significant employment effect, consistent with Pierce and Schott (2016) and Acemoglu et al. (2016).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper connect to the shift-share and market access literatures?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper generalizes standard shift-share designs (Bartik 1991, Blanchard and Katz 1992, ADH 2013, Kovak 2013) in two ways: it adds a consumption cost shift-share (spending shares instead of employment shares) and it adds indirect exposure from other CZs&amp;rsquo; shift-share measures, weighted by model-implied bilateral reduced-form elasticities. Unlike standard designs, time fixed effects in the authors&amp;rsquo; estimating equation absorb only the mean unobserved shock, not any GE indirect effects (since the latter are heterogeneous across CZ pairs). The paper connects to the market access approach (Redding and Venables 2004; Donaldson and Hornbeck 2016) by showing that the authors&amp;rsquo; revenue and consumption exposure measures are partial-equilibrium versions of producer and consumer market access, holding wages and employment constant. The key advantage is that the authors&amp;rsquo; measures can be constructed from initial-equilibrium data without solving the full GE model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper implies that trade shock analyses ignoring GE spillovers substantially understate aggregate employment and wage losses for U.S. workers. The gross substitution condition (trade demand links dominating labor supply links) is required for indirect effects to reinforce rather than attenuate direct effects; this is consistent with the empirical evidence but could fail in settings with very mobile labor markets. The real wage calculation shows that, on average, cheaper imports provide a small net welfare gain (+0.16 log-points), but 39% of CZs experienced net real wage losses, pointing to substantial distributional consequences within the U.S. The framework&amp;rsquo;s scope is first-order (linearization around initial equilibrium), so it is a good approximation for moderate shocks; large shocks require integrating over the adjustment path. The methodology is applicable beyond the China shock to any trade policy with measurable regional exposure variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the MOIV estimator and why is it efficient?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Model-implied Optimal IV (MOIV) is a two-step feasible implementation of the Chamberlain (1987) efficient GMM estimator. The class of consistent GMM estimators for the spatial link parameters θ = (φw, φp, φm, ψ, ε) differs only in how they weight the observed exposure of different markets. The optimal weighting function H*i assigns more weight to markets whose reduced-form elasticities (βij and φij) are most sensitive to changes in the parameter being estimated — i.e., markets that provide the most information about a given parameter. In step 1, an arbitrary initial θ0 is used to obtain a consistent but non-optimal first-stage estimate. In step 2, the consistent estimate is used to compute the optimal instrument, and a second-stage GMM is run. The MOIV is asymptotically equivalent to the Chamberlain efficient estimator. The paper&amp;rsquo;s contribution is to derive the optimal moment conditions for a flexible spatial GE model with non-linear parameter-dependent elasticities.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Spatial Links Matrix&lt;/strong&gt;: The Jacobian of the excess labor demand system with respect to wages, denoted γ-bar, summarizing the combined effect of trade demand substitution (how wage changes in one market shift demand from other markets) and supply substitution (how wage changes affect labor supply across markets, amplified by agglomeration). It governs the propagation of partial equilibrium excess demand shifts to general equilibrium wage and employment responses, and determines the sign and heterogeneity of indirect effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bilateral Reduced-Form Elasticity&lt;/strong&gt;: The element βij (for wages) or φij (for employment) measuring how much market i&amp;rsquo;s outcome responds to a unit shift in market j&amp;rsquo;s excess labor demand, after all GE adjustment rounds. It is a series expansion of the spatial links matrix and is larger for market pairs with stronger bilateral or third-market spatial connections. These elasticities are sufficient statistics for aggregating regional shock exposures to compute GE impact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revenue Exposure (IPW)&lt;/strong&gt;: The shift-share variable capturing a CZ&amp;rsquo;s partial equilibrium revenue shift from a foreign productivity shock: the employment-share-weighted average of sectoral export growth shocks. Identical to the ADH instrument. Measures how much a CZ&amp;rsquo;s producer revenues (and thus labor demand) fall when Chinese costs decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption Cost Exposure (IPC)&lt;/strong&gt;: A novel shift-share variable capturing the partial equilibrium consumption cost shift: the spending-share-weighted average of sectoral export growth shocks, constructed using national input-output tables interacted with regional employment. Measures how much cheaper Chinese imports reduce the cost of living and inputs in a CZ, with a positive effect on real wages and labor supply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model-Implied Optimal IV (MOIV)&lt;/strong&gt;: A two-step feasible GMM estimator that achieves the Chamberlain (1987) efficiency bound for estimating the vector of structural spatial link parameters θ. In the first step any consistent estimator is used; in the second step the first-step estimates are used to compute the optimal moment function — which places more weight on CZs whose reduced-form elasticities are most sensitive to changes in the parameter being estimated — and a second-stage GMM yields the efficient estimate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gross Substitution Property&lt;/strong&gt;: A condition on the spatial links matrix (γij &amp;lt; 0 for all off-diagonal pairs) under which all bilateral reduced-form elasticities βij are positive, so indirect effects of excess demand shifts always reinforce direct effects. The condition is satisfied when trade demand substitution dominates labor supply substitution in the spatial links matrix. Empirically supported for U.S. CZs: negative revenue shocks spread negatively to other CZs rather than triggering offsetting employment inflows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Agglomeration Elasticity (ψ)&lt;/strong&gt;: The elasticity of local labor productivity to local employment in the production function, governing the feedback of employment changes on production costs and thus on excess labor demand. The authors estimate ψ = 0.56 for U.S. CZs — roughly twice the Krugman (1980) value and far above the zero assumed in Ricardian frameworks — and show it is the key parameter that amplifies both direct and indirect responses to trade shocks and determines model fit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Fixed Effect&lt;/strong&gt;: A common component of GE indirect effects that arises when spatial links are identical across markets (Corollary 2). In this special case all indirect effects collapse to a common term absorbed by time fixed effects in standard regressions, making those regressions unable to separately identify the indirect effect from aggregate time trends. In the general case with heterogeneous spatial links, indirect effects differ across CZ pairs and are not absorbed by time fixed effects.&lt;/p&gt;</description></item><item><title>How Costly Are Cartels?</title><link>https://macropaperwarehouse.com/papers/how-costly-are-cartels/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/how-costly-are-cartels/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Moreau and Panon ask how much cartels cost the aggregate economy — in terms of both total factor productivity and welfare — and find the losses are considerably larger than the received wisdom from Harberger (1954) would suggest. The paper&amp;rsquo;s motivation is the mounting evidence that markups are large and growing, combined with a near-total absence of macroeconomic quantification of collusion as one micro-origin of those markups.&lt;/p&gt;
&lt;p&gt;The empirical foundation is an original firm-level database for France covering the period 1994–2007, assembled by scraping all written decisions of the French Competition Authority (ADLC). The final dataset contains 174 cartels and more than 1,000 firms before matching. These cartel records are merged to administrative balance-sheet and income-statement data covering the universe of French firms (BRN and RSI regimes). Key facts documented: average cartel duration is 4.5 years (median 3 years); average cartel size is 6.3 members (median 4); cartels are prevalent across construction, manufacturing, wholesale, retail, and transportation. Crucially, cartel members are empirically shown to be dramatically larger than non-members even within narrowly defined 4-digit industries — roughly 1,900% more sales, a market share premium of 4 percentage points, 1,150% more employment, and 37% higher labor productivity. Firms within a cartel are also substantially more homogeneous in productivity than the overall within-industry distribution: the interquartile productivity ratio across cartel members is only 1.4-to-1, versus 2-to-1 across all non-cartel firms in the same industry.&lt;/p&gt;
&lt;p&gt;The theoretical framework extends the static heterogeneous-firm oligopoly model of Atkeson and Burstein (2008) by introducing collusion microfounded via the cross-ownership framework of O&amp;rsquo;Brien and Salop (1999). A single collusion-intensity parameter κ ∈ [0,1] governs how much each cartel member internalizes the profits of other members. When κ = 0 the model reduces to competitive Cournot oligopoly; when κ = 1 all cartel members jointly maximize profits. In equilibrium, markups rise with firm market share, generating endogenous markup dispersion. Adding collusion causes cartel members to face a lower effective demand elasticity — their own market share augmented by the weighted market shares of co-conspirators — and to charge supracompetitive markups (overcharges). Critically, the effect of cartels on aggregate productivity is theoretically ambiguous: the output contraction of colluding firms redirects demand toward non-colluding firms. If the cartel is composed of the largest (most productive) firms, demand shifts toward less productive non-members, reducing productivity. If the cartel is composed of the least efficient firms, demand shifts toward large non-members, potentially improving allocation.&lt;/p&gt;
&lt;p&gt;The model is calibrated to match six moments from French data in 2007 — aggregate markup, cartel overcharge, the slope of the inverse-markup-on-HHI regression, the median number of firms per sector, the median number of cartel members, and the distribution of relative sales. The key calibrated parameters are: within-sector elasticity of substitution ρ = 10.19; across-sector elasticity η = 1.86; collusion intensity κ = 0.79. The cartel overcharge target is set to 10%, consistent with the OECD benchmark used by antitrust authorities and with Laborde (2021).&lt;/p&gt;
&lt;p&gt;Main quantitative findings (baseline calibration, cartels composed of top producers):&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Eliminating all cartels raises aggregate TFP by 1.1%.&lt;/li&gt;
&lt;li&gt;The productivity cost of markups with respect to the efficient allocation is 70% higher in the model with collusion (3.67%) than in the calibrated competitive oligopoly (2.16%), because collusion generates additional markup dispersion on top of the dispersion inherent in firm heterogeneity.&lt;/li&gt;
&lt;li&gt;Eliminating cartels brings the economy 30% closer to the efficient allocation.&lt;/li&gt;
&lt;li&gt;The aggregate markup falls by approximately 1.5 percentage points when cartels are eliminated.&lt;/li&gt;
&lt;li&gt;Consumption-equivalent welfare gains from eliminating cartels equal 2%.&lt;/li&gt;
&lt;li&gt;Larger cartels (market share above median) account for roughly 80% of the productivity gains; dismantling only large cartels yields a 0.88% TFP gain and 1.97% consumption-equivalent welfare gain; smaller cartels yield 0.23% TFP and 0.54% welfare.&lt;/li&gt;
&lt;li&gt;Umbrella pricing — non-cartel members raise their markups because the cartel&amp;rsquo;s higher prices provide cover — dampens aggregate gains quantitatively but only slightly: fixing non-members&amp;rsquo; markups yields 1.14% productivity gain versus 1.11% in the benchmark.&lt;/li&gt;
&lt;li&gt;Reducing collusion intensity from κ = 0.79 to κ ≈ 0.4 (roughly a 50% reduction) still generates TFP gains of 0.54% and welfare gains of 0.85%, demonstrating that tougher antitrust enforcement at the intensive margin (forcing cartels to soften, not dissolve) yields substantial gains.&lt;/li&gt;
&lt;li&gt;These estimates are one order of magnitude above Harberger&amp;rsquo;s (1954) 0.1% dead-weight loss estimate; the paper shows this discrepancy arises because Harberger uses sectoral data and near-unit demand elasticities, both of which suppress markup dispersion within sectors.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The paper&amp;rsquo;s scope conditions are explicit: results reflect the static cost of cartels; dynamic effects (entry deterrence, innovation incentives) are acknowledged but not quantified; only domestic, detected cartels are covered, so estimates likely understate the true cost; the channel through geographic markup dispersion is excluded.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the paper&amp;rsquo;s primary identification strategy, and what are its main limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper does not rely on a natural experiment or difference-in-differences design. Instead, it uses a structural calibration approach: a heterogeneous-firm oligopoly model with collusion is calibrated to match French data moments, and the cost of cartels is computed as the difference between the calibrated cartel equilibrium and a counterfactual competitive Nash-Cournot equilibrium. The main threats to this strategy are: (1) the sample of cartels consists only of detected cartels, which may not be representative of the latent population — discovered cartels could be either more or less severe than undiscovered ones; (2) no firm-level price data are available, so markups cannot be estimated directly; (3) the counterfactual is a calibrated competitive model rather than an empirically observed post-cartel state; (4) the model abstracts from entry and exit, which may dampen or amplify the true gains from cartel dissolution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms through which cartels affect aggregate productivity, and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two channels operate simultaneously. First, the direct price effect: cartel members raise markups above the competitive level (overcharges), reducing their output. In the presence of markup dispersion, this disproportionately contracts output from high-markup (high-productivity) firms, increasing misallocation. Second, the demand reallocation effect: as cartel members contract output and raise prices, non-cartel members gain market share and increase their markups via the umbrella pricing mechanism. The net effect on productivity depends on which firms gain market share. When cartels consist of top producers, reallocation goes toward less productive non-members, reducing aggregate TFP. When cartels consist of the least efficient firms, reallocation goes toward larger non-members, potentially improving allocation. The two channels are not empirically separated in the data; rather, the model disentangles them analytically and then disciplines the net effect via calibration to observed cartel overcharges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why do the authors assume cartels are composed of the most productive firms, and what is the evidence for this?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The assumption is motivated by three pieces of evidence. First, empirical regressions on the matched administrative data show that cartel members within their 4-digit industries have roughly 1,900% more sales, 1,150% more employment, and 37% higher labor productivity than non-members. Second, firms within a cartel are much more homogeneous than the overall within-industry distribution: the interquartile productivity ratio within a cartel is 1.4-to-1, versus approximately 2-to-1 for all non-cartel firms in the same industry, and the 90-10 ratio is 1.7-to-1 within a cartel versus over 4-to-1 across the industry. Third, only the top-producer composition assumption, combined with a collusion intensity κ = 0.79, can generate a cartel overcharge of 10% consistent with the calibration target. All other composition configurations (least efficient, all-inclusive, random top-10%) yield either implausibly small overcharges or implausibly large ones.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the umbrella pricing effect and how large is it quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Umbrella pricing refers to the mechanism by which cartel members&amp;rsquo; higher prices raise the sectoral price index, allowing non-cartel members to expand output and raise their own markups without reducing their market share. Proposition 1 of the model shows that collusion increases the markups of all firms — cartel and non-cartel — with non-cartel members experiencing markup increases that are larger for larger non-members. Quantitatively, when non-cartel members are held to fixed markups (so the umbrella effect is turned off), the aggregate TFP gain from eliminating cartels rises from 1.11% to 1.14% — a difference of 0.03 percentage points, or less than 3% of the total effect. The welfare effect is similarly small: 2.01% versus 2.00%. The umbrella pricing channel thus dampens aggregate gains but is quantitatively minor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in cartel effects is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three dimensions of heterogeneity are explored. First, cartel size matters: large cartels (those with cumulated market share above the median) account for roughly 80% of the aggregate TFP gain from eliminating all cartels (0.88 percentage points out of 1.11%), while small cartels account for only 0.23 percentage points. Second, cartel composition is critical: top-producer cartels amplify misallocation, all-inclusive cartels generate very large overcharges and dramatically higher misallocation, least-efficient-firm cartels barely affect allocation, and random-top-10% cartels can slightly improve allocation. Third, collusion intensity matters monotonically: across the range κ = 0.1 to κ = 0.4, TFP gains from elimination fall from 0.99% to 0.54%, and welfare gains fall from 1.70% to 0.85%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run, and how do the results change?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper runs six main robustness experiments, all recalibrating the model: (1) Alternative overcharge target of 15% (versus 10% baseline): requires κ = 1.28, yields TFP gains of 1.63% and welfare gains of 2.77%. (2) Low aggregate markup target M = 1.1: TFP gain of 1.37%, welfare gain of 2.07%. (3) High aggregate markup target M = 1.3: TFP gain of 0.90%, welfare gain of 1.96%. (4) Bertrand rather than Cournot competition: TFP gain of 0.55%, welfare gain of 1.35% — smaller because Bertrand generates less markup dispersion, though the reduction in distance to the efficient allocation is larger (39%). (5) Heterogeneous κ across cartels drawn from a truncated normal with four variance levels: TFP gains range from 0.84% to 1.11% and welfare gains from 1.53% to 1.99%, close to the benchmark of 1.11% and 2.00%. (6) The cartel screen regression yields an estimated κ of 0.70 from data on colluding firms, close to the calibrated benchmark of 0.79.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model generate a cartel detection screen, and what does it find?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model&amp;rsquo;s equilibrium first-order conditions imply a regression of a cartel member&amp;rsquo;s labor share (a proxy for the inverse markup under log-linear production) on its own market share and the total cartel market share. The ratio of the estimated coefficient on cartel market share to the sum of both coefficients recovers the collusion intensity κ. Running this regression on the sample of detected cartel firms, the authors find a coefficient on own market share of -0.53 and an intercept of 0.70, both significant at 1%. Adding the cartel joint market share, its coefficient is negative and significant at 1%; the estimated κ from this specification is 0.70, close to the benchmark of 0.79. Results are qualitatively robust to including year fixed effects, though estimates become slightly noisier.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the authors explain the large discrepancy with Harberger (1954)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Harberger&amp;rsquo;s classic estimate of the deadweight loss from monopoly is approximately 0.1% of GDP. The authors show that their model can reproduce estimates close to this when (a) the model is aggregated to the sectoral level, eliminating within-sector markup dispersion — in that case, the TFP gain from eliminating cartels falls to 0.08%; or (b) demand elasticities are set close to unity as in Harberger&amp;rsquo;s sectoral data — the TFP gain falls to 0.24%. The key reason for the discrepancy is that Harberger&amp;rsquo;s framework suppresses both the within-sector dispersion of markups (which in the baseline model amplifies allocative losses) and the endogenous markup response to market share changes (which is large when ρ is substantially greater than 1). Using disaggregated firm-level data and calibrated high-within-sector elasticities restores the large estimated costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper implies that antitrust enforcement against horizontal price-fixing cartels can yield aggregate TFP gains of 1.1% and welfare gains of 2% in consumption-equivalent terms — figures the authors describe as conservative, because (i) the estimate is static (no dynamic gains from entry or innovation effects are included), (ii) only domestic detected cartels are captured and international cartels are excluded, (iii) geographic markup dispersion is abstracted from, and (iv) the calibration uses a conservative overcharge target of 10%. Importantly, the gains from targeting the intensive margin (forcing cartels to reduce overcharges rather than dissolving them entirely) are also substantial: a 50% reduction in κ still yields 0.54% TFP and 0.85% welfare gains. The results further imply that industrial policy and trade liberalization reforms that ignore competition enforcement may be partially undermined if new market power enables cartelization. The scope condition most critical to the quantitative magnitude is cartel composition: results depend on cartels being composed of top producers; the sign and magnitude of productivity effects can flip for alternative compositions. The authors also note that if cartels spur long-run innovation (through higher profits), their static welfare cost estimates would overstate the net social cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from Edmond, Midrigan, and Xu (2022) and Baqaee and Farhi (2020)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Edmond et al. (2022) and Baqaee and Farhi (2020) quantify the total welfare and productivity cost of markups relative to the efficient allocation — the gap between the current economy (with all its markup dispersion from firm heterogeneity) and the first-best. Moreau and Panon instead isolate the cost of one specific, policy-relevant source of excess markup dispersion — collusion — by computing the gap between the cartel equilibrium and the competitive (but still imperfect) Nash-Cournot equilibrium. They also show that competitive oligopoly models of the Edmond et al. type understate the total misallocation cost of markups by approximately 70% when cartels are present and composed of top producers, because competitive models are calibrated to match the same aggregate markup data but attribute all markup dispersion to firm heterogeneity rather than to collusion. The papers are thus complementary: Edmond et al. bound the full cost of all markup distortions, while Moreau and Panon bound the portion attributable to cartels and amenable to competition enforcement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What caveats and limitations do the authors acknowledge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors flag several important limitations. (1) The analysis is static: dynamic effects — including entry deterrence by cartels, barriers to exit for inefficient firms, and the innovation-competition relationship — are not modeled. The relationship between competition and innovation is hump-shaped (Aghion et al., 2005), so cartels could in principle spur or dampen innovation; the authors treat their estimates as an upper bound if cartels raise innovation. (2) Only detected French domestic cartels are in the sample; international cartels (investigated by the European Commission) and undetected cartels are excluded, likely causing understatement of total costs. (3) The selection of detected cartels is non-random: the direction of bias from using only discovered cartels is unclear — discovered cartels may be unusually large (biasing costs upward) or undiscovered large cartels may exist (biasing costs downward). (4) The model abstracts from geographic markup dispersion and from vertical arrangements across industries. (5) The model has no entry or exit of firms, which could amplify or dampen transition dynamics. (6) Firm-level prices are unavailable, so markups cannot be directly measured and must be inferred from the model or from labor shares.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Collusion intensity parameter (κ)&lt;/strong&gt;: A scalar in [0,1] that governs the weight each cartel member assigns to co-conspirators&amp;rsquo; profits when choosing output. When κ = 0, behavior is competitive Cournot; when κ = 1, members jointly maximize aggregate cartel profits. In the baseline calibration κ = 0.79, chosen to match a 10% median cartel overcharge in French data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cartel overcharge&lt;/strong&gt;: The percentage difference in cartel members&amp;rsquo; average markups between the cartel equilibrium and the competitive Nash-Cournot equilibrium. Computed as the median overcharge across cartels in the model. In the baseline calibration it is 10%, consistent with the OECD benchmark and Laborde (2021). The overcharge increases with both collusion intensity (κ) and the cartel&amp;rsquo;s total market share.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Umbrella pricing&lt;/strong&gt;: The mechanism by which a cartel&amp;rsquo;s higher prices raise the sectoral price index, enabling non-cartel members to expand demand, gain market share, and charge higher markups than they would in the absence of the cartel. In the model, umbrella pricing implies that the introduction of collusion increases the markups of all firms in cartelized sectors, not just cartel members; quantitatively, the effect dampens but does not reverse the aggregate productivity gains from cartel dissolution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distance to efficient allocation&lt;/strong&gt;: The ratio of the productivity gain from eliminating cartels (Acartel → Acomp) to the total productivity gain from eliminating all markup dispersion (Acomp → Aeff or equivalently from Acartel → Aeff). In the baseline, eliminating cartels reduces this distance by 30%, meaning cartels are responsible for roughly 30% of the gap between the actual economy and the first-best efficient allocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous markups (size-related)&lt;/strong&gt;: In the Atkeson-Burstein framework embedded in this model, a firm&amp;rsquo;s equilibrium markup is a harmonic average of within- and between-sector demand elasticities weighted by the firm&amp;rsquo;s own market share. More productive firms endogenously hold larger market shares and thus face lower demand elasticities, charging higher markups. Collusion further distorts this by augmenting the effective market share with co-members&amp;rsquo; shares, yielding supracompetitive overcharges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cartel composition&lt;/strong&gt;: The identity of firms within a cartel — specifically, where they sit in the within-industry productivity distribution. The paper shows this is the single most important determinant of whether cartels amplify or dampen aggregate misallocation. Empirically, discovered French cartels are composed of the largest, most productive firms (nearly 1,900% more sales than non-members), and this is the only composition configuration that can match observed 10% overcharges in the calibrated model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive versus extensive margin of cartel policy&lt;/strong&gt;: The extensive margin refers to whether a cartel exists (zero versus positive κ); the intensive margin refers to the degree of collusion among existing cartel members (high versus low κ). The paper shows both margins are quantitatively important: breaking down all cartels (extensive margin) yields 1.11% TFP gain, while halving κ without dissolution (intensive margin) yields 0.54% TFP gain and 0.85% welfare gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cartel screen&lt;/strong&gt;: A regression of cartel members&amp;rsquo; labor shares on their own market share and the joint cartel market share, derived directly from the model&amp;rsquo;s equilibrium first-order conditions. The collusion intensity κ can be recovered as the ratio of the joint market share coefficient to the sum of both market share coefficients. Applied to French data on detected cartel firms, this screen yields κ̂ = 0.70, close to the calibrated value of 0.79.&lt;/p&gt;</description></item><item><title>Identifying Monetary Policy Shocks: A Natural Language Approach</title><link>https://macropaperwarehouse.com/papers/identifying-monetary-policy-shocks-a-natural-language-approach/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/identifying-monetary-policy-shocks-a-natural-language-approach/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: To study how monetary policy affects the economy, macroeconomists must isolate &amp;ldquo;shocks&amp;rdquo; — changes in interest rates that are not systematic responses to economic conditions. The paper proposes a new identification method that captures the Federal Reserve&amp;rsquo;s information set far more comprehensively than prior approaches, using the natural-language text of documents Fed staff prepare for FOMC meetings, not just numerical forecasts.&lt;/p&gt;
&lt;p&gt;Method and data: The approach extends Romer and Romer (2004), who regress changes in the Federal Funds Rate (FFR) target on Greenbook forecasts and take the residual as the shock. The authors instead convert the text of FOMC documents into many &amp;ldquo;aspect-based&amp;rdquo; sentiment time series and predict the FFR change with both these sentiments and an expanded forecast set. They process 772 PDF files for 276 meetings (630 files for 210 meetings before the zero lower bound), covering Greenbook 1/2, Tealbook A, Redbook, and Beigebook documents, starting October 5, 1982 (when the Fed began targeting the FFR per Thornton 2006). Most documents are released with a 5-year lag, so the latest is from end-2016. They extract the most frequently mentioned economic terms, yielding 296 single/multi-word concepts (e.g., &amp;ldquo;inflation,&amp;rdquo; &amp;ldquo;economic activity&amp;rdquo;). For each concept they build a sentiment indicator by scoring positive (+1) and negative (-1) words within a 10-word window, using an augmented Loughran-McDonald (2011) dictionary of 2,882 classified words. The empirical model (equation 3) includes 132 forecast series, 296 sentiment indicators with 4 lags, and quadratic terms — 3,226 regressors total — far exceeding the 210 FOMC-meeting observations over October 1982 to October 2008. They estimate it with a ridge regression, choosing the penalty by 10-fold cross-validation; the shock is the residual.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: (1) Fit/systematic share: the original Romer-Romer OLS specification yields R-squared of 0.50 (so 50% of FFR variation is attributed to shocks), while the preferred nonlinear ridge with forecasts and sentiments yields R-squared of 0.94 — cutting the exogenous shock share from 50% to 6%, an almost ten-fold reduction. Lags 0–4 give R-squared of 0.75, 0.81, 0.90, 0.92, 0.94. (2) Information content: text-based sentiments predict Greenbook unemployment-rate forecast errors; a one-standard-deviation increase in the sentiment first principal component is associated with an almost 0.5 percentage-point negative 1-year-ahead forecast error (R-squared up to 0.25), supporting the view that staff forecasts are modal, not mean, predictions. (3) Comparison to high-frequency surprises: correlation with Swanson (2021) FFR surprises (1991–2008) is 0.49 (vs. 0.36 for Romer-Romer); 0.77 for the top-10 shocks (vs. 0.61) and 0.51 for the top-10 surprises (vs. 0.18). The estimated shocks have lower autocorrelation (0.066 vs. 0.204 for Romer-Romer). (4) IRFs (BVAR with shock as external instrument, IRF sample 1984:02–2016:12): a tightening produces a persistent yield rise (about 20 months), a fall in real output and rise in unemployment materializing after about a year, a sluggish decline in the price level (mild initial &amp;ldquo;price puzzle,&amp;rdquo; visibly negative after about 18 months, significantly negative after 30 months), a sharp rise in the excess bond premium, and a fall in stock prices — all consistent with theory. By contrast, Romer-Romer OLS residuals imply flat output/unemployment responses, an insignificant EBP response, and positive stock-price/rate comovement, at odds with theory.&lt;/p&gt;
&lt;p&gt;Implications: Including text-based information is essential for clean identification — even for the original method to correctly recover responses (especially of unemployment). A Beigebook-only version extends the method to recent meetings, implying the 2022–2023 tightening (525 bp total) carried only about 21 bp of contractionary shock.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What exactly is the identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Monetary policy shocks are defined (equation 1) as the residual after orthogonalizing the FFR target change against the central bank&amp;rsquo;s information set. The authors proxy that information set with the full numerical-forecast set plus 296 text-derived sentiment indicators (with 4 lags and quadratic terms), and estimate the prediction via ridge regression with 10-fold cross-validation. The shock is the residual. Two key assumptions inherited from Romer-Romer are threats: (i) the included variables must be a good proxy for the true information set — the paper argues forecasts alone are insufficient because they are modal, not mean, predictions and assume a specific policy path (Faust-Wright 2008), which is why text is required; and (ii) the mapping from information to decisions must be well-specified — they relax linearity by adding quadratic terms. A residual concern is that even the large information set may not capture truly idiosyncratic considerations, but they argue this is exactly what should remain in the shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why are text sentiments necessary beyond numerical forecasts — what is the Cochrane critique and how do they answer it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cochrane (2004) argued that to study the effect of policy on a given variable, it suffices to orthogonalize the FFR against the Fed&amp;rsquo;s forecast of that variable alone, since an efficient forecast incorporates all relevant information. This holds only if Greenbook forecasts equal the conditional mean. The authors show, via FOMC transcripts (Appendix D, spanning 1985–2016) and econometrics, that staff produce MODAL forecasts accompanied by verbal descriptions of asymmetric risks. Their sentiment indicators predict Greenbook unemployment forecast errors (Table 2): the first PC and even the single &amp;rsquo;economic activity&amp;rsquo; sentiment are significant at multiple horizons (R-squared up to 0.25; a 1-sd PC increase implies an almost 0.5 pp negative 1-year error). After orthogonalizing forecast errors on sentiment, the error distribution becomes more symmetric and centered on zero (Figure 3). Hence at least some text information is required even for the original Romer-Romer method to recover the true unemployment response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why ridge regression rather than LASSO or OLS?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;OLS is infeasible (3,226 regressors vs. 210 observations). Ridge minimizes residual sum of squares plus a penalty on squared coefficients (shrinkage toward zero), equivalent to Bayesian OLS with a normal prior centered at zero. Unlike LASSO (which produces sparse models), ridge keeps all regressors (a dense model), more akin to factor models/PCA. The authors prefer dense methods because economic data have many correlated regressors and few observations; Giannone, Lenza, and Primiceri (2022) (&amp;rsquo;the illusion of sparsity&amp;rsquo;) find sparse methods become unstable under high collinearity — clearly present across forecasts and sentiments here. The penalty lambda is chosen by 10-fold cross-validation, so the high R-squared is not purely mechanical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the authors interpret what the shocks capture, and what case studies support this?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They inspect FOMC discussions in meetings with the largest estimated shocks. November 7, 1984: largest shock in absolute value — a 75 bp FFR decline of which staff forecasts/sentiments predict 53 bp, leaving a -22 bp easing shock, driven by FOMC participants finding the staff forecast too optimistic. November 15, 1994: a 75 bp hike of which 21 bp is a contractionary shock — Greenspan argued &amp;lsquo;a mild surprise would be of significant value&amp;rsquo; for credibility, and the 75-vs-50 bp gap between his decision and the staff&amp;rsquo;s option almost exactly matches the estimated 21 bp. The interpretation: shocks are FFR decisions that are &amp;lsquo;surprises&amp;rsquo; to the Fed staff — orthogonal to the staff&amp;rsquo;s information set. They note their interpretation is narrower than Romer-Romer&amp;rsquo;s (which included target-definition changes and political pressure, both pre-1982 phenomena per Drechsel 2023). Systematic credibility concerns would be absorbed into systematic policy; only nonsystematic ones become shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three interpretations of why Romer-Romer IRFs go wrong, and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Unemployment: because Greenbook unemployment forecasts are modal and text-sentiment predicts their errors, the Romer-Romer OLS cannot fully absorb asymmetric risk shifts, producing a spurious correlation (easing shocks estimated when unemployment rises) and thus a flat/incorrect unemployment IRF (Figure 6). (2) Stock prices: the Fed systematically reacts to equities (Cieslak and Vissing-Jorgensen 2020); failing to control for this leaves spurious positive rate/stock comovement. They test this by adding HF S&amp;amp;P500 surprises as a second instrument with Jarocinski-Karadi (2020) sign restrictions (negative rate/stock comovement for policy shocks): their measure already satisfies the restrictions (Panel a barely changes), whereas the Romer-Romer IRFs change drastically once imposed, &amp;lsquo;correcting&amp;rsquo; activity/price/EBP responses (Figure 7). (3) Credit spreads: Romer-Romer residuals retain endogenous credit-spread variation; the authors&amp;rsquo; sentiments include &amp;lsquo;spreads,&amp;rsquo; &amp;lsquo;credit standards,&amp;rsquo; &amp;lsquo;credit quality.&amp;rsquo; Caldara and Herbst (2019) show that ignoring the Fed&amp;rsquo;s credit-spread reaction attenuates IRFs, supporting this channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) 5-word vs. 10-word sentiment windows give nearly identical R-squared (0.95 vs. 0.94 in the top spec). (2) Sentence-based sentiment construction is highly correlated with the window-based version (0.875 for employment, 0.959 for credit; Appendix C). (3) Lag structure: 0–4 lags raise R-squared 0.75→0.94 with diminishing gains past 4 lags. (4) FOMC composition controls (governor/bank-rep attendance, voting status, appointing president, female attendance) raise R-squared by less than 0.1% — personal dynamics do not drive FFR changes. (5) Alternative nonlinear forms: cubic residuals 99% correlated with quadratic; a ~40,000-variable full-interaction spec yields residuals 96% correlated with quadratic. (6) Forecast-error predictability holds for output and inflation too (Appendix E), and using first-release vs. final-vintage data gives similar results. (7) Local projections (Jorda 2005) confirm the BVAR results, with Romer-Romer again off-theory. (8) IRFs built from only the 10 largest shocks reproduce the main pattern. (9) The extended-forecast ridge (no sentiments) already corrects the IRFs, though the authors stress theory-consistent IRFs are necessary but not sufficient for a good shock measure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the Beigebook-only extension work and what does it find?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Tealbooks/forecasts are released with a 5-year lag, but Beigebooks are public before each meeting. Over 1982–2008, building sentiments from Beigebooks alone gives indicators strongly correlated with the baseline (e.g., &amp;rsquo;economic activity&amp;rsquo;, Figure 8), an R-squared of 0.68 (vs. 0.94 with full documents), and shocks correlated 0.92 with the baseline shocks, with qualitatively similar IRFs. As a proof of concept over December 2015–October 2023 (excluding the March 2020–December 2021 ZLB period), the R-squared is 0.98. Inflation sentiment dropped more than 6 standard deviations in late 2021/early 2022 (driven by &amp;lsquo;concern&amp;rsquo; near &amp;lsquo;inflation&amp;rsquo;). The 2022–2023 tightening of 525 bp total implies only about 21 bp of cumulative contractionary shock — i.e., mostly systematic tightening. This extension is impossible for Romer-Romer because Beigebooks contain no numerical forecasts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It contributes to three literatures. (1) Monetary-shock identification: builds directly on Romer-Romer (2004) but adds NLP/ML and a much larger information set; contrasts with SVAR and high-frequency approaches (Gurkaynak et al. 2005, Gertler-Karadi 2015, Swanson 2021, Bauer-Swanson). (2) Text/ML on Fed documents: unlike Sharpe-Sinha-Hollrah (2020), who build a single sentiment index, the authors build aspect-based sentiments per concept; closest are Handlan (2020), who builds a &amp;rsquo;text shock&amp;rsquo; separating forward guidance from current assessment since 2005, and Ochs (2021), who extracts surprises from the private agents&amp;rsquo; viewpoint — the authors instead orthogonalize against the Fed&amp;rsquo;s internal information set, staying closer to Romer-Romer. (3) Greenbook-forecast literature (Romer-Romer 2000, Faust-Wright, Nakamura-Steinsson 2018): they emphasize the modal nature of forecasts and show sentiments explain forecast errors on average.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy/research implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The method delivers a cleanly identified, &amp;lsquo;all-purpose&amp;rsquo; shock series usable for any macro variable — including ones without Fed forecasts (e.g., credit spreads). It spans a longer period than HF measures (which begin in the early 1990s due to futures-data availability and the fact that the FOMC did not announce rate changes publicly before 1994). Scope conditions: the preferred (Tealbook-based) measure requires the 5-year document lag, so recent meetings need the lower-fidelity Beigebook-only version (R-squared 0.68 in-sample); the main estimation sample ends October 2008 to avoid the ZLB. The method relies on the structured, consistent wording of Fed-staff documents, making dictionary-based sentiment particularly applicable. The authors recommend using the baseline measure whenever feasible, even at the cost of dropping recent observations, and resorting to Beigebook-only only when that cost is high. They also suggest combining their measure with HF surprises as multiple external instruments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there caveats about interpreting the model&amp;rsquo;s coefficients?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. The ridge is built for prediction (y-hat), not coefficient interpretation (beta-hat). With 3,226 highly collinear regressors plus lags and quadratic terms, individual coefficients cannot be cleanly interpreted — the authors invoke Mullainathan-Spiess (2017) that ML belongs in the y-hat toolbox, and a self-driving-car analogy. A potential downside of a large information set is low statistical power in the shock (since more variation becomes systematic), but they show via the BVAR IRFs that power is not a problem in practice.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Identifying the Impact of Inflation Expectations</title><link>https://macropaperwarehouse.com/papers/identifying-the-impact-of-inflation-expectations/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/identifying-the-impact-of-inflation-expectations/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Branch (2022) asks whether subjective consumer inflation expectations causally raise the inflation rate — a question whose empirical answer has been elusive despite its central role in New Keynesian theory and central bank communication. The identification problem is acute: expectations are endogenous by construction, and the standard approach of estimating a Phillips curve with aggregate data produces estimates biased sharply downward by endogeneity. OLS regressions of regional inflation on regional mean expectations, controlling for unemployment, lagged inflation, and region and time fixed effects, yield a slope of only 0.069 (Table 2 context; Figure 1b), far below the theoretical prior of near-unity pass-through.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s empirical strategy exploits a key fact: different demographic groups consume heterogeneous bundles of goods, so their inflation expectations differ systematically and reflect their own basket&amp;rsquo;s price movements. Using roughly 273,000 individual responses from the University of Michigan Survey of Consumers spanning 1978:1–2022:5, Branch classifies respondents into 160 demographic groups defined by sex, age (five categories), education (four levels), marital status, and parental status. The panel covers four U.S. Census regions, producing dimensions T = 528 months, N = 4 regions, and G = 160 groups. Regional inflation is measured from BLS CPI series for all urban consumers.&lt;/p&gt;
&lt;p&gt;The identification strategy is a shift-share (Bartik) instrument: for each region-month, the predicted regional inflation expectation is the population-weighted average of each demographic group&amp;rsquo;s national-level average inflation expectation, where the weights are the group&amp;rsquo;s share of the region&amp;rsquo;s population. Two share measures are used: (i) the January 1978 Current Population Survey (CPS78) distribution, which is time-invariant and plausibly exogenous to subsequent inflation shocks; and (ii) contemporaneous Michigan survey shares. The leave-one-out variant is the preferred construction. The instrument is relevant — first-stage F-statistic of 52.4 (significant at 0.1%) — and the Durbin-Wu-Hausman test rejects OLS consistency at the 1% level (statistic = 8.074).&lt;/p&gt;
&lt;p&gt;Main 2SLS estimates: using Michigan survey shares, a 1 percentage point increase in a region&amp;rsquo;s expected inflation raises regional inflation by 0.33 percentage points (significant at 5%; Table 2). Using CPS78 shares, the estimate rises to 0.55 percentage points (significant at 1%; Table 2). After applying the split-sample jackknife bias correction for finite-sample bias in the small-N/large-T panel, the estimates increase slightly to 0.36 and 0.60 respectively (Table 3). The paper characterizes the 60 basis point estimate as its &amp;ldquo;preferred&amp;rdquo; figure. Both are substantially above the OLS estimate of 0.069 and represent a lower bound: because time fixed effects absorb cross-regional spillovers, the aggregate pass-through is likely stronger, with the paper arguing that after accounting for spillovers the effect is plausibly in the range of 1.0–1.6, consistent with the Calvo- and Taylor-model predictions of Werning (2022), who shows pass-through should lie in [1/2, 1] or above.&lt;/p&gt;
&lt;p&gt;Sectoral decomposition reveals that the expectation effect is concentrated in non-durable goods prices (coefficient 1.74, significant at 1%; Table 7) and commodities more broadly (1.29, significant at 1%; Table 7), with no statistically meaningful effect on durables (−0.10, insignificant) and only marginal positive effects on services (0.22, marginally significant). Among services, the effect is somewhat larger when housing services are excluded.&lt;/p&gt;
&lt;p&gt;A key finding on expectations horizons: when both one-year-ahead and five-to-ten-year-ahead expectations are simultaneously instrumented using their respective Bartik shift-shares, only the short-run (one-year) expectation retains a significant positive effect on inflation. The long-horizon coefficient is small in absolute value, negative in sign, and statistically insignificant in both the joint and standalone specifications (Tables 10 and 12). After conditioning on aggregate macroeconomic factors captured by time fixed effects, long-run inflation expectations have no independent causal role in the regional inflation rate.&lt;/p&gt;
&lt;p&gt;Identification heterogeneity: using the Rotemberg weight decomposition of Goldsmith-Pinkham, Sorkin, and Swift (2020), the identifying variation derives primarily from younger, married consumers with at least a high school degree — specifically those aged 18–34 (Michigan instrument) or 25–49 (CPS78 instrument). The group-specific treatment effects (βg) for these heavily weighted groups are positive and significantly above 1. Temporally, the heaviest identification weights fall on the Great Inflation and Volcker disinflation (1978–82), the Great Recession (2007–09), and the post-pandemic inflation episode (2021–22). The impulse response function shows a significant contemporaneous positive effect of expectations on inflation that mean-reverts cyclically within approximately 12 months, though confidence bands are wide at longer horizons.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core identification strategy and what makes it plausible?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strategy is a differential-exposure quasi-experiment using a Bartik (shift-share) instrument. For each Census region and month, the instrument is the population-weighted average of each demographic group&amp;rsquo;s national-level mean inflation expectation, with weights equal to that group&amp;rsquo;s share of the region&amp;rsquo;s population. The key identifying assumption has two parts: (1) demographic groups have heterogeneous consumption baskets, so their inflation expectations reflect the prices in their own basket; and (2) the distribution of demographic groups across regions is exogenous to unobserved shocks driving regional inflation (as opposed to being exogenous to regional price levels, which is a weaker and separately justified claim). Plausibility is supported by the CPS78 shares having no predictive power for the other covariates of inflation over the sample, and by using a leave-one-out instrument construction to avoid mechanical correlation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main threats to identification and how does the paper address them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The principal threat is that regional demographic composition could be endogenous to regional inflation rather than merely to regional price levels. The paper argues identification requires only exogeneity to the change in prices (inflation), not to the level. The empirical check is that CPS78 beginning-of-period shares show no statistically or economically significant correlation with the other regressors that predict regional inflation. A second threat is that groups may sort into regions based on economic conditions correlated with inflation. The paper argues the channel runs through demand from heterogeneous baskets rather than supply-side sorting. A third threat is weak instruments: this is addressed by first-stage F = 52.4. Fourth, survey measurement concerns (re-interview selection bias, outliers, endogenous prompting thresholds) are addressed through a battery of alternative specifications (first-time respondents only, outlier removal, CPS vs. survey shares, lagged shares, alternative CPI measures).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why are OLS estimates biased downward and by how much?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;OLS is biased because inflation expectations are endogenous — they move with the same shocks driving inflation, so OLS conflates the causal effect with reverse causation and omitted-variable bias. The OLS estimate from the panel regression with region and time fixed effects is approximately 0.069 (Figure 1b). The 2SLS estimates using the Bartik instrument range from 0.33 to 0.55, roughly five to eight times larger than OLS, confirming substantial downward bias. The Durbin-Wu-Hausman test confirms OLS inconsistency at the 1% level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across demographic groups is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Women consistently report higher inflation expectations than men, particularly outside the high-inflation 1970s episode. Older respondents (50+) receive small Rotemberg identification weights, meaning their expectations contribute little to the identifying variation. Younger groups (18–34 under Michigan shares; 25–49 under CPS78 shares), married, with at least a high school education are the groups whose expectations drive the regional cross-sectional identification. The group-specific causal effects (βg) for these heavily weighted groups are uniformly positive and significantly above 1.0, ranging roughly from 1.38 to 1.91 in the top-10 groups. College-educated groups receive higher weight under the CPS78 instrument, while the Michigan shares instrument weights high school and college groups more evenly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the sectoral decomposition of the inflation expectations effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 7 estimates separate 2SLS regressions for components of the CPI. Non-durable goods prices respond most strongly (coefficient 1.74, significant at 1%). Commodities broadly (which include non-durables and durables) also show a large effect (1.29, significant at 1%). Durable goods prices show no meaningful effect (−0.10, statistically insignificant). Services show only a marginal positive effect (0.22, marginally significant at 10%). Among services, the effect is somewhat stronger when housing services are removed. These results are consistent with prior findings that consumer grocery and non-durable prices most directly influence and reflect household inflation expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What do the long-run expectations results show and what is the interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Michigan survey&amp;rsquo;s PX5 question elicits 5-to-10-year ahead inflation expectations. Constructing a shift-share Bartik instrument for these long-horizon expectations and including both short- and long-run instruments simultaneously, the second-stage coefficient on long-horizon expectations is small (−0.023 to −0.037 in the joint specification, Table 10), negative, and statistically insignificant in all specifications. When long-horizon expectations alone are instrumented, the second-stage coefficient is 0.005 to 0.034 (Table 12), positive but still insignificant. The interpretation is that, after controlling for time fixed effects (which capture aggregate macroeconomic factors), long-run expectations have no independent causal role in regional inflation outcomes. Only short-run (one-year ahead) expectations matter. The first stage confirms the long-run instrument is relevant for long-run expectations but orthogonal to short-run expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are reported and what do they find?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 8 reports four alternative specifications, all using Michigan survey shares: (1) &amp;lsquo;small&amp;rsquo; — removing survey responses with absolute values above 25% — gives a coefficient of 0.66 (significant at 1%), larger than baseline, though the paper does not prefer this because large expectations may have real behavioral effects; (2) &amp;lsquo;first-only&amp;rsquo; — using only first-time respondents and dropping the 40% re-interviewed — yields a coefficient of 0.58, still positive though the standard error rises and significance falls; (3) &amp;lsquo;state-CPI&amp;rsquo; — replacing the BLS regional CPI with state-level CPIs aggregated as in Hazell et al. (2022) — gives 0.33 (significant at 5%), very close to the Michigan-shares baseline; (4) &amp;rsquo;lag Michigan shares&amp;rsquo; — instrumenting with 12-month lagged survey shares — gives 0.53 (significant at 5%), bracketed between the two baseline estimates. The jackknife bias correction (Table 3) slightly raises estimates to 0.36 and 0.60 for the two instruments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the impulse response function show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using local projections (Jordà 2005) to estimate a 2SLS impulse response function, a shock to inflation expectations produces a significant positive contemporaneous effect on regional inflation. The response is cyclical and mean-reverting, returning to near zero within approximately 12 months. Confidence intervals are wide in subsequent quarters, so the analysis cannot rule out lingering effects, but the central estimates suggest the impact dissipates within about a year. The paper notes that the lack of strong persistence may reflect the specific U.S. inflation history and suggests extending the analysis to countries with more volatile or persistent inflation histories.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to the New Keynesian Phillips Curve literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The standard approach to measuring expectations&amp;rsquo; impact on inflation is to estimate a NKPC with an instrument for expectations under rational expectations. Mavroeidis, Plagborg-Moller, and Stock (2014) document that this approach faces severe identification and weak-instrument problems. Branch&amp;rsquo;s approach avoids these issues by not assuming rational expectations, not requiring an explicit model of expectations formation, and using a shift-share instrument whose validity rests on cross-sectional demographic heterogeneity rather than time-series moment conditions. The theoretical model in Section 3.1 permits non-rational expectations and nests &amp;lsquo;anticipated utility&amp;rsquo; or &amp;lsquo;steady-state learning&amp;rsquo; (Evans and Honkapohja 2001; Woodford 2013) as the simplifying assumption. The estimated regional coefficients are below but potentially consistent with Werning&amp;rsquo;s (2022) theoretical range of [1/2, 1] for Calvo and Taylor pricing models once spillovers are accounted for.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to the literature on household-level inflation heterogeneity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper builds on Hobijn and Lagakos (2005), who show households consume different bundles, and Kaplan and Schulhofer-Wohl (2017), who find two-thirds of cross-household inflation variation stems from paying different prices for the same goods. D&amp;rsquo;Acunto, Malmendier, Ospina, and Weber (2021) establish that grocery store prices directly influence household inflation expectations. Branch takes these findings as given — they motivate the identifying assumption that expectations reflect basket-specific prices — and focuses on the downstream question of whether those expectations causally raise actual inflation outcomes. Earlier work on heterogeneous expectations by Branch (2004, 2007) using Michigan survey data, finding time-varying heterogeneity across forecasting rules, is also directly referenced.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the Rotemberg weight decomposition reveal about the source of identifying variation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Bartik estimate is a weighted average of 160 just-identified group-specific estimates. Goldsmith-Pinkham, Sorkin, and Swift (2020) show the weights (αg) measure each group&amp;rsquo;s contribution to the overall estimate and sensitivity to bias from that group&amp;rsquo;s potential endogeneity. Tables 4–5 list the top-10 weighted groups: under CPS78 shares, these are predominantly 25–49-year-olds, mostly college-educated, seven of ten married with children. Under Michigan shares, the top groups are even younger (mostly 18–24), with at least a high school degree, almost all married without children. Table 6 shows men receive slightly higher aggregate weight than women (0.53–0.57 vs. 0.43–0.47), and those aged 50+ contribute less than 15% of total weight. Figure 11 shows temporal variation: the heaviest-weighted periods are the late-1970s Great Inflation and Volcker disinflation, the Great Recession (2007–09), and the post-pandemic episode (2021–22).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper provides empirical support for central bank attention to short-run consumer inflation expectations: a 1 percentage point increase in one-year-ahead regional expectations causally raises regional inflation by 0.33–0.55 basis points (lower bound, since spillovers are excluded). Accounting for cross-regional aggregate effects raises the likely total pass-through to above one, validating the central bank emphasis on anchoring short-run expectations. However, the null finding for long-run (5-to-10-year) expectations — controlling for aggregate time effects — suggests that &amp;lsquo;anchoring long-run expectations&amp;rsquo; may not independently prevent near-term inflation above and beyond its correlation with short-run beliefs. The scope conditions are important: the estimates come from U.S. Census regions over 1978–2022, so applicability to countries with persistently high or hyper-inflation is uncertain. The identifying variation is concentrated in high-volatility inflation episodes, suggesting potential nonlinearities in the expectations-to-inflation mapping. The empirical strategy also does not capture general equilibrium feedback from realized inflation back to expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the data limitations and survey design concerns the paper acknowledges?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Five limitations of the Michigan survey are acknowledged: (1) whether surveys elicit genuine expectations rather than attitudes; (2) the rotating panel structure, with roughly 40% of respondents re-interviewed after six months, creates potential selection bias if more accurate forecasters are likelier to re-participate; (3) declining telephone response rates threaten representativeness; (4) the survey prompts respondents reporting &amp;lsquo;unreasonable&amp;rsquo; expectations, with the threshold endogenously tied to recent inflation history; (5) the question wording asks about &amp;lsquo;prices going up&amp;rsquo; rather than &amp;lsquo;aggregate U.S. inflation&amp;rsquo;, making the measure closer to consumption-basket-specific expectations — which the paper treats as a feature rather than a flaw for its identifying assumption. The paper addresses concerns (1)–(4) through alternative specifications (first-time-only respondents, outlier removal, CPS vs. survey shares). The geographic dimension is limited to four Census regions because finer location identifiers are unavailable for a long panel.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shift-share (Bartik) instrument for expectations&lt;/strong&gt;: In this paper, the instrument for regional inflation expectations is constructed by interacting each demographic group&amp;rsquo;s national-level mean inflation expectation (the &amp;lsquo;shift&amp;rsquo;) with that group&amp;rsquo;s population share in the region (the &amp;lsquo;share&amp;rsquo;). The resulting weighted average predicts how much regional expectations would be elevated purely by the region&amp;rsquo;s demographic composition reacting to aggregate group-level expectation shocks, isolating variation plausibly orthogonal to region-specific inflation supply shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Differential exposure quasi-experiment&lt;/strong&gt;: The identification design exploits the fact that U.S. Census regions have different demographic compositions, giving them differential exposure to aggregate shocks in group-specific inflation expectations. Regions with a higher share of a group whose expectations are rising will see a larger predicted increase in regional expectations than regions with a lower share of that group, independent of region-specific factors — this cross-regional contrast is the source of causal identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rotemberg weights&lt;/strong&gt;: Following Goldsmith-Pinkham, Sorkin, and Swift (2020), the Bartik 2SLS estimate is decomposed as a weighted sum of 160 just-identified group-specific estimates, where the weight αg for group g measures the sensitivity of the overall estimate to potential endogeneity in group g&amp;rsquo;s share. Groups with large αg drive identification and are the groups most important to probe for exogeneity. In this paper, the heaviest-weighted groups are younger, married consumers with at least a high school degree.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Anticipated utility / steady-state learning&lt;/strong&gt;: The paper&amp;rsquo;s theoretical model allows for non-rational subjective expectations. Firms and households are modeled as &amp;lsquo;anticipated utility&amp;rsquo; maximizers (Woodford 2013) who adjust expectations over time (&amp;rsquo;learning&amp;rsquo;) but assume for current decisions that expected inflation will remain at its present rate — termed &amp;lsquo;steady-state learning&amp;rsquo; by Evans and Honkapohja (2001). This assumption implies future prices evolve along a linear trend from current expectations, yielding a tractable closed-form link between current expectations and the sector-specific price-setting equation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneous consumption baskets as identification&lt;/strong&gt;: The paper&amp;rsquo;s core identifying assumption is that different demographic groups consume different bundles of goods across sectors, so their inflation expectations reflect the price changes in their own basket rather than a common aggregate signal. This basket heterogeneity is what makes group-level expectations differ systematically and allows the shift-share instrument to generate exogenous variation in regional inflation expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lower bound interpretation of regional estimates&lt;/strong&gt;: The 2SLS estimates capture only the regional (within-country, across-region) effect of expectations on inflation, because time fixed effects absorb cross-regional spillovers — if expectations rise in one region, the increased demand for traded goods spills into other regions and raises their prices too. The paper argues the regional estimates are therefore a lower bound on the aggregate pass-through from expectations to overall U.S. inflation, consistent with the stronger aggregate correlation seen in Figure 1a.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-run expectations nullity&lt;/strong&gt;: The paper&amp;rsquo;s extension finds that 5-to-10 year inflation expectations, instrumented with their own shift-share Bartik and included alongside the one-year instrument, have no statistically or economically significant causal effect on regional inflation once time fixed effects control for aggregate factors. This result implies that, conditional on short-run expectations and macroeconomic controls, long-horizon expectations carry no independent causal information for the current inflation rate.&lt;/p&gt;</description></item><item><title>Import Liberalization as Export Destruction? Evidence from the United States</title><link>https://macropaperwarehouse.com/papers/import-liberalization-as-export-destruction-evidence-from-the-united-states/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/import-liberalization-as-export-destruction-evidence-from-the-united-states/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; How does import liberalization affect a country&amp;rsquo;s &lt;em&gt;export&lt;/em&gt; performance and welfare? Economic theory (Graham 1923, Ethier 1982, Krugman 1984) shows the answer hinges on whether production exhibits increasing returns to scale at the sector level. Krugman (1984) argued that with scale economies, import protection can be export-promoting because a protected industry expands, exploits scale economies, becomes more productive, and exports more — so conversely import liberalization is &amp;ldquo;export destroying.&amp;rdquo; The paper turns this logic into an empirical test: the sign of the import-liberalization-to-export relationship discriminates between constant-returns and increasing-returns trade models. Researchers otherwise lack tools to choose between these model classes, yet the choice matters greatly for multi-sector trade policy analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and data.&lt;/strong&gt; The authors build a multi-sector general-equilibrium gravity model generalizing Krugman (1980) to many countries/sectors with input-output linkages (as in Caliendo-Parro 2015). The model nests constant returns (Armington, σ→∞) and increasing returns. The &amp;ldquo;scale elasticity&amp;rdquo; is 1/(σ−1); the &amp;ldquo;output elasticity&amp;rdquo; of exports equals the trade elasticity (ε−1) times the scale elasticity, and is positive iff there are increasing returns. The empirical application exploits US Permanent Normal Trade Relations with China (PNTR), passed Oct 2000, which removed tariff-revocation uncertainty. Exposure is measured by Pierce-Schott&amp;rsquo;s NTR gap (log gap between non-NTR and NTR tariffs; mean 0.23, SD 0.13, range 0–0.59). Trade data are from CEPII BACI; the baseline sample covers exports from 23 OECD countries (including the US) to 141 importers across 444 NAICS goods industries, in long differences (1995–2000 pre-period vs 2000–07 post-period).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings.&lt;/strong&gt; Reduced-form: US export growth fell in higher-NTR-gap industries after PNTR. The raw Figure 1 slope is −0.51 (SE 0.057); a 10-log-point NTR-gap increase is associated with 5.0 log points lower annual export growth, and the NTR gap explains 18% of cross-industry variation. This is inconsistent with constant returns and implies increasing returns in US goods production. An offsetting &lt;em&gt;input cost effect&lt;/em&gt; (lower imported-input costs) raises exports: PNTR reduced 2007 exports by 13% more for a 75th- vs 25th-percentile NTR-gap industry, but raised them 20% more for a 75th- vs 25th-percentile input-cost-shock industry; net effects range from −18% (Cigarettes) to +56% (Automobiles). A structural IV (NTR gap instrumenting output growth) yields an output elasticity of 0.74 (SE 0.41, preferred column).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative GE results.&lt;/strong&gt; Calibrating the output elasticity to 0.821 (matching the −0.10 conditional NTR-gap effect; trade elasticity set to 5), PNTR raised aggregate US exports/GDP by 3.2%, decomposed into −1.8% real market potential (export destruction), +2.4% input cost, and +2.7% foreign demand. Aggregate export growth is 28% larger with scale economies than without, because scale economies make the input-cost effect almost five times stronger (2.4% vs 0.5%). Exports nevertheless declined in the most exposed sectors (Textiles &amp;amp; Leather, Other Manufacturing), shifting US comparative advantage away from high-NTR-gap sectors. Welfare: PNTR raised US real income 0.068% (real expenditure 0.087%); gains are ~30% smaller than under constant returns because a negative specialization effect (−0.15%) offsets a larger ACR openness gain (0.22%). Chinese gains exceed US gains tenfold.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core theoretical test and why does the sign of the import-liberalization-to-export relationship identify returns to scale?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;From the bilateral trade equation, the elasticity of exports to output equals the output elasticity (ε−1)/(σ−1), which is strictly positive iff there are increasing sector-level returns. Under constant returns (Proposition 1), conditional on foreign demand and domestic input costs, import liberalization does not affect exports (α1=0). Under increasing returns (Proposition 2), import liberalization shrinks domestic real market potential, lowers output, and — because productivity falls with output under scale economies — reduces exports to ALL destinations (α1&amp;lt;0), with the effect&amp;rsquo;s magnitude strictly increasing in the output elasticity. So estimating whether export growth falls in more-liberalized industries distinguishes the two model classes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and its main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A triple-difference: changes in US bilateral export growth by sector after PNTR relative to changes in other OECD exporters&amp;rsquo; growth, identified from the NTR gap interacted with Post and a US-exporter dummy. The estimating equation (12) uses importer-exporter-industry, importer-exporter-period, and importer-industry-period fixed effects to absorb importer demand, common-across-exporter technology shocks, and industry trends in supply capacity and trade costs. The NTR gap is plausibly exogenous because variation stems mostly from Smoot-Hawley (1930) non-NTR tariffs, unlikely related to economic conditions 70 years later; any endogeneity from NTR tariffs being higher in weak-growth industries would bias against finding a negative effect. Threat 1: unobserved US-specific technology shocks negatively correlated with the NTR gap not captured by input/skill/capital intensity controls. Addressed by re-estimating at HS 6-digit level with NAICS-industry-exporter-period fixed effects (Table 3), still finding negative effects. Threat 2: US-China competition in third markets — if PNTR shifted China&amp;rsquo;s export basket toward US-type products in high-NTR-gap industries. Tested by interacting with China&amp;rsquo;s market share (Table 4); the quadruple interaction is positive and insignificant, ruling this out.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three mechanisms and how are they distinguished empirically and quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Real market potential / export destruction: import liberalization lowers the US price index, makes the domestic market more competitive, shrinks real market potential and output, and (under scale economies) cuts productivity and exports — identified by the negative α1 on the NTR gap. (2) Input cost effect: lower imported-input costs cut production costs and raise exports — identified by α2 on the input-output-weighted upstream NTR gap (CostShock), found negative and significant (lower input costs → higher exports). (3) Foreign demand effect: GE expansion of global demand and the trade-balance link between imports and exports — absorbed by fixed effects in the regression but recovered in the calibrated model&amp;rsquo;s decomposition (equation 16). In GE: −1.8% (market potential), +2.4% (input cost), +2.7% (foreign demand), netting +3.2%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Sector-level: the real market potential effect is negative in all goods sectors and stronger where the NTR gap is higher; the input cost effect is positively correlated with the NTR gap (due to heavy diagonal weight in the I-O table); the foreign demand effect is positive everywhere but uncorrelated with the NTR gap. Net exports/GDP rise in 12 of 15 goods sectors but fall in the highest-NTR-gap sectors — Textiles &amp;amp; Leather falls 22% (−32% market potential, +8.5% input cost, +4.6% foreign demand) and exports decline in 3 of the 4 highest-NTR-gap sectors. Under constant returns, by contrast, export growth is positive in all sectors and weakly POSITIVELY correlated with the NTR gap — qualitatively opposite. The correlation between sector-level export growth with vs without scale economies is insignificant (excluding Textiles &amp;amp; Leather) or significantly negative (including it).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Appendix C checks robustness to: starting the post-period in 2001 instead of 2000; alternative NTR-gap definitions; aggregating exports across destinations; varying the exporter/importer/industry samples; allowing PNTR to affect domestic expenditure; and controlling for China import growth driven by non-PNTR shocks. An event study (equation 13, Figure 2) shows no NTR-gap/export relationship before 2000 and a negative one from 2001 until the 2007–08 financial crisis, ruling out pre-trends. The first-stage (Table 5) confirms higher-NTR-gap industries had lower OUTPUT growth (paralleling Pierce-Schott&amp;rsquo;s employment result). Alternative calibrations (Appendix D.5): without I-O linkages the market potential effect weakens but total export growth is roughly unchanged; allowing services scale economies raises US gains; combining Textiles &amp;amp; Leather with Other Manufacturing preserves results; using Bartelme et al. (2019) sector-varying elasticities still yields a negative specialization effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the output elasticity calibrated and how does it compare to the structural estimate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The output elasticity for goods is calibrated to 0.821 by matching the simulated NTR-gap effect to the −0.10 conditional reduced-form estimate (Table 2, column i), with services output elasticity set to zero and trade elasticity (ε−1) set to 5 (Head-Mayer 2014). This is below the value of 1 implied by Krugman (1980) or the Pareto-Melitz model but close to the Bartelme et al. (2019) mean of 0.83. It is reassuringly close to the independent structural IV estimate of 0.74 (SE 0.41). The simulated effect is decreasing in the output elasticity (consistent with Proposition 2 part ii) and rises sharply as the elasticity approaches one; the model has a unique solution for output elasticities below 0.95.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the welfare decomposition work and why are gains smaller with scale economies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following Costinot-Rodríguez-Clare (2014), real-income gains decompose into an ACR term (changes in domestic expenditure share / trade openness) and a specialization term that exists only with scale economies (welfare from sectoral reallocation of employment, weighted by adjusted Leontief forward-linkage coefficients). With scale economies the ACR effect is +0.22% (vs +0.10% without), but it is more than offset by a −0.15% specialization effect, netting +0.068% real income — about 30% below the constant-returns gain. The specialization effect is negative because PNTR shifted resources toward services (weaker scale economies; goods output −0.55%, services +0.11%) and, more importantly per Appendix D.5, toward sectors with weaker FORWARD input-output linkages; cross-sectoral heterogeneity in scale economies alone contributes negligibly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends Krugman (1984)&amp;rsquo;s partial-equilibrium oligopoly mechanism to a class of quantitative GE trade models (love-of-variety, external economies, Melitz-Pareto, or endogenous innovation — shown equivalent in Appendix A.3). Unlike prior scale-economy estimates (Antweiler-Trefler 2002, Lashkaripour-Lugovskyy 2018, Bartelme et al. 2019) and home-market-effect tests (Davis-Weinstein 2003, Costinot et al. 2019), it uses TRADE POLICY variation (not factor content, market size, or exchange rates) for identification and performs an ex-post policy analysis (echoing Goldberg-Pavcnik 2016). Relative to the PNTR/China-shock literature (Pierce-Schott 2016, Handley-Limão 2017, Autor-Dorn-Hanson 2013), it adds a new outcome — US EXPORTS and comparative advantage — and argues the &amp;lsquo;surprisingly swift&amp;rsquo; manufacturing decline would have been smaller absent scale economies. It complements Juhász (2018)&amp;rsquo;s infant-industry evidence (Napoleonic France) by quantifying the export-destruction cost while showing PNTR&amp;rsquo;s net effect on exports and welfare is positive. Dick (1994) tested the same hypothesis cross-sectionally for 1970 US data but found little support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The findings support the existence of the scale-economies channel traditionally invoked to justify protection: pre-PNTR import protection shifted US comparative advantage toward the most-protected industries, and in the calibrated model targeted import protection CAN promote sector-level exports — but not under constant returns. However, the export-destruction effect is dominated, for most sectors and in aggregate, by export-promoting channels (input cost, foreign demand); total export growth is even greater WITH scale economies; and the negative specialization effect is more than offset by traditional gains from trade, so US gains from PNTR remain positive (+0.068% real income). Scope conditions: results rest on the calibrated output elasticity (0.821) and trade elasticity (5); the model assumes constant markups and full employment, so welfare excludes pro-competitive effects (Jaravel-Sager 2020, Amiti et al. 2020) and employment effects (Autor-Dorn-Hanson 2013); it studies a single liberalization episode; and the analysis cannot distinguish among alternative SOURCES of increasing returns. The authors stress accounting for scale economies (or their absence) is a prerequisite for correctly evaluating sector-level trade flows and welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What other notable findings or caveats appear?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;PNTR is calibrated as a reduced-form openness shock (α5=0.43; equation 15), equivalent to a 13% average trade-cost reduction on US imports from China (SD 6.6% across industries) given trade elasticity 5 — matching Handley-Limão&amp;rsquo;s 13-percentage-point estimate. The calibrated economy has 12 economies and 24 sectors (15 goods). Chinese gains exceed US gains more than tenfold (because the US was much larger in 2000, so PNTR was a bigger shock to China), and China&amp;rsquo;s nominal wage rose 6.0% relative to the US, contributing to factor-price convergence. For comparison, Caliendo-Parro (2015) find NAFTA raised US welfare 0.08% and Fajgelbaum et al. (2020) find the Trump trade war cut US real income 0.04%. The model in changes is solved via exact hat algebra, holding each country&amp;rsquo;s trade deficit as a constant share of global value-added (which induces the positive import-export link in the foreign-demand term).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Information and the Formation of Inflation Expectations by Firms: Evidence from a Survey of Israeli Firms</title><link>https://macropaperwarehouse.com/papers/information-and-the-formation-of-inflation-expectations-by-firms-evidence-from-a-survey-of-israeli-firms/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/information-and-the-formation-of-inflation-expectations-by-firms-evidence-from-a-survey-of-israeli-firms/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; How do firms form and update inflation expectations during a monetary-policy regime change and a transition from high/volatile inflation to a low, stable, inflation-targeting environment? This matters because tracking and managing expectations is central to modern monetary policy (especially under forward guidance), yet high-quality firm-level expectations data—particularly across regime changes—are scarce (Bernanke 2007). A central tension in the literature is that firms and households in long-stable advanced economies are largely inattentive to inflation and monetary policy, plausibly because successful stabilization removes the incentive to monitor them. Israel offers a natural experiment: its recent history of high inflation and dollarization, followed by disinflation, de-dollarization, and the anchoring of expectations at the ~2% target midpoint around 2003.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and design.&lt;/strong&gt; The authors use the Bank of Israel Firms&amp;rsquo; Survey, a quarterly survey (quantitative inflation-expectation questions added in 1997), covering six industries (post-2009 shares: manufacturing 36%, services 36%, commerce 14%, transportation/communications 5%, hotels 5%, construction 4%). The main analysis sample is 2001Q3–2018Q3. The survey is voluntary, unbalanced, not nationally representative; late-sample participation fell to ~250–300 firms with a response rate around 30%. Identification exploits within-quarter variation in response timing: because Israel&amp;rsquo;s CPI is published monthly on the 15th and policy-rate decisions are scheduled, firms responding after a release (&amp;ldquo;treatment&amp;rdquo;) had information that firms responding earlier (&amp;ldquo;control&amp;rdquo;) did not. Surprises are defined relative to professional forecasters&amp;rsquo; mean expectations: an inflation (CPI) surprise and a monetary (policy-rate) surprise. Identification assumes response timing is random; the authors show firm characteristics generally do not predict either response period (Table 4) or the cross-section of expectations (Table 3). Estimation uses two-way (firm and quarter) fixed-effects panel regressions interacting treatment dummies with surprise size, plus a lagged dependent variable; local projections (Jordà 2005) first show output/employment respond to the shocks, motivating that beliefs should too.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main quantitative findings (Table 9, full sample 2001Q3–2018Q3).&lt;/strong&gt; A positive inflation surprise of one percentage point raises 1-year inflation expectations by about 0.5 pp from the second-monthly-CPI surprise (coefficient 0.467) and about 0.7 pp from the third-monthly-CPI surprise (0.700). The effect on 1-quarter expectations is weaker (≈0.12 and ≈0.29). Because the annual response exceeds the quarterly response, firms on average treat CPI surprises as persistent, not transitory. A surprise one-percentage-point hike in the policy rate lowers 1-year inflation expectations by about 0.3 pp (coefficient 0.343, negative sign) and 1-quarter expectations by roughly 0.15 pp. The mean second-month-CPI treatment dummy itself is small (-0.07 pp), so the interaction terms carry the economic content.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms and scope conditions.&lt;/strong&gt; The inflation-surprise result is robust across sub-periods, before/after 2010, firm sizes, and industries. The monetary-surprise result is NOT robust: dropping the large 2001–2002 policy shocks (sample 2002Q3–2018Q3) renders it insignificant and sign-flipped, consistent with policy shocks having little effect on beliefs in stable environments (Coibion et al. 2020; Ilek 2021 for Israeli forecasters). Implication: even after de-dollarization and prolonged low/stable inflation, Israeli firms keep monitoring macro news; (re)anchoring expectations—making them insensitive to news—may take a long time, an insight relevant for countries now facing high inflation.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strategy exploits variation in survey response timing within each quarter. Because Israel publishes CPI on the 15th of each month and policy-rate decisions are on scheduled dates, firms that respond after a release (treatment) have seen information that firms responding earlier (control) have not. Responses are grouped into Periods 1, 2, 3 (and Period 0 for missing/late dates), generating two CPI surprises (second- and third-monthly index) and one interest-rate surprise per quarter. The key identifying assumption is that response timing is as-good-as random. The main threat is selection—if attentive or expectation-distinctive firms systematically respond later, treatment status would be endogenous. The authors address this by regressing exposure-period indicators on observable firm characteristics (Table 4) and finding characteristics generally do not predict response period; they also confirm firm characteristics do not explain cross-sectional expectation levels (Table 3). A placebo test replacing the dependent variable with the prior quarter&amp;rsquo;s expectation (t-1) finds no effect (Appendix Table B5), supporting the timing identification. A residual threat is unobservable correlates of timing not captured by observables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms: (1) firms update inflation expectations to new CPI information, and (2) firms update to monetary-policy information. They are distinguished by using separate, independently timed surprises (CPI releases vs. policy-rate decisions) and separate interaction terms. Persistence vs. transitory perception is inferred from the horizon pattern: because the 1-year response to a CPI surprise (~0.5–0.7 pp) exceeds the 1-quarter response (~0.12–0.29 pp), firms must expect the price increase to continue over subsequent quarters, i.e., they perceive CPI shocks as persistent. For monetary policy, the smaller 1-quarter than 1-year effect is read as consistent with monetary policy operating with a lag. The output/employment local projections (Table 8) show a non-monotonic response to rate surprises (rises in quarters 0–1, declines in quarters 2–3), which the authors note could mix conventional contractionary effects with an information effect (a higher rate signaling a stronger economy).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By firm size (Table 11): all three size groups (small, medium, large) respond to CPI surprises on 1-year expectations and the differences across groups are generally not statistically significant; the interest-rate-surprise effect resembles the pooled estimate for medium and large firms but is not statistically significant for small firms. By industry (Table 12): the CPI-surprise effect on 1-year expectations is positive and statistically significant in nearly every industry, whereas the interest-rate-surprise effect on 1-year expectations (full sample) is negative and significant only in manufacturing. Over time (Table 10): the 1-year CPI-surprise effect is almost identical before and after 2010 (the year the monetary committee was established), and the 1-quarter effect is similar or if anything stronger in the later period. Cross-sectionally, firm size, industry, and region are mostly statistically and economically insignificant predictors of expectation levels (Table 3).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Shorter sample 2002Q3–2018Q3 excluding the large 2001–2002 policy shocks—CPI-surprise results essentially unchanged, monetary-surprise results become insignificant and change sign. (2) Split before/after 2010 allowing time-varying effects (Table 10). (3) Heterogeneity by size (Table 11) and industry (Table 12) as consistency checks. (4) A placebo test regressing the previous quarter&amp;rsquo;s (t-1) expectation on current-quarter news, finding no effect (Appendix Table B5). (5) Checks that firm characteristics predict neither response timing (Table 4) nor expectation levels (Table 3), supporting the random-timing assumption. (6) Local projections on output and employment (Table 8) establishing that firms&amp;rsquo; real-side behavior responds to the shocks, motivating belief responses. Standard errors are White and clustered at the firm level throughout.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the firm-expectations literature (Coibion, Gorodnichenko, Kumar 2018; Candia, Coibion, Gorodnichenko 2023) showing firms&amp;rsquo; expectations lie between professional forecasters&amp;rsquo; and households&amp;rsquo;—confirmed here by intermediate disagreement among firms. It connects to expectation-formation work (D&amp;rsquo;Acunto et al. 2021 on shopping experience; Coibion-Gorodnichenko 2015 on exchange-rate sensitivity in Ukraine; Kumar et al. 2015 on New Zealand managers) and to studies of news effects on expectations (Beechey, Johannsen, Levin 2011). It is closest in spirit to Lamla and Vinogradov (2019), who compare household expectations before/after monetary announcements; the contribution is to study firms in an economy with a recent history of high inflation and dollarization undergoing disinflation. It also relates to regime-change classics (Sargent 1982 on ending hyperinflations; Mankiw, Reis, Wolfers 2003 on Volcker disinflation), filling the gap that little is known about firms&amp;rsquo; expectations across a policy-regime change. Its Israeli monetary-surprise null in the stable period echoes Coibion et al. (2020) and Ilek (2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Central implication: even after successful de-dollarization and a prolonged low-and-stable inflation environment, Israeli firms continued to monitor and react to inflation news—so de-dollarization (firms&amp;rsquo; renewed trust in local currency) does not necessarily translate into inattention, and (re)anchoring expectations in the sense of making them insensitive to news may take a long time. For countries currently experiencing high inflation, the Israeli experience suggests firm expectations can remain news-sensitive for an extended period. Scope conditions: the firm sample is not nationally representative; results are specific to Israel&amp;rsquo;s institutional setting (monthly CPI on the 15th, scheduled rate decisions); the monetary-policy result is fragile—it is driven mainly by the unusually large 2001–2002 shocks and disappears in calmer periods, so the conclusion that monetary surprises move firm expectations holds chiefly when shocks are large.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there other significant findings or caveats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Descriptive facts: firms&amp;rsquo; average annual inflation expectations (2001Q3–2018Q3) averaged 2.34% (vs. 1.81% for professional forecasters, 1.57% for the capital market); in the 2011Q1–2018Q3 panel households averaged 3.02% while firms averaged 1.83%, banks 1.07%. Firms&amp;rsquo; expectations are about one percentage point below households&amp;rsquo; but 0.5–1 pp above other (forecaster/market) sources, and disagreement among firms lies between that of households and professional forecasters—consistent with prior literature. Expectations co-move strongly across sources and across industries. Raw cross-period descriptive evidence (Table 5) shows average and median expectations decline as more information becomes available (Period 1 mean 2.52 → Period 3 mean 2.26), and disagreement weakly declines. The largest interest-rate surprises (1.5–2 pp) occurred at the sample start: in December 2001 the Bank cut the rate by 2 pp to 3.8%, triggering capital outflow, depreciation, and price increases, then reversed to 9.1%. A caveat is that the survey was discontinued at end-2020 (replaced by a CBS survey), and the unbalanced, voluntary panel limits representativeness.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Labor Market Discrimination and the Racial Unemployment Gap: Can Monetary Policy Make a Difference?</title><link>https://macropaperwarehouse.com/papers/labor-market-discrimination-and-the-racial-unemployment-gap-can-monetary-policy-make-a-difference/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/labor-market-discrimination-and-the-racial-unemployment-gap-can-monetary-policy-make-a-difference/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper addresses two connected questions: why do Black workers face persistently higher and more volatile unemployment than white workers, and can the Federal Reserve&amp;rsquo;s August 2020 shift from a symmetric &amp;ldquo;Deviations&amp;rdquo; rule to a &amp;ldquo;Shortfalls&amp;rdquo; rule narrow the resulting racial unemployment gap? The authors build a New Keynesian search and matching model with endogenous separations (Mortensen-Pissarides) and add employer taste-based discrimination, calibrated to U.S. Current Population Survey microdata from January 1976 to December 2019.&lt;/p&gt;
&lt;p&gt;The empirical motivation is stark. In CPS data, the Black unemployment rate averages 12.0 percent against 5.5 percent for whites — a gap of 6.5 percentage points that is largely unexplained by observable characteristics such as age, education, marital status, and state of residence (Cajner et al. 2017). The racial gap is also strongly countercyclical: its cyclical correlation with the aggregate unemployment rate is 0.77. A Shimer (2012)-style flow decomposition shows that the separation rate margin accounts for approximately two-thirds (67 percent) of the mean gap and 60 percent of its cyclical variance, with the job-finding rate contributing 20 percent of the mean and 27 percent of variance.&lt;/p&gt;
&lt;p&gt;The model features two types of representative households that differ only in a non-productive attribute (race). Firms incur a per-period perceived cost κ₁ of employing a type-1 (Black) worker, following Becker (1971). This cost is time-invariant and not directly affected by monetary policy. Search is random (firms cannot direct search by race, consistent with anti-discrimination law). The model also incorporates Calvo price rigidities and an effective lower bound (ELB) on the nominal interest rate, solved via Dynare&amp;rsquo;s extended path method. Two aggregate shocks drive dynamics: a risk-premium (demand) shock and a productivity (supply) shock. The discriminatory parameter is calibrated to κ₁ = 0.0292 — equivalent to 3.6 percent of the steady-state average wage — to match the 6.4 percentage-point mean racial unemployment gap.&lt;/p&gt;
&lt;p&gt;The baseline model (under the symmetric Deviations rule) generates four untargeted results that match the data: (1) higher mean separation rates and lower mean job-finding rates for Black workers, with the ratio of Black-to-white separation rates at 2.3 in the model (1.9 in data); (2) higher cyclical volatility of Black unemployment, driven by higher separation-rate volatility; (3) a strongly countercyclical racial gap (near-unit correlation with aggregate unemployment in the model); and (4) positively skewed unemployment distributions for both groups — skewness that arises endogenously from the ELB constraint, which is absent when the ELB is removed. The mechanism is geometric: because Black workers face a higher reservation productivity threshold (due to κ₁ &amp;gt; 0), more Black workers cluster near that threshold. A given aggregate shock therefore moves a larger mass of Black workers across the threshold, amplifying their unemployment response relative to whites.&lt;/p&gt;
&lt;p&gt;Novel model-based discrimination measures — workers not hired or fired solely due to being Black — average 5.86 percent of the Black labor force under the Deviations rule and are strongly countercyclical (correlation with aggregate unemployment = 0.99 in the model vs. 0.64 in EEOC race-charge data). The welfare gap between white and Black households averages 2.4 percent in consumption-equivalent terms.&lt;/p&gt;
&lt;p&gt;Shifting to the Shortfalls rule — which responds to unemployment shortfalls symmetrically but only tightens policy when unemployment is above its steady-state level — strengthens expansions by keeping interest rates lower. The aggregate unemployment rate falls by 0.7 percentage point, from 6.37 percent to 5.65 percent. Because Black workers are more cyclically sensitive, they benefit disproportionately: Black unemployment falls by 1.1 percentage points and white unemployment falls by 0.7 percentage points, narrowing the racial gap by 0.5 percentage point (from 6.50 to 6.03 percent). Model-based discrimination also declines (aggregate measure from 5.86 to 5.52 percent). The downside is a 0.5 percentage-point rise in average inflation, from 1.9 percent to 2.4 percent. The negative skewness in the racial unemployment rate gap is essentially eliminated under the Shortfalls rule, so the distribution shifts toward a lower mean with fewer episodes of extreme gaps.&lt;/p&gt;
&lt;p&gt;From a welfare perspective, however, the gains are quantitatively trivial. Both households experience slightly positive welfare gains under the Shortfalls rule — consumption rises by 0.62 percent for Black households and 0.64 percent for white households — but the differences are effectively indistinct from zero in consumption-equivalent terms. Crucially, the consumption-equivalent welfare wedge between the two groups actually widens slightly, because white wages rise more than Black wages under the Shortfalls rule (average productivity of Black employed workers falls more as the lower reservation threshold admits marginal workers). The authors note their welfare analysis is a lower bound, given within-group consumption insurance, the absence of liquidity constraints, and non-expiring unemployment benefits in the model.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses a structural calibration approach rather than quasi-experimental identification. The model is calibrated to match 10 aggregate moments (1976-2019 CPS data) with all parameters common across racial groups except κ₁. The racial unemployment gap in steady state is the sole targeted moment for racial differences; all other racial outcomes are untargeted predictions. Threats include: (1) the model attributes all cross-race labor market differences to discrimination, ruling out unobserved productivity heterogeneity; (2) the representative firm with taste-based discrimination abstracts from market-selection forces that, in Becker&amp;rsquo;s classic model, would erode discrimination in the long run (the authors cite Black 1995, Rosen 1997, Sasaki 1998 for equilibrium justifications); (3) the model is solved under perfect foresight (extended path), not fully stochastic, though Dynare&amp;rsquo;s method approximates stochastic dynamics; (4) the Shortfalls rule is a reduced-form approximation of the FOMC&amp;rsquo;s 2020 framework, not a structural representation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms through which discrimination generates the observed racial unemployment patterns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core mechanism is that κ₁ &amp;gt; 0 raises the reservation productivity threshold for Black workers at both hiring (firms require higher expected productivity to justify the cost) and separations (existing matches must clear a higher bar to survive). Because idiosyncratic productivity is log-normally distributed, more Black workers cluster near their higher reservation threshold than white workers do near the lower white threshold. This concentration in the density means that any aggregate shock — moving both thresholds — shifts a proportionally larger mass of Black workers across the destruction margin, amplifying the volatility of Black unemployment and separations. The countercyclical racial gap arises because aggregate downturns raise both reservation thresholds, but since more Black workers are near their threshold, more are destroyed. The authors show that the separation-rate margin dominates: in the model it explains 92 percent of the mean gap and 81 percent of its cyclical variance, somewhat overstating the empirical 67 percent and 60 percent, because variation in the job-finding rate comes mostly from the common job-meeting probability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the two types of discrimination in the model — hiring discrimination and separation discrimination — work quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The hiring discrimination measure Df_t counts the fraction of Black job-seekers who are not hired because their idiosyncratic productivity draw falls above the white reservation threshold but below the (higher) Black threshold. The separation discrimination measure Dλ_t counts the fraction of employed Black workers who are endogenously separated for the same reason. Under the Deviations rule with ELB, the hiring margin averages 0.64 percent and the separation margin averages 5.22 percent of the Black labor force, for a total Dt of 5.86 percent. Both measures are strongly countercyclical (correlations with aggregate unemployment of 0.80 and 0.95 respectively). Under the Shortfalls rule, these fall to 0.56 and 4.95 percent (total 5.52 percent), and their skewness toward high discrimination levels is significantly reduced.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the aggregate macroeconomic effects of switching from the Deviations rule to the Shortfalls rule?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Shortfalls rule keeps nominal interest rates lower during periods of below-target unemployment (its asymmetry means it does not tighten in expansions unless inflation rises). This raises average output and consumption. The aggregate unemployment rate falls by 0.7 percentage point (from 6.37 to 5.65 percent), driven by both a lower average separation rate (3.36 to 3.10 percent) and a higher average job-finding rate (50.14 to 56.99 percent). Average inflation rises by 0.5 percentage point (from 1.88 to 2.40 percent annually). The Shortfalls rule increases the volatility of all labor market variables (it has lower stabilization properties) but essentially eliminates the positive skewness in the aggregate unemployment rate. The probability of a binding ELB falls from 10.6 percent to 8.5 percent under the Shortfalls rule. The correlation between inflation and unemployment strengthens from -0.32 to -0.51.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the Shortfalls rule differentially affect Black and white workers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Black workers benefit disproportionately because their unemployment is more cyclically sensitive. The unemployment rate falls by 1.1 percentage points for Black workers (from 11.89 to 10.78 percent) versus 0.7 percentage points for white workers (from 5.39 to 4.74 percent). The racial gap narrows by 0.5 percentage point (from 6.50 to 6.03 percent). Separation rates fall more for Black workers (6.53 to 6.29 vs. 2.90 to 2.65 for whites). Average wages for Black workers increase by 0.43 percent and for white workers by 0.48 percent. The slight relative wage disadvantage under the Shortfalls rule arises because the lower reservation threshold for Black workers admits workers with lower average productivity, pulling down average Black wages relative to whites.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the welfare implications of the policy change, and why are they small?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both households gain welfare under the Shortfalls rule, but the gains are quantitatively very small in consumption-equivalent terms (effectively indistinct from zero). The aggregate benefit — lower average unemployment — is partially offset by the cost of higher average inflation (price dispersion loss in the Calvo framework). Consumption rises by about 0.62 percent for Black households and 0.64 percent for white households. The consumption-equivalent welfare wedge between Black and white households (2.4 percent under the Deviations rule) actually widens slightly under the Shortfalls rule, because white wages increase more than Black wages. The authors emphasize several reasons their welfare analysis understates true racial inequality: (1) within-group consumption insurance prevents individual unemployment spells from being welfare-costly; (2) no liquidity constraints; (3) unemployment benefits do not expire; (4) the model abstracts from labor force participation margins and involuntary part-time employment. These features, if relaxed, would likely reveal larger welfare differences between the two groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does the effective lower bound (ELB) on nominal interest rates play?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The ELB is essential to generating positively skewed unemployment distributions in the model. Without the ELB, the model produces essentially symmetric (near-zero skewness) distributions for both aggregate and racial unemployment outcomes. With the ELB, the baseline model matches the observed positive skewness of the unemployment rate (1.25 aggregate; 1.23 for Black workers, 1.26 for whites). The ELB also raises the mean unemployment rate by about 0.25 percentage point and slightly amplifies labor market volatilities. It introduces a deflationary bias (inflation averages 1.88 percent vs. the 2.0 percent steady-state target). Critically, the main results — the 0.5 pp narrowing of the racial gap and 0.7 pp fall in aggregate unemployment under the Shortfalls rule — are robust to removing the ELB constraint (Appendix B.2.2), confirming they are not artifacts of the nonlinearity introduced by the ELB.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are conducted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Key robustness exercises include: (1) removing the ELB constraint, which confirms the main results hold (aggregate unemployment falls 0.7 pp, racial gap narrows 0.5 pp, inflation rises 0.5 pp without the ELB; Table A.8-A.9); (2) extending the unemployment flow decomposition to a three-state system (employed, unemployed, out of labor force), which confirms that the employment-to-unemployment (EU) transition is the primary driver of the racial gap even accounting for labor force participation transitions (Appendix A.2); (3) verifying that employer-to-employer transition rates are similar across racial groups (2.20 percent for Blacks vs. 1.96 percent for whites, 2004-2019), supporting the assumption of equal exogenous separation rates; (4) confirming that inflation experiences are similar between Black and white households using the Chicago Fed IBEX data (2.80 percent for Blacks vs. 2.87 percent for whites, 1983-2013), supporting the equal-inflation assumption; (5) presenting impulse response functions under both a productivity shock and a demand shock, in models with and without monetary policy inertia.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper contributes to four literatures. First, versus Cajner et al. (2017) on empirical racial labor market gaps, it provides a structural explanation rather than documenting gaps. Second, versus search-and-matching discrimination models (Bartel 1995, Bowlus-Eckstein 2002, Rosen 2003, Flabbi 2010, Borowczyk-Martins et al. 2017), the key contributions are: (a) endogenous separations (prior models used exogenous exit), which the authors view as essential since separation rates dominate the gap&amp;rsquo;s dynamics; and (b) incorporating nominal rigidities and an ELB, enabling analysis of monetary policy. Third, versus Ravenna-Walsh (2012) and Bergman et al. (2022), who embed worker heterogeneity in New Keynesian search models, this paper differs by modelling heterogeneity as discrimination rather than productivity differences, and by studying the Deviations-to-Shortfalls rule change specifically. Fourth, versus Bundick-Petrosky-Nadeau (2021) who study the same Deviations/Shortfalls comparison for the aggregate economy, this paper adds the racial dimension. Versus Lee et al. (2022), Nakajima (2023), and Ait Lahcen et al. (2023) — all of which also study monetary policy and racial inequality — the contribution is generating racial disparities endogenously from discrimination rather than taking them as given, and including endogenous separations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper find about the countercyclicality of racial discrimination?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both the model and the data exhibit strongly countercyclical discrimination. In the data, EEOC race-based discrimination charges (normalized per non-white labor force member) have a contemporaneous correlation of 0.65 with the cyclical component of the aggregate unemployment rate from 1997 to 2019. In the model, the aggregate discrimination measure Dt has a correlation of 0.99 with aggregate unemployment. The countercyclical pattern arises mechanically from the higher density of Black workers near the reservation productivity threshold: during recessions, both thresholds rise, destroying proportionally more Black matches and blocking more Black hires. The model-based discrimination measure also shows positive skewness (1.13 aggregate skewness under the Deviations rule with ELB), consistent with the asymmetric incidence of recessions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the quantitative scope conditions and limitations the authors themselves identify?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors identify several scope conditions and limitations: (1) the model abstracts from labor force participation, so it misses the racial gap in participation rates and involuntary part-time employment; (2) within-group consumption insurance and no liquidity constraints imply welfare estimates are a lower bound on true racial inequality — the consumption-equivalent wedge of 2.4 percent would be larger with incomplete insurance or borrowing constraints; (3) the welfare analysis assumes equal inflation rates across racial groups, which is empirically supported but abstracts from possible differences in consumption baskets; (4) the discriminatory parameter κ₁ is time-invariant and unresponsive to monetary policy, so all channels are indirect (through business cycle dynamics); (5) the model assumes a representative firm with taste-based discrimination, abstracting from firm heterogeneity in discrimination and from customer or statistical discrimination; (6) the Shortfalls rule is a reduced-form approximation of the FOMC&amp;rsquo;s 2020 framework and may not capture all aspects of the actual policy change.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shortfalls rule&lt;/strong&gt;: A Taylor-type monetary policy rule that responds symmetrically to inflation deviations from target but responds to unemployment deviations from steady state only when unemployment is above its steady-state level — not when it is below. This captures, in reduced form, the FOMC&amp;rsquo;s August 2020 revision from &amp;lsquo;deviations&amp;rsquo; to &amp;lsquo;shortfalls&amp;rsquo; of employment from maximum.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deviations rule&lt;/strong&gt;: A symmetric Taylor-type interest rate rule that responds to deviations of both inflation and unemployment from their respective steady-state values, regardless of the direction of the unemployment deviation. The baseline monetary policy in the model before the 2020 FOMC framework change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taste-based discrimination (κ₁)&lt;/strong&gt;: A per-period perceived cost κ₁ borne by employers for each period they employ a Black worker, following Becker (1971). In this model, κ₁ = 0.0292 (≈3.6 percent of the steady-state wage), is time-invariant, and is not directly altered by monetary policy — only indirectly through business cycle conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reservation productivity threshold (zRi)&lt;/strong&gt;: The minimum idiosyncratic productivity level at which it is profitable for a firm to either hire or retain a worker of type i. Because of κ₁, the Black reservation threshold exceeds the white threshold, generating higher endogenous separation rates and lower job-finding rates for Black workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model-based discrimination measures (Df_t, Dλ_t)&lt;/strong&gt;: Novel measures of the fraction of the Black labor force that is not hired (Df_t, hiring margin) or is fired (Dλ_t, separation margin) solely due to discrimination — i.e., workers whose idiosyncratic productivity exceeds the white reservation threshold but falls below the Black threshold. These are expressed as fractions of the Black labor force and compared to EEOC race-based charge data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption-equivalent welfare wedge (Ψ_t)&lt;/strong&gt;: The percentage increase in per-period consumption that must be given to Black households every period to equalize their welfare with that of white households, given the same stochastic future. Under the Deviations rule, this averages 2.4 percent. The change under the Shortfalls rule is effectively zero in quantitative terms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous separation&lt;/strong&gt;: A separation that occurs because a matched worker-firm pair draws an idiosyncratic productivity below the reservation threshold — as distinct from exogenous separations (random layoffs unrelated to productivity). The dominance of the separation margin in explaining the racial unemployment gap motivates the use of endogenous separations as a key model ingredient; prior search-and-discrimination models assumed exogenous exit.&lt;/p&gt;</description></item><item><title>Leaning Against the Global Financial Cycle</title><link>https://macropaperwarehouse.com/papers/leaning-against-the-global-financial-cycle/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/leaning-against-the-global-financial-cycle/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper investigates how institutional quality shapes (i) the domestic financial and macroeconomic impact of Global Financial Cycle (GFC) shocks on emerging market economies (EMEs) and (ii) the menu of counter-cyclical policies those countries actually deploy — and how effectively — in response. The central motivation is that EMEs face a difficult policy trade-off when global financial conditions tighten: they must balance retaining international investor confidence against stabilizing domestic demand, and policymakers have four instruments available (monetary policy, foreign exchange reserve intervention, macro-prudential policy, and capital controls) whose effectiveness may depend critically on underlying institutional strength.&lt;/p&gt;
&lt;p&gt;The empirical analysis covers 22 EMEs (including Turkey, Brazil, Chile, Mexico, South Korea, India, Poland, and others) at monthly frequency from 1995 to 2021. The baseline measure of global financial conditions is the Excess Bond Premium (EBP) of Gilchrist and Zakrajsek (2012). Institutional quality is measured by the World Bank Worldwide Governance Indicators (WGI), with rule of law as the baseline indicator; the authors also check government effectiveness, corruption control, and regulatory quality. The empirical strategy is panel local projections with country fixed effects and Driscoll-Kraay standard errors, interacting the EBP shock with institutional indicators and policy changes to isolate heterogeneous responses. The identifying assumption is that the EBP responds contemporaneously to macroeconomic information while real outcomes respond only with a lag, consistent with ordering the EBP last in a recursive VAR.&lt;/p&gt;
&lt;p&gt;The main finding on outcomes is that a tightening of global financial conditions reduces equity prices, widens sovereign spreads, depreciates the exchange rate, and contracts GDP for the average EME — with the EBP coefficient on equity returns reaching -10.0 percentage points at one month and -14.5 percentage points at six months (both significant at 1%). For a country at the 10th percentile of the rule-of-law distribution (score -1.3), a one-standard-deviation EBP shock (0.63 rise) produces an equity price fall of roughly 8%, a sovereign spread widening of approximately 50 basis points, and a GDP contraction of about 0.8%. Moving from the 10th to the 90th percentile of rule of law (score 1.1) reduces the equity and GDP contractions by roughly half and the spread widening by approximately half. The rule-of-law interaction coefficient on equity at horizon t+1 is 2.08 (significant at 1%), and the GDP interaction coefficients are 0.23 (significant at 10%) and 0.24 (significant at 5%) at horizons of 12 and 18 months, respectively. Exchange rate depreciation is not significantly moderated by institutional quality.&lt;/p&gt;
&lt;p&gt;On policy responses, the key finding is asymmetric policy space: countries with weak institutions tighten interest rates in the face of a GFC shock — to stem capital outflows and contain spread widening — while countries with strong institutions are able to lower rates. The EBP-times-rule-of-law interaction coefficient on interest rates at six months is -0.27 (significant at 5%), indicating that higher institutional quality is associated with lower interest rates after a shock. Simultaneously, weak-institution countries shed reserves significantly, whereas high-institution countries experience changes in reserves not significantly different from zero (or even modest accumulation), with the EBP-times-rule-of-law interaction on reserves at six months equal to 0.38 (significant at 10%). Capital controls show no systematic counter-cyclical use; macro-prudential policies show only a weak and transient response at short horizons. Both instruments appear deployed primarily as ex ante defenses during inflow episodes rather than ex post stabilization tools.&lt;/p&gt;
&lt;p&gt;A notable exception is the Covid-19 episode (January–August 2020). During this period, the institutional-quality interaction terms are statistically insignificant for both financial outcomes and policy reactions: all EMEs cut rates sharply (coefficient -0.34 at one month, significant at 1%) and shed reserves uniformly, with no significant differentiation by rule of law. The authors attribute this to the global, coordinated response of major central banks, which compressed the shock duration and may have overridden normal country-level differentiation.&lt;/p&gt;
&lt;p&gt;To interpret the empirical results, the authors develop a two-period small open economy model with a collateral constraint on foreign borrowing (adapted from Mendoza 2002). The key mechanism is that a higher share of foreign-currency debt (parameter η) tightens the collateral constraint in a crisis via the real exchange rate depreciation channel. Institutional reforms that allow more domestic-currency borrowing (lower η) act as an ex ante structural policy. Foreign exchange market intervention that appreciates the currency in a crisis acts as an ex post cyclical policy. The model shows these two instruments are largely substitutes: countries that have invested in institutions (lower η) benefit less from FX intervention (the intervention is more effective the higher η is), and conversely, countries for which FX intervention is highly effective face a weaker incentive to undertake costly institutional reforms ex ante.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses panel local projections (Jorda 2005) with country fixed effects, interacting the contemporaneous EBP with lagged institutional indicators and contemporaneous policy changes. The EBP is ordered last in the sense that the identifying assumption is that macroeconomic variables respond to financial shocks with a lag while the EBP can react contemporaneously to macro news — this is the same assumption used in Ben Zeev (2019) and Bhattarai, Chatterjee, and Park (2020). The authors include an extensive set of controls in the M matrix: lags of EBP, EBP interacted with rule of law, contemporaneous and lagged domestic inflation and output, contemporaneous and lagged global industrial production and oil prices, and contemporaneous and lagged U.S. inflation and GDP growth. The main endogeneity threat on the policy side is that counter-cyclical policies respond endogenously to the same shock driving outcomes; the authors address this by interacting the shock with a large set of country characteristics to &amp;lsquo;soak up&amp;rsquo; cross-sectional heterogeneity in policy reaction functions and make policy changes &amp;lsquo;as good as random.&amp;rsquo; They acknowledge but do not fully resolve this concern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is institutional quality measured and does the choice of indicator matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline measure is the World Bank Worldwide Governance Indicators (WGI) rule of law score, which captures &amp;lsquo;perceptions of the extent to which agents have confidence in and abide by the rules of society&amp;rsquo; including contract enforcement, property rights, policing, and the courts. The five WGI dimensions (rule of law, government effectiveness, corruption control, regulatory quality, and political stability) are highly correlated, so results reported in Table A1 using government effectiveness, corruption control, and regulatory quality are very similar to the baseline. The authors also test whether central bank independence (Garriga 2016) or central bank transparency (Dincer and Eichengreen 2014) matter instead — neither produces interaction coefficients significantly different from zero, indicating that CB governance is only one element of broader institutional quality and insufficient by itself to insulate EMEs from global shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What distinguishes the paper&amp;rsquo;s contribution from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is most closely related to Batini and Durand (2021), who find that capital controls and macro-prudential policies reduce the correlation between capital inflows to EMEs and the global capital flows cycle, but only during large inflow episodes. The current paper extends this by introducing institutional quality as a moderating variable across the full menu of four counter-cyclical instruments and showing that the effectiveness and actual use of each instrument depends on a country&amp;rsquo;s institutional strength. It also differs from Kalemli-Ozcan (2019), whose theoretical conjecture that low credibility leads to self-defeating macroeconomic policies the authors test and confirm empirically across the full EME panel. The paper additionally contributes a structural model that formally links the ex ante vs. ex post policy substitutability to currency composition of debt and collateral constraints, connecting empirical findings to welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in EME responses is documented beyond the mean effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The primary dimension of heterogeneity is rule of law. At the 10th percentile (score -1.3), a one-SD EBP shock causes an equity fall of ~8%, spread widening of ~50 bps, and GDP contraction of ~0.8%; at the 90th percentile (score 1.1), these effects are approximately halved. The exchange rate response is not significantly differentiated by institutional quality. The policy heterogeneity is also sharp: weak-institution countries tighten rates and deplete reserves, while strong-institution countries lower rates without suffering additional depreciation or reserve outflows. The paper also documents some heterogeneity related to per capita income (Table A2), finding that both per capita income and institutional quality independently predict milder financial tightening, with richer EMEs also experiencing less exchange rate depreciation (possibly reflecting greater fear of floating in less-advanced EMEs). However, per capita income does not displace the institutional quality finding — both coefficients remain significant when included jointly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors conduct four sets of robustness exercises. First, they replace the EBP with the VIX (Table A3) and find broadly consistent results: countries with better rule of law suffer milder GDP contractions and smaller spread widening when the VIX spikes. Second, they replace the continuous EBP shock with a dummy for selected episodes of extreme financial stress (Table A4), finding positive and significant interaction coefficients for equity and GDP (milder contraction) and negative for spreads (milder widening). Third, they add per capita income and its interaction with the EBP (Table A2), confirming that institutional quality retains significance after controlling for income. Fourth, they replace the rule of law with the four other WGI dimensions (Table A1), obtaining virtually identical results. They also show that capital controls and macro-prudential policies display little counter-cyclical activation regardless of specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the mechanism through which institutions moderate GFC transmission?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Stronger institutions raise international investor confidence in a country&amp;rsquo;s credibility and willingness to enforce contracts and property rights. When a GFC tightening hits, investors discriminate less against high-institution EMEs, resulting in smaller capital outflows and less exchange rate pressure. This grants high-institution central banks the policy space to cut rates rather than raise them, which further stabilizes financial conditions without triggering additional capital flight. In the model, strong institutions reduce the share of debt denominated in foreign currency (lower η), which directly relaxes the collateral constraint in a crisis because the collateral value is denominated in domestic currency — less external debt means less amplification of the depreciation-collateral-borrowing spiral. This is the key pecuniary externality in the Mendoza (2002) framework that the model formalizes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do ex ante and ex post policies interact, and what are the policy implications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The theoretical model shows that structural reforms (reducing foreign-currency debt share, i.e., lowering η) and FX intervention are largely substitutes. Specifically, the welfare gain from FX intervention is larger the higher η is — meaning that FX intervention is most valuable to countries that have not undertaken institutional reforms. Countries that have invested in strong institutions need to use FX reserves less in a crisis, consistent with the empirical finding that high-rule-of-law countries experience smaller reserve depletion after a GFC shock. This creates a moral-hazard-style dilemma: if FX intervention is highly effective (because η is large), the marginal incentive to invest in costly institutional reform is reduced. The normative implication is that institutional development and counter-cyclical policies should be seen as a portfolio — countries cannot rely indefinitely on FX intervention as a substitute for governance reform if the goal is to reduce structural vulnerability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why are macro-prudential policies and capital controls not found to be counter-cyclical tools?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two explanations are offered. First, macro-prudential tools require a build-up phase in which standards are tightened during good times so they can be loosened in bad times; many EMEs only began adopting these tools systematically after the 2008 Global Financial Crisis, as shown by the progressive tightening in the iMaPP aggregate index after 2008. Second, capital controls on outflows are strategically avoided in periods of stress because imposing them signals investor-hostile policy intentions precisely when foreign capital is most needed, exacerbating the perception of vulnerability (Rebucci and Ma 2019). Capital controls on inflows are used as ex ante instruments during inflow episodes (Ben Zeev 2017; Das, Gopinath, and Kalemli-Ozcan 2021), but this is an ex ante rather than ex post counter-cyclical use.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the Covid-19 episode differ and what explains the deviation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;During January-August 2020, the standard pattern breaks down. All 22 EMEs cut interest rates sharply (coefficient -0.34, significant at 1%) and shed reserves (coefficient -0.45, significant at 1%) regardless of institutional quality; the EBP-times-rule-of-law interaction terms for both financial outcomes (equity coefficient 1.42, insignificant; spread coefficient 1.16, insignificant) and policy responses (rate interaction 0.053, insignificant; reserve interaction -0.16, insignificant) are not statistically different from zero. The authors attribute this to the unusually swift and coordinated global monetary policy response — led by the U.S. Fed and other major central banks — which made the shock short-lived and may have extended implicit backstops to all EMEs regardless of institutional quality. The Covid episode may also be better explained by idiosyncratic factors such as fiscal space, pandemic containment policies, and integration in global value chains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the two-period model&amp;rsquo;s structure and what does it deliver?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is a deterministic two-period small open economy endowment model with home bias in consumption (import share λ = 0.4), a binding collateral constraint in the crisis state, and debt split between domestic- and foreign-currency denomination (ratio η). The collateral constraint is (1+η)b ≤ ω·pH1·y1, so a higher η — more foreign currency debt — tightens the constraint via the exchange rate in a crisis because real exchange rate depreciation reduces domestic endowment value in foreign terms. The government can (ex ante) conduct structural reforms that lower η at a cost, or (ex post) intervene in the FX market to appreciate the currency, which relaxes the constraint. Calibrated with β = 0.96 (4% annual real rate), ω = 0.3 (maximum debt 30% of output), and normalized output and initial debt to 1, the model shows (i) higher η produces larger utility losses in the crisis state, and (ii) FX intervention reduces those losses, but more so the higher η — confirming the substitutability and the declining returns to FX intervention as institutions improve. The model does not endogenize the choice of η nor derive an optimal policy mix given costs, which the authors acknowledge as a limitation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Global Financial Cycle (GFC)&lt;/strong&gt;: The paper-specific sense follows Rey (2013) and Miranda-Agrippino and Rey (2021): the co-movement of risky asset prices across global markets driven primarily by U.S. financial conditions and global risk appetite, operationalized empirically as shocks to the Excess Bond Premium. For EMEs, the GFC represents an exogenous source of financial tightening or loosening that transmits through capital flows, exchange rates, and credit conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Excess Bond Premium (EBP)&lt;/strong&gt;: The Gilchrist and Zakrajsek (2012) measure of the component of U.S. corporate bond spreads that is not explained by observable firm-level default risk — interpreted as the compensation demanded by investors for bearing corporate credit risk above and beyond expected losses. Used in this paper as the baseline proxy for global financial conditions because its effects on EMEs are well-established and it is more specific than the VIX.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional strength / rule of law&lt;/strong&gt;: Operationalized via the World Bank Worldwide Governance Indicators. In this paper&amp;rsquo;s framework, institutional strength captures the degree to which international investors trust a country&amp;rsquo;s contract enforcement, property rights, and policy credibility. This trust is the mechanism by which high-institution EMEs face lower capital sensitivity to GFC shocks and retain monetary policy space.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex ante vs. ex post policy&lt;/strong&gt;: The paper distinguishes structural reforms (ex ante) that reduce an economy&amp;rsquo;s vulnerability to GFC shocks before they occur — by, for example, improving institutions so that debt can be issued in domestic currency — from cyclical stabilization measures (ex post) deployed after a shock arrives, such as FX reserve sales to support the exchange rate. These two classes of policy are shown to be largely substitutes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collateral constraint (model)&lt;/strong&gt;: In the paper&amp;rsquo;s theoretical framework (following Mendoza 2002), total borrowing is limited to a fraction ω of the domestic endowment value. When denominated in foreign currency, a real exchange rate depreciation tightens the constraint endogenously — the model&amp;rsquo;s central amplification mechanism — creating a pecuniary externality that structural policy (reducing η) or FX intervention (limiting depreciation) can partially offset.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Foreign-currency debt share (η)&lt;/strong&gt;: The ratio of foreign-currency to domestic-currency denominated debt in the model. A higher η amplifies the collateral constraint tightening during a GFC shock because a given exchange rate depreciation reduces the domestic-currency value of the collateral more. Lower η — achievable through institutional reform — is the model&amp;rsquo;s representation of reduced GFC vulnerability. FX intervention is more effective (has larger welfare gains) when η is high.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy space&lt;/strong&gt;: Used in this paper to mean the ability of a central bank to cut the short-term interest rate in response to a negative GFC shock without triggering capital outflows and further depreciation. Strong institutions expand policy space because international investors maintain confidence in the country&amp;rsquo;s credibility and do not flee in response to lower yields. Weak-institution countries lack policy space and are forced to raise rates in a crisis, tightening domestic conditions further.&lt;/p&gt;</description></item><item><title>Macroeconomic Effects of 'Free' Secondary Schooling in the Developing World</title><link>https://macropaperwarehouse.com/papers/macroeconomic-effects-of-free-secondary-schooling-in-the-developing-world/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/macroeconomic-effects-of-free-secondary-schooling-in-the-developing-world/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks whether publicly funded (&amp;ldquo;free&amp;rdquo;) secondary schooling in developing countries raises GDP per capita. The question is policy-relevant because many low-income countries — including Ghana, Kenya, Tanzania, Uganda, and others listed in the paper&amp;rsquo;s appendix — have recently adopted or are considering such policies, motivated by the combination of low secondary enrollment (roughly one-third of secondary-school-age children enrolled in the poorest countries, versus near-universal enrollment in rich countries) and evidence that credit constraints keep talented students out of school.&lt;/p&gt;
&lt;p&gt;The analysis is built around an overlapping-generations (OLG) model with heterogeneous households and credit constraints, estimated to match experimental evidence from a randomized controlled trial (RCT) in Ghana (Duflo, Dupas, and Kremer, 2021). The RCT randomly offered full four-year scholarships covering 100 percent of tuition and fees to approximately two thousand poor but high-ability students who had passed the Basic Education Certificate Examination (BECE) but had not enrolled in Senior High School (SHS). Scholarship winners were 27 percentage points more likely to complete secondary school than the control group, scored 0.16 standard deviations (equivalent to 7.6 percent wage gains in the model) higher on math and literacy tests, and experienced a 10.6 percent decline in fertility after 12 years.&lt;/p&gt;
&lt;p&gt;The model departs from standard human capital OLG models in three ways. First, it incorporates an explicit opportunity cost of schooling: teenagers who attend SHS forgo labor income during ages 15–19, which is economically significant given that secondary-school-age individuals are near their prime working years in developing countries. Second, the model includes a merit-based entrance exam (the BECE), so that removing the exam requirement as part of free schooling causes negative selection — the new marginal students induced to attend have lower average ability than those already attending. Third, the model features education-dependent fertility: more-educated households have fewer children (estimated fertility of 2.07 per less-educated family vs 1.19 per more-educated family, in line with Ghanaian Demographic and Health Survey data). The model also incorporates imperfect substitutability between skilled and unskilled labor (elasticity of substitution set to 4, following long-run cross-country estimates), savings wedges that match low liquid asset holdings, and Ghana&amp;rsquo;s actual progressive income tax schedule.&lt;/p&gt;
&lt;p&gt;The model is estimated using the Simulated Method of Moments (SMM) targeting ten moments — five non-experimental (aggregate population growth rate of 2.2 percent per year, aggregate SHS completion rate, SHS completion in the top and bottom test-score quartiles of the control group, and variance of the permanent component of log wages) and five experimental or quasi-experimental (RCT treatment effects on human capital, fertility, overall SHS completion, the Q4 vs Q1 difference in SHS completion, and the intergenerational schooling correlation from administrative data).&lt;/p&gt;
&lt;p&gt;The central quantitative finding is that nationwide free secondary schooling — eliminating both fees and the entrance-exam requirement — raises secondary school completion by about 12 percentage points (from 30 percent to 42 percent of the population) but reduces GDP per capita by approximately 1 percent in the long run. The 95 percent confidence interval for the GDP effect excludes any positive value (lower bound -4.2 percent, upper bound -0.7 percent), so the model can statistically reject any positive GDP impact. The direct fiscal cost of the policy is 1.4 percent of GDP, implying a total cost (direct cost plus lost GDP) of approximately 2.4 percent of GDP. Taxes per capita increase by 1.4 percent. Adult earnings rise by about 1.2 percent, but this is more than offset by a 7.5 percent decline in child earnings (the opportunity cost of schooling for newly enrolled students). The skilled-to-unskilled wage ratio falls by about 10 percent, reflecting general-equilibrium wage compression from the expanded supply of secondary graduates.&lt;/p&gt;
&lt;p&gt;Three counterfactual experiments decompose the negative GDP result. (i) Eliminating the opportunity cost of schooling reverses the GDP effect from -1.0 percent to +2.9 percent, a swing of nearly 4 percentage points — the dominant channel. (ii) Holding the ability distribution of new secondary attendees to match the experimental sample (removing negative selection) moves GDP from -1.0 percent to essentially 0, accounting for about 1 percentage point of the gap. (iii) Holding fertility constant for new secondary attendees moves GDP from -1.0 percent to +1.2 percent, contributing about 2.2 percentage points. When all three channels are shut down simultaneously, GDP rises by 6.9 percent — close to the naive back-of-the-envelope projection of 6 percent based on the RCT&amp;rsquo;s test-score estimates.&lt;/p&gt;
&lt;p&gt;As a policy comparison, an economy-wide improvement in schooling quality that raises test scores by 0.1 standard deviations (a conservative estimate consistent with randomized teacher-incentive interventions in India and Kenya) raises GDP per capita by 2.7 percent and increases SHS completion by 13.8 percentage points — more than free schooling and at lower fiscal cost (the policy pays for itself in equilibrium). Improving schooling quality avoids the negative selection and opportunity-cost channels because it raises human capital for both new and inframarginal students.&lt;/p&gt;
&lt;p&gt;On welfare and distribution, the policy is predominantly redistributive. The bottom 25 percent of parents gain welfare equivalent to a 7.3 percent increase in lifetime consumption, while the top 25 percent lose 4.2 percent. For children, the bottom 25 percent gain 23 percent in consumption-equivalent welfare, while the top 75 percent lose about 5.3 percent. These distributional predictions are validated against a new nationally representative survey of 3,500 Ghanaian households (conducted by the authors in August–September 2022): households with at most a JHS education were 3.1 percentage points more likely to support the policy than average, while those with SHS education or more were 5.2 percentage points less likely — remarkably close to the model&amp;rsquo;s predicted values of 2.6 and 5.9 percentage points, respectively. The authors conclude that free secondary schooling in developing countries is primarily a redistributive policy and not an efficient path to economic growth at current levels of schooling quality.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses a two-step strategy. First, it estimates the OLG model using SMM, with the experimental moments from Duflo, Dupas, and Kremer&amp;rsquo;s (2021) RCT serving as the key identifying variation. The RCT randomly assigned scholarships to poor but high-ability students in Ghana who had passed the BECE but had not enrolled in SHS, making the treatment effect on schooling completion, test scores, and fertility credibly causal in partial equilibrium. Second, the estimated model is used to compute general-equilibrium counterfactuals for a nationwide policy. The main threats to validity are: (a) external validity of the RCT sample to the general population — the sample is explicitly &amp;lsquo;smart kids from poor families,&amp;rsquo; which the authors account for through the negative-selection counterfactual; (b) the model misses on the intergenerational schooling correlation (model: 0.32 vs data: 0.45) and on the treatment effect on SHS completion (model: 21.3 pp vs data: 27 pp), though the authors show in Appendix C that forcing the model to match these moments does not reverse the negative GDP conclusion (a 40 percent higher schooling cost parameter yields a -0.8 percent GDP result vs -1.0 percent baseline; a 15 percent higher ability-persistence parameter yields -2.0 percent); (c) abstracting from human capital externalities (Lucas 1988 type spillovers) and crime reduction effects of education — the authors note these omissions but argue the low estimated effects of the policy make them unlikely to matter quantitatively; and (d) partial equilibrium of the RCT itself — the authors assume no general-equilibrium effects of the experiment since it covered only 2,064 students.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The three channels are (i) opportunity cost — attendees ages 15–19 forgo labor income; (ii) negative selection — removing the BECE requirement means new marginal students have lower average ability than current attendees; (iii) differential fertility — newly educated households reduce fertility, shifting the long-run population distribution toward less-educated (higher-fertility) households, diluting the share of educated workers over time. The paper isolates each channel through sequential counterfactual experiments: (i) is isolated by eliminating the option for ages-15–19 children to work (forcing the choice between schooling and idleness), which raises the GDP effect from -1.0 to +2.9 percent; (ii) is isolated by artificially boosting the ability of new secondary attendees to match the experimental sample&amp;rsquo;s ability distribution, which moves GDP from -1.0 to approximately 0; (iii) is isolated by setting new attendees&amp;rsquo; fertility to the uneducated-household level, which moves GDP from -1.0 to +1.2 percent. The magnitudes reveal that the opportunity cost channel is the largest (approximately 4 pp swing), followed by the fertility channel (approximately 2.2 pp), and then the selection channel (approximately 1 pp).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Several dimensions of heterogeneity are documented. In the experimental sample, the treatment effect on SHS completion is not particularly skewed toward high-ability students: the difference in treatment effects between the top and bottom test-score quartiles is only 4 percentage points in the data (and 3 in the model), implying broadly similar gains across the ability distribution within the selected sample. In the estimated model&amp;rsquo;s misallocation analysis, the attendance probability plot (Figure 3) shows that the highest-ability children are fairly likely to attend SHS even when born to low-ability parents — suggesting relatively low misallocation in the estimated model compared to the stylized high-misallocation case. On welfare, the paper documents large heterogeneity by income quartile: the bottom 25 percent of parents gain 7.3 percent in consumption-equivalent welfare while the top 25 percent lose 4.2 percent; for children the bottom 25 percent gain 23 percent while the top 75 percent lose about 5.3 percent. Welfare also differs across generations: gains for grandchildren who always exist are smaller (9 percent) than for children (12 percent), reflecting the compounding fertility effect. The survey confirms these patterns across urban/rural, male/female, and across the Volta (42.3 percent average support for free SHS) and Ashanti (78.2 percent average support) regions of Ghana.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors report three robustness checks in Appendix C. First, they increase the schooling cost parameter ΨS by 40 percent to force the model to match the (currently undershot) treatment effect on SHS completion; the free schooling policy then produces a -0.8 percent GDP result (vs -1.0 percent baseline) and a 14 percent increase in attendance (vs 12 percent baseline) — the conclusion is unchanged. Second, they increase the ability-persistence parameter ρ by 15 percent to match the intergenerational schooling correlation; the result is a -2.0 percent GDP decline and a 4 percent attendance increase — the GDP decline is larger, so if anything the baseline is too generous to free schooling. Third, they experiment with lower values of the elasticity of substitution between skilled and unskilled labor (down to 1.4 from the baseline value of 4) and report no substantive change in conclusions. The authors also use bootstrapped 95 percent confidence intervals for all aggregate predictions, which is unusual in general-equilibrium counterfactual exercises in macroeconomics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is most closely related to Abbott, Gallipoli, Meghir, and Violante (2019) and Daruich (2020), both of which study public education expansions in the United States and find largely positive effects on GDP and welfare. The authors argue the contrast with their pessimistic findings reflects lower school quality in developing countries — in a rich-country setting, opportunity costs are lower relative to the returns to schooling. Hendricks and Schoellman (2014) find similar negative selection of college students in the US as enrollment expands, lending support to the selection channel. Khanna (2023) documents substantial declines in the relative wages of skilled workers after an education expansion in India, consistent with the model&amp;rsquo;s 10 percent skilled-to-unskilled wage compression, though Khanna&amp;rsquo;s short-run effects are larger due to lower short-run elasticity of substitution. In terms of methodology, the paper follows Daruich (2020) in using RCT evidence to discipline an OLG model, and is the first paper to do so for the macroeconomic effects of education policy in the developing world. The paper also builds on the macro-development literature emphasizing school quality (Hanushek and Woessmann, 2007; Schoellman, 2012) over average years of schooling as the proximate cause of low human capital in poor countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central policy implication is that free secondary schooling in developing countries, at current low levels of schooling quality, is primarily redistributive rather than growth-enhancing. Countries considering free schooling should expect secondary enrollment to rise substantially (by around 12 percentage points in the baseline) but GDP per capita to fall or stay flat. The alternative of improving schooling quality — modeled as a 0.1 standard deviation increase in test scores, using teacher incentives or additional teachers at a cost of approximately US$5.78 per student per year (based on Mbiti et al. 2019 in Tanzania) — raises GDP by 2.7 percent and schooling enrollment by even more (13.8 percentage points), while paying for itself in equilibrium. A key scope condition: the negative GDP finding is driven by the combination of high opportunity costs of schooling (secondary-school-age workers have economically significant labor income in developing countries), negative selection from removing merit requirements, and low schooling quality that limits the human capital return per year of schooling. In rich countries where these conditions do not hold, the same policy has been found to be beneficial. The paper also shows (Table 6) that maintaining the entrance-exam requirement alongside free schooling substantially mitigates the GDP decline (-0.3 percent vs -1.0 percent), and that keeping both the test and a positive fee results in approximately zero GDP change — suggesting that the test-requirement component of the policy design is important.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper find about misallocation in the estimated model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The estimated model exhibits relatively low misallocation. The misallocation concept refers to situations where high-ability children of poor parents are kept out of secondary school by borrowing constraints even though the net-present-value of additional schooling exceeds the cost. The paper shows (Figure 2) that economies can have similar aggregate secondary enrollment rates of around 30 percent but very different degrees of misallocation — one where enrollment is low because returns are low (low-misallocation case), and one where enrollment is low because high-ability children are credit-constrained (high-misallocation case). The estimated model falls closer to the low-misallocation case (Figure 3), with the highest-ability children fairly likely to attend SHS even if born to low-ability parents. This finding is consistent with the modest increase in SHS completion induced by free schooling (12 percentage points) relative to the experimental treatment effect on the selected sample (27 percentage points): most high-ability children are already attending, so there is limited room for a free schooling policy to reduce misallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the welfare analysis reveal about the puzzle of large welfare gains alongside a GDP decline?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper documents an apparent puzzle: the free schooling policy reduces long-run GDP per capita by 1 percent but produces large positive welfare gains for parents (average 3.9 percent in consumption-equivalent welfare) and even larger gains for children (average 12.4 percent). The resolution is that (a) welfare gains for parents come entirely from redistribution — the very poor gain 7.3 percent while the rich lose 4.2 percent, and the progressive tax schedule is the mechanism; (b) the welfare gains for the children&amp;rsquo;s generation partially reflect large gains to the small number of previously misallocated children who now attend secondary school (the bottom 25 percent of children gain 23 percent, primarily through income gains for those who previously could not afford school); and (c) these gains erode across generations — grandchildren who always exist gain less (9 percent vs 12 percent for children), because the grandchildren who would only have existed without the free schooling policy (i.e., the &amp;lsquo;unborn&amp;rsquo; due to reduced fertility among educated households) would have experienced disproportionately large gains (almost 17 percent). The composition of the population thus shifts toward those experiencing smaller gains, compounding over generations and producing the long-run GDP decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the entrance exam design in free schooling policy outcomes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper shows that how access is structured matters as much as whether schooling is free. In the main analysis, free schooling eliminates both fees and the BECE entrance requirement, consistent with Ghana&amp;rsquo;s 2017 policy. In alternative simulations (Table 6), free schooling that maintains the existing entrance requirement (a &amp;lsquo;relaxed test&amp;rsquo; policy) produces a GDP decline of only -0.3 percent instead of -1.0 percent. Free schooling that keeps the test at full stringency (so fewer new students gain access) produces essentially no change in GDP (-0.0 percent), but also a much smaller increase in secondary attendance (3.0 pp vs 11.8 pp). Eliminating only the test requirement while keeping a positive fee produces a -0.4 percent GDP decline. These results confirm that the negative selection channel is a quantitatively important driver of the adverse GDP effect and is specifically activated by the removal of the merit requirement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the model estimated and what moments does each parameter primarily identify?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is estimated by SMM minimizing the sum of squared differences between model moments and their data counterparts, using a vector of 10 parameters (fertility parameters νJ and νS; schooling efficiency ηS; goods cost of schooling ΨS; intergenerational altruism b; exam score noise σε; Gumbel taste-shock scale θ; savings wedge χ; ability persistence ρ; ability shock standard deviation συ). Six parameters are chosen directly from the literature or normalization (A, α, β, r*, λ, σζ). Ten moments are targeted: population growth rate (primarily identifies νJ, νS), aggregate SHS completion rate and quartile completion rates (identify ηS, b, ΨS, χ), variance of the permanent component of wages (identifies συ, ρ), and five experimental moments from the Duflo et al. RCT (treatment effects on human capital, fertility, SHS completion, the Q4–Q1 completion difference, and the intergenerational schooling correlation). Confidence intervals are bootstrapped by re-sampling the five experimental moments 100 times, treating the non-experimental moments as fixed. The Jacobian matrix (Appendix Table C.1) and sensitivity matrix (Appendix Table C.2) are computed following Kaboski and Townsend (2011) and Andrews, Gentzkow, and Shapiro (2017) to document identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the survey design details and how well does it validate the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors conducted a new nationally representative household survey in Ghana in August–September 2022, covering 3,500 households selected via two-stage cluster sampling from seven regions accounting for about 61 percent of the Ghanaian population. Respondents were asked whether eight categories of government expenditure should be abolished, cut substantially, cut somewhat, maintained, or expanded. For free SHS, respondents with at most a JHS education were 3.1 percentage points more likely to support the policy than average; those with SHS education or more were 5.2 percentage points less likely. These empirical patterns align closely with the model&amp;rsquo;s predicted values of 2.6 and 5.9 percentage points respectively. The pattern is robust across urban/rural subsamples, male/female subsamples, and across the Volta and Ashanti regions (which differ substantially in overall support levels — 42.3 percent vs 78.2 percent — but maintain the same qualitative pattern of lower-educated households being more supportive). The one discrepancy is that the model over-predicts the support of JHS-educated households who have children enrolled in SHS.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Opportunity cost of schooling&lt;/strong&gt;: In this paper&amp;rsquo;s model, the foregone labor income of teenagers aged 15–19 who attend secondary school rather than work. This cost persists even when the school fee is eliminated by government policy and is identified as the single largest channel explaining why free secondary schooling reduces rather than raises GDP per capita in developing countries, contributing approximately 4 percentage points to the adverse GDP effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negative selection of new students&lt;/strong&gt;: The reduction in average ability of the marginal students who enter secondary school once both fees and the merit-based entrance exam are eliminated. The existing pool of secondary attendees was positively selected by the entrance exam, so broadening access induces a lower-ability pool of new entrants, reducing the average human capital gain per new graduate. The paper estimates this channel accounts for approximately 1 percentage point of the adverse GDP gap relative to the back-of-the-envelope projection.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Differential fertility by education&lt;/strong&gt;: The model feature by which secondary-educated households have significantly fewer children (parameter νS = 0.19 implying 2.4 children per family) than non-secondary-educated households (νJ = 1.07 implying 4.1 children per family). When free schooling induces more households to obtain secondary education, aggregate fertility falls, and crucially the share of high-ability households in the long-run population declines because those households now have fewer children, reducing the long-run supply of educated workers and contributing approximately 2.2 percentage points to the adverse GDP gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Misallocation of talent&lt;/strong&gt;: In this paper&amp;rsquo;s sense: the situation in which high-ability children of poor parents are prevented by borrowing constraints from attending secondary school even though the net-present-value of additional schooling exceeds the combined goods and opportunity costs. The paper finds that the estimated model of Ghana corresponds more closely to a low-misallocation economy (Figure 3), meaning the highest-ability children attend SHS at fairly high rates regardless of parental income, so the scope for free schooling to reduce misallocation is limited.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Balanced growth path&lt;/strong&gt;: In this paper: a recursive competitive equilibrium in which aggregate population grows at a constant rate while the relative distribution of households across individual states (ability, education, assets) is stationary, and household policy functions are independent of the aggregate population level. All policy counterfactuals are conducted by introducing a policy into the balanced growth path and computing transition dynamics to the new balanced growth path.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Schooling quality (ηS)&lt;/strong&gt;: The efficiency parameter governing how much human capital a student of given ability acquires from a year of secondary schooling, defined in the production function h(z,S) = z · ηS. In the estimated model, ηS = 5.66, implying an annual return to education of 7.9 percent for the experimental sample. The paper shows that a policy raising ηS (schooling quality) by enough to increase average test scores by 0.1 standard deviations raises GDP by 2.7 percent and expands SHS enrollment by 13.8 percentage points, outperforming free schooling on both counts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Savings wedge (χ)&lt;/strong&gt;: A wedge between the international market rate of return on capital (r*) and the return available to households in the model (r = r* - χ), calibrated to match the low savings rates observed in low-income economies. In the estimated model χ = 0.09, implying households earn approximately 2 percent per year on savings. Together with the borrowing constraint (no borrowing against children&amp;rsquo;s future income), this ensures that poor parents cannot save their way out of the constraint preventing them from sending high-ability children to school.&lt;/p&gt;</description></item><item><title>Market Opacity and Fragility: Why Liquidity Evaporates When It Is Most Needed</title><link>https://macropaperwarehouse.com/papers/market-opacity-and-fragility-why-liquidity-evaporates-when-it-is-most-needed/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/market-opacity-and-fragility-why-liquidity-evaporates-when-it-is-most-needed/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: The paper asks why market liquidity sometimes behaves in a stabilizing way (an illiquidity hike curbs liquidity demand and attracts liquidity supply) but on other occasions &amp;ldquo;evaporates when it is most needed,&amp;rdquo; degenerating into a disorderly run for the exit and a flash crash, often with no fundamentals news. Motivated by flash events (the May 6, 2010 US flash crash where the Dow Jones fell about 9% intraday; the October 15, 2014 Treasury crash; the August 24/25, 2015 ETF freeze; the 1987 crash; and the COVID-19 Treasury market dislocation), Cespa and Vives argue that lack of transparency about order flow is a key ingredient that can jam the &amp;ldquo;rationing&amp;rdquo; function of the cost of trading.&lt;/p&gt;
&lt;p&gt;Model setup: It is a stylized, two-period (trading rounds) rational-expectations model with no noise traders and no asymmetric information about payoffs — only about order flow. A single risky asset (liquidation value v ~ N(0, 1/tau_v)) is traded by competitive CARA agents. There are risk-averse dealers with risk tolerance gamma: a mass mu in [0,1] of &amp;ldquo;full&amp;rdquo; D-dealers present in both periods and 1-mu &amp;ldquo;restricted&amp;rdquo; RD-dealers present only in period 1; both post price-contingent (limit) orders. Overlapping unit-mass cohorts of risk-averse hedgers (risk tolerance gamma_H) receive independent endowment shocks u_t ~ N(0, 1/tau_u) in a non-tradable, perfectly correlated security and submit MARKET orders. Second-period hedgers observe a noisy signal s_u1 = u1 + eta of the first-period order imbalance, with eta ~ N(0, 1/tau_eta); tau_eta indexes transparency (infinity = full transparency, 0 = full opacity). The authors solve for linear equilibria and introduce a novel total-illiquidity measure, the Weighted Average Price Impact (WAPI), which volume-weights the heterogeneous price impacts of u1, u2, and eta.&lt;/p&gt;
&lt;p&gt;Main findings and mechanism: Under full transparency, second-period hedgers can perfectly infer u1, face no price (execution) risk, and supply liquidity via contrarian marketable orders (speculative aggressiveness b &amp;gt; 0); the price impacts of the two cohorts&amp;rsquo; shocks (Lambda_2 and Lambda_21) are independent, liquidity demand slopes DOWN in trading cost, and the equilibrium is unique. Under opacity the signal is noisy (b = 0 under full opacity), Lambda_2 and Lambda_21 become strategic SUBSTITUTES, generating strategic complementarity in illiquidity that can produce MULTIPLE equilibria and make liquidity demand slope UP in trading cost. Multiplicity arises when 0 &amp;lt; tau_u&lt;em&gt;tau_v &amp;lt; gamma/(4&lt;/em&gt;(gamma+gamma_H)^3): three equilibria (two stable extremal, one unstable intermediate). Example with tau_u = 0.1, tau_v = 0.1, gamma = 1, gamma_H = 0.1: Lambda_2 in {8.96, 1.98, 0.12}, Lambda_21 in {0.12, 1.98, 8.96}, Lambda_1 in {0.0001-ish (10^-2), 0.43, 8.84}; with tau_u = 2 a unique equilibrium with Lambda_21 = Lambda_2 = 4.61, Lambda_1 = 2.34. Traders facing the LARGEST trading cost trade most intensely at equilibrium.&lt;/p&gt;
&lt;p&gt;Quantitative comparative statics: An unanticipated, perceived-permanent rise in endowment-shock dispersion produces a flash crash raising WAPI by 44% (from 4.62 to 6.67) and price volatility by 70% (from 4.62 to 7.87); recovery restores the original equilibrium. Halving tau_v raises WAPI by 89% and price volatility by 138%; an 11% decline in gamma raises WAPI by 20% and volatility by 14% (the latter preserving a unique equilibrium — fragility without multiplicity). With restricted dealers, an 11% cut in mu (0.9 to 0.8) when transparency is low can plunge the market to the opposite equilibrium: Lambda_2 from 1.47 to 9.6 (a 653% jump) and WAPI from 5.7 to 10.3 (+80%); a 10% cut (mu 1 to 0.9) raises WAPI from 4.55 to 6.19 (+36%) without multiplicity.&lt;/p&gt;
&lt;p&gt;Implications: When the equilibrium is unique, total welfare is increasing in transparency (tau_eta) and in the mass of always-present dealers (mu), with gains accruing to hedgers and a transfer away from dealers. This supports policies for cheaper, consolidated order-flow information (EU/UK consolidated tape; US Treasury post-trade transparency; the SEC February 2024 dealer rule), while flagging a trade-off: more transparency can erode dealer participation, particularly for riskier securities.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core mechanism that turns a benign illiquidity hike into a liquidity rout?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Order-flow opacity. When second-period hedgers cannot observe the first-period endowment shock u1, the price impacts of the first- and second-period shocks (Lambda_21 and Lambda_2) become strategic substitutes: a higher Lambda_2 makes the price more driven by u2, raising cohort-1 hedgers&amp;rsquo; execution risk and shrinking their liquidity demand (|a21| down), which lowers Lambda_21, which in turn lowers cohort-2 execution risk and boosts their demand (|a2| up), further raising Lambda_2. This self-reinforcing loop (formalized by an aggregate best-response Phi(Lambda_2) that is strictly increasing in Lambda_2) is the strategic complementarity that can yield multiple equilibria and fragility. Under transparency the loop is killed because Lambda_2 and Lambda_21 are independent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is this an &amp;lsquo;identification&amp;rsquo;/equilibrium-selection question rather than an empirical one?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a theory paper with no econometric identification. The analogue of &amp;lsquo;identification&amp;rsquo; is equilibrium selection and the formal conditions for multiplicity. The sufficient conditions for fragility are: overlapping cohorts of risk-averse hedgers suffering endowment shocks and submitting market orders; enough opacity about period-1 order flow; and risk-averse dealers. The necessary condition for multiplicity is sufficiently strong strategic complementarity, which is increasing in opacity. The closed-form multiplicity region is 0 &amp;lt; tau_u&lt;em&gt;tau_v &amp;lt; gamma/(4&lt;/em&gt;(gamma+gamma_H)^3).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model distinguish a &amp;rsquo;liquidity dry-up&amp;rsquo; from a &amp;lsquo;flash crash&amp;rsquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both arise when an unexpected shock (a jump in endowment-shock dispersion, i.e. a fall in tau_u, or a rise in dealer risk aversion / fall in gamma, or a fall in tau_v) pushes a market from a unique high-liquidity equilibrium into the multiplicity region and best-response dynamics attract it to a low-liquidity equilibrium. A dry-up is the transition to low liquidity; a flash crash is the same plus rapid recovery once the shock dissipates, all over a short interval. A shock to dispersion gravitates the market to the high-Lambda_2/low-Lambda_21 equilibrium; a shock to dealer risk aversion gravitates it to the low-Lambda_2/high-Lambda_21 equilibrium; in both, WAPI and price volatility rise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the WAPI measure add and why is it needed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because period-2 price reacts with DIFFERENT impacts to u1, u2, and the signal noise eta (coefficients Lambda_21, Lambda_2, Lambda_22), no single price coefficient captures total illiquidity. WAPI is a volume-weighted average of these price impacts, with weights given by the expected absolute volumes from equilibrium responses (using E|z| = sqrt(2/pi)*sigma_z for normals). It is analogous to a volume-weighted spread for an order that walks the book. WAPI is shown to be U-shaped in transparency tau_eta, even though total welfare is monotonically increasing in tau_eta.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the contrarian marketable order by second-period hedgers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With good information on u1, second-period hedgers post a contrarian market(able) order (b &amp;gt; 0) that offsets the first cohort&amp;rsquo;s selling/buying pressure, providing additional risk-sharing, enhancing the market&amp;rsquo;s risk-bearing capacity, and rationalizing first-period hedgers&amp;rsquo; decision to split their order across rounds. b is increasing in signal precision tau_eta. Under full opacity b = 0 because hedgers cannot predict the direction of the period-1 imbalance, so only dealers absorb the imbalance and risk-bearing capacity collapses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across equilibria and cohorts is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At fragile (multiple) equilibria, trading costs are heterogeneous across cohorts: Lambda_2 and Lambda_21 are negatively correlated (one high, the other low). The cohort facing the HIGHEST market impact demands MORE liquidity (hedging intensity is increasing in the cost of trading it induces). Dealers speculate (consume liquidity) more aggressively in the most illiquid equilibrium — consistent with HFTs stepping up liquidity demand during extreme moves (Brogaard et al. 2018; Bellia et al. 2022). The persistence parameter beta = Lambda_21/Lambda_2 equals 1 at unique/intermediate equilibria (random walk noise), and beta&amp;gt;1 is an indicator of multiple equilibria and fragility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the welfare results and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Restricted to the UNIQUE-equilibrium case (because with multiplicity hedger payoffs are complex-valued and cannot be ranked), and computed numerically with gamma = gamma_H = 1, tau_v = 1, tau_u = 2: total welfare TW(mu; tau_eta) is increasing in both transparency tau_eta and dealer mass mu. The gain is driven by higher hedger certainty equivalents (CEH_1, CEH_2); restricted dealers&amp;rsquo; CE falls with tau_eta, and D-dealers&amp;rsquo; CE falls with mu and (when tau_eta is not too small) with tau_eta. So transparency/dealer-presence policies raise welfare via a transfer from liquidity providers to consumers. A well-defined-payoffs condition is gamma_H^2&lt;em&gt;tau_u&lt;/em&gt;tau_v &amp;gt; 1 (which, when tau_eta=0 and mu=1, also implies a unique equilibrium).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the transparency-versus-dealer-participation trade-off?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;More transparency spurs second-period hedgers&amp;rsquo; speculation, eroding dealers&amp;rsquo; profits, which in a free-entry sense raises effective entry costs and induces some dealer exit (lower mu). Keeping total welfare constant against rising tau_eta requires a smaller mu cut for riskier securities (tau_v = 1) than for safer ones (tau_v = 3). Hence moderate transparency increases can reduce always-present dealer mass and may hurt welfare, especially for risky securities. With low transparency, raising mu has a NON-MONOTONIC effect on fragility (can move from multiple to unique and back), so enhancing transparency — not just dealer presence — is the key tool to eliminate fragility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from prior fragility literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It departs on three dimensions: (i) the disruptive strategic complementarity is on the liquidity DEMAND side, not the supply side (unlike Brunnermeier-Pedersen 2009, Gromb-Vayanos 2002 funding constraints, Cespa-Foucault 2014, Cespa-Vives 2015); (ii) fragility relies on NO irrationality, noise trading, or exogenous demand/supply (unlike crash models of Gennotte-Leland 1990, Jacklin et al. 1992, Madrigal-Scheinkman 1997); (iii) asymmetric information is about the order flow, not payoffs. It also endogenizes an AR(1) noise-trading process whose persistence beta is determined in equilibrium. It supersedes the authors&amp;rsquo; earlier working paper Cespa-Vives (2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model map to fragmentation and OTC markets?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Trading rounds 1 and 2 can be reinterpreted as separate venues; opacity then captures the limited flow of order information across venues, and mu (always-present dealers) is a reduced-form proxy for fragmentation-related dealer presence. Results should hold a fortiori in fragmented OTC markets, which are more opaque than centralized ones. Unlike Chen-Duffie (2021), Malamud-Rostek (2017), and Manzano-Vives (2021) — where fragmentation can raise welfare via traders&amp;rsquo; price impact — here traders are competitive, so those advantages do not arise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness and extension checks are reported?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The partially-opaque case (finite tau_eta) is studied numerically: one or three equilibria can arise, with multiplicity when transparency is low; b&amp;gt;0 and increasing in tau_eta dampens complementarity. The general model with restricted dealers and partial opacity is simulated (Figure 9 partitions (mu, tau_eta) into unique vs. multiple-equilibria regions). Remark 1 allows period-specific endowment variances (tau_u1, tau_u2) and confirms the substitutes logic; as tau_u1 to infinity the transparent solution is recovered. Internet Appendices cover a partially informative signal, comparative statics for tau_v and gamma_H, the AR(1) noise process, the case where first-period hedgers observe u2, and a ranking of hedging aggressiveness across regimes (Corollary 11).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What real-world episodes does the model claim to rationalize, and how is the empirical case made?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is consistent with the May 6, 2010 flash crash, the 2015 ETF freeze (where uncertainty over ETF constituents sidelined arbitrageurs and the SPY-RSP spread reached 21 dollars at one point), and the COVID-19 US Treasury dislocation around March 12, 2020 (spreads up roughly tenfold and depth virtually disappearing, per Duffie 2023). Empirical support for non-standard liquidity provision via contrarian marketable orders is drawn from Brogaard et al., Biais et al. (2017), Anand et al. (2013, 2021). The paper itself runs calibrated simulations (normal-volatility tau_v=1,tau_u=2 giving ~30% return volatility per Yuan 2005; and a liquidity-crisis tau_v=tau_u=0.1 case) rather than original econometric estimation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;placeholder&lt;/strong&gt;: placeholder&lt;/p&gt;</description></item><item><title>Means-Tested Transfers in the US: Facts and Parametric Estimates</title><link>https://macropaperwarehouse.com/papers/means-tested-transfers-in-the-us-facts-and-parametric-estimates/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/means-tested-transfers-in-the-us-facts-and-parametric-estimates/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Guner, Rauh, and Ventura document the scope, generosity, distributional impact, and time evolution of means-tested transfers to working-age US households, and provide parametric estimates of transfer functions for use in applied macroeconomics and public finance. The paper addresses three questions: How large are these transfers? How do they affect income inequality? How have they changed over time? The contribution is descriptive and empirical rather than structural; the paper does not estimate behavioral effects but rather characterizes the effective transfer schedule that households face.&lt;/p&gt;
&lt;p&gt;The data source is the Survey of Income and Program Participation (SIPP), using five waves spanning 1998 to 2016. The benchmark analysis uses the 2014 wave (years 2013–2016). The sample is restricted to household-years in which the head is aged 25–54, is not self-employed, and does not switch marital status within the year — yielding 18,612 households and 38,375 household-year observations. Six programs are covered: TANF, SNAP, WIC, SSI, housing assistance, and Medicaid. For TANF, SNAP, WIC, and SSI, transfer values are observed directly. Medicaid values are imputed using regional HMO premium costs; housing values are imputed as the difference between Fair Market Rent and actual rent paid.&lt;/p&gt;
&lt;p&gt;In the 2013–2016 benchmark period, approximately 35% of working-age households receive some means-tested transfer in a given year, and, conditional on receipt, the average household receives about $17,000 (in 2016 dollars), exceeding one-fourth of average household income. Unconditional total transfers decline steeply with income but in a non-monotone way: households with zero non-transfer income receive $7,500 in non-medical and $13,700 in Medicaid transfers ($21,000 total, or 26% of mean household income). Transfers dip for households with small positive incomes (creating a hump shape), then rise slightly before declining again. At the bottom income decile (0–10%), households receive on average $4,125 in non-medical transfers and $14,141 total. At the median income decile (50–60%), households receive $425 non-medical and $3,006 total. In the top decile, non-medical transfers are negligible ($169) and total transfers are $1,200. The decline in unconditional transfers with income is driven primarily by reduced coverage: conditional on receipt, transfer amounts are relatively stable across income levels, remaining above 15% of mean household income throughout the distribution. The extensive margin of coverage is 82% for zero-income households, 70% for the bottom decile, 29% at the median, and still 5% (non-medical) to 11% (including Medicaid) in the top decile.&lt;/p&gt;
&lt;p&gt;Medicaid is the dominant program throughout. For zero-income households, Medicaid transfers are more than six times larger than the next-largest program (SNAP). Medicaid&amp;rsquo;s share of total transfers rises with income. As a single program, Medicaid reaches 31% of working-age households with an average conditional benefit of about $15,000 per recipient. SNAP covers 18% of households with conditional benefits of about $3,000.&lt;/p&gt;
&lt;p&gt;Transfers substantially compress inequality. The pre-transfer Gini coefficient is 0.48 and falls to 0.42 when all transfers (including Medicaid) are included, and to 0.46 with non-medical transfers only. The pre-transfer 50-10 income ratio of 10.2 drops to 3.0 with all transfers and to 5.6 with non-medical transfers only. The variance of log income falls by nearly 36% (47 log points) with all transfers and by 21% with non-medical transfers. These equalizing effects are concentrated at the bottom of the distribution; for households at 10% of average pre-transfer income, total transfers more than double disposable income.&lt;/p&gt;
&lt;p&gt;Between 1998–1999 and 2013–2016, total unconditional transfers per household quadrupled from approximately 2% to 7.3% of mean household income (from about $1,535 to $6,000). Household coverage rose from 19% to 35%. The expansion is driven almost entirely by Medicaid; non-medical transfers rose only marginally in magnitude (from about 1.3% to 1.8% of mean income), though their coverage increased from 16% to 24% of households. Notably, over this period the concentration of non-medical transfers shifted upward in the income distribution: households with zero income received a smaller relative share in 2013–2016 than in 1998–1999, while shares for households in the second, third, and fourth deciles increased. Pre-transfer income inequality rose substantially over the period, with the Gini increasing from 0.40 to 0.48; the post-transfer Gini rose more moderately, from 0.38 to 0.42, indicating that transfer growth largely offset rising market-income inequality at the bottom.&lt;/p&gt;
&lt;p&gt;For the parametric section, the paper estimates a flexible four-parameter Ricker-style function T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1 for positive income I (normalized by mean income), with a separate level parameter gamma at I = 0. This captures the hump-shaped pattern at low incomes and the rapid decline thereafter. Implicit benefit reduction rates derived from these estimates are large: earning one additional dollar when starting from zero income reduces total transfers by more than $11,000, as crossing from zero into positive income sharply reduces program eligibility. A more realistic $10,000 income increase reduces total transfers by more than $5,000 — an implicit marginal tax penalty exceeding 50%. Non-medical transfer penalties are somewhat smaller: the first dollar earned reduces non-medical transfers by more than $4,500, and a $10,000 income increase reduces them by about $3,300.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is descriptive, not causal — there is no causal identification strategy in the traditional sense. The authors document reduced-form facts about transfer receipt by income level and demographic group using SIPP microdata. The main methodological choices and data limitations are: (1) Medicaid and housing assistance values are imputed rather than directly observed — Medicaid is valued at regional HMO premiums, which may not accurately reflect the value recipients place on coverage; housing benefits are valued at the difference between state Fair Market Rent and actual rent paid, which can produce negative values (2.7% of cases, set to zero). (2) SIPP is known to under-report income at the top of the distribution relative to the CPS; the paper documents that income shares of the top quintile differ by about five percentage points between SIPP and CPS, largely due to SIPP&amp;rsquo;s poor measurement of asset income. This means the effective transfer schedule at the top of the income distribution may be somewhat distorted. (3) The SIPP was overhauled after 2016, precluding analysis of more recent waves and meaning the trends analysis ends in 2013–2016. (4) Self-employed households are excluded (~7% of households) as their income measurement is noisier.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper handle the non-linear hump-shaped pattern in transfers at low income levels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper documents a hump-shaped pattern: transfers are positive at zero income, fall sharply at very low positive income (around the bottom 1% of the distribution), then increase modestly before declining monotonically. This arises because crossing from zero income to any positive income can reduce eligibility for several programs simultaneously. The parametric functional form — the Ricker function from fisheries biology — is specifically chosen to capture this pattern: for I &amp;gt; 0, T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1, where the beta_0 term governs the initial decline/rise and beta_1 allows further curvature. The zero-income level gamma is estimated separately as a discontinuity. The tight confidence intervals around observed income-percentile averages confirm that the fitted function closely tracks the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity by demographic group is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper documents heterogeneity along three dimensions — marital status, number of children, and age of children — in each case reporting both unconditional and conditional transfer amounts and coverage by income decile. Key findings: (a) Marital status: Single-woman households with zero income receive 12% of mean household income in non-medical transfers and about 31% in total transfers. Married households with zero income receive 27% total, and single men receive 17.9% total. At higher income levels, married households can receive more in total transfers than single women, because Medicaid coverage is broader for families. Single-woman households show the highest coverage at very low incomes (88% receive some transfer), but married households lead in coverage at middle income levels. Single men show surprisingly high coverage even at relatively high incomes. (b) Number of children: Transfers increase substantially with children. A first-decile married household without children receives about 1.7% of average income in non-medical transfers and 9% total; with two or more children, non-medical transfers rise nearly five-fold for single-woman households in the same decile. (c) Age of children: Transfers decline as children age, but the magnitude of the age gradient is smaller than the number-of-children gradient.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do conditional and unconditional transfers compare across the income distribution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Unconditional transfers (averaged over all households including non-recipients) decline steeply with income, driven primarily by falling coverage rates. Conditional transfers (among recipients only) are much more stable. For zero-income households, total conditional transfers average $26,500 (32% of mean income) versus $21,000 unconditionally. In the bottom decile, conditional total transfers are about $21,000 or 26% of mean income. After the third income decile, conditional transfer levels stabilize and remain above 15% of mean income throughout most of the distribution. This means that once a household is enrolled in the transfer system, the amounts received are relatively constant regardless of where in the distribution they fall; the intensive margin differences are largely accounted for by Medicaid, which has high conditional values even at middle income levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does Medicaid play relative to non-medical programs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Medicaid dominates the transfer system for working-age households by every measure. It reaches 31% of households in the benchmark period (the next largest program, SNAP, covers 18%). For zero-income households, Medicaid transfers are more than six times larger than SNAP (the next largest non-medical program). Medicaid&amp;rsquo;s share of total transfers grows with income: for zero-income households, total transfers are less than three times non-medical transfers; for households in the 50–60th percentile, this ratio exceeds six. In terms of aggregate spending, Medicaid rose from below 1% of GDP in 1980 to more than 3% in 2022, while non-medical transfers declined from 1.6% to about 1% of GDP over the same period. Almost the entire growth in household transfers between 1998 and 2016 is attributable to Medicaid expansion. Medicaid is also the most important single contributor to measured inequality reduction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do transfers affect income inequality and how has this changed over time?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the 2013–2016 benchmark, total transfers reduce the Gini coefficient by 6 points (from 0.48 to 0.42) and the variance of log income by nearly 36%. The 50-10 income ratio falls from 10.2 to 3.0. Non-medical transfers alone reduce the Gini by 2 points (to 0.46) and the 50-10 ratio to 5.6. The impact is concentrated at the bottom of the distribution: transfers more than double total income of households with pre-transfer income around 10% of the mean. Over time, pre-transfer inequality rose sharply, with the Gini going from 0.40 (1998–1999) to 0.48 (2013–2016) and the 50-10 ratio doubling from 4.19 to 10.2. Post-transfer inequality rose more mildly: the Gini increased from 0.38 to 0.42 (all transfers), and the 50-10 ratio remained stable at around 3 throughout. Excluding Medicaid, the moderating effect is weaker; the Gini rose from 0.39 to 0.46 on a post-non-medical-transfer basis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How has the concentration of transfers across income groups evolved over time?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A notable distributional shift occurred between 1998–1999 and 2013–2016. For non-medical transfers, the share accruing to households with zero income declined substantially — from receiving about $9 per $100 of total transfers distributed in 1998–1999 to about $4 in 2013–2016. Similarly, the relative share for the bottom decile declined. In contrast, the share going to households in the second, third, and fourth income deciles increased. For total transfers including Medicaid, the pattern is similar but the shift is less pronounced, partly because Medicaid expansion was broad and reached middle-income working families. The authors interpret this as reflecting the design changes in the transfer system: TANF (which targeted the very bottom) declined sharply while Medicaid expansion (which reaches further up the distribution) grew.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the implicit benefit reduction rates and why do they matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper derives implicit benefit reduction rates from the estimated parametric transfer functions. At zero income, earning the first dollar of income triggers a very large decline in transfers because eligibility for several programs is lost simultaneously. Specifically, earning $1 reduces non-medical transfers by more than $4,500 and total transfers by more than $11,000. This enormous implicit marginal tax reflects the discontinuity at zero income. For more realistic income increments, earning an additional $10,000 when starting from zero income reduces total transfers by more than $5,000 (over 50% implicit tax rate) and non-medical transfers by about $3,300. These findings are directly relevant for quantitative macroeconomic models that study labor supply and welfare, since the effective marginal tax on low-income workers entering employment is substantially higher than the statutory rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper differ from prior work on parametric tax and transfer functions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The closest antecedents are Gouveia and Strauss (1994), Heathcote, Storesletten, and Violante (2017) (who use the Benabou log-linear tax function), and Guner, Kaygusuz, and Ventura (2014) (who provide effective income tax estimates). Prior work either focused on taxes only or combined taxes and transfers into a single progressivity measure. This paper is the first to estimate effective transfer functions separately from the tax system, decomposed by program, by marital status, and by number of children. Relative to Guner et al. (2023), which assumed transfers decline linearly with income, this paper estimates a more flexible non-linear function that captures the hump at very low incomes. Relative to Ferriere et al. (2023), who propose a transfer function that increases then decreases with income, the current paper provides empirical estimates rather than a theoretical prescription. The functional form (a Ricker-style function with a separate parameter at zero income) is also more flexible than prior approximations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What data limitations are noted and how do they affect comparability with other sources?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper compares SIPP income distributions with the CPS. Both surveys yield similar Gini coefficients and variance of log income, but SIPP shows higher income shares for the bottom quantiles and lower shares for the top quintile (a discrepancy of about five percentage points). This reflects SIPP&amp;rsquo;s weaker measurement of asset income, which is a larger component of total income as one moves up the distribution. The analysis excludes self-employed households (~7%) because their income is harder to measure. The SIPP was overhauled after 2016, making cross-wave comparisons infeasible for later years; this means the paper cannot characterize the effects of post-2016 Medicaid expansion, the COVID-19 pandemic transfer surge, or recent SNAP reforms. For Medicaid, the imputation using regional HMO costs does not capture the insurance value as households themselves perceive it, a standard limitation in this literature also noted by Ben-Shalom et al. (2012) and Scholz et al. (2009) whose methods the paper follows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications of the findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Several implications follow with scope conditions: (1) The transfer system substantially reduces income inequality, but the lion&amp;rsquo;s share of the reduction comes from Medicaid. Policies that reduce Medicaid coverage would substantially raise measured inequality, particularly at the bottom of the distribution. (2) The implicit benefit reduction rates documented — above 50% for a $10,000 income gain at the bottom — generate large effective marginal taxes on low-income households entering employment, relevant for evaluating welfare-to-work policies and for calibrating labor supply elasticities in quantitative models. (3) Despite the large size of the system, the decline in TANF spending (from above 1% of GDP to 0.1%) means that unrestricted cash assistance to the very poorest has fallen sharply; the system has shifted toward in-kind and medical programs that provide less flexibility to recipients. (4) The shift in transfer concentration away from zero-income households toward the second through fourth deciles suggests that the system increasingly supports the working poor rather than the non-working poor — a structural change in the composition of welfare that quantitative models should incorporate. These implications pertain to households headed by working-age adults (25–54), are based on pre-2016 data, and exclude the institutionalized population and self-employed households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key features of the parametric function and how well does it fit the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The estimated function has the form T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1 for I &amp;gt; 0 and T(0) = gamma, estimated by non-linear least squares on income-percentile averaged data. The function is flexible enough to capture: (a) a strictly positive level at zero income; (b) an initial increase then decrease at very low positive incomes (the hump); (c) a decay toward zero at high incomes that can be faster or slower depending on beta_1. The fit is shown to be close — Figure 7 documents tight confidence intervals around mean transfers by percentile, confirming that a smooth function well approximates the data. Parameter estimates are provided for each individual program, for non-medical aggregates, for total transfers, and separately for married and single households and by number of children (in appendix tables C10–C12). The zero-income gamma parameter is notably small for TANF (0.00) and large for Medicaid (0.24) and total transfers (0.26), consistent with the descriptive findings on coverage.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Means-tested transfer&lt;/strong&gt;: In this paper, a government transfer program for which eligibility and benefit amounts are conditioned on household income and assets, targeting the non-retired working-age population. The six programs studied are TANF, SNAP, WIC, SSI, housing assistance, and Medicaid.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive margin of coverage&lt;/strong&gt;: The fraction of months in a given calendar year during which a household receives a positive transfer amount, as distinct from the extensive margin (whether the household receives any transfer at all during the year). The paper documents both margins separately.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Implicit benefit reduction rate (implicit penalty)&lt;/strong&gt;: The reduction in transfer payments associated with a marginal increase in non-transfer income, expressed as the derivative of the estimated transfer function with respect to income. In this paper the implicit penalty at zero income is very large because moving from zero to any positive income simultaneously triggers loss of eligibility in multiple programs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Unconditional vs. conditional transfer&lt;/strong&gt;: Unconditional transfers are averages computed over all households at a given income level, including non-recipients. Conditional transfers are averages computed only among households that actually receive a positive amount. The paper shows that the steep decline in unconditional transfers with income is almost entirely a coverage effect; conditional amounts remain relatively stable across the distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ricker transfer function&lt;/strong&gt;: The parametric functional form T(I) = exp(alpha) * exp(beta_0 * I) * I^beta_1 adopted by the paper to fit the non-linear relationship between normalized household income and normalized transfer receipt for I &amp;gt; 0, with a separate parameter gamma for I = 0. Borrowed from the Ricker (1954) stock-recruitment model in fisheries biology and chosen for its flexibility in capturing the hump-shaped pattern at very low incomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-medical transfers&lt;/strong&gt;: The aggregate of TANF, SNAP, WIC, SSI, and housing assistance — the programs that provide cash or in-kind support excluding health insurance. The paper distinguishes these from total transfers throughout to separate the role of Medicaid, which dominates all other programs in magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Medicaid imputation&lt;/strong&gt;: The procedure used to assign a monetary value to Medicaid enrollment, following Scholz et al. (2009) and Ben-Shalom et al. (2012). Each enrolled household member is assigned the cost of a single HMO policy in their Census region (from the Kaiser Foundation Employer Health Benefits survey), with family policies or sums of individual policies used for multi-member households, and a 2.5× multiplier for elderly or disabled individuals to reflect higher medical needs.&lt;/p&gt;</description></item><item><title>Medical innovation and health disparities</title><link>https://macropaperwarehouse.com/papers/medical-innovation-and-health-disparities/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/medical-innovation-and-health-disparities/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks why medical innovation can widen health disparities even when it unambiguously improves health for everyone who takes it. The authors argue that the standard access-versus-preferences dichotomy is a false one: disadvantaged patients can rationally forgo effective medications because treatment side effects interfere with work, and the income cost of not working is particularly severe for low-education workers who hold physically demanding, inflexible jobs. Health-maximizing and welfare-maximizing behavior are therefore not the same thing, and the gap between the two is systematically larger for lower-education individuals.&lt;/p&gt;
&lt;p&gt;The empirical setting is the introduction of Highly Active Antiretroviral Therapy (HAART) for HIV in the mid-1990s. HAART was substantially more effective than prior mono- and combo-therapy at preventing AIDS progression and death, but it produced harsh physical side effects (fatigue, diarrhea, headache, fever). Data come from the Multi-Center AIDS Cohort Study (MACS), a semi-annual panel of men who have sex with men in Baltimore, Chicago, Pittsburgh, and Los Angeles, covering 1991–2003. After sample restrictions, the analysis uses 11,290 person-visit observations for 1,201 HIV-positive individuals aged 30–64, approximately 63% of whom hold a college degree or more. The study dichotomizes education into less-than-college versus college-or-more and tracks treatment choices, labor supply, immune-system health (CD4 count, with AIDS threshold at 250), physical ailments, income, insurance, and out-of-pocket medical expenditures.&lt;/p&gt;
&lt;p&gt;The structural model is a lifecycle discrete-choice dynamic programming framework in which forward-looking individuals simultaneously choose treatment (no treatment, monotherapy, combotherapy, and post-1995 HAART) and full-time work or non-work each half-year period to maximize expected lifetime utility. Health and survival evolve stochastically as functions of prior health, treatment, and age. Utility is a function of consumption (income minus out-of-pocket expenses), ailments, and labor supply, with utility parameters allowed to differ by education. The model is estimated via maximum likelihood using nested backwards induction; the quasi-experimental introduction of HAART as an unanticipated shock helps identify utility parameters.&lt;/p&gt;
&lt;p&gt;Key quantitative results: (1) HAART drastically reduced mortality for both groups—six-month mortality fell from 9% to 2% for less-educated men and from 6% to 1% for college graduates—and raised the probability of maintaining a high CD4 count from 62% to 78% (less-educated) and 68% to 83% (college+). (2) Despite equivalent access (both groups face roughly 91-95% insurance coverage and similarly low out-of-pocket costs), lower-educated men adopted HAART at a lower rate (58% of post-HAART visits versus 66% for college graduates) and approximately five months later. (3) The structural utility parameters confirm that while the direct disutility of ailments is not significantly different across education groups, the disutility of working while experiencing ailments is substantially larger in magnitude for less-educated men (estimated parameter -2.73) than for college graduates (-1.97). (4) Measured as expected lifetime utility, HAART&amp;rsquo;s introduction increased value for low-CD4 men by 236.1% (less-educated) versus 176.6% (college+), but in absolute utility units the gains were larger for college graduates—establishing that HAART increased welfare inequality. (5) Decompositions show the largest single driver of the education gap in HAART value is the differential survival process; income differences also matter but financial access variables (insurance, out-of-pocket costs) explain little. (6) A simulated six-month HAART mandate improves health—by 1.7 percentage points more for less-educated men—but reduces expected lifetime value by 2.8% for the less-educated versus 1.4% for college graduates, and reduces employment by 4.1% versus 1.6%, as mandated HAART forces men into ailment-producing treatment whose side effects they cannot manage alongside work. (7) A counterfactual $10,000-per-six-months non-labor income subsidy (similar to COVID-19 transfer policies) reduces work by 31–49% for less-educated men and by 25–39% for college graduates, while inducing an 81.2% increase in HAART take-up among less-educated men in good health who were not previously on treatment (from 5% to 9% baseline probability), and a 44.5% increase for similar college graduates (8% to 11%). For men with AIDS-level CD4 counts not on treatment, the policy raises the probability of being healthy next period by 12.6% for less-educated men and 5.3% for college graduates.&lt;/p&gt;
&lt;p&gt;The central mechanism is a wedge between health and welfare that is steeper for disadvantaged workers: occupational conditions make it harder to work while experiencing side effects, so the opportunity cost of HAART compliance is higher. This means effective medical innovation—precisely by creating more severe side effects than older regimens—can widen welfare inequality even as it compresses mortality gaps. Clinical trials that randomize assignment to treatment and measure health outcomes will register the innovation as a success while masking the distributional welfare costs. Policy interventions that reduce the cost of not working (income transfers, labor market restructuring) can simultaneously increase HAART take-up and improve health, with effects concentrated among the disadvantaged.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the main identification strategy and what are the key threats to identification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is estimated by maximum likelihood using nested backwards induction over observable state variables. A key identifying variation is the quasi-experimental, unanticipated introduction of HAART in 1995, which shifts the choice set mid-panel and allows the authors to trace behavioral responses to an exogenous change in treatment efficacy and side-effect profiles. Disutility of ailments and work parameters are identified by conditional choice probabilities given state variables (health, ailment status, prior treatment) and by comparing behavior before and after HAART availability. The authors follow Magnac and Thesmar (2002) to establish that under the distributional assumptions (Type I EV shocks, fixed discount factor β=0.95) and the normalization imposed, the likelihood has a unique maximum. The main threats are: (a) the assumption that individuals were surprised by HAART (no forward-looking anticipation), which simplifies the model but is explicitly noted—Hamilton et al. (2021) show that incorporating individual expectations substantially complicates the framework; (b) the exclusion of unobserved heterogeneity in the utility function, though specifications including it produce very small probabilities of a second type (below 5%); (c) the absence of borrowing and saving, which could allow more educated individuals to smooth consumption across treatment cycles—the authors note this would bias downward the disutility of working with ailments for higher-educated individuals, meaning the estimated cross-education difference in that parameter is a lower bound; (d) the sample is restricted to white men in four cities, limiting external validity; and (e) the education dichotomy collapses heterogeneity within education groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms through which education moderates the health-welfare tradeoff, and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper identifies two nested channels. First, the estimated structural utility parameter for working while experiencing ailments is larger in magnitude for less-educated men (θ = -2.73) than for college graduates (θ = -1.97), indicating greater disutility from combining work and side effects. The paper argues this reflects occupational sorting: lower-education men are significantly more likely to hold manual occupations (occupation score 5.12 versus 4.49 for college graduates, where higher scores indicate more manual tasks per Autor et al. 2003), making physical side effects especially incompatible with job performance. Second, lower-educated men have lower incomes ($15,373 versus $22,290 per half-year for less-educated versus college-educated, pre-HAART), so the income cost of not working is larger in relative terms, creating stronger incentives to maintain employment even at the cost of forgoing treatment. The authors decompose the relative contribution of these mechanisms in the non-labor income subsidy simulation: when they give lower-educated men the income process of higher-educated men (Appendix Figure A1), the gap in behavioral response narrows but does not close; when they give lower-educated men the disutility parameters of higher-educated men (Figure A2), similarly the gap narrows but remains. Both mechanisms are jointly operative.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in HAART take-up and welfare value is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Education is the primary heterogeneity dimension examined. Post-HAART, lower-educated men used HAART in 58% of observations versus 66% for college graduates, were slower to start (5 months later on average), and less likely to ever use it (67% versus 81%). Health status interacts with education: low-CD4 men gain more in percentage terms from HAART because they are more in need of its health-improving effects (236.1% gain for less-educated low-CD4 versus 176.6% for college-educated low-CD4; 85.7% versus 76.3% for high-CD4 men, with college graduates gaining more in absolute utility units throughout). The welfare cost of a treatment mandate is higher for less-educated men (2.8% lifetime value decline versus 1.4%), and the employment reduction induced by the mandate is also larger for them (4.1% versus 1.6%). In the income subsidy simulation, low-CD4 men not on any medication show the largest health response. The paper does not examine race/ethnicity heterogeneity, having excluded non-white individuals from the analysis due to sampling methodology concerns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the value decomposition reveal about why HAART benefited more-educated men more?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table A17 sequentially replaces the processes and parameters of lower-educated agents with those of higher-educated agents. Giving lower-educated men the income process of college graduates narrows but does not close the gap—income is not the primary driver. Replacing the insurance and medical expenditure processes slightly reduces value for less-educated men relative to giving them only the income process, because more-educated individuals actually have somewhat higher out-of-pocket costs. Changing the health and ailments processes has modest positive effects. The largest single contributor to closing the education gap is the survival process: less-educated men face much higher baseline mortality, which depresses the expected present value of all future flows including the gains from HAART. This suggests that policies targeting survival differentials (e.g., access to other health services) could partially close the HAART welfare gap. Finally, replacing the utility parameters mechanically closes the remaining gap, but preferences are less amenable to direct policy intervention than the survival process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What do the treatment mandate simulations show, and why do they matter for evaluating clinical trials?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A six-month HAART mandate mimics randomized assignment to treatment in a clinical trial. It improves health—the probability of high CD4 rises by 1.7 percentage points more for less-educated men than baseline (reflecting a larger baseline gap in HAART use)—which would appear a policy success from a health-only perspective. However, expected lifetime utility falls by 2.8% for less-educated men and 1.4% for college graduates, because mandated HAART forces individuals into ailment-inducing treatment they would not have chosen, inhibiting labor supply. Employment falls by 4.1% for less-educated men versus 1.6% for college graduates. Appendix analyses removing the ailment-producing properties of treatment largely eliminate both the welfare cost and the employment effect, confirming that ailments are the mediating channel. This shows that clinical trials—which typically report health endpoints and do not measure welfare or distributional consequences—can mask the costs that effective but side-effect-heavy treatments impose, and that those costs fall disproportionately on less-advantaged patients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the non-labor income subsidy simulation show, and which groups respond most?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A permanent $10,000-per-six-months increase in non-employment income (approximately 50% of median income, calibrated to COVID-era transfer policies) induces labor force exit across all groups but concentrates its health-promoting effects among disadvantaged men who were not already on HAART. Among relatively healthy (high-CD4) less-educated men not using any medication, HAART take-up rises by 81.2% (from 5% to 9%); the corresponding figure for college graduates is 44.5% (from 8% to 11%). Among men with AIDS-level (low) CD4 not on treatment, the probability of being healthy next period increases by 12.6% for less-educated men and 5.3% for college graduates. Men already on HAART—who are unlikely to change treatment regardless—show little response. The policy has small but positive health externalities beyond the immediate recipients, since people on antiretrovirals have lower viral loads and lower transmission risk. Decomposition simulations (Appendix Figures A1–A2) show that both the income-level channel and the disutility-of-work-with-ailments channel independently contribute to the larger lower-education response, with neither alone sufficient to fully explain the differential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is most closely related to Papageorge (2016, Quantitative Economics), which uses the same MACS data and setting to link non-uptake of HAART to labor supply and side effects. The key difference is scope: Papageorge (2016) focuses on individual-level mechanisms; the present paper&amp;rsquo;s goal is to characterize distributional differences in the health-welfare tradeoff across education groups and to show that innovation can exacerbate existing inequality. Chan, Hamilton, and Papageorge (2016, Review of Economic Studies) also use the MACS setting to study the value of medical innovation, and Hamilton, Hincapié, Miller, and Papageorge (2021, International Economic Review) examine the diffusion of HAART. Relative to the sociological fundamental cause theory literature (Link and Phelan 1995; Phelan et al. 2010), which documents that medical innovations tend to widen health disparities, the present paper provides a structural quantification of the specific mechanisms and their relative magnitude. Relative to papers attributing health disparities primarily to access barriers (insurance, cost), the paper provides evidence that for this sample—where insurance coverage exceeds 91% even for less-educated men and HIV drugs are inexpensive—access explains little of the educational disparity in HAART use or health outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core implication is that policies reducing the cost of not working—income transfers, disability benefits, worker protections—can raise HAART adoption and improve health among disadvantaged patients, precisely the group for whom standard health-access policies have limited traction. The non-labor income subsidy simulation suggests that the health improvements are modest in absolute magnitude (a 0.2% rise in probability of being healthy next period for the best-responding group among high-CD4 non-HAART users, and 13% for low-CD4 non-HAART users), but there are unmodeled positive externalities through reduced transmission risk that would multiply the social return. Scope conditions: (1) The sample is white men who have sex with men in four U.S. cities during 1991–2003, enrolled in a prospective cohort study; generalizability to other populations (women, racial minorities, other diseases) is uncertain. (2) The income subsidy that triggers HAART take-up must be large enough to induce labor force exit; a $10,000 per-six-months transfer is needed to generate the simulated behavioral response, larger for higher-income workers. (3) The paper explicitly notes that drug costs and insurance are not binding constraints in this sample, and the policy conclusions may differ in settings with weaker drug coverage. (4) Mental health is excluded from the model; the paper shows depression variables have smaller effects on treatment choice than the physical mechanisms included, but mental health could independently affect some populations&amp;rsquo; response. The paper&amp;rsquo;s conclusions extend to other conditions where effective treatment has disabling side effects and disadvantaged patients hold inflexible physical jobs—the authors invoke COVID-19 as a contemporary analog.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are conducted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors report several robustness exercises. Treatment transition results are shown to be robust to defining the HAART introduction period as survey visit 23 or 25 rather than 24. Ailment specifications are noted to be robust to varying the type or frequency of ailments counted (citing Papageorge 2016 for this). Specifications including unobserved heterogeneity in the utility function produce very small second-type probabilities (below 5%), arguing against its inclusion. The treatment mandate simulations are run under three alternative shock-assignment methods (2 draws, 8 draws, and the preferred 2-draw approach), with results consistent across methods on the main welfare-versus-health asymmetry. Appendix Tables A19 and A20 remove ailments from all medications and from HAART only, respectively, confirming that the welfare cost of mandates is driven by treatment-induced ailments. Appendix Figures A1 and A2 mechanically decompose the education-differential response to the income subsidy by replacing income processes and disutility parameters separately, confirming that both channels are active. The model fit (Table A9) shows overall employment (66% model, 66% data) and HAART use (33% model, 36% data) closely matching, though the model slightly over-predicts medication use among low-CD4 individuals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the paper focus on white men only, and what does this imply for interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors drop 1,098 observations from 390 non-white individuals because of concerns about the sampling methodology used to recruit the refresher sample for those individuals—specifically, non-white participants entered the panel via a different selection process that could confound estimates. The paper does not investigate racial disparities in HAART take-up, which are also well-documented in the literature. This is a significant limitation because HIV/AIDS has disproportionately affected Black men in the United States, and the mechanisms the paper identifies—occupational sorting, income constraints, disutility of working with ailments—may operate differently or more intensely along racial lines. The authors acknowledge this limitation and note that the structural framework could in principle be applied to other groups if appropriate data were available.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Health-welfare tradeoff&lt;/strong&gt;: In this paper, the wedge between the action that maximizes health (taking effective medication despite side effects) and the action that maximizes lifetime utility (avoiding medication to remain employed and maintain income). The tradeoff is not a bias or error but a rational response to economic constraints, and it is wider for less-educated individuals whose occupational conditions make working with side effects especially costly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;HAART (Highly Active Antiretroviral Therapy)&lt;/strong&gt;: A combination antiretroviral HIV treatment introduced in the mid-1990s, far more effective than prior mono- or combo-therapy at improving CD4 count and preventing AIDS-level immune decline and death. In this paper&amp;rsquo;s model, HAART serves as the innovation whose adoption the authors study: it is more efficacious but produces harsher side effects than earlier treatments, and its introduction is treated as an unanticipated aggregate shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Disutility of working with ailments&lt;/strong&gt;: A structural utility parameter (θ_2,f=0) capturing how much worse-off an agent feels from working while experiencing physical ailments (fatigue, diarrhea, headache, fever). Estimated at -2.73 for less-educated men and -1.97 for college graduates, this parameter is the primary driver of the differential health-welfare tradeoff across education groups and explains why side-effect-bearing treatments like HAART are disproportionately avoided by lower-education workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Treatment mandate simulation&lt;/strong&gt;: A counterfactual in which all agents are assigned to HAART for six months (eliminating choice among other treatment options), used to mimic randomized assignment in a clinical trial. The simulation is designed specifically to illustrate that health improvements observable in a clinical trial coexist with welfare reductions and employment disruptions that would not be captured in standard trial endpoints.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fundamental cause theory&lt;/strong&gt;: A sociological framework (Link and Phelan 1995) arguing that socioeconomic status is a &amp;lsquo;fundamental cause&amp;rsquo; of health disparities that persists despite or is even amplified by medical innovation, because more advantaged individuals are better positioned to adopt and benefit from new treatments. The paper provides structural economic microfoundations for this theory by quantifying the mechanisms through which HAART&amp;rsquo;s introduction widened the welfare gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-labor income subsidy&lt;/strong&gt;: A counterfactual policy simulation in which non-employment income is raised by $10,000 per six months (approximately 50% of the median person&amp;rsquo;s income), modeled after COVID-19 transfer policies. In the paper&amp;rsquo;s model this policy reduces employment but increases HAART take-up and health improvements particularly for less-educated HIV-positive men who were previously forgoing treatment to maintain income from work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Source text origin&lt;/strong&gt;: Not a paper-specific concept but denoted here: the full working paper text was obtained from the NBER Working Paper (No. 28864), not from abstract-only, satisfying the GUARD requirement.&lt;/p&gt;</description></item><item><title>Monetary Policy without Commitment</title><link>https://macropaperwarehouse.com/papers/monetary-policy-without-commitment/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-without-commitment/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Post-pandemic inflation across advanced economies rose to levels not seen since the early 1980s, reviving interest in central bank credibility. The standard quantitative macro models used to interpret this episode assume exogenous central bank reaction functions and inflation targets, which limits their usefulness. This paper instead makes monetary policy endogenous: a welfare-maximizing central bank that lacks the ability to commit re-optimizes every period. The goal is to characterize how lack of commitment shapes long-run inflation and transition dynamics, questions that prior credibility work (Barro-Gordon 1983; Rogoff 1985) could not address because it used static or log-linearized settings.&lt;/p&gt;
&lt;p&gt;Model setup: The authors embed central bank lack of commitment into a standard fully non-linear New Keynesian model (not log-linearized around zero-inflation steady state). Monopolistically competitive firms set prices under Calvo rigidity: a random fraction 1-theta resets prices each period, the rest keep last period&amp;rsquo;s price. Wages are flexible; households choose consumption, labor, savings. The environment is deterministic with permanent unanticipated shocks. An exogenous proportional labor wedge tau (payroll tax capturing taxes, regulation, unionization) is assumed large enough (Assumption 1: tau &amp;gt; -1/sigma) that monopoly distortions persist. Two distortions operate: monopoly power (underproduction) and price dispersion from sticky prices (labor misallocation). The solution concept is Markov Perfect Competitive Equilibrium. Crucially, firms set prices BEFORE the central bank sets the interest rate, so the central bank takes the price distribution (hence dispersion D_t) as predetermined and optimally sets static welfare-maximizing policy: it eliminates monopoly distortions by setting the labor share to 1 (Y_t = D_t^{-1}). Equilibrium reduces to two difference equations: a forward-looking non-linear Phillips curve and a backward-looking price-dispersion law of motion, yielding a unique steady state. The analysis is conducted in a continuous-time limit for transition dynamics.&lt;/p&gt;
&lt;p&gt;Main findings (with magnitudes and scope): (1) Long-run inflation is determined by the interaction of lack of commitment and the environment; steady-state inflation and price dispersion are strictly increasing in the labor wedge tau and strictly decreasing in the elasticity of substitution sigma (the dispersion comparative static in sigma holds for tau below a threshold tau-bar(sigma); the inflation comparative static is unambiguous). (2) Transitions to a higher-inflation steady state feature inflation OVERSHOOTING: inflation jumps on impact then gradually declines, because the central bank&amp;rsquo;s incentive to stimulate is largest early when dispersion/misallocation are low. (3) Quantitative magnitudes are large. Calibration (monthly): beta=(1.02)^{-1/12}, theta=0.86 (7-month price duration, Nakamura-Steinsson 2008), sigma=7 (Coibion et al. 2012), psi=2.5 (Chetty et al. 2011), tau=-0.1427 to target 2% annual inflation. A permanent 0.5% increase in the labor wedge raises steady-state inflation from 2% to 8.76%, with inflation overshooting to 10.11% on impact; it takes 12 months to decline within 25 basis points of the new steady state. A 0.5% decrease in sigma yields similarly large effects.&lt;/p&gt;
&lt;p&gt;Implications: Welfare under inflation targeting strictly exceeds that under no-commitment in both shock scenarios; the welfare gain is about 6% in consumption-equivalent terms (targeting 0.981 vs no-commitment 0.922/0.921). The large magnitudes stem from a nearly vertical long-run Phillips curve (the labor share is insensitive to inflation when beta is near 1). Post-pandemic shocks (lower immigration raising the labor wedge; reduced globalization/supply-chain disruption lowering sigma) do not raise inflation on their own but do so through their interaction with central bank lack of commitment, and may make returning inflation to historic norms unlikely absent strict commitment to inflation targeting.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/solution strategy, and what makes the model tractable?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a theory paper, so &amp;lsquo;identification&amp;rsquo; is the equilibrium characterization rather than econometric identification. The authors solve for Markov Perfect Competitive Equilibria of a fully non-linear (not log-linearized) New Keynesian model. Tractability comes from the timing assumption: flexible-price firms set prices BEFORE the central bank chooses the interest rate. Because the equilibrium is Markov, the central bank at date t takes the price distribution (and hence future dispersion D_{t+1} and continuation value V(D_{t+1})) as predetermined; it cannot change future welfare off the equilibrium path. So it optimally maximizes STATIC welfare conditional on current dispersion, yielding the simple first-order condition Y_t = D_t^{-1} (labor share = 1). Equilibrium then reduces to two difference equations in inflation (forward-looking Phillips curve) and dispersion (backward-looking), giving a unique steady state. A key technical innovation is an auxiliary variable delta_t (the inverse of a discounted sum of future relative prices) capturing the passthrough of real wages to current inflation holding future inflation fixed, which itself has a recursive representation and is related to the slope of the Phillips curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the core economic mechanism generating higher long-run inflation under lack of commitment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Starting from a steady state, a permanent rise in tau (or fall in sigma) increases monopoly distortions and would, under commitment, lower the labor share while keeping inflation fixed. But a no-commitment central bank wants to undo the rise in monopoly distortions by cutting interest rates and stimulating output to push the labor share back to 1. Flexible-price firms rationally anticipate this future stimulus, higher future labor demand, and higher future real wages, so they raise prices today to offset expected future costs. Sequential price increases raise price dispersion. The economy converges to a new steady state once rising dispersion reduces aggregate productivity (labor misallocation) enough that the central bank&amp;rsquo;s marginal benefit from cutting rates vanishes. Hence both long-run dispersion and inflation are permanently higher.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does inflation overshoot in the transition rather than monotonically rise?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Overshooting arises from the evolution of central bank incentives as dispersion rises along the transition. Early in the transition, dispersion and labor misallocation are low, so stimulating output to boost consumption is relatively beneficial; later, once dispersion/misallocation are high, the productivity cost of stimulation is high and the benefit falls. Flexible-price firms anticipate that monetary stimulus is front-loaded, so they front-load their price increases. The result is high inflation early that declines toward the new (lower but still elevated) steady-state level. In the phase diagram (dispersion-inflation plane, holding delta fixed), the dispersion-zero locus is upward sloping and the inflation-zero locus is downward sloping; the saddle path has negative slope, so along it inflation and dispersion move in opposite directions. A labor-wedge shock shifts the inflation-zero locus up (leaving the dispersion locus unchanged); inflation jumps to the new saddle path then declines as dispersion rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why are the quantitative magnitudes so large?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The steady-state labor share is relatively insensitive to inflation because the positive effect of inflation on the labor share (via overhiring sticky-price firms) is largely offset by the negative effect via forward-looking flexible-price firms that raise prices to protect against future overhiring. Standard New Keynesian calibrations use high beta and low theta, so there is a large fraction (1-theta) of flexible-price firms that raise prices substantially, putting downward pressure on the labor share. Formally, the long-run Phillips curve linking labor share mu and inflation Pi (equation 33) becomes almost vertical when beta is near 1. A nearly vertical long-run Phillips curve means small changes in tau or sigma require large changes in inflation to keep mu unchanged. Implication: any change that flattens the long-run Phillips curve would shrink the magnitudes, lower the value of commitment, and imply meaningful benefits from positive long-run inflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the central bank&amp;rsquo;s reaction function and how does it compare to a Taylor rule?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Substituting the FOC Y_t = D_t^{-1} into the Euler equation gives 1 + i_t = (1/beta) * Pi_{t+1} * Y_{t+1} * D_t. This endogenously-derived rule resembles exogenous Taylor rules: the interest rate is increasing in expected future inflation and expected future output, and it also reacts to current price dispersion. Higher dispersion reduces labor productivity via misallocation, lowering the benefit of stimulating the economy, so the central bank raises rates. Like Atkeson, Chari, and Kehoe (2010), the central bank responds to off-equilibrium increases in inflation/dispersion by raising rates enough that an individual flexible-price firm would actually want lower price increases off the equilibrium path.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the comparative static differ between the labor-wedge shock and the elasticity-of-substitution shock?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both raise long-run inflation and (generally) dispersion and produce overshooting. For inflation the comparative static is unambiguous in both cases. For dispersion, the tau result is clean (Dss strictly increasing in tau), but the sigma result requires a bound: Dss is strictly decreasing in sigma only for tau &amp;lt; tau-bar(sigma) (where tau-bar(sigma)=infinity if sigma&amp;lt;=2, else 1/(sigma^2-2sigma)), because sigma also enters the dispersion law of motion and could in principle make dispersion increase with sigma when tau is large. A second difference appears in the comparison with inflation targeting: under a tau shock, an inflation-targeting central bank keeps rates fixed, output falls permanently, and dispersion is unchanged. Under a sigma shock, sigma directly affects the dispersion-inflation relationship, so even under inflation targeting steady-state dispersion would decline (greater differentiation makes relative price differences a less important source of misallocation) and rates would adjust to facilitate the transition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the welfare comparison and how is welfare measured?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Welfare is expressed in consumption-equivalent terms relative to an otherwise-identical flexible-price economy: how much consumption a household would require, right after the shock, to be indifferent between the sticky-price economy (under targeting or no-commitment) and a flexible-price economy with constant consumption and implied labor. For the labor-wedge shock: welfare under targeting 0.981 vs no-commitment 0.922 (difference 0.059). For the elasticity shock: targeting 0.981 vs no-commitment 0.921 (difference 0.060). In both cases targeting strictly dominates, with gains of about 6% consumption-equivalent. The intuition: targeting reduces the misallocation cost of long-run price dispersion, while no-commitment reduces the cost of rising monopoly distortions; the dispersion costs dominate, especially because high beta makes long-run costs weigh heavily.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work on credibility and non-linear monetary policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends the Barro-Gordon (1983) and Rogoff (1985) credibility tradition, which used static or linearized settings that cannot speak to long-run inflation or transition dynamics. It differs from Markovian linearized approaches (e.g., Halac and Yared 2022) which feature no transition dynamics and significantly OVERESTIMATE the effect of permanent shocks on long-run inflation (because linearization underestimates the welfare cost of rising dispersion). It departs from fiscal-commitment models (Alvarez-Kehoe-Neumeyer 2004; Aguiar et al. 2015) and from Davila-Schaab (2023, which uses quadratic adjustment costs and thus has no price dispersion) by emphasizing the Calvo dispersion cost and its dynamic feedback on the inflation-output tradeoff. Relative to the discretionary-multiplicity literature (Albanesi-Chari-Christiano 2003; King-Wolman 2004; Zandweghe-Wolman 2019), this model obtains a UNIQUE equilibrium and provides an analytical (not numerical) characterization of the steady state and transition. It also contributes a novel recursive representation of the non-linear Phillips curve via the auxiliary variable delta_t.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the transition dynamics of the macro variables in the calibrated exercise?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following the permanent labor-wedge increase: inflation jumps up from 2% and gradually declines toward its higher steady state (overshooting). The nominal interest rate jumps up and continues rising throughout the transition (the higher steady-state nominal rate reflects the Fisherian effect present in the non-linear model). The real interest rate jumps DOWN initially (the central bank stimulates to weather the shock) then gradually returns to its original level. Output falls gradually as price dispersion and labor misallocation increase. Nominal wage inflation jumps up with price inflation but stays below it, converging from below; this gap underpins a permanent long-run decline in the real wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Permanent changes in the global economy (e.g., lower immigration shifting labor toward more regulated/higher-wedge sources; slower globalization or supply-chain disruptions raising domestic firms&amp;rsquo; market power, i.e., lower sigma) can raise long-run inflation, but only through their interaction with central bank lack of commitment, not on their own. The post-pandemic inflation spike, and its overshooting, can be partly understood as the private sector rationally anticipating accommodative policy. Scope condition: this holds as long as the central bank operates with FULL DISCRETION; a strict commitment to inflation targeting would prevent it. There can therefore be significant benefits to institutions that enhance commitment. A caveat from the model&amp;rsquo;s own logic: if structural changes flatten the long-run Phillips curve, magnitudes shrink, the value of commitment falls, and there are real benefits to positive long-run inflation (so targeting too low an inflation rate would be costly).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and directions for future research the authors flag?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is deterministic with permanent shocks and abstracts from monetary-fiscal interactions by assuming lump-sum taxes and Ricardian equivalence (debt is payoff-irrelevant, set to zero). It focuses on the stable steady state, setting aside equilibrium implementation and off-equilibrium inflation stability. The discretionary policy (labor share = 1) is invariant to the price-setting model, so the approach extends to menu-cost or rational-inattention models. Future work: relax Ricardian equivalence to study interactions between central bank and fiscal lack of commitment (facilitated by the framework not assuming a long-run debt level since it is not linearized), and examine off-equilibrium inflation stability.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Mortgage securitization and information frictions in general equilibrium</title><link>https://macropaperwarehouse.com/papers/mortgage-securitization-and-information-frictions-in-general-equilibrium/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/mortgage-securitization-and-information-frictions-in-general-equilibrium/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper develops a quantitative general equilibrium model of the U.S. housing finance system that jointly determines mortgage credit and mortgage-backed security (MBS) issuance, with the aim of measuring how information frictions in the securitization market amplify aggregate credit cycles. The central motivation is the tight co-movement of mortgage credit and MBS issuance documented in HMDA data from 1990 to 2016: from 2000 to 2019, originators sold or securitized roughly 70 percent of all residential mortgages within the first year of origination, making securitization the dominant source of funding for new lending. When this source of liquidity collapsed during the Great Financial Crisis (GFC), aggregate residential mortgage credit contracted by roughly 41 percent and RMBS issuance contracted by roughly 37 percent on average from 2008 to 2013.&lt;/p&gt;
&lt;p&gt;The model is a discrete-time, infinite-horizon DSGE framework with three types of agents: an impatient representative borrower household, a unit-mass continuum of heterogeneous lenders, and a government. Borrower households consume non-durables and housing services, take on long-term fixed-rate mortgages modeled as perpetuities with geometrically declining payments, and can endogenously default when idiosyncratic housing valuation shocks erode their equity. Lenders face stochastic loan origination costs drawn i.i.d. from a continuous distribution, can privately identify the quality of loans in their portfolios, and access a securitization market modeled after the to-be-announced (TBA) forward market for agency MBS — the largest liquid MBS market in the U.S. The TBA market features anonymous, non-exclusive trades at a single pooling price, and the &amp;ldquo;cheapest-to-deliver&amp;rdquo; convention gives sellers the incentive to offload their lowest-value loans, giving rise to a classic Akerlof-style adverse selection problem. The government captures GSE credit guarantees through a state-contingent subsidy to MBS buyers, financed by a distortionary fee on originators and lump-sum taxes on households. The model is calibrated to match key cross-sectional moments of the HMDA dataset for 1990 to 2006, including the distribution of lending: the top 1 percent of originators accounted for 62 percent of lending and the top 10 percent for 89 percent. These moments of market concentration are central to quantifying the amplification channel.&lt;/p&gt;
&lt;p&gt;Two novel theoretical features distinguish this framework. First, the mortgage interest rate and the security price are jointly determined in equilibrium — a &amp;ldquo;joint price determination&amp;rdquo; property. Second, the severity of information frictions is itself an endogenous function of equilibrium prices, the household default rate, and lenders&amp;rsquo; trading decisions. When household credit risk rises, more loans become low-quality, deteriorating the average quality of the pool offered by sellers. MBS buyers, aware of sellers&amp;rsquo; incentives, demand a larger adverse selection discount; security prices fall; fewer lenders find it profitable to securitize; an endogenous liquidity shortage follows in the credit market; and tighter lending conditions further weaken household balance sheets. This feedback constitutes the adverse selection multiplier.&lt;/p&gt;
&lt;p&gt;Quantitatively, when the calibrated model is fed the sequence of income and housing-valuation shocks observed from 2006 to 2016, it replicates two-thirds of the observed 41 percent contraction in mortgage lending and the full 37 percent contraction in MBS issuance from 2008 to 2013. A shock decomposition (Table 7) shows that, on average over 2008–2013, information frictions account for 40 percent of the model&amp;rsquo;s predicted decline in mortgage lending (52 percentage points from housing valuation shocks and 5 percentage points from income shocks make up the remainder; comparable shares hold in the securitization market). There is a 1.5 adverse selection multiplier: absent information frictions, credit would have contracted by 27 percent rather than 41 percent. Housing valuation shocks account for roughly half the total dynamics; income shocks account for about 5 percent.&lt;/p&gt;
&lt;p&gt;Regarding the post-GFC structural changes, the paper evaluates the effect of GSEs expanding their market share to 100 percent (up from 69 percent in 1990–2006) and the threefold increase in the guarantee fee (from 20 to 60 basis points after 2012). These changes reduce the volatility of the mortgage spread from 6.3 to 4.7 percentage points and lower the unconditional probability of a securitization market collapse from 6.5 to near zero. However, the policy generates inefficiently high levels of liquidity, produces only small welfare gains for borrowers (0.06 percent in consumption-equivalent units), and distributes gains unequally — lenders gain approximately 1.3 percent. Households face higher interest rates (lenders pass through the guarantee fee) and higher taxes. The model corroborates other GE studies in finding that credit guarantees were underpriced before the GFC; the actuarially fair price is closer to the post-2012 fee.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the paper&amp;rsquo;s identification strategy and what is the nature of the quantitative exercise?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper does not use a reduced-form empirical identification strategy; it is a structural DSGE model. The quantitative exercise feeds the calibrated model the observed sequences of aggregate household income shocks and housing valuation shocks from 2006 to 2016, with the model calibrated to match pre-GFC (1990–2006) moments of the U.S. mortgage market. The decomposition of information frictions is accomplished by simulating a complete-information counterfactual for the same shock sequence: the difference between the benchmark model and the complete-information economy quantifies the contribution of private information.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the securitization liquidity channel, and how does it operate mechanically in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The securitization liquidity channel is the transmission mechanism from the securitization market to mortgage credit supply. In normal times, lenders with low origination costs (sellers) securitize their loan portfolios, freeing up funds to originate new loans, while high-cost lenders purchase securities rather than originate, effectively specializing their roles through the market. A shock that increases household default risk worsens pool quality. Buyers face a larger adverse selection discount, security prices fall, and the wedge between the market price and a seller&amp;rsquo;s valuation of high-quality loans widens. Many lenders switch from selling to holding, reducing the supply of liquidity in the securitization market. Constrained by limited access to debt markets, lenders cut new mortgage origination. The resulting tightening in credit further deteriorates household balance sheets, creating an amplification loop.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three types of lenders in the model, and what determines their trading decisions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Lenders endogenously sort into three groups based on their idiosyncratic origination cost draw z relative to two equilibrium cutoffs. Sellers (low-cost lenders, z below the first cutoff) find origination sufficiently profitable to sell their inventory of loans into the securitization market and originate new ones. Buyers (high-cost lenders, z above the second cutoff) find origination too costly and instead buy securities from sellers. Holders (lenders with z between the two cutoffs) neither sell at the prevailing adverse-selection-discounted price nor buy at the effective cost grossed up by the information wedge; they retain their illiquid loan portfolios and originate fewer new loans. The information wedge — the distance between the two cutoffs — is a decreasing function of the subsidy coverage and an increasing function of the adverse selection discount.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the adverse selection discount endogenously determined, and why does it amplify shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The per-unit adverse selection discount mu_t is defined as the aggregate fraction of low-quality loans traded in the securitization market: mu_t = S_B_t / S_t, where S_B_t is the aggregate supply of low-quality loans and S_t is total loans traded. This fraction is endogenous: it depends on which lenders sort into the seller category and what quality distribution their portfolios have, which in turn depends on the household default rate and the equilibrium price. When household credit risk rises, the default rate increases, more loans become low-quality, and sellers selectively offload bad loans while retaining good ones. The endogenous deterioration in mu_t raises buyers&amp;rsquo; required discount, further reducing the security price, which causes additional holders to switch away from selling, compounding the adverse selection problem. This self-reinforcing dynamic is the multiplier.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Under what conditions can the securitization market shut down entirely, and what happens to credit in that case?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 2 establishes that a sufficient condition for market shutdown in the steady state is that the market effective cost of buying securities exceeds the origination cost of the highest-cost lender in the economy. When this condition holds: (1) the securitization market does not operate; (2) every lender originates using only her own technology; and (3) the mortgage rate is higher than when the market operates. Critically, even when the securitization market collapses, the credit market continues to function, but with higher interest rates and lower intermediation volumes. The economy can transition between states with and without an active securitization market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does market concentration of mortgage originators play in the quantitative results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Market concentration is crucial for the magnitude of amplification. From 1990 to 2016, the top 1 percent of originators accounted for 62 percent of lending and the top 10 percent for 89 percent (from HMDA data). The model is calibrated to match these moments. Because large originators specialize as securitization sellers, their decision to switch from selling to holding — triggered by rising adverse selection discounts — produces very large contractions in aggregate credit supply. The calibrated lending-cost distribution shows a large discontinuity: the last marginal securitization seller originates a volume four times larger than the next marginal holder. When the most efficient, high-volume lenders exit the securitization market, the aggregate effect is disproportionately large.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the government subsidy policy interact with adverse selection, and what are its theoretical properties?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The GSE credit guarantee is modeled as a state-contingent subsidy tau_t = alpha_G * mu_t, where alpha_G in [0,1] represents the degree of insurance provided. Any positive subsidy reduces the adverse selection wedge by moving the second cutoff leftward, expanding the mass of security buyers. A full subsidy (alpha_G = 1) completely offsets buyers&amp;rsquo; losses from default risk, stabilizing security demand regardless of household credit risk and minimizing the probability of market collapse. However, Proposition 3 establishes that a full subsidy generates inefficiently high levels of liquidity compared to the complete information benchmark: it expands the volume of MBS at lower average quality relative to an economy where low-quality loans are screened out. A full subsidy also fails to replicate complete-information allocations because the guarantee fee distorts lenders&amp;rsquo; origination decisions and raises borrowers&amp;rsquo; mortgage rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the welfare implications of the post-GFC policy changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The welfare analysis (Table 9) finds small positive but unequal welfare gains. The overall post-GFC policy changes (full subsidy plus higher guarantee fee) yield borrower welfare gains of 0.06 percent and lender welfare gains of 1.3 percent in consumption-equivalent units. Decomposing the changes: the increase in the subsidy (alpha_G from 69 to 100 percent) generates borrower welfare losses of -0.16 percent (due to higher taxes and interest rates, offset partially by lower volatility) and lender gains of 3.01 percent (from improved lending efficiency). The increase in the guarantee fee reverses some of this by generating borrower gains of 0.18 percent and lender losses of -1.53 percent. The paper characterizes these as upper bounds because the full subsidy may generate moral hazard by weakening originators&amp;rsquo; incentives to screen loan quality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and extend Justiniano et al. (2015, 2019) and Landvoigt (2016)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Justiniano et al. (2015, 2019) argue that credit supply constraints — limits on the funds available to lenders — are quantitatively more important than credit demand forces in explaining mortgage credit fluctuations. This paper provides a microfoundation for those constraints by modeling securitization as the dominant source of liquidity for lenders and deriving endogenously how adverse selection limits that liquidity. Landvoigt (2016) introduces securitization in a DSGE housing model in reduced form. This paper goes further by modeling an endogenous securitization market where lenders optimally trade off liquidity benefits against information friction costs, so security prices and mortgage rates are jointly determined rather than imposed exogenously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to the Kurlat (2013) and Bigio (2015) models of adverse selection in asset markets?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The securitization design combines Kurlat (2013)&amp;rsquo;s framework of asset creation and reallocation with two additional features specific to the TBA market: (1) the cheapest-to-deliver convention, which means sellers can select the lowest-value loans in their inventory satisfying trade terms; and (2) the non-exclusive, anonymous nature of TBA trades, which ensures a pooling price. Bigio (2015) models endogenous liquidity and the business cycle through information frictions in interbank markets. This paper extends the adverse selection approach to the mortgage market specifically and provides an equilibrium linkage between the securitization market and the credit market rather than modeling them as a single market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the non-targeted moments and how well does the model fit the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three non-targeted moments are reported (Table 5). The model generates a fraction of loan sales of 73.9 percent (data: 61.8 percent from HMDA), a correlation between loan sales and new lending of 0.86 (data: 0.90), and a mortgage spread of 178 basis points (data: 330 basis points). The loan sales fraction is somewhat above data and the spread is substantially below. For targeted cross-sectional moments (Table 6), the model closely matches the distribution of lending by quartile, with Q4 market shares of 0.957 in the model versus 0.959 in the data. For the dynamic GFC episode, the model replicates two-thirds of the 41 percent contraction in mortgage lending and the full 37 percent contraction in MBS issuance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the sources of aggregate shocks and how are they calibrated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The two exogenous aggregate state variables are household income Y_t and the variance of idiosyncratic housing valuation shocks sigma_omega_t (the proxy for mortgage credit risk). They follow a first-order joint Markov process. Income is identified using the cyclical component of disposable personal income from the flow-of-funds accounts. The variance of housing shocks is calibrated to match the national delinquency rate for loans 90+ days delinquent or in foreclosure from the National Mortgage Database (FHFA). The calibrated states produce default rates of 1.8 percent in the low-risk state and 7.9 percent in the high-risk state, with an unconditional default rate of 2.6 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key limitations and caveats of the analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Several limitations are noted. First, the welfare analysis of the full subsidy is characterized as an upper bound because moral hazard — the impact of guaranteed insurance on originators&amp;rsquo; incentives to screen loan quality — is not modeled. Second, the model abstracts from other consequences of default for borrowers, such as reputation concerns and long-term credit market exclusion. Third, the paper focuses on information frictions between lenders and investors (the securitization chain), not between borrowers and lenders. Fourth, the non-targeted mortgage spread (178 bps in model versus 330 bps in data) suggests some quantitative limitations in matching all features of the credit market simultaneously. Fifth, the exercise is a structural model exercise and not empirically identified through exogenous variation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Securitization liquidity channel&lt;/strong&gt;: The mechanism by which mortgage originator funding capacity depends on their ability to sell loan portfolios in the securitization market; when securitization demand falls, originators face an endogenous liquidity shortage and reduce new mortgage lending, transmitting shocks from the MBS market to the credit market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adverse selection multiplier&lt;/strong&gt;: The amplification factor arising from private information in the securitization market: as household credit risk rises, sellers&amp;rsquo; incentives to offload low-quality loans worsen pool quality, causing buyers to demand a larger discount, which causes more lenders to withdraw from selling, creating a feedback loop that magnifies the initial shock to credit supply. Quantified at 1.5 for the GFC episode.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;TBA (to-be-announced) forward market&lt;/strong&gt;: The dominant trading venue for agency MBS in the U.S., accounting for over 90 percent of MBS trading volume, where the specific securities to be delivered are not identified at the trade date and sellers can deliver the cheapest eligible pool (&amp;lsquo;cheapest-to-deliver&amp;rsquo;), institutionalizing adverse selection incentives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cheapest-to-deliver convention&lt;/strong&gt;: A TBA market practice by which a seller selects and delivers the lowest-value mortgage pools in its inventory that satisfy the terms of trade, giving sellers a systematic informational advantage and incentivizing selective retention of high-quality loans.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adverse selection discount (mu_t)&lt;/strong&gt;: In this paper, the per-unit discount arising from adverse selection, defined as the endogenous equilibrium fraction of low-quality loans in the aggregate supply of traded loans (S_B_t / S_t); this fraction is determined jointly with prices and lenders&amp;rsquo; trading decisions, and rises when household default risk increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mortgage credit risk (sigma_omega_t)&lt;/strong&gt;: The standard deviation of idiosyncratic housing valuation shocks to household members, which is the exogenous aggregate state variable that drives default rates; when sigma_omega_t rises, more households fall below the default threshold, increasing the aggregate default rate and degrading the quality composition of lenders&amp;rsquo; portfolios.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Joint price determination&lt;/strong&gt;: A novel equilibrium property of the model in which the mortgage interest rate (in the credit market) and the price of securities (in the securitization market) are simultaneously determined; this interdependence means that adverse selection dynamics in the securitization market directly affect the cost of credit and vice versa.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;GSE credit guarantee (subsidy policy)&lt;/strong&gt;: A state-contingent subsidy tau_t = alpha_G * mu_t paid to MBS buyers, representing the credit guarantees of Fannie Mae and Freddie Mac; financed by a guarantee fee (distortionary tax on originators) and lump-sum taxes on households; alleviates adverse selection by stabilizing security demand but generates inefficiently high liquidity and fails to deliver meaningful household welfare gains.&lt;/p&gt;</description></item><item><title>Non-Tariff Barriers in the U.S.-China Trade War</title><link>https://macropaperwarehouse.com/papers/non-tariff-barriers-in-the-u.s.-china-trade-war/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/non-tariff-barriers-in-the-u.s.-china-trade-war/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Chen, Hsieh, and Song study the use of unofficial non-tariff barriers (NTBs) by China during the U.S.-China trade war of 2018–2019 and in the first year of the Phase 1 purchase agreement (2020). The central motivation is that much prior analysis of the trade war focused on announced tariff hikes, yet abundant anecdotal evidence — permit requirements for U.S. pet food, pest-inspection orders on U.S. apples and lumber, changes to pig-feed formulas reducing soybean content — points to a parallel, opaque regulatory channel. The critical puzzle the paper highlights is that China&amp;rsquo;s purchases of U.S. goods rose by 156 percent between 2019 and 2020 without any reduction in tariffs, which is only explicable if NTBs were used in reverse to favour U.S. exporters during the Phase 1 period.&lt;/p&gt;
&lt;p&gt;The paper uses Chinese customs administrative data from 2015 to July 2020, covering 946 HS-6 products aggregated by state-owned versus non-state importer and by source country. Tariff data are constructed from official Customs Tariff Commission documents listing each round of retaliatory hikes beginning April 2018. The empirical strategy proceeds in three steps. First, demand (elasticity of substitution across source countries, epsilon) and supply (gamma) elasticities are estimated by regressing changes in import quantities and CIF prices on changes in tariff rates, using product-country fixed effects so identification comes from within-product, cross-country variation in tariff changes. The identifying assumption — that tariff changes across countries are orthogonal to NTB changes and foreign supply shifts — is validated empirically. The estimated demand elasticity is epsilon = 3.36 for agriculture and 2.34 for manufacturing; supply elasticities of 42 (agriculture) and 71 (manufacturing) imply near-horizontal foreign supply curves, so essentially all the incidence of Chinese trade barriers falls on Chinese consumers.&lt;/p&gt;
&lt;p&gt;Second, NTBs are inferred as a residual: the change in U.S. import quantities relative to imports from other countries of the same HS-6 product, after netting out the estimated price and tariff effect. A normalisation sets the import-weighted average NTB change on non-U.S. source countries to zero, so the residual is attributed to U.S.-specific barriers. This procedure is run separately for non-state and state importers. The tariff-equivalent of NTBs on U.S. agricultural products faced by non-state importers rose by 0.73 log points between 2017 and 2019, while NTBs on state importers were essentially unchanged (Table 4). The weighted average NTB increase for agriculture was 0.60 log points, compared to a tariff increase of 17 percentage points (from 7.5% to 24.5%). For manufactured goods, average NTBs rose by only 0.16 log points versus a tariff increase of 9 percentage points (5.6% to 14.6%). NTBs were highly concentrated: the tariff equivalent rose by 1.0 log points for oil seeds, 1.5 log points for cereals, and 1.1 log points for ores, slag and ash. The variance of tariff-adjusted import growth across HS-6 products increased 18-fold from 0.296 (2015–2017) to 5.31 (2017–2019), and controlling for state versus non-state ownership accounts for 38% of that increase.&lt;/p&gt;
&lt;p&gt;Third, welfare effects are computed using a three-nest CES model (HS-6 products, importer firms, source countries). Tariffs harm welfare via dispersion of tariff rates across source countries; NTBs harm welfare via both the mean and dispersion of NTBs across source countries, firm types, and products, and also because — unlike tariffs — NTBs generate no fiscal revenue. The total welfare loss to China in 2019 relative to 2017 is estimated at $40 billion, of which 92% is attributable to NTBs rather than tariffs (Table 7). For agricultural products alone, NTBs account for 86% of the $12.7 billion welfare loss; for manufacturing they account for 94.1% of the $27.2 billion loss. Crucially, for a given dollar reduction in U.S. imports, NTBs impose approximately six times the welfare cost of equivalent tariff hikes (the Figure 2 text says &amp;ldquo;five times&amp;rdquo;), because NTBs (i) generate no revenue and (ii) create misallocation by applying to some importers (non-state) but not others (state-owned). By 2020 China&amp;rsquo;s welfare loss relative to 2017 widened further to $48.11 billion, as NTB reversals in agriculture were partial and manufacturing NTBs were not reversed at all. The paper also documents that the Chinese government&amp;rsquo;s choice of instrument was strategic: tariff hikes were smaller in sectors with a larger pre-war state importer share, while NTB hikes on non-state importers were larger in those same sectors, consistent with a government pursuing dual objectives of punishing U.S. exporters while protecting state-firm profits.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core identification strategy and its key assumption?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The demand elasticity (epsilon) and supply elasticity (gamma) are estimated from a system of two equations: the change in log import quantity and the change in log CIF price, both regressed on the change in log tariff rates, with product-country fixed effects and year fixed effects. The identifying assumption is that tariff changes across source countries are orthogonal to NTB changes and foreign supply shifts — i.e., China&amp;rsquo;s retaliatory tariff schedule was not systematically targeted at products where NTBs were also rising or where foreign supply conditions were deteriorating. The authors validate this assumption in two ways: (1) Appendix Figure A2 shows near-zero correlation between imputed NTB changes and tariff changes across HS-6 product-country pairs (OLS coefficient 0.014); (2) Appendix Figure A3 shows near-zero correlation between pre-war import growth (2015–2017) and post-war tariff changes (OLS coefficient -0.02), arguing against correlated foreign supply trends.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How exactly are NTBs measured and what normalization is required?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;NTBs are inferred as a structural residual. From the CES demand function, the change in non-state imports of a U.S. product relative to the same product from another source country equals minus epsilon times the relative change in tariff-inclusive CIF price, minus epsilon times the relative NTB. Given estimated epsilon and data on prices and tariffs, the relative NTB (U.S. vs. other countries) is identified. To convert this into the absolute NTB on U.S. goods, the paper normalizes the import-expenditure-weighted average NTB change on all non-U.S. source countries to zero. State-importer NTBs are then backed out from the ratio of state to non-state import growth for U.S. products, using equation (7), which relies on the elasticity of substitution between state and non-state firm types (eta = 3, borrowed from Khandelwal, Schott and Wei 2013).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main threats to identification and how are they addressed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three threats are discussed. (1) Quality or supply changes specific to U.S. products: if imputed NTBs reflect deteriorating U.S. product quality rather than Chinese regulatory barriers, U.S. exports to non-China markets should also fall for the same HS-6 products. Appendix Figure A1 shows no such correlation (OLS slope 0.016, SE 0.007), confirming NTBs are China-specific. (2) Endogenous targeting of tariffs toward products also receiving NTBs (violating the orthogonality assumption): Appendix Figure A2 directly shows near-zero correlation. (3) Correlated pre-trends: Appendix Figure A3 shows no correlation between 2015–2017 import growth and 2017–2019 tariff changes, so pre-existing trends do not appear to have driven the targeting of tariffs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across firm ownership is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;NTBs fell almost entirely on non-state importers of U.S. agricultural products. Non-state NTBs rose by 0.73 log points (2017–2019) while state NTBs were essentially unchanged (Table 4, column 3 vs. column 4). The state share of Chinese agricultural imports from the U.S. roughly doubled from 19.3% in 2017 to 39.8% in 2019 (Table 2), before returning to ~20% in 2020. For imports from the rest of the world, the state share remained stable at ~20% throughout. In manufacturing, state-importer NTBs declined slightly (-0.066) while non-state NTBs rose modestly (0.023). The divergence between state and non-state importers accounts for 38% of the 18-fold increase in variance of tariff-adjusted import growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What product-level heterogeneity is found in the use of NTBs vs. tariffs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;NTBs were highly product-concentrated compared to tariffs. Table 5 shows the largest NTB increases in oil seeds (+1.006 log points), cereals (+1.492), and food industry residues (+0.688), all products where the U.S. held large pre-war import shares. For manufactured goods, the largest NTB increases occurred in ores, slag and ash (+1.106) and vehicles (+0.366). By contrast, tariff hikes were distributed more broadly across products. Table 9 shows that, across HS-6 products, (a) tariff increases were significantly smaller for products with a higher pre-war state importer share (OLS coefficient -0.202) and (b) non-state importer NTB increases were significantly larger for those same products (OLS coefficient +4.431). Both patterns hold when controlling for the U.S. import share in total imports of the product.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the welfare framework and what are its scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Welfare is derived from a three-level CES utility function over HS-6 products (elasticity sigma), importer firms (elasticity eta), and source countries (elasticity epsilon). Tariff revenue is rebated to consumers; NTB costs are not. The welfare cost operates through three channels: (1) tariffs raise dispersion of prices across source countries, reducing welfare with elasticity epsilon; (2) NTBs affect both the mean and the dispersion of import prices, with no offsetting revenue effect; (3) differential NTBs across firm types (state vs. non-state) add a misallocation channel scaled by eta. The framework accounts for expenditure reallocation across source countries within an HS-6 product and across HS-6 products, but not between imported and domestic Chinese goods. This last restriction means welfare losses are likely understated, as the model does not capture the cost of switching from foreign to domestic substitutes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the quantitative welfare results and how do they decompose?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Total welfare loss in 2019 relative to 2017: $40 billion. Agriculture: $12.7 billion (of which tariffs account for $1.7B and average NTBs for an additional $9.3B; differential state/non-state NTBs add a further $1.7B). Manufacturing: $27.2 billion (of which tariffs account for only $1.6B; average NTBs add $23.5B and differential NTBs a further $2.1B). NTBs&amp;rsquo; share: 92% of total (86% for agriculture, 94% for manufacturing). By 2020, the overall welfare loss widened to $48.11 billion, because partial NTB reversal in agriculture was more than offset by continued welfare losses from manufacturing NTBs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why are NTBs so much more costly per dollar of import reduction than tariffs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms. First, tariffs generate revenue that is assumed to be rebated to consumers, partially offsetting their welfare cost; NTBs generate no government revenue. Second, because NTBs are unofficial and opaque, they can be and were applied selectively to non-state importers but not to state importers, creating misallocation: within an HS-6 product, some importers face artificially high effective prices while others (state firms) do not, so the aggregate consumption basket becomes inefficient. The welfare elasticity with respect to import value is approximately five to six times larger for NTBs than for tariffs (Figure 2; the abstract states six times, the Figure 2 text states five times — a minor internal discrepancy).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper show about the Phase 1 purchase agreement (2020)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In 2020 China agreed to increase purchases of U.S. goods without reducing tariffs. The paper shows this was accomplished by partially reversing NTBs. The average NTB for agricultural products fell from +0.60 log points (2017–2019) to +0.14 log points over the full 2017–2020 period, implying substantial 2020 reversal. This reversal applied exclusively to non-state importer NTBs on agricultural products; state importer NTBs and manufacturing NTBs were not reversed. The U.S. share of Chinese agricultural imports rose from 13.7% in 2019 to 17.2% in 2020 despite unchanged tariffs (Table 1), directly confirming the NTB reversal interpretation. Welfare in 2020 from agricultural imports partly recovered but remained $7.3 billion below 2017 baseline; manufacturing welfare loss persisted, yielding an overall 2020 welfare loss of $48.11 billion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to prior work on the U.S.-China trade war?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper builds most directly on Fajgelbaum et al. (2019), borrowing their IV procedure to estimate demand and supply elasticities (using tariff variation across source countries as instruments) and replicating their finding of near-horizontal foreign supply curves. It differs in focusing on Chinese consumers rather than American consumers and in measuring NTBs in addition to tariffs. It also extends Khandelwal, Schott and Wei (2013), whose analysis of state-firm export quotas motivated the state/non-state ownership dimension; the current paper inverts the logic to study selective barriers on non-state importers. Benguria and Safdie (2021) similarly find product variation in U.S. exports to China correlated with state ownership, but do not impute NTBs structurally or quantify welfare. Ma, Ning and Xu (2021) and Liu (2020) use Chinese customs data to document tariff effects on imports but do not examine NTBs. Chor and Li (2021) use night-lights data to estimate aggregate tariff exposure effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are conducted and what do they show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three main robustness exercises. (1) Falsification test: for products where high NTBs are imputed, U.S. exports to non-China markets do not fall (Appendix Figure A1, slope 0.016, SE 0.007), confirming NTBs are China-specific rather than reflecting U.S.-side supply deterioration. (2) Orthogonality check: Appendix Figure A2 shows near-zero correlation between imputed NTBs and tariff changes across product-country pairs. (3) Alternative country normalization: NTBs are estimated for the four largest non-U.S. exporters to China (Brazil, Canada, Thailand, Australia), assuming barriers on the remaining countries average zero. Brazil, Canada, and Thailand show essentially zero imputed NTB changes 2017–2019, consistent with the identifying normalization. Australia shows a modest NTB increase consistent with documented retaliations after Australia&amp;rsquo;s 2018 national security law, but far smaller than the U.S. NTB increase. Additionally, Appendix Tables A1-A3 re-run all estimates with alternative parameter values: sigma = 1 (instead of 1.47/1.25) and eta = 5 (instead of 3). All qualitative results survive: NTBs exceed tariffs in magnitude, fall disproportionately on non-state importers, and impose far larger welfare costs per dollar of import reduction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main policy implication is that opaque regulatory tools are an unusually costly instrument of trade retaliation — approximately five to six times more costly per unit of import reduction than equivalent tariffs — because they neither generate revenue nor require the same importer to bear equal costs. If the Chinese government&amp;rsquo;s objective was to punish U.S. exporters, it chose a particularly self-damaging instrument. A secondary implication concerns the Phase 1 deal: the deal&amp;rsquo;s purchase commitments were met not through tariff reductions but through NTB reversals, and those reversals were partial, selective (agriculture but not manufacturing; non-state but not state), and left China&amp;rsquo;s welfare substantially below the 2017 baseline. Scope conditions: the welfare model does not account for import-to-domestic substitution, so welfare costs are likely understated. The elasticity estimates assume CES preferences and a particular nesting structure. The NTB measurement relies on the normalisation that average barriers on non-U.S. sources did not change, which is validated but not directly observable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper reveal about the strategic logic of China&amp;rsquo;s instrument choice?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Section 7 shows that Chinese authorities&amp;rsquo; instrument choice is consistent with a dual-objective government: punish U.S. exporters while protecting state-firm profits. Tariffs, which apply uniformly to all importers, harm state firms importing from the U.S. as much as non-state firms. NTBs, being unofficial and selectively enforced, can exempt state importers. Regression evidence (Table 9) confirms: tariff hikes were systematically smaller for products with higher pre-war state importer shares (coefficient -0.202, SE 0.042), while NTB hikes on non-state importers were systematically larger for the same products (coefficient +4.431, SE 0.655). These patterns hold controlling for the U.S. product share in total Chinese imports.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Non-tariff barrier (NTB)&lt;/strong&gt;: In this paper, unofficial and opaque regulatory measures — health inspections, permit requirements, informal directives to importers — that function as trade barriers but are not publicly disclosed as such and are not uniformly applied to all importing firms. Measured in tariff-equivalent units as the residual change in U.S. import share after controlling for tariff and price effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tariff-equivalent of NTBs&lt;/strong&gt;: The ad-valorem tariff rate that would produce the same reduction in import demand as the estimated NTB, derived from the structural demand equation. Expressed in log points (e.g., 0.60 log points for average agricultural NTBs in 2017–2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Misallocation from selective NTBs&lt;/strong&gt;: The welfare loss that arises specifically because NTBs are applied to non-state importers but not state importers within the same HS-6 product category. This within-product dispersion of effective prices across firms generates an allocative inefficiency absent when tariffs are used, since tariffs apply uniformly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Phase 1 purchase agreement&lt;/strong&gt;: The January 2020 U.S.-China trade deal in which China committed to purchasing specified amounts of U.S. goods in 2020–2021. The paper shows that China fulfilled these commitments by reversing NTBs rather than reducing tariffs, and that the reversal was partial, concentrated in agricultural imports by non-state firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Elasticity of substitution across source countries (epsilon)&lt;/strong&gt;: The parameter governing how sensitive Chinese import demand for an HS-6 product from a given country is to that country&amp;rsquo;s relative price. Estimated at 3.36 for agriculture and 2.34 for manufacturing using tariff variation as an instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;State vs. non-state importer&lt;/strong&gt;: The ownership classification of Chinese importing firms in the customs data. State-owned importers were largely exempt from NTBs during the trade war, while non-state (private) importers bore nearly all of the NTB increases on U.S. agricultural products. This differential application is the central mechanism generating misallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare channel distinction: tariffs vs. NTBs&lt;/strong&gt;: Tariffs affect welfare only through the dispersion of prices across source countries (revenue is rebated). NTBs affect welfare through both the mean and dispersion of prices across source countries, firm types, and products, with no revenue offset. This structural distinction is why the paper finds NTBs impose approximately five to six times greater welfare cost per dollar of import reduction.&lt;/p&gt;
&lt;!-- flags: Minor internal discrepancy in paper: abstract and conclusion state NTBs impose ~6x the welfare cost of equivalent tariffs per dollar of import reduction; Figure 2 text states ~5x. Both figures are in the source text; the summary uses 'approximately six times' per the abstract/conclusion. --&gt;</description></item><item><title>Oil Prices, Monetary Policy and Inflation Surges</title><link>https://macropaperwarehouse.com/papers/oil-prices-monetary-policy-and-inflation-surges/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/oil-prices-monetary-policy-and-inflation-surges/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Gagliardone and Gertler ask why the US inflation surge that began in mid-2021 was both sudden and persistent, and whether a simple structural model can account for it without targeting inflation in estimation. The paper&amp;rsquo;s central claim is that the surge was driven primarily by the combination of large oil price shocks and accommodative (&amp;ldquo;easy&amp;rdquo;) monetary policy by the Federal Reserve, with oil complementarities and real wage rigidity as the key amplification mechanisms. Secondary factors — demand shocks and labor-market tightening — matter but do not drive the surge on their own.\n\nThe model is a New Keynesian framework with three non-standard features relative to the Blanchard-Gali (2007) benchmark: (1) oil enters both household utility and firm production as a complement rather than a substitute (elasticities of substitution estimated at ψ = 0.02 for households and ε = 0.37 for firms, both well below unity); (2) a Mortensen-Pissarides search-and-matching labor market that makes unemployment endogenous and allows shocks to matching efficiency; and (3) real wage rigidity parameterized by γ, estimated at 0.697, meaning actual wages adjust only about one-third as much as Nash bargaining wages would.\n\nEstimation uses simulated method of moments, matching model impulse responses to two sets of SVAR impulse responses identified via high-frequency external instruments: oil-price surprises around OPEC announcement dates (following Känzig 2021) and monetary-policy surprises around FOMC dates (following Gertler-Karadi 2015, extended by Bauer-Swanson 2022). The SVAR sample runs 1973:01–2019:12, with 2020–2022 reserved as an out-of-sample validation window. The model is then taken to the 2010–2022 period for a historical shock decomposition, targeting unemployment, real oil price inflation, the Federal Funds rate, and labor-market tightness; headline and core PCE inflation are left entirely untargeted and used as the key test of model fit.\n\nMain quantitative findings: the estimated elasticity of substitution between oil and labor in production is ε = 0.37 (s.e. 0.16) and between oil and consumption goods for households ψ = 0.02 (s.e. 0.34), both significantly below unity and confirming strong complementarity. Real wage rigidity γ = 0.697 (s.e. 0.145): actual wages move roughly one-third as far as Nash wages. The Calvo price parameter λ = 0.945 implies an average price duration of approximately six quarters at monthly frequency, and habit persistence h = 0.914.\n\nIn the structural VAR, a monetary tightening of 15 basis points reduces GDP by about 10 basis points (peak after ~10 months) and raises unemployment by roughly 0.5 percentage points; a 6 percent increase in the real oil price reduces GDP 20–30 basis points and raises the core PCE price level about 20 basis points. Complementarities matter quantitatively: at the estimated parameters, the peak GDP drop following an oil shock is 0.13 percent versus only 0.04 percent under Cobb-Douglas (no complementarity), and the core PCE inflation response is more than double in the benchmark. The decline in the marginal product of labor accounts for more than half the increase in marginal cost during the 2021 surge.\n\nIn the historical decomposition (2010–2022), oil shocks and easy monetary policy shocks jointly account for the bulk of the 2021–22 inflation surge; labor-market matching shocks contribute little to either unemployment variation or inflation; demand shocks dominate unemployment variation but are not the primary inflation driver in the surge. The model also explains the 2014–2019 low-inflation/low-unemployment puzzle: declining oil prices and tight money shocks kept inflation down despite a tight labor market, the mirror image of 2021–22. Baseline forecasts (as of spring 2023) under a Taylor rule with coefficient 2 project headline and core PCE declining to roughly 3 percent in about one year then converging slowly to 2 percent, with unemployment rising to approximately 5 percent (its steady state) and overshooting by about half a percentage point. A more aggressive tightening (funds rate held at 4.6 percent through September 2023) reduces inflation by about half a percentage point faster but raises unemployment by an additional persistent 1 percentage point.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for the oil and monetary policy shocks, and what are the main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both shocks are identified as external instruments in an SVAR. The oil shock uses daily surprises in oil futures prices on days of OPEC meetings (Känzig 2021): the surprise is the change in the log oil futures price between the day before the meeting and the close on the announcement day. The money shock uses surprises in the first principal component of the first four quarterly Eurodollar futures in a 30-minute window around FOMC announcements and non-FOMC Fed communication dates (Gertler-Karadi 2015, extended by Bauer-Swanson 2022). The key identifying assumption is relevance and exogeneity: each surprise must be correlated with the structural shock of interest but uncorrelated with the other structural shocks. The primary threat addressed is endogeneity between oil prices and monetary policy: oil price movements prior to FOMC meetings predict the monetary policy surprise (coefficient 0.073, s.e. 0.038), plausibly because the Fed responds systematically to energy prices. The authors regress money surprises on the monthly log change in oil spot prices and use residuals as the cleaned monetary instrument. Without this purging, the SVAR counterfactually predicts a surprise tightening raises oil prices. The authors also drop the Lehman Brothers date from the sample because confounds from the financial collapse would distort the monetary impulse response. A secondary threat is the use of a daily (rather than intraday) window for oil surprises, justified by evidence that oil markets react more slowly to OPEC announcements than financial markets react to FOMC meetings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does strong complementarity between oil and labor amplify the inflation response, and how is this mechanism isolated empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With a CES production function where ε &amp;lt; 1, firms cannot easily substitute away from oil when its price rises. The marginal product of labor declines sharply because each worker needs roughly the same amount of oil to be productive, raising marginal cost of output for any given wage. The Phillips curve then transmits this cost-push increase to inflation. The authors show analytically that the sensitivity of the marginal product of labor to the ratio of oil to labor is proportional to 1/ε: as ε falls, the oil shock&amp;rsquo;s impact on marginal cost and hence inflation rises sharply. This is isolated by comparing the benchmark model against a Cobb-Douglas version (ε = 1, ψ = 1): peak GDP decline is 0.13 percent with complementarities versus 0.04 percent without; the unemployment response is large and persistent only with complementarities; and the core PCE inflation response is more than double in the benchmark. The historical decomposition further shows that the decline in the marginal product of labor accounts for more than half the increase in marginal cost during the 2021 surge.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does real wage rigidity play, and what is the resulting inflation-unemployment trade-off?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Real wage rigidity introduces a cost-push term into the Phillips curve. Without rigidity (γ = 0), the Nash bargaining wage absorbs the oil shock, and the central bank can achieve both price stability and efficient employment simultaneously. With γ = 0.697, actual wages fall by only about one-third as much as Nash wages after an oil shock. The gap between Nash and actual wages enters the Phillips curve as a cost-push term Δt. If the central bank tries to stabilize prices, it must contract demand enough to push the efficient component of marginal cost negative, forcing output and unemployment well below the flexible-price equilibrium — in the model, pursuing price stability after an oil shock causes output and unemployment to deviate from the flexible-price benchmark by more than double over the first 8–10 months. This trade-off rationalizes partial monetary accommodation and is quantitatively important for matching the historical behavior of inflation in 2021–22.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the historical shock decomposition work, and what are its key identifying assumptions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use the estimated DSGE model with the Kalman smoother to perform a historical shock decomposition over 2010–2022. They estimate persistence and standard deviations of four shocks (demand εbt, monetary policy εrt, oil εst, and matching efficiency εΦt) using Bayesian methods, targeting four observable series: unemployment, real oil price inflation, the Federal Funds rate, and labor-market tightness from JOLTS. Nominal variables — headline PCE, core PCE, nominal wage growth, real product wage growth — are entirely untargeted and serve as out-of-sample validation. One important wrinkle is that the spot oil price contains high-frequency speculative volatility that does not pass through to the prices households and firms face. The authors filter this by assuming nominal oil price inflation equals PCE energy inflation plus an i.i.d. speculation shock, so that only the persistent component enters real allocations. The posterior mean of the speculation shock standard deviation (σm = 0.239) is substantially larger than that of the persistent oil shock (σo = 0.042), confirming the filter&amp;rsquo;s importance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What sub-sample variation is documented, and what explains it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model resolves three sub-sample puzzles. First, the 2014–2019 period had low unemployment but persistently low inflation — the model attributes this to declining oil prices and tight monetary policy shocks that offset demand pressures and kept marginal cost subdued. Second, the 2010–2012 period had rising oil prices but also low inflation — attributable to a large negative demand shock from the Great Recession lingering, which depressed marginal cost sufficiently to offset the oil price effect. Third, the high labor-market tightness of 2022 is shown to be largely an endogenous response to easy monetary policy and oil shocks rather than an autonomous labor supply shock. The matching shock does not materially contribute to either unemployment variation or inflation over the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are reported?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Taylor rule coefficient: calibrating ϕπ to 1.5 instead of 2 adds roughly 0.5 percentage points to PCE inflation at the peak of the 2022 surge due to money shocks but does not change qualitative conclusions. (2) Matching shock persistence: results are robust to calibrating persistence to 0.9 or 0.95 instead of the estimated 0.548, confirming that the matching shock&amp;rsquo;s minimal contribution to inflation is not an artifact of low persistence. (3) Unemployment demeaning: using 6 percent instead of 5 percent does not change results. (4) Oil price speculation filter: removing the filter has only minor quantitative effect because anomalous spike-and-reversal days are few. (5) Monetary policy shock orthogonalization: without purging oil-price predictability from the money surprise, the SVAR counterfactually predicts tightening raises oil prices, confirming the necessity of the adjustment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from Blanchard and Gali (2007)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper descends most directly from Blanchard-Gali (2007), which also features oil in a New Keynesian model with real wage rigidity. Key differences: (i) Gagliardone-Gertler make oil a complement rather than a substitute or Cobb-Douglas input in both utility and production, which they argue is necessary to match quantitatively the observed impact of oil shocks on inflation; (ii) they incorporate a Mortensen-Pissarides search-and-matching labor market with endogenous unemployment, enabling labor-market tightness to function as a separate inflation driver; (iii) they estimate the model formally by matching SVAR impulse responses to externally identified shocks rather than calibrating; and (iv) they apply the model specifically to explaining the 2021–22 inflation surge. The real wage rigidity mechanism is retained from Blanchard-Gali as a central feature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to the broader literature on the 2021–22 inflation surge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper explicitly positions itself against work emphasizing supply chain disruptions and goods-sector reallocation (Guerrieri et al. 2021, Di Giovanni et al. 2022, Ferrante et al. 2023) as the main drivers of 2021 inflation. The authors accept that supply chains mattered in 2021 but argue they moderated by end of 2021 while inflation persisted through 2022, so their framework targets the more durable sources. Papers closer in spirit emphasize monetary policy (Ball et al. 2022, Amiti et al. 2022, Benigno-Eggertsson 2023, Pflueger 2023), but Gagliardone-Gertler differ by using a structural DSGE model estimated to identified shocks and by giving oil shocks a prominent co-equal role alongside monetary accommodation. Lorenzoni and Werning (2023) share the emphasis on production complementarities and wage rigidity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The primary policy implication is that the 2021–22 inflation surge was jointly caused by oil shocks and monetary accommodation, and unwinding it involves a short-run cost in real activity due to the inflation-unemployment trade-off generated by real wage rigidity. The baseline forecast is slow convergence to 2 percent inflation with a quasi soft landing: headline and core PCE reaching roughly 3 percent in about one year then declining slowly, and unemployment rising to 5 percent steady state and overshooting by about half a percentage point. A more aggressive tightening (funds rate at 4.6 percent through September 2023) brings inflation to 2 percent faster by about half a percentage point by June 2023 but at the cost of an additional persistent unemployment increase of about 1 percentage point. Scope conditions: (i) results depend critically on long-run inflation expectations remaining anchored at 2 percent — if expectations drift to 3 percent, the disinflation task becomes harder; (ii) the model abstracts from supply chain disruptions, downward nominal wage rigidity, and open-economy channels; (iii) the quantitative conclusions rest on estimated complementarities that carry large standard errors, especially for household oil complementarity ψ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of labor-market tightness as an inflation driver in this framework?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Labor-market tightness (θt = vt/ut) raises marginal cost through two channels: it increases net hiring costs (a tighter market requires more vacancies to fill a given number of positions, raising the per-hire cost) and it raises the Nash bargaining wage (because unemployment becomes less painful, improving workers&amp;rsquo; outside option). In the historical decomposition, however, the matching efficiency shock — the exogenous source of tightness variation — contributes negligibly to both unemployment variation and inflation over the 2010–2022 sample. The high tightness of 2022 is shown to be largely an endogenous response to easy monetary policy and oil shocks rather than an autonomous labor-supply disruption. This finding challenges the narrative that autonomous labor-market tightening was a primary independent cause of the inflation surge.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Oil complementarity (ε, ψ)&lt;/strong&gt;: In the paper&amp;rsquo;s CES framework, oil is a complement when the elasticity of substitution with labor in production (ε) or with consumption goods for households (ψ) is below unity. A value below unity means that when oil becomes scarce, the marginal productivity of labor (or marginal utility of other consumption) falls more than proportionally, amplifying the macroeconomic impact of oil price shocks. Estimated values of ε = 0.37 and ψ = 0.02 imply strong complementarity in both sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real wage rigidity (γ)&lt;/strong&gt;: A parameter ∈ [0,1] measuring how sticky the actual real wage is relative to the Nash bargaining wage. With γ = 0.697, the actual wage moves only about one-third as far as the Nash wage in response to a shock (wqt = (w°qt)^{1−γ}(wq)^γ). This is adopted as a reduced-form mechanism — not derived from deeper frictions — that generates realistic unemployment volatility and introduces a short-run inflation-unemployment trade-off absent from fully flexible-wage models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cost-push term (Δt)&lt;/strong&gt;: The component of inflation in the Phillips curve that arises purely from the gap between actual wages and Nash bargaining wages when real wage rigidity is present. Equals −κγ times the deviation of the Nash wage from steady state. It is the mechanism through which oil supply shocks create an inflation-unemployment trade-off: even if the central bank stabilizes the efficient component of marginal cost, the cost-push term generates inflation, and offsetting it requires contracting demand below the efficient level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Impulse-response matching estimation&lt;/strong&gt;: The paper&amp;rsquo;s estimation procedure: simulated method of moments minimizes the weighted squared distance between model-implied impulse responses and SVAR-estimated impulse responses to externally identified oil and monetary shocks. Precision weights from the SVAR IRF confidence bands determine which moments receive more weight. Confidence intervals for structural parameters are obtained via the delta method. This approach ensures the model can simultaneously explain the dynamics following both supply (oil) and demand (monetary) disturbances.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Easy monetary policy shock&lt;/strong&gt;: A negative realization of the monetary policy shock εrt in the Taylor rule, representing the actual Federal Funds rate falling below what the estimated Taylor rule coefficient on inflation would prescribe. In the historical decomposition, such shocks from roughly mid-2020 onward are attributed substantial responsibility for low unemployment and upward pressure on inflation in 2021–22, distinct from endogenous policy responses to demand or oil shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Speculation shock (εmt)&lt;/strong&gt;: An i.i.d. component of nominal oil price changes that is not reflected in the PCE energy price index and therefore does not pass through to real allocations in the model. Introduced to prevent high-frequency gyrations in spot oil prices (attributed to financial-market speculation) from generating counterfactually large macroeconomic swings. Its estimated standard deviation (posterior mean 0.239) is substantially larger than that of the persistent structural oil shock (0.042).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Historical shock decomposition (untargeted nominal variables)&lt;/strong&gt;: The primary empirical test of the model: after estimating shocks from four targeted real/financial series (unemployment, real oil price inflation, Federal Funds rate, labor-market tightness), the model constructs predicted paths and shock contributions for headline PCE inflation, core PCE inflation, nominal wage growth, and real product wage growth — none of which were targeted in identification. Agreement between model predictions and data for these untargeted nominal variables is the main evidence that the model correctly identifies the sources of the inflation surge.&lt;/p&gt;</description></item><item><title>On the Effects of Monetary Policy Shocks on Income and Consumption Heterogeneity</title><link>https://macropaperwarehouse.com/papers/on-the-effects-of-monetary-policy-shocks-on-income-and-consumption-heterogeneity/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/on-the-effects-of-monetary-policy-shocks-on-income-and-consumption-heterogeneity/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks how conventional and informational monetary policy shocks affect the cross-sectional distributions of labor earnings, consumption, and financial income in the United States. The motivation is the growing concern, particularly in the aftermath of the global financial crisis, about distributional consequences of central bank actions. Existing studies either include scalar inequality statistics in standard VARs — losing information about the full distribution — or rely on indirect approaches that hold household portfolio compositions fixed. Chang and Schorfheide instead apply the functional VAR (fVAR) framework developed in Chang, Chen, and Schorfheide (2024, JPE forthcoming) that stacks macroeconomic aggregates alongside the full time-varying cross-sectional density, represented as a log probability density function approximated via a cubic-spline sieve. This allows simultaneous, internally-consistent IRFs for percentiles, Gini coefficients, 90-10 ratios, standard deviations, and other distributional statistics without the risk of quantile crossings.&lt;/p&gt;
&lt;p&gt;The earnings analysis uses monthly micro data from the Current Population Survey (CPS), sample period 1990:M2 to 2016:M12. The consumption and financial income analyses use quarterly Consumer Expenditure Survey (CEX) data from 1990:Q2 to 2016:Q4. Monetary policy shocks are identified via the Jarocinski-Karadi (2020) high-frequency instruments — surprises in the three-month fed funds futures and in S&amp;amp;P 500 index — used as internal instruments in the structural VAR. The instruments isolate (a) conventional monetary policy shocks (interest rate surprise, stock price opposite direction) and (b) informational shocks (interest rate and stock price surprise in the same direction). Sign restrictions set-identify the two shocks. Bayesian estimation uses a Chan (2022) Normal-Inverse Gamma prior suitable for high-dimensional VARs; model selection (sieve order K, lag length p, hyperparameters) is done by maximizing the marginal data density (MDD). The shock normalization corresponds to an unanticipated 25-basis-point cut in the three-month federal funds rate.&lt;/p&gt;
&lt;p&gt;Main quantitative findings:&lt;/p&gt;
&lt;p&gt;Earnings (conventional shock): An expansionary shock reduces earnings inequality, primarily through the employment (extensive) margin. At the posterior median, the 10th earnings percentile rises by up to 5% relative to steady state, the 20th percentile by up to 1%, while the 80th and 90th percentiles are essentially unaffected. The Gini coefficient for labor earnings falls from approximately 0.431 to 0.428 over a 36-month horizon. The 90-10 earnings ratio falls from approximately 12.27 to 11.76 after 36 months. These effects are driven almost entirely by individuals moving from unemployment into employment (the point mass at zero in the earnings distribution falls as the unemployment rate drops by approximately 0.3 percentage points at the posterior median after three years). When the unemployed point mass is excluded from the inequality computation, the inequality effect is small and short-lived, confirming that the employment channel dominates. The estimated Gini drop of 0.001–0.003 is broadly consistent with the HANK model of Ma (2021) with indivisible labor, which predicts a drop of approximately 0.001 for a comparable shock.&lt;/p&gt;
&lt;p&gt;Consumption (conventional shock): The expansionary shock generates a weakly positive (inequality-increasing) effect on consumption inequality at the posterior median, but with wide credible bands that span both positive and negative values. The cross-sectional standard deviation of consumption, the 90-10 ratio, and the Gini coefficient all peak upon impact and remain above steady state. The slight increase appears concentrated in durable goods expenditure; nondurable and service consumption inequality shows little response at the posterior median. The contrast with the earnings result reflects: (i) only labor income is captured in the earnings analysis, while wealthy households&amp;rsquo; capital income (rising with equity and bond prices) also rises; (ii) potentially higher interest-rate sensitivity of high-consumption households.&lt;/p&gt;
&lt;p&gt;Financial income (conventional shock): No statistically significant effect on financial income inequality. The cross-sectional standard deviation and Gini coefficient of financial income do not respond to the shock. An important caveat is that the CEX misses the top-10 percent of households by financial income (visible from CDF comparison with the Survey of Consumer Finances in 2012). The households most likely to benefit from equity and bond price appreciation — captured in other studies — are absent from the sample.&lt;/p&gt;
&lt;p&gt;Informational shock: A negative informational shock (unexpected simultaneous drop in interest rates and stock prices, signaling worse-than-expected output) increases earnings inequality, mainly via a rise in unemployment. The 10th earnings percentile drops by about 2% at the posterior median. Consumption inequality, by contrast, shows the opposite pattern: the 90-10 ratio and Gini coefficient for consumption decrease, and the posterior median responses are negative, though uncertainty is substantial.&lt;/p&gt;
&lt;p&gt;Policy implication: The authors conclude that earnings inequality effects of conventional monetary policy are well-proxied by the unemployment rate response, so standard macro indicators subsume the distributional information for earnings. The small and highly uncertain responses of consumption and financial income inequality provide, in their view, support for central banks continuing to focus primarily on macroeconomic aggregates.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for monetary policy shocks and what are the main threats to validity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses the Jarocinski-Karadi (2020) high-frequency instruments as internal instruments in a structural VAR. The two instruments are surprises in the three-month federal funds futures (ff4_hf) and surprises in the S&amp;amp;P 500 index (sp500_hf), measured in narrow windows around FOMC announcements. Sign restrictions separate two shocks: a conventional shock is identified by an interest rate increase combined with a stock price fall; an informational shock by both increasing. The key assumptions are instrument relevance (the instruments are correlated with the policy shocks) and instrument validity (the instrument innovations are uncorrelated with non-policy structural shocks). As a robustness check the authors also use the Nakamura-Steinsson (2018) instruments and report very similar results. The main threat to validity is the standard one for external-instrument SVARs: the instruments may capture other economic news released simultaneously with FOMC decisions, violating the exclusion restriction. The informational shock identification partially addresses this by explicitly modeling the central bank&amp;rsquo;s information revelation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the functional VAR approach and why is it preferred over simpler alternatives?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The functional VAR stacks macroeconomic aggregates Yt with the time-varying cross-sectional log-density of micro outcomes. The log-density is approximated by a finite-dimensional linear sieve (cubic spline basis of order K). Sieve coefficients are estimated period-by-period by maximum likelihood from the cross-section, then treated as observations in a standard VAR. The MDD selects K, lag order p, and Minnesota-type hyperparameters jointly. Compared to simply including a few inequality statistics in a VAR, the functional approach (a) derives a single coherent model from which arbitrarily many distributional statistics can be computed without quantile crossings; (b) achieves tighter credible intervals by efficiently compressing cross-sectional information through the sieve; (c) avoids the problem of internally inconsistent forward projections of stacked quantile VARs. Compared to indirect approaches (e.g., McKay-Wolf 2023), it does not require the assumption that household income or portfolio composition is fixed in response to the shock. Compared to panel approaches, it does not require high-frequency panel data, which are unavailable for the US at relevant horizons.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the earnings distribution modeled to handle unemployment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The earnings distribution is treated as a mixture of a point mass at zero (representing unemployed individuals, whose weight equals the CPS-based unemployment rate) and a continuous part (the density of positive earnings of employed individuals, normalized to integrate to one minus the unemployment rate). The sieve density is estimated only from the positive-earnings observations, with a top-coding adjustment for right-censored values. The unemployment rate is included separately as an aggregate variable in the Yt vector. This mixture representation allows the analysis to separately identify the extensive-margin (employment) channel — changes in the probability mass at zero — from the intensive-margin channel (changes within the positive-earnings density). The key finding is that inequality effects are driven almost entirely by the extensive margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in earnings responses is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In percentage terms, the expansionary monetary policy shock has the largest impact at the 10th earnings percentile (posterior median response of 0 to 5%), capturing workers moving out of unemployment. The 20th percentile rises by 0 to 1%. The 80th and 90th percentiles show essentially zero response. Earnings above 2 times GDP per capita (roughly twice the labor share of GDP per capita) are essentially unaffected. When the point mass at zero is excluded and only the continuous part of the earnings distribution is analyzed, the effect on inequality statistics (Gini, 90-10 ratio) is small and short-lived, confirming that the heterogeneous response across the full distribution is driven almost entirely by the employment transition at the bottom.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in consumption responses is documented, and why might consumption inequality rise while earnings inequality falls?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At the posterior median, both the 10th and 20th consumption percentiles initially rise above steady state (h=1), then fall 0.9% to 1.3% below baseline from h=5 onwards. The 80th and 90th percentile responses are quantitatively similar in shape but slightly larger in magnitude, leading to a weakly positive net inequality effect. The Gini coefficient and 90-10 ratio for consumption peak upon impact and stay above steady state. The authors offer two explanations for the inequality-increasing result despite earnings inequality falling: (i) wealthy households also earn substantial capital income (equities, bonds) that rises with the expansionary shock, boosting their total resources and hence consumption, a channel not captured by earnings alone; (ii) higher-consumption households may have more interest-rate-sensitive consumption decisions (larger direct Euler-equation effect), or may be wealthy hand-to-mouth consumers with high MPCs. The component analysis shows the increase is concentrated in durable goods, while nondurable and services Gini responses are near zero at the posterior median.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the financial income analysis find and what data limitation is most important?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The financial income distribution estimated from the CEX shows no statistically significant response to either the level or inequality of financial income following a conventional monetary policy shock. The cross-sectional standard deviation and Gini coefficient of financial income are essentially flat. The most important caveat is that the CEX substantially underrepresents high-financial-income households. A CDF comparison with the Survey of Consumer Finances for 2012 shows that the CEX misses the top-10 percent of households by financial income. These are precisely the households most likely to experience capital gains from equity and bond price appreciation following an interest rate cut. The fraction of households with essentially zero financial income (the point mass κt) fluctuates between 0.65 and 0.82 over the sample, so the analysis is largely capturing the lower 65–82 percent of the financial income distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the informational shock and how do its distributional effects differ from the conventional shock?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;An informational shock is defined as an unanticipated change in interest rates that conveys private central-bank information about the state of the economy — for example, a rate cut that signals the central bank expects worse output and prices than the public. It is identified by the simultaneous drop in interest rates and stock prices, the opposite pattern from the conventional shock. Aggregate effects: real GDP drops approximately 20 basis points and unemployment rises up to 0.15 percentage points after one year. Earnings distributional effects are roughly the mirror image of the conventional shock: the 10th earnings percentile drops about 2% at the posterior median, while other percentiles change little. The Gini coefficient and 90-10 ratio for earnings rise in the long run, driven by the increase in unemployment. Consumption distributional effects are different: relative consumption at the 10th and 20th percentiles rises, while the 90th percentile falls slightly, so consumption inequality (90-10 ratio, Gini) decreases. However, since aggregate consumption also falls, the rise in relative consumption at the bottom does not imply an absolute gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from Coibion, Gorodnichenko, Kueng, and Silvia (2017)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;CGKS (2017) include inequality statistics directly in a VAR and use the Romer-Romer shock measure. For earnings, they find the Gini coefficient rises by about 0.0025 per 100bp contractionary shock (i.e., falls by 0.0025 for an expansionary shock); adjusting for shock size this is slightly smaller than the Chang-Schorfheide estimate of a 0.001–0.003 Gini drop per 25bp expansionary shock (which scales to 0.004–0.012 per 100bp). For consumption, CGKS find that inequality decreases in response to an expansionary shock, the opposite sign from Chang-Schorfheide&amp;rsquo;s posterior-median result (weakly increasing). The discrepancy may reflect: (i) the functional approach&amp;rsquo;s more flexible modeling of the full distribution versus using a single Gini; (ii) differences in shock identification (Romer-Romer vs. JK instruments); (iii) sample period differences. The wide credible bands in the consumption result mean the two findings are not statistically inconsistent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are conducted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors run the following robustness exercises: (i) Nakamura-Steinsson (2018) instruments instead of Jarocinski-Karadi (2020) for the earnings VAR — results are very similar. (ii) Model selection across sieve order K ∈ {4,6,8,10} and lag length p ∈ {1,2,3,4} via MDD maximization, confirming that results are robust to the choice of approximation order. (iii) For the earnings inequality analysis, the paper explicitly separates the contribution of the employment margin from the wage distribution within employment, by recomputing inequality statistics excluding the point mass at zero — confirming that the employment channel dominates. (iv) Comparison of aggregate IRFs across all four model specifications (aggregate VAR, earnings fVAR, consumption fVAR, financial income fVAR) showing that inclusion of cross-sectional data does not substantially alter inference about aggregate variables. (v) Comparison with time-aggregated monthly-to-quarterly rescaled IRFs to validate that monthly and quarterly specifications produce consistent results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the scope conditions and limitations of the findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Key scope conditions: (a) The sample runs through 2016:Q4/M12, so the post-2016 period and the 2020 pandemic episode are excluded. (b) The paper uses repeated cross-sections rather than a panel, so it directly estimates how the cross-sectional distribution evolves but cannot separately identify cohort effects, individual trajectories, or nonlinearities in unit-level histories. (c) The CEX substantially misses high-financial-income households, making the financial income results inapplicable to the top 10% of the financial income distribution. (d) The functional VAR models the unconditional distribution; it does not identify heterogeneous responses by subgroup in the sense of comparing specific groups (e.g., mortgagors vs. owners) as pseudo-panel approaches do. (e) The approach identifies the average linear response to a 25bp shock; nonlinear or asymmetric effects (large shocks, ZLB periods) are not modeled. (f) The simultaneous drop in earnings inequality and (weakly) rising consumption inequality cannot be fully reconciled without a complete model including capital income; the paper acknowledges this limitation explicitly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the quantitative results compare to the Ma (2021) HANK model benchmark?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Ma (2021) incorporates an indivisible labor supply mechanism into a HANK model and shows that an expansionary monetary policy shock raises wages, inducing low-productivity workers to enter the labor market, raising earnings in the left tail. His calibration produces a Gini coefficient drop of approximately 0.001 for a comparable shock (scaled from his Figure 3: −0.4/(4×100) = −0.001 on a 0-to-1 scale for a 100bp shock). The Chang-Schorfheide empirical estimate is a drop of between 0.001 and 0.003 for a 25bp shock, which is broadly consistent with Ma&amp;rsquo;s model. The qualitative mechanism — earnings inequality reduction driven by low-productivity workers transitioning out of unemployment — is also consistent with the Chang-Kim (2006) heterogeneous-agent model with indivisible labor, which generates a negative correlation between idiosyncratic productivity and reservation wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications for central banks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper provides semi-structural empirical evidence relevant for central banks concerned about distributional effects. The main conclusion is that for labor earnings inequality, the distributional effect of conventional monetary policy is well-summarized by the unemployment rate response: reducing unemployment compresses earnings inequality, and a central bank that targets unemployment de facto targets earnings inequality. The small, uncertain, and sometimes-positive effects on consumption and financial income inequality suggest that tracking these additional distributional statistics adds little actionable information beyond what standard macro aggregates already convey. The authors therefore conclude that there is an empirical case for central banks to continue focusing on macroeconomic aggregates. An important qualifier is that the financial income results are constrained by CEX top-coding, so the analysis cannot speak to very-high-income households&amp;rsquo; welfare.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Functional VAR (fVAR)&lt;/strong&gt;: A vector autoregression in which macroeconomic aggregates are stacked with the full cross-sectional log-probability density function of micro outcomes. The log-density is approximated by a finite-dimensional sieve (cubic spline basis), with sieve coefficients estimated period-by-period from cross-sectional data and then entered as observations in a linear VAR. This yields coherent IRFs for the entire distribution — percentiles, Gini, 90-10 ratio, etc. — from a single model, avoiding the quantile-crossing inconsistency of stacked-quantile approaches.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Employment channel (extensive margin)&lt;/strong&gt;: In this paper, the mechanism by which an expansionary monetary policy shock lowers earnings inequality: it reduces the unemployment rate, moving workers from a point mass of zero earnings into the positive-earnings distribution. The paper distinguishes this from the intensive margin (changes in wage rates conditional on employment), and finds empirically that the extensive margin dominates the inequality response of labor earnings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Informational shock (central bank information shock)&lt;/strong&gt;: As defined following Jarocinski-Karadi (2020): an unanticipated change in short-term interest rates that conveys the central bank&amp;rsquo;s private assessment of economic conditions. Identified by the simultaneous movement of interest rates and stock prices in the same direction, opposite to a conventional monetary policy shock. A negative informational shock (rates and equity prices both fall) signals that the central bank expects weaker output and prices than the public, and leads in this paper to rising earnings inequality via higher unemployment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Point mass at zero (earnings distribution)&lt;/strong&gt;: The concentration of probability mass at zero earnings, corresponding to the fraction of individuals in the labor force who are unemployed (the CPS-based unemployment rate). The total earnings density is modeled as a mixture of this point mass and a continuous density for positive earnings. The IRF for the point mass is the IRF for the unemployment rate; including it in inequality computations is necessary to capture the full distributional effect of employment transitions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Log probability density function (log-pdf) sieve representation&lt;/strong&gt;: The modeling device that represents each period&amp;rsquo;s cross-sectional distribution as the logarithm of a probability density, approximated by a finite linear combination of cubic spline basis functions (order K chosen by MDD). Working in log-pdf space avoids non-negativity and monotonicity constraints, enabling coherent linear propagation through the VAR law of motion; the density is recovered by exponential normalization in each period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal data density (MDD) model selection&lt;/strong&gt;: The Bayesian integrated likelihood used in this paper to jointly select the sieve approximation order K, lag length p, and Minnesota-type hyperparameters. The MDD balances in-sample fit (the log-spline likelihood) against a dimensionality penalty, thereby avoiding overfitting. A key result is that the preferred earnings fVAR uses K = 10 with a single lag, while the smoother consumption distribution is adequately captured with K = 6.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;κt (financial income point mass)&lt;/strong&gt;: The time-varying fraction of households in the CEX with financial income below a threshold x (set at the 10th percentile of pooled standardized financial income ≈ 0.0014 of the capital share of per-capita GDP). κt fluctuates between 0.65 and 0.82 over 1990–2016, meaning 65–82 percent of households have negligible financial income in a given quarter. The CEX data constraint — missing the top-10 percent of high-financial-income households — is the principal limitation on the financial income analysis.&lt;/p&gt;</description></item><item><title>On the elasticity of substitution between labor and ICT and IP capital and traditional capital</title><link>https://macropaperwarehouse.com/papers/on-the-elasticity-of-substitution-between-labor-and-ict-and-ip-capital-and-traditional-capital/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/on-the-elasticity-of-substitution-between-labor-and-ict-and-ip-capital-and-traditional-capital/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper estimates the elasticities of substitution between labor, information and communication technology (ICT) and intellectual property (IP) capital, and traditional capital using a nested constant elasticity of substitution (CES) production function. The motivation is twofold: standard macroeconomic models aggregate all capital into a single input and thus miss potentially distinct substitution relationships, and competing estimates of the labor-capital elasticity of substitution diverge sharply — with some finding gross substitutability (Karabarbounis and Neiman 2013) and others gross complementarity (Glover and Short 2020) — leaving unexplained the observed decline in labor income share across advanced economies.&lt;/p&gt;
&lt;p&gt;The data come from the 2023 release of the EU KLEMS database for nine Euro Area economies (Austria, Belgium, Finland, France, Germany, Italy, Netherlands, Portugal, and Spain) over 1996-2020 (with Germany ending in 2019 and Portugal starting in 2001). The nesting structure places an ICT-IP capital aggregate (itself a CES nest of ICT equipment and IP capital, which includes software, databases, patents, and R&amp;amp;D capital) together with labor in an inner nest, and that combined aggregate is then nested with traditional capital in an outer nest. The rationale for grouping ICT and IP capital is their joint and complementary use — computers and software — and the observation that roughly 25% of granted patents in the sample period are ICT-related. Estimation follows the normalized CES methodology of Grandville (1989), Klump, McAdam, and Willman (2007), and Leon-Ledesma, McAdam, and Willman (2010), which jointly estimates the logged and normalized production function together with its first-order conditions using feasible generalized nonlinear least squares, weighting by country-year employment shares and correcting for heteroscedasticity and serial correlation. This approach is preferred because normalization anchors the point elasticity at sample averages and Monte Carlo evidence shows it outperforms first-order-condition-only or translog alternatives, especially when identifying factor-augmenting technological change alongside substitution elasticities.&lt;/p&gt;
&lt;p&gt;The main results (Table 4, column 1) are as follows. The elasticity of substitution between labor and traditional capital (ε1) is estimated at 0.745 (standard error 0.009), statistically significantly below 1, implying gross complementarity. The elasticity between labor and the ICT-IP aggregate (ε2) is 1.187 (0.010), significantly above 1, implying gross substitutability. The elasticity between ICT and IP capital themselves (ε3) is 0.961 (0.003), significantly below 1, implying gross complementarity within the ICT-IP nest. The ICT capital-augmenting technological change parameter (γ_ICT) is estimated at 0.725, several orders of magnitude larger than the labor-augmenting parameter (γ_L = 0.003), consistent with rapid technological progress in ICT. The IP capital-augmenting parameter (γ_IP) is negative (−0.111), and the traditional capital-augmenting parameter (γ_TK) is negative but statistically insignificant (−0.002). For the US, ε2 is substantially larger at 1.712 (0.133), with ε1 = 0.724 (0.024) and ε3 = 0.922 (0.017).&lt;/p&gt;
&lt;p&gt;A counterfactual accounting exercise (fixing ICT and IP technological progress indexes and capital stocks at their 1996 levels) finds that absent these developments, labor income share would have slightly increased in European countries rather than declining, and would have declined by about 75% less in the US over the sample period. ICT accumulation and technological progress is the dominant driver of the fall: absent ICT changes alone, labor share would have risen significantly in Europe.&lt;/p&gt;
&lt;p&gt;The paper also derives the implied aggregate labor-capital elasticity (εL,K) using Hicks&amp;rsquo;s formula applied to the nested production function. The imputed εL,K for European countries ranges from approximately 1.36 to 1.43 over 1996-2020, rising through 1996-2008 and declining afterward. The US imputed values are substantially higher, ranging from approximately 2.14 to 2.37. By contrast, when the author directly estimates a two-input CES function combining labor with aggregate capital, the estimated elasticity is significantly below 1 (approximately 0.988 for European countries in the constant-CES specification), far below the imputed values. This divergence demonstrates that production function specification is consequential for identifying the labor-capital elasticity, and that models treating all capital as a single input can generate downward-biased estimates of this parameter.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The author jointly estimates a normalized CES production function and first-order conditions (capital return equations and the wage equation) using feasible generalized nonlinear least squares with multiple starting points, selecting results by log likelihood, AIC, BIC, and R-squared. Normalization anchors the elasticity as a point elasticity at geometric sample averages, which is theoretically motivated and improves finite-sample identification. Main threats include: (1) endogeneity of factor inputs — the system of equations is estimated jointly but without instrumental variables, relying on non-arbitrage conditions to close the model; (2) negative estimates for γ_IP and γ_TK, which the author acknowledges may capture markups or capital underutilization rather than true technical change (Jiang and Leon-Ledesma 2018 show that omitting markups can bias the sign of capital-augmenting technology); (3) the US results are sensitive to initial values for the estimation algorithm, possibly because of the small sample size (24 observations); and (4) the counterfactual exercise abstracts from equilibrium effects and free-factor supply adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms distinguishing the three capital types, and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;ICT capital (computers, communication devices, peripherals) and IP capital (software, databases, patents, R&amp;amp;D capital) are grouped in an inner nest on the grounds of their complementary joint use. Traditional capital (machinery, transport, construction and structures) forms the outer nest. This nesting allows the elasticity of substitution between labor and the ICT-IP aggregate (ε2 &amp;gt; 1, gross substitute) to differ from the elasticity between labor and traditional capital (ε1 &amp;lt; 1, gross complement), which the paper argues is consistent with the automation literature&amp;rsquo;s emphasis on ICT displacing routine tasks. The elasticity of substitution within the ICT-IP nest (ε3 &amp;lt; 1) reflects gross complementarity between ICT equipment and IP assets (one needs software to use computers). The empirical distinction comes from the separate first-order conditions for each capital type, which link each capital&amp;rsquo;s income share to its stock and price, allowing the three elasticities to be separately identified.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across countries or time?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main estimates pool 9 European countries weighted by employment shares; the author does not report country-by-country elasticity estimates but does report country-level descriptive statistics (Table I in the Data Appendix). Time-series heterogeneity is addressed through the imputed aggregate elasticity εL,K, which rises from approximately 1.367 in 1996 to a peak around 1.388-1.426 near 2008 (varying across the sensitivity columns of Table 6) and then declines to approximately 1.369-1.411 by 2020. The US elasticities are systematically higher than the European ones (εL,K ranging approximately 2.14-2.37 for the US vs. 1.36-1.43 for Europe; ε2 = 1.712 for the US vs. 1.187 for Europe). The time-varying aggregate capital specification in Table 7 shows the estimated ε1 for European countries follows an inverted-U shape over the sample period, while the US estimate shows the contrary pattern (though the latter is imprecise due to the small sample).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper estimates two alternative CES nesting structures (equations 20 and 21, reported in columns 2 and 3 of Table 4) to assess sensitivity to the nesting assumption. In specification (20), labor and traditional capital are nested first and then combined with the ICT-IP aggregate, so the elasticity between labor and ICT-IP equals that between traditional capital and ICT-IP. In specification (21), the different capital types are nested first and then combined with labor. Both alternatives confirm that ICT and IP capital are gross substitutes for labor. The paper also estimates a two-input labor-aggregate capital function in three variants: constant CES, elasticity as a linear function of compensation shares and relative prices, and elasticity as a quadratic polynomial of time (Table 7). Results using US data from the EU KLEMS database are reported separately (column 4 of Table 4 and columns 8-9 of Table 6). The imputed εL,K is further verified using data counterparts of the compensation shares rather than model-predicted shares (column 7 of Table 6), yielding essentially identical results with higher variability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Relative to Karabarbounis and Neiman (2013), this paper agrees that labor and aggregate capital are gross substitutes (imputed εL,K &amp;gt; 1) and that capital deepening drives the labor share decline, but attributes the mechanism specifically to ICT and IP capital accumulation rather than the fall in all capital prices. It contrasts with Glover and Short (2020), whose below-1 estimates the paper reconciles by showing that treating all capital as a single input biases the aggregate elasticity downward. Relative to Eden and Gaggl (2018, 2019), who use US data and find ICT (including software) substitutes for labor in first-order-condition-only estimates, this paper adds normalization and biased technical change parameters and uses European panel data, and also separates ICT equipment from IP/software. Relative to Koh, Santaeulalia-Llopis, and Zheng (2020), who perform an accounting exercise attributing the labor share decline to IP capital capitalization, this paper provides structural estimates of substitution elasticities and corroborates the IP capital importance. Relative to Aum and Shin (2024), who use Korean firm-level data and find software substitutes for labor while ICT equipment complements it, this paper uses a different nesting (ICT and IP grouped together) and European aggregate data, and finds the combined ICT-IP aggregate is a gross substitute for labor — consistent with Aum and Shin&amp;rsquo;s software result driving the within-nest finding. The normalization approach distinguishes the paper from Antras (2004) and earlier aggregate studies that estimate only first-order conditions (which can produce upward-biased elasticity estimates when biased technical change is omitted).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper find about the source of the labor share decline, and what are the scope conditions on this result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The counterfactual exercise (Section 4.2, Panel B of Table 3) finds that absent ICT and IP capital technological progress and accumulation, labor income share would have slightly increased in European countries over 1996-2020 rather than falling. Absent ICT changes alone, labor share would have risen significantly in Europe. The ICT-driven decline is the dominant contributor. By contrast, absent IP capital trends, labor share would have fallen substantially more (suggesting IP capital compensation growth, when attributed to capital rather than labor, partially offsets the ICT effect on labor&amp;rsquo;s share but its own share rise is the proximate driver of labor share decline). For the US, absent ICT and IP developments, labor share decline would have been about 75% smaller. Scope conditions: this is a static accounting exercise holding free factors at initial values and abstracting from general equilibrium effects. The results apply to total industrial value added (not individual sectors) and to the nine Euro Area countries in the sample. The exercise assumes the estimated production function parameters are the correct structural parameters, and thus inherits any limitations of the identification strategy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the implication for the measured aggregate labor-capital elasticity, and why does it differ from standard estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When the paper estimates a two-input (labor, aggregate capital) CES function directly, the estimated aggregate elasticity is significantly below 1 and close to estimates from Herrendorf, Herrington, and Valentinyi (2015). When it instead imputes the aggregate elasticity from the nested-CES parameter estimates using Hicks&amp;rsquo;s formula, the imputed values exceed 1 and are much larger. The paper shows analytically that εL,K &amp;gt; ε2 when the relative capital cost of ICT compared to traditional capital (pKICT&lt;em&gt;KICT / pTK&lt;/em&gt;TK) takes sufficiently low values, which is the case in the data. This divergence arises because the single-input capital specification conflates the high substitutability of labor with ICT-IP capital and the low substitutability with traditional capital, yielding a biased estimate that depends on the capital composition. The paper concludes that production function specification is consequential for identifying the aggregate labor-capital substitution elasticity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key data features that drive the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;ICT investment prices fell at an average annual rate of -4.6% relative to value added prices over the sample, while IP and traditional capital investment prices changed by -0.3% and +0.1% per year, respectively. Real ICT capital stocks grew at 4.9% per year, versus 3.4% for IP capital and 1.6% for traditional capital. ICT and IP capital depreciate rapidly (20.1% and 24.1% per year) compared to traditional capital (3.6%). These patterns imply computed rates of return on ICT capital that were very high at the start of the sample (131% in 1996, largely reflecting the fall in ICT prices that year) and fell sharply to 24% by 2020. The average share of labor and ICT-IP compensation in value added is approximately 71%, with labor making up about 92% of that combined share. The ICT share within the ICT-IP nest is about 21%, meaning IP capital compensation is substantially larger than ICT capital compensation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Allen-Uzawa elasticity of substitution&lt;/strong&gt;: A point elasticity measuring the percentage change in the ratio of two inputs in response to a percentage change in their price ratio, holding output and other input prices constant. In this paper, it is estimated as a structural parameter of the nested CES production function, normalized at sample geometric averages; values above 1 imply gross substitutability and values below 1 imply gross complementarity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Normalized CES production function&lt;/strong&gt;: A CES specification that is indexed to sample averages of output and inputs so that the elasticity of substitution is defined as a point elasticity at those averages. This normalization, following Grandville (1989) and Leon-Ledesma et al. (2010), facilitates identification of both elasticity parameters and factor-augmenting technological change parameters, avoiding the conflation that arises in unnormalized specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gross substitutes / gross complements&lt;/strong&gt;: Two inputs are gross substitutes (elasticity of substitution &amp;gt; 1) if a fall in the relative price of one leads to a rise in the share of cost devoted to it, reducing the other input&amp;rsquo;s cost share. They are gross complements (elasticity &amp;lt; 1) if a fall in relative price instead reduces cost share. In this paper, labor and ICT-IP capital are gross substitutes; labor and traditional capital and ICT with IP capital are gross complements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Traditional capital (TK)&lt;/strong&gt;: In this paper&amp;rsquo;s taxonomy, all non-ICT, non-IP capital: machinery, transport equipment, construction, and structures. It is the residual capital category and is defined as a gross complement of labor in the estimated nested CES structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intellectual property (IP) capital&lt;/strong&gt;: Capital comprising software, databases, patents (including R&amp;amp;D capital), and other forms of intellectual property as measured in the EU KLEMS database. IP capital is grouped with ICT equipment in an inner CES nest on the grounds of complementary use. Its compensation share rise is the proximate accounting factor in the labor share decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Factor-augmenting technological change&lt;/strong&gt;: Hicks-neutral or biased technical progress that enters multiplicatively with a specific factor input in the production function (e.g., γ_ICT for ICT capital), scaling the effective quantity of that input. In this paper, the ICT-augmenting parameter is estimated to be very large and positive (0.725), reflecting rapid ICT productivity growth, while IP- and traditional-capital-augmenting parameters are negative, which the author suggests may partly reflect markups or underutilization rather than pure technology.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imputed aggregate labor-capital elasticity&lt;/strong&gt;: The elasticity of substitution between labor and total capital derived analytically from the nested CES parameters using Hicks&amp;rsquo;s formula, rather than estimated directly from a two-input specification. In this paper, the imputed value exceeds 1 for Europe (~1.36-1.43) and is substantially higher for the US (~2.14-2.37), contrasting with directly estimated values that are below 1, illustrating the sensitivity of this parameter to production function specification.&lt;/p&gt;</description></item><item><title>Optimal Combination of Patent Instruments in a Cumulative-Innovation Growth Model</title><link>https://macropaperwarehouse.com/papers/optimal-combination-of-patent-instruments-in-a-cumulative-innovation-growth-model/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-combination-of-patent-instruments-in-a-cumulative-innovation-growth-model/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper develops a tractable general equilibrium model of endogenous growth driven by cumulative innovation, and uses it to characterize optimal patent policy — both for patent breadth (via a &amp;ldquo;non-infringing inventive step&amp;rdquo; requirement) and patent length — with a focus on their welfare implications and optimal combination.&lt;/p&gt;
&lt;p&gt;The central motivation is that cumulative innovation creates positive knowledge spillovers: each new idea strictly builds on the best existing technology, and the disclosure that patenting requires diffuses knowledge to future innovators. Because private firms do not internalize these spillovers, the decentralized equilibrium features strictly lower R&amp;amp;D investment than the social optimum. The key wedge is an intertemporal spillover effect: firms discount future profits at a rate that includes the hazard of being superseded (rho + lambda&lt;em&gt;v&lt;/em&gt;L), while the social planner uses only the pure time preference rate (rho). Appropriability and business-stealing externalities exactly offset each other, so the intertemporal spillover is the sole source of under-investment.&lt;/p&gt;
&lt;p&gt;The model has a continuum of differentiated varieties, a single labor input, a Poisson idea arrival process (rate lambda per R&amp;amp;D worker), and productivity improvements drawn i.i.d. from a standardized Pareto distribution with shape parameter theta &amp;gt; 1. The Pareto structure yields the key tractability: the log of the k-th best productivity level is Gamma-distributed with mean k/theta, which allows closed-form welfare expressions. In steady state, all outcomes depend on just three deep parameters: the discount rate rho, the Pareto shape theta, and the innovative capacity lambda*L.&lt;/p&gt;
&lt;p&gt;The patent breadth instrument is formalized as a &amp;ldquo;non-infringing inventive step&amp;rdquo; (NIS) requirement B &amp;gt;= 1: a new idea must deliver a productivity at least B times the current patent-holder&amp;rsquo;s productivity to qualify for a patent. Raising B creates two opposing forces. The &amp;ldquo;profit effect&amp;rdquo; extends incumbent monopoly duration by reducing the hazard rate of supersession (from lambda&lt;em&gt;v&lt;/em&gt;L to lambda&lt;em&gt;v&lt;/em&gt;L&lt;em&gt;B^{-theta}), raising innovation incentives. The &amp;ldquo;hurdle effect&amp;rdquo; raises the bar an idea must clear to be patentable, reducing the expected return to R&amp;amp;D. These forces generate a non-monotonic (inverted-U) relationship between R&amp;amp;D effort and B (Proposition 2): there is a unique B_v that maximizes the innovation rate, with dv/dB &amp;gt; 0 for B &amp;lt; B_v and dv/dB &amp;lt; 0 for B_v &amp;lt; B &amp;lt; B_0 (the upper bound beyond which no R&amp;amp;D occurs). Explicitly, B_v = [lambda&lt;/em&gt;L / (rho*(theta-1))]^{1/theta}. Proposition 3 further establishes that in economies whose innovative capacity falls just below the threshold for positive growth at B=1, a well-chosen NIS can shift the economy from a zero-growth to a positive-growth steady state.&lt;/p&gt;
&lt;p&gt;The welfare-maximizing breadth B_w is shown to be unique, binding (B_w &amp;gt; 1), and strictly below B_v (Proposition 4 and 5). The welfare optimum trades off the dynamic gain from greater innovation against the static consumer surplus loss from higher markup power. Because the dynamic gain is still positive when B &amp;lt; B_v (R&amp;amp;D is still rising) but the static loss grows continuously in B, the welfare maximum necessarily occurs in the region where research is still increasing — i.e., B_w &amp;lt; B_v.&lt;/p&gt;
&lt;p&gt;Numerically, at baseline parameters (rho = 0.07, theta = 4, lambda&lt;em&gt;L = 1), B_w = 1.14 and the equilibrium R&amp;amp;D share is v(B_w) = 0.22, implying an asymptotic maximum real wage growth rate of 4.8%. The optimal breadth is most sensitive to theta (Pareto tail thickness) and less sensitive to rho and lambda&lt;/em&gt;L.&lt;/p&gt;
&lt;p&gt;When patent length (Omega) is added as a second instrument, the model yields a sharp result: the welfare-maximizing policy sets Omega → infinity together with B = B_w (Proposition 6). Unlike patent breadth, patent length has no hurdle effect — a longer patent duration raises R&amp;amp;D monotonically (dv/dOmega &amp;gt; 0, Lemma 2). With no diminishing returns to innovation effort in this model (the Poisson arrival rate is proportional to vL), the marginal dynamic gain from extending Omega always strictly outweighs the marginal static loss, so infinite patent length is always superior to any finite length. With Omega = 20 years (the TRIPS standard), the baseline calibration implies B_w = 1.13 and v(B_w) = 0.21 — only slightly below the infinite-length benchmark — suggesting the qualitative infinite-length result has limited quantitative bite for realistic patent durations.&lt;/p&gt;
&lt;p&gt;Proposition 7 shows that patent breadth and patent length are policy complements: when patent length is exogenously constrained to a finite value, the welfare-maximizing breadth increases in Omega (dB_w/dOmega &amp;gt; 0). Intuitively, a shorter patent duration weakens innovation incentives, so the optimal NIS compensates by providing stronger breadth protection.&lt;/p&gt;
&lt;p&gt;The paper provides a unified rationalization of several empirical puzzles: the weak or negative relationship between patent strength and innovation rates (Sakakibara-Branstetter 2001 on Japan; Bessen-Maskin 2009 on US software) is consistent with B being set above B_v, where the hurdle effect dominates; the causal evidence in Galasso-Schankerman (2014) that patents impede cumulative knowledge accumulation is consistent with the hurdle effect operating at the margin.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy, and is this a theoretical or empirical paper?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a purely theoretical paper. There is no empirical identification strategy. The core contribution is an analytically tractable general equilibrium model in which the key results (Propositions 1–7) are derived from first-order conditions, comparative statics, and the application of the intermediate value theorem. The Pareto-improvement distribution is the key parametric assumption that enables closed-form expressions for welfare and the growth rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the key model departure from Kortum (1997) and Eaton-Kortum (2001)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Kortum (1997) and Eaton-Kortum (2001) model ideas as drawn from a stationary distribution over productivity levels — new ideas may or may not surpass the existing frontier, and as ideas accumulate it becomes progressively less likely that a new draw beats the current best. This generates growth only if the workforce grows. Chor and Lai instead model productivity improvements (ratios Z_{k+1}/Z_k) as i.i.d. Pareto draws, so each new idea strictly improves on the frontier regardless of how many ideas have arrived. This cumulative structure generates endogenous growth with a constant workforce and introduces knowledge spillovers that are absent in Kortum (1997).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What exactly is the &amp;rsquo;non-infringing inventive step&amp;rsquo; (NIS) and how does it differ from other breadth concepts in the literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The NIS requirement B stipulates that a new idea must achieve a productivity at least B times the productivity of the current best patent (i.e., Z_new &amp;gt;= B * Z_current) to be patentable and non-infringing (what the paper calls &amp;rsquo;leading breadth&amp;rsquo;). The paper notes this is distinct from — though related to — patentability requirements studied by O&amp;rsquo;Donoghue (1998), which focused on the minimum improvement to qualify for a new patent but not necessarily on infringement. It also differs from the Gilbert-Shapiro (1990) and Klemperer (1990) breadth concepts, which focus on horizontal product differentiation (consumer willingness to substitute away from a patent) rather than vertical quality improvements. In the paper&amp;rsquo;s model, both patentability and non-infringement requirements are captured by a single parameter B, with the simplifying assumption that meeting the B hurdle is both necessary and sufficient for non-infringement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three externalities in the model, and which one drives the market-planner wedge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three externalities are present: (1) The intertemporal spillover effect — firms do not internalize that their innovation raises the knowledge base for future innovators. (2) The appropriability effect — firms capture only private profits, not the full consumer surplus gain from each innovation. (3) The business-stealing effect — each innovator imposes a negative externality on the incumbent patent-holder by eroding their profits. Effects (2) and (3) exactly offset each other in the Pareto specification, so only the intertemporal spillover effect remains. This is verified formally: the market equilibrium condition features a discount rate of rho + lambda&lt;em&gt;v&lt;/em&gt;L (including the creative destruction hazard), whereas the social planner&amp;rsquo;s problem involves only rho. The wedge between v_eqm and v_SP stems entirely from this higher effective discount rate in decentralized equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the welfare-maximizing patent breadth strictly less than the innovation-rate-maximizing breadth?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At B_v, research effort is at its maximum, but this is achieved by granting patent-holders maximum protection, imposing the largest static consumer surplus loss. For B between B_w and B_v, increasing B further raises the static loss but no longer raises the innovation rate significantly enough to compensate; in fact for B &amp;gt; B_v, research effort falls while the static loss remains. The welfare optimum trades off the dynamic benefit (higher innovation) against the static cost (monopoly pricing). Because welfare must also account for the static loss at each period, and this loss is already large at B_v, the welfare optimum is achieved at a lower level of protection. Formally, dU_0/dB &amp;lt; 0 for all B in [B_v, B_0), and the unique welfare maximum lies strictly in [1, B_v).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the optimal patent length infinite?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Unlike patent breadth, patent length has only a profit effect and no hurdle effect — a longer patent strictly raises R&amp;amp;D effort (Lemma 2). Moreover, the model has no diminishing returns to innovation effort: the Poisson arrival rate of ideas is simply proportional to the total number of R&amp;amp;D workers at each date (lambda&lt;em&gt;v&lt;/em&gt;L), so each additional unit of research labor generates the same expected innovation flow regardless of how much research has already been done. This means the marginal dynamic gain from raising Omega (via increased innovation) is approximately constant, while the marginal static loss (additional consumer surplus ceded per period) is also roughly constant. The dynamic gain always strictly exceeds the static loss as long as the economy can sustain positive R&amp;amp;D (Lemma 1 condition holds), so Omega → infinity is always welfare-improving. This result breaks down if one introduces diminishing returns to R&amp;amp;D (e.g., a fishing-out effect or a congestion externality in research).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are patent breadth and patent length policy substitutes or complements?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They are policy complements (Proposition 7): when patent length is shorter (e.g., exogenously constrained by TRIPS or ethical considerations), the welfare-maximizing breadth B_w is lower; conversely, a longer patent length calls for a higher optimal breadth. This is because a longer patent length increases the dynamic gain from research, which raises the marginal value of also increasing breadth (since breadth further amplifies the monopoly profit effect). Formally, d^2U^l_0/(dB d Omega) &amp;gt; 0 at B_w, implying dB_w/d Omega &amp;gt; 0 by the implicit function theorem.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the quantitative calibration, and what are the key numerical results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The calibration is illustrative rather than structural. Baseline: rho = 0.07 (matching real stock market returns as in Kortum 1997), theta = 4 (implying expected profits = 25% of per-variety expenditure, since 1/(1+theta) = 0.20 &amp;hellip; actually 1/(1+4) = 0.20, with the text stating 1/(1+theta) = 0.25 implying theta=3; the paper states theta=4 gives 1/(1+theta) = 0.20 — there is a slight inconsistency in the text&amp;rsquo;s wording, but the stated result is 25% of expenditures per variety), lambda*L = 1 (one expected new idea per variety per year). These yield: B_w = 1.14 (infinite patent length), v(B_w) = 0.22 (22% of labor in R&amp;amp;D), and an asymptotic maximum real wage growth rate of 4.8%. The optimal breadth B_w is most sensitive to theta: lowering theta (fatter tail, larger average improvements) raises B_w substantially. Under a finite patent length of Omega = 20, the results change minimally: B_w = 1.13, v(B_w) = 0.21.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model handle the possibility that economies with low innovative capacity might not innovate at all without policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When lambda&lt;em&gt;L &amp;lt; rho&lt;/em&gt;theta, the economy has no R&amp;amp;D in the decentralized equilibrium at B = 1 (v(1) &amp;lt; 0 per equation 22). However, Proposition 3 shows that if lambda&lt;em&gt;L falls in the intermediate range (rho&lt;/em&gt;(theta-1)&lt;em&gt;(theta^2/(theta^2-1))^theta &amp;lt; lambda&lt;/em&gt;L &amp;lt; rho*theta), there exists a range of binding NIS values B &amp;gt; 1 that can shift the economy from zero to positive growth. Setting B = B_v achieves this transition. This is because the profit effect of introducing a binding NIS can more than offset the hurdle effect in this regime, making it profitable for some workers to engage in R&amp;amp;D.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key welfare-improving scope conditions for the NIS policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The welfare gain from a binding NIS requires Assumption 1: lambda&lt;em&gt;L &amp;gt; rho&lt;/em&gt;theta. This ensures the economy already features positive R&amp;amp;D at B = 1, and that the innovative capacity is large enough so the dynamic gains from raising B above 1 exceed the static consumer surplus losses. Without this condition, the NIS may either fail to generate R&amp;amp;D (if lambda*L is very low) or may tip the economy into R&amp;amp;D via Proposition 3&amp;rsquo;s mechanism, but welfare-optimality of the NIS still requires the economy be in a regime where the profit effect dominates for small B. Additionally, the NIS must remain below B_v to generate any dynamic gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model relate to Japan&amp;rsquo;s narrow patent breadth policy from 1960-1993?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper cites Ordover (1991) and Maskus-McDaniel (1999) to note that Japan deliberately adopted narrow patent breadth to encourage more incremental innovation and technology catch-up. In the model&amp;rsquo;s terms, Japan was setting B close to 1 (or even at 1) to lower the hurdle for new patents, maximizing the number of patentable ideas. This is consistent with a strategy of maximizing the innovation rate (operating near B_v or even below it), potentially at the cost of some dynamic welfare optimization. The Apple v. Samsung example illustrates that the US tends toward broader patent breadth (higher B) than Japan, consistent with the model&amp;rsquo;s international variation in NIS standards.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper handle the price markup and profit structure under the NIS?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under Bertrand competition with limit pricing, the incumbent with the best patentable technology sets price equal to the marginal cost of the second-best technology (the previous patent-holder). The price markup m = Z_k/Z_{k-1} is drawn from a Pareto distribution with shape theta and lower bound 1 (no NIS) or B (with NIS). Flow profits are therefore: Pi = B(1+theta)^{-theta} / [B(1+theta) - theta] &amp;hellip; more precisely from equation (19): Pi = [B(1+theta) - theta] * (B(1+theta))^{-1}. As B rises, Pi increases (higher average markups from higher minimum improvement), which is the profit effect. The expected log productivity of the k-th patentable idea is E[ln Z~_k] = k/theta + k*ln(B), confirming that higher B raises not just the probability threshold but also the expected productivity of successful innovations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the limitations and potential extensions noted by the authors?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors acknowledge several limitations and propose extensions: (1) The model assumes fully cumulative innovation — each idea strictly builds on the frontier. Generalizing to partial cumulativeness (where some ideas are non-cumulative or only partially built on existing knowledge) is flagged as a natural extension. (2) The analysis is confined to a single-country setting. A multi-country extension would allow study of cross-border patent policy spillovers and optimal international IPR harmonization (e.g., under TRIPS). (3) The model does not allow directed research — firms cannot target specific varieties. Relaxing this could introduce additional policy margins. (4) The model abstracts from imitation threats, which Gallini (1992) shows can make broader patent protection optimal.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper compare to O&amp;rsquo;Donoghue (1998) and O&amp;rsquo;Donoghue-Zweimüller (2004)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;O&amp;rsquo;Donoghue (1998) shows a patentability requirement can raise social welfare in a partial equilibrium setting, and Hunt (2004) finds an inverted-U relationship between innovation rate and requirement strength — both echo Chor-Lai&amp;rsquo;s findings. O&amp;rsquo;Donoghue-Zweimüller (2004) embed patentability in a quality-ladder endogenous growth model but focus more on innovation effects than welfare. The contribution of Chor-Lai relative to these papers is: (i) a fully general equilibrium treatment with explicit welfare analysis; (ii) derivation of both the welfare-maximizing breadth and the innovation-maximizing breadth and proof that Bw &amp;lt; Bv; (iii) extension to jointly optimal patent breadth and length, showing infinite patent length is optimal; and (iv) the Pareto-Gamma tractability that yields closed-form expressions and enables clean comparative statics on three deep parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks does the paper provide?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper notes in the main text that results are robust to removing the scale effect (the feature that the innovation rate increases in L). An online appendix (referenced but not included in this draft) proves that the main qualitative results — inverted-U in innovation vs. B, unique welfare-maximizing B_w &amp;lt; B_v, and infinite optimal patent length — survive in a model variant without the scale effect. The numerical sensitivity analysis in Section 3.4 also demonstrates robustness of the qualitative findings across wide ranges of rho (0.02 to 0.12) and theta (2 to 6) and lambda*L.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Non-Infringing Inventive Step (NIS) requirement&lt;/strong&gt;: A patent policy parameter B &amp;gt;= 1 stipulating that a new idea must achieve a productivity at least B times that of the current best patent to qualify for a patent and be deemed non-infringing. In the paper&amp;rsquo;s usage, this simultaneously captures both the patentability requirement and the leading breadth (protection of incumbents against near-imitation), and is used interchangeably with &amp;lsquo;patent breadth.&amp;rsquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cumulative innovation&lt;/strong&gt;: An innovation process in which each new idea strictly improves upon the existing technological frontier. Formally, the productivity improvement Z_{k+1}/Z_k is drawn i.i.d. from a Pareto distribution with support [1, infinity), so each arriving idea always delivers a strictly positive productivity gain over the current best technology. This contrasts with non-cumulative models (e.g., Kortum 1997) where draws are from a stationary distribution and may fall below the frontier.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Profit effect (of patent breadth)&lt;/strong&gt;: The mechanism by which a higher NIS requirement B reduces the hazard rate that an incumbent patent-holder is superseded (from lambda&lt;em&gt;v&lt;/em&gt;L to lambda&lt;em&gt;v&lt;/em&gt;L*B^{-theta}), thereby extending the expected duration of monopoly power and raising the value of each patent. This increases R&amp;amp;D incentives by raising expected profits from successful innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hurdle effect (of patent breadth)&lt;/strong&gt;: The mechanism by which a higher NIS requirement B reduces the probability that any given arriving idea is patentable (probability B^{-theta}), thereby lowering the expected return to engaging in R&amp;amp;D. This discourages research effort and is the force that eventually dominates when B becomes sufficiently large, causing the innovation rate to fall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Innovative capacity&lt;/strong&gt;: The product lambda&lt;em&gt;L, where lambda is the per-worker Poisson arrival rate of ideas and L is the total labor endowment. All steady-state outcomes in the model depend on lambda and L only through this product, not their individual values. It is the key parameter determining whether positive R&amp;amp;D equilibrium exists (requires lambda&lt;/em&gt;L &amp;gt; rho*theta) and the magnitude of welfare gains from patent policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intertemporal spillover externality&lt;/strong&gt;: The sole market failure driving under-investment in R&amp;amp;D in this model&amp;rsquo;s Pareto specification. Because the knowledge embodied in each marketed innovation diffuses freely and becomes the base for subsequent cumulative improvements, private innovators do not internalize the benefit their R&amp;amp;D confers on future innovators. This causes firms to use an effective discount rate of rho + lambda&lt;em&gt;v&lt;/em&gt;L (including the creative destruction hazard) rather than rho alone, leading to strictly less R&amp;amp;D than the social optimum. Appropriability and business-stealing externalities exactly cancel in this model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy complementarity (breadth and length)&lt;/strong&gt;: The property that the welfare-maximizing patent breadth B_w is increasing in patent length Omega: dB_w/d Omega &amp;gt; 0. When the patent authority is constrained to set a shorter patent length, the optimal breadth should also be narrower, and vice versa. This arises because a longer patent length raises the marginal dynamic benefit of providing stronger breadth protection.&lt;/p&gt;</description></item><item><title>Payment data, information disclosure, and privacy</title><link>https://macropaperwarehouse.com/papers/payment-data-information-disclosure-and-privacy/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/payment-data-information-disclosure-and-privacy/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Digital payments generate vast, high-frequency, transaction-level data that several central banks (Bank of Canada, Swiss National Bank, Eurosystem members) already use for nowcasting, and regulatory initiatives (the EU&amp;rsquo;s PSD2, the UK&amp;rsquo;s Open Banking Standard, prospective CBDCs) are broadening system-wide data access. The paper asks how improved aggregate-demand forecasts enabled by payment data affect economic activity and through which channels; what the optimal communication policy for disseminating such forecasts is and how it depends on the monetary-policy stance; whether a competitive market in which private banks produce and sell forecasts is socially optimal; and how privacy concerns over individual transaction data affect optimal policy.&lt;/p&gt;
&lt;p&gt;Model setup: The authors build a Lagos-Wright / Rocheteau-Wright general-equilibrium monetary model with infinitely-lived buyers and sellers (unit measure each) and periods split into a centralized market (CM) and decentralized market (DM). Each period a stochastic fraction theta_t of buyers becomes &amp;lsquo;active&amp;rsquo; and wants the DM good; theta_t takes two values, theta_B &amp;lt; theta_G (bad/good aggregate state) with unconditional mean E[theta_t] = theta-bar. Sellers can pay an effort cost kappa to raise productivity from theta_L to theta_H. Payments use bank deposits fully backed by one-period government bonds costing g &amp;gt; beta (g is the policy variable; r = 1/g - 1). DM terms of trade follow the Kalai (1977) bargaining solution with buyer bargaining power sigma. No agent observes theta_t directly, but aggregating payment data across all banks yields a noisy binary signal s in {o,p} (optimistic/pessimistic), producing an unbiased forecast theta-tilde_t in {theta-tilde_G, theta-tilde_B} with E(theta-tilde_t) = theta-bar.&lt;/p&gt;
&lt;p&gt;Main findings (qualitative, as the paper is theoretical with an illustrative calibration): Disclosing forecasts affects welfare through two channels. (1) Demand channel: buyers hold more deposits when expecting high demand, so disclosure raises deposit-holding volatility; even though buyer utility is strictly concave, aggregate welfare w(theta) can be convex or concave. The sign hinges on the statistic T(x) = [u&amp;rsquo;&amp;rsquo;(x)]^2 / [u&amp;rsquo;&amp;rsquo;&amp;rsquo;(x)(u&amp;rsquo;(x)-1/theta)]: w(theta) is convex if T(x) &amp;lt; 1/3 and concave if T(x) &amp;gt; 1 over the relevant range (Lemma 4). (2) Investment channel: sellers underinvest because they capture only fraction (1-sigma) of DM surplus, so disclosure that encourages (discourages) investment raises (lowers) welfare (Lemma 3, thresholds kappa_1 &amp;lt; kappa_2 &amp;lt; kappa_3). Crucially the welfare effect is state-dependent in the monetary stance: with a low bond price/high deposit rate (low g) disclosure tends to reduce welfare (it mainly adds downside volatility and can weaken investment), while with high g (low deposit rate) disclosure tends to raise welfare (Proposition 1; thresholds g, g-bar). Calibrating to the U.S. economy 2016-19, disclosure improves welfare when the utility curvature parameter gamma is small and g is large; the discrete investment channel is inactive over most of the parameter space (Figure 2).&lt;/p&gt;
&lt;p&gt;Policy/theoretical implications: A central bank that controls disclosure can do better than binary reveal/withhold by sending noisy messages, a form of Bayesian persuasion (Kamenica-Gentzkow 2011): by committing to send the pessimistic message mb even when the forecast is optimistic (P^b &amp;lt; 1), it raises the posterior theta-tilde_b and induces investment, improving welfare when the investment channel is strong (Figures 4-5; numerical cases g = 1.05 and g = 1.00). A competitive market where private banks pay fixed cost C to produce and sell the forecast yields zero profits and always reveals undistorted information; provided C is below a threshold C-bar the forecast is always produced and sold, possibly causing excessive information production relative to the social optimum. Privacy: a fraction eta of buyers with high privacy costs use cash, shrinking recorded transactions and lowering forecast precision, but this need not reduce welfare; concave privacy costs can make deposit buyers&amp;rsquo; preferences less concave, turning welfare convex so disclosure helps via the demand channel, partially but not fully offsetting the privacy cost.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the modeling/identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a theoretical general-equilibrium paper, not an empirical identification exercise. The strategy is to embed payment-data-derived forecasting and central-bank communication into a Lagos-Wright/Rocheteau-Wright monetary search model. Aggregate demand theta_t is a two-state random variable realized at the start of the DM; agents make CM decisions (deposit holdings, investment) under a common prior theta-bar unless a forecast is disclosed. The &amp;rsquo;threat&amp;rsquo; analog is robustness of the comparative statics to functional-form and parameter assumptions; the authors discipline curvature via the statistic T(x) and use a CRRA-type utility u(x)=(x+gamma)^{1-sigma_u}&amp;hellip; so that conditions map cleanly into the parameter gamma. They acknowledge agents in reality observe many macro indicators, but assume the only payment-data-based information is the unbiased binary signal, to isolate the informational value of payment data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two main channels, and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The demand channel works through buyers&amp;rsquo; deposit holdings: optimistic forecasts raise deposits and DM consumption x, pessimistic forecasts lower them; its welfare sign depends on the convexity/concavity of w(theta), governed by T(x) (convex if T&amp;lt;1/3, concave if T&amp;gt;1). The investment channel works through sellers&amp;rsquo; discrete investment decision: because sellers capture only (1-sigma) of surplus they underinvest, so disclosure that pushes investment up raises welfare and disclosure that pushes it down lowers welfare. They are distinguished analytically by shutting one off: Lemma 4 and Proposition 1 set theta_L = theta_H to isolate the demand channel; Lemma 3 isolates the investment channel via the cost thresholds kappa_1 &amp;lt; kappa_2 &amp;lt; kappa_3.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the welfare effect depend on monetary policy stance?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The bond price g (inverse of the deposit rate, r = 1/g - 1) is the key policy variable. When g is small (high deposit rate, cheap to hold deposits), consumption x is near its upper bound x*(theta) already under theta-bar, so an optimistic forecast barely raises x while a pessimistic one sharply lowers it, making welfare locally concave and disclosure welfare-reducing; low g also makes DM surplus large so sellers already invest, and a low theta-tilde_B can discourage investment, hurting welfare. When g is large (low deposit rate, costly deposits), x is low under theta-bar so an optimistic forecast substantially raises trade volume, making welfare convex and disclosure welfare-improving (Proposition 1, thresholds g and g-bar). Hence optimal forecast communication should be designed jointly with conventional monetary policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the Bayesian persuasion / noisy-message result work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Instead of fully revealing theta-tilde_t, the central bank sends messages m in {mg,mb} under a committed, publicly known policy phi, choosing posteriors P^g = P(theta-tilde_G|mg) and P^b = P(theta-tilde_B|mb). Lemma 6 gives the policy implementing constant posteriors (requires P^b + P^g != 1). By lowering P^b below 1, the bank sometimes sends mb even when the forecast is optimistic, raising the posterior theta-tilde_b conditional on mb and encouraging sellers to invest; this can outweigh the demand-channel loss when the investment channel is strong. Lowering P^g below 1 adds beneficial noise via the demand channel when w is concave (low g). Numerical exercises with g = 1.05 (welfare locally convex, full transparency P^g=P^b=1 optimal when only demand channel active) and g = 1.00 (welfare locally concave, noisy messages welfare-improving) illustrate this (Figures 4-5).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why do buyers and sellers always want to buy the forecast even when disclosure can lower welfare, and what is the market failure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Lemma 5 shows buyers&amp;rsquo; willingness to pay rho^b_t &amp;gt; 0 always and sellers&amp;rsquo; rho^s_t &amp;gt;= 0. Knowing theta-tilde_t lets buyers tailor deposit holdings (avoiding the cost of carrying a fixed level since g &amp;gt; beta) and lets sellers tailor investment, yielding strictly higher private surplus. But neither internalizes the social benefit (the increase in total DM surplus), so private willingness to pay can exceed the social value. Proposition 3 shows that for C &amp;lt;= C-bar the forecast is always produced and sold in the competitive equilibrium (banks earn zero profit), which can lead to excessive information production relative to the social optimum. The market always fully reveals; it cannot replicate the central bank&amp;rsquo;s optimal noisy (persuasion) policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the selective-disclosure result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When the production cost C is neither large nor small, the break-even price may exceed only one side&amp;rsquo;s willingness to pay, so the forecast is sold only to buyers or only to sellers (Proposition 3). A buyer-only outcome can improve welfare if the forecast helps via the demand channel but hurts via the investment channel; a seller-only outcome helps if the reverse holds. Online Appendix C.3 shows both are possible, but these market outcomes generally do not coincide with the social optimum, so implementing welfare-improving selective disclosure may require the central bank to control the payment data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does forecast precision affect outcomes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Raising phi_o (precision of the optimistic signal) requires lowering phi_p, sharpening the forecast under both realizations. Through the demand channel, dE[w]/dphi_o = phi-tilde(theta_G-theta_B)[w&amp;rsquo;(theta-tilde_G)-w&amp;rsquo;(theta-tilde_B)], which is positive when w is convex and negative when concave. Through the investment channel, more precision raises theta-tilde_G but lowers theta-tilde_B, which can raise or lower investment depending on kappa. With private banks, Proposition 4 shows buyers&amp;rsquo; and sellers&amp;rsquo; willingness to pay rises with precision, making production (and possible over-production) more likely and selective disclosure less likely. Under Bayesian persuasion, higher precision weakly raises welfare (it expands the feasible policy set); but if private banks also disseminate, the central bank&amp;rsquo;s persuasion is constrained because agents&amp;rsquo; posteriors cannot contain less information than the private forecast.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are privacy and cash modeled, and what is the effect on welfare?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A fraction eta in (0,1) of buyers (&amp;lsquo;cash buyers&amp;rsquo;) face sufficiently large privacy costs from deposit-based payments and use lower-return cash; the rest (&amp;lsquo;deposit buyers&amp;rsquo;) prefer deposits. Cash use shrinks the share of recorded DM transactions, lowering forecast precision (unless cash and deposit buyers&amp;rsquo; demand is perfectly correlated). By the precision results this can raise or lower welfare; with private production it makes excessive information less likely, while under central-bank noisy-message disclosure lower precision shrinks the feasible policy set and can reduce welfare. If the privacy cost is increasing and concave in DM consumption x, deposit buyers&amp;rsquo; net DM utility becomes less concave, making w more likely convex, so disclosure can improve welfare via the demand channel and the optimal policy may switch from non-disclosure to disclosure. This partially but not fully offsets the negative welfare impact of the privacy cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the equilibrium-multiplicity and underinvestment results in the benchmark?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With no data sharing, all decisions are state-independent under theta-bar. Strategic complementarity (more sellers investing raises buyers&amp;rsquo; deposits, which raises investment payoff) can generate multiple stationary equilibria (lambda=0, lambda=1, and a mixed lambda in (0,1)) when kappa and theta-bar are intermediate (Figure 1). The lambda=1 equilibrium is highest-welfare and Pareto optimal, and the authors impose a refinement selecting it. Sellers can underinvest: there exists kappa for which lambda=0 is the unique equilibrium even though lambda=1 would be socially better, because sellers receive only (1-sigma) of DM surplus. This underinvestment drives the investment-channel welfare results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from closely related work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Versus Andolfatto-Berentsen-Waller (2014) and Andolfatto-Martin (2013), where assets pay stochastic dividends and information is disclosed at the start of the DM so nondisclosure is always optimal (consumption smoothing), here the forecast is revealed at the start of the CM and affects deposit and investment decisions, so disclosure can be welfare-positive or -negative. Versus Choi-Liang (2023), whose non-monotonic disclosure effects arise from a money-adoption coordination margin, here non-monotonicity arises from how disclosure shapes marginal deposit holdings and investment. It extends the payment-data literature (Garratt-van Oordt 2021; Garratt-Lee 2020; Kang 2024; Amendola-Araujo-Ferraris 2025; Wang 2020, 2023; Cheng-Izumi 2025; Ahnert-Hoffmann-Monnet 2024) by focusing on the macroeconomic forecasting value of payment data and optimal disclosure, and connects to central-bank communication work (Morris-Shin; Jarocinski-Karadi 2020 information channel; Aruoba-Drechsel forthcoming).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the CBDC and privacy-protection implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;CBDC can serve as an institutional alternative source of payment data: transactions are recorded on a digital ledger, potentially letting the central bank observe flows directly, and can reduce coverage gaps from financial exclusion (the paper cites the 2021 FDIC survey: 4.5 percent of U.S. households, about 5.9 million, were unbanked). CBDC data could improve welfare via the demand and investment channels. Because privacy is a primary public concern, the authors recommend privacy-preserving architectures: adding statistical noise (differential privacy), randomizing data on the buyer&amp;rsquo;s device before transmission, keeping data decentralized with only model updates shared (federated learning), and clear governance/consent. Scope condition: incentivizing a cash-to-deposit/CBDC shift is welfare-improving only under sufficient privacy protection and only under the conditions (e.g., concave privacy cost, high g) that make disclosure beneficial; legal hurdles to central-bank access of payment data remain, which CBDC issuance could circumvent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What extensions and robustness checks are reported?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Correlated signals: the central bank and private banks may receive correlated but non-identical signals (e.g., the bank has confidential surveys); Online Appendix B.4 shows this does not change the main results because information affects allocations only through agents&amp;rsquo; beliefs about theta_t at decision time. The model is calibrated to the U.S. 2016-19 (Online Appendix B.2) for the quantitative figures. Online Appendix C.2 provides a continuous-investment version (under which the investment channel is always active and welfare responses are smoother); the paper deliberately presents the discrete-investment case to highlight the channels. Online Appendix C.1 gives additional noisy-message numerical exercises, and C.3 shows selective-disclosure cases. An alternative to the lambda=1 refinement is a government &amp;lsquo;revenue backstop&amp;rsquo; subsidy (Online Appendix B.2).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Demand channel&lt;/strong&gt;: The mechanism by which disclosing the aggregate-demand forecast changes buyers&amp;rsquo; deposit holdings and hence DM consumption volatility; its welfare sign depends on whether aggregate welfare w(theta) is convex or concave, governed by the curvature statistic T(x), not merely by the concavity of buyer utility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Investment channel&lt;/strong&gt;: The mechanism by which disclosure changes sellers&amp;rsquo; discrete decision to invest in higher productivity; because sellers capture only fraction (1-sigma) of DM surplus they underinvest, so disclosure that encourages investment raises welfare and disclosure that discourages it lowers welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;T(x) statistic&lt;/strong&gt;: A normalized log-curvature measure, T(x) = [u&amp;rsquo;&amp;rsquo;(x)]^2 / [u&amp;rsquo;&amp;rsquo;&amp;rsquo;(x)(u&amp;rsquo;(x)-1/theta)], that disciplines the curvature of w(theta): w is convex when T(x) &amp;lt; 1/3 and concave when T(x) &amp;gt; 1 over the relevant consumption range, capturing how quickly the marginal DM surplus falls as consumption rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bayesian persuasion via noisy messages&lt;/strong&gt;: In the paper&amp;rsquo;s sense, the central bank commits to a publicly known communication policy (choosing posteriors P^g and P^b) that deliberately garbles the forecast - e.g., sending the pessimistic message even when the forecast is optimistic - to shift agents&amp;rsquo; expectations (especially to induce socially efficient seller investment), exploiting that Bayes&amp;rsquo; rule constrains only the average posterior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Excessive information production&lt;/strong&gt;: The outcome under a competitive market for forecasts where, because banks earn zero profit and both buyers and sellers are willing to pay for the forecast even though it may lower aggregate welfare, the forecast is always produced and sold whenever the cost C is below a threshold, over-supplying information relative to the social optimum.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cash buyers / privacy cost&lt;/strong&gt;: Buyers facing sufficiently large privacy costs from deposit-based (recorded) payments who choose lower-return cash; their use reduces recorded transactions and forecast precision, but a privacy cost that is concave in consumption can make deposit buyers&amp;rsquo; preferences less concave, turning welfare convex so that disclosure becomes optimal and partially offsets the privacy cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Aggregate state theta_t&lt;/strong&gt;: The two-valued (theta_B bad, theta_G good) random fraction of buyers who become active and demand the DM good, equal to the level of aggregate demand; realized at the start of the DM with unbiased forecast theta-tilde_t derived from aggregated payment data.&lt;/p&gt;</description></item><item><title>Pricing-to-market in business cycle models</title><link>https://macropaperwarehouse.com/papers/pricing-to-market-in-business-cycle-models/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/pricing-to-market-in-business-cycle-models/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper evaluates five microfounded pricing-to-market (PTM) mechanisms and one reduced-form aggregator in a two-country DSGE model with volatile exchange rates driven by financial shocks (following Gabaix and Maggiori 2015) and real productivity shocks. The central question is whether existing open-economy theories can jointly achieve three empirically mandated targets — low exchange-rate pass-through to import prices, muted expenditure switching (low short-run trade elasticity), and plausible producer markups — when exchange rates are volatile and act as a major independent source of fluctuations. The paper&amp;rsquo;s main contribution is to show analytically and quantitatively that no existing microfounded PTM model fully escapes a structural tension among these three targets, which the authors call the parameterization trilemma.&lt;/p&gt;
&lt;p&gt;The models evaluated are: (i) the Kimball Aggregator (KA; reduced-form, Itskhoki-Mukhin application); (ii) the Distribution Cost model (CD; Corsetti-Dedola 2005); (iii) the Price Dispersion model (PD; Alessandria 2009); (iv) the Nested CES/Cournot model (NCES; Atkeson-Burstein 2008); (v) the Deep Habits model (DH; Ravn-Schmitt-Grohe-Uribe 2007); and (vi) the Customer Capital model (CC; Drozd-Nosal 2012). The encompassing framework uses the Backus-Kehoe-Kydland (1995) two-country structure augmented with a financial sector that generates UIP deviations via a capacity-constrained arbitrageur segment and exogenous noise-trader positions. The model is estimated/calibrated to quarterly U.S. data (1981Q1–2009Q4 for prices, 1980Q1–2004Q1 for quantities), HP-filtered with lambda = 1,600.&lt;/p&gt;
&lt;p&gt;The baseline markup target is 50%, consistent with BEA input-output tables for U.S. tradable sectors (ranging 45–50% across 2007, 2012, 2017); listed-firm SEC data imply higher values around 73–75%, which the authors treat as an upper bound. The empirical pass-through target is 0.4 (midpoint of a 0.2–0.6 range estimated by Campa-Goldberg 2005 and others; Gopinath-Itskhoki 2022 estimate 0.2–0.3). The short-run trade elasticity target is 0.7, measured using the volatility ratio of quantities to prices, which yields an upper-bound estimate. Real exchange rate volatility is targeted at 3.97 (standard deviations relative to GDP). Imports-to-GDP ratio is targeted at 12%.&lt;/p&gt;
&lt;p&gt;The central analytic finding — the parameterization trilemma — is characterized precisely for each model. For the KA model, the demand elasticity parameter gamma(1) simultaneously pins down both the markup and the trade elasticity, so matching 50% markups implies trade elasticity of approximately 1.5 (above the desired range of less than 1) and any value below TE = 1 is simply unattainable. For the CD model, pass-through of 0.4 requires a distribution cost markup wedge of 150% above the producer&amp;rsquo;s markup, which is inconsistent with the 50% markup target. For the PD model, the structural formula links PT and markups but less severely, so the trilemma is partially mitigated. For the NCES model, the trade elasticity equals the firm-level elasticity theta, which is also the main driver of pass-through, recreating a binding version of the KA trilemma on the quantity side. For the CC model, the market-expansion friction (captured by adjustment-cost parameter psi) provides an additional degree of freedom that allows trade elasticity to be set independently of pass-through and markups; at symmetric bargaining power eta = 0.5 and 50% markups, the model delivers PT = 0.33 analytically, close to the data target.&lt;/p&gt;
&lt;p&gt;Quantitative results confirm the analytic predictions. The KA model fails on quantity statistics because it implies trade elasticity far above target, generating counterfactually negative international comovement of consumption, investment, and employment. The CD model delivers only moderately incomplete pass-through (substantially above the 0.4 target), underperforming on price statistics, and implies a counterfactual correlation of net exports with the terms of trade. The PD model delivers pass-through of approximately 0.70 — better than CD but still above target — and performs well on quantities. The NCES model achieves pass-through of 0.63 (close to but above the 0.4 target) but at the cost of large, negative international comovement in general equilibrium, including a counterfactual positive correlation of net exports with output. The DH model generates more-than-complete pass-through in the presence of persistent exchange rates, failing on prices. The CC model delivers PT = 0.36, closest to the empirical target, achieves correct signs for international quantity comovement, and generates a positive terms-of-trade/net-exports correlation — but requires assumed productivity shock correlation of 0.75 to match measured TFP correlation of 0.3 due to endogenous marketing investment affecting measured TFP, and fails to deliver a positive correlation between terms of trade and the exchange rate.&lt;/p&gt;
&lt;p&gt;The paper concludes that further research is needed into frictions that simultaneously dampen the price and quantity responses to volatile exchange rates without violating markup discipline. The reduced-form KA model neither nests nor outperforms the microfounded alternatives. The CC and PD search-based models perform best overall but introduce frictions that are harder to identify and measure directly.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the parameterization trilemma and how is it characterized analytically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The trilemma is the structural impossibility of jointly satisfying three empirically necessary targets: (a) plausible steady-state producer markups (calibrated at 50%), (b) low short-run trade elasticity (targeted at 0.7 or below), and (c) low exchange-rate pass-through to import prices (targeted at 0.4). The authors derive closed-form expressions for pass-through (PT), trade elasticity (TE), and markups (mu) for each model and show that satisfying any two targets forces a violation of the third. For the KA model, the key parameter gamma(1) satisfies TE = gamma(1) and mu = (gamma(1) - 1)^{-1}, so targeting 50% markups forces TE = 3 and targeting TE = 1.5 forces markups of 200%. For the CD model, PT = 0.4 requires the distribution-cost wedge xi/(theta-1) = 1.5, implying markups more than 150% above the friction-free level, incompatible with a 50% target. For the PD model the formula is PT = 1 - mu/(1+mu), which is less restrictive. For the NCES model, TE = theta (the firm-level elasticity) and theta also drives pass-through, recreating the KA-type trilemma on the quantity side. For the CC model, the friction parameter psi in marketing capital accumulation independently controls TE, providing an extra degree of freedom that lets the model partially escape the trilemma.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for pass-through and trade elasticity, and what are its main assumptions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The theoretical pass-through coefficient (PT) is defined as the partial equilibrium, on-impact elasticity of the import price with respect to the exchange rate, computed at the steady state while holding constant marginal costs (v, v*), the stochastic discount factor, and the domestic price of the home good. This mimics what regression-based pass-through estimates do (controlling for local costs). Trade elasticity (TE) is defined analogously as the PT-scaled elasticity of the import/domestic quantity ratio with respect to the exchange rate, under a one-time shock that reverts to the steady state next period (except for the DH model, where a permanent shock is considered). A key assumption is that importers take aggregate price indices as consistent with all importers behaving the same way (a rational-expectations fixed point). General-equilibrium co-movements between exchange rates and marginal costs are abstracted from in the analytic section, consistent with the goal of isolating each model&amp;rsquo;s intrinsic PTM mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the KA model fail on quantity statistics despite being able to match any degree of pass-through?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The KA model can match pass-through of 0.4 by freely choosing the curvature of the demand aggregator g&amp;rsquo;&amp;rsquo;(1) (independently of gamma(1)). However, the steady-state demand elasticity gamma(1) simultaneously determines both the markup (mu = (gamma(1)-1)^{-1}) and the trade elasticity (TE = gamma(1)). Matching 50% markups forces gamma(1) = 3 and therefore TE = 3, far above the target of 0.7. This excessive trade elasticity generates counterfactually large expenditure switching in response to exchange-rate shocks, leading to counterfactual negative international comovement of consumption, investment, and employment. A modified Kimball aggregator with a convex adjustment cost (equation 62) does not resolve the problem because the convex cost parameter also enters the steady-state markup formula, so targeting 50% markups still forces high effective trade elasticity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the Deep Habits model generate more-than-complete pass-through when exchange rates are persistent?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the DH model, producers internalize the law of motion for habits: by lowering prices today they accumulate more customer habits, which allows them to raise prices later. When the exchange rate appreciates persistently (from the foreign exporter&amp;rsquo;s perspective), exporters expect their foreign sales and thus foreign habit stocks to fall over time. This reduces the shadow value of habit (Delta_f), so producers let prices fall by more than the exchange rate movement, generating pass-through greater than one. The authors derive analytically that, for a permanent shock, PT &amp;gt; 1 because dlog(gh)/dlog(x) &amp;lt; 0 (habit falls upon appreciation), and this dominates the direct pricing effect. For a purely transitory shock, the sign reverses (PT &amp;lt; 1), but since exchange rates are highly persistent in the data, the first property dominates. The quantitative section confirms this: the DH model generates PT &amp;gt; 1, marked as 1.00 in Table 4, disqualifying it on prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the Customer Capital (CC) model partially escape the trilemma?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The CC model introduces two key elements absent from other frameworks: (1) Nash bargaining over prices within bilateral matches, which directly ties pass-through to the sharing of exchange-rate-driven surplus rather than to demand elasticity; and (2) a convex adjustment friction on marketing capital (psi) that controls the pace of trade-share adjustment, independently setting the short-run trade elasticity. Because prices are determined by bargaining (equation 53: pf = eta*P_d + (1-eta)*v), they depend on the retail marginal value of the foreign good (P_d) and the foreign marginal cost (v), but not on quantity within the match. This decouples PT from TE. Analytically, at static steady state, PT = (1-eta)(1 + mu - (TE/gamma)(eta+mu)*omega)^{-1}; for eta = 0.5 and 50% markups and TE/gamma approaching zero, PT approaches (1-eta)/(1+mu) = 1/3. The psi parameter then tunes TE separately from markups and PT. However, a high long-run elasticity gamma (= 7.9) is required to generate sufficient retail-price responsiveness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the NCES model achieve on prices and why does it fail on quantities?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The NCES (Nested CES with Cournot competition) model generates incomplete pass-through of 0.63, the second-best performance on prices after the CC model. The mechanism is that non-atomistic (Cournot) firms internalize the impact of their pricing on the sectoral price index; when the exchange rate moves, foreign exporters&amp;rsquo; market share changes, altering the endogenous demand elasticity they face and dampening their pass-through. To calibrate the model with only one exporting firm (NX=1 out of N=5), the authors maximize the Cournot effect. However, this calibration implies TE = theta (the firm-level elasticity, set at 7.9 in calibration), far exceeding the target of 0.7. A quantity adjustment cost cannot remedy this because it would simultaneously constrain import-share movements, which are the source of the endogenous demand elasticity variation that generates incomplete pass-through. Consequently, the model implies large negative international comovement of output, consumption, employment, and investment — a worse quantity performance than most other models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper measure markups and what data sources does it use?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper equates markups with gross margins under the maintained assumptions of Cobb-Douglas production and static cost minimization (Hall 1988; De Loecker et al. 2020). Under Cobb-Douglas, marginal cost v = wl/y, so markup mu = P&lt;em&gt;y/(w&lt;/em&gt;l) - 1 = sales/(cost of goods sold) - 1. Three data sources are used, all for U.S. data 2007-2017: (1) BEA 402 Industry Input-Output Use Tables, which give gross margins of approximately 39-41% for all sectors and 45-50% for traded sectors (import share &amp;gt; 3%). (2) S&amp;amp;P 500 Compustat with BEA sector value-added adjustment, yielding approximately 73-74% for all non-FIRE/GOV/NGO firms. (3) Unadjusted Compustat, yielding 43-49%. The paper adopts 50% as the baseline calibration target, treating it as conservative given the data range, and noting that the BEA I-O measure is the broadest and likely most accurate. The paper explicitly holds that models must respect profit and margin accounting within their own structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper&amp;rsquo;s conclusion differ from Itskhoki and Mukhin (2021) regarding the Kimball Aggregator?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Itskhoki and Mukhin (2021) use indirect inference and treat producer margins/markups as a free parameter, implicitly allowing for a much higher markup value — substantially above 50%. Under their calibration approach, the KA model can reconcile low pass-through with better quantity performance. Drozd, Kolasa, and Nosal instead impose a markup discipline: models must match empirically observed gross margins of 50% (for tradable sectors from BEA I-O tables) in their steady state. Under this discipline, the KA model&amp;rsquo;s trilemma becomes binding, and the model fails on quantity statistics. The authors argue that higher markup assumptions change the effective structure of the model and should be treated as a separate research agenda rather than a free calibration choice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of financial shocks in the model and how are they implemented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Financial shocks generate exchange-rate volatility that is largely decoupled from real fundamentals — mimicking the observed &amp;rsquo;exchange rate disconnect&amp;rsquo; from output and consumption. They are modeled following Gabaix and Maggiori (2015): a global financial sector with short-lived arbitrageurs and noise traders. Arbitrageurs face a capacity constraint (parameterized by Gamma) that prevents them from fully exploiting UIP violations, resulting in a distorted UIP condition where the interest rate differential includes a term proportional to the arbitrageur&amp;rsquo;s position. Noise traders take exogenous positions n(t) that follow an AR(1) process (persistence rho_n = 0.97 in calibration) with standard deviations ranging from 21.2 (CC model) to 114.9 (NCES model) across calibrations. These shocks generate real exchange rate volatility of 3.97% (standard deviations relative to GDP), matching the data target. The paper notes that the precise implementation (Gabaix-Maggiori vs. Itskhoki-Mukhin) has little impact on exchange-rate properties in a linearized setting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and extensions does the paper consider?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper considers a modified Kimball aggregator with a convex adjustment cost on the ratio of imported to domestic quantities (equation 62) as a potential fix for the KA model&amp;rsquo;s high trade elasticity. This is shown not to resolve the trilemma because the convex cost parameter also enters the steady-state markup formula, keeping the binding constraint in place. Results for this modified model are reported in the Online Appendix. The paper also notes that the DH model&amp;rsquo;s pass-through is analyzed under both permanent and transitory shocks, with the sign reversal for purely transitory shocks documented analytically. The paper abstracts from nominal rigidities throughout, justifying this by citing Gopinath-Itskhoki (2011) evidence that conditioning pass-through on price adjustments versus non-adjustments makes little difference in observed pass-through patterns, suggesting limited pass-through is largely a real phenomenon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the paper&amp;rsquo;s main implications for the DSGE modeling of open economies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper implies that the standard toolkit for generating incomplete exchange-rate pass-through and muted expenditure switching is inadequate when exchange rates are volatile and act as a major shock. All models face tension among the three targets; the best performers (CC and PD) do so by introducing search frictions that are intrinsically difficult to identify and measure directly. The paper does not claim to provide a solution; rather, it performs a clean diagnostic showing that more research is needed into real frictions that simultaneously insulate import prices and trade quantities from exchange-rate volatility. The finding that the Kimball reduced-form aggregator neither nests nor outperforms microfounded alternatives has implications for monetary-policy DSGE models that frequently use the KA for tractability, suggesting that researchers should be aware of the high implicit markup that is required for the KA to work well in open-economy settings with volatile exchange rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What moments from the data are targeted in calibration and what is the quantitative approach?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is calibrated quarterly and HP-filtered (lambda = 1,600). Common targets include: imports/GDP = 12%; 50% producer markups; 30% work hours relative to time endowment; investment volatility relative to GDP = 2.79; short-run trade elasticity (volatility ratio) = 0.7; cross-country TFP correlation = 0.3; TFP volatility = 0.8% and autocorrelation = 0.72; real exchange rate volatility = 3.97%. The pass-through target of 0.4 is used only as an additional degree of freedom for the KA model; for all others, pass-through is an outcome of the structural parameterization. The financial shock persistence is set arbitrarily at rho_n = 0.97 for lack of a target. When a model cannot satisfy all targets (as with KA and NCES on trade elasticity), that target is dropped in favor of best performance on prices. Pass-through is measured in the quantitative section by running regressions analogous to Campa-Goldberg (2005) on model-generated data, rather than using the analytic partial-equilibrium formula.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the sign of the terms-of-trade and exchange-rate correlation, and what does it imply for model evaluation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In model-generated data (without noise), the correlation of terms of trade (tot = pf/px) with the exchange rate (x) is either -1 (when PT &amp;lt; 0.5) or +1 (when PT &amp;gt; 0.5). The empirical target from U.S. data is approximately -1. This means matching PT &amp;lt; 0.5 and a negative tot-x correlation are equivalent predictions. In the quantitative results, only the KA and CC models achieve PT &amp;lt; 0.5 and thus generate the correct negative correlation; all other models (CD, PD, NCES, DH) generate PT &amp;gt; 0.5 and thus positive tot-x correlation. The authors note that the strict 0.4 target may be too aggressive for aggregate data — PT slightly above 0.5 would be consistent with a positive (near zero) correlation — pointing to Gopinath et al. (2020) who find small, statistically insignificant tot-x coefficients ranging from positive to negative.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Parameterization Trilemma&lt;/strong&gt;: The structural impossibility of jointly achieving three empirically necessary targets in standard PTM models: (1) plausible producer gross margins (~50%), (2) low short-run trade elasticity (~0.7 or below), and (3) low exchange-rate pass-through to import prices (~0.4). Each PTM model can satisfy at most two of the three targets simultaneously under quantitative discipline; the third is either infeasible or inconsistent given the model&amp;rsquo;s internal constraints.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pricing-to-Market (PTM)&lt;/strong&gt;: The practice by which internationally active firms set different prices in home and foreign markets as a function of the bilateral exchange rate, rather than uniformly passing exchange-rate changes through to import prices. In this paper, PTM is measured by the degree of incomplete pass-through (PT &amp;lt; 1) and is generated by specific microfounded frictions (distribution costs, search, habits, market power, customer capital) rather than by nominal rigidities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange-Rate Pass-Through (PT)&lt;/strong&gt;: The elasticity of the import price (in the importing country&amp;rsquo;s currency) with respect to the bilateral real exchange rate, computed in partial equilibrium at the steady state, controlling for local costs. Values used in calibration: empirical short-run range 0.2–0.6; paper target 0.4. Models in which PT = 1 satisfy the law of one price; models with PT &amp;lt; 1 exhibit pricing-to-market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Short-Run Trade Elasticity (TE)&lt;/strong&gt;: The elasticity of import quantities relative to domestic quantities with respect to the exchange rate (equivalently, the expenditure-switching response to import price changes), measured at business-cycle frequencies. The paper measures this using the volatility ratio of trade-flow quantities to prices (an upper-bound estimate abstracting from correlations), targeting a value of 0.7. Long-run elasticity estimates based on trade liberalization episodes are much higher (typically 6 and above) and are used as the long-run elasticity parameter gamma in search-based models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Customer Capital (CC) Model&lt;/strong&gt;: A PTM model (Drozd-Nosal 2012) in which firms build market-specific customer relationships through costly, time-consuming investment in marketing capital, and within-match prices are set by Nash bargaining. The combination of a capacity constraint on quantities traded within each match and bargaining-determined prices decouples the short-run trade elasticity from pass-through, allowing the model to partially escape the parameterization trilemma via the adjustment-cost parameter psi.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Kimball Aggregator (KA)&lt;/strong&gt;: A reduced-form, implicitly defined demand aggregator (Kimball 1995) that generates variable demand elasticity through the curvature of the function g(·) around the steady state. In the open-economy application of Itskhoki-Mukhin (2021), two curvature parameters (g&amp;rsquo;(1) and g&amp;rsquo;&amp;rsquo;(1)) can independently control markup and pass-through — but not trade elasticity simultaneously, which is bound to the steady-state demand elasticity gamma(1) and hence to the markup. The paper shows this model neither nests nor outperforms microfounded alternatives under markup discipline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial Shock&lt;/strong&gt;: An exogenous disturbance to the position of noise traders in the international bond market (following Gabaix-Maggiori 2015), which drives deviations from Uncovered Interest Parity via the capacity constraint on arbitrageurs. These shocks generate exchange-rate volatility that is largely disconnected from real fundamentals (productivity), calibrated with persistence rho_n = 0.97 to match U.S. real exchange rate volatility of 3.97% relative to GDP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gross Margin / Producer Markup&lt;/strong&gt;: In this paper, defined as (price - marginal cost) / marginal cost = (sales - cost of goods sold) / cost of goods sold, where under Cobb-Douglas production and static cost minimization, the markup equals the gross margin. The paper targets 50% for U.S. tradable-sector firms based on BEA 402 Industry I-O Use Tables (which yield 45–50% for tradable sectors across 2007–2017), treating this as a hard empirical constraint that models must satisfy in the steady state.&lt;/p&gt;</description></item><item><title>Procyclical Fiscal Policy and Asset Market Incompleteness</title><link>https://macropaperwarehouse.com/papers/procyclical-fiscal-policy-and-asset-market-incompleteness/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/procyclical-fiscal-policy-and-asset-market-incompleteness/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Developing and emerging economies exhibit procyclical fiscal policy on both the spending and taxation sides: government expenditures expand in booms and contract in recessions, and tax rates fall in good times while rising in bad times. This is the mirror image of optimal countercyclical policy prescribed by standard theory and practiced in advanced economies. Understanding why developing countries pursue policies that amplify already-volatile business cycles is a long-standing puzzle in international macroeconomics.&lt;/p&gt;
&lt;p&gt;This paper develops a small open economy model with Ramsey-optimal fiscal policy to argue that standard incomplete asset markets — without sovereign default risk, limited commitment, or high risk premia — are sufficient to explain procyclical fiscal policy on both the spending and the taxation sides. The authors proceed in three stages: a static two-state model that isolates a novel theoretical result; a calibrated infinite-horizon DSGE model that replicates the result and quantifies welfare costs; and a cross-country empirical section providing reduced-form support.&lt;/p&gt;
&lt;p&gt;The paper covers 121 countries (99 developing, 22 OECD) using data on real government consumption, real GDP, and VAT rates updated from earlier studies. The average correlation between the cyclical components of real government spending and real GDP is 0.29 for developing countries versus -0.12 for OECD countries (both significant at the 1 and 5 percent levels, respectively). For tax policy, the average correlation between changes in the VAT rate and real GDP is -0.22 for developing countries (significant at the 1 percent level) versus -0.06 for industrial countries (insignificant at the 5 percent level), confirming procyclical tax behavior in non-OECD economies.&lt;/p&gt;
&lt;p&gt;The core theoretical contribution is a novel result established in a static model: under financial autarky (extreme market incompleteness), government spending is always procyclical regardless of preference parameters, but tax rates can be procyclical, acyclical, or countercyclical depending on the relative magnitudes of the intertemporal elasticities of substitution for private versus public consumption (sigma_c and sigma_g). The key is the &amp;ldquo;consumption preference channel&amp;rdquo;: when sigma_c exceeds sigma_g, private consumption rises proportionally more than public consumption in good times, expanding the tax base by more than the increase in government spending, which allows the fiscal authority to reduce tax rates. The ratio of private to public consumption comoves positively with the business cycle when sigma_c &amp;gt; sigma_g — the empirically-relevant case — generating procyclical tax policy.&lt;/p&gt;
&lt;p&gt;Under complete markets, both government spending and tax rates are acyclical regardless of preference parameters.&lt;/p&gt;
&lt;p&gt;The DSGE model introduces an infinite-horizon setting with endogenous production and labor supply and access to a non-state-contingent international bond with a debt-elastic interest rate spread. This adds a &amp;ldquo;consumption smoothing channel&amp;rdquo; that works against procyclicality: when households can borrow to smooth consumption following adverse shocks, the tax base contracts less, reducing the pressure to raise taxes. However, when the model is calibrated to non-OECD countries — using a debt-elasticity parameter of phi = 0.125 (estimated from non-OECD panel data using EMBIG spreads and public debt) and TFP persistence of rho_A = 0.95 — the consumption preference channel dominates the consumption smoothing channel. The correlation between government spending and output exceeds 0.95 across all values of sigma_g examined (from 0.5 to 1.5) and across all considered debt elasticities. The cyclicality of tax rates flips sign as sigma_g crosses sigma_c, consistent with the static result.&lt;/p&gt;
&lt;p&gt;A moment-matching exercise calibrated to non-OECD data selects sigma_g = 0.25, phi = 1, and rho_A = 0.95 as best-fit parameters. The model successfully replicates four targeted moments — standard deviations of output and private consumption, and the correlations of government spending and tax rates with output — and also matches the untargeted positive comovement of the private-to-public consumption ratio with GDP. The model accounts for only about one-tenth of observed government spending volatility and one-fifth of tax rate volatility, indicating additional non-Ramsey sources of fiscal variation exist.&lt;/p&gt;
&lt;p&gt;Welfare costs of fiscal procyclicality are computed using a Lucas (1987) approach. With no financial frictions (phi approximately 0), welfare costs are approximately 0.015 percent of lifetime consumption. Increasing phi to the calibrated non-OECD value of 0.125 nearly doubles welfare costs to approximately 0.03 percent of lifetime consumption. More persistent TFP shocks (higher rho_A) amplify procyclicality further.&lt;/p&gt;
&lt;p&gt;The empirical section provides cross-country evidence. Capital controls (measured by Fernandez et al.&amp;rsquo;s 2016 de jure indices across 32 transaction types in 10 asset classes over 1995-2015) are larger in non-OECD countries by an order of magnitude, and the null of equal completeness is statistically rejected. The estimated debt-spread elasticity for non-OECD countries using public debt is phi = 0.125 (significant at the 1 percent level), versus 0.002 for OECD countries (insignificant). GDP volatility measured by the standard deviation of HP-filtered real GDP is 3.28 for non-OECD countries versus 1.47 for OECD countries, a difference of more than twofold.&lt;/p&gt;
&lt;p&gt;The policy implication is that completing markets — through sovereign wealth funds, contingent credit lines with international financial institutions, or structural fiscal rules that force saving in good times — could reduce procyclicality and yield welfare gains estimated at up to twice the Lucas-type cost attributable to current friction levels.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the main theoretical result, and how does it advance beyond the prior literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper establishes that incomplete markets (modeled as financial autarky or an upward-sloping supply of funds) are necessary and sufficient to generate procyclical government spending, but are only necessary — not sufficient — for procyclical tax rates. The direction of tax cyclicality depends on the relative intertemporal elasticity of substitution of private consumption (sigma_c) versus public consumption (sigma_g): procyclical if sigma_c &amp;gt; sigma_g, acyclical if equal, countercyclical if sigma_c &amp;lt; sigma_g. This overturns the widespread impression from Cuadra et al. (2010) that incomplete markets cannot generate procyclical tax rates. Prior work invoked sovereign default risk or limited commitment; this paper shows those additional ingredients are unnecessary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the consumption preference channel and why is it empirically relevant?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The consumption preference channel works as follows: when households have a stronger preference for private over public consumption (sigma_c &amp;gt; sigma_g), private consumption rises proportionally more than government spending in good times. The wider tax base allows the government to reduce tax rates while still financing higher spending, generating procyclical tax policy. Empirically, the ratio of private to public consumption comoves positively with output in non-OECD countries — the model matches this as an untargeted moment — so the procyclical case (sigma_c &amp;gt; sigma_g) is the empirically relevant one. The model&amp;rsquo;s best-fit calibration selects sigma_g = 0.25 against sigma_c = 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the consumption smoothing channel and when does it dominate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the DSGE model, households can issue non-state-contingent bonds, partially smoothing consumption against shocks. A negative TFP shock therefore causes a smaller fall in consumption (the tax base), reducing the fiscal authority&amp;rsquo;s need to raise taxes procyclically. This consumption smoothing channel works against tax procyclicality. It dominates when the debt-elastic spread is low (cheap borrowing) and TFP shocks are transitory (low rho_A). For the calibrated non-OECD parameterization — phi = 0.125 and rho_A = 0.95 — the supply of funds is steep enough and shocks persistent enough that the consumption preference channel dominates, and procyclical tax policy results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does TFP persistence play?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Higher TFP persistence amplifies business cycle volatility and deepens the procyclicality of fiscal policy. When a negative TFP shock is more persistent (rho_A rises from 0.42 as in Mendoza 1991 toward 1.0), consumption falls more sharply and for longer, shrinking the tax base substantially. This forces the fiscal authority to raise taxes more aggressively in recessions, increasing procyclicality. The half-life of a TFP shock with rho_A = 0.95 is close to seven quarters, versus less than a quarter at rho_A = 0.42. Aguiar and Gopinath (2007) motivate the use of high persistence as a distinguishing feature of emerging market business cycles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are the two types of financial frictions — market incompleteness and debt-elastic spreads — distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Asset market incompleteness refers to the dimension of available financial instruments (financial autarky: none; incomplete: risk-free bond; complete: full set of state-contingent claims). The debt-elastic spread (governed by phi_c and phi_g) captures the steepness of the supply of external funds, which can be high even when access to a bond market exists. The authors note these are not isomorphic: Fernandez and Gulan (2015) provide microfoundations for the debt elasticity in an environment with defaultable private debt and asymmetric information, holding market incompleteness constant. Both frictions independently amplify business cycles and procyclicality, but the paper treats them separately in both calibration and empirical proxies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three propositions from the static model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 1: Government spending is acyclical under complete markets and strictly procyclical under financial autarky, regardless of the values of sigma_c and sigma_g. Proposition 2: Tax rates are acyclical under complete markets. Under financial autarky, tax rates are acyclical if sigma_c = sigma_g, countercyclical (positive correlation with output) if sigma_c &amp;lt; sigma_g, and procyclical (negative correlation with output) if sigma_c &amp;gt; sigma_g. Proposition 3: Under financial autarky, the procyclicality of government spending increases with output volatility. If taxes are procyclical (sigma_c &amp;gt; sigma_g), tax procyclicality also increases with output volatility. Under complete markets, output volatility has no effect on fiscal cyclicality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the moment-matching exercise and what does it conclude?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The exercise calibrates four parameters — TFP volatility (sigma_A), TFP persistence (rho_A), the government consumption elasticity (sigma_g), and the debt-spread elasticity (phi) — to minimize a quadratic loss function over the four targeted moments: standard deviations of income and private consumption, and correlations of taxes and government spending with real GDP, using non-OECD country data with balanced panels of more than ten consecutive annual observations. The best-fit parameters are sigma_g = 0.25, phi = 1, and rho_A = 0.95. The model matches the sign and approximate magnitude of the four targeted moments and also replicates the untargeted positive comovement of the private-to-public consumption ratio with output. It accounts for only about one-tenth of observed government spending volatility and one-fifth of tax volatility, suggesting other sources of fiscal variation beyond Ramsey dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are welfare costs calculated and what are the magnitudes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Welfare costs are computed in the Lucas (1987) tradition: they equal the permanent share of steady-state consumption that households in a frictionless economy (no shocks) would need to forgo to achieve the same lifetime utility as households in the economy with TFP shocks and varying degrees of fiscal procyclicality induced by different values of phi. Using 100,000 simulated quarters with sigma_g = 0.5, sigma_c = 1, sigma_A = 0.0129, and rho_A = 0.95, welfare costs rise from approximately 0.015 percent of lifetime consumption when phi is near zero to approximately 0.03 percent at the calibrated non-OECD value of phi = 0.125 — nearly doubling as procyclicality increases. The paper acknowledges that higher phi also imposes other costs beyond procyclicality per se.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What empirical proxies are used and what do they show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Asset market incompleteness is proxied by four indices from Fernandez et al. (2016) covering de jure restrictions on capital inflows and outflows across 32 transaction types and 10 asset classes for 1995-2015: overall inflow restrictions (kai), outflow restrictions (kao), bond inflow restrictions, and bond outflow restrictions. Each index ranges from 0 to 1. All four indices are higher for non-OECD countries than OECD by an order of magnitude, with the null of equality statistically rejected. For debt-spread elasticity, the paper estimates the model&amp;rsquo;s functional form (spread regressed on an exponential function of debt-to-output) using panel fixed effects, with spreads proxied by EMBIG for non-OECD, T-bill spreads over German Bunds for EU-OECD, and UIP-implied spreads for other OECD. Using public debt, the elasticity for non-OECD is phi = 0.125 (significant at 1 percent) versus 0.002 for OECD (insignificant). GDP volatility (standard deviation of HP-filtered real GDP) is 3.28 for non-OECD versus 1.47 for OECD.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to Cuadra et al. (2010) and Riascos and Vegh (2003)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Riascos and Vegh (2003) showed in a calibrated model that incomplete markets can explain procyclical government spending, but their model faced government borrowing at the risk-free rate across all states, which Cuadra et al. argued prevented the model from generating negative output-tax rate correlations. Cuadra et al. (2010) incorporated both incomplete markets and sovereign default risk, showing that their combination yields procyclical fiscal policy on both spending and revenue sides. This paper argues that Cuadra et al.&amp;rsquo;s assessment left the mistaken impression that incomplete markets per se are insufficient for procyclical taxes. The current paper shows this impression is wrong: standard incomplete markets without default risk yield procyclical tax rates when the empirically-validated condition sigma_c &amp;gt; sigma_g holds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The mechanism implies that reducing financial frictions — either by completing asset markets or by flattening the supply of external funds — would moderate fiscal procyclicality and generate Lucas-type welfare gains. Concrete instruments include: sovereign wealth funds that allow self-insurance in good times; contingent credit lines with international financial institutions that provide access to funds in bad times; and structural fiscal rules (as in Chile&amp;rsquo;s structural balance rule) that force saving in booms, effectively completing markets through institutional commitment. The scope condition is that these gains are relevant for non-OECD countries characterized by high capital controls, steep debt-elastic spreads, and volatile output — not for OECD economies where markets are already more complete and fiscal policy is acyclical or countercyclical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main limitations acknowledged by the paper?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is deliberately parsimonious and accounts for only about one-tenth of observed government spending volatility and one-fifth of tax rate volatility. Additional shocks beyond TFP and world interest rate variation — including political economy forces, commodity price cycles, and demand shocks — are clearly relevant. The model also only accounts for a fraction of the private consumption-output correlation, suggesting missing amplification mechanisms. The paper does not structurally identify the model from micro-data and relies on moment matching over a grid rather than formal estimation. The welfare cost calculation attributes all welfare loss to fiscal procyclicality, but higher phi also raises the cost of debt in ways unrelated to fiscal cyclicality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of political economy explanations, and does this paper displace them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper presents the financial frictions explanation as complementary to rather than a replacement for political economy explanations (such as Tornell and Lane 1999&amp;rsquo;s voracity effect or Alesina et al. 2008&amp;rsquo;s Leviathan-starving hypothesis). The paper&amp;rsquo;s claim is narrower: from an applied theory perspective, incomplete markets alone are sufficient to generate the stylized facts, so additional ingredients such as sovereign risk or limited commitment are not required to explain the basic puzzle. Whether political economy or financial frictions are quantitatively more important in explaining the cross-country variation in fiscal cyclicality remains an open question.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Procyclical fiscal policy&lt;/strong&gt;: In this paper&amp;rsquo;s usage, government spending is procyclical when it rises in good times and falls in bad times (positive correlation with output), and tax policy is procyclical when tax rates fall in good times and rise in bad times (negative correlation between tax rates and output). The paper stresses that the ratio g/y is not an appropriate cyclicality measure because y is endogenous.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption preference channel&lt;/strong&gt;: The mechanism by which households&amp;rsquo; relative preference for private over public consumption (sigma_c &amp;gt; sigma_g) causes private consumption to expand proportionally more than government spending in good times, widening the tax base relative to spending needs and allowing the fiscal authority to cut tax rates procyclically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption smoothing channel&lt;/strong&gt;: The countervailing mechanism present in the DSGE model: when households can borrow at relatively low cost to smooth consumption, adverse TFP shocks cause a smaller fall in the tax base, reducing the government&amp;rsquo;s need to raise taxes in recessions. This channel works against tax procyclicality and is weaker when the debt-elastic spread is steep.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt-elastic interest rate spread (phi)&lt;/strong&gt;: A country-specific premium on external borrowing that increases with the stock of debt, following the Schmitt-Grohe and Uribe (2003) formulation. In this paper, phi governs the slope of the supply of external funds and proxies for the severity of financial frictions distinct from the dimension of market incompleteness. Non-OECD countries are estimated to have phi = 0.125, compared to 0.002 for OECD.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial autarky&lt;/strong&gt;: The polar case in which neither households nor the government can buy or sell financial securities internationally; all financial transactions must be within the country, so the domestic interest rate adjusts endogenously to clear markets. In the model, this case delivers the strongest procyclicality, equivalent to very high phi.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ramsey optimal fiscal policy&lt;/strong&gt;: The paper solves for the fiscal policy (tax rates and government spending) that maximizes household welfare subject to the government&amp;rsquo;s budget constraint and private sector implementability conditions. This is used rather than an ad-hoc fiscal rule, so procyclicality is an optimal response to frictions rather than a policy failure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lucas-type welfare cost&lt;/strong&gt;: Measured here as the permanent fraction of steady-state consumption that a household in a shock-free economy would forgo to achieve the same lifetime utility as a household in the stochastic economy with TFP shocks and a given level of debt-elastic financial friction. The paper reports that this cost nearly doubles as phi rises from near zero to the calibrated non-OECD value of 0.125.&lt;/p&gt;</description></item><item><title>Property rights, fiscal capacity, and social capacity: The lasting impact of the Taiping Rebellion</title><link>https://macropaperwarehouse.com/papers/property-rights-fiscal-capacity-and-social-capacity-the-lasting-impact-of-the-taiping-rebellion/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/property-rights-fiscal-capacity-and-social-capacity-the-lasting-impact-of-the-taiping-rebellion/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: How do civil wars affect long-term development, and through which institutional mechanisms? The paper studies the Taiping Rebellion (1850-1864) in Qing China, one of history&amp;rsquo;s deadliest civil wars (at least ~20 million deaths, with some estimates of 70-100 million), as a critical juncture in China&amp;rsquo;s path to modernity. It matters because the rebellion generated large, persistent regional institutional variation that can help explain what the authors call the &amp;ldquo;Intra-China Divergence&amp;rdquo; — regional GDP-per-capita gaps as large as 27-to-1 (Dongguan vs. Tianshui, 2010) that rival the world&amp;rsquo;s largest inter-regional gaps.&lt;/p&gt;
&lt;p&gt;Data and design: A prefecture-level (occasionally county-level) panel covering 266 prefectures in China proper (1820 delineation). 55 prefectures fell under Taiping control (treatment) — split into 37 &amp;ldquo;Early Taiping&amp;rdquo; prefectures (occupied up to 1859, in Anhui/Jiangxi/Hubei, ambiguous land rights) and 18 &amp;ldquo;Late Taiping&amp;rdquo; prefectures (occupied from 1860, in Jiangsu/Zhejiang, stronger land rights) — and 211 control prefectures. Population is observed at seven points (1820, 1851, 1880, 1910, 1953, 1982, 2000). The core strategy is difference-in-differences (1820 reference year, prefecture and year fixed effects), supplemented by propensity-score matching (135-prefecture matched sample), a spatial autoregressive (SAR) model, and an instrumental-variable strategy using the longitude of the prefectural seat (motivated by the Taiping Navy&amp;rsquo;s eastward-along-the-Yangtze military strategy; first-stage F-statistics above 20).&lt;/p&gt;
&lt;p&gt;Main quantitative findings (with scope conditions): (1) Population: The rebellion caused large, permanent population losses. The Taiping DID coefficient is -0.45 in 1880 (a 36% lower population growth rate vs. control) and -0.51 in 1953 (40% lower) — no convergence. Crucially, in the matched sample Late Taiping areas recovered (no significant long-run population gap vs. control) while Early Taiping areas did not (an immediate ~30% drop in 1880 plus further decline). (2) Property rights: In 1915 county data, the idle-land share is 3.6 percentage points higher in Early Taiping than control counties, while Late Taiping is not significantly different from control — supporting the property-rights hypothesis. (3) Fiscal capacity (likin): Taiping areas collected ~12 times (e^2.5) as much likin per 1,000 sq km as control areas in 1869-1879, still 3.7 times as much in 1922-1925. Late Taiping areas had even higher intensity (22.2x in 1869-1879; 6.1x in 1922-1925) than Early Taiping (9.0x; 2.7x). (4) Social capacity (charities): On average the rebellion had no significant effect, but Late Taiping areas saw charity growth ~56 percentage points (44 log points) above control by 1880, rising to ~78 percentage points (58 log points) by mid-20th century. (5) Long-term development: Driven entirely by Late Taiping areas — 1982 agricultural+industrial output per capita 90% higher (64 log points), 2010 GDP per capita 87% higher (63 log points), and 2010 fiscal revenue per capita 203% higher (111 log points) than control; Early Taiping is statistically indistinguishable from control. Late Taiping counties also show higher post-1895 industrial firm entry. (6) Civic outcomes and resilience: Using CGSS 2010, Late Taiping residents show higher trust in personal networks and greater civic engagement (political attention, local participation). During the Great Famine (1959-1961), Taiping areas had 6.9% larger survivor cohorts; the effect is 28% stronger in Late Taiping (8.4%) than Early Taiping (6.5%).&lt;/p&gt;
&lt;p&gt;Implications: Violent conflict can leave lasting positive institutional imprints — through property rights, decentralized local fiscal capacity (&amp;ldquo;war made the state&amp;rdquo; at the local level), and elite-led social capacity — conditional on favorable initial conditions (strong gentry, wealthier commercial regions). The authors argue cultivating civil society and social capacity could yield large payoffs given China&amp;rsquo;s strong-state/weak-society configuration.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline is a difference-in-differences comparing Taiping vs. control prefectures over 1820-2000, with prefecture and year fixed effects and 1820 as the reference year. Identification rests on parallel pre-trends: the Taiping coefficient in 1851 (pre-rebellion) is small and insignificant, indicating no differential selection conditional on controls. The main threats are: (i) the binary Taiping measure aligning with provincial boundaries and picking up broad regional dynamics; (ii) control-group contamination because some control prefectures were temporarily conquered (but not governed) by the Taiping Army; (iii) spatial spillovers between neighbors (Tobler&amp;rsquo;s law / Kelly 2019 critique); (iv) omitted subsequent historical events; and (v) omitted variables differing systematically between treated and control areas. The authors address these with dosage measures (battles, occupation months), matching, a SAR model, an IV (longitude), explicit controls for the Taiping conquest, an adjacent-treatment indicator, leave-one-province-out checks, and controls for many other historical events.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the instrumental-variable strategy work and why might longitude be valid?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Longitude of the prefectural seat instruments for the Taiping dummy. Relevance: the Taiping leaders&amp;rsquo; July 1852 military plan was to march eastward along the Yangtze, capture Jiangning (Nanjing), and expand from there using their dominant navy — so eastern (higher-longitude) prefectures were far more likely to fall under Taiping rule (Table 1 confirms Taiping prefectures have significantly larger longitudes; first-stage F-statistics above 20, Shea&amp;rsquo;s partial R-squared above 0.1). Exclusion: prefecture fixed effects absorb time-invariant geographic advantages, and year-dummy interactions with key geography (distances to coastline, Grand Canal, Yangtze) allow flexible time-varying geographic effects; conditional on these, longitude is argued to be excludable. IV estimates are larger in magnitude than OLS but qualitatively confirm a persistent negative population effect (robust to Anderson-Rubin weak-IV inference). The authors caution that omitted determinants correlated with longitude cannot be fully ruled out.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the four hypotheses and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Property-rights hypothesis: Late Taiping areas (post-1860 &amp;lsquo;direct tenant payment&amp;rsquo; system creating de facto/de jure tenant ownership) had better-defined land rights than Early Taiping areas (collapsed landlord system, lost deeds, anti-rent movements), so should have less idle land and faster population recovery — tested via the 1915 idle-land cross-section and the Early-vs-Late population DID. (2) Likin-as-fiscal-capacity hypothesis: Qing fiscal decentralization and the likin tax (introduced 1853) strengthened local fiscal capacity, persistently higher in Taiping (especially Late Taiping) areas — tested via the likin-intensity DID. (3) Social-change hypothesis: elite-led militias and reconstruction spurred charities (&amp;lsquo;benevolent halls&amp;rsquo;/shantang) as bridging social capital, especially in Late Taiping areas — tested via charity-stock DID and by adding charities as a mediator in long-term regressions. (4) Social-cohesion-and-civic-engagement hypothesis: forged social capital persists, raising modern trust/civic engagement and reducing Great Famine deaths — tested via CGSS 2010 and famine-survivor cohort ratios.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central heterogeneity is Early vs. Late Taiping. Early Taiping areas (Anhui/Jiangxi/Hubei) suffered permanent population loss, higher idle land (+3.6pp), only modest likin gains, no charity growth, no long-term development advantage, and weaker famine resilience. Late Taiping areas (Jiangsu/Zhejiang) recovered population, had no excess idle land, far higher likin intensity (22x early period), large charity growth (+56 to +78pp), strong long-term development gains (90%/87%/203% in output/GDP/fiscal revenue), higher modern trust and civic engagement, and the strongest famine resilience (8.4% vs 6.5%). Industrialization heterogeneity is also temporal: no Early/Late firm-entry difference before 1895, but after the 1895 Treaty of Shimonoseki liberalized private industry, Late Taiping counties had more entry and Early Taiping fewer.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For the population results: dosage interactions (log battles, log occupation months); excluding six most-intense-fighting prefectures (Wuchang, Songjiang, Anqing, Jiangning, Suzhou, Hangzhou); controlling for newly selected jinshi (civil-service quota channel); a SAR spatial model (after Pesaran cross-sectional-dependence tests); PSM matched sample; longitude IV with Anderson-Rubin inference; controls for seven other historical events (Guangxu Drought, Hui Revolt, Nian Rebellion, early-Republic conflicts, Sino-Japanese War, Chinese Civil War, missionary activity); explicit controls for Taiping conquest vs. regime; an adjacent-treatment indicator (Butts 2021) for spillovers; and leave-one-province-out exclusion. Long-term development results add SAR, matching, historical-event controls including the Cultural Revolution, and an &amp;lsquo;intermediate-term&amp;rsquo; 1930s industrialization check. Famine results are robust to alternative famine-severity measures, SAR, matching, and historical-event controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the mediation analysis handled and what does it show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors add likin intensity (1880) and average charities (1880-1941) to cross-sectional long-term regressions, explicitly flagging these as endogenous &amp;lsquo;bad controls&amp;rsquo; (Angrist-Pischke 2009; Imai et al. 2011) to be interpreted cautiously as descriptive mediation. Findings: a one-SD increase in likin intensity is associated with +1.7pp middle-school completion, +4.8pp literacy, +5.3% schooling, and +12.2% (11.5 log points) GDP per capita in 2010. A one-SD increase in charities is associated with +15% 1982 output, +20% 2010 GDP, and +55% 2010 fiscal revenue per capita. Once charities are netted out, Late Taiping advantages in output, GDP, and fiscal revenue are attenuated by about 17%, 14%, and 22% respectively — highlighting the social-capacity channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the Great Famine resilience result connect to the rebellion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Famine severity is measured by &amp;lsquo;Famine Control&amp;rsquo; = ratio of cohort size born during the famine (1959-1961) to cohort size born pre-famine (1954-1957) from the 1990 census 1% sample (higher = less severe). Taiping areas had a 6.9% larger survivor cohort than non-Taiping; the effect is 8.4% in Late Taiping vs. 6.5% in Early Taiping. Back-of-envelope, the Late Taiping experience would have &amp;lsquo;saved&amp;rsquo; ~31,374 people in an average prefecture (17% of the 1959-1961 cohort) vs. ~24,145 (13%) for Early Taiping. Controlling for political radicalism (reverse party-member density, -1*PMD, after Yang 1996) does not change the result. The mechanism: higher social capital made local officials more sympathetic/less radical in grain procurement and citizens better able to act collectively (paralleling Cao-Xu-Zhang 2022 on clan density and Hu-Yao-You 2023 on home-county officials).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior Taiping studies examined narrower consequences: civil-service exam quotas (Li 2014), demographic and industrialization effects (Li and Ma 2016), migration and public goods (Hao and Xue 2017), and late-Qing power distribution (Bai, Jia, and Yang 2023). None addressed the rebellion&amp;rsquo;s enduring impacts on modern development, social trust, and Great Famine responses, nor the property-rights/fiscal-capacity/social-capacity mechanism triad. It complements Xue (2021) on Qing charities, generalized trust, and political participation, but extends to development outcomes. Against the European state-building literature (war strengthens central state capacity via centralization), this paper&amp;rsquo;s distinctive claim is that the Taiping Rebellion strengthened LOCAL fiscal capacity through DECENTRALIZATION, and expanded local social capacity that constrained the central state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The benefits of war-induced institutions are conditional, not universal: they appeared chiefly in Late Taiping areas with a strong gentry class and favorable initial conditions for modern sectors (the wealthier, more commercial Lower Yangtze). The likin/fiscal-capacity benefits are explicitly stated to be conditional on strong gentry and good modern-sector initial conditions. The broad implication is that, given China&amp;rsquo;s very strong state but still weak society today, cultivating civil society and strengthening social capacity could yield particularly large long-term payoffs. The authors also caution (Appendix F.1) that likin could be distortionary taxation rather than fiscal capacity, arguing the fiscal-capacity interpretation is more relevant for long-term development.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What significant caveats does the paper acknowledge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Long-term mechanisms cannot be exhaustively identified — likin and charities are endogenous outcomes, so mediation magnitudes are descriptive, not causal. History contains near-infinite interrelated events, so confounding cannot be fully eliminated (a fundamental limitation of all history-based work). The IV may have omitted correlates of longitude. Some 2SLS estimates for development outcomes were largely insignificant. The charity-stock measure assumes charities persisted once founded (no closure dates in the data). On property-rights persistence: using 2005 World Bank Enterprise Survey data they find no association between modern firms&amp;rsquo; perceived property-rights protection and Taiping regimes, suggesting the channel works through income effects rather than persistence of property rights per se.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Early vs. Late Taiping areas&lt;/strong&gt;: Early Taiping = prefectures occupied by the rebels up to 1859 (Anhui, Jiangxi, Hubei), where the old landlord system collapsed and land rights stayed ambiguous; Late Taiping = prefectures occupied from 1860 (Jiangsu, Zhejiang), where the Taiping introduced a &amp;lsquo;direct tenant payment&amp;rsquo; (作佃交粮) system and issued new deeds, granting tenants de facto/de jure ownership. This distinction is the paper&amp;rsquo;s central source of institutional variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Likin (lijin)&lt;/strong&gt;: A local tax on trade and commerce introduced in 1853 (a transit tax on travelling merchants&amp;rsquo; goods plus a business tax on resident merchants), collected in a decentralized, province-specific way. In the paper it is the operational measure of local fiscal capacity (likin revenue per 1,000 sq km), not central state capacity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social capacity&lt;/strong&gt;: In the paper&amp;rsquo;s sense, the ability of society to act collectively, constrain the state, and empower its members — operationalized empirically by the stock of local charity organizations (&amp;lsquo;benevolent halls&amp;rsquo;/shantang) that functioned as bridging social capital across classes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Likin-as-fiscal-capacity hypothesis&lt;/strong&gt;: The claim that the rebellion-induced likin system durably raised LOCAL fiscal capacity (an instance of Tilly&amp;rsquo;s &amp;lsquo;war made the state&amp;rsquo; operating locally rather than centrally), which improved public-goods provision and long-run development — conditional on strong gentry and favorable modern-sector initial conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stationary bandit (applied to Late Taiping rulers)&lt;/strong&gt;: Borrowing Olson (1993): in Late Taiping areas the consolidated, longer-horizon Taiping regime behaved like a stationary bandit, lowering effective tax rates, encouraging land registration, and securing tenant property rights to expand the tax base and promote production, unlike the looting/confiscation of the early stage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Famine Control&lt;/strong&gt;: The paper&amp;rsquo;s local famine-severity measure: the ratio of the cohort born during the Great Famine (1959-1961) to the cohort born pre-famine (1954-1957) in the 1990 census; a higher value means less severe famine and more survivors, and it is less vulnerable to government understatement of famine deaths.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intra-China Divergence&lt;/strong&gt;: The authors&amp;rsquo; term for China&amp;rsquo;s persistent, very large regional disparities in economic performance (up to 27-to-1 in GDP per capita) despite all regions historically sharing similar Malthusian income levels — the macro puzzle the rebellion&amp;rsquo;s institutional legacy helps explain.&lt;/p&gt;</description></item><item><title>Remote Work and City Structure</title><link>https://macropaperwarehouse.com/papers/remote-work-and-city-structure/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/remote-work-and-city-structure/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Monte, Porcher, and Rossi-Hansberg ask why remote work surged abruptly and permanently after COVID-19 despite information-technology advances raising it only marginally between 1980 and 2019, why the change was so heterogeneous across cities, and what the welfare consequences are. Their answer is a coordination mechanism: working downtown (the CBD) yields productive interactions with other in-office workers but entails commuting/congestion costs, while remote work avoids those costs but forgoes agglomeration benefits. Because workers do not internalize the spillovers they confer, a worker prefers the office only if others commute too — generating, in a dynamic discrete-choice model with idiosyncratic preferences and fixed switching costs, the possibility of MULTIPLE stationary equilibria with different permanent commuter shares. A temporary shock (the pandemic) that drives commuters near zero can then select the low-commuting equilibrium permanently.&lt;/p&gt;
&lt;p&gt;The model is a dynamic monocentric city (disk-shaped, radially symmetric CBD, absentee landlords, Cobb-Douglas utility, Gumbel idiosyncratic shocks). Multiplicity arises (Proposition 4.3) when agglomeration forces are strong enough — the net strength delta + xi exceeds a threshold above theta + gamma/(2mu) — AND remote-work productivity relative to office productivity z/A lies in an intermediate &amp;ldquo;cone of multiplicity&amp;rdquo; (neither too low nor too high). The authors quantify city-specific parameters for U.S. CBSAs using pre-2019 data (Census/ACS 1980-2023, NLSY79 panel of 4,147 individuals 1998-2022, SafeGraph cell-phone mobility, Zillow ZHVI zip-code house prices). Estimation: transition elasticity s = 0.30 (elasticity of transitions into remote work = 3.09), fixed switching cost F = 1.78 (equivalent to giving up 83% of a year&amp;rsquo;s earnings); agglomeration externality delta with mean 0.067 (SD 0.022, 619 CBSAs); the amenity-vs-congestion difference xi - theta is statistically insignificant and set to zero.&lt;/p&gt;
&lt;p&gt;Stylized facts. Predicted remote-work share (controlling for composition) rose in the ACS from under 1% (1980) to 2.6% (2019), jumped to 12% (2020), peaked at 15% (2021), and fell to 11% (2023); NLSY shows a parallel path (1.4% in 1998 to 3.7% in 2018, 9.2% in 2020, 7.8% in 2022). The remote-work wage premium rose steadily but did NOT jump post-2018: ACS discount of 44.5% in 1980 became a 6.5% premium by 2022; NLSY discount fell from 18.5% (2000) to 3.1% (2022). A stable premium alongside a sudden quantity jump argues against pure productivity/preference shocks.&lt;/p&gt;
&lt;p&gt;Mobility/housing facts. All cities dropped to ~20% of pre-pandemic CBD trips in spring 2020 (about a 75% drop, unrelated to city size). Recoveries diverged: the 25 largest CBSAs (employment &amp;gt; 1.5M) stabilized at ~60% of January-2020 trips, while the 663 smallest (&amp;lt; 150K) returned fully to pre-pandemic levels by early 2021. New York and San Francisco stabilized near 40%; Madison, WI recovered fully. House-price distance gradients flattened ~0.01 everywhere by January 2021; the flattening persisted and stabilized around 0.095 by end-2024 in large cities but reversed in small ones.&lt;/p&gt;
&lt;p&gt;Results and welfare. Of 278 estimated CBSAs, 208 were inside their cone of multiplicity pre-pandemic; larger cities are systematically more likely to be inside (probit on log employment significant). The cone indicator predicts trip shortfalls (R-squared 0.144 alone, retaining significance with controls) and gradient flattening. Welfare: comparing high- vs low-commuting stationary equilibria for the 208 cone cities, the loss from switching is positive but modest — mean 2.3%, median 2.2%, range 1.2% to 4.0% (Table 3). Average wages fall sharply (15-35%) but option-value and commuting-cost savings offset most of it; net strength delta - gamma/(2mu) predicts the loss with R-squared 0.85. Cities with trips at 60% or less of pre-pandemic levels have an average welfare loss of 2.7%.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core economic mechanism, and how does it generate multiple equilibria?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Office work confers productivity spillovers and CBD amenity value that rise with the mass of in-office workers (L-tilde-c), but workers do not internalize these external benefits. So each worker prefers the office only if enough others commute. In a dynamic setting with idiosyncratic Gumbel preference shocks and fixed switching costs F, this coordination can produce multiple stationary equilibria: a high-commuting and a low-commuting one (with an unstable equilibrium E2 between them). Multiplicity requires (Prop 4.3) static agglomeration forces (delta + xi) above a threshold eta_min &amp;gt; theta + gamma/(2mu), AND relative remote productivity z/A in an intermediate interval Z — the &amp;lsquo;cone of multiplicity.&amp;rsquo; If z/A is too low, the high-commuting equilibrium is unique; if too high, only the remote equilibrium survives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/quantification strategy and its main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To avoid taking a stand on which equilibrium generated the data, the authors rely ENTIRELY on pre-2019 data (when every city was plausibly in the high-commuting equilibrium) and on model relationships that hold in any equilibrium. Four steps: (1) transition elasticity s and cost F from NLSY79 transition probabilities via a CCP/log-linear regression (eq. 21), using past wage ratios as an instrument for future ratios to address measurement error / forward-looking expectations (IV eta0 = -0.47, eta1 = 3.09); (2) agglomeration externality delta_j from commuter-wage changes instrumented by 1980 occupational composition interacted with economy-wide occupation-specific commuter-share changes (shift-share IV, eq. 26-28), with five industry groups; (3) remote/office productivity z_j, A_j from occupation-level remote-work premia (NLSY, 22 occupation groups) reweighted by city occupation shares; (4) transport-cost elasticity gamma_j from CBSA-specific housing rent-distance gradients (ACS block-group rents 2015-2019). Main threats: selection of workers into remote work on unobservables (addressed by NLSY individual fixed effects), endogeneity of commuter shares to local productivity shocks (addressed by the shift-share IV), and the assumption that all cities were in the high-commuting equilibrium in 2019; tau_j is calibrated to match each city&amp;rsquo;s 2019 Lc/L.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the authors rule out competing explanations (pure productivity/preference shocks, congestion, establishment size, occupational shift)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;National productivity/preference shocks: would be expected to leave some lasting imprint even in small cities, but small CBSAs reverted fully, and at least 34% of jobs remain teleworkable even in fully-reverting cities (Dingel-Neiman teleworkable share ranges 25-55% across CBSAs), so low telework capacity cannot explain reversion; cities with permanent 40%+ trip declines have only a modestly higher 43% teleworkable share. The wage premium shows no differential evolution across high- vs low-teleworkable occupations over the pandemic. Congestion: if congestion drove the shift, large cities should show lower CBD propensity pre-pandemic, but the opposite holds (30.6% of trips to CBD in large vs 15.6% in small CBSAs in late 2019). Establishment concentration: employment is LESS concentrated in smaller cities, so big-employer return-to-office decisions cannot explain reversion. Occupational shift: teleworkable employment share rose only ~5% post-pandemic, and rose MORE in smaller CBSAs (7.9%) than larger (5.8%) by end-2023, the wrong direction to explain the heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across cities is documented and how does it map to the theory?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Large cities (high agglomeration, high net strength delta - gamma/(2mu), which rises with size: doubling size raises net strength ~0.004 off a mean 0.049) are disproportionately inside the cone of multiplicity (208 of 278 estimated cities in-cone; probit on log employment positive and significant). These cities show permanent CBD-trip declines (stabilizing ~60% for the 25 largest) and persistent gradient flattening (~0.095 by 2024). Small cities are mostly outside the cone, with unique equilibria, and revert fully. The cone indicator is also positively associated with delta_j and z_j/A_j and negatively with gamma_j, as the theory predicts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Estimates of s and F are similar using restricted-use county-geocoded NLSY and under an alternative city-partition definition (two days/week remote). Main results are robust to lower delta_j and higher gamma_j calibrations (Appendix A.17). A CES production function in remote/in-person labor yields very large substitution elasticities, motivating the linear specification. An endogenous-housing-supply model yields a nearly identical rent gradient (because commuters were a high share of employment pre-2020). Office-trip-only versions of the mobility figures (workplace visits) show similar patterns. The cone indicator retains significance in Table 2 after adding teleworkable share, pre-pandemic CBD-trip share, industry value-added shares, and total employment; results hold for an alternative binary &amp;lsquo;returned to office&amp;rsquo; indicator 1back(5,20). Multiple DYNAMIC equilibria were not found in numerical exercises (Appendix B.6).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Unlike Davis, Ghent &amp;amp; Gregory (2024) (remote productivity via adoption externalities), Parkhomenko &amp;amp; Delventhal (2024) (amenity value of remote work), and Duranton &amp;amp; Handbury (2023) (exogenous changes in who may work remotely), this paper does NOT rely on exogenous productivity or amenity/preference shocks to explain the large persistent jump. Instead a temporary commuter shock SELECTS among pre-existing multiple equilibria. Liu &amp;amp; Su (2023) document a falling urban wage premium for remote-amenable occupations (consistent with weaker agglomeration). The paper&amp;rsquo;s documented divergence of residential rent-distance gradients between large and small cities is, to the authors&amp;rsquo; knowledge, a new fact, interpreted structurally. Owens, Rossi-Hansberg &amp;amp; Sarte (2020) similarly use coordination/residential externalities (Detroit neighborhoods).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the coordination failure operates partly OUTSIDE firm boundaries, individual firms&amp;rsquo; return-to-office mandates may be insufficient to restore the high-commuting equilibrium. City-level interventions — taxing remote work or subsidizing commuting — could in principle move a city back, since the only active externality in the quantification is a positive agglomeration externality (implying too little commuting relative to the efficient benchmark in all equilibria). However, the authors stress these welfare effects and the effectiveness of policy remain open questions; their welfare numbers depend on estimation details and the abstraction from a system-of-cities with migration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and abstractions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model treats each city as a CLOSED economy: no inter-city migration, trade, or investment links, though the authors note large cities show a small differential population drop (Appendix A.9), attributed to low migration elasticities. Remote work is &amp;lsquo;partial&amp;rsquo; with a FIXED fraction mu = 3/5 of days at home, not chosen. Occupational heterogeneity is abstracted from (justified by rare occupation transitions). The amenity (xi) vs congestion (theta) externalities are not separately identified and set to zero (difference insignificant). Spillovers are not internalized by firms in the model. The welfare ranking (high-commuting preferred) is intuited from the single positive externality rather than formally proven.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is there a discrepancy between the abstract&amp;rsquo;s welfare figures and per-city numbers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The abstract and revised Table 3 report a mean welfare loss of 2.3% (median 2.2%, range 1.2%-4.0%) across the 208 cone cities, and state cities with permanently low commuting (60% or less of pre-pandemic trips) experience average losses of 2.3% (2.7% in the text). The introduction additionally quotes specific city losses (about 3.7% for Los Angeles and San Jose, 3.2% for New York, 2.8% for San Francisco, 2% for Phoenix); these are the largest cities and lie within or near the upper part of the distribution, consistent with welfare loss rising in net agglomeration strength (R-squared 0.85 of loss on delta - gamma/(2mu)).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;!-- flags: Welfare magnitudes: the final/revised headline figures are mean 2.3%, median 2.2%, range 1.2-4.0% (Table 3, 208 cities). The Introduction also cites larger per-city losses (3.7% LA/San Jose, 3.2% NYC, 2.8% SF, 2% Phoenix); these are consistent with the distribution (loss rises with net agglomeration strength) but appear to be from a specific large-city calibration table, not the summary distribution. Reported both, flagged for reviewer., Paper is a Nov 2025 revision of NBER WP 31494 (orig. July 2023); some figures span data through end-2024/Nov-2024, later than the original draft. --&gt;</description></item><item><title>Resource Misallocation in European Firms: The Role of Constraints, Firm Characteristics and Managerial Decisions</title><link>https://macropaperwarehouse.com/papers/resource-misallocation-in-european-firms-the-role-of-constraints-firm-characteristics-and-managerial-decisions/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/resource-misallocation-in-european-firms-the-role-of-constraints-firm-characteristics-and-managerial-decisions/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper investigates why firms in the European Union exhibit wide dispersion in marginal revenue products (MRP) of capital and labor — a direct indicator of resource misallocation — and asks how much aggregate productivity the EU forfeits as a result. The research question is motivated by the persistent productivity gap between the EU and the United States, by evidence that within-country MRP dispersion in Europe has been trending upward since the mid-1990s, and by an institutional context in which the EU single market (launched in 1993) has not eliminated cross-country factor market frictions even three decades later.&lt;/p&gt;
&lt;p&gt;The primary data source is the EIB Investment Survey (EIBIS), a stratified random survey of non-financial enterprises conducted annually since 2016 across all 28 EU member states, covering manufacturing, services, utilities, and construction (NACE categories C–J). The analysis uses three waves (2016–2018), with approximately 12,500 firms per wave and a panel component of roughly 2,000 firms appearing in all three waves. Survey responses are matched to Orbis administrative data; the correlation between log employment in EIBIS and Orbis is 0.91, confirming data quality. MRP of capital (MRPK) is measured as the capital cost share times revenue divided by fixed assets; MRP of labor (MRPL) is the labor cost share times revenue divided by employment. Cost shares are calibrated from OECD STAN and Eurostat national accounts at the country–year–industry level.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a dynamic model of a profit-maximizing firm with Cobb-Douglas production, isoelastic demand, and quadratic adjustment costs. Under the assumption that pure economic profits are small and that the labor output distortion is negligible (following Hsieh-Klenow 2009), the model implies that log MRPK and log MRPL can be approximated by observable average revenue products. The empirical strategy is a Mincerian regression of log MRPK (and log MRPL) on a rich vector of firm-level characteristics — firm demographics, input quality, capacity utilization, investment constraints, dynamic adjustment variables, and financing sources — plus country, industry, and year fixed effects (and their interactions). Because regressors are endogenous, the R² from OLS is interpreted as an upper bound on the share of MRP variance attributable to each factor (formally shown to dominate the IV R²). Marginal R² increments when a variable block is added identify the contribution of that block to the variance in MRP, which is then mapped into productivity gains via the Hsieh-Klenow formula.&lt;/p&gt;
&lt;p&gt;The main quantitative findings are as follows. Raw dispersion is large: the standard deviation of log MRPK is 1.43 and of log MRPL is 1.19 (and 1.63 for log MRPL minus log MRPK), all substantially exceeding comparable US figures (0.98 for capital and 0.58 for labor from Asker et al. 2014 and Bartelsman et al. 2013). The R² in the full regression is 0.14 (without fixed effects) and 0.49 (with country × industry × year fixed effects) for MRPK, and 0.29 and 0.74 respectively for MRPL. Among firm-characteristic blocks, the &amp;ldquo;adjustment&amp;rdquo; (dynamic investment and employment growth) and &amp;ldquo;demographics&amp;rdquo; (firm size, age, subsidiary and exporter status) blocks carry the largest marginal R² contributions; the &amp;ldquo;obstacles to investment&amp;rdquo; block (direct reports of constraints) contributes modestly by comparison. Country fixed effects alone explain R² = 0.052 for MRPK and R² = 0.445 for MRPL, while industry fixed effects alone explain R² = 0.239 for MRPK and R² = 0.268 for MRPL. The combined country–industry–year fixed-effects R² reaches 0.275 for MRPK and 0.611 for MRPL; adding the full interaction yields 0.492 and 0.736 respectively.&lt;/p&gt;
&lt;p&gt;Treating the &amp;ldquo;distortions&amp;rdquo; block of variables as genuine frictions, removing them would raise EU aggregate productivity by more than 40 percent (computed as 1.5 × 1.42 × 0.186 + 0.13 × 2.66 × 0.134 = 0.442). If all variables in X are treated as distortions, the implied gain is approximately 72 percent (0.715 in log points). Removing cross-country inequality in average MRPs (equalizing country fixed effects) would imply a 102 percentage log-point gain in productivity under the Hsieh-Klenow formula; removing barriers between industries and countries could raise productivity by at least 143 percentage log points.&lt;/p&gt;
&lt;p&gt;A Machado-Mata distributional decomposition comparing Germany (σ(log MRPK) = 0.92, σ(log MRPL) = 0.61) and Greece (σ(log MRPK) = 1.64, σ(log MRPL) = 0.91) reveals that the primary driver of Greece&amp;rsquo;s higher dispersion is the &amp;ldquo;prices&amp;rdquo; (regression coefficients reflecting institutional and policy environment), not the &amp;ldquo;endowments&amp;rdquo; (firm characteristics). Giving Greece German institutional &amp;ldquo;prices&amp;rdquo; reduces the counterfactual standard deviation of Greek MRPK from 1.66 to 0.94. This pattern generalizes across EU countries: German b (coefficients) tends to reduce MRPK dispersion for most countries, while German X (firm characteristics) tends to increase it, because Germany has more heterogeneous firms but an environment that prices those characteristics in a way that equalizes returns. This finding constitutes large-scale microeconomic evidence that institutions matter — cross-country differences in MRP dispersion reflect how business, institutional, and policy environments translate firm heterogeneity into outcomes, more than they reflect differences in firm characteristics per se.&lt;/p&gt;
&lt;p&gt;The policy implication is that deep institutional reform — not merely changes in firm composition — is required to narrow EU resource misallocation. The scope condition is that these estimates are upper bounds, and some observed MRP dispersion likely reflects compensating differentials (e.g., higher-quality capital commanding a higher MRPK) rather than pure distortions.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper does not attempt causal identification. Instead, it uses OLS to estimate equilibrium (Mincerian-type) regressions of log MRPK and log MRPL on firm characteristics plus fixed effects. The key insight is that OLS R² provides an upper bound on the share of MRP variance causally attributable to each regressor, because simultaneity or omitted variables can only inflate OLS R² above the true IV R². The main threats are: (1) endogeneity of regressors — a growing firm facing red tape will have high MRPK and a binding constraint simultaneously, inflating the R² attributed to constraints; (2) classical measurement error in survey responses, which attenuates R² toward zero (so OLS actually understates causal effects in this direction); (3) omitted variable bias via unobserved firm quality (managerial talent, etc.); (4) use of same variables (employment, fixed assets) on both left and right sides, addressed by cross-checking with Orbis data as instruments. The authors argue these threats are mostly conservative — they overstate, not understate, the upper bound.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the theoretical justification for using average revenue products to measure marginal revenue products?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under the assumption that the share of pure economic profits is small (following Basu and Fernald 1997), the optimality conditions of the dynamic model imply that MRPK ≈ (capital cost share) × (revenue / capital) and MRPL ≈ (labor cost share) × (revenue / employment). These are average revenue products scaled by factor cost shares, matching Hsieh and Klenow (2009). The distortion framework further implies that the variance of log MRPK and log MRPL, when distortions are log-normally distributed and uncorrelated, maps directly into the Hsieh-Klenow productivity-loss formula, linking the regression R² to quantitative welfare calculations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of compensating differentials versus true distortions in interpreting the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper emphasizes that not all dispersion in MRPs reflects inefficient distortions. Some dispersion — particularly from &amp;lsquo;quality of capital,&amp;rsquo; &amp;lsquo;capacity utilization,&amp;rsquo; and &amp;lsquo;dynamic adjustment&amp;rsquo; — may reflect compensating differentials: firms that invest in higher-quality capital rationally face higher costs, demanding a higher MRPK in equilibrium, analogous to how more educated workers earn higher wages in a Mincerian framework. If these variables reflect compensating differentials rather than frictions, using &amp;lsquo;raw&amp;rsquo; MRP dispersion overstates misallocation. Conversely, if all variables proxy for distortions, the productivity gains from reform are even larger (72 percent versus 40 percent). The paper presents both interpretations explicitly, making the framework &amp;lsquo;highly portable&amp;rsquo; for different views of what drives observed dispersion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in MRP dispersion is documented across EU countries and industries?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Dispersion is notably lower in Germany (σ(log MRPK) = 0.92, σ(log MRPL) = 0.61) than in Greece (1.64 and 0.91) or smaller countries such as Malta, Luxembourg, and Cyprus. Country fixed effects explain R² = 0.445 of MRPL variation but only R² = 0.052 of MRPK variation, meaning labor is more segmented across countries than capital. Industry fixed effects explain R² = 0.239 for MRPK versus R² = 0.268 for MRPL, indicating capital is more segmented across industries than across countries. Core EU countries (France, Denmark) are relatively insensitive to counterfactual substitution of German coefficients, while periphery countries (Portugal, Ireland) show large movements. Romania, which resembles Slovenia in raw MRPK dispersion, looks much more like the Netherlands after controlling for firm characteristics — illustrating that observed dispersion rankings can be misleading without adjustment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the Machado-Mata decomposition reveal, and how is it implemented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Machado-Mata (2005) decomposition separates the distribution of MRP into an &amp;rsquo;endowments&amp;rsquo; component (due to the values of firm characteristics X) and a &amp;lsquo;prices&amp;rsquo; component (due to the regression coefficients b, which capture how the institutional and policy environment translates X into outcomes). The decomposition draws B = 10,000 bootstrap samples from the empirical distribution of X for each country, combines them with quantile regression coefficients estimated separately for each country, and constructs counterfactual distributions. Applying Greek X with German b reduces Greece&amp;rsquo;s counterfactual σ(log MRPK) from 1.66 to 0.94 — close to Germany&amp;rsquo;s actual 0.92 — while applying German X with Greek b increases dispersion. The main finding is that differences in &amp;lsquo;prices&amp;rsquo; (institutional environment) dominate differences in &amp;rsquo;endowments&amp;rsquo; (firm characteristics) in explaining cross-country variation in within-country MRP dispersion. This pattern holds generally across EU countries: gains from &amp;lsquo;importing&amp;rsquo; German institutions are correlated with poor World Bank Governance Indicators and International Country Risk Guide scores.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the paper&amp;rsquo;s estimates of EU misallocation compare to US benchmarks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The EU standard deviations of log MRPK (1.43) and log MRPL (1.19) substantially exceed comparable US figures of 0.98 for capital (Asker et al. 2014) and 0.58 for labor (Bartelsman et al. 2013). The paper discusses three caveats for this comparison: (1) EIBIS uses revenue rather than value added, which affects dispersion (approximately +0.16 log points for MRPL, -0.21 for MRPK) — insufficient to explain the full gap; (2) survey measurement error is present but small — averaging over multiple waves reduces the standard deviation of log MRPK by only 8–12 percent; (3) EIBIS measures firms (not plants), and since about two-thirds of within-firm MRPK variance occurs across plants within firms (Kehrig and Vincent 2017), the EU–US comparison likely understates the true difference. Qualitatively, the greater EU dispersion is consistent with lower EU aggregate TFP relative to the US.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What specific regression results are reported for individual variable blocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The full R² (without / with country × industry × year fixed effects) is 0.14 / 0.49 for MRPK and 0.29 / 0.74 for MRPL. Among variable blocks, the &amp;lsquo;adjustment&amp;rsquo; (investment, employment growth, past and planned investment) and &amp;lsquo;demographics&amp;rsquo; (size, age, subsidiary, exporter) blocks have the largest marginal R². The &amp;lsquo;obstacles to investment&amp;rsquo; (direct constraint reports) block contributes modestly, with some coefficients not statistically significant. Within regression coefficients (from Table A.4): older, exporting, high-utilization firms have higher MRPK and MRPL; investment is strongly negatively associated with MRPK (movement down the MRPK curve as capital rises) and positively with MRPL (labor becomes relatively scarcer); employment growth is positively associated with MRPK and negatively with MRPL (symmetric logic); credit-constrained status is negatively correlated with both MRPK and MRPL.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper reports: (1) &amp;lsquo;between&amp;rsquo; regressions on multi-year firm averages to reduce transitory variation and measurement error — results are qualitatively similar with slightly larger productivity gains; (2) restricting the sample to firms appearing in all three survey waves (Appendix Table A.5) — qualitatively similar results; (3) estimating equation (4) for each wave separately — similar results; (4) using Orbis employment and investment as regressors instead of EIBIS responses to address mechanical measurement-error correlation — nearly identical results (Appendix Table A.17); (5) replacing log(1+investment) with an indicator for positive investment (Appendix Table A.7) — similar results; (6) using industry-specific rather than country–year–industry cost shares — similar results; (7) confirming that measurement error can account for only a portion of the EU–US dispersion difference (8–12 percent reduction in standard deviation when averaging over waves). The paper also reports separate coefficient estimates for three blocs of EU countries (North/West, South, Center/East) in Appendix Tables A.10–A.16.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from Hsieh and Klenow (2009) and related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper extends Hsieh and Klenow (2009) in several directions. First, while Hsieh-Klenow use administrative census-type data for India and China restricted to manufacturing, this paper uses a consistent cross-country survey covering all sectors in 28 EU countries, enabling direct cross-country comparison. Second, Hsieh-Klenow implicitly assume all MRP dispersion reflects distortions; this paper explicitly distinguishes distortions from compensating differentials and shows the distinction matters quantitatively. Third, this paper develops the Mincerian regression approach to apportion the variance in MRPs across observable factors — analogous to labor economists decomposing wage dispersion — and shows OLS R² provides a valid upper bound without requiring exogenous variation. Fourth, unlike country-level distortion measures (Gamberoni et al. 2016), tight theoretical restrictions (David and Venkateswaran 2017), or specific reforms (Rotemberg 2019), this paper draws on firm-level survey data with minimal restrictions and maintains high external validity. Fifth, the Machado-Mata distributional decomposition adds a new dimension absent from Hsieh-Klenow: decomposing cross-country differences into endowments vs. institutional &amp;lsquo;prices.&amp;rsquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The primary policy implication is that EU productivity could rise by more than 40 percent if distortions to resource allocation were removed — and up to 72 percent if all observed MRP variation is attributed to distortions. A more modest goal of equalizing within-industry MRP dispersion across countries (i.e., making Germany and Greece similar within industries) implies gains of approximately 31–53 percent depending on interpretation. The decomposition evidence implies that institutional reform (changing how environments price firm characteristics) is more important than directly changing firm composition. The scope conditions are: (1) these are upper bounds derived from OLS; (2) some dispersion reflects compensating differentials that should not be counted as losses; (3) the EIBIS covers firms with at least 5 employees, so very small firms are excluded; (4) the framework assumes log-normal, uncorrelated distortions and constant returns to scale — relaxing these can increase estimated losses further (Jones 2011); (5) the estimates do not account for firm-level markup heterogeneity, which could overstate or understate other channels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper contribute to the literature on measurement error in MRP studies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper shows formally (Appendix D) that classical measurement error in regressors attenuates OLS R² toward zero, so OLS provides a conservative upper bound from this direction. It also shows that averaging across multiple survey waves reduces measurement error while also attenuating transitory adjustment-cost variation, so multi-year averages likely overstate the role of measurement error. Crucially, the paper validates EIBIS against Orbis administrative data, finding a 0.91 correlation for log employment, similar standard deviations of log MRPK (1.44 in Orbis vs. 1.37 in EIBIS) and log MRPL (1.07 in Orbis vs. 1.30 in EIBIS) for matched firms, and a mean absolute log difference in standard deviations of approximately 2 percent across countries. This contributes to the debate initiated by Bils et al. (2017) on whether measured MRP dispersion reflects mismeasurement, and corroborates that surveys can be reliable substitutes for census-type administrative data in cross-country analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper find about the role of credit constraints specifically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Credit constraint status (defined as loan rejection, discouragement from applying, or receiving a loan that was too small or too expensive) is negatively correlated with both MRPK and MRPL in the full regression. This is consistent with credit-constrained firms being unable to invest to the point where MRPK is equalized with the cost of capital, but the negative sign also raises the interpretive caveat noted by the authors: cross-sectional equilibrium relationships can have signs inconsistent with causal priors because constraints may be more binding for firms that are already performing poorly. The &amp;lsquo;source of funds&amp;rsquo; block (share of investment from internal vs. external sources, and credit constraint) is grouped with &amp;lsquo;distortions&amp;rsquo; in the paper&amp;rsquo;s preferred decomposition.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Marginal Revenue Product (MRPK/MRPL)&lt;/strong&gt;: In this paper, the marginal revenue product of capital (MRPK) and labor (MRPL) are measured as observable average revenue products — the capital or labor cost share times revenue divided by the stock of capital or employment. Under the paper&amp;rsquo;s model assumptions, these approximate the shadow cost of inputs and serve as the primary measure of firm-level resource allocation efficiency. A firm with a high MRPK relative to its cost of capital is under-capitalized; dispersion of MRPK across firms signals misallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Compensating differentials (in the MRP context)&lt;/strong&gt;: The paper adapts the Mincerian concept of compensating differentials from labor markets to the firm side: some observed dispersion in MRPK and MRPL may reflect optimal responses to heterogeneity in input quality, capital utilization, or adjustment dynamics — not inefficient distortions. For example, a firm with state-of-the-art machinery may face a higher MRPK reflecting the quality premium, not a barrier to investment. Because such dispersion is rational, it should be subtracted from productivity-loss calculations rather than counted as welfare-reducing misallocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Machado-Mata decomposition&lt;/strong&gt;: A distributional decomposition technique (Machado and Mata 2005) applied here to attribute cross-country differences in the dispersion of MRPK and MRPL to two components: &amp;rsquo;endowments&amp;rsquo; (the empirical distribution of firm characteristics X in a given country) and &amp;lsquo;prices&amp;rsquo; (the regression coefficients b, which capture how the country&amp;rsquo;s business, institutional, and policy environment translates those characteristics into marginal revenue products). The decomposition constructs counterfactual MRP distributions by combining one country&amp;rsquo;s X with another country&amp;rsquo;s b.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mincerian productivity regression&lt;/strong&gt;: The paper&amp;rsquo;s core empirical framework, modeled explicitly on Mincer&amp;rsquo;s (1958) wage regression: just as wages are regressed on worker characteristics (education, experience) to decompose earnings dispersion, log MRPK and log MRPL are regressed on firm characteristics (demographics, quality, utilization, adjustment, constraints, financing) to decompose MRP dispersion. OLS R² in this regression is an upper bound on the share of MRP variance attributable to each regressor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;EIB Investment Survey (EIBIS)&lt;/strong&gt;: An annual firm-level survey administered by Ipsos MORI on behalf of the European Investment Bank since 2016, covering all 28 EU member states with a stratified random sample of approximately 12,500 non-financial enterprises per wave (minimum 5 employees, NACE C–J). Unique features include consistent cross-country design, merger with Orbis administrative data, and questions on investment plans, capital quality, capacity utilization, perceived obstacles, and financing sources — all directly informative about sources of MRP variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional &amp;lsquo;prices&amp;rsquo; on firm characteristics&lt;/strong&gt;: In the Machado-Mata framework as applied here, &amp;lsquo;prices&amp;rsquo; refer to the country-specific regression coefficients b in the MRP regression — how steeply a country&amp;rsquo;s environment (regulations, institutions, policies) translates a given unit of firm heterogeneity in X into a difference in marginal revenue products. Countries with smaller b magnitudes (like Germany) achieve more equalization of MRPs across heterogeneous firms, reflecting an efficient institutional environment; countries with large b (like Greece) amplify firm-level heterogeneity into large MRP dispersion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Upper-bound R² approach to productivity gains&lt;/strong&gt;: The paper&amp;rsquo;s portable method for quantifying productivity gains from removing a friction: the marginal R² increment in an OLS regression of log MRPK (or log MRPL) when a friction variable is added is an upper bound on the share of MRP variance attributable to that friction. This bound, multiplied by the variance of log MRP and the Hsieh-Klenow productivity-loss formula parameters, gives an upper-bound estimate of the aggregate TFP gain from eliminating that friction. The method does not require exogenous variation or tight structural assumptions.&lt;/p&gt;</description></item><item><title>Returns to experience and the elasticity of labor supply</title><link>https://macropaperwarehouse.com/papers/returns-to-experience-and-the-elasticity-of-labor-supply/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/returns-to-experience-and-the-elasticity-of-labor-supply/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: A large empirical literature uses micro data to estimate the intertemporal elasticity of substitution (IES) of labor supply, a parameter crucial for understanding business-cycle fluctuations in hours and labor-supply responses to tax policy. Standard micro studies, which regress log hours on log wages, typically obtain small estimates (in the range of 0-0.4), leading much of the profession to conclude labor-supply elasticities are small. These studies assume wages evolve exogenously. The authors argue that when wages rise with work experience (learning-by-doing, LBD), the marginal return to an hour of work exceeds the wage because it also includes the discounted increase in all future earnings from added experience. Because the wage is only one component of total remuneration, a given percentage wage increase raises the total marginal return by a smaller percentage, so regressing hours on wages produces a downward-biased estimate of the IES. Critically, the omitted variable (the ratio of total remuneration to the wage) is mechanically related to the wage, so the bias cannot be corrected by instrumental variables or natural experiments.&lt;/p&gt;
&lt;p&gt;Model and strategy: The authors extend a MaCurdy (1981) life-cycle model of consumption and labor supply to include LBD, where the wage equals marginal return to human capital times a human-capital stock that grows with experience. They derive a log-linear labor-supply equation with an extra term capturing future returns to work, which is negatively correlated with the wage. Their key insight: for individuals whose future returns to experience are negligible (the term F approaches zero, e.g., at end of working life or at very high human-capital stocks), the standard regression yields an unbiased IES estimate, allowing them to remain agnostic about the human-capital accumulation process.&lt;/p&gt;
&lt;p&gt;Data: They use daily labor-supply records of Florida spiny lobster trap fishermen from the Florida Fish and Wildlife Conservation Commission, covering the 1986 through 2007 seasons (a 22-year panel), restricted to the first 70 days of each season. Analysis samples are drawn from fishermen active 2001-2005. Wage variation is exogenous and partly predictable because lobster catch rates rise around the new moon (and with rough weather). The moon phase is the key instrument. The preferred sample of &amp;ldquo;retiring fishermen&amp;rdquo; (at least 60 years old, at least 15 years of experience, exiting at season&amp;rsquo;s end) has 50 individuals. A &amp;ldquo;naive&amp;rdquo; full sample has 639 fishermen; an &amp;ldquo;entering fishermen&amp;rdquo; sample (new entrants remaining at least two more seasons) has 29 individuals.&lt;/p&gt;
&lt;p&gt;Main findings: Estimating intensive (hours) and extensive (daily participation) margins via a type-2 Tobit and summing them, the preferred total IES for retiring fishermen is 2.65 (hours elasticity 0.249, participation elasticity 2.401). Across retiring-fishermen specifications, the total IES ranges roughly 2.3 to 3.1, and the headline estimate stated in the abstract and discussion is 2.7. The naive full-sample estimate is 1.27 (about 1.3), implying that accounting for LBD bias more than doubles the IES (relative bias factor about 2.1). For entering fishermen, the IES is approximately zero (-0.068). Earnings per hour are about 40% higher during a new moon than a full moon. Returns to experience are positive, significant, and plateau around 15 years.&lt;/p&gt;
&lt;p&gt;Implications: Results support using relatively large labor-supply elasticities in representative-agent macro models and provide model-free evidence that LBD matters. Because LBD breaks the equivalence of IES, Frisch, Hicks, and Marshall elasticities, a Frisch estimate no longer bounds welfare effects of tax changes, and permanent tax changes can have larger short-run labor-supply effects than transitory ones, undermining transitory tax cuts as stimulus.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core theoretical mechanism generating the bias?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In a life-cycle model with learning-by-doing, the wage equals the marginal return to human capital times the human-capital stock (w = w-tilde times k), and human capital grows with hours worked. The intra-temporal first-order condition shows total remuneration for an hour of work is w + F, where F is the discounted marginal increase in all future earnings from one additional hour of experience. The log-linear labor-supply equation thus contains an extra term, omega times ln(1 + F/w). Since F is non-negative and negatively correlated with the wage, omitting it (the standard model, where gh=0 so F=0) produces omitted-variable bias that pushes the estimated IES downward. The Frisch elasticity equals omega times w/(w+F), which is weakly less than omega.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification rests on (1) selecting fishermen for whom future returns to experience are negligible (F approximately 0), so the standard regression is unbiased, and (2) using the lunar cycle as an instrument for the wage, since catch rates and hence hourly earnings vary predictably with the moon phase but the moon plausibly does not affect tastes for or opportunity costs of work (fishermen fish in daylight, are not affected by tides, and other relevant fisheries are closed during the studied window). A type-2 Tobit (Amemiya 1984) corrects for selection because earnings and hours are observed only when fishermen participate; exclusion restrictions for the selection equation include weekend indicators, their interactions with age and age-squared, and a hurricane-preparation indicator. The main threat: that something other than returns to experience makes the samples respond differently to wage variation. Because the omitted variable is mechanical, IV cannot fix the bias in the biased samples, but it is not needed in the retiring sample where F is approximately 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do they validate the key exclusion restrictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For weekend indicators, prices and landings must not vary with the day of week; they regress daily lobster prices on Saturday/Sunday indicators with season and dealer fixed effects and find the coefficients extremely small and insignificant. Landings are argued independent of day-of-week because trap catch does not depend on aggregate participation. For the hurricane-preparation indicator, they regress daily prices on hurricane indicators with season and dealer fixed effects and find the hurricane-preparation coefficient very small and insignificant. Lobsters being storable/transportable and Florida supplying only 4-7% of the global annual spiny lobster catch supports price exogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the evidence that returns to experience matter in this industry?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They estimate two restrictive wage specifications: one with years of experience, its square, and an indicator for having one or more years of experience; another with eighteen indicators for each experience level. Both (Figure 1) show returns to experience are positive and statistically significant, with cumulative returns plateauing around 15 years (consistent with the model&amp;rsquo;s assumption that gh approaches 0 at high human capital and with the 15-year experience criterion for retiring fishermen) and a sizable drop in marginal returns between zero and some experience.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the headline elasticity magnitudes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Preferred retiring sample (15+ seasons): hours elasticity 0.249 (SE 0.062), participation elasticity 2.401 (SE 0.548), total IES 2.650. The 10+ seasons retiring sample gives total IES 2.309 (smaller because returns to experience may not yet be negligible below 15 years). Across specifications retiring estimates span about 2.3 to 3.1, with 2.7 as the headline. Full (naive) sample: hours 0.046, participation 1.226, total 1.272 (about 1.3). Entering fishermen (preferred): total -0.068, i.e., approximately zero; expanded entering sample also small and insignificant. New moon earnings about 40% above full moon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do they rule out that sample differences other than experience drive the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They re-estimate using a placebo sample of fishermen who meet the retiring-sample criteria (at least 60 years old, at least 15 years experience) but are at least two years from retirement, so they share age and career history but still have non-negligible returns to experience. Estimates for these older, experienced, non-retiring fishermen (Table 3) are very similar to the full sample and notably smaller than for retiring fishermen, indicating the elasticity difference is driven by returns to experience, not age or career history. They also note (footnote 27) that a flat cumulative return after 15 years is consistent with significant human-capital depreciation, so marginal returns can remain non-negligible until the final pre-retirement season.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks address the wage-prediction (instrument) being estimated separately per sample?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because estimating equation (11) separately per sample lets the moon-phase coefficient vary across samples, they run two pooled alternatives. Alternative #1 predicts earnings from the full sample of fishermen; the preferred retiring IES falls slightly (to about 2.06) because the moon coefficient is larger in absolute value, but entering-fishermen estimates stay small and insignificant. Alternative #2 pools entering and retiring fishermen in estimating (11), interacting all variables with an entering-fisherman indicator to limit selection-bias contamination; this raises retiring IES somewhat. Both confirm the cross-sample differences come from different responses to wage variation, not from different wage predictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from prior structural and reduced-form work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Beginning with Imai and Keane (2004), a literature jointly estimates labor supply and human-capital accumulation in fully structural models (Imai and Keane 2004 IES 3.8; Wallenius 2011 IES 1.1; Keane and Wasi 2016 IES 2). Structural models control for wage endogeneity and allow counterfactuals but require fully specifying the wage and choice environment, are complex, and it can be unclear which moments identify the IES. This paper&amp;rsquo;s complementary, largely model-free approach exploits negligible end-of-career returns to experience, remaining agnostic about human-capital accumulation. Their estimates lie within (at the high end of) the structural range. Their relative bias (2.1) nearly matches Wallenius (2011) and is below Imai and Keane&amp;rsquo;s 8-12 (whose sample of 20-36 year-old males has high returns to experience; bias falls to 3.2 for a 20-64 simulated sample with outliers removed). The closest prior approach is Rogerson and Wallenius (2013), who infer an IES lower bound from rationalizing retirement; both approaches are robust to LBD but use very different identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What alternative explanations do they consider and reject?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two. (1) Borrowing/credit constraints (Domeij and Floden 2006) also bias the IES downward and could differ across samples if retiring fishermen are less constrained; but the authors study daily decisions, and fishermen own a collateralizable vessel and almost certainly have credit or liquid assets for day-to-day purchases, so daily credit constraints are implausible. (2) Reference dependence with daily income targets and loss aversion (Camerer et al. 1997; tested by Farber 2015 on NYC taxi drivers, who also finds elasticities rising with experience): reference-dependent behavior should appear only when realized wages deviate from expected wages, but here identification comes from the perfectly predictable lunar cycle, so it cannot drive the results. The much larger participation elasticity for retiring fishermen (a decision based on anticipated wages) further argues against it; moreover Farber (2015) and Haggag, McManus and Paci (2017) find LBD in NYC taxis, so the experience-elasticity correlation there may itself reflect LBD.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results support relatively large labor-supply elasticities in calibrated representative-agent macro models (their IES falls within aggregate hours elasticities of 1.9 to 4 reported by Chetty et al. 2011). But extrapolation to macro requires care: the IES-to-labor-supply-elasticity link is broken under LBD, and aggregate elasticities depend on long-run labor-force participation and aggregation across life-cycle stages, not the daily participation margin estimated here; a fully structural model is still needed for life-cycle and aggregate predictions. On taxes, because LBD breaks the standard ordering (IES = Frisch, Frisch &amp;gt; Hicks &amp;gt; Marshall), a Frisch estimate no longer bounds welfare effects of tax changes. Permanent tax changes can have larger short-run labor-supply effects than transitory ones (which only affect the current wage), undermining transitory tax cuts as ideal short-term stimulus; permanent changes also have amplified long-run effects because reduced current labor lowers future wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What modeling choices and caveats accompany the estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They model a daily period, so omega is the IES over hours within a working day; the total elasticity comparable to annual data is the sum of the hours elasticity (delta from the intensive-margin equation) and the daily participation elasticity (from the probit). For retiring fishermen, individual fixed effects equal individual-by-season fixed effects (each appears one season), flexibly controlling for the human-capital stock. They do not correct standard errors for the generated regressor (predicted log wage) but, citing Miles (1997) and Benito (2006), judge it unlikely to render estimates insignificant; standard errors are clustered by calendar date. A potential dynamic concern (lobsters accumulating in traps) is dismissed because catch per trap stops rising after a few days of soak time (and average soak times of 7-15 days exceed that), so daily catch depends on environmental conditions, not past fishing. The exit-date inference rule drops less than 3% of observations with virtually identical results.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Self-Fulfilling Fluctuations in HANK Economies</title><link>https://macropaperwarehouse.com/papers/self-fulfilling-fluctuations-in-hank-economies/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/self-fulfilling-fluctuations-in-hank-economies/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: A central tenet of monetary policy is that aggressively raising nominal rates more than one-for-one with inflation (the Taylor principle) nips self-fulfilling inflationary beliefs in the bud. That logic is built on Representative-Agent New Keynesian (RANK) models that abstract from inequality and incomplete markets. Acharya and Benhabib ask whether this central tenet survives in Heterogeneous-Agent New Keynesian (HANK) economies where idiosyncratic income risk is countercyclical, and they answer in the negative: no matter how aggressively monetary policy responds to inflation, such economies remain susceptible to self-fulfilling fluctuations (&amp;ldquo;endogenous demand shocks&amp;rdquo;).&lt;/p&gt;
&lt;p&gt;Model setup: The paper builds an analytically tractable continuous-time HANK model. Tractability comes from quasi-linear preferences (linear in labor), which makes the economy block-recursive — aggregate output and inflation dynamics can be characterized independently of the wealth distribution. Households face a 2-state Poisson idiosyncratic productivity process (high ξh / low ξl, treating ξl loosely as &amp;ldquo;unemployment&amp;rdquo;), with the transition rate into the low state given by λl,t = λl·y^(−Θ); Θ &amp;gt; 0 makes risk countercyclical (Θ = 0 is acyclical). Firms are monopolistically competitive with a forward-looking (Rotemberg-type) Phillips curve. The baseline monetary rule is a simple inflation-targeting Taylor rule it = r + φπ·πt with φπ &amp;gt; 1, and crucially the model imposes NO effective lower bound, to distinguish the mechanism from liquidity-trap multiplicity (Benhabib-Schmitt-Grohé-Uribe 2001).&lt;/p&gt;
&lt;p&gt;Key mechanism: With countercyclical risk, the &amp;ldquo;natural rate&amp;rdquo; r*(y) = ρ − σ·y^(−Θ) (defined Keynes-style as the real rate consistent with constant output, not the flexible-price rate) is endogenous and co-moves with output: dr*/dy = σΘy^(−(1+Θ)) &amp;gt; 0. A belief that output will fall raises perceived future risk, raises desired precautionary saving, and lowers the natural rate; if policy does not cut rates enough, real rate exceeds natural rate, spending falls, and the pessimistic belief is self-fulfilling.&lt;/p&gt;
&lt;p&gt;Main results (with magnitudes/scope): (1) Local determinacy requires a cyclical-risk-augmented Taylor principle φπ &amp;gt; φ(Θ) = 1 + ρσγΘ/κ, valid only if risk is not too countercyclical, Θ &amp;lt; Θ* ≡ ρ/(σγ); if Θ &amp;gt; Θ* the targeted equilibrium is locally indeterminate for any finite φπ. (2) GLOBAL indeterminacy holds for ANY Θ &amp;gt; 0 and any finite φπ (Proposition 3): an untargeted steady state always coexists with the target, and depending on cyclicality, fluctuations take the form of a saddle connection (mildly countercyclical, Θ &amp;lt; Θ⋄), a stable limit cycle around the target (moderately countercyclical, Θ⋄ &amp;lt; Θ &amp;lt; Θ*), or local indeterminacy (highly countercyclical, Θ &amp;gt; Θ*). (3) Calibration (real rate 4%, γ⁻¹ = 2, λl = 0.013, ch/cl = 1.1 implying ξh/ξl = 1.23, φπ = 1.5) yields Θ⋄ ≈ 15.8 and Θ* = 31.08; empirical estimates from Bilbiie-Primiceri-Tambalotti (2023) put Θ in [21.98, 29.9] with mode 28.1 — comfortably in the moderately countercyclical region. At Θ = 28.1 the untargeted steady state has output about 6.5% below target, and the stable cycle has output-gap amplitude of roughly ±2.5% — magnitudes comparable to U.S./Euro-area post-Great-Recession gaps and U.S. business cycle fluctuations. (4) Policy fixes: a monetary rule that responds to the endogenous natural rate, it = r + φπ·πt + φr·(r*(xt) − r) with φπ &amp;gt; 1 and φr ≥ 1 (a &amp;ldquo;Taylor principle for natural rates&amp;rdquo;), delivers global determinacy (Proposition 4). Alternatively, a passive-monetary/active-fiscal regime (φπ &amp;lt; 1, φb ∈ [0,1)) eliminates all manifestations of indeterminacy via the Fiscal Theory of the Price Level (Proposition 5). Rules responding only to output, inertial rules, or escape clauses that merely remove the untargeted steady state (e.g., switching to strict inflation targeting if output falls below x̃ = −0.1) fail because the stable cycle survives.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the central claim and how does it overturn the RANK benchmark?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In RANK (or HANK with acyclical risk), the Taylor principle φπ &amp;gt; 1 delivers both local AND global determinacy because the IS curve has no higher-order terms. In HANK with countercyclical risk, the natural rate r*(y) = ρ − σy^(−Θ) co-moves with output. This adds a stabilizing first-order term (−σγΘx) to the IS curve requiring a stronger response for local determinacy (φπ &amp;gt; φ(Θ)), and adds stabilizing higher-order terms that no finite φπ can overwhelm — producing global indeterminacy for any Θ &amp;gt; 0. So aggressive inflation-fighting alone cannot anchor the economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the &amp;rsquo;natural rate&amp;rsquo; defined here, and how does it differ from standard usage?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors follow Keynes (1936): r*(y) is the real interest rate consistent with output remaining constant at level y. This differs from the standard New Keynesian definition (the flexible-price real rate r = ρ − σ). The two coincide in RANK, in HANK with acyclical risk, and at the steady state y = 1 (r = r*(1)), but DIVERGE when risk is countercyclical: there are many natural rates r*(y) — one per output level — while there is a single flexible-price rate r = ρ − σ. The flexible-price rate never depends on endogenous output; r*(y) does.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What distinguishes this source of multiplicity from prior determinacy literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three distinctions. (1) Versus Benhabib-Schmitt-Grohé-Uribe (2001b) liquidity-trap multiplicity: the paper purposely imposes NO effective lower bound, so the ELB is not the driver — countercyclical risk is. (2) Versus the local-determinacy HANK literature (Acharya-Dogra 2020, Bilbiie 2024, Auclert et al. 2023, Ravn-Sterk 2021): those papers show a stronger &amp;lsquo;cyclical-risk-augmented Taylor principle&amp;rsquo; restores LOCAL determinacy; this paper shows that same condition cannot rule out GLOBAL indeterminacy. (3) Versus Benhabib-Eusepi (2005) / older RANK global-indeterminacy work that relied on money-in-utility, money-in-production, or capital: this model is cashless and capital is not a factor of production, so the mechanism is genuinely the countercyclical risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to Ravn and Sterk (2021), the only other HANK global-indeterminacy paper?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Ravn-Sterk (2021) study a HANK economy with search frictions and find an additional &amp;lsquo;unemployment trap&amp;rsquo; steady state (100% unemployment) alongside the target. This paper&amp;rsquo;s characterization (two steady states) is complementary, but goes further by providing a COMPLETE analytical characterization of the dynamics through which countercyclical risk generates indeterminacy, and by analyzing which policy designs eliminate it. A key novel point: indeterminacy manifests not only as a second steady state but also as a stable cycle around the target, so policies that only kill the untargeted steady state can fail.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why isn&amp;rsquo;t eliminating the untargeted steady state sufficient for global determinacy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because under moderately countercyclical risk a stable limit cycle surrounds the targeted steady state independently of the untargeted steady state. The paper shows an escape-clause rule that switches to strict inflation targeting (π = 0) when output falls below x̃ = −0.1 (i.e., more than 5% below target) does eliminate the untargeted steady state, yet trajectories near the target still diverge locally and then converge to the surviving stable cycle, remaining bounded. Hence only policies that neutralize ALL non-fundamental equilibria — not just the untargeted steady state — guarantee global determinacy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the proposed monetary-policy fix and its scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A rule it = r + φπ·πt + φr·(r*(xt) − r) with φπ &amp;gt; 1 and φr ≥ 1 (Proposition 4) delivers global determinacy for any Θ &amp;gt; 0. The intuition is a &amp;lsquo;Taylor principle for natural rates&amp;rsquo;: by committing off-equilibrium to move the nominal rate at least one-for-one with endogenous natural-rate fluctuations, policy undoes the precautionary-saving impulse so pessimistic/optimistic beliefs cannot be confirmed. Setting φr = 1 makes the nominal rate perfectly track r*(xt), analogous to the optimal RANK response to exogenous demand shocks. It is also related to Holden&amp;rsquo;s (2024) robust real-interest-rate rule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the fiscal-policy alternative and the mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A passive-monetary/active-fiscal regime (φπ &amp;lt; 1, φb ∈ [0,1), Proposition 5) eliminates the untargeted steady state and the stable cycle for any Θ &amp;gt; 0, yielding a unique globally determinate equilibrium converging to x = π = 0, b = b*. Mechanism is the Fiscal Theory of the Price Level: with active fiscal policy, taxes do not rise enough to stabilize debt, so the price level must adjust to keep the real value of debt equal to the present value of future primary surpluses. A permanent-recession (deflationary) belief would raise real debt and eventually violate the government budget constraint, so such beliefs cannot be self-fulfilling. Importantly, the paper assumes b* &amp;gt; 0 (positive steady-state primary surplus), distinguishing it from Kaplan et al. (2023), where multiplicity arises under persistent deficits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Do other standard monetary rules rescue determinacy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. Appendices E.1 and E.2 show that adding an output-gap response (it = φπ·πt + φx·xt) or making the rule inertial/backward-looking can make LOCAL determinacy easier but cannot eliminate global indeterminacy: for any finite (φπ, φx) however large, or any degree of backward-lookingness (any α), the equilibrium remains globally indeterminate as long as risk is countercyclical. The reason is that none of these rules respond to the endogenous natural-rate fluctuations directly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How robust are the results to the functional form of countercyclical risk?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Robust. Appendix E.4 generalizes λl,t = λl·Λ(γxt) for any non-negative, weakly decreasing analytic Λ. The untargeted steady state exists whenever risk is countercyclical locally (−Λ&amp;rsquo;(0) = Θ &amp;gt; 0), even if Λ is linear. The stable cycle exists if Λ is sufficiently convex locally (Λ&amp;rsquo;&amp;rsquo;(0) sufficiently positive). Crucially the conditions depend only on local behavior at x = 0, which is reassuring given the thin empirical evidence on how risk varies far from steady state. The authors argue convexity is plausible: the inflow rate into unemployment rises sharply in recessions but does not fall as sharply in expansions (Crump et al. 2019), and labor-flow asymmetries exceed GDP asymmetries (McKay-Reis 2008).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Does the multiplicity survive introducing predetermined variables?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes, with a caveat about jumps. The baseline has no predetermined variables, so the economy can instantaneously jump between steady states/onto the cycle. Appendix E.5 lets the fraction of ξl households vary (a predetermined state), Appendix E.2 uses a backward-looking rule (lagged inflation predetermined), and Section 4.2/Appendix D.1 add government debt. In all cases instantaneous jumps are ruled out, but global indeterminacy persists: transitions to the untargeted steady state or the stable cycle become GRADUAL (e.g., a slow rise in the ξl fraction alongside falling output and inflation) rather than instantaneous.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the headline calibrated magnitudes and how credible are they?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Calibration: real rate 4%, relative risk aversion γ⁻¹ = 2, transition rate λl = 0.013 (from Bilbiie-Primiceri-Tambalotti 2023), consumption drop at job loss ch/cl = 1.1 implying ξh/ξl = 1.23, and φπ = 1.5. This gives regime boundaries Θ⋄ ≈ 15.8 and Θ* = 31.08. The empirically estimated Θ lies in [21.98, 29.9] (mode 28.1), squarely in the moderately countercyclical region. At Θ = 28.1, the untargeted steady state has output ~6.5% below target (comparable to post-Great-Recession U.S./Euro-area gaps) and the stable cycle has output-gap amplitude ~±2.5% (comparable to U.S. business cycle fluctuations). The 10% consumption drop is within empirical estimates (Cochrane 1991: 24–27% lower growth; Ganong-Noel 2019: ~11%; Gruber 1997: 6.8% for food).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their caveats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Central banks should monitor and react to private-sector beliefs about REAL activity (consumer confidence, perceived job-loss probability) as vigilantly as they monitor inflation expectations — ignoring real-activity beliefs can leave even inflation expectations unanchored. Because multiplicity does not stem from the ELB, it can afflict the economy even during a tightening cycle, and large rate hikes against inflation do NOT by themselves guarantee anchored expectations. Caveat/scope: the prescriptions hold in this stylized cashless, quasi-linear, no-aggregate-risk model; the precise cycle magnitude/periodicity and depth of the untargeted steady state depend on the full shape of Λ away from steady state, even though their existence depends only on local behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the broader methodological lesson?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Local stability/determinacy analysis can be misleading: even when the targeted equilibrium is locally determinate, multiple bounded global equilibria can exist. Researchers using HANK models should check global, not just local, determinacy. Because linear models have no higher-order terms, local determinacy implies global determinacy there; but HANK with countercyclical risk is genuinely nonlinear, so the implication breaks.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Natural rate of interest r&lt;/em&gt;(y)&lt;/em&gt;*: Defined Keynes-style (1936) as the real interest rate consistent with output remaining constant at level y; given by r*(y) = ρ − σy^(−Θ). Distinct from the flexible-price real rate. With countercyclical risk it is endogenous and rises with output (dr*/dy &amp;gt; 0), and there is one natural rate per output level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Neutral rate of interest&lt;/strong&gt;: The single flexible-price real interest rate r = ρ − σ in the model — the natural rate consistent with full-employment output y = 1, i.e., r = r*(1). It depends only on exogenous parameters, never on endogenous output.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Countercyclical risk (parameter Θ)&lt;/strong&gt;: Idiosyncratic income risk that rises when output falls, modeled via transition rate λl,t = λl·y^(−Θ). Θ &amp;gt; 0 means a ξh household is more likely to fall to the low-productivity (loosely &amp;lsquo;unemployment&amp;rsquo;) state when output is low; Θ = 0 is acyclical. Θ governs the strength of this cyclicality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous demand shock&lt;/strong&gt;: A self-fulfilling, non-fundamental fluctuation arising because a belief about future activity shifts desired precautionary saving, moves the endogenous natural rate, and — if policy does not offset it — confirms the original belief. Functions like an exogenous demand shock but is generated internally by countercyclical risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global vs local determinacy&lt;/strong&gt;: Local determinacy: the targeted steady state is the only bounded equilibrium in a small neighborhood (governed by first-order/eigenvalue terms). Global determinacy: it is the only bounded equilibrium starting from ANY point (governed also by higher-order terms). In this nonlinear HANK model local determinacy does NOT imply global determinacy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taylor principle for natural rates&lt;/strong&gt;: The proposed fix: monetary policy must move the nominal rate at least one-for-one (φr ≥ 1) with endogenous fluctuations in the natural rate r*(x), in addition to responding to inflation (φπ &amp;gt; 1). This off-equilibrium commitment prevents beliefs about real activity from becoming self-fulfilling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-cyclicality regimes (mild / moderate / high)&lt;/strong&gt;: Mildly countercyclical (Θ ∈ (0, Θ⋄)): indeterminacy via a saddle connection to the untargeted steady state. Moderately countercyclical (Θ⋄ &amp;lt; Θ &amp;lt; Θ*): a stable limit cycle surrounds the target. Highly countercyclical (Θ &amp;gt; Θ* = ρ/(σγ)): the target is locally indeterminate for any finite φπ. Calibrated thresholds Θ⋄ ≈ 15.8, Θ* = 31.08.&lt;/p&gt;</description></item><item><title>Serial Entrepreneurship in China</title><link>https://macropaperwarehouse.com/papers/serial-entrepreneurship-in-china/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/serial-entrepreneurship-in-china/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper studies entrepreneurship and new firm creation in China through the lens of serial entrepreneurs (SEs) — individuals who establish more than one firm — contrasting them with non-serial entrepreneurs (Non-SEs). The central question is whether serial entrepreneurs are selected on persistent productive skill or on non-skill advantages such as preferential access to finance, because the two mechanisms have opposite implications for resource allocation: skill-driven serial entrepreneurship raises aggregate productivity, while favoritism-driven serial entrepreneurship generates misallocation.\n\nThe empirical foundation is two administrative datasets for Chinese firms: the Business Registry of China (SAIC), covering the universe of all firms since 1949 with a 2015 snapshot, used for the period 1995–2015; and the Inspection Database (SAIC), providing firm-level income-statement and balance-sheet data, used for 2008–2012 due to data quality constraints. The sample focuses on individually-owned firms (with the largest shareholder being a natural person), covering roughly 17 million entrepreneurs and 20 million firms by 2015. SE firms constitute approximately one-third of all individual-owned firms throughout the period and hold nearly half of all registered capital — making serial entrepreneurship quantitatively central to the Chinese private sector. SE firms have on average about twice the registered capital of Non-SE firms (e.g., 3.22 million yuan vs. 1.91 million yuan in 1995).\n\nTo organize empirical findings the authors develop a two-period Hopenhayn (1992)-style model with collateral-constrained borrowing (k ≤ λe, where k is capital and e is equity). The model generates two competing predictions. If TFP draws across firms started by the same entrepreneur are persistent (AR(1) with autocorrelation ρ), SEs outperform Non-SEs on TFP and the second firm outperforms the first. If instead some entrepreneurs are &amp;ldquo;favored&amp;rdquo; with a less binding collateral constraint (higher λ) and persistence is low, favored entrepreneurs enter more readily, pushing SE TFP below Non-SE TFP while installing more capital conditional on TFP.\n\nEmpirically, the average evidence favors persistent skills: 1st-SE firms are 9% more productive than Non-SE firms (within 2-digit industry, province, and year) and 2nd-SE firms are 18% more productive, both significant at the 1% level. In terms of assets, 1st-SE firms are 40% larger and 2nd-SE firms are 66% larger than Non-SE firms.\n\nThis average premium, however, conceals critical heterogeneity driven by industry-switching behavior. Two-thirds of SEs (67%) start the second firm in a different 2-digit input-output industry (switchers); one-third stay in the same industry (stayers). Stayers&amp;rsquo; 1st-SE and 2nd-SE firms are respectively 49% and 70% more productive than Non-SE firms — accounting for the entire average SE premium. Switchers&amp;rsquo; 1st-SE and 2nd-SE firms are respectively 9% and 11% less productive than Non-SE firms. Despite their TFP deficit, switchers hold at least 7% more capital in both firm generations than stayers. TFP persistence (autocorrelation of log TFP across 1st- and 2nd-SE firms) is twice as high for stayers (0.29) as for switchers (0.14), confirming the model&amp;rsquo;s key identifying assumption that within-industry persistence exceeds cross-industry persistence. The model interprets switchers&amp;rsquo; low-TFP/high-capital profile as the empirical signature of favored entrepreneurs.\n\nThe model further predicts that equity-constrained entrepreneurs should close the first firm when the second is substantially more productive (opportunity cost of capital). Consistently, 1st-SE firms that are shut when the 2nd starts have 32% lower TFP and 13% lower equity than those run concurrently; 2nd-SE firms operated non-concurrently have 8% higher TFP and 22% lower equity than those run alongside the first.\n\nBeyond learning, the paper documents two additional industry-choice motives for switchers. First, a diversification motive: a one-standard-deviation increase in the covariance of returns between the 1st- and 2nd-SE firm industries raises 2nd-SE TFP by 20%, consistent with entrepreneurs demanding a risk premium to enter correlated industries. Second, an input-output complementarity motive: serial entrepreneurs are significantly more likely to choose industries that are upstream-integrated (coefficient 0.46), downstream-integrated (0.47), or complementary (0.41) with the first industry (all significant at 1%), consistent with transaction-cost motives for co-owning trading partners.\n\nThe policy implication is that China&amp;rsquo;s private sector harbors both dynamism — embodied in highly productive stayer SEs driven by persistent skills — and distortion — embodied in low-productivity switcher SEs who enter and accumulate capital through preferential credit access. Since SE firms account for roughly one-third of all firms and nearly half of all capital, the aggregate productivity costs of favoritism-driven serial entrepreneurship are likely significant. Results apply to individually-owned private firms in China over 1995–2015 and may not extend to settings with more uniform financial markets or state-owned firm dynamics.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper does not use a natural experiment or instrumental variables for the main TFP comparisons. It relies on a structural model to interpret conditional correlations, with TFP measured relative to province-industry-year cell averages (2-digit industry, province, and year fixed effects). The theoretical identification comes from the fact that two distinct mechanisms — persistent skills and favoritism — generate opposite predictions on the joint TFP/capital relationship: skill dominance predicts higher TFP for SEs while favoritism predicts lower TFP combined with higher capital. The paper shows both signatures in data for distinct subgroups (stayers and switchers respectively), lending internal consistency. The concurrent/non-concurrent distinction provides an additional layer: the model predicts concurrency depends on equity and the TFP gap between firms, and the data confirm these predictions precisely (Table 7). The main threat is selection on unobservables: entrepreneurs who choose to start second firms may differ from non-SEs along dimensions not captured by the model, such as risk preferences, managerial talent, or social connections, and these could confound the TFP comparisons even within industry-province-year cells.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms are posited. (1) Persistent skills (ρ &amp;gt; 0 in an AR(1) for TFP across an entrepreneur&amp;rsquo;s firms): positive selection makes SEs more productive and the 2nd-SE more productive than the 1st-SE. (2) Favoritism/credit access heterogeneity (heterogeneous collateral multiplier λ): favored entrepreneurs enter at lower TFP thresholds, so they are over-represented among SEs but have lower TFP and more capital conditional on TFP. The mechanisms are empirically distinguished by using industry switching as a proxy for favoritism. The learning model predicts low-first-period-TFP entrepreneurs switch industry (they do better by searching elsewhere), so favored individuals, who also have low TFP, should be concentrated among switchers. The data show switchers have both lower TFP than Non-SEs and more capital — a pattern only rationalized by favoritism. Stayers exhibit high TFP consistent with persistent skills. TFP persistence (autocorrelation) is twice as high within-industry (stayers, 0.29) as across-industry (switchers, 0.14), confirming the structural assumption separating the two mechanisms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across SE types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, stayer vs. switcher heterogeneity is the dominant finding: stayers&amp;rsquo; 1st-SE TFP is 49% above Non-SE and 2nd-SE TFP is 70% above Non-SE; switchers&amp;rsquo; 1st-SE TFP is 9% below Non-SE and 2nd-SE TFP is 11% below Non-SE. Switchers have more assets, equity, and registered capital than stayers despite lower TFP (at least 7% more capital). Second, concurrent vs. non-concurrent heterogeneity: 47.5% of SE firms in the 2008–2012 sample are operated concurrently. Non-concurrent 1st-SE firms have 32% lower TFP and 13% lower equity; non-concurrent 2nd-SE firms have 8% higher TFP and 22% lower equity, consistent with equity-constrained optimal capital reallocation. Third, generational heterogeneity: 2nd-SE firms are consistently larger and more productive than 1st-SE firms across all measures (TFP +18% vs. +9%; assets +66% vs. +40%), consistent with high ρ and positive selection into the second firm. Fourth, geographic stability: 72.3% of SEs locate the 2nd firm in the same prefecture as the first, suggesting local knowledge and networks matter for firm creation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and data restrictions are applied?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper trims the top and bottom 1% of assets and TFP before computing relative TFP. It excludes the 2007–2008 period from return-to-capital calculations (financial crisis concern). It excludes post-2014 registry data because of a registry reform that inflated new registrations and depressed measured exit. It confirms the covariance-TFP diversification result holds when including SE firms not run concurrently. It excludes entrepreneurs who established more than 20 firms (542 individuals, 188,266 firms) to avoid chain-store effects. The paper does not report instrumental-variable estimates, placebo tests, or alternative TFP measures as formal robustness exercises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior work on serial entrepreneurship (Holmes and Schmitz 1990, 1995; Lafontaine and Shaw 2016 for US; Rocha et al. 2015 for Portugal; Shaw and Sørensen 2019, 2022 for Denmark; Felix et al. 2021) uniformly finds SEs are more productive or larger than Non-SEs and attributes this to ability or learning. This paper confirms the average finding but is the first to demonstrate that the premium fully disappears and reverses for industry switchers, and to link this reversal to capital market distortions and favoritism rather than skill. The use of a comprehensive universe of firms (not manufacturing-only or survey-based samples) distinguishes it empirically. The misallocation literature (Hsieh and Klenow 2009; Buera, Kaboski, Shin 2011; Midrigan and Xu 2014; Moll 2014) analyzes distortions across all firms but does not analyze serial entrepreneurship. Song, Storesletten and Zilibotti (2011) and Hsieh and Song (2015) focus on state vs. private sector differences; this paper shows distortions exist within the private sector among individual-owned firms. Contemporaneous work by Shaw and Sørensen (2022) on Denmark documents similar properties of SE firms to the Chinese average findings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the model&amp;rsquo;s key structural propositions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 1: entrepreneurs enter iff TFP z ≥ z*(e), where the entry threshold is decreasing in equity e. Proposition 2: without financial frictions and with ρ &amp;gt; 0, 1st-SE and 2nd-SE firms have higher expected TFP than Non-SE, and 2nd-SE &amp;gt; 1st-SE for sufficiently large ρ. Proposition 3: with frictions, the 2nd-period entry threshold Z(z1, e) is increasing in z1 (opportunity cost of first firm&amp;rsquo;s capital) and decreasing in e. Proposition 4: with frictions and Assumption 1 (equity monotone in TFP) and sufficiently large ρ, SE firms are more productive than Non-SE. Proposition 5: with ρ = 0 and heterogeneous λ, favored entrepreneurs are over-represented among SEs, which then have lower average TFP but more capital conditional on TFP. Proposition 6: concurrent operation is increasing in equity and decreasing in |z2 − z1|. Proposition 7: entrepreneurs stay in the same industry iff 1st-firm TFP exceeds the unconditional mean; stayers have higher TFP than switchers for both SE firms. Proposition 8: with a risk diversification motive, the probability of choosing industry s&amp;rsquo; for the 2nd firm is decreasing in Cov(δs&amp;rsquo;, δs); conditional on choosing s&amp;rsquo;, 2nd-SE TFP is increasing in Cov(δs&amp;rsquo;, δs).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the diversification and input-output linkage findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For diversification, the authors construct an industry-level return-on-assets covariance matrix using 2010–2012 Inspection Data (excluding the financial crisis year). A one-standard-deviation increase in the covariance of returns between 1st and 2nd SE firm industries increases 2nd-SE TFP by 20% (significant at 1%), meaning entrepreneurs require a TFP risk premium to enter a correlated industry. In the excess-probability regression for industry choice, the covariance has a coefficient of -0.11 (significant at 1%), confirming switchers prefer industries negatively correlated with their first industry. For linkages, using 2007 Chinese Input-Output tables and Fan-Lang (2000) methodology, the authors find excess probability of industry choice is significantly higher for downstream-integrated industries (0.47), upstream-integrated industries (0.46), and complementary industries (0.41), all at the 1% level in a joint regression. These results hold controlling for 1st-SE industry fixed effects and year of establishment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper implies that China&amp;rsquo;s private sector suffers from a specific type of misallocation: entrepreneurs with preferential credit access (favored individuals, proxied by industry switchers) establish and expand firms despite lower productivity, crowding out more productive entrepreneurs. Reducing distortions in credit access — leveling the collateral constraint across entrepreneurs — would shift resources toward skill-driven serial entrepreneurs (stayers) and raise aggregate productivity. The scale of the problem is meaningful: SE firms hold roughly half of all capital in the individual-owner sector. Scope conditions: these findings apply to individually-owned private firms in China during 1995–2015, a period characterized by rapid private-sector growth, underdeveloped financial markets, and significant political-economic favoritism. The results abstract from cross-regional and cross-industry variation in financial frictions; if such variation matters (as Brandt, Kambourov and Storesletten 2023 suggest), the aggregate distortion estimates could differ. The paper does not quantify the aggregate TFP losses from misallocation in a counterfactual exercise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What data limitations and caveats apply?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Inspection Data lack employment information, so the authors impute labor input from the labor first-order condition under competitive wages within province-industry-year cells — a valid proxy only if factor market prices are equalized within cells. Revenue is used as a proxy for value added, valid only if intermediate input shares are constant within industry-province-year cells. The registry snapshot is from end-2015, so ownership history must be inferred; the authors note that for over 80% of individual-owned firms the founding owner coincides with the exit-period or current owner. Post-2014 data are excluded due to registry reform contamination. The analysis excludes entrepreneurs who established more than 20 firms (542 individuals, 188,266 firms) to avoid chain-store effects. The analysis excludes SEs who start a 2nd firm through an enterprise they control (expanding the definition would add 300,400 such cases). Concurrent/non-concurrent classification uses the Inspection Data&amp;rsquo;s 2008–2012 window, which may misclassify some firms. The TFP measure is relative within province-industry-year cells, so cross-cell TFP comparisons are not made.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Serial entrepreneur (SE)&lt;/strong&gt;: In this paper, an individual investor who is or has been the largest shareholder in at least two separate firms over the observation period, not necessarily concurrently; 1st-SE refers to the entrepreneur&amp;rsquo;s first firm and 2nd-SE to all subsequent firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-serial entrepreneur (Non-SE)&lt;/strong&gt;: An individual investor who is or was the largest shareholder in exactly one firm over the entire observation window; the benchmark category for TFP and size comparisons.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stayer&lt;/strong&gt;: A serial entrepreneur whose 2nd-SE firm is in the same 2-digit input-output industry as the 1st-SE firm; interpreted in the model as evidence of high industry-specific comparative advantage and high TFP persistence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Switcher&lt;/strong&gt;: A serial entrepreneur whose 2nd-SE firm is in a different 2-digit input-output industry from the 1st-SE firm; interpreted as evidence of either low first-period TFP (learning/Jovanovic motive) or preferential credit access (favoritism motive); empirically identified by lower TFP than Non-SEs combined with more capital.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Favored entrepreneur&lt;/strong&gt;: In the model, an entrepreneur with a less binding collateral constraint (higher λ), representing individuals with preferential access to bank credit or other non-skill advantages; they enter at lower TFP thresholds, are over-represented among SEs, and display the signature pattern of lower TFP combined with more capital conditional on TFP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collateral constraint&lt;/strong&gt;: A borrowing limit of the form k ≤ λe, where k is installed capital, e is equity, and λ ≥ 1 is the collateral multiplier; the central financial friction in the model, generating the observed co-movement between TFP, assets, and debt-equity ratios in the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Concurrent vs. non-concurrent SE operation&lt;/strong&gt;: Whether the entrepreneur&amp;rsquo;s 1st and 2nd firms are both operating simultaneously (concurrent) or the 1st firm is closed before or when the 2nd begins (non-concurrent); the model predicts non-concurrent operation is optimal when equity is scarce and the TFP gap between firms is large, rationalizing the observed pattern that non-concurrent 2nd-SE firms have higher TFP and lower equity.&lt;/p&gt;</description></item><item><title>Sources of rising student debt in the U.S.: College costs, wage inequality, and delinquency</title><link>https://macropaperwarehouse.com/papers/sources-of-rising-student-debt-in-the-u.s.-college-costs-wage-inequality-and-delinquency/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/sources-of-rising-student-debt-in-the-u.s.-college-costs-wage-inequality-and-delinquency/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;U.S. outstanding student debt rose roughly 20-fold, from about $50 billion in 1985 to nearly $1 trillion in 2014 (about 7% of GDP), making it the second-largest form of household debt after mortgages. Kim and Kim ask how much of this growth in &lt;em&gt;undergraduate&lt;/em&gt; loans can be explained by three forces: rising college costs, rising wage inequality, and the option to become delinquent. They build a partial-equilibrium incomplete-markets overlapping-generations (OLG) model with a three-stage life cycle (college, work, retirement, ages 18-85, annual periods). Individuals are endowed with heterogeneous ability (decile distribution of demeaned log AFQT80) and correlated parental transfers, and choose college attendance, government student-loan borrowing, and whether to repay or become delinquent (90+ days past due, carrying a skill-specific utility cost). College lasts 4 years; lower-ability students face a dropout probability at year 2 (aggregate enrollment-to-non-completion is ~54%). Loans follow a fixed 10-year repayment schedule (nT=10), accrue interest at rb=6.1% (risk-free r=3%), with a cumulative borrowing limit of $23,000 (raised to $31,000 from 2008) and a cap of 70% of tuition.&lt;/p&gt;
&lt;p&gt;The model is calibrated to the 1985 steady state, mainly with NLSY79 (plus NLSY97 for transfers/costs and PSID for the experience premium and wage-shock process). Transitional dynamics 1985-2014 feed in three time-varying inputs: rising college costs (net cost rises from $5,859 in 1985 to $12,000 in 2014), rising wage inequality (persistent-shock variance rises from 0.015 to 0.03 and transitory from 0.05 to 0.08; college wage premium from 1.2 to 1.37; skilled ability premium from 0.89 to 1.33; shock persistence ρ=0.9791), and a growing preference for college (a declining psychic cost calibrated to reproduce rising attainment).&lt;/p&gt;
&lt;p&gt;Main results: the benchmark economy raises aggregate undergraduate debt from $37 billion (1985) to $351 billion (2014), a $314 billion increase that explains about 64% of the observed U.S. rise — without being calibrated to the debt increase. Rising college costs are the primary driver of higher borrowing; rising income risk and declining average student ability drive higher delinquency (the aggregate delinquency rate more than triples 1985-2014; 16% of borrowers delinquent in 2014). In a decomposition (Table 3), fixing college costs cuts the debt rise to +$33B; fixing ability premia leaves it roughly unchanged (+$317B); fixing the college wage premium lowers it by $49B (to +$265B); and fixing wage-shock variances &lt;em&gt;raises&lt;/em&gt; it to +$418B (less risk means less delinquency but more borrowing). Removing the delinquency option entirely cuts the debt rise to $178 billion, so delinquency accounts for about 43% of the transitional increase. Delinquency works through a mechanical channel (missed payments plus accrued interest) and an incentive channel (delinquency as insurance encourages borrowing, the Domar-Musgrave effect); roughly one-third of the benchmark/no-delinquency gap is mechanical and two-thirds incentive. Finally, an income-driven repayment (IDR) plan (10% of discretionary income) cuts delinquency from 5.0% to 2.2% and slows debt growth to a $169 billion rise over the transition, because IDR substitutes for delinquency as insurance.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the model and the identification/quantification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is a partial-equilibrium incomplete-markets OLG model solved as two steady states (1985 and 2014) with a transition path. Identification of the aggregate-debt contribution is not econometric but quantitative: the model is calibrated to 1985 cross-sectional moments (and a few transition-path moments) WITHOUT targeting the aggregate debt increase, then exogenous time-varying inputs (college costs, wage inequality, college preference) are fed in and the resulting debt path is compared to data, explaining ~64% of the rise. The main threats are: (i) the model is partial equilibrium, taking costs/inequality/preferences as exogenous (general-equilibrium feedback, e.g. tuition responding to inequality per Cai-Heathcote 2022, is abstracted from); (ii) the residual 36% is unexplained and could reflect omitted forces such as private loans, for-profit institutions, or graduate-school spillovers; (iii) the &amp;lsquo;preference for college&amp;rsquo; is a reduced-form declining psychic cost that absorbs many unmodeled drivers (job amenities, over-optimism about graduation) rather than being separately identified.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two channels through which delinquency raises debt, and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The mechanical channel: missed scheduled payments plus accrued interest are added directly to the outstanding balance. The incentive channel: the option to delay payment acts as insurance against adverse post-college income shocks, encouraging students to borrow more ex ante (the Domar-Musgrave effect). They are separated with a &amp;lsquo;mechanical effect counterfactual&amp;rsquo; that removes delinquency but holds borrowing fixed at benchmark levels: the gap between benchmark and this counterfactual is the mechanical effect, and the gap between the mechanical counterfactual and the full no-delinquency economy is the incentive effect. The incentive effect dominates — roughly two-thirds of the benchmark/no-delinquency gap — because the mechanical effect operates only through the small share of delinquent borrowers (16% in 2014), while the incentive effect shapes all college students&amp;rsquo; borrowing. The incentive channel grows over time as income risk rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Borrowing increases with ability and (weakly) with parental transfers, driven by consumption smoothing: high-ability individuals anticipate higher lifetime earnings and borrow more against future income. Notably, in the 1985 simulation, average earnings during college exceed college costs across all ability groups, so most students could self-finance but still borrow. Dropout probability declines sharply with ability (so ~54% of enrollees do not complete). Delinquency rates differ by skill: 7% for college graduates vs 25% for college dropouts in 2010 (calibration targets). The stronger college preference draws more low-ability students into college over time, lowering average student ability and raising delinquency. Under IDR, the rise in borrowing participation (34%-&amp;gt;40%) is driven primarily by low-ability students.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness/validation checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Validation (not targeted): the model reproduces the rising trend in average annual borrowing 1993-2014 (NPSAS), the cross-sectional borrowing distribution by ability tercile and parental-transfer quartile in 1997 (NLSY97), the more-than-tripling of the aggregate 90+ day delinquency rate (FRBNY), and ~8% of borrowers behind on payments 10 years after graduation (Table D1). It also replicates the untargeted population distribution across ability/transfer cells. Robustness: results are stable with 10 or more ability grid points; the implied ~12% decline in average student ability between the 1960s and 1990s cohorts is consistent with Hendricks-Schoellman (2014). An alternative delinquency definition using 270-day default plus wage garnishment (Appendix C) yields similar aggregate effects, with delinquency explaining about 33% of the debt increase (vs 43% in the 90-day benchmark). A weakness flagged by the authors: the model generates flat college costs across parental-transfer quartiles and so misses the non-monotonic (U-shaped) cost pattern in the data, because ability and transfers are positively correlated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds directly on Abbott, Gallipoli, Meghir, Violante (2019), whose framework of government grants/loans and college attainment it extends by adding an endogenous delinquency choice on student debt to capture debt amplification. It differs from Ionescu (2008, 2009), which evaluate specific loan-policy reforms (lock-in interest, flexible repayment, eligibility) for enrollment/default, by focusing on the &lt;em&gt;dynamics of the aggregate debt stock&lt;/em&gt; rather than direct policy evaluation. It connects to the credit-constraints/family-income literature (Belley-Lochner 2007, Lochner-Monge-Naranjo 2011, Carneiro-Heckman 2002, Keane-Wolpin 2001) by jointly modeling parental transfers and borrowing, and to the repayment/default-determinants literature (Looney-Yannelis 2015, Lochner-Monge-Naranjo 2015, Deming-Goldin-Katz 2012). It remains agnostic about private loans (only 6-7% of outstanding debt and structurally different, per Ionescu-Simpson 2016).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;IDR is identified as an effective instrument for managing student-loan burdens: capping payments at 10% of discretionary income reduces delinquency sharply (5.0%-&amp;gt;2.2% in steady state) and slows the transitional debt rise from $314B to $169B, because formal repayment flexibility substitutes for informal insurance via delinquency. Scope conditions: IDR also &lt;em&gt;increases&lt;/em&gt; loan participation (34%-&amp;gt;40%), so the slowdown in debt comes from the delinquency-reduction effect dominating the borrowing-increase effect; in steady state total debt falls only $3 billion, the larger effect being on the transition. The result holds in partial equilibrium with no model re-calibration and assumes borrowers choose labor supply anticipating 10%-of-income repayment; general-equilibrium and fiscal-cost (loan-forgiveness) implications are not modeled. Take-up was low over 1985-2014 (11% of undergraduate borrowers in 2010, 24% by 2017), so IDR is treated as a forward-looking policy extension rather than a driver of the historical debt rise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What other significant findings or caveats appear?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Fixing wage-shock variances counterintuitively raises debt (+$418B vs +$314B) because lower income risk reduces delinquency but encourages more borrowing — illustrating that inequality&amp;rsquo;s net effect on debt runs partly through the insurance/incentive channel rather than just borrowing need. The annual flow of newly delinquent debt rose from about $200 million (1985) to $5.5 billion (2015) in the benchmark (Figure D9). The number of borrowers and average debt per borrower both rose (borrowers from 8% of population in 2004 to 14% in 2014; average debt per borrower from $15,106 to $21,677). The model abstracts from endogenous dropout during college (no idiosyncratic risk in college) and from graduate loans, focusing on undergraduate debt as the largest component.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Sovereign Debt Restructuring and Reduction in Debt-to-GDP Ratio</title><link>https://macropaperwarehouse.com/papers/sovereign-debt-restructuring-and-reduction-in-debt-to-gdp-ratio/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/sovereign-debt-restructuring-and-reduction-in-debt-to-gdp-ratio/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Sovereign debt restructuring is a central tool for countries in debt distress, yet surprisingly little evidence exists on whether it actually reduces the debt-to-GDP ratio — the metric used in virtually every debt sustainability analysis. This paper fills that gap. The debt-to-GDP ratio is not a simple pass-through from restructuring: the numerator (debt stock) only falls at the completion of a restructuring episode, while the denominator (GDP) can be depressed from the start of the crisis. Cash flow relief and face value reductions affect the numerator along different timelines, and fiscal consolidation — or its absence — can erode or reinforce whatever gains restructuring provides. These complexities make the net effect on the ratio genuinely non-obvious.&lt;/p&gt;
&lt;p&gt;The authors compile a novel, highly comprehensive dataset covering 709 restructuring events across 115 emerging market and developing economies from 1950 to 2021, encompassing private external creditors, Paris Club bilateral creditors, China, and domestic creditors — broader coverage than any prior study. Country-level macroeconomic data (GDP, general government debt, primary balances, inflation, exchange rates) come from the IMF World Economic Outlook October 2022 vintage. The sample excludes advanced economies, which almost never restructure (the three AE episodes — Slovenia 1992–96, Greece 2011–12, Cyprus 2013 — are dropped because the structural features of AE debt differ markedly from EMEs and LICs).&lt;/p&gt;
&lt;p&gt;Identification addresses the core problem that restructuring is endogenous to macroeconomic conditions: countries restructure precisely when growth is weak and fiscal positions are deteriorating. Following Jorda and Taylor (2016), the authors employ an Augmented Inverse Probability Weighted (AIPW) estimator. A first-stage saturated probit model estimates each country-year&amp;rsquo;s propensity score using lagged GDP growth, debt-to-GDP levels (interacted with country dummies to allow heterogeneous thresholds), primary and current account balances, US short and long interest rates, effective interest rates, and prior restructuring history. The predicted propensity scores feed a second-stage local projection of debt-to-GDP changes on the restructuring dummy and covariates across horizons 0–5 years. The AIPW is doubly robust: consistency requires only that the first stage or the second stage (not necessarily both) be correctly specified. The propensity model achieves an AUROC above 0.85.&lt;/p&gt;
&lt;p&gt;The main finding is that a typical sovereign debt restructuring event reduces the debt-to-GDP ratio by 3.8 percentage points in the first year (statistically significant), rising to a cumulative 7.2 percentage points after five years. The effect is negative and significant at every horizon from year 0 through year 5, and extends beyond five years (robustness checks to 10-year horizon show consistently negative effects, though standard errors widen with smaller samples). An important robustness check using debt level (percent change in debt stock) as the outcome shows the restructuring reduces debt by about 7 percent on impact and over 35 percent after five years — establishing that the ratio result is not mechanically driven by GDP movements alone.&lt;/p&gt;
&lt;p&gt;Heterogeneity across restructuring types and accompanying policies is substantial. When restructuring coincides with fiscal consolidation (positive average cyclically adjusted primary balance during the episode), the debt-to-GDP decline ranges from 4.7 percentage points in year 1 to 11.9 percentage points in year 5 — roughly double the average effect in the long run. Restructurings that include a face value reduction show an immediate impact of 8.9 percentage points in year 1 (versus 3.8 for the average), but the long-run effect after five years converges toward 5.0 percentage points — smaller than the fiscal consolidation pathway. Large-scale creditor coordination under the HIPC/MDRI initiatives produces ATEs of 5.4 percentage points in year 1 and 6.4 percentage points in year 5. These results collectively indicate that the long-run depth of the debt reduction is most reliably achieved when restructuring is paired with sustained fiscal effort, whereas face value reduction and creditor coordination are particularly potent in the short run.&lt;/p&gt;
&lt;p&gt;A novel finding concerns cash flow relief only (maturity extension and/or coupon rate reduction, without face value reduction): normalizing by the size of treatment (the average present-value reduction in the debt ratio, estimated at 2.8 percentage points of GDP for private external restructurings, compared to 6.0 percentage points for face value reduction events), the ATE per unit of treatment for cash flow relief converges to roughly the same magnitude as for face value reduction after four to five years. This suggests that, conditional on treatment depth, the form of restructuring does not determine long-run effectiveness — what matters is that the intervention provides sufficient fiscal space for subsequent adjustment.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses an Augmented Inverse Probability Weighted (AIPW) estimator following Jorda and Taylor (2016). The first stage is a saturated probit model predicting the propensity score for restructuring entry using: two lags of the treatment dummy, GDP growth, and change in debt-to-GDP; one lag of exchange rate change, inflation, global output gap, US short and long rates, effective interest rate, primary balance, and current account balance; and the level of debt-to-GDP interacted with country dummies (to allow heterogeneous restructuring thresholds). The second stage is a local projection of the change in debt-to-GDP regressed on the treatment dummy, its interaction with covariates, and country plus year fixed effects, across horizons 0–5. The AIPW ATE formula re-weights observed outcomes by propensity scores and adds augmentation terms from the outcome model, yielding double robustness. The main identification threat is selection-on-unobservables: countries that restructure may have systematically different unobserved growth prospects that simultaneously affect the debt ratio. The authors address one specific form of this concern — that countries and creditors time resolution to coincide with favorable growth — by including 1- and 2-year ahead IMF GDP forecasts as controls in a robustness check, finding similar results. Observations with propensity scores outside [10^-4, 1−10^-4] are excluded to avoid extreme weight instability. Significant overlap between treatment and control propensity score distributions (both approaching full support in [0,1]) is verified.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the timing of restructuring start (vs. end) relevant for the debt ratio?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior papers (Reinhart and Trebesch 2016; Cheng et al. 2019) measure the impact from the end of the restructuring episode or the resolution of the debt crisis. This paper instead measures from the start of the restructuring event (the onset of debt crisis). The distinction matters because: (i) the debt stock is only formally reduced at the completion of restructuring (once a deal is struck and recorded), so the numerator of the debt ratio moves discontinuously at the end of the episode; (ii) GDP, however, can be negatively affected from the outset of the crisis, compressing the denominator before any debt relief is delivered. About one-third of restructuring episodes last two or more years, so the distinction is empirically non-trivial. Measuring from the start captures the full dynamic path — including the initial GDP drag and the later debt relief — without conditioning on crisis resolution, which could itself be endogenous.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the dataset cover and how does it differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The dataset covers 709 restructuring events in 115 emerging market and developing countries from 1950 to 2021. It includes four creditor classes: private external creditors (sourced from Asonuma and Trebesch 2016), official bilateral external creditors under the Paris Club (from Paris Club database and Horn et al. 2022), official bilateral creditors outside the Paris Club including China (from Horn et al. 2022), and domestic creditors (from IMF 2021). The paper also covers restructurings that occur outside sovereign defaults, including preemptive restructurings where payments are not missed. Prior literature focused primarily on post-default restructurings with external private or Paris Club creditors. The 310 EM restructuring events break down as 85.8% cash flow relief only and 14.2% face value reduction; 58.4% are preemptive, 21.6% post-default, and 20% both or unidentified. For LICs, 396 events are recorded, with 73.5% cash flow relief only and 26.5% face value reduction. Macroeconomic controls come from the IMF WEO October 2022 vintage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the propensity model&amp;rsquo;s predictive performance, and what does it reveal about the determinants of restructuring?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The first-stage probit achieves an AUROC above 0.85 and a pseudo R-squared of 0.295 on 1,233 observations. Key findings: the lagged treatment dummy is negative and significant (countries that recently restructured are less likely to do so again soon, possibly because creditors resist multiple sequential restructurings); lagged changes in debt-to-GDP are negative in the two years preceding restructuring (reflecting that countries often pursue fiscal consolidation before resorting to restructuring as a last resort); global output gap and GDP growth have the expected signs (restructurings more likely when global conditions are favorable and domestic growth is low), though p-values are near 0.10; US interest rate coefficients have opposite signs for short vs. long rates and are statistically insignificant. The propensity score distributions show significant overlap between treatment and control groups, supporting the common support assumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the ATE per unit of treatment analysis reveal about cash flow relief vs. face value reduction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The ATE per unit of treatment is constructed by dividing the estimated ATE by the average size of treatment. For face value reduction events, the size is the average annual face-value-reduction-to-GDP ratio, approximately 6.0 percentage points. For cash flow relief only events (restricted to private external restructurings where present-value data are available from Asonuma et al. 2023), the size is estimated using a back-of-envelope calculation scaling the FVR size by the ratio of present-value debt reduction for cash flow relief (5 percent) to that for FVR (10.6 percent), yielding 2.8 percentage points. Table 4 shows: for FVR, the ATE in year 0 is -10.6 pp (per unit: -1.77), falling to -5.0 pp in year 5 (per unit: -0.83) — a frontloaded and then diminishing profile. For cash flow relief, the ATE is +3.6 pp in year 0 (per unit: +1.29), moving to -5.7 pp in year 5 (per unit: -2.04) — a monotonically increasing profile. The per-unit effects converge by around year 4, supporting the conclusion that treatment depth rather than treatment type is what determines long-run effectiveness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the interaction between restructuring and fiscal consolidation defined and what does the heterogeneity analysis show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Fiscal consolidation is defined as a positive average cyclically adjusted primary balance during the duration of the restructuring episode. The AIPW model is re-estimated using only the subset of restructuring events meeting this criterion as the treatment group, while keeping all non-restructuring observations as the control group. The estimated ATE ranges from 4.7 percentage points in year 1 to 11.9 percentage points in year 5 — substantially exceeding the 3.8 and 7.2 pp average effects. The long-run amplification relative to the average is larger than the short-run amplification, underscoring that sustained fiscal effort is the dominant factor in durable debt ratio reduction. A robustness check using a weaker definition of fiscal consolidation (positive year-on-year change in the cyclically adjusted primary balance, which can still leave the primary balance negative) shows a larger initial impact but a declining cumulative effect after a few years, consistent with the interpretation that only episodes maintaining a positive (not just improving) fiscal stance sustain the gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the heterogeneity analysis show for creditor coordination (HIPC/MDRI) versus the average?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Restricting the treatment group to restructuring events under the Heavily Indebted Poor Country Initiative and the Multilateral Debt Relief Initiative, the paper finds ATEs of 5.4 percentage points in year 1 and 6.4 percentage points in year 5. Both exceed the average effects (3.8 and 7.2 pp, respectively) in year 1, though the five-year effect is slightly smaller than the average (6.4 vs. 7.2 pp). The authors contrast this with Easterly (2002), who argued that HIPC countries remained heavily indebted even after two decades of debt relief and concessional financing (1980–1997). The paper&amp;rsquo;s result suggests that more comprehensive HIPC/MDRI programs produce meaningful and durable reductions in the debt ratio, at least within the five-year window studied.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the analysis imply about GDP dynamics during restructuring?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper establishes that debt levels fall more in percentage terms than the debt ratio does. In the baseline, the average debt-to-GDP ratio falls 3.8 pp in year 1 while the debt level falls about 7 percent in year 1. A back-of-the-envelope calculation (holding the average debt ratio at roughly 1, so the ratio change approximately equals the percent change in debt minus the percent change in GDP) implies that GDP falls by roughly 3.8 percent after one year of restructuring relative to the year prior, after controlling for selection. Over five years, the debt level falls over 35 percent while the debt ratio falls 7.2 pp, implying cumulative GDP losses that moderate the ratio improvement. The authors confirm this via a robustness check using GDP forecasts as additional controls, finding similar results to the baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are performed and what do they show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Six main robustness checks are reported: (1) Extending the horizon from 5 to 10 years — effects remain negative throughout, though standard errors widen due to smaller samples. (2) Using the change in debt level (percent) as the outcome instead of the change in the debt ratio — the restructuring reduces debt by about 7 percent on impact and over 35 percent after 5 years, confirming the ratio result is not purely a GDP-denominator artifact. (3) Including 1- and 2-year ahead IMF GDP forecasts as additional controls — results are similar to baseline. (4) Removing interaction terms between the treatment dummy and covariates from equation (1) — results are similar to baseline. (5) Comparing AIPW ATE to a plain OLS local projection (setting the ATE equal to the coefficient on the treatment dummy, without AIPW weighting) — the AIPW attenuates the estimated impact compared to OLS, as expected given upward selection bias: countries in worse shape are more likely to restructure, so naive estimates understate the baseline counterfactual. (6) Alternative probit subsetting for FVR events: removing top/bottom 10% of FVR-to-GDP from the treatment group (to address outliers) produces robust results; alternatively, using the predicted probability of FVR occurrence (based on pre-restructuring information only) to define treatment group membership yields similar findings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work on debt restructuring and debt ratios?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The closest prior papers are Reinhart and Trebesch (2016) and Cheng et al. (2019). Reinhart and Trebesch compare simple pre/post means across 18 AEs (1920–1939) and 35 EMs (1978–2010) — limited by small samples, no causal identification, focus on private external creditors, and measurement from the end of the restructuring episode. Cheng et al. study 93 EMs and LICs (1956–2015) using local projections but cover only Paris Club official creditors and focus on the end of the crisis. The present paper adds: coverage of 115 countries over 1950–2021; a broader set of creditors (private, Paris Club, China, domestic); timing from the start rather than the end of the episode; causal identification via AIPW; and heterogeneity analysis across fiscal consolidation, face value reduction, creditor coordination, and treatment size. The finding that cash flow relief per unit of treatment converges to face value reduction in the long run is novel; prior literature mostly emphasized nominal haircuts. The positive result for HIPC/MDRI also directly contradicts Easterly (2002).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The key policy implication is that debt restructuring is an effective tool for reducing debt ratios in EMEs and LICs — this is not automatic or mechanical, as GDP effects partially offset the debt stock relief, yet the net effect on the ratio is statistically significant and long-lasting. Scope conditions: (i) The results apply to emerging market economies and low-income countries; advanced economies rarely restructure and the three AE episodes in the sample are excluded as structurally different. (ii) The effectiveness is substantially amplified when restructuring is accompanied by sustained fiscal consolidation (positive average cyclically adjusted primary balance), implying that restructuring alone, without accompanying fiscal effort, provides a smaller and less durable reduction. (iii) Face value reduction is more potent in the short run but converges to cash flow relief in the long run (per unit of treatment), suggesting that deep rescheduling without nominal haircuts can be comparably effective as long as it provides sufficient fiscal space. (iv) The HIPC/MDRI creditor coordination framework is associated with larger-than-average impacts. (v) Preemptive restructurings (without outright default) are included and common, suggesting the results are not limited to post-default episodes. The paper informs current IMF and policymaker discussions on how to manage the post-COVID sovereign debt overhang.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What stylized facts characterize the types of restructuring in the dataset?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Based on Table 2: among EMs, 85.8% of restructurings involve cash flow relief only (no face value reduction) and 14.2% involve face value reduction; 58.4% are preemptive, 21.6% post-default. The most common creditor type in EMs is private external (54.8%), followed by Paris Club (48.1%). Among LICs, 73.5% involve cash flow relief only and 26.5% face value reduction; 54.3% are preemptive and 31.1% post-default; Paris Club is dominant (73.5%). Domestic debt restructurings are rare across both groups; when they occur, they tend to involve smaller face value reductions than external restructurings. The paper also notes that 60% of restructuring events are preceded by an increase in the primary-balance-to-GDP ratio, indicating fiscal effort before crisis resolution is common.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Augmented Inverse Probability Weighted (AIPW) Estimator&lt;/strong&gt;: A two-stage causal estimator that first models the propensity score (probability of treatment) and then uses it to re-weight observed outcomes in a local projection, with an augmentation term from the predicted outcome model. It is doubly robust: the average treatment effect is consistently estimated if either the propensity model or the outcome model is correctly specified, but not necessarily both.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Face Value Reduction (FVR)&lt;/strong&gt;: A cut in the nominal (principal) amount of the outstanding debt instruments, also called a nominal haircut. In the paper, the average FVR-to-GDP ratio during restructuring events with FVR is approximately 6 percent per year. FVR events constitute 14.2% of EM restructurings and 26.5% of LIC restructurings in the dataset.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cash Flow Relief&lt;/strong&gt;: Debt rescheduling without reduction in face value — encompassing maturity extension and/or coupon rate reduction — that alters the stream of future payments without changing the nominal amount owed. This is the predominant form of restructuring (85.8% of EM events). The present-value size of treatment for cash flow relief is estimated at 2.8 pp of GDP for private external restructurings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average Treatment Effect (ATE) per Unit of Treatment&lt;/strong&gt;: The estimated ATE divided by the average size of the treatment (e.g., face-value-reduction-to-GDP for FVR events, or estimated present-value reduction for cash flow relief events). Used to compare the effectiveness of different restructuring modalities on a common scale, revealing that FVR has a larger per-unit impact in the short run but converges to cash flow relief by year 4–5.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Preemptive Restructuring&lt;/strong&gt;: A restructuring implemented before any missed payments occur (no legal default), or with only briefly missed payments over a short window after negotiations begin, without a unilateral default. Distinguished from post-default restructurings, which involve unilateral cessation of payments prior to any creditor agreement. Preemptive restructurings account for 58.4% of EM events and 54.3% of LIC events in the dataset.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Doubly Robust Estimator&lt;/strong&gt;: In the paper&amp;rsquo;s context, an estimator (the AIPW) whose consistency holds as long as at least one of its two component models — the propensity score model (first stage) or the outcome model (second stage) — is correctly specified. This provides a safeguard against misspecification in one stage, unlike single-model approaches such as simple IPW or plain OLS local projections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;HIPC/MDRI Creditor Coordination&lt;/strong&gt;: The Heavily Indebted Poor Country Initiative and the Multilateral Debt Relief Initiative, which provide structured large-scale debt relief programs with coordinated participation by multiple official creditors. In the paper, restructuring events under HIPC/MDRI constitute a treatment subgroup showing ATEs of 5.4 pp (year 1) and 6.4 pp (year 5), exceeding the average year-1 effect but roughly in line with the average year-5 effect.&lt;/p&gt;</description></item><item><title>Taxing Top Wealth: Migration Responses and their Aggregate Economic Implications</title><link>https://macropaperwarehouse.com/papers/taxing-top-wealth-migration-responses-and-their-aggregate-economic-implications/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/taxing-top-wealth-migration-responses-and-their-aggregate-economic-implications/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Proposals to tax top wealth (e.g., Saez and Zucman, 2019) face a recurring objection in public debate: that the wealthy will emigrate en masse and, because many are entrepreneurs, their departure will inflict large negative spillovers (&amp;ldquo;trickle-down&amp;rdquo;) on the broader economy, making wealth taxes self-defeating. Credible evidence on international migration responses to wealth taxes has been scarce due to data limitations and a lack of clean identifying variation. This paper provides such evidence and quantifies the aggregate economic implications.&lt;/p&gt;
&lt;p&gt;Data and setting: The authors use exhaustive administrative data from Sweden (wealth tax register Förmögenhetsregistret 1993-2007, LISA, matched employer-employee RAMS, K10 closely-held-business filings, and the Serrano ownership-network data that maps indirect ownership) and Denmark (used for out-of-sample validation). A key strength is observing all wealth components without top-coding and linking individuals to firms they control directly and indirectly. They exploit three large reforms: the unexpected 2007 repeal of the Swedish wealth tax (statutory top marginal rate fell from 1.5% to 0%; effective average rate on the top 2% was ~0.5%), and Danish reforms of 1989 (rate cut from 2.2% to 1%) and 1996/1997 (abolition). Business assets were exempt in Sweden but fully taxed in Denmark.&lt;/p&gt;
&lt;p&gt;Empirical strategy: A two-step procedure. Step 1 estimates migration elasticities using difference-in-differences around the reforms (treated = top 2% of net wealth; baseline control = top 20% to top 10%), with treatment assigned on predicted wealth to avoid endogeneity post-2007. Step 2 estimates the effect of migration on individual-, firm-, and market-level outcomes via event studies (never-movers with placebo dates as controls), independent of the tax reforms. The two are combined, weighted by the wealthy&amp;rsquo;s share of aggregate activity (decomposition in equation 1).&lt;/p&gt;
&lt;p&gt;Main quantitative findings: A 1pp increase in the top wealth tax rate raises the out-migration rate by 0.17pp and reduces in-migration by 0.05pp; the 2007 repeal cut wealthy out-migration propensity by ~30% (about one-third of top-2% expatriations were tax-induced). Danish elasticities are statistically indistinguishable. Net flow semi-elasticity is -0.22pp per 1pp. Flow effects cumulate to a modest stock elasticity: the elasticity of the wealthy population w.r.t. the net-of-tax rate is 1.77 (s.e. 0.47) — a 1% rise in the net-of-tax rate raises the stock by under 2%. The implied income-net-of-tax migration elasticity is ~0.05, comparable to top-income cross-border elasticities. Firms controlled by the top 2% account for ~9% of Swedish employment, 15% of value added, 12% of investment, 19% of tax payments (and ~10% employment / 15% value added per the intro). When a top-2% owner out-migrates, directly-controlled firms see employment fall ~33%, gross investment ~22%, value added ~34%, and tax payments ~51%, driven almost entirely by the extensive margin of firm disappearance (effects near zero conditional on survival). But 45% of &amp;ldquo;closed&amp;rdquo; firms are absorbed via mergers/acquisitions; displaced workers lose only 4.3% in earnings and face a 0.6pp higher unemployment probability; market-level spillovers are small and insignificant even for granular firms.&lt;/p&gt;
&lt;p&gt;Aggregate and policy implications: Combining steps, a 1pp rise in the top wealth tax rate reduces aggregate employment by 0.022%, investment by 0.065%, and value added by 0.103% in the long run — modest despite the wealthy&amp;rsquo;s large economic footprint, because migration flows are small. Fiscally, each $1 raised loses only $0.22 to migration responses vs. $0.54 to intensive-margin responses (savings/avoidance/evasion, using Jakobsen et al. 2020), so $0.76 total. Migration responses are far from the Laffer bound but, because the MCPF is highly nonlinear, they nearly double it from ~2.2 to ~4.2. Migration threats, while salient in debate, matter less for welfare and policy than intensive-margin responses.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for the migration elasticity and what are the main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A difference-in-differences design around the 2007 Swedish wealth tax repeal, comparing out-migration of the treated top-2% group to a control group in the top 20% to top 10%. The non-contiguous control avoids contamination bias (households near the threshold anticipating future liability; less than 1% of controls reach the top 2% by 2006). The main threat is the parallel-trends assumption given a control group lower in the distribution; the authors show no differential pre-trends in out-migration and that effective capital-income and labor-income tax rates evolved similarly across groups (only wealth-inclusive tax rates diverged). The 2007 inheritance tax abolition is ruled out as a confounder because inheritance tax had little bite and strict residency rules made it hard to avoid by migrating (10-year non-residence required at death). Treatment is assigned on predicted wealth (from pre-reform variables) to avoid endogenous post-2007 wealth measurement. 2SLS specification (4) instruments the log net-of-tax rate with the treatment-by-post interaction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the aggregate effect identified separately from the migration channel, and why not use the reform directly?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;National wealth tax reforms cannot identify general-equilibrium/aggregate effects because treatment and control groups share the same aggregate economy, the exclusion restriction fails (wealth taxes also affect savings, capital accumulation, avoidance/evasion), and they are underpowered (small stock changes are hard to detect). The two-step procedure circumvents this: event studies of migration events (specification 7, with randomly-assigned placebo dates for never-movers, no matching) give the effect of migration on outcomes independent of the tax reform, and these are combined with the reform-based migration elasticity, weighted by the wealthy&amp;rsquo;s share of each aggregate outcome (equation 1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the LATE / marginal-mover correction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The two-step procedure requires the population whose migration impact is measured (event studies) to match the population whose migration responds to the tax (compliers). Using methods from the insurance-selection literature (Hendren et al., 2021) and the fact that 30% of pre-reform wealthy migrants were tax compliers, they recover the characteristics and treatment effects of marginal movers. Tax-induced movers (compliers) are slightly younger, slightly more likely entrepreneurs, slightly wealthier, around the 65th-70th skill percentile, but their firms are not selected. Event-study estimates pre vs post reform are similar (not statistically different), so treatment-effect heterogeneity is limited; column (5) double-difference LATE estimates for compliers are the preferred inputs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the firm-level evidence and how is reallocation distinguished from genuine destruction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Owner out-migration causes a ~30pp drop in firm survival (firm-identifier disappearance) and large declines in employment (~33%), value added (~34%), investment (~22%), turnover, and tax payments (~51%), almost entirely extensive-margin. The authors distinguish destruction from reallocation using Bolagsverket merger/closure-reason data: 45% of closures are linked to mergers (the firm is absorbed), 55% are liquidations/bankruptcies. Accounting for buy-outs cuts the firm-existence and employment effects by ~40%. Worker-level event studies show displaced employees lose only 4.3% in earnings and 0.6pp higher unemployment, indicating workers reallocate. Including indirectly-held firms, five-year effects are employment -19%, value added -33%, turnover -28%, investment -19%, tax payments -45%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Migration semi-elasticities do not vary much by age or education; entrepreneurs&amp;rsquo; out-migration semi-elasticity is larger but less precisely estimated (their effective tax rate dropped less because business assets were exempt; their out-migration fell ~0.14pp, roughly 50%, within a year). Firm-level migration effects show limited heterogeneity by owner age or children; effects are smaller for larger firms and especially for the top-10 largest moves (multi-billion-SEK businesses), where effects are considerably below average. In-migration effects mirror out-migration with opposite sign but are smaller for value added, turnover, investment, and tax payments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Estimates are robust to alternative control groups closer to the treatment group; to assumptions on the regeneration/replacement rate of the wealthy population and to dynastic effects (detectable but small); and to tax evasion — using Alstadsæter et al. (2019) and Boas et al. (2024) bounds, the stock elasticity ranges 1.85 (lower) to 1.92 (upper) vs. 1.77 baseline. Firm outcomes are robust to winsorization choices (Appendix Table IV.3); with no winsorization, value added/investment/tax effects turn positive-insignificant due to one outlier firm. Market-level spillovers are insignificant across alternative market definitions. Alternative aggregate calibrations (including accounting for buy-outs) imply smaller effects, so the baseline is a conservative upper bound.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the wealth-tax behavioral-response literature (Seim 2017; Jakobsen et al. 2020; Brülhart et al. 2022) which is largely silent on international migration, and on the tax-migration literature (Kleven et al. 2013/2014/2020; Akcigit et al. 2016) which focuses on income taxes and within-country mobility. It is the first systematic evidence on international migration responses to wealth taxes and their trickle-down. Versus the CEO/owner death-and-retirement literature (Smith et al. 2019: -26pp firm survival, -82% profits per worker, -45% even conditional on survival; Jäger and Heining 2022), migration effects are much smaller and nearly zero conditional on survival, because owners often retain control or restructure rather than shut down. Findings echo Bach et al. (2023) for France.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Migration-driven fiscal externality is $0.22 per $1 raised, vs. $0.54 for intensive-margin responses, $0.76 combined — below the Laffer bound. Because the MCPF is nonlinear, migration roughly doubles it from ~2.2 to ~4.2; wealth taxation would be welfare-improving if revenue funds projects with MVPF above 4.2 (e.g., programs for low-income children, often above 5 per Hendren and Sprung-Keyser 2020). Scope conditions: estimates come from reforms that only cut rates, so asymmetric responses to increases cannot be ruled out; the elasticity depends on destination-country taxes (Swedish movers went to low-tax UK non-dom, Switzerland, Austria), so responses could be more muted if all neighbors taxed wealth heavily; results are for small open economies with low wealth inequality and weaker agglomeration than the US, suggesting the estimates are upper bounds; computations reflect 1990s-2000s Scandinavia where offshoring/evasion mattered, and depend on tax base, enforcement, and exit-tax design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the stock elasticity derived from flow elasticities?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using a simple OLG framework, the population stock elasticity ≈ net-flow semi-elasticity times (T+1)/2, where T is the average &amp;rsquo;lifespan&amp;rsquo; of wealthy individuals (the inverse of the regeneration/birth rate into the wealthy population). Longer lifespan means slower regeneration, so lost migrants are harder to replace and the stock effect is larger. This yields a stock elasticity of 1.77 (s.e. 0.47); the effect stays modest because top-of-distribution migration flow rates are very small.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the magnitudes of migration flows and tax-payment effects, and any caveats on persistence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Top-decile out-migration is ~0.2% per year in Sweden (vs. ~0.65% in the bottom half) and ~0.1% in Denmark, rising in the extreme tail; taxable wealth of wealth-tax-liable out-migrants is only 0.09% of total taxable wealth; net migration is small and slightly positive. One year after out-migration, total tax payments fall ~66% (wealth tax -59%, income tax -68%; income taxes are ~90% of the wealthy&amp;rsquo;s payments, implying large fiscal externalities on income tax). Effects attenuate over time: ~40% reduction at five years because ~40% of out-migrants return within five years (migration is persistent but return migration is common). Taxable wealth in Sweden falls 94% one year out; real estate is typically sold, and financial wealth falls at extensive (-21%) and intensive (-15%) margins, confirming real rather than purely fiscal-residence responses.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Technology Sophistication Across Establishments</title><link>https://macropaperwarehouse.com/papers/technology-sophistication-across-establishments/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/technology-sophistication-across-establishments/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: How sophisticated are the technologies establishments actually use, and how close are they to the world frontier? Traditional measures (since Ryan-Gross 1943 and Griliches 1957) characterize technology by the presence of one or a few advanced technologies, which (i) cover too few technologies and unrepresentative tasks, (ii) say nothing about how non-adopters produce or how far they are from the frontier, and (iii) ignore the intensity with which a technology is used. The authors argue intensity of use matters for explaining income divergence (Comin-Mestieri 2018), so they build a direct, comprehensive measure of technology sophistication.&lt;/p&gt;
&lt;p&gt;Data and design: The authors construct &amp;ldquo;the grid,&amp;rdquo; a two-dimensional structure with business functions (BF) on the horizontal axis and technologies ranked by sophistication (simplest to world frontier) on the vertical axis. The grid spans 63 business functions (7 general business functions [GBF] relevant to all sectors plus 56 sector-specific business functions [SSBF] across 12 sectors) and a total of 305 technologies. More than 50 industry experts built and ranked the grid before survey administration. The grid is implemented in the Firm Adoption of Technology (FAT) survey, fielded 2019-2023 to 21,055 randomly selected establishments forming nationally representative samples (for establishments with 5+ workers) in 15 countries spanning all income levels (Korea, Poland, Croatia, Chile, Brazil-Ceara, Georgia, Vietnam, four Indian states, Ghana, Bangladesh, Kenya, Cambodia, Senegal, Ethiopia, Burkina Faso), representing a universe of about 2.1 million establishments. The median establishment has 9 workers (mean 34); 20% of workers hold a college degree, 17% are exporters, 18% are multinational-affiliated. FAT records, per BF, which grid technologies are used and which one is &amp;ldquo;most widely used.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;Two measures are built at the BF-establishment level on a [1,5] affine scale: MAX (sophistication of the most advanced technology used, reflecting adoption) and MOST (sophistication of the most widely used technology, reflecting both adoption and intensity/diffusion within the firm). Establishment-level measures are simple averages across in-house BFs. Cardinalization is validated three ways: linearity of the sophistication-productivity relationship; correlation above 0.98 with a z-score cardinalization (Bloom-Van Reenen 2007); and median correlation 0.95 with an independent productivity-based (&amp;ldquo;Q&amp;rdquo;) cardinalization for 18 BFs.&lt;/p&gt;
&lt;p&gt;Main findings with magnitudes: (1) Establishments underutilize their most sophisticated adopted technology. In 63% of BFs where multiple technologies are used, MOST is not the most sophisticated available; the MAX-MOST gap appears in 62% of multi-technology BFs. (2) MAX and MOST are distinct upgrading processes: a one-unit rise in the number of technologies (NUM) raises MAX by 0.84 but MOST by only 0.25; MAX explains just 34% of within-establishment MOST variance. (3) Gaps are persistent, not transitory: only weakly related to age (cross-decile correlation -0.29; individual -0.01) and unrelated to time since adoption. (4) Gap frequency falls with income (country-level 51% in Korea to 83% in Burkina Faso; correlation -0.55 with per-capita income) and rises with input scarcity (low human capital, loan denial) and managerial mistakes (perception bias, family ownership, non-exporting). (5) Within-country dispersion in gaps (0.28) is about three times the between-country dispersion (0.09). (6) Establishment-level MAX and MOST average 2.6 and 2.0; both correlate with income (0.78 for MAX, 0.94 for MOST) and with size, human capital, management, exporter and multinational status. (7) Both productivity and profitability rise with sophistication, more strongly for MOST and for agriculture; the association is not smaller in low-income countries, contradicting the &amp;ldquo;appropriate technology&amp;rdquo; hypothesis.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What are MAX and MOST, and why are they conceptually distinct?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;MAX_{f,j} is the sophistication of the most advanced grid technology establishment j uses in business function f; MOST_{f,j} is the sophistication of the most widely used technology in that function. Both lie in [1,5] with MAX &amp;gt;= MOST by construction, and both measure closeness to the world frontier. They are conceptually different: increases in MAX reflect adoption of a new (to the function) more sophisticated technology, whereas increases in MOST can reflect adoption OR the extension/intensification of an already-adopted technology — closer to Mansfield&amp;rsquo;s (1963) concept of intra-firm technology diffusion. The paper&amp;rsquo;s central empirical claim is that these are driven by distinct upgrading processes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy, and what does the paper NOT claim?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a descriptive/correlational paper, not a causal one. The authors explicitly state their data do not permit causal inference; the productivity, profitability, and characteristic associations are partial correlations from cross-sectional regressions with country and 2-digit sector fixed effects. The BF-level analyses (MAX-NUM, MOST-NUM, MAX-MOST) use establishment and function fixed effects to absorb establishment- and function-specific levels. The main &amp;lsquo;identification&amp;rsquo; work is measurement validity, not causal identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are MAX and MOST shown to be distinct upgrading processes empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three pieces of evidence. First, regressing MAX on NUM (number of technologies) with establishment and function FE yields a coefficient of 0.84 (s.e. 0.01) — near one-to-one — while regressing MOST on NUM yields only 0.25 (s.e. 0.01). Second, regressing MOST on MAX (with FE) shows MAX explains only 34% of within-establishment MOST variance, so MAX is not a sufficient statistic for MOST. Third, MAX and MOST have different distributions (MOST more skewed), different lifecycle profiles, different correlates, and different associations with productivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Is the MAX-MOST gap transitory or persistent, and how is this tested?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Persistent. Three exercises: (i) across age deciles the gap correlates only -0.29 with age (-0.01 at the individual level), with no clear lifecycle pattern by income or size except a decline only among large establishments aged 16+; (ii) the distribution of years since adopting a top-tier technology is similar for BFs with and without a gap, so time does not close it; (iii) splitting top-tier adopters into early vs. recent adopters yields similar MOST distributions. Together these confirm gaps persist long after adoption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two hypothesized drivers of MAX-MOST gaps, and what evidence supports each?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Input constraints — scarcity of skilled labor or finance pushes firms to rely on simpler technologies operable by less-educated workers or needing less capital. Supported by the negative coefficient on human capital (college share) and the positive coefficient on the loan-denied dummy. (2) Managerial mistakes — poor management or biased self-perception of one&amp;rsquo;s own sophistication causes suboptimal underuse. Supported by positive correlations with perception bias and family ownership, and a negative correlation with exporter status (competitive pressure narrows the gap); the management z-score association is weak. Across subsamples, input scarcity is more prominent in low-income countries while managerial-mistake proxies are more salient among large establishments (likely from the complexity of managing scale).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in technology sophistication is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By income: country averages span 1.53 (MAX) and 1.01 (MOST); within-country dispersion (p80-p20) rises with income, more steeply for MOST (0.95 vs 0.33). By sector: agriculture shows greater cross-establishment dispersion in both MAX and MOST than manufacturing or services. Lifecycle: MAX rises gradually with age in all income/size groups, but MOST flattens beyond ~10 years in low-income countries and among small establishments. Size effects on MOST are stronger in high-income countries; on MAX they are similar across income levels. The performance-sophistication link is strongest in agriculture and weakest in services, and is not weaker in low- than high-income countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How much of the variation is across vs. within sectors, and why does that matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following Syverson (2011), sector dummies explain only 14% (2-digit), 20% (3-digit), and 23% (4-digit ISIC) of cross-establishment variance in sophistication — comparable to their explanatory power for productivity (sales per worker). This implies sophistication variation reflects differences in the technologies used to perform similar tasks, not differences in what tasks/goods establishments produce.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness and validation checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cardinalization: linear approximation of the sophistication-productivity relation; correlation &amp;gt;0.98 with z-score cardinalization; median 0.95 (p25-p75: 0.90-0.98) with a productivity-based Q-cardinalization across 18 BFs; establishment-level baseline-vs-Q correlations of 0.90 (MAX) and 0.91 (MOST). Ranking validity: three-stage expert validation (functionality/integration/automation; novelty and cost; ChatGPT replication) on 14 BFs plus an independent relative-productivity exercise on 18 BFs. Data quality: response rates 15-86% (high for establishment surveys); no significant non-response differences in employment, sophistication, wages, or skill; a Kenya back-check pilot showing 80.6% consistency for technology-use reports; external validation against Korea (KED) and Brazil (RAIS) with cross-establishment correlations above 0.93 for sales/employment and 0.73 for labor productivity; ERP adoption in Korean manufacturing of 32% vs. 40% in Chung-Kim (2021). Establishment-level results are robust to controlling for the in-house fraction of functions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It generalizes the intra-firm diffusion literature (Mansfield 1963; Battisti-Stoneman 2003), which studied a handful of technologies in a few countries, by showing MAX-MOST gaps are widespread and persistent across 63 functions and 15 countries. It parallels Bloom-Van Reenen (2007) on management practices in method (expert rankings, survey scoring, z-scores) and finds supporting evidence for the Bloom-Sadun-Van Reenen (2012) technology-management complementarity. It differs from the US Advanced Business Survey / Acemoglu et al. (2022), which covered five frontier technologies, by being comprehensive and frontier-relative. It contributes new evidence to the agricultural productivity gap (Caselli 2005; Gollin-Lagakos-Waugh 2014) and to the appropriate-technology debate (Basu-Weil 1998; Acemoglu-Zilibotti 2001).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the sophistication-performance association is not smaller in low-income than high-income countries, advanced technologies appear &amp;lsquo;appropriate&amp;rsquo; across income levels — challenging the appropriate-technology hypothesis that poor countries gain little from sophisticated technology. Policy should target not only adoption (MAX) but also the extension of use/intensity (MOST), since MOST is more strongly tied to productivity and profitability. Scope conditions: associations are correlational, not causal; samples are representative only for establishments with 5+ workers; coverage is the 12 surveyed sectors; and the cross-section cannot trace dynamics (the authors plan a longitudinal extension).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What do the descriptive technology-use patterns show about adoption behavior?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Establishments use about two technologies per function on average; 62.6% of functions use more than one and 28.3% use at least three. Leapfrogging/skipping is rare: among single-technology functions (37.4% of cases), 52.8% use the least sophisticated grid technology, so only about 18% of functions have fully skipped or abandoned simpler technologies. In 70.4% of multi-technology functions one technology used is the least sophisticated available, and sophistication gaps (non-contiguous use) occur in only 25% of functions (27% GBF, 17% SSBF; most common in payments 48%, business administration 34%, sales 28%). Firms thus typically retain dominated technologies rather than abandon them, which is why MAX proxies the full adoption history well. Only 16% of establishments use an ERP system (the most sophisticated business-administration technology).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Any notable caveats about the measures themselves?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;MAX-MOST gaps are ordinal (cardinalization-free), but establishment-level MAX and MOST are cardinal and could be sensitive to the chosen cardinalization — addressed by the validation exercises. Establishment-level measures use only in-house functions (87% of relevant SSBFs and an overwhelming majority of GBFs are in-house; only 3.9% of GBFs not in-house), and results are robust to controlling for the in-house share. The survey deliberately avoided the words &amp;rsquo;technology&amp;rsquo; and &amp;lsquo;sophistication&amp;rsquo; (using &amp;lsquo;methods&amp;rsquo;/&amp;lsquo;processes&amp;rsquo;) to limit social-desirability bias.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;The grid&lt;/strong&gt;: A two-dimensional structure mapping each key business function (horizontal axis, task-based) to the range of technologies that can perform it (vertical axis, ranked by sophistication from simplest to the world frontier). Spans 63 business functions (7 general + 56 sector-specific across 12 sectors) and 305 technologies, built and ranked by 50+ industry experts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MAX&lt;/strong&gt;: The sophistication (on a [1,5] affine scale) of the most advanced technology an establishment uses in a given business function. Increases in MAX reflect adoption of a technology new to that function; near one-to-one with the number of technologies used (coefficient 0.84).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MOST&lt;/strong&gt;: The sophistication (on a [1,5] scale) of the most widely used technology in a business function. Changes in MOST reflect both adoption and the intensification/extension of already-adopted technologies — closer to Mansfield&amp;rsquo;s (1963) intra-firm diffusion than to adoption per se; only weakly tied to the number of technologies (coefficient 0.25).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MAX-MOST gap&lt;/strong&gt;: A binary indicator equal to 1 when MAX &amp;gt; MOST in a function with multiple technologies in use — i.e., the most widely used technology is not the most sophisticated one adopted. Present in 62-63% of multi-technology functions, persistent over time, and associated with input scarcity, managerial mistakes, and lower productivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;FAT survey&lt;/strong&gt;: The Firm Adoption of Technology survey: a cross-section of 21,055 establishments forming nationally representative samples (5+ workers) in 15 countries (2019-2023), implementing the grid plus modules on financials, employment, management practices, and adoption barriers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Appropriate technology hypothesis&lt;/strong&gt;: In this paper&amp;rsquo;s usage, the claim (Basu-Weil 1998; Acemoglu-Zilibotti 2001) that establishments in poor countries underutilize sophisticated technologies because scarce human and physical capital limits the productivity gains those technologies embody. The paper&amp;rsquo;s finding that the sophistication-performance association is not smaller in low-income countries runs counter to this hypothesis.&lt;/p&gt;</description></item><item><title>The Aggregate Costs of Uninsurable Business Risk</title><link>https://macropaperwarehouse.com/papers/the-aggregate-costs-of-uninsurable-business-risk/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-aggregate-costs-of-uninsurable-business-risk/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; A large literature argues that credit constraints are the dominant financial friction holding private businesses below their optimal scale, so that easing credit access would yield large aggregate efficiency gains. This paper challenges that view. Private businesses are also poorly diversified — their owners bear undiversifiable business-income risk — and the authors argue the macroeconomic costs of this lack of diversification are far larger than those of credit constraints. The crux is that entrepreneurs can limit risk exposure by operating at a smaller scale, so productive-but-poor entrepreneurs choose an inefficiently low scale and are unwilling to borrow to expand. Firm size is thus limited by risk, not by credit availability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and setup.&lt;/strong&gt; The empirical analysis uses the historical Orbis dataset (Moody&amp;rsquo;s Bureau van Dijk), 1995–2019, focusing on Spain (best coverage; results extend to Italy, France, Norway, Portugal, Slovakia in the appendix). Output is value added; the sample is partnerships and private limited companies, excluding FIRE, public administration, defense, education. The final sample is 622,883 firms (6,298,358 firm-year observations), observed on average 10 years; the mean (median) firm has 12 (5) workers and 486 (151) thousand EUR value added. The Spanish Survey of Household Finances (EFF, 2008–2020) provides entrepreneur wealth/prevalence and consumption data. The model is a small-open-economy model of entrepreneurial dynamics (à la Quadrini 2000; Cagetti–De Nardi 2006) with two frictions: each firm is owned by a single (undiversified) entrepreneur, and a collateral constraint k&amp;rsquo; ≤ a&amp;rsquo;/(1−ξ). Key modeling choices: capital AND labor are chosen before productivity is observed (time-to-build), and productivity has persistent and transitory shocks drawn from fat-tailed mixtures of normals. Parameters are estimated by simulated method of moments (9 parameters, 16 moments; objective 0.013, ~1.3% average deviation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main quantitative findings.&lt;/strong&gt; Profit shares fluctuate sharply: 5% of firms have losses exceeding 20% of output, against an average profit share of 0.13; the 5th percentile of profit-share deviations is −0.33 and the 95th is +0.47. Output growth is fat-tailed (s.d. 0.48, IQR/s.d. ratio 0.65 vs 1.35 Gaussian; excess kurtosis 10.7). Inputs do not track output: regressing wage-bill growth on output growth gives 0.40 (capital 0.16); restricting to |Δlog y|&amp;lt;0.5 gives 0.58 and 0.31. A change in profit share on output growth has slope 1.56 (0.46 in the restricted sample). The headline result: eliminating both frictions would raise output by 15.8%; eliminating the risk wedge alone raises output by 15.4%, while eliminating the credit wedge alone raises output by only 0.4%. Misallocation losses are 10.8% (11.0% due to risk, 0.2% due to credit). Aggregate wedges are equivalent to a 12.8% tax on labor and 14.9% on capital. Wage losses are 27.8% (26.4% risk, 0.4% credit).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms and implications.&lt;/strong&gt; Two wedges distort choices: a risk wedge (from the covariance of consumption and productivity) that distorts both labor and capital, and a credit wedge (from the binding collateral constraint) that distorts only capital. The credit wedge falls quickly with wealth (vanishing once unconstrained), but the risk wedge declines only gradually and persists even for wealthy entrepreneurs. Aggregate losses are governed by the distribution of wedges weighted by efficient firm size (Hopenhayn 2014): risk wedges are large precisely for high-ability entrepreneurs who would be large under efficiency, whereas credit-constrained firms are mostly unproductive with small efficient size. Policy implication: improving credit access has limited impact unless it also improves risk sharing. The findings also imply firm profits largely reflect compensation for risk (75% of the aggregate profit share), and dispersion in returns to business wealth largely reflects risk compensation.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for the model, and how are parameters pinned down?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Parameters ϑ=(β,α,η,ρ,σu,σε,s,p,ϕ) are estimated by simulated method of moments, minimizing a weighted distance between 16 empirical and model moments scaled by 1+empirical moment (objective = 0.013, ~1.3% average deviation). Intuitively: β is pinned by the entrepreneur wealth-to-income ratio (12.5 in data and model); α and η by the capital-output ratio (1.22 vs 1.21), labor share (0.72 vs 0.71) and profit share (0.13 vs 0.14); ρ, σu, σε by output autocorrelations at horizons 1–3, the cross-sectional s.d. of output, and the s.d. of output growth at horizons 1–3; the tail parameters s and p by the IQR of output growth relative to its s.d.; and ϕ by the entrepreneurship rate. Three assigned parameters: δ=0.10, r=0.02, θ=2, with ξ=0.408 set to match the aggregate debt-to-capital ratio of 0.408. Standard errors (bootstrapped) are small because the firm sample is very large.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the main mechanism, and how are the risk wedge and credit wedge distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because labor and capital are chosen before productivity is realized and risk is undiversified, the entrepreneur weights future states by their own stochastic discount factor. The risk wedge τ (&amp;gt;1) arises from the negative covariance between marginal utility of consumption and productivity and distorts both labor and capital equally. The credit wedge ω (&amp;gt;1 when the collateral constraint binds) distorts only capital. As wealth rises, the credit wedge falls rapidly and vanishes once the firm is unconstrained, but the risk wedge declines only gradually and never disappears. The two are isolated quantitatively by setting ω=1 (to get the role of risk) or τ=1 (to get the role of credit) in the productivity-loss mapping (eq. 13).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does risk dominate credit in the aggregate even though most firms are credit-constrained?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Aggregate outcomes depend on the distribution of wedges weighted by efficient firm size n_it (Hopenhayn 2014). Weighted by efficient size, the risk wedge ranges from 1.27 (10th pct) to 1.61 (90th pct), while the credit wedge is essentially 1 except at the very top (1.02 at the 90th pct). Unweighted, the risk wedge is only 1.12 at the 90th pct and the credit wedge is positive for more than half of firms — but those constrained firms are unproductive with small efficient size. Risk wedges are large precisely for high-ability entrepreneurs who would be large under the efficient allocation, so they drive the aggregate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the result robust to the form of the collateral constraint?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors consider two extremes: no borrowing at all (ξ=0) and unlimited borrowing (ξ=1, no credit limit). With no borrowing, misallocation losses rise only from 10.8% to 11.7%, still mostly risk-driven (8.3% risk vs 1.4% credit). With no credit limit, risk wedges remain nearly as large as baseline and removing credit frictions has negligible effects. Intuitively, risk leads entrepreneurs to operate small and accumulate precautionary wealth, so they self-finance most desired capital and credit wedges stay small even without credit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Which three ingredients are essential to the risk-dominates result, and what happens without each?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Fat-tailed productivity shocks, (2) transitory productivity shocks, and (3) labor chosen before productivity is realized. Removing each in isolation (with re-estimation) reverses the conclusion so that credit becomes the primary driver: without fat tails, misallocation losses fall to 2.1% (credit 1.5%, risk 0.3%); without transitory shocks, losses are 12.1% (credit 10.9%, risk 0.4%); with flexible labor, losses fall to 3.3% (credit 2.4%, risk 0.1%). The flexible-labor case matters because risk then distorts only capital, whose share is smaller than labor&amp;rsquo;s, reducing income volatility and pushing firms to expand and hit the credit constraint. In all three counterfactuals, the 1st percentile of profit-share deviations ranges −0.21 to −0.43, far smaller in magnitude than the data (−1.66) or baseline model (−1.92).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Is the result driven by high risk aversion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The baseline uses relative risk aversion θ=2. Re-estimating with θ=0.5 (low end of usual values) still yields sizable, risk-dominated losses: productivity losses 6.4%, output losses 9.2%, wage losses 16.7% — roughly three-fifths of the baseline — and again primarily driven by risk rather than credit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What untargeted moments does the model match (model validation)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model reproduces the distribution of profit-share deviations (10th pct −0.17 data vs −0.16 model; 1st pct −1.66 data vs −1.92 model), the full distribution of output growth rates, the low wage-bill/output comovement (0.58 data vs 0.55 model in the restricted sample), the profit-share/output comovement (0.46 vs 0.42; falling to 0.10 vs 0.06 when holding the labor share constant), and the persistence/volatility of capital and labor (e.g., wage-bill growth s.d. 0.36 vs 0.32). Critically, it matches the low comovement of entrepreneur consumption with profits: regressing Δc on Δπ gives a slope of 0.02 in both data and model (data based on 799 EFF observations, three-year changes).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity and external validity does the paper document?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The motivating facts hold for Italy, France, Norway, Portugal and Slovakia, and for Spanish public firms; for young (age≤5) and old firms; for small and large firms (top decile of value added vs rest); and across the five largest sectors (manufacturing, construction, wholesale/retail, accommodation/food, professional activities). Output-growth kurtosis ranges roughly 11–18 across countries. On diversification: 12% of households are entrepreneurs; 93% of entrepreneurs own exactly one business; multi-business owners hold 71% of their business wealth in their main business; the average ownership share is 83%, and 71% own 100% of their main business.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the extensive-margin and unconstrained-firm results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Extensive margin: when the planner can also choose who becomes an entrepreneur, it cuts the entrepreneurship rate from 13.2% to 1.2%, but because marginal entrepreneurs are low-ability the gains are small — productivity, output and wage losses relative to the unconstrained planner are 10.8%, 16% and 27.8%, very close to the intensive-margin numbers. Unconstrained firms: adding a frictionless sector calibrated to match the 58.7% output share of public firms in Orbis leaves misallocation losses at 10.5% (vs 10.8% baseline), still mostly risk-driven (risk 10.1%, credit 0.1%); wage losses fall to about three-fifths of baseline because the unconstrained sector reduces the aggregate labor wedge.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the implications for profits and returns to wealth?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Decomposing the profit share into span-of-control, risk and credit components: risk accounts for 75% of the aggregate profit share (0.11/0.146), with the rest from span of control; credit contributes little. Risk also drives most of the profit-share dispersion (s.d. 5.5%, essentially all from risk; credit contributes only 1%). For excess returns to wealth, the mean of 2.2% is almost entirely accounted for by risk, and risk drives most of the dispersion (s.d. 5.5%). This implies dispersion in returns to private business wealth — a driver of wealth inequality — largely reflects compensation for risk rather than credit constraints.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the working-capital robustness check?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Adding a working-capital constraint where a fraction ϑ=0.25 of the wage bill is paid in advance (à la Mendoza 2010), evaluated at baseline parameters, gives misallocation losses of 11.1% (vs 10.8% baseline), with risk still accounting for the bulk (9.4%) and credit less important (1.3%); risk accounts for 13.4% of the 16.3% total output losses. So even when credit frictions can also distort labor, risk remains dominant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central implication is that policies expanding firms&amp;rsquo; access to credit will have limited aggregate impact unless they also improve risk sharing. This holds within the scope of the model — undiversified private businesses with single owners, where risk exposure is endogenously chosen via scale and can be partly self-insured through wealth, labor income, and occupational switching. The authors note their framework assumes (rather than micro-founds) the lack of diversification, and suggest future work should model the moral-hazard or informational frictions preventing diversification, and broaden redistributive tax analysis to incorporate uninsurable-risk distortions (as in Di Tella et al. 2024).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It contributes to the misallocation literature (Hsieh-Klenow 2009; Buera et al. 2011; Moll 2014; Midrigan-Xu 2014; Gopinath et al. 2017). Prior work on risk and investment (Tan 2018; Robinson 2021; David et al. 2022a) studies how risk distorts investment; this paper instead emphasizes how risk distorts LABOR choices, relating it to Arellano et al. (2019) and David et al. (2022b). It differs from the credit-constraint-centric tradition by showing credit matters little once undiversified risk and the three key ingredients are present. Di Tella et al. (2024), partly motivated by these findings, study optimal policy under uninsurable risk and show it is the opposite of optimal policy when misallocation stems from markups.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Risk wedge (τ)&lt;/strong&gt;: In the paper&amp;rsquo;s sense, the gap between the expected marginal product of an input and its price arising from undiversifiable business risk. It equals [1 + COV(c^{-θ}, zε)/(E c^{-θ} · E zε)]^{-1}, generally &amp;gt;1 because of the negative covariance between the entrepreneur&amp;rsquo;s marginal utility of consumption and productivity. It distorts both labor and capital, declines only gradually with wealth, and persists even for wealthy entrepreneurs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Credit wedge (ω)&lt;/strong&gt;: The distortion from a binding collateral constraint, ω=1+(1−ξ)μ/R, where μ is the multiplier on the constraint k&amp;rsquo;≤a&amp;rsquo;/(1−ξ). It exceeds one only when the constraint binds, distorts only capital, falls rapidly with wealth, and vanishes once the entrepreneur is unconstrained.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Profit share&lt;/strong&gt;: In this paper, the ratio of profits to output (value added), π_it/y_it, where profit is output net of the wage bill and the user cost of capital. Its average is 0.13; the paper studies its large transitory firm-level fluctuations as the empirical signature of uninsurable risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Time-to-build (inputs chosen before productivity)&lt;/strong&gt;: The assumption that both capital and labor are chosen before the firm observes its productivity shock. This parsimoniously generates the imperfect high-frequency comovement between inputs and output and makes wealth affect employment as well as investment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficient-size-weighted wedge distribution&lt;/strong&gt;: The paper&amp;rsquo;s organizing device (following Hopenhayn 2014): aggregate productivity losses depend on the distribution of risk and credit wedges weighted by each firm&amp;rsquo;s efficient size n_it. Because high-ability firms have large efficient size and large risk wedges, risk dominates the aggregate even though most firms are credit-constrained.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-financing&lt;/strong&gt;: The mechanism by which entrepreneurs, operating at small scale and saving for precautionary reasons because of risk, accumulate enough wealth to finance most of their desired capital — so credit wedges stay small even in an economy with no credit, rendering the borrowing limit nearly irrelevant for aggregates.&lt;/p&gt;</description></item><item><title>The Efficiency-Equity Tradeoff of the Corporate Income Tax: Evidence from the Tax Cuts and Jobs Act</title><link>https://macropaperwarehouse.com/papers/the-efficiency-equity-tradeoff-of-the-corporate-income-tax-evidence-from-the-tax-cuts-and-jobs-act/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-efficiency-equity-tradeoff-of-the-corporate-income-tax-evidence-from-the-tax-cuts-and-jobs-act/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper estimates the firm- and worker-level effects of the corporate income tax cuts in the 2017 Tax Cuts and Jobs Act (TCJA) — the largest corporate tax cut in U.S. history — to inform the long-running efficiency-versus-equity debate over corporate taxation. The question matters because federal corporate tax reforms are rare, prior credible evidence comes mostly from subnational or small-economy variation (where factors are more mobile and the tax base smaller), and theory predicts alternate instruments behave differently, so existing estimates may not extrapolate to a major reform in a large advanced economy.&lt;/p&gt;
&lt;p&gt;Identification exploits that TCJA cut the top C-corporation rate from 35% to 21% (a 40% reduction) while cutting the implied top rate for S corporations far less — from 39.6% to 37%, and to 29.6% for many via the new 20% Qualified Business Income deduction (a cumulative ~25% reduction). The authors use employer-employee matched federal tax records (corporate SOI files merged with W-2 and individual returns), tax years 2013-2019, on a balanced panel of large firms (&amp;gt;=50 employees and &amp;gt;=$1M sales each pre-period year): 15,490 firms and 108,430 firm-year observations. The main design is an event study / 2SLS comparing similarly sized C and S corps in the same industry-size bin, with firm and industry-size-year fixed effects and standard errors clustered by firm; entity-switchers are dropped. The identifying assumption is parallel trends absent the tax change (as in Yagan 2015), not random C/S assignment.&lt;/p&gt;
&lt;p&gt;First stage: C corps&amp;rsquo; marginal tax rate fell ~5.0 pp (s.e.=0.2) relative to S corps, raising the log net-of-tax rate ~6.6% (s.e.=0.2); C corps paid ~$2,100 (s.e.=341) less tax per worker. Real effects: C-corp sales rose 3.9 pp (s.e.=1.2) relative to S corps; pre-tax profits +3.0 pp (s.e.=0.7); after-tax profits +4.0 pp (s.e.=0.7); total payouts +21.9% intensive (s.e.=2.9) and +3.0 pp extensive (s.e.=0.5); employment +2.3% (s.e.=0.8); payrolls +3.4% (s.e.=0.8); net investment +2.9% (s.e.=0.4). The benchmark corporate elasticity of taxable income (pre-tax profits) is 0.46 (s.e.=0.11); after-tax-profit elasticity 0.61 (s.e.=0.11); investment elasticity 0.45 (s.e.=0.07). Worker earnings are flat for the bottom 90% (median wp50 coefficient -0.001, s.e.=0.004) but rise for the top 10%: +1.3% at the 95th percentile (s.e.=0.4), +4.8% at the 99th, and +4.8% for executives (top-5 paid; s.e.=0.7, earnings elasticity 0.73). Executive-pay gains barely shrink when controlling for firm performance (4.8% to 4.5%) and are concentrated among incumbents, consistent with rent-sharing rather than productivity.&lt;/p&gt;
&lt;p&gt;Responses concentrate in capital-intensive industries and are not larger for cash-constrained firms, pointing to a cost-of-capital channel rather than liquidity. Via a stylized model, a $1 marginal cut in corporate tax revenue generates $0.44 in additional output; revenue falls $0.85 per $1 mechanical loss (total -$86 billion, 0.40% of GDP). Factor incidence: 51% of gains to firm owners, 10% to executives, 38% to high-paid workers, 0% to low-paid workers. Across the income distribution, 80% of gains accrue to the top 10% and 20% to the bottom 90%, with gains concentrated in the Northeast/West and large high-income cities. The corporate tax is ~twice as inefficient as the personal income tax but similarly progressive, suggesting margin-of-efficiency gains from shifting toward personal income taxation. Results are short-run and abstract from public-goods provision and deficit financing.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strategy is a difference-in-differences/event study (and 2SLS) comparing C corporations to S corporations in the same industry-size bin before and after TCJA, instrumenting the change in the log net-of-tax rate with pre-existing C/S entity status, with firm and industry-size-year fixed effects and firm-clustered standard errors. The identifying assumption is parallel trends in outcomes absent the tax change (not random C/S assignment), supported by (a) flat pre-trends in the event studies, (b) Yagan (2015) showing C and S trends were statistically indistinguishable 1996-2008, (c) the unexpected nature of TCJA before the 2016 elections limiting anticipation, and (d) industry-size-year fixed effects matching firms in similar product markets. Main threats: anticipatory/intertemporal tax shifting (some rate decline already in 2017; executive pay also trends up in 2017); other concurrent TCJA provisions (bonus depreciation, DPAD repeal, NOL/interest limitation, international); endogenous entity switching; differential industry-size composition; and general-equilibrium/SUTVA violations where C-corp gains could be S-corp mirror-image losses or where common wage effects are absorbed by time fixed effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors argue the dominant mechanism is a reduction in the cost of capital from the permanent rate cut, not liquidity relief and not primarily bonus depreciation. Evidence: (1) responses are larger in capital-intensive industries (profits and investment), consistent with the cost-of-capital first-order condition; (2) high-cash firms are if anything more responsive than low-cash firms, ruling out liquidity constraints (and thus income effects); (3) bonus depreciation is downweighted because many eligible firms do not claim it, much capital (intangibles, structures) is never fully expensed, C and S corps had near-identical expensing exposure (so the design differences them out), and the investment response is driven almost entirely by short-lived assets rather than the long-lived assets where accelerated depreciation is most valuable. A complementary dynamic-adjustment-cost model (Auerbach-Hassett 1992 with Foertsch 2018 cost-of-capital inputs) yields elasticities very similar to the benchmark.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By capital intensity: C corps in capital-intensive industries show significantly larger profit and investment responses (supporting the cost-of-capital channel). By liquidity: high-cash firms are no less (if anything more) responsive than low-cash firms, contrasting with Zwick and Mahon (2017). By firm size: no clear pattern in profits, median earnings, or investment, with only suggestive evidence that high-income-worker gains are larger in smaller firms. By worker position: earnings gains are concentrated entirely in the top 10% of the within-firm distribution and especially in executives, with zero gains below the 90th percentile. By worker tenure: gains are driven by incumbents, not new hires (consistent with rent-sharing). Geographically: gains concentrate in the Northeast and West and in large high-income commuting zones (e.g., ~3x the median CZ gain in New York City, ~5x in the San Francisco Bay Area).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Alternate specifications (Table 7): cohort(age)-by-year FE, state-by-year FE, firm-specific pretrend controls, 6-digit NAICS industries, reweighting S to match the C industry-size distribution, inverse-propensity-weighting, log-transformed outcomes, winsorizing at 5th/95th percentiles, and 2016-sales/payroll weighting — elasticities are stable. Alternate samples (Table 8): excluding firms with &amp;gt;$1B sales or &amp;gt;10,000 employees, excluding mismatched industries (C share &amp;gt;80% or &amp;lt;20%), excluding manufacturing (trade-war exposure), unbalanced panel, excluding public firms, excluding industries most exposed to DPAD/NOL/interest-limitation/bonus-depreciation provisions, excluding multinationals, dropping tax years 2017-2018 (anticipation/shifting), and dropping single-owner S corps (wage/profit reclassification). Entity switching rose only from ~0.1% to ~0.3% (profit-weighted) and is negligible. Most estimates stay within the benchmark confidence intervals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the C-vs-S comparison design of Yagan (2015) but studies marginal corporate rate cuts rather than the 2003 dividend tax cut. It obtains an investment elasticity (0.45) very close to Chodorow-Reich et al. (2023)&amp;rsquo;s 0.52 despite a different identification strategy and sample. Its corporate ETI (0.46) is below state/local estimates (Giroud-Rauh ~0.50; Suarez Serrato-Zidar ~0.9; Bachas-Soto 3.0-5.0 in Costa Rica) but above typical personal-income ETIs (Saez et al. central 0.25), consistent with distortions scaling with factor mobility. Its incidence finding — that the corporate tax falls on capital and high-income workers — differs from Fuest et al. (2018), who find German municipal corporate tax hikes fall on low-skilled/marginally-attached workers (the authors note possible asymmetry between hikes and cuts and small-firm effects), and aligns with Risch (2024). It uses directly observed owner returns and the full earnings distribution, requiring weaker assumptions than Suarez Serrato-Zidar (2016, who infer owner returns structurally) and Fuest et al. (who assume negligible rental-rate changes).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;On efficiency: a $1 cut in corporate tax revenue yields $0.44 of additional output, and current U.S. top corporate rates appear below the revenue-maximizing rate (revenue falls only $0.85 per $1 mechanical loss). The corporate tax is ~twice as inefficient as the personal income tax but similarly progressive, and 3-4x more progressive than the payroll tax while being 2-3x as inefficient — implying that shifting the federal revenue mix toward personal income taxes could raise efficiency without much loss of progressivity. On equity: the cuts are regressive in the short run, with 80% of gains to the top 10% (24% to the top 1%, 56% to the 90-99th percentiles), 0% to low-paid workers, and 17% flowing to foreign equity holders. Scope conditions: estimates are short-run (through 2019, pre-COVID); they hold welfare equal to output (ignoring utility curvature); they assume a representative consumer (no consumer-price channel) and equal redistribution of revenue; they abstract from deficit financing, public-goods provision, and long-run productivity/wage effects; and the very largest C corps have no S-corp analogue, so their responses are not well identified.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What other significant findings, extensions, or caveats appear?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Employment increases reflect predominantly reallocation of workers across sectors rather than net new hiring, which the authors account for in the aggregate analysis (and is why incidence focuses on wages, not employment). New investment gains are in short-life assets (e.g., computers), with no change in long-life machinery or structures. Firms returned excess profits via dividends and buybacks but did not increase equity or debt issuance, and shareholder-payout results are robust to excluding multinationals (so the repatriation holiday is not the driver). Executive pay shifted forward into 2017 (bonuses) to be deducted at the higher pre-cut rate. Caveats flagged by the authors: rent-sharing tests are suggestive not dispositive (conditioning on post-treatment outcomes; unobserved hours/effort; short two-year horizon); private-income components are precisely estimated but the welfare confidence interval includes zero (up to ~0.4% of GDP); and long-run channels (productivity, lower prices, real wages) and offsetting cuts to public services/transfers are outside the analysis.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;C corporation vs. S corporation&lt;/strong&gt;: The two legal entity types whose divergent TCJA tax treatment provides identification. C corps pay corporate income tax directly (rate cut 35% to 21%) and their dividends are taxed at the shareholder level; S corps pass income through to up to 100 individual U.S. shareholders who pay ordinary income tax (top rate cut 39.6% to 37%, or 29.6% with QBI), with no corporate-level or dividend tax.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Implied marginal tax rate (for S corps)&lt;/strong&gt;: Because S corps pay no entity-level tax, their firm marginal rate is constructed as the ownership-share-weighted average of the individual marginal income tax rates of the firm&amp;rsquo;s owners, computed from linked personal returns (e.g., two equal owners at 25% and 35% imply 30%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Corporate elasticity of taxable income (ETI)&lt;/strong&gt;: The percent change in the corporate tax base (pre-tax profits) per percent change in the net-of-tax rate; the paper&amp;rsquo;s benchmark is 0.46. Following Feldstein (1999), it summarizes the deadweight loss / efficiency cost of the tax under negligible income shifting and income effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Net-of-tax rate&lt;/strong&gt;: One minus the marginal tax rate, ln(1-tau); the object firms optimize against, used to scale reduced-form effects into elasticities. TCJA raised C corps&amp;rsquo; log net-of-tax rate by ~6.6% relative to S corps.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cost-of-capital channel&lt;/strong&gt;: The mechanism by which a lower tax rate (or higher expensing parameter theta) reduces the user cost of capital phi = r(1-theta*tau)/(1-tau), raising capital demand, labor demand, and firm scale — the paper&amp;rsquo;s preferred interpretation, distinguished from liquidity effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal excess burden&lt;/strong&gt;: dW/dT, the change in welfare (output, defined as private income plus tax revenue) per dollar of corporate tax revenue; estimated so that $1 of foregone corporate revenue generates $0.44 of additional output.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incidence across the income distribution&lt;/strong&gt;: An extension of factor incidence that assigns owners&amp;rsquo; capital gains back to workers using the Distributional Financial Accounts (since many workers hold equity and many owners work), yielding the result that 80% of tax-cut gains accrue to the top 10% of earners.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rent-sharing&lt;/strong&gt;: The channel whereby earnings gains accrue to incumbent high-paid workers and executives rather than to new hires (the marginal unit of labor), with executive pay only weakly tied to firm performance — interpreted as workers/executives capturing a share of excess after-tax profits.&lt;/p&gt;</description></item><item><title>The Transmission of Monetary Policy to Corporate Investment: the Role of Loan Renegotiation</title><link>https://macropaperwarehouse.com/papers/the-transmission-of-monetary-policy-to-corporate-investment-the-role-of-loan-renegotiation/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-transmission-of-monetary-policy-to-corporate-investment-the-role-of-loan-renegotiation/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; This paper asks how monetary policy transmits to corporate investment through bank credit, and specifically whether the relevant credit margin is the origination of &lt;em&gt;new&lt;/em&gt; loans (the channel emphasized by the traditional credit/bank-lending channel literature, e.g., Kashyap, Stein and Wilcox, 1993) or the &lt;em&gt;renegotiation&lt;/em&gt; of existing loans. The motivation is institutional: in the U.S., almost 70% of corporate loan contracts are renegotiated prior to maturity, with firms renegotiating existing loans about twice as often as issuing new ones, and renegotiations typically alter loan amounts, spreads and maturities by 30%–40% of initial values. Prior work measured only new lending, disregarding these revisions. The author claims this is the first study to distinguish new loans from revisions of existing loan terms in the transmission channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and empirical strategy.&lt;/strong&gt; The author builds a novel loan-level panel by combining automated textual analysis with manual review of SEC EDGAR credit-agreement filings (2005–2015, spanning conventional and unconventional/ZLB policy). Each loan path is traced from origination through renegotiations to maturity/early termination. After standard restrictions the loan-level sample has 9,565 loan paths from 2,685 firms, totaling 129,733 loan-quarter observations; ~53% of observations are private firms. Dataset accuracy exceeds 94% versus Roberts (2015)&amp;rsquo;s hand-collected data (~90% of ~300 matched observations agree completely). Loan data are merged with Compustat, Call Report, DealScan, FISD/SDC. The impulse is the Bu, Rogers and Wu (2021) monetary policy shock series (covers conventional + unconventional policy, purged of information effects), aggregated to quarterly. Identification uses local projections (Jordà, 2005): a linear probability model at the bank-firm-quarter level for the extensive margin of credit (origination vs renegotiation indicator), an intensive-margin variant using cumulative standardized within-bank-firm demeaned loan amount/spread, and a firm-quarter investment-response regression. Shocks are normalized so positive = expansionary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main quantitative findings.&lt;/strong&gt; A 25bps expansionary shock raises the renegotiation probability by about 1.7–2.1 percentage points in the same quarter (economically large vs the ~10%, specifically 10.2%, average quarterly renegotiation rate), persisting for about three quarters. The effect on new-loan origination is positive but weaker and varies across specifications (~0.3–1.5 pp). On the intensive margin, renegotiation expands loan amount by ~0.2 standard deviations vs average renegotiations, with no significant spread increase; new-loan volume shows limited/weak evidence of increase (origination amount coefficient -0.184*, spread insignificant). Effects are asymmetric: expansionary shocks matter more than contractionary ones on the extensive margin (Wald test rejects symmetry for renegotiation p=0.000 and origination p=0.013), but not the intensive margin. For investment: firms that renegotiate raise investment relatively more than non-renegotiators, with the relative effect notable from 3 quarters and peaking at 10 quarters—faster than the average response, which peaks at 18 quarters (where a 25bps expansionary shock raises the investment rate up to ~0.2%). Heterogeneity: highly leveraged &amp;amp; bank-dependent firms have ~3–4 pp higher origination/renegotiation propensity after the shock, and renegotiation amplifies their investment response. New-loan issuance, by contrast, is driven by &lt;em&gt;prior&lt;/em&gt; investment growth (firms with prior investment/assets one SD above average are ~0.7 pp more likely to originate). Contribution to the aggregate: renegotiating firms account for ~47.4% [43.6, 51.4] of the average investment response, originating firms ~11.9% [8.5, 15.2], and either activity ~55.1% [51.3, 58.8].&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Implications.&lt;/strong&gt; Renegotiation, not new origination, is the dominant bank-credit channel transmitting monetary policy to investment, it acts faster than origination, and it amplifies responses for financially constrained firms—implying monetary policy eases their constraints via improved credit access through renegotiation. Policymakers should monitor renegotiation dynamics, not just total loan balances, and coordinate prudential and monetary policy since prudential regulation affects renegotiation conditions.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The author uses local projections (Jordà, 2005) with the Bu, Rogers and Wu (2021) monetary policy shock series as the exogenous impulse. That shock is constructed to be exogenous (heteroskedasticity-based partial least squares isolating monetary from non-monetary news), purged of central-bank information effects, and largely unpredictable from Blue Chip forecasts/news/sentiment, addressing the standard confounding of policy actions with the central bank&amp;rsquo;s economic outlook. For the credit-margin regressions, bank and firm fixed effects (and in saturated specs, bank-by-firm fixed effects) absorb persistent supply- and demand-side and relationship heterogeneity; in the heterogeneity regressions bank-by-time fixed effects absorb credit-supply variation so the interaction identifies demand-side variation. Standard errors are two-way clustered. Threats: generated-regressor inference (the shock is estimated), which the author notes Pagan (1984) shows yields consistent SEs under the null and which holds when using shocks as instruments for interest rates; and demand-supply confounding, addressed via fixed effects. A subtler concern is reverse selection in investment regressions—firms renegotiating because investment is already trending up—which the paper addresses head-on in the decomposition (Section 3.2.3).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core distinction is renegotiation vs new origination. Renegotiation responds strongly and immediately to expansionary shocks (1.7–2.1 pp), expands borrowing (~0.2 SD) without raising spreads, and is independent of prior investment growth. Origination responds weakly, and its likelihood is instead predicted by the firm&amp;rsquo;s prior investment growth (~0.7 pp per SD), so it follows rather than drives investment. The decomposition (Table 8) separates total discounted investment growth (t-1 to t+18) into &amp;rsquo;lead&amp;rsquo; (t to t+18) and &amp;rsquo;lagged&amp;rsquo; (t-1 to t) components: for renegotiating firms the total response (0.537**) is driven by the lead component (0.707***) not the lagged (-0.178, insignificant), confirming renegotiation predicts &lt;em&gt;subsequent&lt;/em&gt; investment; for originating firms none of total/lead/lagged is significant. The paper also reasons that renegotiation is cheaper (fee ~0.1–0.3% of loan vs origination fee ~0.5–5% plus search/matching costs) and yields a larger borrower surplus, explaining why firms prefer it after accommodative shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) By financial constraint: highly leveraged &amp;amp; bank-dependent firms (15.8% of firm-quarter obs) show ~3–4 pp higher semi-elasticity of both origination and renegotiation propensity after a 25bps expansionary shock, and renegotiation significantly magnifies their investment response (triple-interaction, Figure 5). (2) By prior investment: firms with high ex-ante investment growth are more likely to originate (not renegotiate). (3) By age: younger firms rely more on new-loan issuance than renegotiation. (4) Alternative constraint proxies (size, leverage, distance to default, younger-and-non-dividend) in appendix figures confirm constrained/closer-to-default firms have higher credit-adjustment likelihood. (5) By renegotiation subtype: amount, spread and covenant adjustments produce greater relative investment responses, but maturity changes do not. Notably the intensive-margin loan-amount response shows NO significant heterogeneity by constraint or prior investment (Table 6).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Controlling for lender-specific bank capital ratio (Table B.1.1); estimating at the more granular loan-quarter level (Table B.1.2); an alternative construction of zeros for the origination indicator covering all ever-matched bank-firm pairs (Table B.1.3, which shows no immediate origination effect but lagged effects—widening the renegotiation/origination gap); using central-bank information shocks of Jarociński and Karadi (2020), which have the opposite sign on credit propensity, consistent with the information-effect interpretation (Table B.1.4); using the shock as an instrument for interest-rate changes (results unchanged); alternative shock series (Nakamura-Steinsson; Jarociński-Karadi); a nonlinear (logit/probit) procedure; and an alternative unweighted quarterly shock aggregation. The micro data also reproduce macro investment dynamics (~0.9 correlation with BEA private nonresidential fixed investment), validating external relevance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends the bank-lending and firm-balance-sheet credit-channel literature (Kashyap-Stein-Wilcox 1993; Jiménez et al. 2012; Abuka et al. 2019) which measured only new lending, by separating renegotiation. It extends Ippolito, Ozdagli and Perez-Orive (2018)&amp;rsquo;s floating-rate channel by showing renegotiation alters loan terms in ways that can dominate the mechanical floating-rate/policy-rate link. It vastly expands the renegotiation data of Roberts (2015) (114 firms) and Roberts and Sufi (2009) via text mining, and is more comprehensive than supervisory SNC/Y-14 data (which miss major renegotiation types). On heterogeneity it complements Caglio, Darst and Kalemli-Özcan (2021), Jeenas (2019), Ottonello and Winberry (2020), and Cloyne et al. (2023). On asymmetry it aligns with Kandil (1995) and extends Abuka et al. (2019) (asymmetry on extensive but not intensive margin). It links to Lummer and McConnell (1989) on the informational distinctness of renegotiated vs new loans, and to Mian and Santos (2018) on renegotiation and capex over the credit cycle.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because monetary policy transmits to investment with a lag while renegotiation responds immediately, renegotiation can serve as an early predictor of effective transmission, so policymakers should monitor renegotiation dynamics—not just total loan balances. Renegotiation is described as potentially &amp;rsquo;the sole lifeline&amp;rsquo; for financially constrained firms, magnifying their investment response. The paper highlights coordination between micro/macroprudential policy and monetary policy, since prudential regulation affects renegotiation lending conditions (Thakor and Furlong Wilson, 1995); depending on objectives, regulators might relax or tighten renegotiation conditions. Scope conditions: estimates apply to U.S. firms 2005–2015 spanning conventional and unconventional/ZLB regimes; effects are stronger for expansionary than contractionary shocks (asymmetry); and the author flags that the renegotiation channel&amp;rsquo;s role may differ between conventional and unconventional periods as a topic for future research.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What significant caveats or measurement details apply?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Renegotiations bundle amendments, amended-and-restated agreements and replacements, recorded together because the economic distinction is minor (following Roberts, 2015). Pre-specified contractual changes (rating-triggered spread increments, Evergreen auto-extensions) are NOT counted as renegotiations. Loans are assumed matured absent contrary SEC evidence. Intensive-margin samples are much smaller (conditional on the event and on non-missing spreads). The firm-quarter investment sample requires firms observed at least 6 years (24 quarters). Observations with negative bank capital (&amp;lt;0.4%, mostly during the GFC) are excluded. Balance-sheet variables are winsorized at 1% (0.5% for some). The investment-rate mean is ~0.2 (capxq*4/lagged ppentq); average bank capital ratio is 12.2% (SD 4.8%).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Uncertainty and Change: Survey Evidence of Firms' Subjective Beliefs</title><link>https://macropaperwarehouse.com/papers/uncertainty-and-change-survey-evidence-of-firms-subjective-beliefs/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/uncertainty-and-change-survey-evidence-of-firms-subjective-beliefs/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: A large literature shows that firms perceiving more uncertainty make more cautious intertemporal decisions (investment, hiring, price setting), but it is far less clear what makes firms uncertain in the first place. Macro models typically impose rational expectations and treat uncertainty as exogenous shocks to the conditional volatility of fundamentals. The paper asks how subjective uncertainty arises and evolves, and whether it is the same object as conditional volatility.&lt;/p&gt;
&lt;p&gt;Data and design: The authors build a new panel from a quantitative module they added in 2012 to the ifo Business Survey of German manufacturing firms. At the start of each quarter, top managers report (i) last quarter&amp;rsquo;s realized sales (&amp;ldquo;Umsatz&amp;rdquo;) growth, (ii) a one-quarter-ahead point forecast, and (iii) best- and worst-case scenarios. The &amp;ldquo;span&amp;rdquo; between best and worst case is their quantitative measure of subjective uncertainty; the forecast error is realized growth minus the point forecast. The baseline sample is 1,005 firms and 8,889 firm-quarter observations over 27 waves, 2013:Q2–2019:Q4 — a calm period with no German recession. A simple scenario-analysis model (Proposition 1) shows that under a quadratic loss and a location-scale shock family, span is proportional to subjective standard deviation, justifying span as an index of subjective conditional volatility. An organizing framework contrasts rational expectations (Example R: subjective uncertainty equals conditional volatility, forecasts unbiased) with learning about signal quality (Example L: managers are unsure of signal precision, so unfamiliar signals raise perceived uncertainty even when true volatility is constant, and generate forecast bias).&lt;/p&gt;
&lt;p&gt;Main findings with magnitudes: (1) Subjective uncertainty reflects experienced change, in both cross section and time series, following an asymmetric V-shape in growth (steeper negative branch, flatter positive branch, minimum near zero). Mean span is 12.4 pp, larger than mean absolute forecast error of 9.0 pp; cross-firm SD of time-averaged span is 7.4 pp and within-firm time-series SD of span is 6.3 pp. Cross-sectional V: a 1 pp lower (more negative) average growth goes with about 0.6 pp higher span; a 1 pp higher positive average growth with about 0.2 pp higher span. Time-series V (firm fixed effects removed): a 1 pp lower negative quarterly growth is followed by 0.2 pp higher span next quarter; a 1 pp higher positive growth by 0.1 pp (0.118 positive, -0.204 negative branch coefficients in Table 4). (2) Uncertainty is more than conditional volatility. Volatility explains about a quarter of cross-sectional variation in uncertainty; turbulence quartile dummies alone explain 30%, with span rising from 7 pp (lowest) to 18 pp (highest quartile). But controlling for turbulence, shrinking firms remain more uncertain (bottom-trend dummy ~2 pp) and make systematically too-conservative (toward-zero) forecasts, while large firms (&amp;gt;250 employees) report ~5 pp lower span holding trend/turbulence fixed (9 pp unconditionally). In the time series, after positive growth uncertainty rises but absolute forecast errors do not — inconsistent with rational expectations (Proposition R2), consistent with learning (Example L). Within-firm forecast-error/forecast correlation is -0.27 (overreaction); larger in magnitude (-0.31 vs -0.24) for low-excess-span firms. (3) Uncertainty is mostly idiosyncratic (time/industry fixed effects give R-squared ~1%, rising to ~5-7% with time-industry effects) yet matters for plans: a one-SD rise in span raises the probability of planned employment decrease by 2.4 pp (vs 4.2 pp for a one-SD forecast decline; baseline ~11%), raises planned price decreases by 0.9 pp and lowers planned price increases by 0.8 pp. Because employment (a quantity) and prices move the same direction, uncertainty acts like a negative demand shifter / &amp;ldquo;pessimism,&amp;rdquo; not a freezer of actions.&lt;/p&gt;
&lt;p&gt;Implications: Understanding subjective uncertainty requires going beyond rational-expectations models where uncertainty equals conditional volatility; learning is a promising alternative even for mature firms (median age 45 years). Decoupling of uncertainty from volatility matters for welfare and policy evaluation (misallocation, optimal policy under idiosyncratic risk).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core measurement strategy, and why is span a valid index of subjective uncertainty?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The ifo module elicits best- and worst-case sales-growth scenarios; span (best minus worst) is the uncertainty measure, and the separate point forecast (answer 2b) is the subjective conditional mean. The authors model managers who think through a finite number n of scenarios to minimize expected quadratic loss based on distance from the closest scenario. Proposition 1 shows that if growth g = mu + sigma*epsilon belongs to a location-scale family, optimal span is linear in sigma (independent of mu), so span is proportional to subjective conditional standard deviation. Quadratic cost is a second-order approximation to general loss, making the link broad. Span is also robust/low-cognitive-load: it depends only on adjacent scenarios&amp;rsquo; first-order conditions, so it is insensitive to interior reshaping or tail-shape changes managers cannot confidently distinguish.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for distinguishing uncertainty from conditional volatility, and what are the threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification rests on contrasting two observable implications. Under rational expectations (Example R), a cross-sectional uncertainty V must be accompanied by a cross-sectional volatility V in mean absolute forecast errors (Proposition R1), and a time-series uncertainty V must coincide with a &amp;lsquo;conditional-volatility V&amp;rsquo; in absolute forecast errors (Proposition R2). Under learning (Example L), uncertainty can move with growth while debiased forecast-error volatility does not (Proposition L2). The authors test these by comparing span responses to forecast-error responses. The main threat is that span is only an index of subjective volatility (level not identified), so for the negative branch — where both uncertainty and volatility rise — they cannot fully rule out that higher uncertainty merely reflects higher conditional volatility. They argue against this because the implied span-to-volatility ratio (up to 4 in Table 4) would far exceed the roughly one-for-one cross-sectional relationship for most firms. For positive growth, the absence of any forecast-error response makes the rational-expectations explanation clean to reject.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two competing mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Mechanism 1 (Example R, rational expectations): subjective uncertainty equals true conditional volatility, driven by heteroskedastic fundamentals; forecasts are unbiased. Mechanism 2 (Example L, learning about signal precision): growth is homoskedastic but managers observe a noisy signal of unknown information content gamma; using a Normal-Gamma prior with confidence parameter nu, an unfamiliar signal (far from prior mean, either sign) leads managers to infer lower precision and remain more uncertain, and generates forecast bias toward zero. Distinguishing tests: (a) cross section — shrinking firms are more uncertain AND biased holding volatility fixed (supports learning, Proposition L1b); large firms are less uncertain but unbiased (supports a confidence/nu channel, L1c); (b) time series — after positive growth, uncertainty rises but absolute forecast errors do not (rejects R2, supports L2); (c) the within-firm negative correlation between forecast and forecast error (-0.27) indicates overreaction from overprecision (Proposition L3). The preferred reading is a hybrid: a known volatility component generating the negative branch (R) plus a symmetric learning V (L).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three dimensions. Turbulence (time-series SD of growth): strongly raises uncertainty — top vs bottom quartile span 18 vs 7 pp, ~1.5 cross-sectional SDs, dummies explain 30%. Trend growth: asymmetric V — both fast-growing and fast-shrinking firms are more uncertain, but after controlling for turbulence only the bottom (shrinking) trend quartile retains a significant ~2 pp effect, and shrinking firms also have biased (too-conservative) forecasts, whereas fast-growing firms lose significance once volatility is controlled. Size: larger firms perceive less uncertainty — large (&amp;gt;250 employees) firms ~9 pp lower span unconditionally, ~5 pp lower controlling for trend and turbulence, but show no significant difference in average forecast errors (so the size effect is a confidence/nu channel, not bias). Time-series heteroskedasticity of span also rises with turbulence and trend and is larger for smaller firms, consistent with smaller firms having lower nu. Employment effects of uncertainty are similar across size classes (if anything slightly stronger for large firms).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Industry dummies (14 sectors) added to the cross-sectional span regression leave the turbulence/trend/size coefficients essentially unchanged and raise R-squared by only 2 pp, showing the effects are within-industry. Time and time-industry fixed effects confirm variation is overwhelmingly idiosyncratic (R-squared ~1% rising to ~5-7%). The within-firm uncertainty results are robust to requiring at least 5 span observations per firm (Table I4), as are the employment/price-plan results (Tables I6). Deseasonalization is corroborated at macro and micro level (Appendix B). Forecast-error analyses use a debiased absolute forecast error (residual from regressing forecast error on past growth and firm fixed effects) to separate volatility from bias, and a &amp;lsquo;statistical forecast error&amp;rsquo; (deviation of growth from firm mean) as an econometrician benchmark, both giving the same V/no-V patterns. Data quality is documented: ~73-86% of respondents are top management, the responder is the same person in ~98% of firms, ~80% of firms use in-house quantitative planning, and a majority rely on scenario analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on survey-based &amp;lsquo;micro uncertainty&amp;rsquo; work (Guiso and Parigi 1999; Bontempi et al. 2010; Bachmann, Elstner and Sims 2013). Several papers found V-shapes between subjective uncertainty and lagged sales growth (Altig et al. 2022 Atlanta Fed SBU; Bloom et al. 2020 MOPS; Kumar, Gorodnichenko and Coibion 2023 New Zealand), but those use single cross sections or short pooled samples and cannot separate cross-sectional from time-series Vs. The contribution is decomposing the V into between- and within-firm components and constructing volatility Vs to contrast against the uncertainty Vs, showing uncertainty is more than volatility. It also connects to the behavioral/miscalibration literature (Ben-David, Graham and Harvey 2013; Barrero 2022) by linking forecast bias to the gap between subjective uncertainty and conditional volatility via endogenous perceived precision. Uniquely, it studies subjective idiosyncratic uncertainty jointly with both a quantity (employment) and prices in normal (non-recession) times; Kumar et al. (2023) found &amp;lsquo;uncertainty as pessimism&amp;rsquo; but for a macro variable (GDP).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy and modeling implications, and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The decoupling of uncertainty from volatility matters for welfare and policy because the standard approach (regress absolute forecast errors on conditioning information and use the fitted value as uncertainty) measures &amp;rsquo;too little&amp;rsquo; uncertainty — it ignores uncertainty about features the econometrician sees only with hindsight. Heterogeneous-firm models of misallocation and optimal policy under idiosyncratic risk (e.g., Boar et al. 2025; Di Tella et al. 2025) should incorporate uncertainty distinct from volatility. Models of firm dynamics need either heteroskedastic innovations or sufficient nonlinearity, plus feedback from past growth to uncertainty (learning), and should treat idiosyncratic demand uncertainty as a driver of employment churn and price dispersion even in steady state. Scope conditions: the evidence is German manufacturing, 2013-2019, a calm idiosyncratic-shock-dominated period (so results speak to idiosyncratic, not aggregate, uncertainty); span identifies relative not absolute uncertainty; for idiosyncratic uncertainty to affect actions, firm decisions must depend on it (manager career concerns, closely-held ownership, or ambiguity/Knightian uncertainty defeating diversification). The authors note the decoupling principle extends to policy uncertainty (e.g., tariffs) even when realized paths are not volatile.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the &amp;lsquo;uncertainty as a negative demand shifter&amp;rsquo; result tell us about the type of shocks managers fear?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because higher span lowers BOTH planned employment (a quantity) and planned prices in the same direction, the comovement indicates that managers primarily worry about demand shortfalls rather than cost shocks. A firm fearing a demand shortfall scales down production (sheds workers) and lowers prices; a firm fearing input-cost increases would still cut employment but RAISE prices. The observed pattern therefore points to idiosyncratic, subjective demand uncertainty as the relevant primitive, and (with financial frictions or risk/ambiguity-averse decision-makers placing more weight on low-payoff states) explains why uncertainty &amp;lsquo;acts like pessimism&amp;rsquo; rather than freezing actions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key caveats and limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Span is an index of subjective volatility, so levels and the exact span-to-volatility ratio are not point-identified, leaving residual ambiguity on the negative branch where uncertainty and volatility both rise. The sample is non-recessionary German manufacturing, so results characterize idiosyncratic (not aggregate) uncertainty; the authors explicitly note variation is essentially all idiosyncratic. The learning examples abstract from explicit dynamics (the prior is held fixed each period), serving as stark illustrations rather than a fully dynamic structural model; the data are interpreted through a hybrid of R and L. The plan outcomes are qualitative (up/down/same) and ifo does not elicit realized outcomes suitable for the authors&amp;rsquo; purposes, so the link to realized employment/prices relies on external evidence that ifo indicators forecast those variables.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Unconventional Monetary Policies and Inequality</title><link>https://macropaperwarehouse.com/papers/unconventional-monetary-policies-and-inequality/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/unconventional-monetary-policies-and-inequality/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks whether the Federal Reserve&amp;rsquo;s unconventional monetary policies (UMP) — specifically quantitative easing (QE) and forward guidance — exacerbated income and welfare inequality in the United States during the effective lower bound (ELB) episode following the Great Recession (2009–2015). The question is empirically and theoretically contested: QE raises profits and equity prices, benefiting wealthy households who hold most equity, while simultaneously reducing unemployment, which benefits poorer households who rely almost entirely on labor income. Resolving the net effect requires a unified framework that captures both channels simultaneously, with empirically realistic responses of profits, wages, and unemployment to monetary policy.&lt;/p&gt;
&lt;p&gt;The paper builds a medium-scale Heterogeneous Agent New Keynesian (HANK) model that incorporates: (i) a two-asset structure (liquid deposits and illiquid equity) with portfolio adjustment costs; (ii) three working statuses — employed, unemployed, and business owner — with endogenous job-finding rates determined by a search-and-matching labor market; (iii) a banking sector modeled after Gertler and Karadi (2011), with a moral-hazard leverage constraint; (iv) a substantial fixed cost in production that, combined with wage rigidity, generates procyclical profit responses to monetary policy shocks — a feature absent from standard New Keynesian models and critical for capturing benefits to wealthy households; and (v) an occasionally binding ELB constraint with QE modeled as central bank asset purchases and forward guidance modeled as exogenous expected ELB durations following Jones (2017). The model is calibrated to match the 2007 Survey of Consumer Finances (SCF), targeting the top decile&amp;rsquo;s share of wealth (~70%), income composition across wealth groups, and standard labor market and financial sector moments. Remaining parameters are estimated using Bayesian methods on U.S. quarterly data from 1992 Q1 to 2018 Q4, using ten observables (output, consumption, investment, inflation, nominal interest rate, real wage, unemployment, lump-sum transfers, profits, and Federal Reserve assets), with the ELB regime handled via an inversion filter and the Kulish-Jones method for exogenous ELB durations.&lt;/p&gt;
&lt;p&gt;At the posterior mode, the model attributes the Great Recession primarily to a series of large negative risk premium shocks around 2008–2009, causing investment to fall by more than 20% relative to the pre-crisis level. The central counterfactual compares the actual ELB episode (with UMP) against a scenario where the central bank held its balance sheet constant and allowed ELB durations to be determined endogenously by fundamentals. Between 2009 and 2015, UMP on average produced: a 3.3% increase in profits, a 0.9% increase in equity prices, a 1.5 percentage-point reduction in the unemployment rate, and only a 0.1% increase in real wages (reflecting high estimated wage rigidity). Output and investment were higher by approximately 1% and 3% respectively on average, with profits rising as much as 8% during the ELB episode.&lt;/p&gt;
&lt;p&gt;These aggregate effects translated into non-linear distributional outcomes. For the Gini index, lower unemployment reduced the income Gini by up to 0.6 percentage points, but this was offset by about 80% by the increase in profits and equity prices — leaving only a marginal net Gini reduction of 0.04 percentage points on average. When computed for the bottom 90% alone, the Gini reduction was more pronounced because that group relies overwhelmingly on labor income. However, the income share of the top 10% rose by an average of 0.17 percentage points, driven mainly by higher profits and equity prices. Thus the answer to whether UMP raised inequality is measure-dependent: UMP reduced within-bottom-90% inequality while widening the top-decile income gap.&lt;/p&gt;
&lt;p&gt;Welfare gains (consumption equivalents over the ELB episode) were U-shaped across the wealth distribution: the average gain was 0.27% of lifetime consumption, but households at both extremes gained more than the middle. The bottom 10% benefited from higher job-finding rates (gaining ~0.3%), the top 10% from profits and equity prices (also ~0.3%), and the top 1% gained ~0.33%. The middle 60% gained only ~0.26%. By working status, business owners gained the most (0.82%), followed by the unemployed (0.35%) and the employed (0.27%).&lt;/p&gt;
&lt;p&gt;Decomposing UMP into QE and forward guidance, the paper finds that forward guidance accounted for approximately 55% of total UMP stimulus. Forward guidance amplified both the aggregate and distributional effects of asset purchases: QE alone raised the top 10% income share by about 0.1 percentage point, and forward guidance added a further 0.09 percentage point increase. Forward guidance lowered the overall Gini by about 0.05 percentage points more than QE alone around 2013, and reduced the bottom-90% Gini by an additional 0.2 percentage points during the same period. The interaction intensified what the paper calls a &amp;ldquo;hollowing out&amp;rdquo; of the middle class: forward guidance further reduced middle-60% income shares while leaving bottom-10% shares nearly unchanged, because the additional stimulus disproportionately raised profits and equity prices (by about 2% and 1%, respectively, between 2011 and 2014).&lt;/p&gt;
&lt;p&gt;Comparing QE with a hypothetical conventional monetary policy (CMP) that would have allowed the nominal rate to drop to approximately -1%, the paper finds that CMP would have produced larger aggregate stimulus than QE but more adverse distributional effects. Under CMP, lower financing costs disproportionately boosted bank net worth, indirectly raising profits and benefiting wealthy households even more than QE did. Under QE, central bank asset purchases crowded out private bank investment by reducing expected equity returns even as they raised equity prices, partially dampening the profitability gains to the financial sector. Consequently, CMP would have delivered above-average welfare gains only to the bottom 1% (debtors benefiting from lower real rates) and the top 10% (through larger bank profit effects), while the broad middle class would have fared no better and in some dimensions worse.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s key methodological contribution is the first Bayesian estimation of a HANK model with an occasionally binding ELB constraint. Its key substantive finding is that standard NK models, which generate countercyclical profits, systematically understate the benefits that expansionary monetary policy delivers to wealthy households, producing a misleading or incomplete picture of the distributional effects of monetary policy.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the model&amp;rsquo;s identification strategy and how is the ELB period handled in estimation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is estimated with Bayesian methods using an inversion filter (following Guerrieri and Iacoviello 2017 and Cuba-Borda et al. 2019) on ten quarterly observables from 1992 Q1 to 2018 Q4. The key identification challenge is the occasionally binding ELB constraint. The paper follows Kulish et al. (2014) and Jones (2017), treating the ELB as a temporary alternative regime with exogenous expected durations. These expected durations are themselves estimated as latent variables, with priors informed by the New York Fed&amp;rsquo;s primary dealer survey. The Metropolis-Hastings algorithm is used for structural parameters (treating ELB durations as fixed in each draw), while ELB durations are drawn separately using a discrete uniform proposal density. To make estimation computationally feasible given the large idiosyncratic state space, the paper follows Bayer and Luetticke (2020) and updates only the subset of the model Jacobian corresponding to &amp;lsquo;aggregate&amp;rsquo; and &amp;lsquo;summary&amp;rsquo; equations during each iteration, leaving the &amp;lsquo;idiosyncratic&amp;rsquo; blocks fixed across estimated parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms by which UMP affects inequality and how does the model distinguish them empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper identifies four main channels: (1) Profit and equity price channel — QE raises equity prices and reduces financing costs, increasing profits and the dividend rate on illiquid assets. Because the top decile holds ~70% of total wealth overwhelmingly in the form of equity, with capital and business income accounting for ~50% of their income, this channel benefits the wealthy disproportionately. (2) Unemployment channel — lower interest rates stimulate demand and raise the job-finding rate. Because households at the bottom of the wealth distribution are more likely to be unemployed at the onset of the ELB episode (8.75% of the bottom decile vs. 6.54% in the middle quintile in 2009 Q1), this channel is progressive. (3) Wage channel — nominal and real wage rigidity (only one-fifth of the real wage adjusts to labor productivity changes) means that the wage channel is very weak; average real wages rose by only 0.1% due to UMP. (4) Inflation/redistribution channel — forward guidance generates inflationary expectations that compress real rates, redistributing from savers to debtors. The empirical decomposition is performed by first isolating QE alone (endogenizing ELB durations) and then comparing to the full UMP scenario (exogenous ELB durations), attributing the residual effect to forward guidance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the key modeling innovation regarding profits, and why does it matter for inequality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Standard New Keynesian models generate countercyclical profit responses to monetary policy shocks: when demand rises, price rigidity keeps prices sticky while factor prices (wages) adjust upward, squeezing markups and reducing profits. This contradicts empirical evidence from structural VARs, which show procyclical profits. The paper introduces three interacting features that resolve this: (a) a substantial fixed cost of production calibrated to roughly 20% of steady-state output, so that average production cost falls even as marginal cost rises, boosting net profits; (b) wage rigidity with search-and-matching frictions, so that real wages respond very weakly to monetary shocks; and (c) a banking sector with a financial accelerator, so that rising equity prices boost banks&amp;rsquo; net worth and their investment demand, further amplifying profits. Without procyclical profits, the model would understate the benefits wealthy households (whose income depends heavily on profits and equity returns) gain from expansionary monetary policy, producing an incomplete picture of distributional effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in households&amp;rsquo; balance sheets and income composition is documented, and how does it shape distributional results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using the 2007 SCF, the paper documents stark composition differences. The bottom 80% of the wealth distribution derives ~80% of income from labor, with transfer income making up most of the rest. The top 10% derives about 50% from labor and 50% from capital (equity and business income). For the top 0.1%, labor income is only 16% and capital/business income is about 83–85%. In the model, the top 10% hold about 70% of total wealth, overwhelmingly in illiquid equity. These composition differences mean that any policy raising profits and equity prices is strongly progressive at the top and neutral-to-mild at the bottom, while any policy reducing unemployment is strongly progressive at the bottom. The interplay of these two forces explains why UMP simultaneously reduces bottom-90% inequality (through the unemployment channel) and widens the top-vs.-rest gap (through the profit and equity channel), and why welfare gains are U-shaped rather than monotone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the welfare accounting methodology and what are the key welfare findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Welfare gains are measured as consumption equivalents — the fraction of lifetime consumption that a household in the counterfactual (no UMP) scenario would be willing to forgo to enjoy the UMP outcome. Households are sorted into wealth groups based on their 2009 Q1 wealth position (so group composition is not affected by UMP), and the same households are followed throughout the episode. Beyond the sample end (2018 Q4), no further shocks are assumed. The average welfare gain at the posterior mode is 0.27% of lifetime consumption. Bottom 10%: ~0.3% (driven by higher job-finding rates). Top 10%: ~0.3% (driven by profits and equity gains). Top 1%: ~0.33%. Middle 60%: ~0.26%. Business owners: 0.82%. The unemployed: 0.35%. The employed: 0.27%. Critically, the welfare gaps between extremes and middle are smaller than the income gaps, because anticipated tapering after the sample implies lower future profits and equity prices for wealthy households, narrowing their long-term advantage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the contributions of QE and forward guidance compare in aggregate and distributional terms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Forward guidance accounted for approximately 55% of the total UMP stimulus at the posterior mode. Exogenous expected ELB durations exceeded endogenous (fundamentals-based) durations by 1–2 quarters on average, and sometimes by up to 8 quarters, with the divergence widening from 2011 onward. In distributional terms, QE alone initially reduced the bottom-90% Gini and raised the top 10% income share by about 0.1 percentage point. Forward guidance amplified both effects: it lowered the overall Gini by an additional ~0.05 pp and the bottom-90% Gini by an additional 0.2 pp around 2013, but also added a further ~0.09 pp to the top 10% income share between 2011 and 2014. The amplification occurred because forward guidance raised profits and equity prices by about 2% and 1% respectively during that window, intensifying the income concentration at the top while also stimulating job creation at the bottom. The middle class saw its income share further compressed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does QE compare with conventional monetary policy in terms of aggregate and distributional effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the counterfactual CMP scenario, the nominal policy rate drops to approximately -1% and remains negative for an extended period. CMP produces larger aggregate stimulus than QE: the stimulus effects of QE were partly crowded out by general equilibrium effects, specifically QE reduced banks&amp;rsquo; expected return on equity even as it raised equity prices, discouraging private bank investment. Under CMP, lower nominal rates instead benefit banks through lower financing costs, boosting bank net worth via an accelerator mechanism more strongly than under QE. This difference has distributional consequences: CMP would have delivered higher welfare gains only to the bottom 1% (low-wealth debtors benefiting from lower real rates on their liabilities) and the top 10% (benefiting from larger bank profits). Households in the broad middle — already employed, holding limited equity, neither heavy borrowers nor large business income recipients — would have been no better off and in some dimensions worse off under CMP. The paper thus concludes that QE had less adverse distributional effects than CMP would have had, absent the ELB constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and sensitivity analyses are conducted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper checks results against: (a) the full 10th–90th percentile range of the posterior distribution for all key findings on aggregate effects, income inequality, welfare gains, and QE vs. CMP comparisons, showing that qualitative findings are robust to parameter uncertainty; (b) a comparison between rigid-wage and flexible-wage model variants (Table A1), showing that the flexible-wage version generates countercyclical profits, a weak unemployment response, and a strong real wage response — inconsistent with empirical SVAR evidence — validating the modeling choice of high wage rigidity; (c) a structural VAR analysis on U.S. data confirming procyclical profits, weak real wage responses, and significant unemployment responses to monetary policy shocks; (d) a comparison of the OccBin method (endogenous ELB durations, Guerrieri and Iacoviello 2015) vs. the Kulish-Jones method (exogenous durations) for solving the occasionally binding constraint; (e) a check that wages implied by the calibrated wage function always remain in the bargaining set, validating the equilibrium wage assumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key differences between this paper and the closest prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Kaplan, Moll, and Violante (2018) and Bayer et al. (2020) have two-asset HANK models but omit frictional labor markets, so they cannot capture how monetary policy affects employment and thus the progressive unemployment channel. Gornemann et al. (2016) include search-and-matching labor markets but only one asset, so they cannot capture the capital income benefits to wealthy households. Broer et al. (2019) and Auclert et al. (2023) identify the countercyclical profit problem but their solutions (wage rigidity alone) produce procyclical profits that are too weak quantitatively. This paper combines fixed costs, wage rigidity, and a banking sector to produce procyclical profits quantitatively consistent with SVAR evidence. On unconventional policy specifically, Lenza and Slacalek (2018) and Casiraghi et al. (2018) study ECB QE with partial equilibrium methods and find inequality-reducing effects; Bivens (2015) and Montecino and Epstein (2015) reach opposite conclusions for U.S. QE. This paper is the first to study both QE and forward guidance jointly in a Bayesian-estimated HANK model with an explicitly binding ELB, and is to the author&amp;rsquo;s knowledge the first to estimate a HANK model with an occasionally binding ELB constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, UMP&amp;rsquo;s inequality effects are measure-dependent: policies that simultaneously stimulate employment and profits can reduce within-bottom-90% inequality while widening the top-vs.-rest gap. Policymakers who cite Gini reductions and those who cite rising top-income shares are both correct, pointing to different parts of the distribution. Second, forward guidance amplifies inequality effects as much as it amplifies aggregate effects, so its use carries a distributional cost concentrated at the top of the distribution. Third, QE had less adverse distributional effects than conventional monetary policy would have had, suggesting that concerns about QE&amp;rsquo;s inequality effects should be placed in context of the ELB constraint — the relevant comparison is not QE vs. no policy but QE vs. CMP with the ELB absent. Fourth, models that generate countercyclical profits will systematically understate benefits to the wealthy and potentially reach qualitatively different conclusions about whether monetary policy raises or reduces inequality. These findings are scoped to the U.S. Great Recession ELB episode, estimated with the specific HANK model structure and Bayesian posterior; findings may differ for different financial structures, more generous unemployment insurance, or different asset price dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What drives the Great Recession in the model and how is UMP modeled mechanically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At the posterior mode, the Great Recession is primarily attributed to a series of large negative risk premium shocks (shocks to banks&amp;rsquo; discount factor) around 2008–2009, which caused banks to sharply contract their investment, leading to the investment collapse (&amp;gt;20% below pre-crisis). QE is modeled following Gertler and Karadi (2011): the central bank issues bonds (sold to the private sector) and uses proceeds to purchase equity directly, converting non-productive asset demand into productive capital demand and raising equity prices and investment. Forward guidance is modeled as setting exogenous expected ELB durations longer than would be implied endogenously by the Taylor rule fundamentals, effectively mimicking future negative interest rate shocks and inducing inflationary pressure via intertemporal substitution. The expected ELB durations at the posterior mode range from 6 to 8 quarters through 2013, falling sharply to 1–2 quarters by late 2014–2015.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneous Agent New Keynesian (HANK) model&lt;/strong&gt;: As used in this paper, a DSGE model where households differ ex-post in idiosyncratic productivity, asset holdings (liquid deposits and illiquid equity), and employment status; combined with search-and-matching labor markets, a banking sector with leverage constraints, and a zero lower bound on the policy rate. The heterogeneity in wealth composition and income sources determines how aggregate policy shocks translate into distributional outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Procyclical profits&lt;/strong&gt;: The property, established empirically via SVAR and reproduced in the model, that firm profits rise in response to expansionary monetary policy shocks. Standard New Keynesian models generate the opposite (countercyclical profits) because price rigidity compresses markups when demand rises. In this paper, the combination of large fixed costs in production, wage rigidity, and a banking sector financial accelerator is required to generate quantitatively realistic procyclical profit responses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective lower bound (ELB) episode&lt;/strong&gt;: The period from 2009 Q1 to 2015 Q4 during which the Federal Reserve&amp;rsquo;s policy rate was constrained at zero. In the model, this is treated as a temporary alternative regime with exogenous expected durations; when the policy rate hits the ELB, the central bank can only affect the economy through asset purchases (QE) and forward guidance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forward guidance (as exogenous expected ELB durations)&lt;/strong&gt;: In this paper&amp;rsquo;s framework, forward guidance is operationalized as the central bank committing to maintain the policy rate at zero for a longer period than the endogenous (fundamentals-based) Taylor rule would prescribe. This is parameterized as an exogenous expected ELB duration that exceeds the endogenous one, creating anticipations of future negative interest rate shocks and thus stimulating activity through intertemporal substitution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption equivalent welfare gain&lt;/strong&gt;: The fraction of lifetime consumption that a household in the counterfactual scenario (no UMP) would be willing to forgo in order to instead experience the outcomes under UMP. Used to compare welfare across heterogeneous households in a cardinal, utility-based metric rather than income alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Business owner working status&lt;/strong&gt;: A third working status (alongside employed and unemployed), following Bayer et al. (2019), in which households receive a fixed fraction of aggregate profits as income without supplying labor. Business owners transition into and out of this status exogenously and are the highest-income group in the model, calibrated to match the top-decile&amp;rsquo;s share of liquid assets and the income composition data showing that capital and business income dominate the very top of the wealth distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inversion filter&lt;/strong&gt;: The likelihood evaluation method used in this paper for Bayesian estimation, following Guerrieri and Iacoviello (2017). Rather than running a Kalman filter, structural shocks are backed out directly by inverting the linear solution of the model given the observed data and a given set of expected ELB durations. This avoids continuously updating the large state-transition matrix and makes estimation computationally feasible.&lt;/p&gt;</description></item><item><title>Understanding High-Wage Firms: Monopoly, Monopsony, and Bargaining Power</title><link>https://macropaperwarehouse.com/papers/understanding-high-wage-firms-monopoly-monopsony-and-bargaining-power/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/understanding-high-wage-firms-monopoly-monopsony-and-bargaining-power/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Why do some firms pay persistently higher wages for observably similar workers, and what role do firms&amp;rsquo; product-market power (monopoly/markups), labor-market power (monopsony/markdowns), and workers&amp;rsquo; collective bargaining power play in shaping wages and welfare? Prior literature studies labor-market power as a driver of wages/profits but abstracts from product-market power and bargaining, while the markups literature abstracts from imperfect labor competition and bargaining. The paper unifies all three in one structural framework.&lt;/p&gt;
&lt;p&gt;Central theoretical insight: A firm&amp;rsquo;s wage equals its marginal revenue product of labor (MRPL) times a &amp;ldquo;labor wedge&amp;rdquo; (the share of MRPL workers receive). The labor wedge decomposes into three components — price-cost markups, monopsony markdowns, and bargaining power — via equation (3): Lambda = kappa*(product market rents term) + (1-kappa)*lambda. With positive bargaining power (kappa&amp;gt;0) workers capture a share of markup-generated rents, so the labor wedge rises with markups (rent-sharing); this nests pure monopsony as the kappa=0 special case.&lt;/p&gt;
&lt;p&gt;Data and setting: French administrative micro-data. Firm balance sheets (FARE, 2008-2019, DGFiP); firm-product output prices (EAP survey, 2009-2019, INSEE, manufacturing firms &amp;gt;=20 employees or sales &amp;gt;5m euros); matched employer-employee data (DADS, 1995-2018) which crucially includes hours worked. Firm wage premia estimated via a k-means/BLM grouped AKM regression (Bonhomme, Lamadon, Manresa 2019). Markups and labor wedges estimated with the production-function/production approach (De Loecker-Warzynski 2012; Yeh et al. 2022) using translog functions and an Ackerberg-Frazer-Caves control function, separating the two by noting markups distort all input demands while labor wedges distort only labor demand.&lt;/p&gt;
&lt;p&gt;Two key empirical facts a standard monopsony model cannot explain: (i) high-wage firms charge higher output prices and markups; (ii) high-wage firms pay a larger share of MRPL as wages (higher labor wedges). Both persist within narrow industries and conditional on TFP, pointing to product quality and positive bargaining power.&lt;/p&gt;
&lt;p&gt;Main quantitative findings (French manufacturing, 2016 unless noted): Median markup 1.32 (IQR 1.14-1.60). Median labor wedge 0.62 (median monopsony markdown 0.46) — the gap is due to bargaining power and markups. Workers capture about 12% of firm profits (bargaining power kappa ~ 0.12-0.14; falls to ~0.05-0.13 under IV correction). Median markdown 0.46 implies a median firm-specific labor supply elasticity of 0.85. Accounting for hours matters: median labor wedge is 0.62 with effective hours, 0.65/0.68/0.71 across specifications, rising to 0.71 when labor is measured by employment (near Yeh et al.&amp;rsquo;s 0.70-0.73 US figures) — so omitting hours upward-biases labor wedges.&lt;/p&gt;
&lt;p&gt;Quantitative GE model (oligopoly/oligopsony, nested-CES, Atkeson-Burstein/Berger et al.): A 1% productivity shock has wage passthrough 0.97-0.99 versus 0.23 for an equal quality shock (because varieties are close substitutes, sigma=5.17), though quality still generates more wage-premium dispersion. Markups and markdowns reduce welfare by 46% in consumption-equivalent terms, with markups alone accounting for over 80%; misallocation explains about 63% of the markup welfare cost. Equalizing markups raises average wages 39% and wage variance 99% and welfare 24% (output-restriction effect dominates rent-sharing, so equalizing markups raises wage dispersion). Raising bargaining power from 0.12 to 0.50 matches the wage gains of removing markups but yields only 10% welfare gain (vs 38%); full bargaining power (kappa=1) raises welfare 13%, under one-third of the planner&amp;rsquo;s 46% gain. Bargaining power offsets the uniform-tax and misallocation distortions on labor demand but cannot fix markup distortions to capital/material demand.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core identification strategy for separating markups from labor wedges, and what are its main assumptions/threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The author applies the production approach: estimate translog production functions per 2-digit manufacturing sector (via two-step GMM with an Ackerberg-Frazer-Caves control function for unobserved productivity) to recover firm-specific output elasticities. Markups distort the demand for ALL inputs while labor wedges distort ONLY labor demand, so choosing materials as a flexible, price-taken input lets markups be identified from the material cost share (mu = alpha_m * PY/(Pm*M)) and labor wedges from the wage-bill-to-materials ratio scaled by elasticity ratios (eq. 4). Key assumptions/threats: materials must be a flexible input firms take prices for (examined in Appendix B.7-B.8); unobserved productivity must satisfy scalar unobservability and monotonicity in material demand; unobserved output and input prices bias elasticities — addressed using observed EAP output prices (measuring output in quantities) plus the De Loecker et al. (2016) input-price control function, and additionally controlling for firm wage premia because monopsony markdowns create unobserved labor-price variation. Markup variation driven by idiosyncratic demand uncorrelated with TFP is controlled via export status, market shares, firm age, and a 3rd-order price polynomial. Gandhi-Navarro-Rivers concerns about identifying material elasticities are addressed in Appendix B.9.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the new identification challenge for estimating bargaining power, and how is it solved?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The rent-sharing literature estimates bargaining power kappa by regressing wages on quasi-rents using instruments (export demand, patent shocks) assumed orthogonal to the worker&amp;rsquo;s reservation wage. But in this model, when kappa=0 workers earn an endogenous monopsony wage (lambda*MRPL) that moves with the SAME firm-specific shocks (productivity, quality, amenities) that shift quasi-rents — so standard instruments violate the exclusion restriction. The solution: instead of the wage equation, exploit the labor-wedge equation (3), which relates labor wedges to markups and avoids unobserved monopsony wages. Conditional on markdowns, variation in product-market rents identifies kappa (when kappa=0 product-market rents do not affect the labor wedge). This shifts the core challenge from unobserved monopsony wages to unobserved amenities (mirroring IC3 in the rent-sharing literature), handled by a theory-consistent control function in which employment and the wage bill jointly proxy for amenities under a monotonicity assumption (labor supply increasing in amenities). Under multiplicative separability of wages and amenities, markdowns do not depend directly on amenities, so unobserved amenities do not bias kappa at all.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the bargaining-power estimates across specifications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pooled OLS gives ~0.135; adding firm fixed effects ~0.124; adding the amenity control function (columns 3-4) ~0.124-0.135, indicating amenities have little direct effect on markdowns; instrumenting product-market rents with their lags to correct correlated measurement error (columns 5-6) gives 0.130 and 0.059. Baseline kappa is taken as ~0.12 (specification 4). All 2-digit sectors have kappa below 0.3. These align with the rent-sharing literature&amp;rsquo;s typical 0.05-0.15, though external innovation-based instruments tend to find ~0.30.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper measure firm wage premia and why not use standard AKM?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Standard AKM firm effects assume time-invariant firm effects and rely on worker mobility; short panels yield noisy estimates with upward-biased variance. The author needs time-varying premia (to measure effective labor over time). He uses the BLM (Bonhomme, Lamadon, Manresa 2019) k-means approach: cluster firms by the similarity of their internal wage distributions (by 2-digit sector over overlapping 2-year windows), then run an AKM-style regression with firm-GROUP effects that vary by year, identified by workers switching between firm-groups — greatly increasing the number of switchers. DADS-Postes is used for clustering (broad coverage) and DADS-Panel for the wage-premium regression.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Firm wage premia dispersion accounts for 5.2% of wage dispersion; the 90-10 premium gap is ~30% (about 4 euros/hour, 25% of the median worker&amp;rsquo;s hourly wage), IQR 15%. Markdowns increase with firm wage premia (flat gradient) but DECREASE with firm size — larger firms have more monopsony power, consistent with oligopsony models. Firm-specific labor supply elasticities are 0.54/0.85/1.33 at the 25th/50th/75th percentiles. About 7% of firms have labor wedges above 1, and these tend to have much higher markups (rationalized by kappa&amp;gt;0). In the GE model, top-decile high-wage firms are ~15% more productive but have over 100% greater product quality than bottom-decile firms; amenities rise slightly more steeply with premia than productivity. Passthrough is substantially smaller for 90th-percentile firms (0.74 productivity, 0.18 quality) than for median/10th-percentile firms (~1.06/~0.26).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the dispersion of wage premia decomposed across sources of firm heterogeneity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Introducing one source at a time into the GE model and comparing variance to baseline (Table 6): varying only product quality reproduces 161.5% of baseline variance, only TFP 153.3%, and only amenities 40.8%. Product quality is the largest single contributor to wage-premium dispersion, closely followed by productivity, then amenities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the productivity passthrough differ so much from the quality passthrough?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Total passthrough is 0.97 for a 1% productivity shock vs 0.23 for an equal quality shock (~4x). The decomposition (Table 5) attributes most of the gap to the direct effect (1.07 vs 0.26): with high within-market substitutability (sigma=5.17), consumers are very price-sensitive, so productivity (which lowers price) moves sales and labor demand far more than quality. Higher sigma raises productivity passthrough but lowers quality passthrough. For sufficiently low sigma the ranking can reverse. The variable-market-power channel also matters: higher productivity raises markups, increasing rent-sharing (+0.06 via labor wedge) but also output restriction (-0.09 via markup), with output restriction dominating; firm-size effects (sectoral price -0.10, sectoral wage +0.03) further adjust passthrough. Amenity shocks have direct effect -0.26 (mirror of quality) but total -0.28, amplified because better amenities lower hiring costs and expand the firm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does worker bargaining power affect welfare, and what are the limits?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bargaining power offsets two distortions firm market power imposes on aggregate labor demand: a uniform tax (Lambda/mu, lowering labor demand proportionally) and a misallocation tax (Theta, from dispersion in wedges). There exists a kappa-bar that exactly cancels the uniform tax, and kappa-bar falls as markups rise (high markups make bargaining more effective). With full bargaining power and common markups, the markdown-driven misallocation tax is fully neutralized. BUT bargaining only acts through labor demand; markups also distort capital and material demand, which bargaining cannot fix. Quantitatively: raising kappa from 0.12 to 0.50 matches the wage gain of removing markups but yields only 10% welfare gain (vs 38%) and far less dispersion increase; full kappa=1 raises welfare 13%, under one-third of the planner&amp;rsquo;s 46% gain. So bargaining power is a partial, not full, remedy for firm market power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the welfare accounting for markups vs markdowns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Comparing the decentralized economy to the social planner&amp;rsquo;s (Table 7, column 3): eliminating both markups and markdowns raises wage-premium dispersion 113%, average wages 303%, and welfare 46% (consumption-equivalent). Over 80% of the welfare gain comes from removing markups. Equalizing markups alone (column 4) gives 24% welfare, +39% wages, +99% wage variance, implying ~63% of the markup welfare cost is misallocation. Equalizing markdowns alone (column 5) has little welfare effect (2%), though a wide markdown level reduces welfare significantly (column 2).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and caveats does the author flag?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Caveats: (1) Multiplication bias — mismeasured output elasticities enter both labor wedges and product-market rents multiplicatively, mechanically biasing kappa upward (Appendix B.10); IV with lags only fixes classical, not serially-correlated, measurement error. (2) Labor adjustment costs get absorbed into the labor wedge and bias kappa; firm fixed effects do not fully fix this (Appendix B.11). (3) The markdown estimation imposes that all markdown variation reflects firm size and amenities — more general than kappa=0 approaches but restrictive in this dimension. (4) The model uses collective (not individual) bargaining and abstracts from sequential-auction wage-setting (Cahuc-Postel-Vinay-Robin); robustness to hiring-wages-only following Di Addario et al. (2020) is shown (Appendix B). (5) Worker types assumed perfect substitutes; an Appendix E two-skill extension gives similar results. (6) Empirical patterns hold without TFPQ controls (Figure D.3) and by firm size (Figure D.4).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Versus the labor-market-power literature (Berger et al. 2022; Lamadon et al. 2022) it adds product-market power and bargaining, showing their pure-monopsony labor wedge is a kappa=0 special case. Versus the markups/welfare literature (De Loecker et al. 2020; Edmond et al. 2023) it adds imperfect labor competition and bargaining. Versus recent integrated product+labor power models that use wage-posting and no bargaining (Kroft et al. 2024; Deb et al. 2024), it adds the rent-sharing channel where markups raise (not just lower) the labor wedge. Versus production-approach markdown estimation (Yeh et al. 2022; Mertens 2020), it shows their estimates are labor wedges (not markdowns) once kappa&amp;gt;0, and that omitting hours upward-biases them. Versus the rent-sharing literature (Card et al. 2018; Kline et al. 2019; Van Reenen 1996), it shows their instruments violate exclusion under endogenous monopsony wages and proposes the labor-wedge-equation alternative. The closest exception incorporating unions is Azkarate-Askasua and Zerecero (2025).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Strengthening worker collective bargaining power can raise welfare mainly by offsetting markup-induced distortions to labor demand and redistributing rents, but it raises between-firm wage inequality and cannot restore full efficiency because it leaves markup distortions to capital/material untouched (full kappa closes under one-third of the planner gap). The wage effects of innovation depend on whether it improves productivity or quality and on the degree of product differentiation. Scope conditions: estimates are for French manufacturing under firm-level collective bargaining institutions (firms &amp;gt;=50 employees legally bargain annually); results rely on the production-approach assumptions (flexible/price-taken materials, scalar unobservability) and on data including hours and output prices that many countries lack — researchers should interpret labor-wedge/markup moments cautiously without hours data.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>University Research and the Market for Higher Education</title><link>https://macropaperwarehouse.com/papers/university-research-and-the-market-for-higher-education/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/university-research-and-the-market-for-higher-education/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper proposes that university R&amp;amp;D is determined endogenously by competition for tuition and talented students in the market for higher education, and asks why universities fund research internally with tuition despite negligible returns to patenting. Motivation: between 2000 and 2018 U.S. universities accounted for 13% of aggregate R&amp;amp;D spending and 53% of all basic-research spending, yet in 2018 over 25% of university research was internally funded (25.54% in 2018; federal government 52.97%) while between 1991 and 2018 the median university earned patent licensing revenue totaling less than 2% of its R&amp;amp;D expenditure. Internal funds therefore come essentially from tuition.&lt;/p&gt;
&lt;p&gt;Approach: (1) four stylized facts from administrative microdata (IPEDS, NSF HERD survey covering 916 universities / 99.1% of sector R&amp;amp;D, AUTM patent-licensing survey, Web of Science / Leiden bibliometrics); (2) a causal natural experiment; (3) a general-equilibrium model of the higher-education sector with heterogeneous universities choosing teaching and research, calibrated to U.S. data; and (4) policy counterfactuals.&lt;/p&gt;
&lt;p&gt;Causal evidence: the authors exploit the 1998-2003 doubling of the NIH budget (from $13.6bn to $27.1bn) using a Bartik shift-share instrument built from each university&amp;rsquo;s pre-period (1993-1997) share of federal life-science grants, regressing the change in net tuition (1993-1997 to 2004-2008) on the instrumented change in R&amp;amp;D per student, with state-clustered standard errors and state-specific trends. The benchmark estimate is that a $1.00 increase in R&amp;amp;D spending per student raises tuition by $0.15 (s.e. 0.05) — universities recoup up to 15% of R&amp;amp;D through higher tuition. Across specifications the effect ranges $0.10-$0.15; it is driven by research universities (non-liberal-arts), is statistically insignificant for liberal arts colleges, and a placebo using student-amenities spending shows no significant effect. The point estimate is about 60% larger at private non-profits than publics, but that difference is not statistically significant.&lt;/p&gt;
&lt;p&gt;Model and mechanism: education quality q = k^ωk * z̄^ωz * eT^ωe depends on intangible knowledge capital k (accumulated via research, k&amp;rsquo; = k^γk * eR^γe), peer ability z̄, and teaching spending. Universities maximize discounted education quality, funding research from tuition. Equilibrium features an endogenous college hierarchy with two-dimensional sorting by ability and family income. The research share sR rises with the steepness of the college quality-ladder Σq/Σk; when students are highly stratified or tuition rises sharply with rank, universities invest in research even if the direct contribution to teaching (ωk) is small — research persists even as ωk→0 (acting as a pure signal). Incentives fall when intangible capital is highly dispersed across colleges.&lt;/p&gt;
&lt;p&gt;Calibration matches the joint distribution of research, tuition, and student ability, plus untargeted R&amp;amp;D dispersion; simulated NIH expansion yields $0.18 per $1 in steady state and $0.11 along the transition, bracketing the empirical $0.10-$0.15.&lt;/p&gt;
&lt;p&gt;Policy findings (long-run, vs baseline): removing all need-based federal tuition subsidies cuts university research by 8.1% (replacing progressive with revenue-neutral flat tuition subsidy: -2.2%); progressive aid compresses revenue dispersion, steepens the quality-ladder, and raises the research share (+0.8 pp). Removing all federal research grants cuts research by 69.1% — only 6.9 pp below the government&amp;rsquo;s 76% funding share, implying crowding-out: the meritocratic grant structure concentrates funds at top schools, flattening the ladder and cutting the research share by 16.4 pp. A revenue-neutral flat research subsidy would instead raise research by 14.8%, human capital by 9.6%, and output by 11.1%.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A Bartik/shift-share IV exploiting the 1998-2003 NIH budget doubling. Each university&amp;rsquo;s change in R&amp;amp;D is instrumented by its pre-period (1993-1997) share of all federal life-science research grants. Relevance: NIH was the bulk of federal life-science funding before the shock and did not substantially change award criteria, so high-share schools received mechanically larger funding increases. Exogeneity requires that universities did not systematically invest in life-science research in the pre-period in anticipation of the expansion. The estimation is in long-differences comparing steady states; standard errors are clustered at the state level with state-specific tuition trends. Threats: the NIH expansion occurs at a common point in time, so it may correlate with other contemporaneous market changes; initially larger or higher-quality research universities might have raised tuition for reasons unrelated to R&amp;amp;D. The authors address this with group-specific time trends (public/private, pre-existing life-science status, school size, initial quality via faculty-student ratio) and pre-trend controls (1987-1992 faculty-student ratio, FTE size, life-science status). A limitation the authors acknowledge: they cannot test the effect on subsequent student ability because ability proxies are only available after the intervention.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The college quality-ladder Σq/Σk (the cross-sectional elasticity of education quality with respect to intangible capital) is the sufficient statistic for research incentives. Equation (14) decomposes it into three channels: (i) the direct teaching contribution of research ωk; (ii) attracting better students, ωz × Σz̄/Σk; and (iii) charging higher tuition, ωe × ΣR/Σk. Channels (ii) and (iii) flow from competition for talented students and tuition and can dominate even when ωk is tiny. Empirically, Σz̄/Σk maps to the cross-sectional elasticity of student ability w.r.t. research (Figure 3) and ΣR/Σk to the elasticity of tuition w.r.t. research (Figure 4), so the calibration disciplines these channels with observable cross-sectional relationships.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The tuition effect is concentrated in research universities (non-liberal-arts), with a larger, highly significant point estimate; for liberal arts colleges the NIH shock has no statistically significant effect on tuition (the authors caution the LAC sample is smaller — ~32% of institutions, ~24% of FTE — and more heterogeneous, so power may be insufficient). The effect appears ~60% stronger at private non-profits than publics, but the difference is not statistically significant. Across the model, top schools and bottom schools both invest less in research when intangible capital is highly dispersed (top schools face weak incentives to improve already-secure rank; bottom schools find climbing too costly).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Empirically: adding pre-trend controls (column 3) leaves estimates intact; splitting by NLA vs LAC; and a placebo replacing R&amp;amp;D with student-services (amenities) spending, which yields no significant effect, rejecting spurious cross-category correlation. In the model: (1) the limiting case ωk→0 where research is a pure signal — the research share falls from 8.8% to 2.4% of tuition but stays strictly positive, and policy effects retain 50% (tuition-subsidy removal: -0.4 pp vs -0.8) and 66% (research-subsidy removal: +10.8 vs +16.4 pp) of their magnitude; (2) allowing some teaching expenditure to also enter intangible-capital production (γT&amp;gt;0), where the research share falls from 8.8% to 4.7% and policy effects moderate (-0.4 pp and +7.1 pp). In both, existing tuition policies still boost research and federal research grants still crowd it out.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on equilibrium higher-education models — Epple, Romano &amp;amp; Sieg (2006) (quality maximization, exogenous endowment hierarchy, finite universities with market power) and Cai &amp;amp; Heathcote (2022) (competitive, constant-returns technology) — but endogenizes university R&amp;amp;D alongside teaching. A theoretical contribution is proving existence of a unique dynamic equilibrium with quality maximization and an endogenous college-quality hierarchy with a continuum of colleges; Cai &amp;amp; Heathcote argued no quality-maximization equilibrium exists when colleges are ex-ante identical (all want to be at the top), which this paper resolves via the endogenous knowledge hierarchy. It contributes to the economics of science / university-R&amp;amp;D literature by adding market-driven incentives, and to the basic-research-subsidy literature (Akcigit et al.) by showing universities have private incentives to do basic research, implying the need for government subsidy may be smaller than the standard Nelson/Arrow/Rosenberg view holds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two main implications. First, a novel complementarity between equity and innovation: progressive need-based tuition aid compresses revenue dispersion across colleges, makes them more similar, steepens the quality-ladder, and raises research (+8.1% relative to a no-subsidy world; flat subsidy gives only ~one-quarter of that, +2.2%). Second, current meritocratic federal research grants partially crowd out internal research and raise educational inequality by concentrating resources at top schools; removing them cuts research by 69.1% (only 6.9 pp below the 76% federal share, the gap being the crowding-out). A revenue-neutral flat research subsidy would raise research by 14.8%, human capital 9.6%, and output 11.1%, eliminating the equity-innovation trade-off because it lowers research cost without altering market structure. Scope conditions: these are long-run steady-state comparisons in a calibrated model of 4-year public and private non-profit U.S. institutions; magnitudes depend on the hard-to-measure ωk and on the research-technology specification, as the robustness exercises show.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why do universities fund research from tuition rather than patents, and does the model rationalize it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because patent licensing is too small (median &amp;lt;2% of R&amp;amp;D, 1991-2018) to fund the &amp;gt;25% of R&amp;amp;D that is internal, and unrestricted operating funds are composed almost entirely of tuition (much of it from unrecovered facilities-and-administration costs on sponsored projects — roughly $7bn in 2018). The model rationalizes diverting tuition to research because research raises education quality and thus students&amp;rsquo; willingness to pay, so in a competitive sector students accept it. The model also replicates the joint pattern that higher-R&amp;amp;D universities are higher-ranked, attract wealthier and abler students, and charge higher tuition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the sources of inefficiency in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two. First, borrowing constraints prevent efficient sorting of students by ability (a social planner would send the ablest to the best colleges, but students are limited by parental capacity to pay). Second, university knowledge has positive spillovers to the real economy (calibrated ιk = 0.1) that colleges do not internalize, causing under-investment; however, quality-maximizing colleges face extra competitive incentives to do research, so net under- or over-investment is ambiguous and depends on stratification relative to spillover strength.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;College quality-ladder (Σq/Σk)&lt;/strong&gt;: The equilibrium cross-sectional elasticity of education quality with respect to a university&amp;rsquo;s intangible knowledge capital — a sufficient statistic for a university&amp;rsquo;s private incentive to invest in research. Steeper ladder (more stratification, tuition rising more with rank) means stronger research incentives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intangible (knowledge) capital k&lt;/strong&gt;: Institution-specific intangible capital accumulated by investing in research (k&amp;rsquo; = k^γk eR^γe). It is primarily frontier knowledge and ideas exposed to students, but also networks, recruiting, labs, and methods; it can act purely as a reputation signal in the limiting case ωk→0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Research share (sR)&lt;/strong&gt;: The share of a university&amp;rsquo;s tuition revenue allocated to research in equilibrium (≈8.8% under existing policies). It increases with college forward-lookingness (βc) and the steepness of the quality-ladder, and decreases with the dispersion of intangible capital across colleges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Crowding-out of internal research&lt;/strong&gt;: In the paper&amp;rsquo;s sense, the phenomenon whereby federal grants, by concentrating funds at top schools, raise the dispersion of research (Σk), flatten the quality-ladder (Σq/Σk), lower the research share, and thereby reduce universities&amp;rsquo; internal research spending — so total research rises less than the government&amp;rsquo;s funding share (69.1% decline vs 76% share on removal).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equity-innovation complementarity&lt;/strong&gt;: The model&amp;rsquo;s finding that progressive need-based tuition aid, by compressing revenue dispersion and making colleges more similar, steepens competition and raises university research — so equity-promoting policy also boosts basic research, rather than trading off against it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Education-innovation gap (ωk calibration)&lt;/strong&gt;: Biasi &amp;amp; Ma&amp;rsquo;s (2021) measure of how frontier-current a university&amp;rsquo;s curriculum is, interpreted in the model as log(k). A one-unit decrease is associated with a 0.011% rise in graduate income; normalized by its school-level standard deviation of 0.85, it is used to pin down ωk via ωk·α = .011/.85·Σk.&lt;/p&gt;</description></item><item><title>Wage Adjustment in Efficient Long-Term Employment Relationships</title><link>https://macropaperwarehouse.com/papers/wage-adjustment-in-efficient-long-term-employment-relationships/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/wage-adjustment-in-efficient-long-term-employment-relationships/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper develops a tractable theoretical model of wage dynamics in long-term employment relationships, situated between two polar extremes in the existing literature: continual Nash renegotiation (Mortensen and Pissarides 1994) and wage adjustment only when participation constraints bind (MacLeod and Malcomson 1993). The central motivation is that neither polar extreme matches well-documented empirical facts about wage adjustment — wages are adjusted neither continuously nor as rarely as participation constraints alone would imply.&lt;/p&gt;
&lt;p&gt;The model&amp;rsquo;s key ingredients are: (1) match-specific productivity that evolves as a geometric Brownian motion, generating persistent idiosyncratic shocks; (2) on-the-job search, whereby employed workers receive outside job offers at rate s*lambda; and (3) renegotiation costs modeled as breakdown probabilities (Delta_W for workers, Delta_F for firms) that apply whenever a party unilaterally initiates a renegotiation. These breakdown risks create a wedge between what each party can guarantee by threatening to renegotiate and the full Nash share, thereby generating inaction regions within which the wage remains unchanged. When either party&amp;rsquo;s surplus falls to the boundary of this inaction region, wage adjustment occurs by mutual consent at zero cost, keeping separations bilaterally efficient. The result is a &amp;ldquo;drunken walk&amp;rdquo; for wages: constant most of the time, adjusting minimally when productivity shocks or outside job offers drive the system to the boundary.&lt;/p&gt;
&lt;p&gt;An analytical general solution for firm and worker surpluses is derived — a methodological innovation, since prior work with persistent idiosyncratic shocks has required numerical methods.&lt;/p&gt;
&lt;p&gt;The model is calibrated at monthly frequency to: a 5% annual real interest rate; a 1% per month exogenous separation rate (from Farber 1999); a 6% steady-state unemployment rate; a 2.5% per month employer-to-employer (E-to-E) transition rate (from Fujita, Moscarini, and Postel-Vinay 2021); a standard deviation of annual log base wage changes among job stayers of 0.053; and an incidence of total compensation (base plus bonus) freezes of 17% (both from Grigsby et al. 2021). Worker bargaining power is set to beta=0.2, which delivers a wage pass-through elasticity of 0.22 (in range of Lamadon et al. 2022 and Kline et al. 2019), hiring costs of 1.4 months of wages (consistent with Oi 1962 and subsequent work), and a base pay share of compensation of 97% at the median (matching Grigsby et al. 2021). The breakdown probability calibrates to Delta=0.33 for both workers and firms.&lt;/p&gt;
&lt;p&gt;Key quantitative findings:&lt;/p&gt;
&lt;p&gt;First, the calibrated model generates a hump-shaped separation hazard peaking at just over 0.08 at around 3 to 5 months of tenure and declining thereafter, closely matching Farber (1999) — a nontargeted moment. Cumulative wage growth after 10 years of tenure is approximately 15%, lying between Topel&amp;rsquo;s (1991) estimate of over 25% and Altonji and Williams&amp;rsquo; (2005) estimate of 11%.&lt;/p&gt;
&lt;p&gt;Second, the model-implied distribution of annual base wage changes among job stayers features over 30% with zero change, substantially more wage increases than cuts, and limited downward flexibility — all key features documented in microdata (Altonji and Devereux 2000; Grigsby et al. 2021). The distribution of total compensation (base plus bonus) is far more symmetric and has lower incidence of freezes (targeted at 17%), consistent with Grigsby et al.&amp;rsquo;s finding that bonus pay drives most compensation flexibility. The sequential auctions special case (without renegotiation costs) greatly overstates pay freezes, underscoring that renegotiation costs are the mechanism generating empirically realistic intermediate wage adjustment.&lt;/p&gt;
&lt;p&gt;Third, the model delivers a near-memorylessness property for hiring wages: because idiosyncratic shocks and outside job offers necessitate ex post wage adjustments that preserve bilateral efficiency, subsequent wages become independent of the initial hiring wage once the first adjustment occurs. Quantitatively, this largely negates Hall&amp;rsquo;s (2005) result that rigid hiring wages can generate substantial unemployment fluctuations: in the calibrated model with empirically realistic adjustment, the allocative effect of entry wage flexibility on labor market tightness is much smaller than in Hall&amp;rsquo;s special case.&lt;/p&gt;
&lt;p&gt;Fourth, the model provides a novel theory of recruitment and retention bonuses. Because persistent productivity shocks are best met with adjustments to the flow wage, while transitory outside offers are best met partly with lump-sum bonuses (flow wage increases are credibly capped by the firm&amp;rsquo;s inaction boundary), the model predicts non-base pay as an equilibrium outcome. Counterfactual experiments show that eliminating firms&amp;rsquo; ability to pay retention bonuses reduces total match surplus at the date of new matches by approximately 15.1% and raises the employment-to-unemployment separation rate by approximately 9.5%; eliminating both retention and recruitment bonuses raises these figures to 16.0% and 10.3%, respectively.&lt;/p&gt;
&lt;p&gt;The paper also extends the baseline model to accommodate positive inflation (nominal wages held fixed absent renegotiation), using a perturbation method due to Fleming (1971), generating a spike at zero nominal wage change that decays with inflation — consistent with the large empirical literature on nominal wage adjustment.&lt;/p&gt;
&lt;p&gt;The implication for macroeconomics is that efficient long-term relationships with realistic sporadic wage adjustment cannot be the source of cyclical unemployment volatility, pointing toward either violations of bilateral efficiency (asymmetric information, wage-cut costs) or volatile labor demand as the necessary ingredient.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is primarily theoretical and quantitative, not empirical, so it does not employ a conventional identification strategy. The model is calibrated to match a set of moments from existing microdata (Farber 1999; Fujita et al. 2021; Grigsby et al. 2021) and then evaluated on nontargeted moments such as the shape of the separation hazard by tenure. Threats to the model&amp;rsquo;s quantitative conclusions include: (a) the calibration sets beta=0.2 somewhat informally (targeted to four informal moments rather than formally estimated); (b) the baseline restricts mu=sigma^2/2 so that log match productivity is driftless, and Delta_W=Delta_F (symmetric breakdown risk) — the paper checks in the appendix that relaxing mu gives essentially unchanged main results; (c) the model abstracts from risk aversion, general human capital accumulation, and permanent firm heterogeneity, any of which could alter wage dynamics or calibrated parameter values; (d) the Grigsby et al. (2021) moments used for calibration pertain to a period of very low inflation, which the paper treats as approximately a zero-inflation environment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the drunken walk and why is it called that?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The &amp;lsquo;drunken walk&amp;rsquo; is the wage path that emerges from the model. The wage remains constant whenever both parties&amp;rsquo; surpluses lie strictly within their respective inaction regions (neither party can credibly threaten to renegotiate). When idiosyncratic productivity hits the upper or lower boundary of the inaction set, the wage adjusts minimally upward (to restore the worker&amp;rsquo;s surplus to the threshold) or minimally downward (to restore the firm&amp;rsquo;s surplus to the threshold). The path therefore wanders irregularly, making small adjustments only when forced to by the boundaries, analogously to a drunken walk — a term echoing the dynamic contracting literature (Thomas and Worrall 1988), where the same path arises from insurance motives rather than renegotiation costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper characterize the surplus analytically and why is this novel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The key innovation is that bilateral efficiency decouples the total match surplus (determined as an optimal stopping problem) from the division of that surplus between firm and worker. Total surplus S(x) is characterized analytically as a function of match productivity x alone, solving an ODE with boundary conditions (value-matching and smooth-pasting at the separation threshold). Given S(x), the firm surplus J(w,x) and worker surplus V(w,x) satisfy ordinary differential equations (not PDEs) for any fixed wage w, because wages change only at boundaries. This reduces the wage determination problem to one of iterating over constants rather than functions, allowing analytical general solutions (Propositions 2, 3, 4) that prior work with persistent idiosyncratic shocks could not obtain, requiring numerical methods instead (Yamaguchi 2010; Lise et al. 2016).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two special cases studied and what do they reveal?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The costly renegotiation case (s=0, no on-the-job search) isolates adjustment driven purely by idiosyncratic productivity shocks and breakdown risk. In this case, the wage adjustment boundaries simplify to an upper bound from the worker&amp;rsquo;s threat and a lower bound from the firm&amp;rsquo;s threat; there is a fundamental asymmetry in that workers cannot credibly threaten a wage increase in the face of complete breakdown risk (Delta_W=1), since they receive no outside offers. The sequential auctions case (beta=0, Delta_F=1, on-the-job search only) recovers and extends Postel-Vinay and Robin (2002) to persistent productivity shocks with analytical solutions. In this case, wage adjustment is one-sided in a surprising direction: wage increases are triggered by reductions in match productivity, because lower productivity reduces the recruitment compensation that a worker could extract if an outside offer arrived, lowering her match value and necessitating a raise. This case greatly overstates pay freezes relative to data, confirming that renegotiation costs are essential to match empirical wage adjustment frequency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the memorylessness property and what are its implications for Hall (2005)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The memorylessness property states that, conditional on the occurrence of a wage adjustment, the subsequent path of wages is independent of the initial hiring wage. Once the wage is adjusted, the history is &amp;lsquo;forgotten.&amp;rsquo; This arises because ex post wage adjustments are determined solely by contemporaneous productivity and the bilateral efficiency requirement, not by the history of wages up to that point. The implication for Hall (2005) is that the allocative effect of hiring wage rigidity on unemployment fluctuations — which rests on the hiring wage having an indefinite legacy (no adjustment ever needed in Hall&amp;rsquo;s special case of zero idiosyncratic shocks, zero on-the-job search, and full breakdown risk) — is largely negated once realistic wage adjustment is introduced. The decomposition in equation (27) shows that the entry wage effect on firm surplus and labor market tightness is much smaller in the baseline calibration than in Hall&amp;rsquo;s special case, and that general equilibrium effects (firms anticipating future wage adjustments in booms) further moderate volatility. This dovetails with the empirical literature initiated by Beaudry and DiNardo (1991) finding that economic conditions at the start of a job have little explanatory power for current wages once one controls for the history of conditions since job start.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the model&amp;rsquo;s theory of recruitment and retention bonuses and why does it matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bonuses arise from the asymmetry between the type of shocks and the type of compensation instrument best suited to absorb them. When match productivity changes persistently, adjusting the flow wage is efficient; but when an outside offer arrives temporarily, the value delivered to retain a worker cannot always be committed credibly via flow wages — the firm can only raise the base wage up to the threshold at which the firm would immediately trigger another renegotiation to cut it back. Any remaining value above that threshold must be delivered as a lump-sum retention bonus. Analogously, when recruiting a worker from another firm, the new employer has an upper bound on the flow wage it can credibly offer; remaining value goes to a recruitment bonus. This provides an endogenous theory of non-base pay. The allocative stakes are large: eliminating retention bonuses reduces match surplus at new matches by 15.1% and raises the E-to-U separation rate by 9.5%; eliminating both retention and recruitment bonuses raises these figures to 16.0% and 10.3%. Even though bonuses are transitory and account for only a small share of overall compensation (the base pay share is 97% at the median in the calibration), they are allocatively important — the paper calls this an instance of the general principle that marginal variation can be allocatively consequential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented or analyzed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main model is deliberately parsimonious and abstracts from worker and firm heterogeneity. However, the paper notes that the model can accommodate permanent worker type differences in efficiency units: if x, b, and vacancy costs all scale with efficiency units, the log wage change distribution is identical across worker types while the initial wage scales proportionally. The paper also analyzes two sources of heterogeneity in wage outcomes that emerge endogenously: variation in wage change incidence with match tenure (separation hazard that is hump-shaped in tenure) and variation in base-wage versus total-compensation changes (base wages change less frequently and are more asymmetric than total compensation). The appendix contains an extended model allowing general drift mu, encompassing specific human capital accumulation, with results described as essentially unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are performed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Key robustness exercises include: (1) The appendix provides the extended model with general mu (not restricted to mu=sigma^2/2), encompassing specific human capital accumulation; main results are stated to be essentially unchanged. (2) Recalibrated versions of the two special cases (s=0 for costly renegotiation; Delta_F=1 and beta=0 for sequential auctions) are examined separately to understand which mechanism drives empirical fit. (3) An alternative special case with Delta_W=Delta_F=1 and beta&amp;gt;0 is confirmed to generate a similarly counterfactual share of pay freezes (~75%), reinforcing that wage-adjustment-only-at-participation-constraints is empirically rejected. (4) The inflation extension in Section 3 uses an approximate analytical solution (Taylor expansion to first order in pi) following Fleming (1971) to show the model generates sensible nominal wage change distributions and a decaying zero-spike with inflation. (5) Proposition 2 result (ii) establishing the expected duration of wage spells provides an internal consistency check linking the allocative effects of wages to their duration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;MacLeod and Malcomson (1993) is the closest theoretical predecessor: it studies renegotiation by mutual consent with efficient long-term relationships and generates a drunken walk. This paper extends it by adding idiosyncratic productivity shocks and on-the-job search and making the model quantitative with analytically tractable solutions, moving beyond MacLeod-Malcomson&amp;rsquo;s polar case (Delta=1). Postel-Vinay and Turon (2010) study a similar environment to the sequential auctions special case but with i.i.d. productivity shocks, requiring numerical methods; this paper obtains analytical solutions even with persistent shocks. Postel-Vinay and Robin (2002) and Cahuc et al. (2006) are nested as special cases. Hall (2005) is nested and shown to be quantitatively non-generic: its result on hiring wages and unemployment fluctuations relies on special-case assumptions that are empirically rejected. Gertler and Trigari (2009) achieve large unemployment fluctuations via time-dependent staggered wage adjustment; this paper studies state-dependent adjustment and finds the opposite result. Grigsby et al. (2021) provide the key calibration moments on the incidence of pay changes; the paper replicates their finding that total compensation is more flexible than base pay and provides a theoretical interpretation. Balke and Lamadon (2022) study long-term contracts with directed search but without wage inaction, which is a central object here. Dupraz et al. (2022) model wage rigidities that generate inefficient separations; this paper instead maintains bilateral efficiency and generates wage rigidity endogenously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central policy-relevant conclusion is that, within a model of efficient long-term relationships with realistic sporadic wage adjustment, hiring wage flexibility (or rigidity) is much less consequential for unemployment fluctuations than Hall (2005) suggested. This implies that policies aimed at wage flexibility at the point of hiring are unlikely to substantially moderate unemployment fluctuations if the broader employment relationship is bilaterally efficient. The model instead points to wage-cut costs, asymmetric information, or impediments to matching outside offers as the necessary ingredients for hiring-wage stickiness to matter for unemployment. The allocative importance of non-base pay (retention and recruitment bonuses) suggests that regulations or institutional arrangements that restrict bonus pay could meaningfully retard match formation and raise separations, even when bonuses appear small as a share of total compensation. The scope conditions are bilateral efficiency, risk neutrality, and the absence of aggregate shocks (the paper focuses on idiosyncratic shocks in a stationary equilibrium, with only a perturbation analysis for aggregate shocks in the allocation-of-entry-wages section).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the user cost of labor framework reveal?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Section 1.6 extends the user cost of labor concept of Kudlyak (2014) — the shadow flow price of labor in long-term relationships — to this environment. The user cost in this model contains components absent from simple Diamond-Mortensen-Pissarides: turnover costs due to on-the-job search (proportional to the firm surplus of a new match, contributing sλ*J(w0,x0)), and the value of future productivity drift and variance (which act as a source of moderation of user cost). The key message is that idiosyncratic shocks and on-the-job search diminish the importance of the initial wage in the firm&amp;rsquo;s effective flow cost of labor, because future wage adjustments are anticipated. This provides a flow-based interpretation of the memorylessness property and complements the work of Doniger (2021) and Bils et al. (2023) on quality-adjusted labor costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does inflation affect wage adjustment in the extended model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the extension (Section 3), the nominal wage is held fixed absent renegotiation, so the real wage drifts downward at the inflation rate pi. This creates an additional source of value to the firm (and loss to the worker), valued at -pi&lt;em&gt;w&lt;/em&gt;J_w. Because J_w&amp;lt;0 (higher wages reduce firm surplus), inflation raises firm value and consequently shifts the adjustment boundaries inward: for a given productivity, firms are less likely to demand nominal wage cuts and workers are more likely to demand nominal wage increases. The zero-change spike in the distribution of nominal wage changes decays as inflation rises, a well-established empirical feature. The analytical solution uses a first-order Taylor expansion in pi (following Fleming 1971), which the authors note may also be extendable to approximate solutions for aggregate shocks.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Drunken walk (wage dynamics)&lt;/strong&gt;: The equilibrium wage path in the model: wages remain constant for extended periods and adjust minimally — only enough to prevent a unilateral renegotiation — when idiosyncratic productivity shocks or outside job offers drive firm or worker surplus to the boundary of their respective inaction sets. The name reflects the irregular, boundary-regulated wandering of wages over time.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Renegotiation costs (breakdown risk)&lt;/strong&gt;: The cost of unilaterally initiating a wage renegotiation, modeled as a probability Delta_W (Delta_F) that the match breaks down if the worker (firm) forces a renegotiation. These costs generate inaction regions in which neither party can credibly threaten a unilateral renegotiation, so the wage remains unchanged. They are the key parameter governing the frequency of equilibrium wage adjustment, nesting both continual bargaining (Delta=0) and adjustment only at participation constraints (Delta=1) as polar cases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inaction set&lt;/strong&gt;: For any current wage w, the set of match productivities x within which neither the firm nor the worker can credibly issue a unilateral threat to renegotiate. The wage remains constant when productivity lies in the interior of both parties&amp;rsquo; inaction sets. The boundaries of these sets are the thresholds x_W(w) and x_F(w) at which wage adjustments are triggered by mutual consent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Memorylessness (of hiring wages)&lt;/strong&gt;: The property that, once a wage adjustment occurs, the subsequent path of wages is independent of the initial hiring wage. This arises because ex post adjustments are determined solely by contemporaneous productivity and the bilateral efficiency requirement. As a result, the legacy of any hiring wage is truncated to the duration of the first wage spell, negating the allocative importance of hiring wage rigidity for unemployment fluctuations in Hall&amp;rsquo;s (2005) sense.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Recruitment and retention bonuses&lt;/strong&gt;: Lump-sum payments made by the current or prospective employer when an employed worker receives an outside job offer, in situations where the value to be delivered to retain or recruit the worker exceeds what can credibly be committed via increases to the flow base wage (which face a ceiling imposed by the firm&amp;rsquo;s inaction boundary). The model predicts these bonuses as an equilibrium outcome of bilateral efficiency, arising from the asymmetry between persistent productivity shocks (best absorbed by flow wage changes) and transitory outside offers (partially absorbed by lump-sum bonuses).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bilateral efficiency (in long-term employment relationships)&lt;/strong&gt;: The property that firm and worker jointly maximize total match surplus, so that separations occur if and only if total surplus is exhausted, and wages are set to preserve this condition. In this paper, bilateral efficiency is preserved on the equilibrium path because costless mutual-consent wage adjustments preempt costly unilateral renegotiations. The term is used specifically for bilateral efficiency of individual relationships (not equilibrium efficiency of aggregate allocations).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;User cost of labor&lt;/strong&gt;: The shadow flow price of labor in a long-term employment relationship, extending Kudlyak (2014) and the Jorgenson (1963) capital user cost concept to this environment. It equals flow output at a new match and consists of the flow wage plus flow-equivalent discounting and separation costs, minus the capital gains from anticipated future wage adjustments induced by productivity drift, variance, and on-the-job search. Idiosyncratic shocks and on-the-job search reduce the importance of the initial wage in this user cost, providing a flow-based expression of the memorylessness property.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage pass-through elasticity&lt;/strong&gt;: The elasticity of the equilibrium wage with respect to a change in match-specific productivity — the log change in wages induced by a one log-point rise in match productivity. In the calibrated model this equals 0.22, reflecting that efficient renegotiation shares only part of idiosyncratic productivity gains with the worker (bounded by the worker&amp;rsquo;s bargaining power beta=0.2 and the renegotiation cost structure). This is the model&amp;rsquo;s analogue to empirical rent-sharing elasticities in Lamadon et al. (2022) and Kline et al. (2019).&lt;/p&gt;</description></item><item><title>Warming with Borders: Forced Climate Migration and Carbon Pricing</title><link>https://macropaperwarehouse.com/papers/warming-with-borders-forced-climate-migration-and-carbon-pricing/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/warming-with-borders-forced-climate-migration-and-carbon-pricing/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper asks how the threat of forced climate migration — international displacement driven by climate-induced natural disasters — should alter optimal carbon taxation. The motivation is twofold. First, climate change is intensifying natural disasters that disproportionately afflict developing nations, generating large cross-border population flows that existing integrated assessment models (IAMs) ignore. Second, migration and climate policy are simultaneously among the most contested political issues, yet their interaction has received almost no joint economic analysis.&lt;/p&gt;
&lt;p&gt;The paper proceeds in two stages. First, it documents empirically that natural disasters cause international migration. Using a global annual panel (165 countries, 1980–2013) from EM-DAT and UN migration flow tables, the paper estimates a fixed-effects regression of log-migration flows from developing (origin) to developed (host) countries on disaster frequency, controlling for GDP per capita and population. The key coefficient implies a semi-elasticity of approximately 2.3%: a unit increase in natural-disaster occurrence is associated with a 2.3% rise in migration to host regions. To link disaster frequency to carbon concentrations, a time-series cointegration analysis yields an elasticity of 13.49 for climatological and hydrological disasters (6.74 when meteorological disasters are added), implying an overall elasticity of climate refugees to CO2 concentrations of 11.87 (5.93 with meteorological events).&lt;/p&gt;
&lt;p&gt;Second, these empirical estimates calibrate a quantitative multi-region integrated assessment model (IAM) in which energy-related emissions generate two externalities simultaneously: output damage through temperature, and population reallocation from origin to host regions. The model features a North–South structure (Kyoto Annex I countries as host; rest of world as origin), Cobb-Douglas production with capital, labor, and energy (coal-proxy), region-specific climate damage parameters drawn from Hassler et al. (2019), and a climate module following Golosov et al. (2014). Social welfare in host regions can optionally include a direct disutility from immigration (parameterized using data on European Pay-to-Go programs and the 2016 EU–Turkey Agreement). The model is simulated over 300 years starting from 2015, with 10-year periods.&lt;/p&gt;
&lt;p&gt;The paper then analytically characterizes and quantitatively estimates optimal carbon prices under three policy regimes: (1) unilateral host-only action, (2) globally cooperative (first-best), and (3) a Nash equilibrium with all regions active.&lt;/p&gt;
&lt;p&gt;The central quantitative finding is an asymmetry across policy regimes. Under unilateral host-region action, accounting for forced climate migration raises the optimal carbon price by approximately 22% (from $44.72 to $54.73 per ton of carbon when calibrated to climatological and hydrological disasters only; to $49.77, an 11% increase, when meteorological events are included). The dominant mechanism is the &amp;ldquo;Labor Effect&amp;rdquo;: migrants move without capital and dilute per capita income in host regions because environmental resources and capital are finite, making the negative welfare consequences exceed the positive labor-supply benefit under a Cobb-Douglas technology with climate damages. The social cost of immigration (disutility of anti-immigration sentiment) adds only marginally to the carbon price ($54.99 vs. $54.73 per ton under the Pay-to-Go calibration). When border control is modeled explicitly, a planner facing US-calibrated deportation costs ($4.6 × 10^5 per immigrant) prefers tightening the carbon tax over using border control, validating the main finding. Only when border control is costless does the optimal strategy switch to low carbon taxes and restricted immigration.&lt;/p&gt;
&lt;p&gt;In contrast, the globally optimal SCC is nearly unchanged by forced climate migration ($118.62 without FCM vs. $123.03 with FCM), because the Global Labor Effect balances out: costs of population growth in the host are offset by the adaptation benefit of relocating people to less climate-vulnerable areas. Under Nash equilibrium, host SCCs rise modestly ($44.72 to $49.89 under C&amp;amp;H disasters), while origin SCCs fall slightly ($73.81 to $72.51) as migrants, once relocated, face lower climate damages. The welfare cost to host-region natives from applying the no-FCM policy when FCM is in fact present amounts to a 0.193% permanent consumption equivalent.&lt;/p&gt;
&lt;p&gt;Policy implication: in the absence of a global climate agreement (the prevalent situation), developed countries have substantially stronger unilateral incentives to price carbon than existing IAMs suggest, because they indirectly bear the economic costs of climate-induced immigration. The global SCC, however, is not materially affected, so the case for international coordination rests on the same foundation as before.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the empirical identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical strategy exploits the quasi-random timing of natural disasters within an origin country using a two-way fixed-effects (country and year) panel regression. The dependent variable is the log of annual unilateral migration flows from each origin country to the pooled group of host countries (43 OECD-type destinations). The independent variable is the frequency (or log frequency) of climate-related natural disasters in the origin country in the same year. Country fixed effects absorb time-invariant push/pull factors; year fixed effects absorb common global shocks. Main threats discussed: (1) Endogeneity of contemporaneous GDP and population, addressed by using first lags of controls. (2) Reporting bias in EM-DAT (disasters in early years may be under-recorded), addressed by computing the ratio of warming-related to geophysical disasters (reporting bias should be type-orthogonal) and by restricting to large disasters (&amp;gt;=1,000 affected or &amp;gt;=100 deaths). (3) The paper focuses exclusively on the contemporaneous (same-year) migration response, treating lagged effects as lower bounds. (4) The semi-elasticity estimates are used as calibration inputs, not as causal estimates of structural parameters — the author acknowledges the causal chain from concentrations to disasters is not fully established.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the four theoretical components of the unilateral host SCC and how do they combine?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The unilateral host SCC (equation 12) is the sum of: (1) Standard Output Damages — the present discounted value of climate damage to final output, the only component in standard IAMs; (2) Emissions Reallocation — the reduction in origin-region emissions as migrants move to the host, which lowers global concentrations and benefits the host, making this component negative (it reduces the carbon price); (3) Immigration Social Cost — the direct disutility of newly arrived immigrants borne by host natives (parameterized by gamma), which adds to the carbon price when gamma &amp;gt; 0; and (4) Labor Effect — the net welfare consequence of a larger host labor force, which comprises a positive externality (higher output) and a negative externality (dilution of per capita consumption due to finite environmental resources and capital). Under Cobb-Douglas production with climate damages and capital (Result 1), the net Labor Effect is always a negative externality that raises the carbon price. In the quantitative exercise, the Labor Effect dominates all other FCM-related components and accounts for essentially the entire 22% increase in the unilateral SCC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the global SCC remain nearly unchanged when forced climate migration is included?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The global planner internalizes the welfare of both host and origin regions. The &amp;lsquo;Global Labor Effect&amp;rsquo; contains two offsetting terms: costs to host natives from capital dilution and per capita income reduction, and benefits to origin-region emigrants who move to a less climate-vulnerable, more economically developed area. These effects largely cancel. In addition, migration reallocates economic activity away from high-damage origin regions, lowering expected global climate damages. Migration costs calibrated to equalize consumption per capita across regions (absent climate change) prevent the global planner from strategically using pollution to trigger welfare-improving migration. Quantitatively, the global SCC rises only slightly, from $118.62 to $123.03 per ton of carbon (less than 4%), and may even fall after roughly four decades as the adaptation benefit grows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the social cost of immigration (anti-immigrant sentiment) parameterized and calibrated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The parameter gamma represents the marginal social cost of immigration to native households — their willingness to pay to prevent a marginal unit of immigration. Two calibration approaches are used: (A) Pay-to-Go programs: using data on European Assisted Voluntary Return programs in 2015, the paper derives gamma = 7.1 × 10^3 (in terms of final good per billion migrants). (B) EU-Turkey Agreement: using costs from the 2016 deal managing the Syrian refugee influx, the paper derives gamma = 7.3 × 10^3. The similarity of the two estimates provides cross-validation. The baseline quantitative exercise disables this feature (gamma = 0), treating it as a sensitivity; a UK Brexit-era survey value implies a four-fold increase in the unilateral SCC but is judged unrepresentative of permanent preferences. The paper is explicit that these are positive descriptions of political preferences, not normative endorsements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in the migration response is documented empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three dimensions of heterogeneity are explored: (1) Income: Unlike for slow-onset climate migration (where middle-income countries drive the response), poorer countries show a stronger migration response to disasters (positive and significant interaction between disaster frequency and a poor-country dummy, column 4 of Table B.1). This is interpreted as evidence that migration costs are less binding when disaster severity forces departure. (2) Disaster type: Climatological and hydrological disasters have higher and statistically significant migration-response coefficients than meteorological disasters (Table B.5). This differential is why the paper presents results under two calibrations (C&amp;amp;H disasters vs. C&amp;amp;H&amp;amp;M disasters). (3) Disaster severity: Restricting to large disasters (&amp;gt;=1,000 affected or &amp;gt;=100 deaths) yields an even larger migration response (column 5 of Table B.1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run on the empirical results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper runs an extensive set of checks reported in Online Appendix B: (1) Zero-inflated negative binomial (ZINB) model to handle zeros in the dependent variable. (2) Bilateral migration flows with origin-destination fixed effects. (3) Three-year non-overlapping windows (to reduce zero mass in independent variable), which more than doubles the estimated coefficients. (4) Per capita migration as the dependent variable. (5) Disaster frequency weighted by share of affected population. (6) Inverse hyperbolic sine (IHS) transformation. (7) Excluding China and India. (8) Excluding Singapore and South Korea. (9) Controlling for conflict (battle-related deaths). (10) Controlling for a climate vulnerability index. (11) Controlling for the second lag of disasters. (12) Polynomial regression to check for acceleration. (13) Poisson specification. (14) Checking that an upward trend in disaster ratios relative to geophysical events is not attributable to reporting bias. Results are consistent across all specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the Nash equilibrium result, and how does it differ from both the unilateral and first-best settings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the Nash equilibrium, each region implements its own best-response carbon policy. Host regions&amp;rsquo; NE SCC resembles the unilateral SCC (Section 4) except that the &amp;lsquo;Emissions Reallocation&amp;rsquo; component drops out, because when all regions are strategically active, the host cannot treat origin emissions as exogenously reduced by migration. Quantitatively, host NE SCC rises from $44.72 (no FCM) to $49.89 (with FCM, C&amp;amp;H disasters) — a roughly 11.5% increase. Origin region NE SCC falls slightly from $73.81 to $72.51, because origin planners care about the welfare of their emigrants who now live in lower-damage host regions. Without FCM, the origin SCC is 1.6 times higher than the host SCC (reflecting greater vulnerability and larger population in origin). With FCM, this gap narrows. The NE global SCC is lower than the first-best because each region only partially internalizes the global externality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the border control extension interact with the optimal carbon tax?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When the host planner can choose both a carbon tax and a border control stringency (share of migrants admitted), the optimal carbon tax with FCM is lower than in the no-border-control case, because restricting migration inflows reduces both the Labor Effect cost and the Immigration Social Cost. At the same time, restricting inflows reduces the Emissions Reallocation benefit. In equilibrium, the marginal cost of deportation equals the net benefit of keeping an additional immigrant out. Quantitatively, when border control costs are calibrated to US Department of Homeland Security data ($4.6 × 10^5 per detained immigrant), the carbon tax remains essentially equal to the no-border-control case and migration inflows are also nearly unchanged — the planner finds it optimal to abate emissions rather than pay deportation costs. Only when border control is costless does the planner switch to a low carbon tax and high migration restriction. This sensitivity analysis validates the main finding under realistic border enforcement costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to, and differ from, Cruz and Rossi-Hansberg (2024)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cruz and Rossi-Hansberg (2024) use a highly spatially disaggregated model with endogenous migration to quantify welfare costs of climate change under an exogenous global carbon tax. The key differences are: (1) This paper derives optimal carbon taxes — both globally and regionally — rather than taking them as exogenous. (2) This paper provides closed-form analytical characterizations of the SCC under multiple policy regimes, enabling clear decomposition of mechanisms. (3) Migration in this paper is exclusively &amp;lsquo;forced&amp;rsquo; (disaster-driven), not microfounded by economic incentives (though Appendix F relaxes this); Cruz and Rossi-Hansberg treat migration as fully endogenous to economic conditions. (4) This paper explicitly analyzes strategic interactions (Nash equilibrium) between regions. (5) This paper can account for anti-immigration sentiment (gamma) and border control policies. The approaches are thus complementary: Cruz and Rossi-Hansberg offer richer spatial geography and fully endogenous migration; this paper offers analytical tractability and policy-regime analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The principal implication is that developed countries (host regions) have approximately 22% stronger unilateral incentives to impose a carbon tax than existing IAMs indicate, once climate-induced international displacement is accounted for. This result holds under climatological and hydrological disasters calibration and US-level border enforcement costs; it is smaller (~11%) when meteorological events are added and even smaller when border control is assumed freely available. The global SCC is barely affected, so the normative case for a global agreement is not strengthened or weakened in magnitude, but the analytical structure of the globally optimal tax is qualitatively different. Scope conditions: the model abstracts from internal migration, micro-founded voluntary migration, endogenous TFP growth, and capital mobility across regions. Results are robust to Stern discounting, more catastrophic damage functions, and Negishi weights. The welfare cost of ignoring FCM in policy design is modest in magnitude (0.193% consumption equivalent) but positive and policy-relevant as a systematic downward bias in host-country incentives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the microfounded migration extension show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Online Appendix F relaxes the forced-migration-only assumption by introducing economically motivated migration: individuals in the origin choose migration based on consumption differentials across regions, subject to migration costs calibrated to eliminate non-climate migration at steady state. The host unilateral SCC rises to $79.52 per ton of carbon under microfounded migration, compared to $54.73 under forced-only climate migration and $44.72 with no migration (Table F.1). This indicates the 22% increase in the main analysis is a lower bound: broader climate-related migration (including voluntary economic responses to climate shocks) would generate even larger incentives for host regions to tighten carbon pricing. However, this extension sacrifices analytical tractability and closed-form solutions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the welfare cost of ignoring FCM?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 6 reports the welfare cost of applying the sub-optimal &amp;rsquo;no FCM&amp;rsquo; carbon tax to a world in which FCM is actually occurring. The cost is measured as the percentage increase in consumption in every period that would be needed to make host-region natives as well-off as they would be under the correctly calibrated FCM-inclusive policy. Without immigration disutility, the cost is 0.193%. With the Pay-to-Go disutility calibration, it is 0.195%. These figures are small but positive and increasing in the social cost of immigration. They represent the aggregate efficiency loss to host-region natives from the systematic underestimation of the unilateral SCC in existing IAMs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the migration–concentrations link empirically constructed for model calibration?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses an elasticity decomposition: the elasticity of climate refugees to CO2 concentrations is the product of two elasticities. The first — the elasticity of migration to disaster frequency — is estimated from the panel regression and equals 0.88 after pooling countries into two regions. The second — the elasticity of disaster frequency to carbon concentrations — is estimated from a time-series cointegration analysis following Thomas and Lopez (2015), yielding 13.49 for climatological and hydrological disasters alone and 6.74 when meteorological events are included. The product gives overall elasticities of 11.87 and 5.93 respectively. These are then used to calibrate the linear migration function B (the flow of migrants per unit change in carbon concentrations), using historical average concentration increases, average migration flows relative to host population, and the elasticities. B = 5.03 × 10^-5 (C&amp;amp;H disasters) or 2.52 × 10^-5 (C&amp;amp;H&amp;amp;M disasters).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Forced Climate Migration (FCM)&lt;/strong&gt;: In the paper&amp;rsquo;s usage, the specific subset of climate migrants who are forced to move internationally because of climate change-induced natural disasters (rapid-onset events such as floods, storms, and heatwaves), as distinct from voluntary economic migration or migration driven by slow-onset climate variables such as temperature trends.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Cost of Carbon (SCC)&lt;/strong&gt;: The monetary value of the present and future economic damage caused by a marginal one-unit increase in carbon emissions today, which under the Pigouvian framework equals the optimal carbon tax. The paper distinguishes three variants: the unilateral host-region SCC, the globally optimal (first-best) SCC, and the Nash-equilibrium SCCs for host and origin regions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor Effect&lt;/strong&gt;: A novel component of the unilateral SCC in the model, capturing the net welfare consequence of a larger host-region labor force due to FCM. It contains a positive sub-term (higher labor raises output) and a negative sub-term (capital dilution and reduction in per capita consumption because environmental goods are finite). Under Cobb-Douglas production with climate damages and capital, the net Labor Effect is always negative (raises the carbon price), as shown in Result 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Emissions Reallocation&lt;/strong&gt;: The reduction in origin-region emissions that mechanically follows when population — and therefore emission-generating activity — moves from the high-emission-intensity origin region to the host region. This component enters the unilateral SCC with a negative sign (it reduces the carbon price), because the host planner benefits from lower global concentrations induced by fewer emitters in the origin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Cost of Immigration&lt;/strong&gt;: The direct disutility experienced by host-country natives from the arrival of immigrants in the current period, parameterized by gamma, representing the native household&amp;rsquo;s marginal willingness to pay to prevent an additional unit of immigration. It is calibrated using data on European Pay-to-Go programs and the EU–Turkey Agreement. It adds to both the unilateral and Nash-equilibrium host SCCs, but quantitatively contributes only a small increment above the Labor Effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;North-South Calibration&lt;/strong&gt;: The paper&amp;rsquo;s two-region parameterization in which &amp;lsquo;host&amp;rsquo; corresponds to Kyoto Annex I countries (most European nations, the United States, Canada, Australia, New Zealand) and &amp;lsquo;origin&amp;rsquo; corresponds to the rest of the world. Host regions have higher GDP per capita, lower climate vulnerability parameters (theta), and higher emissions per capita; origin regions are more exposed to climate damages and more densely populated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nash Equilibrium (non-cooperative) SCC&lt;/strong&gt;: The carbon price chosen by a local planner as the best response to other regions&amp;rsquo; optimal strategies, without the Emissions Reallocation component (since other regions&amp;rsquo; emissions are now also strategically set). In this setting, host SCCs rise relative to the no-FCM benchmark but less than under unilateral action; origin SCCs fall slightly because origin planners account for the welfare of emigrants residing in host regions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Integrated Assessment Model (IAM) with FCM&lt;/strong&gt;: The paper&amp;rsquo;s quantitative framework that combines a neoclassical multi-region growth model, a climate module following GHKT (Golosov et al. 2014), region-specific damage functions, and an endogenous migration flow driven by carbon concentrations. The model is solved by direct optimization over savings rates and energy-labor shares, simulated for 300 years, with each period representing 10 years.&lt;/p&gt;</description></item><item><title>Who Buys High and Sells Low: Trading against Expected Returns and Wealth Inequality</title><link>https://macropaperwarehouse.com/papers/who-buys-high-and-sells-low-trading-against-expected-returns-and-wealth-inequality/</link><pubDate>Thu, 01 Jan 2026 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/who-buys-high-and-sells-low-trading-against-expected-returns-and-wealth-inequality/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Wealth in the US is far more concentrated than income, even among the bottom 99%. In 2013, the next-49% (above the bottom 50%) earned 4.7 times the income of the bottom 50% but held 6.5 times the net worth (SCF 2013). Since housing is most Americans&amp;rsquo; primary vehicle of wealth accumulation, differences in housing returns could amplify wealth gaps. Prior work studied heterogeneity in risk-taking in housing; this paper instead studies the timing (mistiming) of housing trades: do some households consistently &amp;ldquo;buy high and sell low&amp;rdquo; relative to EXPECTED asset returns, and what does that do to portfolio returns and wealth inequality? Theory is ambiguous: pro-cyclical credit supply (Mian-Sufi, Rajan) predicts poorer, credit-constrained households buy more in booms (when expected returns are low); extrapolative expectations (Barberis et al., Kaplan-Mitman-Violante) predict richer, less-constrained households buy more in booms. So it is an open empirical question.&lt;/p&gt;
&lt;p&gt;Data and method: The author builds a novel annual balanced panel of real-estate ownership from CoreLogic (formerly DataQuick) assessor file (a 2012-2013 cross section, ~104 million records, ~94% of US population) plus transaction-deed records, working backwards from 2012-2013 to assign owners by year (owner on Dec 31). Owners&amp;rsquo; wealth/permanent-income is imputed from surnames: household wage income averaged at the surname level in the 1940 full-count Census (the latest full Census and first to ask income) is a strong predictor of those surnames&amp;rsquo; 2012-2013 wealth (Henry de Frahan and Sakong 2023). Surname population counts and racial shares come from the 2000 Census tabulations (in 2000, 151,671 surnames with 100+ people, covering 242M of 282M people = 85.8%). Two samples: a &amp;ldquo;long&amp;rdquo; sample 1988-2013 (148 counties, 674 jurisdictions, 11 states, ~21-25% of US population) and a &amp;ldquo;wide&amp;rdquo; sample 1998-2013 (36 states, &amp;gt;60% of US population). Expected asset returns are estimated following Cochrane (2011) by regressing one-year-ahead realized housing returns on the log rent-to-price ratio (rents from BLS owner-equivalent rent or imputed from IRS local income; house prices from CoreLogic HPI, with Case-Shiller and FHFA for robustness), at aggregate, CBSA, county and zip-code levels, using common or area-specific (heterogeneous) coefficients. The key estimand is the covariance between (residualized) log housing quantity held by a wealth group and the log expected asset return — the &amp;ldquo;active&amp;rdquo; timing component, decomposed via a lognormal first-order approximation (Calvet-Campbell-Sodini-style passive/active split). Specifications include group, time, and group-time-trend fixed effects to isolate cyclical-frequency timing from long-run trends and new construction.&lt;/p&gt;
&lt;p&gt;Main findings (with magnitudes): (1) Over 1988-2013, lower-wealth (lower 1940-income-percentile) surnames consistently held more housing pro-cyclically — buying when expected returns were low and selling when high. Portfolio expected returns from active trades are increasing in wealth (decreasing in pro-cyclicality), especially pronounced for the bottom 20% of the 1940 income distribution. (2) Using more disaggregated expected returns raises the estimated gradient almost monotonically: the coefficient on surname 1940 income percentile rises from 0.089 bp (aggregate) to 0.180 bp per percentile (zip code, heterogeneous coefficients, wide sample — the preferred specification). Aggregate returns bias the estimate downward toward zero. (3) The gradient is larger where expected-return volatility is higher: a one-standard-deviation higher expected-return volatility roughly doubles the wealth gradient (Table 3a, zip codes); meanwhile the extent of buy-high-sell-low behavior itself is statistically unrelated to volatility (Table 3b, near zero). (4) The positive overall return-on-wealth slope is driven by BETWEEN-race differences (non-White groups own housing highly pro-cyclically, consistent with Kermani-Wong); WITHIN race, portfolio expected returns are slightly DECREASING in wealth. (5) Quantitatively, projecting 1940 income percentiles onto the 2013 wealth distribution (via average home value and a housing Engel curve from the 2013 SCF), a 10% rise in net-worth percentile is associated with ~13 bp higher annual portfolio expected return; across the interquartile range this is a 65-basis-point per year differential — about two-thirds of the ~1% total realized-return spread Fagereng et al. (2020) find for financial wealth in Norway, here from timing alone. (6) A back-of-the-envelope calculation (APC out of labor income cy≈0.25 from PSID, wealth-to-labor-income ratio W/Y≈10 from SCF) implies the 65 bp differential raises the wealth share ~9% above the income share, accounting for roughly 20% (a fifth) of residual wealth concentration above income concentration across the interquartile range. Implication: time-series volatility of housing markets widens wealth inequality beyond income inequality; dynamic trade timing, not just average returns or asset heterogeneity, matters for wealth levels.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core conceptual distinction the paper insists on, and why does it use expected rather than realized returns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper measures &amp;lsquo;buying high and selling low&amp;rsquo; as the negative co-movement between the QUANTITY of an asset held and the EXPECTED asset return on it — not realized returns on completed trades. Three reasons: (1) Over a finite period some households get lucky/unlucky on unpredictable realized returns, but those wash out over the long run; only co-movement with the PREDICTABLE (expected) component survives to affect long-run wealth accumulation. (2) Expected returns are imputed as a log-linear function of the local rent-to-price ratio, observable at local levels, rather than realized returns on a specific property. (3) It computes returns on the whole stock of housing owned, not only traded units, because non-traders earning 0% realized return must be averaged in for wealth-inequality purposes. Example given: from 2007, aggregate housing had a realized return of -8% (-20% vs the 12% time-series average) but a +8% one-year expected return (-4% vs average); the paper focuses on the -4% expected, not the -20% realized.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/measurement strategy and what are the main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification rests on (a) imputing owner wealth from surname-level 1940 Census average wage income, validated against 2000 Census zip-code incomes (Table 1: strong, expected correlations, e.g., owner-occupant 1940 log wage loads ~1.6-1.8 on Census median income; investment-home owners&amp;rsquo; residence income loads positively even controlling for property-site income), and (b) estimating the covariance of residualized log quantity held with log expected asset returns at cyclical frequency, with group, time, and group-specific-trend fixed effects (equations 7-8) to strip out level differences, differential new construction, and long-run population/inequality/homeownership trends. Threats: surname-level estimates require additional assumptions to map to family-level behavior (handled via Henry de Frahan and Sakong 2023 framework; the author deliberately avoids 2010s surname income/consumption to prevent reverse causality with 1988-2013 trading); the samples are not nationally representative (more urban, larger boom-busts); expected returns are imprecisely estimated for short local time series; and new construction cyclicality could confound who-owns-when (argued orthogonal because the outcome is the portfolio expected-return differential — even if poorer residents buy new units in booms, they are acquiring risky assets when expected returns are low).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two competing theoretical mechanisms, and does the paper claim to distinguish which one operates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Mechanism A: pro-cyclical credit supply (market- or government-driven, Rajan 2011; Mian-Sufi 2009) relaxes constraints in booms, so credit-constrained POORER households buy/own more housing in booms (when expected returns are low). Mechanism B: extrapolative expectations (Barberis et al. 2015; Kaplan-Mitman-Violante 2017) make booms coincide with optimism, and RICHER, less-constrained households are better positioned to add exposure, so they own more in booms. The two give opposite cross-sectional predictions. The paper emphasizes that its quantification of the wealth-inequality impact does NOT depend on WHICH mechanism drives the pattern or why households buy high — it measures the covariance regardless. Empirically it finds the poorer-buy-in-booms pattern dominates, consistent with the credit-supply channel, but does not structurally separate the mechanisms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three dimensions. (1) Geographic volatility: areas with more volatile expected returns (California, Florida prominently) show steeper wealth gradients in portfolio expected returns; one SD higher volatility roughly doubles the gradient (Table 3a). (2) Time period: the positive wealth slope holds both pre-subprime (1988-2002) and during the boom-bust, but is larger during the more-volatile subprime boom-bust. (3) Race: the overall positive slope of portfolio expected return on wealth is driven by BETWEEN-race variation — non-White groups own housing highly pro-cyclically (consistent with Kermani-Wong 2021, who attribute lower Black realized returns largely to foreclosures) — while WITHIN-race the gradient is slightly decreasing in wealth. The bottom 20% of the 1940 income distribution shows the most pronounced pro-cyclicality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Quantity units: results robust to using number of properties (baseline), number of bedrooms, or square footage. Price indices: aggregate results similar using CoreLogic HPI, Case-Shiller, and FHFA (Table 2a columns: 0.080, 0.063, 0.057 bp). Samples: long (1988-2013) vs wide (1998-2013) give similar aggregate estimates. Rent source: BLS owner-equivalent rent vs IRS-income-imputed rents both yield strong predictability and similar gradients. Estimation of expected returns: common vs heterogeneous (area-specific) prediction coefficients both work, with heterogeneous generally larger. Validation of surname-wealth mapping via three sets of Census 2000 regressions (Table 1). Geographic disaggregation robustness (aggregate to CBSA to county to zip) shows monotone increase, and restricting to CBSA counties with BLS rent for apples-to-apples comparison (Online Appendix Table OA.3a) preserves results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It complements contemporaneous work on heterogeneity in REALIZED portfolio returns along income/race (Goldsmith-Pinkham-Shue 2020; Xavier 2021; Kermani-Wong 2021; Martinez-Toledano 2022; Wolff 2022) and the wealth-returns literature finding returns increasing in wealth (Bach-Calvet-Sodini in Sweden; Fagereng et al. in Norway; Garbinti-Goupille-Lebret-Piketty in France; Kuhn-Rios-Rull, Wolff in US). It differs by focusing on EXPECTED returns and the TIMING (covariance) channel rather than realized returns or asset heterogeneity, and by isolating the active-trade timing component on the whole housing stock. Its 65 bp interquartile differential from timing alone is ~two-thirds of Fagereng et al.&amp;rsquo;s ~1% total realized financial-return differential, highlighting that timing matters even absent asset heterogeneity. It also relates to cyclical homeownership-by-demographic literature (Goodman-Mayer 2018; Mabille 2023).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy/theoretical implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Implication: because expected housing returns are time-varying and predictable, and lower-wealth households trade against them, trade timing widens wealth inequality beyond income inequality — and areas/periods with more volatile housing markets amplify this. Dynamic, asset-price-driven mechanisms (not just average returns) matter for wealth LEVELS, not merely their cyclicality. Scope conditions: the result requires expected returns to be genuinely time-varying and predictable (if EtR were constant, the covariance term vanishes); the lognormal approximation requires positive asset quantities (holds for housing, would fail for risk-free borrowing); the quantification depends on cy≈0.25 (PSID), W/Y≈10 (SCF), and the housing Engel-curve projection; samples are urban-skewed and not nationally representative; and the cross-sectional volatility-inequality prediction is only suggestively, not rigorously, tested (data limits on local wealth inequality).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the formal decomposition (Propositions 2-3) deliver?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 2 decomposes long-run average wealth return into (i) a participation term — the product of differences in average asset shares times expected returns (the focus of the risky-participation literature) — and (ii) a covariance term between asset shares and expected returns (this paper&amp;rsquo;s focus). The covariance term is nonzero only if expected returns are time-varying and asset shares vary across households. Proposition 3 splits the share-return covariance into a &amp;lsquo;passive&amp;rsquo; part (price changes mechanically move shares opposite to expected returns) and an &amp;lsquo;active&amp;rsquo; part (deliberate quantity adjustment), via a first-order lognormal approximation; a sufficiently contrarian active change can flip the covariance positive. The paper targets the active component, equation (4): E(mu) times cov(residual log quantity, log expected return).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key caveats the author flags?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Estimates are fundamentally at the surname level; family/household interpretation needs extra assumptions. (2) Expected returns are noisily estimated, especially locally with short series; heterogeneous coefficients add error but allow meaningful heterogeneity. (3) The wealth-inequality quantification is explicitly &amp;lsquo;back-of-the-envelope&amp;rsquo; and depends on approximations (APC, W/Y ratio, Engel curve, household-vs-surname extrapolation assumption). (4) During the subprime boom-bust, realized returns were far more volatile than rent-to-price-predicted expected returns (Online Appendix Fig OA.1), so the expected-return measure deliberately understates realized volatility. (5) Aggregate expected returns bias the gradient toward zero, so even the preferred zip-code estimate is likely a lower bound if returns are heterogeneous at finer-than-zip levels. (6) Samples cover urban areas with larger boom-busts and are not US-representative.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>A Model of Post-2008 Monetary Policy</title><link>https://macropaperwarehouse.com/papers/a-model-of-post-2008-monetary-policy/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-model-of-post-2008-monetary-policy/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Since 2008 the US economy has gone through two zero-lower-bound (ZLB) episodes (Dec 2008–Dec 2015 and Mar 2020–Mar 2022). Standard New Keynesian (NK) and monetarist models struggle with three broad facts about US inflation during these episodes, emphasized by Cochrane (2018): (1) no significant deflation, (2) little inflation volatility, and (3) no significant inflation following large quantitative-easing (QE) balance-sheet expansions. A fourth challenge is that money-market rates (federal funds, T-bills) were often below the interest rate on reserves (IOR rate), which many read as evidence of full satiation of reserve demand — undercutting any model relying on a monetary friction. Diba and Loisel build a model that can qualitatively account for all four facts and then draw out implications for policy normalization and the operational framework (floor system).&lt;/p&gt;
&lt;p&gt;Model setup: They add banks and bank reserves to the basic NK model. Monopolistically competitive firms must borrow a fraction phi in (0,1] of their nominal wage bill from banks before producing (a cost channel); calibration uses phi=1. Households contain production workers and bankers; bankers produce real loans using their own labor and real reserves via a production function homogeneous of degree d in (0,1], so holding reserves reduces banking (labor) costs — i.e., reserves carry a convenience yield. The central bank sets TWO instruments directly: the IOR rate (I^m) and the nominal stock of reserves (M). A ZLB on the net IOR rate arises because non-interest vault cash is a perfect substitute for reserves. Calvo price rigidity (theta) is assumed.&lt;/p&gt;
&lt;p&gt;Key analytical results: Under a permanent IOR-rate peg with an exogenous (or QE-rule) money supply, the model delivers a UNIQUE steady state and local-equilibrium determinacy, provided 1 &amp;lt;= I^m &amp;lt; I = 1/beta. Setting the IOR rate pins down real reserve demand, and given the exogenous nominal stock this pins down the price level; steady-state inflation equals the money growth rate. This rules out the Benhabib-Schmitt-Grohe-Uribe deflationary equilibria. The log-linearized model yields an IS equation, a modified Phillips curve (output enters net of real reserves, with delta_m and slope kappa depending on banking-cost cross-derivatives), and a reserves-demand equation. The characteristic roots satisfy 0 &amp;lt; rho &amp;lt; 1 &amp;lt; omega_1 &amp;lt; omega_2, so anticipated shocks decay exponentially with horizon — the opposite of the basic NK model (where 0&amp;lt;omega_1&amp;lt;1&amp;lt;omega_2 makes effects grow exponentially with ZLB duration). Hence deflation converges to a finite value kappa·z*/[beta·sigma·(omega_1-1)(omega_2-1)] rather than exploding, explaining no severe deflation and low inflation volatility. (In the basic NK model under their calibration, deflation reaches about 21% per year for an expected ZLB duration of two years.)&lt;/p&gt;
&lt;p&gt;QE simulations (calibrated to US data, November 2010, start of QE2): Calibration: sigma=1 (log utility), eta=1 (unit Frisch), alpha=0.67, epsilon=6, theta=0.67, phi=1, net IOR rate = 25 bps p.a., benchmark net shadow-rate-minus-IOR spread (I - I^m) = 10 bps p.a. (alternatives 5 and 20 bps), beta=0.999 quarterly, reserves/loans ratio m/ell = 1/9, loan rate I^ell-1 = 3.25% p.a.; derived ical=0.0039, V_b=0.019. Two conditions make QE nearly non-inflationary: demand close to satiation (I^m close to I, Gamma_m near 0) and the expansion perceived as temporary. Results (Figure 1, 5-year expected duration): a single QE2 expansion ($1T to $1.6T over 3 quarters) lowers the I_t - I^m_t spread from 10 to 6.2 bps and raises annualized inflation by only 18 bps on impact. Double/triple/quadruple QE2 lower the spread to 4.5/3.5/2.9 bps and raise inflation by only 27/32/35 bps — strongly decreasing returns to QE. With a 5-bps steady-state spread the single-QE2 impact falls to 9 bps; with 20 bps it rises to 37 bps (inflation impact moves roughly one-for-one with the spread). Inflation impact scales roughly one-for-one with expected duration: single QE2 raises inflation 18 bps (5 yrs), 40 bps (10 yrs), 84 bps (20 yrs); up to 32xQE2 reaches 48/104/212 bps for 5/10/20 yrs (Table 1). The calibration makes omega_1 = 1.0003 (very close to 1) and omega_2 = 1.42.&lt;/p&gt;
&lt;p&gt;Implications: A permanent reserve expansion would be fully inflationary (proportional long-run price rise) unless accompanied by a rise in money demand (e.g., a higher IOR rate). The 2021-22 inflation surge may partly reflect expansions coming to be seen as permanent plus adverse supply shocks raising the shadow rate I via a Fisher effect. Forward guidance about expansion duration is a powerful inflation-control tool. An extension with liquid government bonds reconciles non-satiation with T-bill rates below the IOR rate without changing any inflation implications. Normalization (IOR hikes and balance-sheet contraction) is always deflationary — no Neo-Fisherian effect. Under a floor system, determinacy holds for any non-negative IOR response to inflation (Taylor principle not required) and for a wide range of output responses (threshold 15.7 on the output coefficient under their calibration).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core modeling innovation relative to the basic New Keynesian model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They introduce banks and bank reserves with a convenience yield: holding reserves reduces banks&amp;rsquo; labor cost of making loans (banker production function f^b homogeneous of degree d in (0,1] in banker labor and reserves), and firms must prepay a fraction phi of their wage bill via bank loans (a cost channel). Crucially the central bank sets BOTH the IOR rate and the nominal stock of reserves, two instruments the Fed controls directly. This gives the model a &amp;lsquo;monetarist element&amp;rsquo; while keeping NK price rigidity (Calvo theta).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the model deliver determinacy and avoid the NK ZLB pathologies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the central bank sets the money supply (exogenously or via a QE rule), the model has a unique steady state provided 1 &amp;lt;= I^m &amp;lt; 1/beta: setting the IOR rate pins down real reserve demand, and the exogenous nominal stock then pins down the price level. The third-order price-level dynamic equation has roots 0&amp;lt;rho&amp;lt;1&amp;lt;omega_1&amp;lt;omega_2, satisfying Blanchard-Kahn for one predetermined variable, so there is a unique bounded solution. Anticipated future shocks decay exponentially (weights omega_1^{-k}, omega_2^{-k} both &amp;lt;1), so deflation stays bounded and inflation volatility stays low. In the basic NK model the analogous roots are 0&amp;lt;omega_1&amp;lt;1&amp;lt;omega_2, so weights grow exponentially with ZLB duration, producing explosive deflation and volatility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What exactly are the three (four) facts the model targets, and which mechanism handles each?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) No significant deflation and (2) little inflation volatility at the ZLB — handled by determinacy under a money-supply-setting central bank, giving bounded, duration-insensitive deflation. (3) No significant inflation after QE — handled by near-satiation (Gamma_m near 0, small steady-state spread) plus the expansion being temporary, so a large nominal-reserve increase is absorbed by a tiny fall in the IOR-vs-shadow-rate spread rather than by higher prices. (4) Money-market/T-bill rates below the IOR rate — handled by an extension where government bonds provide liquidity services to non-bank entities, generating T-bill returns below the IOR rate without requiring full reserve satiation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two key conditions for QE to be nearly non-inflationary, and how sensitive are the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Condition 1: demand for reserves is close to satiation, meaning I^m close to I (Gamma_m near 0) so the semi-elasticity of reserve demand is large and a flat Gamma_m absorbs large supply changes through small spread movements. Condition 2: the expansion is perceived as temporary. Sensitivity: the inflation impact moves roughly one-for-one with the steady-state I - I^m spread (single QE2 impact = 9, 18, 37 bps for spreads of 5, 10, 20 bps) and roughly one-for-one with expected duration (18, 40, 84 bps for 5, 10, 20 years). A permanent expansion would be fully (proportionally) inflationary in the long run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the central spread calibrated given the shadow rate is unobservable, and why is that a limitation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The shadow bond rate I is a rate on hypothetical bonds with no non-pecuniary services in zero net supply, hence unobservable. Using Nagel (2016) and the repo-T-bill spread (8 bps in Nov 2010), assuming the convenience yield of borrowed Treasuries is half that of T-bills held outright, they back out a net shadow rate I-1 of about 30-35 bps and an I - I^m spread of about 5 bps; to be conservative they set the benchmark spread to 10 bps (alternatives 5 and 20). The authors flag the unobservability of the relevant spread as a genuine limitation of the model&amp;rsquo;s quantitative QE implications and call for future work with observable spreads.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the liquid-government-bond extension reconcile non-satiation with T-bill rates below the IOR rate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Workers derive utility from holding government bonds (a proxy for pension/money-market funds that hold bonds and supply financial services). Banks could use bonds instead of reserves for liquidity but choose not to in equilibrium, so the extended model&amp;rsquo;s equilibrium coincides with the benchmark for all common endogenous variables except the lump-sum transfer T_t. This lets the bond/T-bill return fall below the IOR rate (driven by strong non-bank demand, e.g., collateral or international reserve use) while reserve demand remains unsatiated, leaving all inflation results from Sections 3-4 intact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the model imply for monetary-policy normalization and Neo-Fisherian effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the log-linearized model under exogenous instruments, current and expected future IOR-rate hikes and balance-sheet contractions ALWAYS exert deflationary pressure: in the inflation solution (Equation 25), the coefficient on i^m_{t+k} is negative and on reserve growth mu_{t+k} is positive, because the unstable eigenvalues omega_1, omega_2 are positive real numbers &amp;gt;1 and delta_m·chi_y &amp;lt; 1. So the model has no Neo-Fisherian region (unlike some NK equilibria in Schmitt-Grohe-Uribe 2017 and Bilbiie 2022). The authors stress this hinges on the eigenvalues being positive reals; with complex or negative eigenvalues (as in MIU models) the sign could flip by horizon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the model say about the floor system and the Taylor principle?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under a floor system (nominal reserves exogenous, IOR rate set by a Taylor rule I^m = R(Pi, y)), local-equilibrium determinacy holds for ANY non-negative IOR response to current inflation (r_pi &amp;gt;= 0) — the Taylor principle is not required; even an IOR-rate peg works. If the rule also responds to output, a sufficient condition is r_y &amp;lt; (1 - delta_m·chi_y)/(delta_m·chi_i), whose right-hand side equals 15.7 under their calibration — comfortably above typical output coefficients (about an order of magnitude smaller), so determinacy is likely to prevail.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks support the determinacy result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Appendix C replaces the exogenous nominal reserve stock with a QE rule (reserves react to output and the price level): determinacy no longer holds for all parameter values but holds for all reasonable calibrations. Appendix D adds household cash via a cash-in-advance constraint: determinacy still holds under an exogenous IOR rate and exogenous monetary base, except for implausible calibrations. The QE simulation results are also stated to be insensitive to most parameters (e.g., raising theta to 0.75 only makes inflation impacts smaller) and to plausible variations in the loan-rate and reserves/loans targets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on Diba and Loisel (2021), which showed a small monetary friction resolves NK puzzles/paradoxes under an IOR peg. Reserve/banking-cost modeling is close to Curdia and Woodford (2011) and Ireland (2014), but with new analytical results (determinacy proof, closed-form inflation/output solution) and three differences: banking costs tied to time spent on banking, borrowers are firms borrowing the wage bill, and reserve demand is not satiated. It complements asset-side QE models (Gertler-Karadi 2011, Sims et al. 2023) by focusing on the liability side. Versus Andolfatto (2015), which links low inflation to full satiation, this paper generates low inflation WITHOUT full satiation. The determinacy analysis overlaps most with Piazzesi, Rogers, Schneider (2022).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are notable caveats the authors themselves raise?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They state the model cannot explain why QE1 (starting from about $45 billion of reserves in 2008) was non-inflationary, since Gamma_m was unlikely to be flat at such low reserve levels; they attribute QE1&amp;rsquo;s non-inflationary effect to a rise in reserve demand (interbank-market collapse, IOR introduction Oct 2008, later Basel III liquidity-coverage and stress-test requirements). The unobservable shadow rate limits quantitative precision. Results are qualitative for the inflation facts. The Discussion subsection explicitly notes some views &amp;lsquo;go beyond the formal results.&amp;rsquo;&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Banks of a Feather: The Informational Advantage of Being Alike</title><link>https://macropaperwarehouse.com/papers/banks-of-a-feather-the-informational-advantage-of-being-alike/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/banks-of-a-feather-the-informational-advantage-of-being-alike/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Can banks effectively monitor their peers under asymmetric information? Effective peer monitoring matters for functioning interbank markets and, by implication, financial markets and the transmission of monetary policy. If banks monitor effectively, central banks can stay in a &amp;ldquo;night-watchman&amp;rdquo; role (Goodfriend and King 1988); if they systematically fail to identify solvent counterparties, central banks should be more active (Freixas and Jorge 2008). The paper argues that PORTFOLIO SIMILARITY between two banks is the key to their reciprocal monitoring ability: a lender uses private information about its own loan portfolio to assess the quality of a peer&amp;rsquo;s portfolio, so it is better informed the more similar the two exposures.&lt;/p&gt;
&lt;p&gt;Data and setup: Quarterly bilateral bank-to-bank and bank-to-firm exposures from the German credit register, 2009-2018, covering 2,054 lending and 2,035 borrowing banks, balanced into 2,644,640 lender-borrower-quarter combinations; 701,533 true credit relations (102,044 within the same banking network, 2,087 within the same holding company). Interbank exposure represents 21% of German banks&amp;rsquo; total borrowing and 20% of total lending; ~1.4 trillion euros average quarterly exposure by end-2018. The authors build three novel measures: (1) Portfolio quality = 1 minus the exposure-weighted average probability of default (PD) from proprietary supervisory filings (a forward-looking, private quality proxy); (2) Portfolio opacity = exposure-weighted standard deviation of PDs different banks assign to the same borrower (peers&amp;rsquo; disagreement); (3) Portfolio similarity = cosine similarity of two banks&amp;rsquo; exposure vectors across 10 industries (WZ 73 one-digit) and 9 regions (first zip digit). Estimation uses a Heckman (1977) two-step sample selection model: a Probit selection equation for the extensive margin (whether a credit relation exists) and an OLS outcome equation for the intensive margin (percentage change in bilateral exposure), with lagged credit relation as exclusion restriction, plus lender, borrower and quarter-year fixed effects. Independent variables are standardized.&lt;/p&gt;
&lt;p&gt;Main findings (signs, magnitudes, scope): Portfolio quality validation - it negatively and significantly predicts next-quarter NPL ratios up to 2 years ahead, explaining 16-17% of cross-sectional NPL variation and 71-77% with fixed effects. For the AVERAGE bank, lending does NOT respond to borrower Portfolio quality (coefficients negative, mostly insignificant), but DOES respond to the backward-looking NPL ratio: a one-SD higher borrower NPL ratio lowers the probability of receiving a loan by 118 basis points (vs. unconditional 26.53%) and reduces amounts by 133-236 bp (avg. quarterly change 1.46%). Higher borrower Portfolio opacity reduces lending (extensive -38 bp; intensive -57 to -111 bp). The key result: interacting similarity with quality reverses this for similar pairs. For HIGH-similarity pairs (3 SD above mean), a one-SD increase in borrower Portfolio quality raises matching probability by 50 bp and lending by 408 bp; a deterioration cuts lending by 348-368 bp (avg. change between similar banks 10.95%). For LOW-similarity pairs, higher Portfolio quality LOWERS lending (matching -80 bp; amount -563 bp), and lending rises after quality deteriorates (370/342 bp), which Section 6 shows is a demand effect. The NPL-ratio response vanishes for similar pairs. Portfolio similarity itself raises lending: one-SD more sectoral similarity raises intensive-margin lending ~100-259 bp, regional similarity ~84-114 bp - jointly comparable in magnitude to relationship lending, the strongest known predictor. For opaque borrowers, high-similarity lenders lend MORE (extensive +23 bp; intensive +129 to +162 bp). A variance decomposition (Lemmon et al. 2008 ANCOVA) finds common/bank-pair characteristics explain 98.0% of extensive-margin variation and 18.9% of intensive-margin variation; lender, borrower and market characteristics explain only 1.2/0.8/0.1% (extensive) and 35.6/44.2/9.1% (intensive). Implication: peer monitoring works, but only among similar banks; this raises interbank efficiency at the cost of higher systemic risk and too-interconnected-to-fail concerns.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core estimation is a Heckman (1977) two-step sample selection model: a first-stage Probit for the extensive margin (existence of a bilateral credit relation) and a second-stage OLS for the intensive margin (log change in bilateral exposure), with the inverse Mills ratio carried into the second stage. The exclusion restriction is the lagged existence of a credit relation (Credit relation_{i,j,t-1}), which strongly predicts a current relation (first-stage t-statistic 335; t=293 in the similarity specification) because German interbank exposures are long-lived, yet carries no information on whether exposure will rise or fall next quarter. The chief threats are: (1) demand vs. supply confounding - observed lending is equilibrium, so a negative quality-lending link could reflect borrowers&amp;rsquo; demand rather than lenders&amp;rsquo; screening; addressed in Section 6. (2) Correlated portfolio quality of similar banks - a lender cutting lending in response to its OWN deteriorating portfolio could be misread as a reaction to a similar borrower&amp;rsquo;s portfolio; addressed via a matched sample in Section 7. The paper also notes both Portfolio quality and NPL series are persistent, so the predictive regressions should be read as &amp;lsquo;gentle evidence,&amp;rsquo; not strict causal proof.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the authors separate supply effects from demand effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They adapt Degryse et al. (2019). They define an adjusted exposure change bounded in [-2,2] (Chodorow-Reich 2014; Davis-Haltiwanger 1992) that captures both margins, then regress it on lending-bank-time fixed effects (proxying supply) and borrowing-bank-class x industry x region x time fixed effects (proxying demand, assuming homogeneous demand across lenders). The estimated lender-time fixed effects, demeaned and aggregated to the borrowing-bank level, give a borrower-specific liquidity-supply shock. Regressing this on borrower Portfolio quality, NPL ratio and opacity shows supply is restricted when quality deteriorates, NPL rises, or opacity increases. This confirms the puzzling positive lending-to-low-quality result for dissimilar pairs is a DEMAND effect: low-quality borrowers, shunned by similar lenders, demand more liquidity and turn to dissimilar lenders. The authors stress this borrower-level approach supports but cannot replace the bank-pair analysis, since it cannot include pair characteristics like similarity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do they rule out that lenders are just reacting to their own correlated portfolio quality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the full sample, the correlation of Portfolio quality between two above-average-similarity banks is 0.0499 versus only 0.0150 for below-average-similarity pairs. They build a matched subsample (nearest-neighbour matching, assigning each &amp;lsquo;similar&amp;rsquo; pair - both similarities above the 75th percentile in 2009Q1 - three &amp;lsquo;dissimilar&amp;rsquo; pairs below the 25th percentile with the closest Portfolio-quality correlation) so that within-pair quality correlation is the same for similar and dissimilar pairs, and redefine similarity as binary. If lenders only reacted to their own portfolio, the similarity x quality interaction should vanish in this sample. Instead, the interaction stays positive and mostly significant (and NPL x similarity too); weaker significance in some fixed-effect models reflects the smaller sample, since coefficient sizes are comparable to the main results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms, and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Mechanism: information on a peer&amp;rsquo;s asset quality is private and costly to obtain; a lender proxies a peer&amp;rsquo;s portfolio quality by the average quality of the industries/regions it lends to, and can do this more cheaply when it already lends to the same industries/regions (similar portfolio). So similar lenders are better informed. Empirically distinguished by: (a) the average bank reacts to the public NPL ratio but not to private Portfolio quality, while similar pairs react strongly to Portfolio quality and barely to NPL - showing similar lenders access private information; (b) the similarity x quality and similarity x opacity interactions; (c) the supply-shock decomposition separating screening from demand; (d) the matched sample ruling out own-portfolio reactions. A competing mechanism, risk shifting (Elliott et al. 2018) - banks deliberately courting correlated counterparties to raise bailout probability - cannot be ruled out and may co-drive preferential lending between similar peers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) By similarity: similar pairs (3 SD above mean) react to forward-looking Portfolio quality and lend more to higher-quality and more-opaque peers; dissimilar pairs (3 SD below mean) react only to the backward-looking NPL ratio and end up lending more to low-quality borrowers via demand. (2) By opacity: lending between similar banks is especially important for opaque borrowers, who otherwise struggle to refinance; opaque banks are shunned by dissimilar lenders and turn to similar ones, while low-quality banks are shunned by similar lenders and turn to dissimilar ones. (3) Sectoral vs. regional similarity: both matter; sectoral similarity tends to have larger intensive-margin effects (e.g., 259 vs. 94 bp in Model 3). (4) Lender&amp;rsquo;s own quality: lenders cut lending when their own Portfolio quality falls (one-SD drop reduces amounts by 215-226 bp within-bank), consistent with prior work (Acharya-Merrouche 2013).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and additional analyses are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Multiple fixed-effect layers: cross-section, lender/borrower fixed effects, and added quarter-year fixed effects (Models 1-4 across tables). (2) Control set: lagged Capital ratio, Liquidity ratio, ROA, Loans-to-assets, Size, relationship lending and reverse relationship lending over an 8-quarter window, difference in liquidity surplus, same-network and same-holding-company dummies. (3) Supply-vs-demand decomposition (Section 6). (4) Matched-sample analysis breaking the quality correlation (Section 7). (5) Validation of Portfolio quality via NPL-predictive regressions and a panel Granger causality test (Juodis et al. 2021; Half-Panel Jackknife Wald &amp;gt; 300; Dumitrescu-Hurlin Z &amp;lt; -50), significant 5-50 quarters ahead. (6) Two-digit WZ 73 industry classification (100 industries) in Appendix B. (7) Variance decomposition (ANCOVA, Type III sums of squares) quantifying explanatory power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends peer-monitoring literature (Goodfriend-King 1988; Rochet-Tirole 1996; Flannery-Sorescu 1996; Furfine 2001) by showing that even among banks, the more similar the lender, the better its monitoring - identifying Perignon et al. (2018)&amp;rsquo;s &amp;lsquo;informed lenders&amp;rsquo; as similar-portfolio banks. Versus relationship-lending work (Affinito 2012; Braeuning-Fecht 2017; Cocco et al. 2009), it shows that with a similar portfolio NO long-standing relationship is needed to obtain quality information, and that similarity mitigates opaque banks&amp;rsquo; hampered access on top of relationships. It augments lender/borrower/market-characteristic studies by adding dyadic (common) covariates. Unlike prior work using aggregate bank-level ratios, CDS spreads, or rating-agency disagreement, it uses granular real-exposure data and proprietary supervisory PDs to measure private quality and peer-perceived opacity directly. It links to systemic-risk/contagion literature (Allen-Gale 2000; Fecht et al. 2011; Elliott et al. 2018), showing banks over-expose to similar counterparties despite indirect-contagion risk, surfacing an efficiency-vs-systemic-risk trade-off akin to focus-vs-diversification in Acharya et al. (2006).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Peer monitoring is real but partial: only similar banks effectively screen on private, forward-looking quality, while others fall back on inferior public proxies (NPL ratios). This bears on the central-bank &amp;rsquo;night-watchman vs. active&amp;rsquo; debate - because monitoring fails for dissimilar pairs, a purely hands-off stance may be insufficient. The headline trade-off: stronger lending between similar banks raises interbank informational efficiency and monitoring, but the above-average direct exposure between similar (correlated) banks multiplies systemic risk and too-interconnected-to-fail concerns, and reflects a lack of diversification. Scope conditions: results are specific to the German banking system (2009-2018), a tiered market dominated by private, savings, and cooperative banks with mostly long-term interbank loans (45% over a year, only 15% overnight); the data lack interest rates, so the analysis covers quantities/existence of lending, not prices; effects are estimated on bank-pairs that lent at least once; and the supply-identification assumes homogeneous borrower demand across lenders.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key caveats the authors themselves flag?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) No interest-rate data, so price effects of similarity, quality and opacity are untested. (2) Portfolio quality and NPL series are persistent, so the forward-looking predictive evidence is &amp;lsquo;gentle,&amp;rsquo; not definitive. (3) The supply-shock approach gives borrower-level (not pair-level) shocks and cannot incorporate similarity. (4) Risk shifting cannot be ruled out as a co-driver of preferential lending between similar peers. (5) Portfolio quality is built using the median PD across IRB banks, excluding borrowers exposed only to Standardised-Approach banks. (6) The balanced sample includes only pairs that lent at least once, ignoring pairs that could theoretically but realistically would not lend (consistent with tiered-market evidence).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>CBDC as Imperfect Substitute to Bank Deposits: A Macroeconomic Perspective</title><link>https://macropaperwarehouse.com/papers/cbdc-as-imperfect-substitute-to-bank-deposits-a-macroeconomic-perspective/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/cbdc-as-imperfect-substitute-to-bank-deposits-a-macroeconomic-perspective/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: As central banks worldwide explore retail central bank digital currency (CBDC), the macroeconomic consequences depend heavily on how CBDC interacts with bank deposits. Prior work spans a wide range of conclusions — from &amp;ldquo;no effect&amp;rdquo; (Brunnermeier and Niepelt 2019) to disintermediation that reduces lending and output (Keister and Sanches 2022; Chiu et al. 2022) to large output gains (Barrdear and Kumhof 2021, +3% GDP). Bacchetta and Perazzi argue these differences hinge on (i) how substitutable CBDC is with checking deposits, (ii) how easily banks replace lost deposits with other funding, (iii) the interest rate on CBDC, and (iv) the competitive structure of banking. The paper provides quantitative welfare estimates in a model where CBDC and deposits are imperfect substitutes and banks are in monopolistic competition.&lt;/p&gt;
&lt;p&gt;Model setup: A closed-economy steady-state model (akin to Gali 2015 and Del Negro-Sims 2015) with households, &amp;ldquo;bank owners,&amp;rdquo; firms, banks, government, and central bank. Money reduces a transaction cost on consumption (Schmitt-Grohe-Uribe 2004 style). Deposits and CBDC combine via a CES composite liquid asset characterized by three CBDC design dimensions: its interest rate (rc), its relative liquidity (alpha_c/alpha_b, the CES weight), and its substitutability with deposits (elasticity epsilon_cb). Crucially, with monopolistic competition each bank takes the average deposit rate as given, so the equilibrium deposit rate is unaffected by CBDC (Lemma 1); and because firms can fund at the risk-free rate, bank credit extension and loan rates are also unaffected by CBDC in steady state. Calibration (US-based): risk-free rate 4%, deposit spread 2%, loan spread 1%, reserve ratio 5%, deposit management cost 25 bps, interest semi-elasticity of money demand -0.05, inverse Frisch elasticity gamma=1, wealth/consumption=4. The two extreme ownership cases are zeta=1 (&amp;ldquo;case a,&amp;rdquo; households fully own banks) and zeta=0 (&amp;ldquo;case b,&amp;rdquo; a zero-measure set of bankers receives all profits).&lt;/p&gt;
&lt;p&gt;Main findings (welfare in consumption-equivalent basis points): Welfare can improve via three channels — (1) seigniorage allowing lower distortionary labor taxes, (2) a lower opportunity cost of holding money (raising money holdings, cutting transaction costs, stimulating labor and consumption), and (3) redistribution of bank deposit rents from bankers to the general population. The optimal CBDC rate trades off seigniorage versus opportunity-cost reduction and is decreasing in the labor tax rate and decreasing in the share of banks owned by households (Proposition 3). The first two channels alone yield only modest gains: +9 bps at a 25% labor tax and +20 bps at 45%. Adding the redistribution channel (&amp;ldquo;case b&amp;rdquo;) raises non-bankers&amp;rsquo; welfare to +54 bps (25% tax) and +59 bps (45% tax); the headline maximum is about 60 bps. From Table 2 (epsilon_cb=20, equal liquidity): consumption rises +27 bps (case a) / +54 bps (case b) at 25% tax, and +41 / +62 bps at 45% tax. All benefits require historically normal interest rates (baseline 4%); near the zero lower bound seigniorage, money&amp;rsquo;s opportunity cost, and deposit rents all vanish, so the welfare gain falls roughly linearly to zero with the deposit spread.&lt;/p&gt;
&lt;p&gt;Policy/theoretical implications: CBDC is a tool to mitigate two distortions — distortionary taxation and the gap between the opportunity cost and the (low) production cost of money — plus a redistributive lever against the concentration of bank rents. The pure efficiency gains are modest; the larger gains come from redistribution and are larger where labor taxes (e.g., EU-14 averaging &amp;gt;40% vs. US ~25%), the Frisch elasticity, or the interest semi-elasticity of money demand are higher.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the model&amp;rsquo;s identification/derivation strategy, since this is a theoretical paper rather than an empirical one?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;There is no econometric identification; results come from a calibrated closed-economy steady-state general equilibrium model. The &amp;lsquo;identification&amp;rsquo; of the welfare channels is analytical: three propositions (proved in an online appendix) characterize how seigniorage and the optimal CBDC rate depend on CBDC liquidity (alpha_c), substitutability (epsilon_cb), and the labor tax rate, and numerical experiments on a US-calibrated economy quantify the welfare changes. The key structural assumption enabling the results is monopolistic competition in banking plus a financial-market funding alternative for banks at the risk-free rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the introduction of CBDC leave the deposit rate and bank lending unchanged in this model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Lemma 1: under monopolistic competition each individual bank takes the aggregate deposit rate as given and does not internalize how aggregate deposit demand shifts with CBDC, so its optimal deposit rate (eq. 30) is invariant to CBDC&amp;rsquo;s interest rate or liquidity. CBDC lowers aggregate deposit demand, so banks simply rely more on other liabilities (bonds/equity). Lending is unaffected because the marginal cost of bank funding remains the risk-free rate (banks can borrow from the market), so the loan rate (eq. 32) and quantity of loans do not change. This contrasts with monopoly/Cournot banking (Andolfatto 2021; Chiu et al. 2022) where CBDC moves the deposit rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three welfare channels and how is each maximized?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Seigniorage: higher central-bank seigniorage finances lower distortionary labor taxes; maximized by setting rc to raise seigniorage revenue (peak occurs at rc &amp;lt; rb in the cases analyzed). (2) Opportunity cost of money: paying high interest on CBDC raises money holdings and cuts the transaction cost, stimulating labor and consumption; maximized by setting rc equal to the risk-free rate so households drop deposits entirely and drive the transaction cost toward zero. (3) Redistribution: CBDC lets non-bankers capture deposit rents previously held by bankers (via tax cuts or interest on CBDC), maximal when zeta=0 and rc near the risk-free rate. Channels (1) and (2) conflict, generating the optimal-rate tradeoff.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does seigniorage look like as a function of the CBDC rate, and what do Propositions 1-2 say?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Seigniorage is non-monotonic in rc: a higher rc lowers seigniorage per unit of CBDC but raises CBDC demand. Proposition 1 (under alpha_b^{epsilon_cb}*epsilon_cb &amp;gt; 1 and negligible CBDC management cost): the seigniorage-maximizing rc exceeds the deposit rate rb; if epsilon_cb&amp;gt;1.5 the optimal rc decreases in CBDC liquidity alpha_c; and the peak seigniorage rises with both alpha_c and epsilon_cb. Proposition 2: within that parameter region, maximum seigniorage is achieved as epsilon_cb to infinity (perfect substitutes) with rc set infinitesimally above rb — i.e., outcompete deposits. In the numerical cases shown, the seigniorage peak occurs at rc &amp;lt; rb, moving closer to rb as CBDC liquidity rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity / cross-country variation does the paper document?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two dimensions. (i) Labor tax level: US ~25% vs EU-14 averaging &amp;gt;40% (Trabandt-Uhlig 2011). Higher taxes raise the value of the seigniorage/tax-cut channel, lower the optimal CBDC rate, and raise welfare gains (efficiency gains +9 bps at 25% to +20 bps at 45%). (ii) Bank ownership (zeta): &amp;lsquo;case a&amp;rsquo; (households own banks) gives small gains (7-8 bps at 20% tax to 18-20 bps at 45%); &amp;lsquo;case b&amp;rsquo; (bankers own banks) gives large gains (52-53 bps at 20% to 58-60 bps at 45%) via redistribution. The optimal CBDC rate is higher in case b than case a and rises with the tax rate (Proposition 3 / Figure 3).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness / alternative-parameter checks are run (Table 3)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Frisch elasticity (gamma=0.25 i.e. Frisch=4, and gamma=4 i.e. Frisch=0.25): higher Frisch raises case-a gains (e.g., +28 bps at 25% tax) but case-b gains are roughly independent of Frisch. Interest semi-elasticity of money demand set to -0.12 (Benati et al. 2021 for Switzerland): with 45% taxes, gains reach +35 bps (case a) and +85 bps (case b) — this parameter has the biggest impact. Other variations with small effects: deposit/loan management costs, reserve ratio (0% vs 10%), bank-profit tax tau_b (15% vs 35%; lower tau_b means more inequality and larger CBDC gain), loan elasticity epsilon_l, working-capital share phi, wealth/consumption ratio (2 vs 4). Loan-side parameters and household wealth essentially do not matter because lending is unaffected by CBDC. With lump-sum (non-distortionary) taxes, case-a gains shrink (the seigniorage-tax channel is inactive) while case-b gains are essentially unchanged. At the zero lower bound the welfare gain is approximately linear in the deposit spread and zero when the spread (net of management cost) is zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from the closest prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Versus Barrdear and Kumhof (2021): shares the transaction-cost money-demand approach but estimates a much smaller welfare benefit; their large +3% GDP gain comes mainly from the central bank buying public debt and lowering the government bond rate — a channel absent here. Versus Brunnermeier-Niepelt (2019): they get equivalence (no effect) under specific funding conditions; here CBDC does affect outcomes through seigniorage, opportunity cost, and redistribution. Versus Andolfatto (2021, monopoly bank) and Chiu et al. (2022, Cournot): in those the CBDC rate moves the deposit rate, whereas monopolistic competition here insulates the deposit rate (Lemma 1). Versus Chiu-Davoodalhosseini (2021): the opportunity-cost channel is shared. The paper abstracts from cyclical issues (cf. Burlon et al. 2022 DSGE; Piazzesi et al. 2022 monetary-policy use of rc) by focusing on steady state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and scope conditions on the welfare results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Steady-state only — no transitional or cyclical analysis. (2) Requires historically normal interest rates; near the ZLB all three channels are inert. (3) Liquidity and substitutability are treated as fixed design constraints in the welfare optimization, with only rc as the policy lever, because they may be technologically hard to set. (4) The headline ~60 bps gain relies on the extreme &amp;lsquo;case b&amp;rsquo; (zero-measure bankers own all banks) and on the welfare function ignoring bankers — i.e., it is largely a redistribution result, not a pure efficiency result. (5) The model deliberately shuts down CBDC effects on bank lending (banks fund at the risk-free rate), so disintermediation-of-credit channels stressed elsewhere are absent by construction. (6) Bank profits in the model equal net interest income (~1.5-2% of consumption), comparable to US bank NII but higher than actual bank profits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Is cash incorporated, and does it change the conclusions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline model excludes cash, but an appendix adds cash as a third zero-interest money in a nested CES (cash and CBDC combine, then that composite substitutes for deposits). The paper shows that if the &amp;lsquo;composite interest&amp;rsquo; of cash-plus-CBDC equals the rc of the two-instrument baseline, economic outcomes are unchanged: households rebalance across the three instruments so the equilibrium transaction cost and total cost of holding money are the same.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Does the Phillips Curve Lie Down as We Age?</title><link>https://macropaperwarehouse.com/papers/does-the-phillips-curve-lie-down-as-we-age/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/does-the-phillips-curve-lie-down-as-we-age/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: The paper asks whether population aging flattens the Phillips curve through a previously unexplored channel — age-related differences in the elasticity of substitution across product varieties. Existing work on demographics and monetary policy emphasizes wealth, liquidity, and life-cycle savings channels. The authors instead argue that if older consumers are less willing to substitute across varieties of goods (i.e., they have a lower elasticity of substitution), then firms selling to them have more market power, adjust prices less responsively to marginal cost, and the slope of the Phillips curve falls. Because advanced economies are simultaneously aging and exhibiting a flattening Phillips curve, this offers a structural, demographically-driven explanation.&lt;/p&gt;
&lt;p&gt;Data and empirical strategy: The empirical analysis uses barcode (UPC) level retail purchase data from the NielsenIQ Homescan Consumer Panel, 2004-2019. The panel is rotating and nationally representative, surveying between 40,000 and 60,000 households per year (average 57,355 households/year), capturing over 900 million transactions and 1,117 product modules. Purchases are aggregated into five age groups (25-34, 35-44, 45-54, 55-64, 65+) within more than 1,000 disaggregated product modules. The elasticity of substitution within modules is estimated by age using the Feenstra (1994) / Broda and Weinstein (2006) supply-and-demand identification (applied as in Jaravel 2019), with Equation (4) estimated by weighted least squares and aggregate elasticities formed as expenditure-share-weighted averages of module elasticities. Each module must have at least 20 purchasing households.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: The youngest cohort (25-34) consistently has the highest elasticity and the oldest (65+) the lowest; the middle groups (35-64) are non-monotonic. Median elasticity is 5.73 for the oldest and 7.02 for the youngest, in line with prior estimates (Broda-Weinstein 2010, Hottman et al. 2016). The maximum gap (oldest vs. youngest) is 1.29 for medians and 1.55 for means — larger than the 0.375 difference Faber and Fally (2022) find between richest and poorest income quintiles. A decomposition (Table 1) attributes the 65+ vs. 25-34 gap to one-third lower within-module elasticities and two-thirds a composition effect (older baskets weighted toward lower-elasticity products); for other age groups vs. 65+, 55-60% comes from the within-module elasticity term. The age pattern survives income controls and is most pronounced in the top two income quartiles (over 70% of expenditure share), so the authors conclude the age gradient is not driven by income.&lt;/p&gt;
&lt;p&gt;Mechanism and theory: They extend a Rotemberg (1982) price-adjustment model to multiple consumer types. The log-linearized Phillips curve slope (Eq. 7/19) is the population-weighted average elasticity, sum_a (sigma_a - 1) s_a / phi. A lower share-weighted average elasticity flattens the curve: firms facing less price-sensitive (older) demand have more market power, can delay price changes, so inflation responds less to marginal cost. They note this does not hold in a first-order Calvo approximation with constant returns, but show in an Online Appendix menu-cost model that for empirically relevant parameters a lower elasticity reduces the probability of price adjustment, extending the result.&lt;/p&gt;
&lt;p&gt;Quantitative exercise: Calibrating phi = 122 to match a 2022 Phillips-curve slope of 0.055 (the Gagliardone et al. 2023 midpoint of an estimated 0.05-0.06 range), then feeding in 1984 consumption shares yields a slope of 0.056 — a 2.3% reduction over 1984-2022. Benchmarked against the literature&amp;rsquo;s roughly 50% (halving) decline in the slope (Furlanetto and Lepetit 2024), the demographic channel accounts for about 4.5% of the observed flattening (2.3/50 = 4.5). The authors describe this as not large but a genuine contributing factor.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for the elasticity of substitution, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They use the Feenstra (1994) and Broda-Weinstein (2006) double-difference approach. For each product module they specify a CES demand equation relating changes in expenditure shares to changes in prices (slope -(sigma_m - 1)) and an inverse supply equation. Differencing both relative to a reference barcode k eliminates the time-varying intercepts (alpha_mt, phi_mt). Assuming the differenced demand and supply errors are uncorrelated, the two are combined into a single moment condition (Eq. 4) involving squared and cross-product terms of differenced prices and shares, estimated by weighted least squares; sigma_m and the inverse supply elasticity omega_m are backed out from the estimated theta coefficients subject to sigma_m &amp;gt; 1 and omega_m &amp;gt; 0. The key identifying assumption is the orthogonality of demand and supply shocks (changes in unobserved quality vs. supply-side shocks). A second threat the authors directly address is that age correlates with income, so age differences in elasticity could reflect income; they rebut this by re-estimating within income halves. They use only continuing barcodes (present in t and t-1) to measure period-to-period changes, and exclude non-UPC &amp;lsquo;magnet&amp;rsquo; items like fresh produce.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the age effect distinguished from an income effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Income in the Homescan data is reported in discrete bins with a two-year lag, so the authors instead construct per-capita expenditure as an income proxy (following Faber and Fally 2022), regressing log total expenditure on household-size dummies and household attributes and netting out size effects; an appendix table shows this proxy is monotonically increasing in reported income bins. Re-estimating elasticities within the lower and upper 50% of the (expenditure-proxied) income distribution (Table 2), the falling-with-age pattern remains apparent conditional on being high income — indeed the gap across ages is even starker at higher incomes. Since upper-income households account for the large majority of expenditure within each age group, the pooled estimates track the upper-income pattern. The authors conclude the age gradient stems from a factor of age unrelated to income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two channels behind the age-elasticity gap, and how are they separated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A decomposition (Table 1) splits the overall elasticity gap between each younger group and the 65+ group into (i) a &amp;lsquo;difference from sigma&amp;rsquo; term that varies module elasticities while holding expenditure weights fixed (older people have lower elasticities within the same modules), and (ii) a &amp;lsquo;composition&amp;rsquo; term that holds module elasticities at the 65+ values and varies expenditure weights (older baskets tilt toward lower-elasticity modules). For the largest gap (65+ vs. 25-34), about one-third is the within-module elasticity effect and two-thirds is composition; for the other age groups vs. 65+, 55-60% is the within-module elasticity effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does a lower elasticity flatten the Phillips curve mechanically in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the multi-type Rotemberg model the non-linear pricing FOC (Eq. 5) scales marginal cost by consumption weighted by each cohort&amp;rsquo;s elasticity. Log-linearizing around zero-inflation steady state gives a slope equal to the share-weighted average (sigma-bar - 1)/phi. A lower sigma means products are less substitutable, firms have more market power and are less sensitive to marginal-cost changes, so they can absorb cost changes or delay passing them through without losing demand — making larger but less frequent price changes. Marginal cost must move relatively more to generate the same inflationary pressure, hence a flatter curve. As the old (lower sigma) consume a rising share of output, sigma-bar falls and the curve flattens.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Doesn&amp;rsquo;t the Calvo model undercut the result, since elasticity doesn&amp;rsquo;t enter its Phillips-curve slope?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To a first-order approximation around zero-inflation steady state with constant returns to scale, the elasticity of substitution does not affect the Calvo Phillips-curve slope, because the price-adjustment probability is exogenous and independent of pricing power. The authors address this two ways. First, with decreasing returns the Calvo slope does depend on elasticity (a higher elasticity flattens it via marginal-cost dispersion), an effect absent under Rotemberg because there is no price/cost dispersion. Second, and more importantly, in a one-period menu-cost model (Online Appendix B) they show the firm&amp;rsquo;s willingness to pay the fixed cost and update prices is increasing in sigma for empirically relevant parameters (6 &amp;lt; sigma &amp;lt; 11, phi around 0.5 implying a 5-10% profit share). Since Calvo is a special case of dynamic menu costs, a lower elasticity maps to a lower adjustment probability and thus a flatter curve, so the result extends beyond Rotemberg.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the quantitative exercise actually compute, and what are its limits?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is explicitly not a full-scale evaluation — it was added at a reviewer&amp;rsquo;s suggestion. They write the five-group slope (Eq. 8), calibrate phi = 122 so that 2022 elasticities and consumption shares reproduce a slope of 0.055 (Gagliardone et al. 2023 midpoint of 0.05-0.06, estimated from Danish firm-level marginal-cost data 1999-2019), then substitute 1984 consumption shares (holding elasticities fixed) to get 0.056. The resulting 2.3% slope decline, divided by the roughly 50% decline the literature reports (Furlanetto-Lepetit 2024 survey, with large uncertainty), gives about 4.5% of the observed flattening. The exercise varies only consumption shares, not the estimated elasticities themselves, over time, and the literature&amp;rsquo;s 50% benchmark is itself uncertain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented beyond the age gradient?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By income (Table 2): at lower income, mean elasticities rise slightly until 55-64 and are lowest for 65+; at higher income the age differences are starker than pooled. Median elasticities across income but within age are similar for ages 45+, but below 45 the lower-income group has smaller elasticities than the upper-income group. By year (Appendix Table 6): elasticities by age and year are reported for 2004-2019, with the oldest group lowest in essentially every year. The number of estimable modules differs across groups (e.g., Age 25-34: 378; 35-44: 632; 45-54: 743; 55-64: 768; 65+: 742), with fewer modules at younger and lower-income groups due to the 20-household threshold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It departs from the wealth/liquidity HANK literature (Kaplan-Violante 2018, McKay-Wolf 2023) and from age-and-monetary-policy work that runs through wealth and savings: Eggertsson et al. (2019) on aging savers pushing down the natural rate, Berg et al. (2021) on age-dependent interest-rate sensitivity via wealth, Leahy-Thapar (2022) on the age structure of entrepreneurs, and Juselius-Takats (2021) on demographics affecting the level of inflation. Closest is Mangiante (2023), who shows older households&amp;rsquo; baskets are weighted toward higher-price-rigidity products; this paper instead emphasizes that older households are themselves intrinsically less price-sensitive (lower within-module elasticity), a distinct price channel. It is consistent with Bornstein (2021) (older consumption more persistent) and Aguiar-Hurst (2007) (older households shop more, pay lower prices). It also speaks to the structural-stability literature (Rubio-Ramirez and Fernandez-Villaverde 2007): the aggregate elasticity is not a fixed structural parameter but depends on demographic composition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the monetary-policy transmission mechanism depends on the Phillips-curve slope, ignoring the age distribution can bias the conduct and assessment of monetary policy efficacy; transmission will also have heterogeneous effects across age groups; and, all else equal, aging advanced economies should expect a flattening Phillips curve. Scope conditions: the channel is qualitatively important but quantitatively modest (about 4.5% of the observed flattening); the estimate covers retail/UPC purchases only and excludes services (where older households spend more and where price rigidities are higher per Cravino et al. 2022 and Mangiante 2023, so the composition effect may be understated); the flattening result is model-dependent (clean under Rotemberg, requiring the menu-cost argument to extend to Calvo); and the normative implications for optimal monetary policy are left as an open question.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and caveats does the paper provide?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Income re-estimation within income halves; per-capita expenditure validated as an income proxy against reported bins; a 20-household-per-module threshold; use of continuing barcodes only; exclusion of magnet items; year-by-year elasticity estimates (Appendix Table 6) showing stability of the ranking; the menu-cost extension to address Calvo; and explicit acknowledgment that services are missing from the data and that the quantitative benchmark (50% slope decline) is uncertain. The authors note the middle age groups are non-monotonic, so the result is a young-vs-old contrast rather than a strictly monotone age gradient.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Financial Fragility and the Fiscal Multiplier</title><link>https://macropaperwarehouse.com/papers/financial-fragility-and-the-fiscal-multiplier/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/financial-fragility-and-the-fiscal-multiplier/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Does fiscal stimulus still work when it is financed through a banking system that is undercapitalized and holds large quantities of risky domestic government bonds? This was a first-order policy question in Southern Europe (Spain, Italy, Portugal — &amp;ldquo;SIP&amp;rdquo;) during the 2011–2013 European sovereign debt crisis, and the authors argue it is relevant again as central banks raise rates after the Zero Lower Bound. Motivating stylized facts: Spanish banks held domestic sovereign debt equal to more than 150% of Tier-1 capital (Italian banks ~200%, Greek banks ~250% at end-2011); CDS spreads on Italian and Spanish sovereign debt rose from ~100 bps in January 2010 to above 400 bps in 2012–2013 (Portugal exceeded 1000 bps at end-2011); VAR evidence shows sovereign-spread pass-through to corporate lending rates is nearly complete within six months. Gennaioli et al. (2018) document that 12.7% of emerging-market commercial bank assets are (mostly domestic) government bonds, extending relevance beyond Europe.&lt;/p&gt;
&lt;p&gt;Model setup: The authors first build a tractable two-period general-equilibrium model with leverage-constrained banks (Gertler-Karadi 2011 incentive-compatibility constraint), long-term debt, and endogenous sovereign default risk to derive analytical propositions. They then build and Bayesian-estimate an infinite-horizon New Keynesian DSGE model of a small open economy in a monetary union (in the spirit of Burriel et al. 2010), calibrated/estimated to Spain. Default risk is modeled as a non-strategic default driven by a stochastic maximum feasible level of taxation (Schabert-van Wijnbergen; Corsetti et al. 2013); the default probability draws from a generalized beta distribution. Long-term bonds use the Woodford (2001) decaying-coupon structure. Estimation uses quarterly Spanish data for 2003Q1–2010Q4 (10 observable series including real GDP, consumption, government spending, exports, imports, inflation, real wage, hours, deposit rate, and the NFC loan rate). The model is estimated WITHOUT sovereign risk because risk was minor over the estimation window. Key calibrated/estimated parameters: weighted steady-state leverage ratio phi-bar = 6.48; lambda_b/lambda_k = 0.5; posterior-mean corporate-loan diversion rate lambda_k-bar = 0.64 (implying lambda_b-bar = 0.32), both higher than the literature&amp;rsquo;s typical values (below 0.4 and 0.2), indicating financial frictions are relatively important for Spain. Steady-state default probability set to 50 quarterly basis points (~2% per year); default elasticity of 0.003 (small relative to Schabert-van Wijnbergen&amp;rsquo;s 0.01).&lt;/p&gt;
&lt;p&gt;Main quantitative findings: Simulating a financial crisis (a one-off 5% &amp;ldquo;MIT&amp;rdquo; increase in the corporate-loan diversion rate, persistence 0.7, output recovering after ~20 quarters) followed by a deficit-financed stimulus of 0.5% of quarterly GDP, the discounted cumulative multiplier is: +0.25 with short-term debt and no sovereign risk (row 1); +0.15 with long-term debt (20-quarter duration) and no sovereign risk (row 2); and -0.65 with both long-term debt and sovereign default risk (row 3). Adding long-term debt explains ~11% of the 90-bp decline; adding sovereign risk explains ~89%. Combining both ingredients lowers the multiplier by at least 0.60 percentage points versus including only one. Nonlinearities: the multiplier falls with stimulus size — for a delayed (4-quarter lag) stimulus, going from 0.5% to 4% of quarterly GDP lowers the multiplier by 0.58 pp (-0.65 to -1.23); for an immediate stimulus by 0.29 pp (-0.14 to -0.43). It falls only mildly with crisis size (delayed: -0.63 to -0.70 as the shock rises from 2% to 15%). Implementation timing: an immediate stimulus has multiplier -0.14 versus -0.65 for a 4-quarter delay, a 0.51-pp gap (the paper states &amp;ldquo;at least 0.30 pp&amp;rdquo; lower for a 4-quarter lag). Policy implications: implement stimuli fast after announcement, clean up bank balance sheets before stimulating, and keep stimuli small when banks are undercapitalized.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the new mechanism (&amp;ldquo;channel&amp;rdquo;) the paper identifies, and how does it differ from prior crowding-out stories?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A new credit-availability/crowding-out channel running through bank balance sheets. A deficit-financed stimulus raises the bond supply and (via higher debt) sovereign default risk, depressing bond prices. Undercapitalized, leverage-constrained banks holding existing government bonds suffer capital losses, which reduce net worth and tighten the incentive-compatibility (leverage) constraint, forcing them to cut corporate lending and crowding out private investment. The novelty versus prior bank-sovereign-nexus work (e.g., Corsetti et al. 2012, where banks do not hold government debt and causality runs only from sovereign problems to lending rates) is the feedback loop / &amp;lsquo;doom loop&amp;rsquo;: capital losses on existing bonds raise rates on newly issued bonds, aggravating the sovereign problem, causing further capital losses and further lending contraction. This amplification cycle requires both long-term debt and endogenous default risk to be quantitatively important.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three terms in the analytical decomposition of the lending response (equation 9)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the two-period model, the change in corporate lending dk0/dg0 decomposes into: (1) direct crowding out by new spending (-lambda_b) — lending must fall to free balance-sheet capacity to absorb newly issued bonds (Kirchner-van Wijnbergen 2016); (2) a funding-cost effect — higher deposit/funding costs raise the required return on loans, reducing loan demand (zero under the small-open-economy assumption); and (3) the key innovation — capital losses on existing long-term bond holdings b_{-1} from the bond-price drop (dq/dg0 &amp;lt; 0) reduce net worth, tightening the constraint and contracting lending further. The third term exists only with multi-period bonds and grows with maturity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the contribution of each ingredient (maturity vs. sovereign risk) quantified?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By trimming the model stepwise (Table 1). Moving from short-term/no-risk (mu_D = 0.25) to long-term/no-risk (mu_D = 0.15) explains 11% of the total 90-bp decline. Adding sovereign default risk (mu_D = -0.65) explains the remaining ~89%. Thus sovereign risk is the dominant driver, but it bites significantly only in the presence of longer-maturity debt — at short maturities both with- and without-risk multipliers equal 0.25 (Figure 8).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does implementation timing matter, and what is the mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A financial crisis lowers domestic prices relative to foreign (Eurozone) prices, improving competitiveness/terms of trade. A stimulus raises domestic prices, causing expenditure switching toward foreign goods and lower exports. An immediate stimulus is implemented while domestic goods are still cheap (crisis-induced), partially offsetting the loss; a delayed stimulus arrives after domestic prices have recovered, so the relative-price deterioration is larger and more persistent. Additionally, forward-looking banks anticipate the future debt issue, so the bond price falls (by almost 0.5% extra) and net worth contracts before implementation, producing negative output effects in the pre-implementation period. The cumulative multiplier falls from -0.14 (immediate) to -0.65 (4-quarter delay).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity / dimensions of variation are documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Debt maturity: the multiplier declines with average duration (Figure 8), more steeply with sovereign risk present. (2) Stimulus size: the multiplier falls substantially with size (Table 4), more for delayed stimuli (-0.58 pp) than immediate (-0.29 pp). (3) Financial-crisis size: the multiplier falls only mildly as the lambda_k shock rises from 2% to 15% (delayed: -0.63 to -0.70; immediate: -0.13 to -0.19) — quantitatively small. (4) Implementation lag: monotonically lower multiplier with longer lag (Figure 10). Heterogeneity across SIP countries is documented descriptively in the stylized facts (sovereign exposures and CDS spreads).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/estimation strategy, and what are its limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two-stage: first partial calibration (standard literature values plus first-moment targets such as steady-state labor supply and the leverage ratio phi-bar = 6.48 from Bank of Spain OMFI assets-over-capital, halved per Gertler-Karadi 2013); second, Bayesian estimation of remaining deep parameters via first-order approximation on 2003Q1–2010Q4 Spanish data. The NFC loan-rate series identifies the corporate-loan diversion rate (posterior mean 0.64). A key limitation acknowledged by the authors: the model is estimated WITHOUT sovereign default risk (because risk was minor in the estimation window, following Bocola 2016), and sovereign-risk parameters are calibrated rather than estimated. Statistical significance of the sovereign-risk effect is assessed by checking whether with-risk IRFs (bond prices, investment, output) lie outside the 90% HPD bands of the no-risk model — they do (Figure 7).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is sovereign default modeled, and does default actually hit bank net worth in equilibrium?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Default is non-strategic (Aguiar-Amador 2013 language): each period a stochastic fiscal limit (max feasible taxation) is drawn from a generalized beta distribution; if required taxes exceed it, the government applies a haircut (1 - theta_t) on outstanding liabilities. Notably, the default gains are rebated to unconstrained households via lower lump-sum taxes and used to recapitalize banks in randomized fashion, so aggregate bank net worth is unaffected ex post by realized default (a modeling choice to avoid a discontinuity). The economically active channel is therefore ex ante: anticipated default risk lowers the bond price q_t, which lowers the market value of banks&amp;rsquo; existing holdings and tightens the leverage constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run (Appendix E)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The multiplier is recomputed for alternative values of: the steady-state corporate-loan diversion rate, the ratio of government bonds to corporate loans, the steady-state leverage ratio, the household bond-adjustment-cost coefficient, and the fraction of constrained households. Without sovereign risk the multiplier changes very little (for both short- and long-term debt), though it decreases when the fraction of constrained households is reduced. Alternative calibrations of the default-probability function change the multiplier more when debt is long-term and risky. The central conclusion — the multiplier falls substantially once sovereign default risk is added — holds across all alternative parameterizations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Versus Gornicka et al. (2020): both find a positive multiplier absent sovereign risk or long-term debt; the difference (negative multiplier) arises because Gornicka et al.&amp;rsquo;s sample pools all excessive-deficit-procedure countries regardless of whether they were in a sovereign crisis, whereas this paper focuses on a crisis country (Spain almost lost bond-market access in May 2012). Versus Corsetti et al. (2012/2013): those have one-directional causality (sovereign problems -&amp;gt; lending rates) and banks do not hold government debt, so the doom-loop feedback is absent. Versus Gertler-Karadi (2013), Bocola (2016), Kirchner-van Wijnbergen (2016), Kollmann et al. (2013): these let banks hold government bonds but treat sovereign risk as absent or exogenous; this paper endogenizes default probability via the fiscal-limit model, creating the amplification cycle. Versus van der Kwaak-van Wijnbergen (2014): that paper studies recapitalizations, not fiscal-policy effectiveness. Empirical support: Homar-van Wijnbergen (2017) find fiscal policy has no significant recovery effect when banks are not recapitalized.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three main policy recommendations and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(i) Implement stimuli as soon as possible after announcement (minimize the announcement-implementation lag), because effectiveness deteriorates with delay; (ii) clean up / recapitalize commercial bank balance sheets early in a crisis BEFORE embarking on fiscal stimulus; (iii) keep stimuli small when banks are undercapitalized, since the multiplier declines with size. Scope conditions: these apply specifically to economies where banks are undercapitalized AND hold large quantities of long-term domestic sovereign debt subject to (endogenous) default risk — i.e., a combined banking-sovereign crisis (Spain/Southern Europe 2011–2013, and emerging markets with large domestic bond holdings). Absent sovereign risk or long-term debt, the multiplier is positive and standard.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why can the cumulative multiplier be negative even though the direct spending effect is positive?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The impulse-response (Figure 6) shows the output effect is negative before implementation (anticipation tightens bank balance sheets), turns positive at implementation, then turns negative again within a year as the balance-sheet/crowding-out channels dominate, fizzling to zero by ~40 quarters. When the negative areas (discounted) outweigh the positive, the cumulative discounted multiplier (Mountford-Uhlig 2009 definition, equation 32) turns negative (-0.65 in the base case), meaning the stimulus is self-defeating.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Financial Stability with Fire Sale Externalities</title><link>https://macropaperwarehouse.com/papers/financial-stability-with-fire-sale-externalities/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/financial-stability-with-fire-sale-externalities/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Asset fire sales were a defining feature of the 2007-08 crisis, and post-crisis reforms (Basel III liquidity requirements, Money Market Mutual Fund reforms) were introduced to mitigate fire sale externalities by reducing distressed debt obligations and forcing larger liquidity buffers. The paper asks whether policies that successfully mitigate fire sale externalities actually improve financial stability, since it is not obvious how banks re-optimize in response.&lt;/p&gt;
&lt;p&gt;Model setup (no empirical data — this is a theoretical paper): The authors build a three-period (t = 0,1,2) Diamond-Dybvig (1983) model of financial intermediation augmented with (i) cash-in-the-market pricing in a financial market as in Allen and Gale (1998), and (ii) limited commitment as in Ennis and Keister (2009), following Li (2017). A unit continuum of ex ante identical depositors have CRRA preferences with relative risk aversion γ &amp;gt; 1. Each depositor is impatient with known probability π. There are two assets: a short-term storage asset (1 unit yields 1 next period) and a long-term asset (1 unit at t=0 yields R &amp;gt; 1 at t=2). The bank invests fraction x in the long-term asset and 1−x short. Long-term assets can be sold at t=1 at an endogenous price p to risk-neutral investors who receive endowment ws (market liquidity) and have outside return R* &amp;gt; 0. Runs are introduced via a sunspot s ∈ {α, β} with run probability q; runs are partial (stop after fraction π is served), following Ennis and Keister. The authors assume R* = R, which implies p ≤ 1 in equilibrium. Financial fragility is measured by q-bar, the maximum run probability q for which the run strategy is an equilibrium (run condition c1 ≥ c2β).&lt;/p&gt;
&lt;p&gt;Main analytical findings: (1) Without intervention, banks over-invest in long-term assets relative to the socially efficient level because each competitive bank takes p as given and does not internalize that selling long-term assets in a run depresses p (the fire sale externality); the equilibrium price is inefficiently low. (2) The bank&amp;rsquo;s best response is in Case I (no excess liquidity, fire sale occurs) when 0 &amp;lt; q &amp;lt; q_l, and Case II (excess liquidity held) when q_l ≤ q &amp;lt; 1 (Lemma 1). There is a unique q_c at which the market-clearing price p* turns from decreasing to increasing in q (Lemma 3). (3) Comparative statics on market liquidity ws (Proposition 1): when the relevant q-bar lies in Case II (low ws), q-bar is strictly increasing in ws, so a small rise in market liquidity raises fragility; when q-bar lies in Case I (high ws), q-bar is strictly decreasing in ws. The mechanism (Lemmas 4-5) is that a higher p* raises c1 via intertemporal substitution; the c2α/c2β effect is always dominant, flipping the sign of dq-bar/dws between cases. (4) The intervention: a regulator controls (x, c1), internalizing the effect on p, while the bank still chooses (c2α, c1β, c2β) taking p as given. The regulator chooses lower x and higher c1 than the bank in Case I (Lemma 6: c1 ≤ c1R, x ≥ xR), raising the market-clearing price (Proposition 2: p* ≤ pR* in Case I). (5) Key result (Proposition 3): q-bar_R ≥ q-bar when both solutions are in Case I (intervention always raises fragility); ambiguous otherwise. When ws (or R) is high, intervention raises fragility (q-bar_R &amp;gt; q-bar); when ws or R is low, intervention involves excess liquidity and lowers fragility (q-bar_R &amp;lt; q-bar). Proposition 4 gives a sufficient condition for q-bar_R &amp;gt; q-bar via four thresholds ws1≤ws≤ws2 and ws3&amp;lt;ws&amp;lt;ws4. When ws is sufficiently high, p = pR = 1, the externality vanishes, and q-bar = q-bar_R. (6) Welfare (Proposition 5): WR(q-bar) ≤ W(q-bar) when both in Case I, and for some parameter values otherwise — intervention does not always improve welfare and can worsen it when market liquidity is large.&lt;/p&gt;
&lt;p&gt;Policy implication: Mitigating fire sale externalities does not necessarily increase stability. Because the regulator takes q as given, it ignores that its own intervention can raise q-bar. Policymakers must internalize the fragility effect and balance externality mitigation against increased fragility, especially when market liquidity is high.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Is there an identification strategy or empirical data? What are the threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. This is a purely theoretical paper with no data, sample period, or estimation. The quantitative content consists of analytical comparative-statics results (Lemmas 1-6, Propositions 1-5) and numerical illustrations rendered as figures (Figures 4-9) for specific parameter combinations of (ws, R, q, γ, π). There is no econometric identification; the analog of robustness is the set of modeling assumptions and the parameter regions over which results hold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the core economic mechanism, and how does intervention raise fragility?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The regulator internalizes the fire sale externality by reducing the bank&amp;rsquo;s long-term holdings x and holding more short-term assets, which reduces asset supply in a crisis and raises the market value p of each long-term asset (this mitigates the externality and is the intended benefit). But two competing effects act on long-term payments c2β: the higher price raises the value of remaining long-term assets, while there are fewer long-term assets left for c2β (whose period-2 return R is fixed, so the price increase does not help c2β as it does c1β). The net effect on c2β is ambiguous. Simultaneously, reducing x lowers the relative cost of t=1 consumption, optimally pushing the regulator to raise short-term payment c1. Since the run condition is c1 ≥ c2β, raising c1 while c2β may fall makes early withdrawal more attractive, raising q-bar. When market liquidity is high, the net effect always increases fragility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of &amp;rsquo;excess liquidity&amp;rsquo; and how does it reverse the result at low market liquidity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Excess liquidity (Case II: πc1 &amp;lt; 1−x, holding more short-term assets than needed for the first π payments) is the bank&amp;rsquo;s/regulator&amp;rsquo;s hedge against runs. When ws is low, the anticipated fire sale price is low, so the regulator chooses to hold more excess liquidity than the bank. Excess liquidity supplies additional resources to pay c1β and further reduces asset supply (raising p), leaving more resources for c2β. This makes the net effect on c2β favorable enough that q-bar falls. Thus at low market liquidity the regulator can simultaneously mitigate the externality and reduce fragility; at high market liquidity, excess liquidity is small or zero and the fragility-increasing channel dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity / regime dependence is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results depend critically on the regime (Case I = no excess liquidity / fire sale; Case II = excess liquidity; Case III = excess liquidity, no fire sale, which never arises in equilibrium). The sign of dq-bar/dws flips between Case I (decreasing) and Case II (increasing). The intervention&amp;rsquo;s effect on fragility flips with market liquidity ws and long-term return R: low ws or low R → intervention reduces fragility; high ws or high R → intervention raises fragility; very high ws → externality vanishes (p = pR = 1) and intervention is neutral (q-bar = q-bar_R). The switch from Case I to Case II is governed by thresholds q_l (bank) and q_l,R (regulator), with q_l,R &amp;lt; q_l because the regulator internalizes the price and is more inclined to hold excess liquidity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness / generality checks are discussed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Several modeling-assumption relaxations are argued not to change results qualitatively: (i) the assumption R* = R (giving p ≤ 1) can be generalized to allow p &amp;gt; 1, which does not undermine findings in the p &amp;lt; 1 range; (ii) partial runs can be generalized to multiple waves via a richer sunspot space without changing mechanisms; (iii) depositors not observing the bank&amp;rsquo;s portfolio can be replaced by observing it only after the withdrawal decision, with identical results; (iv) the simultaneous-move game is shown equivalent to a dynamic game in which the regulator moves first, as long as depositors cannot observe regulator choices; (v) the assumption that interventions convey no information to depositors can be relaxed (justified by the complexity of post-crisis regulation, e.g., the 848-page Dodd-Frank Act) without undermining the structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the fire sale externality literature (Lorenzoni 2008; Gale and Gottardi 2015; He and Kondor 2016; Davila and Korinek 2018 on over/under-investment; Acharya et al. 2011 and Gale and Yorulmazer 2020 on distorted portfolios; Perotti and Suarez 2011, Walther 2016, Kara and Ozsoy 2019 on optimal capital/liquidity regulation). It also builds on the bank-run literature (Bryant 1980; Diamond-Dybvig 1983) and on general-equilibrium / endogenous-portfolio extensions (Allen-Gale 2004; Farhi et al. 2009; Eisenbach-Phelan 2021; Cooper-Ross 1998; Ennis-Keister 2006; Li 2017). The stated novel contribution is being the first to show that policies designed to correct fire sale externalities can worsen financial fragility, achieved by jointly endogenizing the portfolio choice, the general-equilibrium asset price, and the equilibrium probability of a run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Macroprudential interventions that regulate short-term liabilities and portfolio choice to curb fire sale externalities can increase the equilibrium probability of runs. The scope condition is market liquidity: the harmful trade-off (mitigate externality but raise fragility, and sometimes lower welfare) arises specifically when market liquidity ws is high (and/or R high); when ws is low, the regulator&amp;rsquo;s optimal excess-liquidity holding lets intervention both mitigate the externality and reduce fragility. A central caveat is that the regulator takes q as given and so does not perceive that its policy raises q-bar; the prescriptive takeaway is that policymakers must internalize q-bar (the endogenous run probability) when designing such policies, balancing externality mitigation against fragility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are the quantitative results exact magnitudes or signs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s results are predominantly signs and ordinal comparisons (e.g., x ≥ xR, p* ≤ pR*, q-bar_R ≥ q-bar, monotonicity in ws and p) plus closed-form threshold expressions (q_l, p_l, p_u, the four ws thresholds in Proposition 4) given in the text and appendices. Specific numeric magnitudes appear only as illustrative figure values (e.g., the example in Figure 9 where intervention raises fragility when ws is near 0.2); the paper does not report calibrated point estimates beyond such illustrative figures.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Fire sale externality&lt;/strong&gt;: In this model, the inefficiency arising because each competitive bank takes the t=1 asset price p as given and does not internalize that its long-term holdings and crisis-time asset sales depress p, harming other banks. It leads banks to over-invest in long-term assets and sell more than the efficient amount, pushing the equilibrium price below its efficient level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cash-in-the-market pricing&lt;/strong&gt;: The price of long-term assets at t=1 is set by the limited cash (endowment ws) that risk-neutral investors bring to the market rather than by fundamental value; when banks must sell, scarce market liquidity forces the price down (p ≤ 1 under the R*=R assumption).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial fragility (q-bar)&lt;/strong&gt;: Measured as q-bar, the maximum run probability q for which the partial-run strategy profile is part of an equilibrium, i.e., the largest q satisfying the run condition c1 ≥ c2β. Higher q-bar means the banking system is more fragile.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Excess liquidity&lt;/strong&gt;: Short-term asset holdings beyond what is needed to pay the first π withdrawals (πc1 &amp;lt; 1−x; Case II). It is a precautionary buffer that supplies resources for crisis payments c1β, reduces asset supply, and raises the fire sale price; the regulator holds more of it than the bank when market liquidity is low.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Case I vs Case II vs Case III&lt;/strong&gt;: Regimes of the bank&amp;rsquo;s best response: Case I = no excess liquidity, fire sale occurs (small q, high ws); Case II = excess liquidity held with fire sale (large q, low ws); Case III = excess liquidity so large that no fire sale occurs — shown never to be an equilibrium because it implies c2β &amp;gt; c2α &amp;gt; c1 (no run condition).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regulator/intervention&lt;/strong&gt;: A planner that chooses (x, c1) internalizing the effect of these choices on the asset price p, while the bank still chooses (c2α, c1β, c2β) taking p as given and the regulator cannot direct depositors&amp;rsquo; withdrawal decisions; it represents the two policy instruments of regulating short-term liabilities and portfolio choice.&lt;/p&gt;</description></item><item><title>Fiscal Distress and Banking Performance: The Role of Macroprudential Regulation</title><link>https://macropaperwarehouse.com/papers/fiscal-distress-and-banking-performance-the-role-of-macroprudential-regulation/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/fiscal-distress-and-banking-performance-the-role-of-macroprudential-regulation/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper studies a transmission channel from sovereign fiscal weakness to banking performance that the literature has largely overlooked: government-provided deposit insurance, rather than banks&amp;rsquo; holdings of sovereign bonds. The motivation comes from the Eurozone crisis (especially Greece), where doubts about a government&amp;rsquo;s ability to honor its deposit-insurance pledge made bank deposits risky and weakened the banking system. The central question is whether allowing macroprudential policy (bank capital requirements) to adjust optimally to the degree of fiscal stress can sever the standard positive co-movement between sovereign and bank credit risk.&lt;/p&gt;
&lt;p&gt;The authors build a quarterly DSGE model based on Clerc et al. (2015) and Mendicino et al. (2018), featuring a rich financial sector with multiple agency problems, capital regulation, government deposit insurance, and endogenous bank default from idiosyncratic and aggregate loan-portfolio shocks. Their novel ingredient is that the Deposit Insurance Agency may honor only a fraction p of insured deposits when government finances are fragile; the unhonored portion is bailed in and becomes a junior claim on the failed bank&amp;rsquo;s repossessed assets. The key fiscal-robustness measure is gamma = p*k (fraction of deposits effectively insured), with robustness rising in gamma. The model is calibrated to Greece using Eurostat and Bank of Greece data over 2000-2010 (pre-crisis, to keep the steady state well behaved). Baseline calibration: gamma0 = 0.34 (set to match the average bank-deposit-vs-German-bund spread); capital requirements of 8% for corporate and 4% for mortgage loans; repossession cost mu = 0.3 (30% asset-value loss); idiosyncratic shock SDs sigma_m = 0.11 (households) and sigma_e = 0.487 (entrepreneurs); bank risk-shock SDs sigma_F = 0.0331 and sigma_H = 0.0163 set so steady-state bank default = 2%. Given the low default rate, the steady-state expected depositor bail-in is only 0.155% and the annualized deposit risk premium is 0.41%.&lt;/p&gt;
&lt;p&gt;Main findings: (1) Holding capital requirements fixed, greater fiscal frailty (lower gamma) raises the deposit spread, bank and corporate default rates, and lowers credit and GDP; welfare is a monotone decreasing function of fiscal frailty (1 - gamma). (2) The optimal level of corporate capital requirements rises uniformly as deposits become riskier — from phi_F = 0.1048 at gamma = 0.34 to phi_F = 0.1075 at gamma = 0.05. (3) Crucially, implementing this optimal increase lowers the bank default rate, producing a NEGATIVE correlation between sovereign and financial credit risk — reversing the standard positive correlation in the literature — while also making the output and credit contraction milder than under fixed requirements; the indirect (credit) channel is the bigger contributor to the output gain, not just direct default-cost savings. (4) Fiscal frailty exacerbates the effects of other risk shocks, but optimal macroprudential adjustment mitigates the response, and this insulation is more pronounced when financial uncertainty (risk-shock variance) is high; optimal requirements rise at an increasing rate with risk-shock variance. (5) A bankruptcy-law reform lowering repossession costs (illustrated as 30% to 10%) unambiguously raises welfare, supports LOWER optimal capital requirements, raises credit and output, lowers bank default, and improves insulation to risk shocks. Policy implication: under a banking union with pooled (weighted-average) fiscal capacity, fiscally weak countries see lower optimal requirements (benefit) and fiscally strong countries higher requirements (lose) — rationalizing why southern EU countries favored banking union and northern ones resisted.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core mechanism linking fiscal distress to banking performance, and how does it differ from the existing literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The mechanism operates through the LIABILITY side of bank balance sheets via deposit insurance, not the asset side (banks holding sovereign bonds). When government finances are fragile, the Deposit Insurance Agency honors only a fraction p of insured deposits; the rest is bailed in and reclassified as a junior claim on the failed bank&amp;rsquo;s repossessed assets. This raises the riskiness of insured deposits, increases banks&amp;rsquo; cost of funding, reduces lending, raises borrowers&amp;rsquo; and hence banks&amp;rsquo; default probability. The extant literature (Bocola 2016; Broner et al.) focuses exclusively on the asset-side channel (bond prices weakening bank balance sheets) or fiscal-to-bank crowding out; this paper studies the deposit-insurance/liability channel, which played a real role in the Greek crisis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is fiscal robustness modeled formally?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Fiscal robustness is gamma = p&lt;em&gt;k, where k is the (fixed, non-choice) fraction of nominally insured deposits and p is the fraction of the insurance pledge actually honored. The realized return on total bank debt is R-tilde_D = R_D minus (1 - gamma)&lt;em&gt;Omega, where Omega is the default loss per unit of bank debt. gamma can follow a feedback rule gamma_t = gamma0 + gamma1&lt;/em&gt;(RB_t - RB&lt;/em&gt;) + gamma2*(b_t - b*) + epsilon_t, with gamma1 &amp;lt; 0 (more public-debt repayment lowers fiscal space) and gamma2 &amp;gt; 0; in the baseline these feedback terms are switched off (gamma1 = gamma2 = epsilon = 0) so the analysis isolates differences in gamma0. Because taxation is lump-sum, the true optimal p is always unity; the authors treat reductions in fiscal capacity as exogenous rather than micro-founding the constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the key qualitative result that overturns a standard assumption in the literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The literature treats the positive correlation between sovereign credit risk and bank (financial) credit risk as a robust feature. This paper shows that if capital requirements adjust optimally to rising fiscal frailty, the optimal requirement RISES, which lowers the bank default rate, thereby generating a NEGATIVE correlation between sovereign and financial credit risk. So the standard positive co-movement is an artifact of holding macroprudential policy fixed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why do higher capital requirements support, rather than depress, output here?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;One might fear that higher requirements reduce bank lending and depress output. In the model&amp;rsquo;s general equilibrium, however, higher requirements make banks safer, which mitigates the rise in the deposit spread and the decline in deposits and bank credit. The net effect is that the recession is less severe than without policy adjustment. The authors find the INDIRECT effect (supporting a higher level of financial intermediation/credit) is a bigger contributor to the output gain than the DIRECT effect (saving on default costs).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the steady-state welfare analysis show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Welfare is a negative, monotone function of fiscal frailty (1 - gamma): more fragility is socially detrimental. The reason for monotonicity is that deposit insurance is cheap to provide (funded by lump-sum taxes, so optimal gamma = 1) and there is no good substitute because depositors do not monitor banks. Under optimal capital requirements, welfare is higher for any given gamma, and the welfare benefit of adjusting requirements grows as fiscal frailty rises (the gap between the optimal-policy and fixed-policy welfare lines widens at lower gamma).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the quantitative magnitudes of the dynamic stabilization, and why are they small?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In response to a one-SD negative bank risk shock, moving from baseline gamma = 0.34 (optimal phi_F = 0.1048) to high fragility gamma = 0.05 worsens GDP and bank default. Adjusting phi_F optimally to 0.1075 mitigates this. The quantitative effects are SMALL because uninsured deposits are nearly risk-free in the calibration (steady-state bank default only 2%, expected bail-in only 0.155%, high asset recovery), and because the economy is assumed to start at the optimal capital requirement. The authors note that if the economy instead started at the suboptimal Basel III minimum of 8% (CAR = 0.08), failing to adjust requirements would be considerably more consequential — the gap would be quantitatively bigger (shown in online appendix A1.5).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do incomplete deposit insurance and risk-shock variance interact?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Holding requirements fixed, raising the variance of the entrepreneurial risk shock (sigma_e) modestly lowers mean output and raises its volatility; a lower gamma (higher bail-in risk) exaggerates all these effects, so the two uncertainty sources interact in a destabilizing way. Optimal macroprudential policy partly contains this. For corporate-bank risk-shock variance (sigma_F), the bank-default response is non-monotone: to the left of sigma_F = 0.0331 the default rate is higher under optimal policy (banks are sub-optimally OVER-capitalized there), and to the right it is lower (banks sub-optimally UNDER-capitalized). Optimal phi_F rises at an increasing rate with risk-shock variance, so countries with greater financial/aggregate volatility need higher capital requirements; combining high uncertainty with high fiscal frailty magnifies optimal requirements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the model imply for banking union, and what is the scope condition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If the banking union&amp;rsquo;s fiscal capacity is the weighted average of members&amp;rsquo;, fiscally strong countries face HIGHER optimal capital requirements on joining (worse off, due to the costly credit/output side of requirements) and fiscally weak countries face LOWER requirements (better off). This rationalizes southern EU countries favoring banking union and northern countries resisting (unwilling to share fiscal capacity for bailouts). The explicit scope condition: this is only ONE factor among many in the banking-union decision — a narrow fiscal perspective. Moreover, even removing the fiscal dimension (e.g., via an EU-wide deposit insurance scheme), differences in economic uncertainty across countries still make banking union problematic because optimal requirements differ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness exercises are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Six: (i) Extending government guarantees to all bank debt (gamma = 1) — full insurance mitigates the effect of bank risk shocks. (ii) Open-economy version with external public debt (Abad 2018 framework; debt burden 5% then 15% of GDP, gamma1 = -0.012, persistence rho_RB = 0.57): higher external-debt servicing costs reduce welfare, consumption, investment but RAISE output, deposit spreads, bank default, and optimal requirements — output rises because higher non-distortionary taxes create a negative wealth effect that makes households work more; higher external indebtedness mitigates the GDP/default impact of a bank risk shock. (iii) Lower repossession costs (30% to 10%) — higher welfare, lower optimal requirements, higher credit/output, lower default, better risk-shock insulation. (iv) Alternative welfare weights (baseline savers 0.5863, borrowers 0.4137) — no qualitative change; a higher weight on savers lowers welfare under optimal requirements (savers have lower marginal utility) and calls for higher optimal requirements to protect savings. (v) Dynamics around the suboptimal Basel III minimum CAR = 0.08 instead of the optimal level — yields bigger quantitative effects. (vi) A short-cut for the asset-side channel: combining a negative bank net-worth shock (-1% of steady-state output) with a negative public-debt-servicing-cost shock (-1%) — outcomes are worse except output, which falls by less due to the wealth-effect labor-supply response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main threats to the analysis / caveats the authors acknowledge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model deliberately omits the asset-side channel (banks holding long-term government bonds), which would require an extra state variable; they approximate it only via the combined-shock short cut in appendix A1.6. Fiscal capacity is not micro-founded — gamma is treated as exogenous, and because taxation is lump-sum the true optimal gamma is always 1, so there is no genuine fiscal trade-off generating an interior solution. Calibration of the deposit-insurance parameters (k and p separately) is speculative because no data exist; gamma0 = 0.34 is backed out from the deposit spread. DSGE methods are unsuitable for large crisis deviations, so calibration uses pre-crisis 2000-2010 data. The banking-union result is explicitly only one narrow fiscal consideration among many.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds directly on the Clerc et al. (2015) and Mendicino et al. (2018) three-layers-of-default DSGE models, adding incomplete deposit insurance tied to fiscal capacity. It contributes to the strand studying transmission of fiscal fragility to bank lending (Bocola 2016; Broner et al. 2013/2014) but via deposit insurance rather than bond exposure or selective default. Stavrakeva (2017) also finds a positive relationship between fiscal capacity and minimum capital requirements (in a model with moral hazard and pecuniary externalities) but does not pursue the macroeconomic implications. Farhi and Tirole (2017/2018) is the main exception that considers prudential policy and contagion, but their focus is on how banking union overcomes national regulators&amp;rsquo; supervisory leniency (a doom loop from fundamentals), a different question.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Fiscal robustness (gamma = p*k)&lt;/strong&gt;: The fraction of bank deposits that is EFFECTIVELY insured, equal to the nominally insured share k times the fraction p of the pledge the Deposit Insurance Agency actually honors. Robustness increases in gamma; 1 - gamma measures fiscal frailty. Baseline gamma0 = 0.34.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incomplete deposit insurance / depositor bail-in&lt;/strong&gt;: In this model the government, when fiscally fragile, honors only fraction p of insured deposits; the unhonored portion is added to the uninsured tranche as a junior claim on the failed bank&amp;rsquo;s repossessed assets. From a creditor&amp;rsquo;s view, one unit of dishonored insured debt equals one unit of uninsured debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Optimal capital requirement (phi_F)&lt;/strong&gt;: The corporate-loan capital requirement that maximizes the unconditional second-order approximation of the social welfare function. It rises with fiscal frailty (0.1048 at gamma = 0.34, 0.1075 at gamma = 0.05) and rises at an increasing rate with risk-shock variance. Its relation to welfare is hump-shaped, reflecting a trade-off between bank default and underinvestment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sovereign-financial credit-risk correlation reversal&lt;/strong&gt;: The paper&amp;rsquo;s central result: the standard POSITIVE co-movement between sovereign and bank default risk becomes NEGATIVE once capital requirements are allowed to adjust optimally to fiscal frailty, because higher optimal requirements lower the bank default rate even as fiscal risk rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Direct vs indirect effects of fiscal frailty&lt;/strong&gt;: Direct effects are output lost to default and savings on default costs from higher requirements; indirect effects work through the level of deposits and bank credit (financial intermediation). The indirect (credit) channel is found to be the larger driver of why optimal requirements support output.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Repossession cost (mu)&lt;/strong&gt;: The fraction of a defaulting unit&amp;rsquo;s asset value lost to creditors upon repossession, set to 0.3 (30%) in the baseline. Lowering it (e.g., to 10% via bankruptcy-law reform) raises welfare, supports LOWER optimal capital requirements, and improves insulation against bank risk shocks.&lt;/p&gt;</description></item><item><title>Global Factors in Noncore Bank Funding and Exchange Rate Flexibility</title><link>https://macropaperwarehouse.com/papers/global-factors-in-noncore-bank-funding-and-exchange-rate-flexibility/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/global-factors-in-noncore-bank-funding-and-exchange-rate-flexibility/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: The paper asks how far global factors drive the foreign-borrowing component of advanced-economy banks&amp;rsquo; non-core funding, and whether exchange rate flexibility (and macroprudential policy) can insulate national banking systems from those global factors. This speaks to the long-running &amp;ldquo;trilemma vs. dilemma&amp;rdquo; debate (Rey 2015 vs. Mundell 1963; Miranda-Agrippino and Rey 2020) over whether a flexible exchange rate buys monetary/financial autonomy under open capital accounts. Non-core funding (funding other than deposits — repos, debt securities, foreign borrowing) matters because, per Shin and Shin (2011), Hahm et al. (2013) and Jorda et al. (2017), it is an elastic, crisis-predictive funding source closely tied to credit booms and leverage.&lt;/p&gt;
&lt;p&gt;Data and method: A balanced quarterly panel of 31 advanced (high-income) economies, 2004:Q1-2022:Q1, &amp;gt;2,000 country-quarter observations (most specifications drop Iceland as an outlier, leaving 30 countries, 72 periods, 2,160 obs). The non-core ratio is foreign liabilities (IFS line 26c) over deposits (lines 24+25); mean 78%, SD ~94%. The loan-to-deposit ratio (mean 122%, SD ~58%) is a robustness outcome; the two are correlated at ρ=0.92. Sample is ~53% fixed exchange rate (Ilzetzki et al. 2019 coarse classification, monetary union counts as fixed); average Chinn-Ito index 0.95, so capital accounts are essentially fully open. Identification combines the Pesaran (2006) Common Correlated Effects (CCE) estimator with the Mean Group (MG) estimator in a three-step procedure: (1) CCE-MG with observed global factors plus cross-section averages to absorb unobserved factors; (2) extract principal components (number set by Ahn-Horenstein 2013 criterion) from the composite residual; (3) re-estimate with PCs, allowing PC loadings to differ by exchange rate regime.&lt;/p&gt;
&lt;p&gt;Main findings with magnitudes: (1) The non-core ratio is highly persistent (lagged dependent variable significant at 1% throughout; coefficient 0.659 in the baseline MG-PC specification) and overwhelmingly driven by global factors; the number of common factors in the non-core ratio is estimated at 3, and the three PCs explain ~80% of the explained variance (PC1 0.795, PC2 0.585, PC3 0.138 — note these sum to &amp;gt;1 and are reported as the lower panel of Table 3). (2) Standard two-way fixed effects leave strong residual cross-sectional dependence (CD test rejects), so are likely biased; the CCE step drives the residual CD statistic to a non-rejection 0.797 (p=0.425) with zero residual factors. (3) Central result: global factors raise non-core ratios more for fixers than floaters — the PC1 loading is 0.984 for fixers vs. 0.302 for floaters; PC2 is significant for fixers, PC3 for floaters; a test on the summed PC loadings (statistic 7.12) confirms larger loadings for fixers. So flexible exchange rates partially insulate. (4) Insulation is stronger away from crises: in the no-crisis 2010-2019 sample the fixer-floater gap in PC1 widens and PC3 (a crisis factor) turns insignificant. (5) Among domestic variables, only the lagged dependent variable, a more appreciated real exchange rate, and higher money/GDP significantly raise non-core ratios; country-specific factors play a minor role overall.&lt;/p&gt;
&lt;p&gt;Mechanisms and implications: Relating PCs to observables, PC1 loads most on world macroprudential stringency (tighter regulation lowers non-core ratios), PC2 on the US shadow rate (positive in-sample, reflecting QE/QT dynamics), PC3 on financial-crisis dummies. VIX, oil prices and the US real exchange rate carry expected signs but smaller effects. Using BIS Locational Banking Statistics (23 of 30 countries), the global-factor effect works mainly through interbank borrowing (cross-border liabilities to banks), a flighty source; currency denomination matters little. Tighter macroprudential policy provides complementary insulation, especially for fixers against PC2 and PC3 (which together explain ~21% of non-core variation): for fixers the PC2/PC3 loadings of ~1.47/1.55 under loose regulation fall to essentially zero under tight regulation; for floaters macroprudential tightness adds no insulation. Policy upshot: the Mundellian trilemma is broadly supported for bank funding — flexible exchange rates and tighter macroprudential rules each dampen transmission of the global financial cycle to bank balance sheets, though not against crisis shocks.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors estimate a dynamic interactive-fixed-effects panel where the non-core ratio depends on its lag, country-specific variables, observed global factors, and unobserved common factors with country-specific (heterogeneous) loadings. Identification proceeds in three steps: (1) a CCE-MG regression (Pesaran 2006; Chudik-Pesaran) that includes observed global factors directly and approximates unobserved factors via cross-section averages of the dependent and independent variables, identifying the country-specific slopes off the variation in regressors orthogonal to common factors; (2) extraction of principal components from the composite residual u-hat that encapsulates the entire factor structure (number of PCs = 3, the estimated number of common factors in the non-core ratio); (3) re-estimation with the PCs, with loadings split by exchange rate regime. The main threat is that omitted/unobserved common factors correlated with the regressors cause strong cross-sectional dependence and biased, inconsistent estimates — exactly what they show afflicts two-way fixed effects (CD test rejects weak dependence; 2 residual factors remain). They verify the CCE step removes this: residual CD statistic 0.797 (p=0.425) and zero estimated residual factors, so the composite captures the full factor structure. They use one-quarter lags of all observables to limit endogeneity, and the rank condition is met with six cross-section averages exceeding the number of factors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;After establishing the PCs statistically, the authors give them economic content by regressing each standardized PC on observed global factors (Table 6). PC1 loads most strongly on world macroprudential stringency (coefficient -2.957 on the non-core ratio direction, i.e., tighter global regulation lowers non-core ratios), R2=0.971. PC2 is driven by the US shadow rate (coefficient 1.171, positive), R2=0.921. PC3 is driven by financial-crisis dummies — adding a US banking crisis dummy (2007:Q4-2011:Q4) raises the PC3 regression R2 and the crisis dummy (coefficient 2.050) dominates the macroprudential variable. The positive PC2-US-rate relation seems to contradict the GFC literature (lower US rates usually raise cross-border flows), but they explain it via QE: lower shadow rates from bond purchases flatten the yield curve and push banks to fund via long-term bond issuance rather than short-term interbank borrowing; since their non-core measure is dominated by interbank borrowing, lower shadow rates reduce it. They show the sign flips to the conventional negative when using the loan-to-deposit ratio (Appendix Table 11) or a pre-2007 (pre-QE) sample (correlation -15.7%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two main dimensions. (1) Exchange rate regime: PC loadings are larger for fixers than floaters — PC1 loading 0.984 (fixers) vs. 0.302 (floaters); PC2 significant for fixers, PC3 for floaters; the summed-loading difference test statistic is 7.12 (p in the test reported as 0.011 for PCF1&amp;gt;PCF0). (2) Macroprudential stance: countries that tightened macroprudential policy more than the median country are less affected by PC2 and PC3. The insulation from tight macroprudential policy is concentrated in fixers — for fixers the PC2 (PC3) loading of ~1.47 (1.55) under loose regulation falls to essentially zero under tight regulation; for floaters, macroprudential tightness gives no additional insulation. Beyond this, country-specific slopes are confirmed necessary by slope-heterogeneity tests (the delta tests reject homogeneity).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Five (Table 4): (1) dropping the United States (since observed global factors are US-dominated) — results hold, PC1+PC3 affect floaters, PC1+PC2 affect fixers. (2) Including Iceland — results similar but less precise and some residual cross-sectional dependence reappears. (3) Dropping COVID (sample ends 2019:Q4) — virtually unchanged, slightly lower significance. (4) A pure no-crisis sample 2010:Q1-2019:Q4 — PC1 and PC2 still larger for fixers, the fixer-floater PC1 gap widens (insulation stronger outside crises), and PC3 turns insignificant for both groups (consistent with PC3 being a crisis factor). (5) Loan-to-deposit ratio as alternative outcome — PC1 and PC2 significant for floaters, PC1 only for fixers; the apparent lack of flexible-rate insulation to PC1 here is driven by the crisis episodes, and disappears when GFC/COVID are dropped. The three-step CCE diagnostics (first-stage CD non-rejection, zero residual factors) hold across columns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends the global-financial-cycle literature (Rey 2015; Miranda-Agrippino and Rey 2020; Bruno and Shin 2015; Obstfeld et al. 2019) and the non-core-funding literature (Shin and Shin 2011; Hahm et al. 2013) by focusing specifically on the non-core-to-core funding ratio of advanced-economy banking systems rather than capital flows or interest rates. Relative to Amiti et al. (2017) — who find global factors explain cross-border flows mainly in expansions — and Cerutti et al. (2019) — who find the global component explains less than a quarter of capital-flow variation — this paper finds global factors overwhelmingly dominate the non-core ratio. Methodologically it differs by combining Pesaran&amp;rsquo;s CCE estimator with PC extraction and MG estimation to identify and economically label the global factors, rather than relying on two-way fixed effects, which it shows are biased here by uneliminated cross-sectional dependence. It sides with the trilemma camp (exchange rate flexibility insulates, at least partially) against the strong &amp;lsquo;dilemma&amp;rsquo; view.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Flexible exchange rates partially insulate bank non-core funding from the global financial cycle, and tighter macroprudential regulation provides complementary insulation — supporting the Mundellian trilemma for bank balance sheets. Scope conditions: (1) insulation works against regulatory/financial/real drivers (PC1, PC2) but NOT against financial-crisis shocks (PC3), which hit fixers and floaters similarly; (2) insulation is stronger away from global crises; (3) macroprudential insulation operates mainly for fixed-rate countries; (4) the global financial cycle cannot be summarized by a single observable (VIX or otherwise) — it is best captured by composite principal components, so policymakers should monitor a bundle of real, monetary and financial indicators. The authors explicitly caution the currency-denomination-doesn&amp;rsquo;t-matter result and the broader findings are advanced-economy-specific and may not extend to emerging markets with larger currency mismatches and more volatile exchange rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Through which liability channel does the global-factor effect operate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using BIS Locational Banking Statistics (23 of 30 countries) in fixed-effects regressions of cross-border liability components on the three PCs (Table 7), all three PCs are positively correlated with total cross-border liabilities. The effect materializes through both domestic- and foreign-currency liabilities (currency denomination matters little — sample correlations 80% foreign-currency, 82% domestic-currency) and, crucially, through cross-border liabilities vis-a-vis other banks (interbank borrowing, correlation 89% with the non-core ratio). Liabilities to nonbank financials (correlation 80%) and other sectors (correlation 18%) are hardly, or even negatively, related to the PCs. Interbank funding is emphasized as a particularly flighty source.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why use the CCE/MG estimator instead of two-way fixed effects, and what is the cost?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two-way fixed effects assume additive country and time effects and cannot absorb unobserved common factors that load heterogeneously across countries or are correlated with regressors; in this data they leave strong residual cross-sectional dependence (CD test rejects; two residual factors), implying biased and inconsistent slopes. The CCE estimator approximates unobserved factors by cross-section averages without needing to know the exact number of factors, and the MG estimator allows country-specific slopes (confirmed necessary by slope-heterogeneity tests). The pooled CCE estimator failed to remove residual cross-country correlation in every specification and was inferior to MG. A cost is that the PCs span observed and unobserved factors and lack a clean one-to-one economic meaning, which the authors address by separately regressing PCs on observables (Section 5.1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the descriptive evidence show before the regressions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The non-core ratio and loan-to-deposit ratio co-move strongly (ρ=0.92). The non-core ratio is generally higher for fixed-rate countries, shows long-term trend shifts and co-movement across regime groups, rose before the GFC to a global peak of 70% in 2008, then fell to about 30% by 2022, with short-term fixer-floater divergence only in 2015-2020. The benchmark non-core ratio correlates 88% with the overall BIS cross-border liability variable.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Go big or buy a home: The impact of student debt on career and housing choices</title><link>https://macropaperwarehouse.com/papers/go-big-or-buy-a-home-the-impact-of-student-debt-on-career-and-housing-choices/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/go-big-or-buy-a-home-the-impact-of-student-debt-on-career-and-housing-choices/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Folch and Mazzone ask how undergraduate student debt shapes three intertwined post-college decisions — whether to pursue a post-bachelor (graduate) degree, the trajectory of earnings, and whether/when to buy a home. The motivation is the steep rise in student borrowing: between 1993 and 2016 the share of undergraduates who ever borrowed rose from 45% to 68%, and median cumulative borrowing rose from $14,329 to $29,115 (2020 dollars). The puzzle the paper resolves is why debt strongly distorts education and earnings yet has a negligible net effect on home ownership timing.&lt;/p&gt;
&lt;p&gt;Data and empirical strategy: The authors use restricted-use Baccalaureate and Beyond Longitudinal Study (B&amp;amp;B) data, focusing on the B&amp;amp;B:08/18 cohort (followed up to ten years post-graduation), merged with college-level IPEDS/College Scorecard data. The sample is restricted to US citizens/residents who earned a bachelor&amp;rsquo;s at ages 21-25, first enrolled 2001-2004, did not transfer, and excludes private for-profit colleges (~9,000 graduates in B&amp;amp;B:08/18; ~8,000 in B&amp;amp;B:16/17). In 2008, 72% of graduates held debt averaging $23,640; in 2016, 66% averaging $28,843. To address endogeneity of debt, they instrument with the change during enrollment in an institution-level grant-to-aid ratio (institutional grants / (grants + loans)), exploiting supply-side shifts in grants unlikely to be anticipated at application. The first stage is strong: one SD increase in grant-to-aid while enrolled predicts an ~18% decline in debt (about $4,250 lower balances), with F-statistics around 22-29.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: Increasing debt balances by 10% ($2,364 relative to average $23,640) reduces the probability of obtaining a post-bachelor degree by about 1 percentage point (from a baseline of 22% four years after graduation and 45% ten years after). The same 10% increase raises initial post-graduation earnings — about +3.6% four years out ($1,440) and +$1,392 one year out — but reverses to a 5.3% decline ($2,828) ten years out. Graduate-school enrollment falls by about 0.85% (1 year) and 0.83% (4 years) per 10% debt increase. The net effect on first-time home ownership timing is statistically insignificant.&lt;/p&gt;
&lt;p&gt;Mechanisms: A life-cycle Roy model (Borjas 1987) with Ben-Porath (1967) human capital accumulation, housing, and financial frictions rationalizes this. Debt affects home ownership through two offsetting channels: (1) a traditional wealth effect that deters ownership, and (2) discouragement of further education that pushes graduates into early labor-market entry, accelerating ownership for that subgroup; these roughly cancel. Education choices are especially wealth-sensitive because post-bachelor attendance carries large non-monetary (amenity) returns valued at $3,929 on average (vs. $1,155 housing amenity), while the medium-run graduate wage premium is roughly 30% controlling for ability and human capital.&lt;/p&gt;
&lt;p&gt;Policy implications: Traditional mortgage-style fixed repayment imposes high burdens right after graduation, distorting human capital investment. Income-based repayment (modeled on PAYE, 10% of discretionary income, 20-year term with forgiveness) raises post-bachelor enrollment (from 35% to 42.4%) and home ownership, but adversely sorts lower-ability workers into graduate school via the implicit subsidy and dampens human capital investment through a Ben-Porath labor-supply/tax channel. The assessment is partial equilibrium.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;OLS of outcomes on log cumulative undergraduate debt is biased because unobservables (ability, true family contribution) drive both debt and outcomes. The authors instrument debt with the change during enrollment in an institution-level grant-to-aid ratio = institutional grants/(grants+loans). They use the CHANGE rather than the level (Eq. 2) because students may sort into colleges on the level of grants; mid-enrollment changes are unlikely anticipated. The exclusion concern is that grant-to-aid correlates with unobserved student characteristics affecting outcomes. They address relevance (first-stage F ~22-29; one SD raises grant-to-aid predicts ~18%/$4,250 lower debt) and conduct a balancing test (Table A.2) regressing the instrument on predetermined attributes — only financial need is significant (at 5%), and an F-test fails to reject joint insignificance. A residual threat is that idiosyncratic grant fluctuations could contract graduate slots at the same institution (supply-side); only 3.9% pursue graduate study at their undergrad institution, and splitting by Carnegie research vs. non-research institutions (Table A.8) leaves results intact. Another threat — relocation driving the housing/grad-school substitution — is addressed by re-estimating on 2009 and 2018 (years with state of residence): non-movers are 79% and 64%, and results closely mirror the full sample (Table A.7).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two channels through which debt affects home ownership, and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Channel 1 is the traditional wealth effect: debt reduces wealth available for a downpayment, deterring ownership. Channel 2 is an indirect education channel: debt discourages graduate enrollment, pushing graduates into earlier labor-market entry where higher savings and lower balances facilitate earlier purchase, raising ownership for that subgroup. The two nearly cancel, yielding a negligible net effect. Empirically they are distinguished via ability sub-populations (Table 5): the housing response is negative for low-ability students but positive for high-ability students, and high-ability students cut enrollment more in response to debt. The structural model confirms it: for graduates who will not attend graduate school (Table A.10 Panel A), housing responds positively to debt; the substitution is also visible in life-cycle profiles where indebted bachelor holders have higher early ownership that reverses by age 30.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Ability heterogeneity is central. Two proxies are used: high-school grades, and time-to-degree (graduating within four years = high ability, five-plus years = low ability, following Hendricks and Leukhina 2018). High-ability graduates respond more in enrollment to debt; the housing response is positive for high-ability and negative for low-ability graduates (Table 5). In the model, the non-monetary value of graduate school is highly heterogeneous across the income distribution: poorer workers weigh almost only monetary returns, while high-income graduates value graduate school at the equivalent of hundreds of thousands of dollars in lifetime income, and debt shifts this distribution sharply leftward, especially for less wealthy individuals (Fig. 4).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Restricting the instrument sample to institutions with at least 6 observed graduates (preferred spec, dropping 5-10% of obs; robust to alternative cutoffs); a balancing test (Table A.2); relocation/non-mover re-estimation for 2009/2018 (Table A.7); splitting by Carnegie research vs. non-research institutions (Table A.8); testing completion conditional on enrollment (no detectable effect, Table A.6); home value conditional on ownership (insignificant, Table A.9); a binary &amp;rsquo;ever borrowed&amp;rsquo; instrument specification implying smaller income effects (Table A.1); varying max sample age to 23 or 30 (similar results); age-dependent unemployment risk calibration leaving results unaffected; and a gradual house-price-trend exercise (1.4%/yr for 12 years, Table A.17) confirming the baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;On earnings, the paper aligns with Rothstein and Rouse (2011), Luo and Mongey (2019), Field (2009), and Alon et al. (2023) showing debt raises initial earnings (their ~$500 per $1,000 is larger than Rothstein-Rouse&amp;rsquo;s ~$200, Luo-Mongey&amp;rsquo;s $70-160, and Alon et al.&amp;rsquo;s ~$210 — attributed to their Great Recession entry cohort and pre-ICL period); the ten-year reversal of ~$1,200 per $1,000 is close to Alon et al.&amp;rsquo;s ~$1,270. On graduate school, it complements Zhang (2013) and Chakrabarti et al. (2023); they find a $10,000 debt increase reduces probability of a post-graduate degree by 3.4%. On home ownership, it contrasts with Mezza et al. (2020), who find ~1pp reduction per $1,000; the null is attributed to sampling — excluding for-profit and two-year programs and dropouts (over one-fourth of US graduates) selects higher-ability, lower-debt individuals for whom the education-substitution channel offsets the wealth channel. The structural contribution extends the initial-conditions/lifetime-inequality literature (Huggett et al. 2011; Griffy 2021) by modeling multiple wealth dimensions and graduate-education choice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the structural model add and how well does it fit?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model lets the authors control for ability explicitly and run the &amp;lsquo;ideal&amp;rsquo; regression on simulated data (Table 9): indebted graduates have 0.22% higher earnings per 1% additional borrowing one year out but 0.11% lower ten years out, qualitatively replicating data point estimates within/near the 95% CIs. It fits earnings profiles, enrollment (slightly over a third pursue further education), and home ownership (reaching ~85% by age 50 in model and data). The model attributes excess sensitivity of education to wealth to the amenity value of graduate school operating as a luxury good (parameter xi). Quantitatively, discrete-choice effects are somewhat stronger than data, partly because only one graduate-school type exists and bequests/inter-vivo transfers are omitted, steepening the home-ownership profile.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the IBR policy results and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under universal PAYE-style income-based repayment (tau=10% of discretionary income above a threshold, capped at the 10-year Stafford payment, 20-year term with forgiveness), post-bachelor enrollment rises from 35% to 42.4% and home ownership grows (50-plus ownership up &amp;gt;13%), but total retirement wealth rises only ~3% — the ownership gain is mostly a shift from liquid to housing wealth driven by reduced precautionary saving. Enrollment among non-indebted graduates falls from above 60% to ~40% (because the implicit subsidy is decreasing in income), while the most-indebted tercile&amp;rsquo;s enrollment jumps from ~3.5% to ~42%. IBR adversely sorts lower-ability workers into graduate school and dampens human capital investment via a Ben-Porath/proportional-tax channel (consistent with de Silva 2025, Fu et al. 2025). Fiscally, ~4% of individuals (6% of borrowers) get forgiveness averaging &lt;del&gt;$55,000 (&lt;/del&gt;$42,000 net of 24% tax), about $1,700 averaged across the cohort, or ~$20 per half-year period — small enough that behavioral feedback is negligible. SCOPE: the assessment is partial equilibrium, abstracting from general-equilibrium wage, return-to-education, and aggregate-demand adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the earnings effect reverse sign over time?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Higher debt (lower net wealth) shifts the trade-off between current and future income: indebted graduates front-load earnings — choosing higher-paying occupations or careers rather than working more hours (labor-supply evidence is weak, Table A.5) — to ease debt payments on current consumption. The &amp;lsquo;smoking gun&amp;rsquo; for the later decline is that debt reduces graduate-school enrollment both short- and long-run, forgoing the ~30% graduate wage premium and reduced human-capital accumulation; the model adds that early career sorting is hard to reverse because re-enrolling entails partial loss of accumulated human capital.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Heterogeneity in Manufacturing Growth Risk</title><link>https://macropaperwarehouse.com/papers/heterogeneity-in-manufacturing-growth-risk/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/heterogeneity-in-manufacturing-growth-risk/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; Since the Great Recession, quantifying downside risks to economic activity (rather than only expected outcomes) has become central for policymakers and investors. A large &amp;ldquo;growth-at-risk&amp;rdquo; literature documents that tightening financial conditions sharply raise downside risks to aggregate output while leaving upside potential roughly unchanged (Adrian, Boyarchenko and Giannone, 2019). This paper argues that the aggregate focus misses important structure: aggregate fluctuations can originate from industry-specific shocks, and recessions sharply raise cross-industry dispersion in growth (Bloom, 2014). The authors ask how downside output-growth risk from tight financial conditions differs across U.S. manufacturing industries, and which industry characteristics explain that heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and method.&lt;/strong&gt; They use monthly industrial production (IP) growth for 74 U.S. manufacturing industries at the four-digit NAICS level over January 1973–July 2020 (Federal Reserve G.17; same industry selection as Chang and Hwang, 2015), and the Chicago Fed&amp;rsquo;s National Financial Conditions Index (NFCI) as the financial-conditions gauge. The method is a two-level (multi-level) quantile regression. Level 1 (following Adrian et al., 2019) regresses the τ-th quantile of average h-month-ahead IP growth on the current NFCI and current IP growth, industry by industry, focusing on h=3. Level 2 (inspired by Petersen and Strongin, 1996) regresses the estimated level-1 NFCI quantile coefficients cross-sectionally on standardized, time-invariant industry characteristics (capital, materials, energy, production-labor and overhead-labor intensities; a correlation-based labor-hoarding measure; four-firm concentration ratio; industry size measured by value-added share; and a durability dummy). Inference uses a stationary bootstrap (1,000 replications) that propagates level-1 estimation uncertainty into level 2. Industries split into 45 durables and 29 nondurables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main quantitative findings.&lt;/strong&gt; Deteriorating financial conditions hit downside risk far harder than the center or upside of the growth distribution. On average across industries, a one-standard-deviation positive NFCI shock lowers three-month-ahead IP growth by 0.237% at the median and 0.773% at the 5% quantile, and raises the 95% quantile by 0.042%. The average 5% NFCI coefficient is -0.77 across all industries versus -0.31 (linear) and -0.24 (median); 47 of 74 industries (63.5%) have significant 5% coefficients, only 5 (6.8%) have significant 95% coefficients. Durables are about twice as sensitive in the left tail: average 5% coefficients are -0.96 (durables) versus -0.48 (nondurables), with 75.6% of durables versus 44.8% of nondurables significant at 5%. Some industries (computer, aerospace, food, dairy) are essentially unaffected across the whole distribution. The relationship is nonlinear for 46 of 74 industries (62.2%) at the 5% quantile (77.8% of durables, 37.9% of nondurables). Galvao et al. (2018) slope-homogeneity tests reject coefficient equality across industries for lower quantiles. Subsample analysis (1973-84 / 1985-2006 / 2007-2020) shows tail effects strongest in the most recent period (average 5% coefficient -1.38 vs -0.73 and -0.49), weakest during the Great Moderation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Explaining heterogeneity / implications.&lt;/strong&gt; In the all-manufacturing second level, large industries and durable-goods producers have significantly more vulnerable downside growth, while capital-intensive, overhead-labor-intensive, and labor-hoarding industries are less vulnerable. Within durables, size, materials intensity (more vulnerable) and overhead labor intensity (less vulnerable) matter; within nondurables, energy intensity (more vulnerable) and labor hoarding (less vulnerable) matter. Implication: industry-targeted stabilization policy may be more effective than nationwide policy given the heterogeneity, and investors can build industry-rotation strategies less exposed to financial-market shocks.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the empirical/identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strategy is descriptive-predictive rather than causal. Level 1 estimates industry-specific quantile regressions of average h-month-ahead IP growth on the current NFCI and current IP growth (Koenker-Bassett check-function minimization via the Frisch-Newton interior-point algorithm). Level 2 regresses the estimated NFCI quantile coefficients on standardized industry characteristics via OLS. The key inferential innovation is a stationary bootstrap (Politis-Romano 1994; block length via Politis-White 2004 with Patton et al. 2009 correction, expected block ~36.76 set by the NFCI series) that jointly resamples industry IP and NFCI and feeds level-1 estimation uncertainty into level-2 confidence bands. Main threats: (i) the relationship is associational, not identified as causal — the NFCI is endogenous to the macroeconomy; (ii) generated-regressor problem in level 2 (coefficients are estimates), addressed by the bootstrap; (iii) small cross-sections (45 durables, 29 nondurables, even fewer at the three-digit level) reduce power to detect characteristic effects; (iv) time-invariant characteristics are averaged over varying available windows, abstracting from time variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is nonlinearity established, and against what benchmark?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Quantile coefficients are compared to OLS linear coefficients (constant across quantiles) using 95% bootstrap bands generated under a null that the data-generating process is a VAR(4) for the NFCI and IP growth (the Adrian et al. 2019 approach). Quantile estimates falling outside those bands are evidence of nonlinearity. 46 of 74 industries (62.2%) have a 5% coefficient significantly different from OLS; the total manufacturing sector is also nonlinear, mirroring Adrian et al. (2019) for aggregate GDP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three layers. (1) Durables vs nondurables: durables roughly twice as sensitive in the left tail (avg 5% coefficient -0.96 vs -0.48). (2) Within sectors: e.g. motor vehicles, motor bodies and motor parts have significant 5% coefficients below -2; resin and fiber below -1.5; while computer, aerospace and food are insignificant/unaffected. (3) Across the distribution: strong effects at low quantiles, near-zero at high quantiles (avg 95% coefficient 0.04). Industries with large negative 5% coefficients also tend to have larger positive 95% coefficients (higher conditional volatility under tight conditions), most clearly iron, motor vehicles, fiber and resin — though upside gains are generally smaller than the downside increase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Which industry characteristics explain the heterogeneity, and in which direction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All-manufacturing (74 industries): negative effects on lower-quantile NFCI coefficients (i.e. more downside vulnerability) from industry size and durability; positive effects (less vulnerability) from overhead labor intensity, labor hoarding, and capital intensity. Durables: significant negative effect of materials intensity, negative (small) effect of size, positive effect of overhead labor intensity; production labor intensity significant at some higher quantiles. Nondurables: significant negative effect of energy intensity, positive effect of labor hoarding. Energy intensity, production labor intensity and concentration ratio are NOT significant for total manufacturing or durables in the way Petersen-Strongin found for cyclicality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What economic mechanisms are offered for each characteristic effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Size: mean reversion — an industry larger than average is more likely to see growth fall (Braun-Larrain 2005). Durability: durable production is inherently more cyclical (Petersen-Strongin 1996). Labor hoarding / overhead labor: firms retain trained (especially nonproduction) workers due to sunk hiring/training costs (Becker 1962; Oi 1962; Parsons 1986), lowering the incentive to cut production in downturns. Capital intensity: higher fixed-to-variable cost ratio reduces incentive to cut output, and tangible capital provides collateral easing financing (consistent with Braun-Larrain 2005). Materials intensity (durables): higher share of variable costs raises cyclicality; also links to the negative materials-intensity/TFP relation of Baptist-Hepburn (2013).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(i) Additional controls (Gilchrist-Zakrajsek variables: term spread, real federal funds rate, credit spread, excess bond premium, plus extra IP lags) — qualitatively similar, wider bands. (ii) Unobserved heterogeneity via Ando-Bai (2020) interactive-fixed-effects panel quantile model (one common factor optimal) — highly similar. (iii) Alternative NAICS disaggregation: three-digit (21 industries; capital intensity dropped for multicollinearity; only labor hoarding and durability significant) and six-digit (101 industries; more characteristics significant, including production labor intensity and concentration ratio). (iv) Longer horizons h=6 and h=12 — qualitatively similar but weaker/less significant as horizon lengthens. (v) Subsample analysis of both the growth-risk coefficients and the characteristic construction windows (1973-84, 1985-2006, 2007-2020; and start dates 1958/1973/1987) — effects relatively stable; size and labor-hoarding effects weaken in recent periods while overhead labor and durability stay significant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this relate to and differ from Petersen and Strongin (1996) and Adrian et al. (2019)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends Adrian et al. (2019) from aggregate to industry-level growth-at-risk, documenting substantial cross-industry variation that is invisible at the aggregate level — to the authors&amp;rsquo; knowledge the first disaggregate growth-at-risk study. It extends Petersen-Strongin (1996), who used a linear cyclicality framework, by allowing a flexible/nonlinear quantile relationship specifically with financial conditions. Findings broadly echo Petersen-Strongin for downside risk (materials intensity most important in durables; labor hoarding for nondurables — their only significant nondurable effect), but deviate by NOT finding energy intensity, production labor intensity, or concentration ratio significant in durables, and by adding size and capital intensity (cf. Braun-Larrain 2005) as relevant for total manufacturing. The agreement is attributed to business and financial cycles being closely intertwined (Claessens et al. 2012).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because vulnerability is highly heterogeneous, industry-level stabilization policy may be more effective than nationwide policy (OECD 2003), and policies can be targeted using the signalling characteristics (size, durability, materials/energy intensity vs capital/overhead-labor intensity and labor hoarding). Investors can build industry-rotation strategies less exposed to financial shocks. Scope conditions: evidence is U.S. manufacturing only, associational not causal, conditional on the NFCI as the financial-conditions measure, strongest at the three-month horizon and in the post-2007 subsample, and characteristic effects rest on relatively small cross-sections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there caveats the authors themselves flag?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes: after splitting into durables/nondurables, fewer characteristic effects are significant, which the authors attribute to smaller cross-sections rather than absence of effects; the two-level model is estimated sequentially (two-step) not simultaneously; characteristics are treated as time-invariant averages (justified by stable cross-industry rankings, though production labor intensity shows a downward trend); and upside potential, while present, is generally smaller than the increased downside risk.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Growth-at-risk / downside growth risk&lt;/strong&gt;: The lower-quantile (e.g. 5%) of the conditional distribution of future output growth given current conditions; here the 5% quantile of average three-month-ahead industry IP growth conditional on the NFCI, capturing how bad growth could plausibly get under tight financial conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multi-level quantile regression&lt;/strong&gt;: The authors&amp;rsquo; two-step procedure: level 1 estimates industry-specific quantile regressions of future IP growth on the NFCI and current IP growth; level 2 regresses the estimated NFCI quantile coefficients cross-sectionally on industry characteristics, with a bootstrap carrying level-1 uncertainty into level-2 inference.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;NFCI (National Financial Conditions Index)&lt;/strong&gt;: Chicago Fed weekly index of U.S. money, debt, equity, and (shadow) banking conditions built from a large dynamic factor model; positive values mean tighter-than-average financial conditions, negative values looser-than-average. Averaged to monthly here.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor hoarding&lt;/strong&gt;: Retention of employees during downturns because of sunk search, hiring and training costs; measured here as the negative correlation between changes in materials usage and changes in production-worker hours (a value of -1 = no hoarding), so higher values indicate more hoarding and predict less cyclical, less vulnerable growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Overhead labor intensity&lt;/strong&gt;: Cost of nonproduction (overhead) labor relative to value added. Because nonproduction workers embody more firm-specific investment, they are more subject to labor hoarding, so overhead-labor-intensive industries have less vulnerable downside growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Durable vs nondurable goods sector&lt;/strong&gt;: Federal Reserve classification (45 durable, 29 nondurable industries here). Durable-goods production is more cyclical and, in this paper, about twice as sensitive in the left tail of the growth distribution to adverse financial conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Slope homogeneity test&lt;/strong&gt;: Galvao et al. (2018) Swamy-type and standardized Swamy-type tests for a quantile-regression fixed-effects panel, used to formally reject equality of NFCI quantile slopes across industries, especially at lower quantiles.&lt;/p&gt;</description></item><item><title>How Does Public Sector Employment Affect Household Saving Rates? Evidence from China</title><link>https://macropaperwarehouse.com/papers/how-does-public-sector-employment-affect-household-saving-rates-evidence-from-china/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/how-does-public-sector-employment-affect-household-saving-rates-evidence-from-china/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: The paper asks whether and why the type of employment — specifically public-sector employment — affects household saving rates in China. This matters because Chinese household saving rates are extraordinarily high in international comparison (the paper reports an average gross household saving rate of roughly 35% in China versus only about 5% in OECD countries over the period considered), and the high rates remain a puzzle. Household saving feeds investment and long-run growth, its cyclicality can amplify or dampen crises, and via the &amp;ldquo;global saving glut&amp;rdquo; hypothesis Chinese saving has financed global imbalances and the US current account deficit. Prior literature on Chinese saving emphasizes economic transition, income growth/uncertainty, demographics (one-child policy), and culture, but neglects the role of employment type. Notably, the international finding (e.g., Bettoni and Santos, 2021, calibrated on Brazilian data) is that public employment REDUCES saving because of lower job/income uncertainty and higher compensation, so less precautionary saving. China appears to run the opposite way.&lt;/p&gt;
&lt;p&gt;Data and strategy: Micro-level longitudinal data from the China Household Finance Survey (CHFS), a nationally representative survey covering 29 provinces (excludes Tibet, Xinjiang, Inner Mongolia). The authors use the 2013, 2015, and 2017 waves, restrict to urban households whose head is aged 16-60, and restrict the non-public control group to those with an above-one-year labor contract. The final sample is 5,539, 5,785, and 4,545 observations per wave (15,869 total; 25.18% public-employed). The saving rate is defined as (income minus consumption)/income, with the sample restricted to saving rates above -200% to remove extreme values. Crucially, SOE employees are classified as NON-public (following You and Zhang, 2016) because post-1990s SOE reform made them market players. Public employees = government workers (about 20% of public employees) plus Shiyedanwei (fiscally-financed public institutions: education, health, research). The empirical toolkit: (1) Correlated Random Effects (CRE) panel regressions with rich controls, plus IV-CRE using the head&amp;rsquo;s CPC membership as instrument; (2) Propensity Score Matching (one-to-one, k-nearest neighbor, radius, kernel) and a PSM-CRE panel model; (3) Heckman two-step treatment-effects model for self-selection; (4) a within-household differences estimator exploiting employment transitions; (5) life-cycle interaction analysis.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: Public-employed households save more, by roughly 3 to 8 percentage points depending on method and sample. Raw descriptive gap: mean/median saving rates are 23.16%/33.89% for public vs. about 5.6 and 4.8 pp lower for non-public. Baseline CRE: the public-employment dummy adds 3.589 pp (col 1); each additional public-employed member adds 2.028 pp (col 3). IV-CRE coefficients rise to 8.094 and 4.878 (significant only at 10%; first-stage F = 38.65 and 49.68). PSM cross-sectional ATEs are about 5-8 pp (mostly significant at 1%). PSM-CRE: 3.928 pp. Heckman: 3.557 pp, with an insignificant inverse Mills ratio (so self-selection is not driving the result). Employment-transition (within-household): households switching from non-public to public raise their saving rate by 14.245 pp relative to non-switchers (135 transitioning vs. 1,831 stable households). Life-cycle: the public-employment x age interaction is negative; the saving-rate gap is significant for heads roughly aged 24-38 (strongest for the young/middle-aged), with a U-shaped age-saving profile turning around age 35-40. Robustness on the definition of &amp;ldquo;public&amp;rdquo;: holding Bianzhi raises saving by 8.5 pp; broadening to include SOEs gives 4.5 pp.&lt;/p&gt;
&lt;p&gt;Mechanisms and implications: The saving rate reflects both motive and capacity. On motives, public-employed households save more for children&amp;rsquo;s education (about 25% report saving for education/training vs. 19% non-public; 16.2% plan to send children to study abroad vs. 12.9%) and inheritance (about 16% vs. 11.4%); heterogeneity shows the effect is concentrated in one-SON households (Wei-Zhang competitive saving) and in households with high education-expense shares. On capacity, better social security coverage reduces public employees&amp;rsquo; out-of-pocket expenditure needs (e.g., negative food-income interaction) and frees disposable income for saving; social-security interaction terms are negative, indicating public employment&amp;rsquo;s effect is dampened where social security is already held. Policy implication: changes to the public-employment share affect aggregate household saving, and reducing the benefit/guarantee disparity between public and non-public jobs could lower the high saving of public-employed households. Scope: results are Chinese institution- and culture-specific, possibly extendable to other East Asian Confucian societies, and may erode as ongoing public-sector reforms cut public employees&amp;rsquo; benefits.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core empirical claim and how large is the effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Households headed by a public employee have higher saving rates than non-public-employed households, by approximately 3 to 8 percentage points depending on method and sample. Point estimates: baseline CRE 3.589 pp (dummy) and 2.028 pp per additional public-employed member; PSM-CRE 3.928 pp; Heckman 3.557 pp; PSM cross-sectional ATEs about 5-8 pp; IV-CRE 8.094/4.878 pp (only 10% significant).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three threats are addressed: (1) confounders affecting both employment choice and saving (education, risk aversion, financial literacy, social security) — handled with rich CRE controls; (2) endogeneity/reverse causality (households with strong saving desire may sort into a sector) — handled with IV using the head&amp;rsquo;s CPC membership; (3) self-selection into public jobs — handled with PSM and a Heckman two-step treatment-effects model. The within-household employment-transition estimator further nets out fixed household characteristics. Main residual threat: the IV&amp;rsquo;s exclusion restriction cannot be formally tested (just-identified, instruments do not exceed endogenous variables); the authors argue CPC membership is plausibly excludable since many students join the CPC before graduation and many CPC members work in the private sector. The Heckman IMR is insignificant, indicating self-selection is not the driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the instrument (CPC membership) argued to be valid?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Relevance: about 3 in 10 public employees are CPC members vs. 1 in 10 private employees; first-stage F-statistics are 38.65 and 49.68, well above weak-instrument thresholds. Exogeneity (argued, not tested): no direct channel from CPC membership to saving decisions because many college students join the CPC and many members work in private sectors. The orthogonality (third) condition cannot be tested due to just-identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two main mechanisms, and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Saving motive and saving capacity. Motive: from the 2013 CHFS bank-deposit-purpose question and study-abroad plans, public-employed households more often save for children&amp;rsquo;s education (about 25% vs. 19%), inheritance (about 16% vs. 11.4%), health (10.25% vs. 8.49%), and housing (15% vs. 13.78%). Capacity: better social security reduces expenditure needs and frees disposable income — shown by consumption regressions (negative public-employment x income interaction for food, positive for education/travel/luxury) and by social-security interaction terms that are negative and by smaller public-employment coefficients in the with-social-security subsample. The two are distinguished by combining stated-motive data with consumption-category and social-security interaction analyses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Life-cycle: the saving gap is significant and strongest for heads aged about 24-38 (young/middle-aged) and narrows with age; the public-employment x age interaction is negative. (2) Child gender: the positive effect comes primarily from one-SON households (one-son public coefficient 6.067 significant; one-daughter insignificant; interaction with son gender 5.872), consistent with Wei-Zhang competitive/marriage-market saving. (3) Education-expense share: the effect is larger for households spending a higher share on children&amp;rsquo;s education (above-median 7.536 vs. below-median 4.471). (4) Definition of public sector: Bianzhi holders 8.5 pp; including SOEs 4.5 pp.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) IV-CRE to address endogeneity. (2) Alternative saving-rate measures: winsorizing at the bottom 1% instead of the -200% cutoff, and a log(income)-log(consumption) definition (saving relative to consumption); the positive effect holds (CRE 0.043, PSM-CRE 0.243). (3) Alternative thresholds (-100%, -300%) give similar results. (4) Different scopes of &amp;lsquo;public sector&amp;rsquo; (Bianzhi-only narrow; SOE-inclusive broad). (5) Regressing each saving-motive dummy on public employment plus controls to avoid being misled by raw means. (6) Number-of-public-members measure as an alternative to the head dummy. (7) Multicollinearity checked via correlation matrix; regressions without singletons reportedly robust.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It contrasts directly with Bettoni and Santos (2021), who (using Brazilian micro data) find public employment LOWERS saving via reduced precautionary motive. This paper finds the opposite for China and argues the precautionary channel is only part of the story; Chinese-specific cultural factors (Confucian social status, competitive saving for sons, status investment in children) and capacity effects (better social security freeing disposable income) dominate. It complements He et al. (2018), who use SOE reform to document precautionary saving, and Lugauer et al. (2019) and Chen et al. (2019) on dependent children and social norms. Methodologically it extends the Chinese saving literature by foregrounding employment type, a political/occupational dimension prior work largely neglected.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the employment-transition (within-household) result show and what is its caveat?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Households whose head switches from non-public to public employment raise their saving rate by 14.245 pp relative to non-public households without a transition. This nets out time-invariant household characteristics, supporting causality. Caveat: the transition sample is small (135 transitioning households vs. 1,831 stable), and the coefficient is much larger than cross-sectional estimates, so it should be read as directional confirmation rather than a precise magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Changes in the public-employment share will affect aggregate household-sector saving; policymakers wishing to lower China&amp;rsquo;s high saving could reduce the benefit/guarantee disparity between public and non-public jobs. Scope conditions: results are specific to Chinese institutions and Confucian culture, may extend to other East Asian societies, and may weaken over time as ongoing public-sector reforms cut public employees&amp;rsquo; benefits, shrinking the public/non-public gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the stated limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) External validity is limited by Chinese-specific institutional and cultural settings, though possibly applicable to similar East Asian cultures. (2) Ongoing reduction of public employees&amp;rsquo; benefits through public-administration reform may change saving behavior and reduce the documented gap over time. The dataset also covers only employed heads aged 16-60, so it does not capture post-retirement saving behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What do the control variables show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Higher household assets reduce the saving rate; higher income percentiles raise it (monotonically); male-headed households save more; a U-shaped age profile (low around middle age 35-40); high-school education lowers saving while university education is insignificant; larger household size, being married, and more dependent children all reduce saving; risk aversion raises saving while risk-loving and financial literacy are insignificant. In the Heckman first-stage probit, higher education, CPC membership, and risk aversion raise the probability of public employment, and the mother&amp;rsquo;s (not father&amp;rsquo;s) education and CPC membership significantly predict the head&amp;rsquo;s public employment.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Public employee (paper&amp;rsquo;s definition)&lt;/strong&gt;: In this paper, employees who work directly for central/local government (about 20% of public employees) plus those in Shiyedanwei (fiscally-financed public institutions such as education, health, and research). SOE employees are deliberately EXCLUDED and classified as non-public, because post-1990s SOE reform made them resemble market players rather than public-sector actors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shiyedanwei&lt;/strong&gt;: Public institutions and state organs mainly financed by fiscal spending (e.g., schools, hospitals, research institutes). Their staff are counted as public employees in this study, with relatively low unemployment risk and higher compensation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bianzhi&lt;/strong&gt;: The authorized number of established posts/personnel in government and its affiliated institutions (per Brodsgaard, 2002). Employees holding Bianzhi are fully fiscally dependent — employment and wage guaranteed by the government — and thus the most secure subgroup of public employees; their saving-rate premium is the largest (8.5 pp).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Saving capacity vs. saving motive&lt;/strong&gt;: The paper&amp;rsquo;s framing that a household&amp;rsquo;s saving rate is jointly determined by the desire to save (motive: education, inheritance, status) and the ability to save (capacity: how much disposable income is freed after needs, raised by better social security that lowers expenditure needs).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Iron rice bowl&lt;/strong&gt;: The pre-reform notion of guaranteed lifetime job security in state employment; invoked to explain why public-sector jobs in China historically carried very low unemployment risk, a status partially eroded by SOE reform for SOE workers (but retained by core public employees).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Correlated Random Effects (CRE) model&lt;/strong&gt;: A Mundlak (1978) random-effects specification that adds time-averages of time-varying regressors, allowing correlation between explanatory variables and the unobserved individual effect; chosen over fixed effects because employment type varies little within households across waves.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Competitive saving motive&lt;/strong&gt;: The Wei-Zhang (2011) idea that households with a son save more to improve his marriage-market competitiveness amid China&amp;rsquo;s high male sex ratio. The paper finds this motive is concentrated among public-employed one-son households.&lt;/p&gt;</description></item><item><title>Inflationary Household Uncertainty Shocks</title><link>https://macropaperwarehouse.com/papers/inflationary-household-uncertainty-shocks/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/inflationary-household-uncertainty-shocks/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Macro-uncertainty is widely believed to depress activity, but existing measures are tied to financial markets, professional forecasters, or economic policy, while a key transmission channel runs through households&amp;rsquo; propensity to consume, save, and work. Direct, macro-usable measures of household uncertainty are scarce. Ambrocio asks whether household uncertainty shocks behave like the negative demand shocks documented for the US (Leduc and Liu, 2016), and finds they do not in Europe.&lt;/p&gt;
&lt;p&gt;Data and measurement: The paper builds a novel household uncertainty index (HUN) from the European Commission&amp;rsquo;s harmonized consumer survey, defined as the average fraction of &amp;ldquo;Don&amp;rsquo;t know&amp;rdquo; responses across the four forward-looking questions used to construct the pre-2019 Consumer Confidence Indicator (general economic situation, unemployment, household financial position, likelihood to save). The survey is monthly, covers all EU member states (and candidates), averaging over 40,000 households per month, conducted in the first two to three weeks of each month. HUN is constructed for January 2002 to December 2019. On average 3-6% of Euro area households respond &amp;ldquo;Don&amp;rsquo;t know&amp;rdquo; per round; at the national level the range runs from 2 to over 10 percent (e.g. Spain, France, Italy). HUN is standardized so 100 = mean and 10 points = one standard deviation. The Euro area HUN peaks around EU enlargement, the Global Financial Crisis, the European Sovereign Debt Crisis, and Brexit.&lt;/p&gt;
&lt;p&gt;Empirical strategy: Following Leduc and Liu (2016), the author estimates monthly VARs with an uncertainty measure, unemployment, inflation, and the short rate, three lags, Bayesian estimation with Minnesota priors (ECB BEAR toolbox). Shocks are identified recursively with uncertainty ordered first, justified by the early-month survey timing and household inattention.&lt;/p&gt;
&lt;p&gt;Main findings (with magnitudes/signs/scope): (1) For the Euro area, household uncertainty shocks are inflationary, with a delayed rise in unemployment only after about 20 months. By contrast, financial (Eurostoxx-50 implied volatility, IVOL) uncertainty shocks resemble negative demand shocks (raise unemployment, lower inflation), and policy (Baker-Bloom-Davis EPU) shocks have ambiguous inflation effects. (2) FEVDs: household or financial uncertainty shocks each account for about 20% of inflation forecast-error variance at roughly a 4-year horizon (policy uncertainty substantially less); household shocks account for about 10% of unemployment variation, financial and policy 20-30%. (3) Counterfactuals zeroing out the monetary-policy response to uncertainty: cumulated 48-month inflation IRF for HUN moves from 2.02 (baseline) to 1.66 (still inflationary); EPU from -0.79 to 0.68 (becomes inflationary); IVOL from -2.66 to -1.33 (less deflationary) - indicating monetary policy responds to financial/policy but not household uncertainty. (4) Cross-country (17 Euro-area countries excluding Ireland and Malta plus 8 non-Euro-area), cumulated 48-month inflation responses range from nearly 6% deflation (Lithuania) to over 12% inflation (Bulgaria); deflationary in Austria, Finland, Portugal, inflationary in Italy, Spain, Sweden. The cross-country inflation response correlates positively and significantly with average markups (De Loecker and Eeckhout, 2020; 13 countries, 2002-2016), regression slope ~1.86, robust to labor-market, institutional, and economic-structure controls.&lt;/p&gt;
&lt;p&gt;Mechanism and implications: Results support a pricing-bias (precautionary pricing) channel: under nominal rigidities and monopolistic competition, firms raise prices when uncertainty rises because under-pricing is more costly than over-pricing. A calibrated New Keynesian model (Rotemberg pricing, third-order perturbation) matching country markups reproduces the deflationary-to-inflationary range for supply-side uncertainty; varying price rigidity and the monetary-policy response to uncertainty can jointly generate inflationary household and deflationary financial uncertainty shocks. Supply-side (productivity-volatility) uncertainty matches the data features better than demand-side uncertainty.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Recursive (Cholesky) identification in monthly VARs with the uncertainty measure ordered first, justified because the consumer survey is conducted in the first two weeks of the month (so contemporaneous monthly movements in other variables plausibly cannot affect HUN) and because households are inattentive and under-react to news. The main drawback is the assumption that the uncertainty measure is not contemporaneously affected by other shocks. The author argues monthly data mitigates this (Carriero et al., 2021, find limited contemporaneous feedback to uncertainty at this frequency) and shows results are robust to ordering uncertainty last and to the Carriero et al. (2021) time-varying-volatility identification (which allows uncertainty to respond contemporaneously). He also notes the recursive scheme can be read as a proxy-SVAR with the first variable as instrument, yielding more conservative (attenuated) impulse responses than a proxy SVAR.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central mechanism is the pricing bias (precautionary pricing) channel under nominal rigidities and monopolistic competition: firms set higher prices when uncertain because ending up with too-low a price (selling more at thin margins) is costlier than too-high a price. This is distinguished from the standard precautionary-savings/negative-demand interpretation. Empirically: (i) household uncertainty is inflationary while financial uncertainty is deflationary; (ii) the cross-country inflation response correlates positively and significantly with average markups - the key comparative-static predicted by theory (elasticity of substitution governs markups); (iii) counterfactual VARs show monetary policy response, not the measure itself, drives part of the sign difference. The NK model then confirms only supply-side (not demand-side) uncertainty generates the observed positive markup-inflation relationship.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Large cross-country heterogeneity: cumulated 48-month inflation responses range from nearly 6% deflation (Lithuania) to over 12% inflation (Bulgaria); deflationary in Austria, Finland, Portugal and inflationary in Italy, Spain, Sweden. Splitting into core / periphery / non-Euro-area shows little difference in average response; geographically, Southern European responses are marginally higher than Northern. The cross-country variation is well explained by average markups: a regression of the cumulated inflation IRF on markups yields a positive slope (~1.86, significant) and country-group dummies are insignificant once markups are controlled for.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Ordering uncertainty last - results virtually unchanged. (2) Carriero et al. (2021) time-varying-volatility identification - household uncertainty still inflationary. (3) Adding consumer sentiment (CSI) to the VAR - sentiment acts like a positive demand shock (lower unemployment, higher inflation), HUN remains inflationary, so results are not driven by first-moment sentiment. (4) A VAR with all three uncertainty measures (IVOL, EPU, HUN) - HUN still inflationary; policy uncertainty becomes inflationary in this setup. (5) Replacing the short rate with the Wu-Xia (2016) shadow rate to capture unconventional policy - results hold. (6) Adding linear trends and month-specific (seasonal) intercepts - results hold. (7) Alternative HUN built only from the two macro questions (HUN-Macro) and common-factor versions (HUN-F10, HUN-F16) - still inflationary. (8) Household belief dispersion (DIS) shocks instead of HUN are mildly deflationary, distinguishing uncertainty from disagreement. (9) Markup regressions remain significant controlling for labor-market, institutional-quality, and economic-structure variables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It directly contrasts with Leduc and Liu (2016), who use the Michigan Consumer Survey and find US household uncertainty shocks resemble negative demand shocks (higher unemployment, lower inflation); here European household uncertainty shocks are inflationary. The inflationary result aligns with Mumtaz et al. (2018) (US state-level) and Mumtaz and Theodoridis (2015) (US shocks on the UK), while Carriero et al. (2018) find no significant price effect for the US. It builds on the pricing-bias literature (Born and Pfeifer, 2014, 2021; Fernandez-Villaverde et al., 2015; Bianchi et al., 2018) and on multi-source-uncertainty models. Relative to Bianchi et al. (2018), who find supply-side uncertainty deflationary and demand-side neutral under low price rigidity, this paper&amp;rsquo;s baseline (price duration over 3 quarters, calibrated shock volatilities) yields both demand- and supply-side uncertainty inflationary; their result is recoverable under low rigidity. The HUN measure newly exploits an under-explored source (households) with long time and broad country coverage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The monetary-policy response to uncertainty matters for whether an uncertainty shock is inflationary or deflationary: counterfactuals show that when policy does not respond to household uncertainty it stays inflationary, while financial and policy uncertainty (to which policy does respond) shift toward inflation when that response is removed. In the model, very small monetary-response coefficients to uncertainty are sufficient to flip the sign (a_vb=0.0002 yields near-zero, 0.0004 yields about -1.1% deflation, against a 1.37% baseline). Scope conditions: results are specific to Europe / the Euro area&amp;rsquo;s common monetary policy; the counterfactual is subject to the Lucas critique (assumes the policy change is small enough not to alter agents&amp;rsquo; behavior); and the paper explicitly does NOT evaluate whether monetary policy should respond - optimal policy is left for future research, noting that raising rates under uncertainty aggravates the output decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the New Keynesian model add and how is it calibrated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A basic NK model with habit-forming risk-averse households, monopolistically competitive firms with Rotemberg price-adjustment costs, productivity (supply-side) and preference (demand-side) stochastic-volatility shocks, and a Taylor rule that can respond to uncertainty. The elasticity of substitution is calibrated to match average markups (baseline Euro area, eta=3.13; range Portugal-to-Italy 1.84-8.82 markups); baseline price stickiness matches a Calvo price duration of just over 3 quarters; shock-volatility variances are calibrated to match the VAR cumulated inflation IRF. Solved by third-order perturbation; IRFs are generalized impulse responses at the stochastic steady state (500-quarter burn-in). Findings: markup variation generates a wide deflationary-to-inflationary range for supply-side uncertainty (matching Italy high / Finland low) but not for demand-side; inflation responses are hump-shaped in price rigidity, with low rigidity giving deflationary supply / inflationary demand shocks and high rigidity reversing this; supply-side uncertainty better matches the markup-inflation correlation, suggesting HUN proxies uncertainty about productive capacity rather than relative consumption desires.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the notable caveats and limitations the author flags?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(i) The Rotemberg-vs-Calvo choice is not innocuous: Oh (2020) shows Rotemberg costs make uncertainty shocks more deflationary, so a Calvo model would likely be even more inflationary. (ii) The counterfactual monetary-policy exercise is subject to the Lucas critique. (iii) The empirical link between price rigidity and inflationary responses across countries is not tested - left for future research. (iv) The model has simple financial and labor markets; labor-market frictions known to matter for uncertainty transmission are abstracted from. (v) Some country HUN indices (Cyprus, Lithuania, Slovakia) may have unaddressed structural breaks. (vi) Cross-country markup regressions have only 13 observations, creating degrees-of-freedom limits in the slope-interaction specifications. (vii) HUN correlates positively (about 0.49) with the new European Commission uncertainty index and shows no detected structural break from the 2019/2021 survey-question change.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Household uncertainty index (HUN)&lt;/strong&gt;: A survey-based measure equal to the average fraction of respondents answering &amp;lsquo;Don&amp;rsquo;t know&amp;rsquo; across the four forward-looking questions (general economic situation, unemployment, household finances, likelihood to save) of the European Commission harmonized consumer survey; interpreted as households&amp;rsquo; uncertainty about the economy, and argued to proxy supply-side (productive-capacity) uncertainty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pricing bias (precautionary pricing) mechanism&lt;/strong&gt;: The transmission channel whereby firms in monopolistically competitive markets with nominal rigidities raise prices under higher uncertainty, because ending up with a too-low price (large volume, thin margins) is more costly than a too-high price; this makes uncertainty shocks inflationary, amplified by stronger nominal rigidities and higher markups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inflationary vs. deflationary uncertainty shock&lt;/strong&gt;: In this paper, household uncertainty shocks raise inflation (inflationary) whereas financial (IVOL) uncertainty shocks lower it like negative demand shocks (deflationary); the sign depends on the relative strength of the pricing-bias channel versus precautionary savings and on whether monetary policy responds to that source of uncertainty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual monetary-policy IRF&lt;/strong&gt;: Impulse responses computed by zeroing out the direct (contemporaneous and lagged) response of the policy-rate equation to uncertainty in an estimated recursive VAR (Bachmann-Sims, Kilian-Lewis), isolating how much of the inflation response is attributable to the systematic monetary-policy reaction to that uncertainty source.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supply-side vs. demand-side uncertainty&lt;/strong&gt;: In the NK model, demand-side uncertainty is a shock to the volatility of preference shocks and supply-side uncertainty a shock to the volatility of productivity shocks; only supply-side uncertainty reproduces the empirical positive markup-inflation correlation, leading the author to interpret HUN as closer to supply-side uncertainty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Disagreement (DIS) vs. uncertainty&lt;/strong&gt;: DIS is the average cross-household dispersion of survey views (a measure of disagreement/polarization), distinct from HUN (frequency of &amp;lsquo;Don&amp;rsquo;t know&amp;rsquo;); the two are negatively correlated, and DIS shocks are mildly deflationary, paralleling Born et al. (2020a)&amp;rsquo;s distinction between belief dispersion and forecast-error uncertainty.&lt;/p&gt;</description></item><item><title>Information Transparency of Firm Financing</title><link>https://macropaperwarehouse.com/papers/information-transparency-of-firm-financing/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/information-transparency-of-firm-financing/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Noël and Sun build an information-based theory of capital structure designed to explain the diversity of observed firm financing behavior and the coexistence of distinct optimal financial contracts. The motivating puzzle is that real-world financing methods (external equity, corporate bonds/bank loans, business credit lines/cards) differ systematically in how much firm-specific information investors require — equity and rated debt are &amp;ldquo;transparent&amp;rdquo; with firm-specific terms, while credit lines have general qualification standards and common interest rates. The paper asks three questions: what drives a firm&amp;rsquo;s optimal financing choice, why do equity, transparent debt, and opaque debt coexist as optimal contracts, and what is a firm&amp;rsquo;s optimal debt-to-equity ratio.&lt;/p&gt;
&lt;p&gt;This is a pure theory paper (no data or sample period). The model has a continuum of ex-ante heterogeneous firms, each with internal funds n (support [0, ī]), productivity θ, and survival/success rate α, all i.i.d. With investment i, output is θ·min[i,ī] with probability α and 0 with probability 1−α. The model nests two information problems: (1) adverse selection over a firm&amp;rsquo;s quality (α, θ), which a costly verification technology can reveal at cost γ &amp;gt; 0; and (2) an ex-post agency problem, since a firm can hide output and auditing recovers only a fraction σ ∈ (0,1) of hidden output. Internal funds n are public. Firms choose among four options: opaque contract, separating contract, transparent contract, or self-funding. Investors are risk-neutral with outside storage return r &amp;gt; 0. Assumption 1 (αθ̲ &amp;gt; 1+r &amp;gt; σᾱθ̄) ensures all projects are worth investing and all firms prefer some external financing.&lt;/p&gt;
&lt;p&gt;Main results (proved as a unique perfect Bayesian equilibrium):&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Three contract types arise endogenously: equity (investors get a fraction of output / ownership, payout depends on θ), transparent debt (firm-specific interest rate (1+r)/α reflecting survival rate), and opaque debt (common interest rate (1+r)/αΩ). The transparent contract is implementable by either equity or transparent debt when n ≤ nT(αθ); only transparent debt when n &amp;gt; nT(αθ).&lt;/li&gt;
&lt;li&gt;The separating (signaling without costly verification) contract does NOT survive for any firm except possibly the lowest type (α̲, θ̲); even that type is strictly better off pooling on opaque debt.&lt;/li&gt;
&lt;li&gt;The unique equilibrium has θΩ = θ̲ and αΩ = E[α] (existence requires verification cost condition (26): γ/(σᾱθ̲ī) ≥ (1−σ)θ̲(ᾱ−E[α])/(1+r−σθ̲E[α])). It is either pooling on opaque debt or mixing (transparent + opaque), never pooling on transparent. There is a threshold cost γ̄ ∈ (0,∞) above which the transparent set is empty and the equilibrium becomes pooling.&lt;/li&gt;
&lt;li&gt;Firm characteristics drive choice: all firms with αθ ≤ θ̲·E[α] use opaque debt regardless of internal funds; transparent contracts require sufficiently high quality satisfying condition (27) AND intermediate internal funds. Firms with n ∈ [n1(α,θ), nT(αθ)] are indifferent between equity and transparent debt; those with n ∈ (nT(αθ), n2(α,θ)] strictly prefer transparent debt; very low or very high n firms use opaque debt.&lt;/li&gt;
&lt;li&gt;Partial capital structure irrelevance: only a strict subset of firms (those satisfying (27) with n ∈ [n1, nT(αθ)]) are indifferent between equity and transparent debt (a Modigliani-Miller equivalence within an asymmetric-information setting).&lt;/li&gt;
&lt;li&gt;Debt weakly dominates equity: debt implements the optimal contract for all firms; equity does so only for the strict subset above. The optimal debt-to-equity ratio is not a smooth function of internal funds and need not be unique (a continuum is optimal for indifferent firms). The theory reconciles the conflicting empirical evidence of Myers (2001) (equity issues minor, mostly debt, across broad U.S. firms) versus Frank and Goyal (2003) (equity significant, often exceeding investment, for publicly-traded firms).&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the model environment and the two layers of information frictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A continuum of ex-ante heterogeneous firms, each with public internal funds n ∈ [0, ī] and private quality (α, θ): productivity θ and survival/success rate α. Output is θ·min[i, ī] with probability α and 0 otherwise. Friction 1 is adverse selection over (α, θ), resolvable only via a costly verification technology (cost γ &amp;gt; 0) used before contracting. Friction 2 is an ex-post agency/moral-hazard problem: a firm can hide actual output, and auditing recovers at most a fraction σ ∈ (0,1) of hidden output — so the contract must induce truthful reporting. Investors are risk-neutral with storage return r &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the separating (signaling) contract collapse in equilibrium?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A separating contract must satisfy two incentive-compatibility constraints simultaneously: the financing firm&amp;rsquo;s own truthful-output-reporting constraint (identical to the transparent contract&amp;rsquo;s IC), AND a constraint that no other firm type wants to mimic it. Proposition 3 proves the first constraint makes the second impossible to uphold for all firms except possibly the lowest type (α̲, θ̲). Firms with lower expected quality but higher actual productivity (θ̃ ≥ θ) want to mimic at low funds; higher-risk firms (α̃ &amp;lt; α) want to mimic at high funds. Since any optimal separating contract is also an optimal transparent contract minus the cost γ, any firm that could separate would never use the costly transparent contract — but no firm can successfully separate. Even the lowest type prefers opaque debt (Proposition 7), so no separating contract is used in equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the opaque contract necessarily debt and never equity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With opaque financing investors do not learn firm quality. A binding incentive-compatibility constraint reduces to zO = σθΩ·iO, and the participation constraint (which binds for all n &amp;lt; ī) gives payout zO = ((1+r)/αΩ)·(iO − n) — a fixed general interest rate (1+r)/αΩ on external funds. This is a debt contract. Equity is impossible because investors cannot be convinced to take ownership shares of output without firm quality being revealed to them. Opaque debt resembles a business line of credit: general qualification standards (Assumption 1) and a common interest rate reflecting E[α], independent of firm-specific information.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;When are equity and transparent debt equivalent, and what distinguishes the information each reveals?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For firms with n ≤ nT(αθ), both the firm&amp;rsquo;s IC constraint (2) and investors&amp;rsquo; participation constraint (3) bind. The optimal transparent contract is then implementable equivalently by equity (payout = a fraction of output, depends on θ) or transparent debt (firm-specific interest rate (1+r)/α, depends on α). This is a Modigliani-Miller-style equivalence obtained under asymmetric information. Conditional on survival, equity investors care about θ (commercial information — technology, product lines, outlook), while transparent-debt investors care about α (creditworthiness — financial condition), matching real-world distinctions between equity due diligence and credit-rating/bank scrutiny. The equivalence holds even if verifying α and θ costs differently, as long as both constraints bind.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity in financing behavior does the model generate (cross-section)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Per Table 1 and Theorem 1: (a) Equity users have high quality (αθ), are lower-intermediate in internal funds (n ∈ [n1(α,θ), nT(αθ)]), reveal both α and θ, and have the highest financial leverage. (b) Transparent-debt users have high quality, intermediate funds, reveal α and θ, with firm-specific interest rate reflecting α. (c) Opaque-debt users span all quality types and all funds levels (often very low or very high funds), reveal only general information (E[α], θ̲), face a common interest rate, and have lower leverage. Better-quality but funds-constrained firms are most likely to use transparent financing; firms with αθ ≤ θ̲E[α] always use opaque debt regardless of funds, masking inferior quality by pooling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What dynamic firm-financing patterns can the (static) model rationalize?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors interpret each capital-structure decision as a reaction to updated (n, α, θ). They reconcile: (1) startups using equity (high αθ, low n relative to capacity); (2) share buybacks (rising n moving a firm from the equity-indifference region into transparent-debt or opaque-debt regions); (3) small businesses starting with a credit line then adding equity/loans/bonds as n or quality rises into the transparent region; (4) firms issuing equity when prices are high (high price signals improved quality αθ, and funds raised via equity strictly increase in αθ); (5) firms using two or three financing types simultaneously, because the theory is per-project — different projects/purposes (e.g., main operations vs. routine liquidity) can optimally use transparent and opaque contracts at the same time.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model reconcile the Myers (2001) vs. Frank-Goyal (2003) empirical discrepancy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Myers (2001) reports that for broad U.S. nonfarm/nonfinancial corporations, external finance is a small share (mostly under 20%) of capital formation with equity issues minor and the bulk being debt. Frank and Goyal (2003) find that for publicly-traded U.S. firms (excluding financials, regulated utilities, major-merger firms), external finance is large (often exceeding investment) and net equity issues commonly exceed net debt issues. The theory explains both: equity finance is optimal only for high-quality, intermediate-funds firms, and amounts raised increase in quality, so publicly-traded (high-quality) samples show large, equity-heavy external finance, while broader samples include many debt-only and self-funded firms, yielding smaller, debt-dominated external finance. Verification cost γ varying over time, industry, and country also generates cross-dataset behavioral differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the structure of the optimal debt-to-equity ratio?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 10: it varies with firm characteristics and is not a smooth function of internal funds, and may not be unique. In a pooling equilibrium it equals σθ̲E[α]/(1+r−σθ̲E[α]) for n ≤ nO (constant across quality) and ī/n − 1 (strictly decreasing) for n &amp;gt; nO. In a mixing equilibrium, firms not satisfying (27) follow the same formula; firms satisfying (27) traverse: the constant ratio for n &amp;lt; n1; a continuum [0, σαθ/(1+r−σαθ)] over the equity/transparent-debt indifference region n ∈ [n1, nT(αθ)]; then the constant ratio; then ī/n − 1. The non-uniqueness over the indifference region is precisely the &amp;lsquo;partial capital structure irrelevance.&amp;rsquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the equilibrium switch between mixing and pooling?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Theorem 1(iv): all else equal, as the verification cost γ rises, the set of transparent-contract users shrinks and opaque-debt users expand. There is a threshold γ̄ ∈ (0,∞) above which no firm uses transparent financing, so the equilibrium is pooling on opaque debt; below it, the equilibrium is mixing. Existence of the unique PBE itself requires condition (26), ensuring γ relative to the tightest discipline σᾱθ̲ī is sufficiently high so that all firms with productivity θ̲ (any α) choose opaque debt, pinning down θΩ = θ̲ and αΩ = E[α].&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from prior optimal-contracting and capital-structure literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior costly-state-verification models (Diamond 1984; Gale-Hellwig 1985; Williamson 1986) yield debt as optimal with homogeneous entrepreneurs; adverse-selection models (Leland-Pyle 1977; Stiglitz-Weiss 1981; Myers-Majluf 1984 and others) and agency models (Jensen-Meckling 1976; DeMarzo-Sannikov 2006; DeMarzo-Fishman 2007) treat the frictions separately. This paper&amp;rsquo;s novelty is nesting BOTH adverse selection and the agency problem in a model of heterogeneous firms (along quality AND internal funds). That combination is what makes signaling/separating contracts fail and forces costly verification (transparency) for adverse-selection resolution, and it generates the coexistence of equity, transparent debt, and opaque debt, lends theoretical support to the pecking-order hypothesis (debt weakly dominates equity), and yields partial — not full — Modigliani-Miller irrelevance. It also contributes to the literature on optimal information control (Hirshleifer 1971, 1972; Diamond 1985; Dang-Gorton-Holmström-Ordoñez 2017; Monnet-Quintin 2017) by endogenizing the information-disclosure decision within contract design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the key scope conditions and caveats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results hold under Assumption 1 (all projects worth investing; all firms prefer external financing — so &amp;rsquo;lowest quality&amp;rsquo; is not literally any inferior business). The model is static and per-project; &amp;rsquo;low n&amp;rsquo; means low funds relative to project capacity ī, not necessarily a small or young firm. The most severe misreporting penalty (recovering fraction σ) is imposed to make incentive compatibility least costly. ī can be made to vary across projects without changing main results. The verification cost γ is the central comparative-statics parameter governing whether the equilibrium is mixing or pooling. Equilibrium existence requires condition (26) on γ. There is no empirical estimation — quantitative claims are model-derived equilibrium objects, not data estimates.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Information transparency&lt;/strong&gt;: Defined in the paper as whether investors require business information considered confidential to the firm to aid their investment decisions. Equity and transparent debt are &amp;rsquo;transparent&amp;rsquo; because the firm pays cost γ to reveal its true (α, θ); opaque debt merely reflects general information about the pool of qualifying firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Opaque debt&lt;/strong&gt;: A pooling debt contract carrying a common interest rate (1+r)/αΩ independent of firm-specific information, reflecting the lowest productivity θΩ and the expected survival rate αΩ = E[α] of all qualifying firms. Resembles a real-world business line of credit; the only contract implementable for firms needing small external funds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Transparent debt&lt;/strong&gt;: A debt contract whose firm-specific interest rate (1+r)/α reflects the firm&amp;rsquo;s verified survival rate α (creditworthiness). Resembles corporate bonds or bank loans with firm-specific rates set after credit-rating-style scrutiny.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Transparent (equity) contract&lt;/strong&gt;: The optimal transparent contract implemented as equity: investors receive a fraction of actual output (ownership), with payout depending on productivity θ. Available only to high-quality firms with lower-intermediate internal funds (n ∈ [n1, nT(αθ)]); these firms are indifferent between equity and transparent debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Separating contract&lt;/strong&gt;: A contract by which a firm signals its true quality (α, θ) WITHOUT paying the verification cost γ, designed so no other type mimics it. Proved not to survive in equilibrium for any firm except possibly the lowest type, which itself prefers opaque debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Partial capital structure irrelevance&lt;/strong&gt;: A Modigliani-Miller-style equivalence holding only for a strict subset of firms — those satisfying condition (27) with n ∈ [n1(α,θ), nT(αθ)] — who are indifferent between equity and transparent debt. Outside this subset the financing choice is determinate, so irrelevance is &amp;lsquo;partial,&amp;rsquo; not universal.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Verification cost γ&lt;/strong&gt;: The cost of the technology (e.g., a rating agency, or the firm&amp;rsquo;s own effort to convince investors) that ascertains true firm quality (α, θ) before contracting. Its level governs whether the equilibrium is mixing (low γ) or pooling on opaque debt (γ above threshold γ̄), and existence of the unique PBE requires γ sufficiently high relative to σᾱθ̲ī (condition 26).&lt;/p&gt;</description></item><item><title>Interest Rate Pegs and the Reversal Puzzle: On the Role of Anticipation</title><link>https://macropaperwarehouse.com/papers/interest-rate-pegs-and-the-reversal-puzzle-on-the-role-of-anticipation/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/interest-rate-pegs-and-the-reversal-puzzle-on-the-role-of-anticipation/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper revisits the &amp;ldquo;reversal puzzle&amp;rdquo; — the counterintuitive result, first documented by Carlstrom, Fuerst and Paustian (CFP, 2015), that in standard New Keynesian models the effect of forward guidance (technically implemented as a perfectly anticipated interest rate peg) can switch from expansionary to contractionary as the duration of the peg increases. The authors&amp;rsquo; central claim is that the appearance of the puzzle hinges on agents&amp;rsquo; degree of anticipation of the peg, and they examine three polar/intermediate cases: perfect anticipation, no anticipation, and imperfect anticipation.&lt;/p&gt;
&lt;p&gt;Model and setup: The laboratory is the medium-scale DSGE model of Carlstrom, Fuerst and Paustian (2017), which features funding constraints and market segmentation (only financial intermediaries can hold long-term public and private bonds, subject to a leverage constraint from a hold-up problem and net-worth adjustment costs; households face a loan-in-advance constraint on investment). These frictions break Wallace neutrality so that QE has real and inflationary effects. The model has standard New Keynesian features: habit consumption, monopolistic competition, Erceg-Henderson-Levin (2000) sticky prices and wages with Christiano-Eichenbaum-Evans (2005) indexation, investment adjustment costs, and a Taylor rule with interest-rate smoothing. It is estimated with Bayesian methods on eight euro-area observables over 1998Q1-2013Q4, with a subset of parameters calibrated to CFP (β=0.99, capital share α=0.33, depreciation δ=0.025, price/wage markup elasticities ε_p=ε_w=5, steady-state leverage 6). The initial impulse in all experiments is the launch of a QE programme, modeled as a single shock to an AR(2) process for the real market value of long-term bonds (purchases last 6 quarters). Without a peg, QE raises inflation (the orthodox result).&lt;/p&gt;
&lt;p&gt;Main findings: (1) Perfect anticipation (perfect-foresight solution): reversals are a robust phenomenon. As peg duration P rises, the inflation response first grows and then explodes near a critical value; in the baseline this critical value is eight quarters. For P of 9-14 quarters inflation reverses sign (deflation instead of inflation); for 15-23 quarters the sign flips back to positive; for 24-50 quarters it turns negative again. Thus output and inflation responses oscillate with P. The authors give analytical intuition via the forward solution: complex unstable eigenvalues of matrix J, written in polar form, mean powers of J enter the solution as trigonometric functions of P (de Moivre&amp;rsquo;s formula), producing the oscillation. (2) No anticipation (extended-path method, agents expect E_t[ε_{t+n}]=0 each period and are &amp;ldquo;surprised&amp;rdquo;): the reversal puzzle is absent for all durations 0-50; the initial inflation response is always positive, because powers of J no longer enter the solution. (3) Imperfect anticipation (Markov-switching model solved with Maih&amp;rsquo;s 2015 RISE toolbox): two regimes — Taylor rule (regime 1) vs. peg (regime 2, where ρ=τ_Π=τ_y=0). Agents know transition probabilities, so the frequency F2 and average duration AD2 of the peg are known; frequency is interpreted as the degree of anticipation. Generalized impulse responses (50,000 draws) for average durations of 4, 11.5, 19, 37, 50 quarters and frequencies of 10%, 15%, 20%, 30%, 40%, 50% show: at the empirically relevant frequency of 10% (post-WWII US ZLB experience, ~7 years in 73) and at 15% and 20%, no reversals occur for any average duration. Reversals appear only at implausibly high frequencies: at 30% only for AD2=4 quarters; at 40% for AD2=4, 11.5, 19 quarters; at 50% for all average durations.&lt;/p&gt;
&lt;p&gt;Implications: A Markov-switching treatment of pegs/ZLB delivers more plausible model outcomes than perfect foresight and is a promising tool for policy simulations to avoid the reversal pathology, since under realistic anticipation forward guidance is less powerful and reversals do not arise.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What exactly is the reversal puzzle and where did it originate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is the counterintuitive result that the macroeconomic effect of forward guidance — implemented technically as a perfectly anticipated interest rate peg — can switch from expansionary to contractionary depending on the peg&amp;rsquo;s duration, producing sizeable deflation instead of inflation. Carlstrom, Fuerst and Paustian (2015) first analyzed and named it. Similar sign reversals are noted in Lindé-Smets-Wouters (2016) and Binning-Maih (2017).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/solution strategy for each anticipation case, and what distinguishes them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Perfect anticipation: perfect-foresight (deterministic) solution where the peg is implemented via binary dummy shocks (ε^TR in {0,1}) set to one for P pre-announced quarters; agents know all future ε_{t+n}, so powers of the eigenvalue matrix J enter the forward solution. No anticipation: the extended-path method, running a deterministic simulation each period with the previous period as initial condition and steady state as terminal condition, imposing E_t(ε_{t+n})=0 — agents are surprised the peg continues, so powers of J drop out. Imperfect anticipation: a Markov-switching framework (Maih 2015) with non-zero transition probabilities between a Taylor-rule regime and a peg regime; the peg is a recurring stochastic event whose frequency and average duration are known to agents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the formal mechanism for the oscillation under perfect foresight?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The forward-looking (explosive) variables solve as w2,t = -E_t{Σ J^{n-1} Ω22^{-1} Q2 Φ ε_{t+n}}. Some diagonal elements of J (the unstable generalized eigenvalues) are complex; in polar form z_jj = r(cos φ + i sin φ), and by de Moivre z_jj^k = r^k(cos kφ + i sin kφ) for k=0,&amp;hellip;,P-1. Because nonzero anticipated future shocks bring in powers of J, the solution involves trigonometric functions of the peg length P, so simulations approach an asymptote, switch sign, approach another asymptote, switch again — hence oscillation as P grows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why are reversals absent under no anticipation, given the same complex eigenvalues?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Complex eigenvalues are only a necessary, not sufficient, condition. Under no anticipation E_t(ε_{t+n})=0, so the solution for w2,t no longer depends on powers of J; the simulations do not &amp;lsquo;move along&amp;rsquo; the trigonometric functions, so the explosive complex eigenvalues cannot induce cyclical/explosive effects. A sufficient degree of anticipation is necessary for reversals to occur.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are frequency and average duration of the peg pinned down in the Markov-switching model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;p12 is the transition probability from Taylor regime (1) to peg regime (2); p21 from 2 to 1. Average peg duration AD2 = 1/p21. Frequency F2 = AD2/(AD1+AD2) with AD1 = 1/p12. Table 2 maps the (AD2, F2) grid to the implied p12, p21. The authors check the mean-square-stability condition for each calibration before computing generalized impulse responses from 50,000 draws.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the empirically relevant peg frequency and how is it justified?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;About 10%, based on the post-WWII US zero-lower-bound experience (7 years at the ZLB out of 73 years), the same value used by Dordal-i-Carreras, Coibion, Gorodnichenko and Wieland (2016). The paper stresses that even at double this value (20%) reversals are absent for all average durations considered.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the reversal pattern under imperfect anticipation differ from perfect anticipation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The patterns differ. Under perfect foresight the lowest sub-range of durations (0-8 quarters) shows no reversal, whereas under imperfect anticipation at frequencies of 30% and 40% a reversal occurs for the lowest average duration (4 quarters). Reversals also appear &amp;lsquo;grouped&amp;rsquo; across adjacent average durations. The regime-specific IRFs explain this: given the peg regime (regime 2), higher average durations lead to reversals at low frequencies; given the no-peg regime (regime 1), only frequencies of 30%+ permit reversals and there lower average durations reverse. The GIRF blends both regimes, so its resemblance to a regime&amp;rsquo;s IRF depends on how frequently that regime occurs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are performed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;An extensive grid search (Appendix D) varies each structural parameter one at a time around benchmark values under perfect foresight. Reducing forward-lookingness (lower β) or raising habit, changing depreciation δ or investment adjustment cost ψi, varying the Calvo price/wage parameters (θp, θw) and indexation (ιp, ιw), and varying Taylor-rule coefficients (ρ, τπ, τy) all only change the peg duration required for the reversal to appear, not its existence. Notably, even shutting down price and wage indexation jointly (ιp=ιw=0) does not eliminate reversals in this medium-scale model, because other endogenous state variables (capital, wages, net worth) generate complex eigenvalues. More aggressive inflation stabilization (higher τπ) or longer Calvo durations (&amp;gt;0.9) require a longer peg before reversal appears.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is complementary to CFP (2015), who showed reversals require complex eigenvalues from endogenous states and that switching from sticky-price to sticky-information removes the puzzle; this paper instead goes beyond perfect foresight to show the degree of anticipation is key. It differs from De Graeve-Ilbas-Wouters (2014), Maliar-Taylor (2019), and Bundick-Smith (2020), who rely on realistic calibration to weaken forward guidance; here the resolution comes from realistic modeling of expectations. Unlike de Groot and Mazelis (2020) — who modify the linearized solution so agents are fully aware of the peg — the Markov-switching approach treats the peg as a recurring stochastic event. Methodologically closest is Chen (2017), who compares perfect-foresight and Markov-switching implementations of the ZLB; consistent with her, the authors find Markov-switching delivers more plausible outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the ZLB and forward guidance must be accounted for in model simulations, and these are often modeled as interest-rate pegs, policy evaluations risk spurious reversals. The Markov-switching approach circumvents this pathology and yields qualitatively plausible outcomes. Scope conditions: the result holds for empirically relevant peg frequencies (up to ~20%, double the 10% benchmark) across average durations of 4-50 quarters; reversals can still arise but only under extreme, arguably implausible frequencies (30%+). The conclusions are derived within the CFP (2017) segmented-markets model estimated on euro-area data, with QE as the initiating impulse.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the QE programme modeled and what is its transmission?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;QE is a single shock to a persistent AR(2) process for the real market value of long-term bonds held by the public, generating an inverse hump shape with purchases lasting 6 quarters before gradual return to steady state. Transmission: lower bond supply to FIs raises bond prices and lowers yield-to-maturity and the term premium; FI net worth and leverage fall but net-worth mobility is limited by adjustment costs, so FIs raise demand for (perfect-substitute) investment bonds, raising their price, relaxing households&amp;rsquo; loan-in-advance constraint, boosting investment, output, and inflation; monetary policy then raises the policy rate under the Taylor rule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there caveats about the no-anticipation case as a &amp;lsquo;solution&amp;rsquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. The authors state the no-anticipation case is obviously not a suitable solution to the puzzle — it is an unrealistic polar case (agents are surprised every period). Both polar cases (perfect and no anticipation) are unrealistic, which motivates the imperfect-anticipation Markov-switching analysis as the realistic middle ground.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Reversal puzzle&lt;/strong&gt;: The counterintuitive switching of forward guidance&amp;rsquo;s effect from expansionary to contractionary (deflation rather than inflation) as the duration of a perfectly anticipated interest rate peg increases; in this paper, the inflation response oscillates in sign across peg durations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Degree of anticipation&lt;/strong&gt;: The extent to which agents expect a future interest rate peg. The paper&amp;rsquo;s central organizing concept: in the stochastic case it is operationalized by the frequency of the peg regime, since a higher frequency makes agents consider a peg more likely.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Interest rate peg&lt;/strong&gt;: A regime in which the central bank abandons the Taylor rule and holds the nominal short-term rate fixed for a period — the technical implementation of forward guidance and the ZLB in this analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imperfect anticipation (Markov-switching implementation)&lt;/strong&gt;: A scenario where agents attach non-zero transition probabilities to entering and exiting a recurring peg regime, so individual peg episodes are stochastic in occurrence and duration but their frequency and average duration are known.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Frequency of the peg (F2)&lt;/strong&gt;: The long-run share of time the economy spends in the peg regime, F2 = AD2/(AD1+AD2); interpreted as the degree of anticipation, with ~10% taken as the empirically relevant post-WWII US ZLB value.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Complex eigenvalues / forward solution&lt;/strong&gt;: Unstable generalized eigenvalues of the solution matrix J that are complex-valued; their polar-form powers introduce trigonometric functions of peg length P into the forward solution — a necessary but not sufficient condition for reversals, which require sufficient anticipation to activate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wallace neutrality breakdown&lt;/strong&gt;: The property, induced by FI funding constraints and bond-market segmentation in the CFP (2017) model, that asset purchases (QE) affect real activity and inflation rather than being neutral as in the standard New Keynesian model.&lt;/p&gt;</description></item><item><title>Liquidity Crises and the Market-Maker of Last Resort</title><link>https://macropaperwarehouse.com/papers/liquidity-crises-and-the-market-maker-of-last-resort/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/liquidity-crises-and-the-market-maker-of-last-resort/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper develops a theoretical model to explain why financial markets can suffer self-fulfilling liquidity crises and how a central bank acting as a &amp;ldquo;market-maker of last resort&amp;rdquo; (MMLR) can mitigate them. The motivation is policy-driven: during the 2008-09 crisis and the COVID-19 pandemic, the Fed, ECB, and other central banks purchased assets at above-market prices (e.g., Maiden Lane I/II/III and the TALF) to support markets, a function distinct from the traditional lender-of-last-resort (LLR) role. The authors note that formal theoretical analysis of MMLR remains sparse (citing Buiter et al. 2023) and aim to fill that gap.&lt;/p&gt;
&lt;p&gt;Model setup: It is an overlapping-generations (OLG) model with two-period-lived agents and fully rational expectations. There are two assets: a risk-free storage technology with gross return 1-δ (0&amp;lt;δ&amp;lt;1, a negative net return capturing the cost of self-insurance) and a non-depreciating Lucas tree in unit measure paying a constant dividend r (0&amp;lt;r&amp;lt;1). Young agents receive a unit endowment and save (natural buyers); old agents sell their tree to finance consumption (natural sellers). The tree price p_t is set by decentralized Nash bargaining with β denoting the seller&amp;rsquo;s (old agent&amp;rsquo;s) bargaining power. Old agents face an i.i.d. idiosyncratic liquidity shock γ∈{0,1} with probability q; if hit (γ=1) they must pay one unit of the good or suffer a utility penalty ω times the shortfall, with ω&amp;gt;1 (focus on large ω). A key parameter restriction is 0&amp;lt;r&amp;lt;δ&amp;lt;1, which rules out a trivial case where liquidity crises could never occur.&lt;/p&gt;
&lt;p&gt;Main results: Because trading is by bilateral bargaining (not Walrasian), the model has multiple Pareto-rankable stationary rational-expectations equilibria, each sustained by self-fulfilling beliefs about future prices; lower-price equilibria are Pareto-inferior, more pessimistic, and entail lower consumption. Three benchmark equilibria are derived: (1) an efficient stationary equilibrium with p_t=1 (zero storage), which exists for large ω if seller bargaining power β exceeds a threshold β̃=(1-δ)(1-r)/[δ+(1-δ)(1-r)]; (2) an inefficient stationary equilibrium at p_t=p*=1-r/δ, which exists for any β∈(0,1) and large ω; and (3) a nonstationary equilibrium where prices asymptotically approach p* via p_{t+i}=p*-(1-δ)^i(p*-p_t), requiring β below a threshold β*. The authors introduce a nonfundamental &amp;ldquo;sunspot&amp;rdquo; shock that occurs each period with small probability π, inducing pessimistic beliefs that lower the price below the continuation path (to C(p_{t-1})) and leave old agents illiquid (W&amp;lt;1) — a liquidity crisis with flight-to-quality (increased costly storage), run-like behavior, and fire-sale-like price collapse. Crucially, along non-crisis recovery paths all later generations remain liquid, and the increased output loss from storage is exactly offset by greater price appreciation (the wealth difference across adjacent non-crisis periods nets to zero).&lt;/p&gt;
&lt;p&gt;Policy: An &amp;ldquo;aggressive&amp;rdquo; MMLR — government issuing bonds to young agents and buying trees via Nash bargaining with a positively sloped excess-utility function — can support the unique first-best (p=1) allocation, but the authors argue this is likely politically infeasible (looks like a Wall Street bailout) and fragile (requires persistent intervention if β&amp;lt;β̃). A &amp;ldquo;conservative&amp;rdquo; MMLR embedding a &amp;ldquo;no-bailout&amp;rdquo; constraint (buy low / sell high) can support p=p*, eliminating utility-cost (crisis) inefficiency but leaving storage-cost inefficiency. Finally, replacing bilateral bargaining with a centralized Walrasian auction yields a unique, efficient equilibrium (p_t=1) with no storage and no liquidity crises, motivating regulatory pushes toward centralized/transparent trading (e.g., Dodd-Frank swap execution facilities, Treasury central clearing proposals). The model abstracts from moral hazard and from distinguishing fundamental vs. nonfundamental price declines.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core mechanism that generates multiple equilibria and liquidity crises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The combination of (a) decentralized Nash bargaining as the trading mechanism and (b) the concavity of the indirect utility function when ω&amp;gt;1. With ω&amp;gt;1, the liquidity penalty makes storage relatively more valuable to a poorer young agent, so an equal fall in the tree price today and tomorrow reduces young agents&amp;rsquo; wealth and shifts demand from the tree toward storage. This makes pessimistic beliefs self-fulfilling: a fall in p_t justified by expected low p_{t+1} is itself an equilibrium. With ω=1 (no liquidity penalty) Proposition 1 shows there is a single stationary equilibrium and no nonstationary equilibria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How exactly is a liquidity crisis defined in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;An old agent is &amp;rsquo;liquid&amp;rsquo; if end-of-trading wealth W(p_t,p_{t-1})≥1, which is enough to fund a unit liquidity shock. A liquidity crisis is a state where W&amp;lt;1, so an old agent hit by γ=1 cannot fund the shock and incurs the utility penalty. The crisis is triggered by a nonfundamental sunspot that makes the young pessimistic, pushing the price to a crisis-deviation value C(p_{t-1}) satisfying p_underbar &amp;lt; C(p_{t-1}) &amp;lt; κ^o(p_{t-1}), which renders the date-of-crisis old agents illiquid.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three benchmark equilibria and their existence conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Efficient stationary p_t=1 ∀t: exists for large ω if β&amp;gt;β̃=(1-δ)(1-r)/[δ+(1-δ)(1-r)]; under the tighter condition β&amp;gt;1-δ it exists for all ω&amp;gt;1; not an equilibrium if β&amp;lt;β̃ for large ω. (2) Inefficient stationary p_t=p*=1-r/δ: exists for any β∈(0,1) and large ω; here κ^o(p*)=κ^y(p*)=p* so all agents are liquid. (3) Nonstationary equilibrium p_{t+i}=p*-(1-δ)^i(p*-p_t) approaching p*: requires β&amp;lt;β*=(1-δ)p*/[δ+(1-δ)p*] and appropriate starting prices; along this path W=1 for all i≥1 so all agents are liquid.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the recovery after a crisis leave subsequent generations liquid even though prices recover only gradually?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Although a crisis raises costly storage (flight to quality) and prices recover only asymptotically, the authors decompose wealth in adjacent non-crisis periods and show the reduction in output from increased storage is exactly offset by a greater rate of price appreciation: W_{t&amp;rsquo;+i}-W_{t&amp;rsquo;+i-1}=(p_{t&amp;rsquo;+i-2}-p_{t&amp;rsquo;+i-1})(1-δ) + (p_{t&amp;rsquo;+i-1}-p_{t&amp;rsquo;+i-2})(1-δ) = 0. So later generations remain liquid (W=1) until the next crisis hits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What distinguishes the &amp;lsquo;aggressive&amp;rsquo; from the &amp;lsquo;conservative&amp;rsquo; MMLR policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Aggressive MMLR (Proposition 6): government traders act with an excess-utility function having strictly positive slope in p_t (prefer buying at higher prices), which can enforce p=1 and support the first-best. The authors deem it politically infeasible (appears to subsidize/bailout Wall Street) and fragile (if β&amp;lt;β̃, sustaining p=1 requires persistent intervention). Conservative MMLR (Proposition 7): government adopts a &amp;rsquo;no-bailout&amp;rsquo; excess-utility function strictly decreasing in p_t and increasing in expected future price (buy low, sell high), supporting p=p* and ruling out p=1 as an equilibrium. It eliminates utility-cost (crisis) inefficiency but not storage-cost inefficiency, and p* remains a natural equilibrium even if political support wavers (absent a current crisis).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role does the Walrasian alternative play?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 8 shows that if trading occurs via a centralized Walrasian auction rather than bilateral bargaining, there is a unique equilibrium with p_t=1 ∀t, no storage, and no liquidity crises. The multiplicity arises in the bargaining model precisely because there is no market to sell storage and buy more trees, permitting interior solutions p_t∈(0,1). This yields the normative implication that regulators should favor centralized, transparent trading venues (cited examples: national bid/offer dissemination for stocks, Dodd-Frank swap execution facilities, proposals for Treasury central clearing).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is bargaining power β interpreted, and what is its normative significance?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;β∈[0,1] is the old agent&amp;rsquo;s (seller&amp;rsquo;s) bargaining power, taken as a primitive standing in for unmodeled market characteristics (e.g., the seller of an MBS may have superior information, or fire-sale conditions may disadvantage sellers). Bargaining power inheres in the role (seller vs. buyer), not the individual; the same agent has power β when old/selling and 1-β when young/buying. High β supports the efficient p=1 equilibrium; low β makes the economy prone to crises. The authors note the Hosios-type efficiency condition on β from labor-search models is not relevant here.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from the closest prior work, Choi and Yorulmazer (2023, &amp;lsquo;CY&amp;rsquo;)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both study multiple equilibria in financial markets and the MMLR&amp;rsquo;s role in removing multiplicity. Differences: CY&amp;rsquo;s model is fundamentally static, whereas this is a dynamic stochastic equilibrium model used to generate periodic crises from exogenous bouts of pessimism. Price determination differs: CY uses the cash-in-the-market paradigm (Allen and Gale 1994), whereas this paper uses decentralized Nash bargaining, in which the Walrasian equilibrium is unique and efficient but many Pareto-inferior bargaining equilibria coexist, letting the authors ask whether MMLR can eliminate some or all inferior equilibria. The paper also relates to Holmström-Tirole (self-insurance via low-yield assets is suboptimal; government has a role), but there the friction is a pledgeability/principal-agent problem, whereas here suboptimality comes from a small-probability inferior equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Is the Nash bargaining assumption robust to an alternative bargaining solution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors check Kalai (1977) proportional bargaining. Holding the Kalai weight ν constant, there exist two values of ν supporting the efficient and inefficient equilibria of Propositions 2 and 3 (with parameters r=0.2, δ=0.25, ω=200, q=0.1, the young&amp;rsquo;s proportional weight is 0.243 in the efficient equilibrium and 0.555 in the inefficient one). For the nonstationary equilibrium of Proposition 4, the ratio of old-to-young excess utility changes over time, so no single constant ν supports it; the Nash solution, by contrast, holds over a range of weights. Inefficient equilibria are supported under both Nash and Kalai.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and scope conditions on the policy conclusions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is highly stylized: two-period OLG rules out LLR analysis (old agents do not live long enough to repay loans). In practice policymakers must distinguish price declines due to equilibrium shifts from those due to changing fundamentals (the authors say both were likely active in 2007-08), and must determine the &amp;lsquo;correct&amp;rsquo; equilibrium price, which is nontrivial. The model abstracts entirely from moral hazard in public backstopping (citing Farhi-Tirole 2012, Gradstein 2022). The aggressive policy supporting p=1 is fragile and politically vulnerable; the conservative no-bailout policy only removes crisis (utility-cost) inefficiency, leaving storage-cost (flight-to-quality) inefficiency intact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What real-world MMLR interventions does the paper map its model to?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Maiden Lane LLC (March 2008, Bear Stearns mortgage assets to facilitate the J.P. Morgan merger), Maiden Lane II and III (October 2008, addressing AIG&amp;rsquo;s exposure to RMBS and CDOs), and the TALF (supporting certain asset-backed securities). It also cites Buiter et al. (2023) documenting extensive MMLR use by the Fed, ECB, Sveriges Riksbank, Bank of Japan, and Bank of Canada during COVID-19 (repo participation, corporate bond and commercial paper purchases, restarting TALF).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Loan Evergreening through Banks' Lenses: Evidence from Credit Product-Level Data</title><link>https://macropaperwarehouse.com/papers/loan-evergreening-through-banks-lenses-evidence-from-credit-product-level-data/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/loan-evergreening-through-banks-lenses-evidence-from-credit-product-level-data/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; Banks reluctant to recognize losses on troubled borrowers engage in &amp;ldquo;loan evergreening&amp;rdquo;—rolling over or extending credit to delay loss recognition. This misdirected lending has been blamed for Japan&amp;rsquo;s Lost Decade and Europe&amp;rsquo;s post-crisis stagnation by steering credit to unproductive firms. Observing &lt;em&gt;how&lt;/em&gt; banks do this, and their regulatory motives, is empirically hard. The paper studies a specific, previously hard-to-observe evergreening strategy that arises from banks&amp;rsquo; incentive to avoid loan-loss provisions, which increase convexly as repayment delays lengthen.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification innovation.&lt;/strong&gt; The authors depart from the firm-profitability-based zombie-lending literature and instead look at credit products. They identify evergreening as instances where a firm receives a new &lt;em&gt;bullet loan&lt;/em&gt; (interest-only until maturity) of similar amount to its contemporaneous &lt;em&gt;amortizing loan&lt;/em&gt; repayment to the same bank in the same month. They compute the ratio (new bullet loan / amortizing repayment) and observe an &amp;ldquo;excess mass&amp;rdquo; around 1; cases with a ratio between 0.5 and 1.5 are classified as evergreening. Bullet loans are common (~25% of firms with amortizing loans also have one); 70% of bullet loans have maturity ≤181 days. This strategy carries less capital consumption than restructuring, which forces higher provisioning.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and setting.&lt;/strong&gt; Two monthly datasets from the Central Bank of Uruguay, 2006–2018: the exhaustive Credit Registry (loan-level: borrower, sector, amount, currency, maturity, delinquency) and bank balance-sheet/income data. Sample: 1,950,189 amortizing-loan observations, 14 banks, 39,698 firms. Public credit register means all banks can see borrowers&amp;rsquo; delinquency elsewhere.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Validation of the measure.&lt;/strong&gt; The share of evergreening is countercyclical (correlation with GDP growth = −0.55, highly significant), tripling from mid-2007 to early 2010. By end of sample, ~2% of amortizing-loan observations receive evergreening (0.5%–2% range overall—lower than the ~10% in zombie-lending literature, but measuring a different, narrower strategy). A placebo-style test: the dairy sector (hit by a large negative external shock around 2014 from China&amp;rsquo;s slowdown and Venezuela&amp;rsquo;s crisis) shows evergreening more than doubling, well above the whole economy and the comparable but unaffected livestock sector.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings (linear probability models with rich fixed effects, including Firm×Month FE).&lt;/strong&gt; (1) &lt;strong&gt;Determinants:&lt;/strong&gt; Solvency (capital/RWA) is the only consistently relevant bank determinant; lower solvency → more evergreening. A one-SD lower solvency (SD = 0.083, or 8.3pp) raises evergreening probability by 0.546pp, an over-50% increase relative to the ~1% unconditional mean. Solvency matters &lt;em&gt;more during booms&lt;/em&gt;, contradicting gambling-for-resurrection accounts. Loan-level: short-term loans (+0.7pp), higher USD share (0%→100% gives +0.8pp), being the firm&amp;rsquo;s top/main bank (+0.65pp), and longer relationships all raise evergreening likelihood. (2) &lt;strong&gt;Credit:&lt;/strong&gt; Evergreening is associated with ~7pp (7.3pp) higher amortizing credit growth from the same bank over 12 months (excluding the bullet loan), and a 7.5pp higher probability of any credit increase (23.4% above the 32% baseline). (3) &lt;strong&gt;Relationship survival:&lt;/strong&gt; No effect on probability of relationship ending. (4) &lt;strong&gt;Performance:&lt;/strong&gt; Without Firm×Month FE, evergreening predicts +1.1pp higher future delinquency at 12 months, concentrated in low-solvency banks and ex-ante non-performing firms; the effect peaks ~16 months out (~2pp). With Firm×Month FE the sign reverses—a multi-bank firm is &lt;em&gt;less&lt;/em&gt; likely to become delinquent with the bank that evergreened than the one that did not. (5) &lt;strong&gt;Access to new lenders:&lt;/strong&gt; Single-relationship firms receiving evergreening are more likely to obtain a second bank after ~18 months. (6) &lt;strong&gt;Crowding-out:&lt;/strong&gt; No aggregate displacement, but at the 5-digit-industry level, banks more engaged in evergreening are more likely to fully cut credit to non-evergreened firms in that industry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Implications.&lt;/strong&gt; The measure is an early-warning tool for supervisors; the strategy is regulatory arbitrage that avoids the provisioning penalty of formal restructuring.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors identify evergreening as a new bullet loan whose amount approximately matches a contemporaneous amortizing-loan repayment to the same bank-firm in the same month (ratio between 0.5 and 1.5). The bank-borrower-month granularity lets them saturate the determinants regression with bank and Firm×Month fixed effects, so firm-level credit demand and characteristics are absorbed, isolating bank/loan supply-side drivers. Main threats: (a) misclassification—the measure misses evergreening done via larger bullet loans or other instruments; the authors argue this biases results downward (attenuation). (b) The legality/intent of any single bullet loan is ambiguous (many legitimate reasons exist), but they rely on the statistical excess mass at ratio≈1 to argue the vast majority of selected cases are genuine evergreening. (c) Omitted bank-level confounders—addressed via Oster (2019) coefficient-stability: the bias-adjusted Solvency coefficient at R-squared=1 is −5.556, and unobservables would need to be ~11x (δ=10.9) more correlated with Solvency than observables to nullify the result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two motives. (1) Provision/capital management (regulatory arbitrage): provisions rise convexly with repayment delay, so banks issue bullet loans to keep firms current and avoid provisioning. Supported by the dominance of Solvency, the short-term-loan effect, and the Firm×Month-FE result that a firm receives evergreening from its &lt;em&gt;non-delinquent&lt;/em&gt; bank (preventing the delay rather than reacting to it). (2) Relationship/reputation lending à la Hu and Varas (2021): banks evergreen to camouflage problems so the borrower can attract outside funding. Supported by the finding that single-relationship firms gain access to a second bank ~18 months after evergreening. The booms-matter-more result distinguishes this from gambling-for-resurrection (Bruche and Llobet 2014), which predicts weak banks pushing losses forward mainly in bad times.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cyclical: Solvency&amp;rsquo;s importance is stronger in booms (at average ~4% GDP growth the coefficient is −5.267; a one-SD higher GDP growth of ~2.6pp shifts it to about −7.14). By bank: low-solvency banks evergreen riskier (ex-post worse) firms, so the evergreening→future-delinquency link is concentrated among low-solvency lenders and weakens/reverses for high-solvency banks (one SD above median: ~0.6pp lower delinquency, not significant). By relationship structure: single-bank firms drive the positive evergreening→delinquency result; multi-bank firms show the opposite (less likely delinquent with the evergreening bank). By ex-ante status: the delinquency effect is present for currently-performing firms and even stronger (triple interaction) for currently non-performing ones. The solvency effect on the &lt;em&gt;probability&lt;/em&gt; of evergreening is concentrated in the top/main bank.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Brodeur et al. (2020) specification-check: each of six bank controls is regressed against all 1,023 combinations of the other ten controls; only Solvency is consistently significant (always negative, t&amp;gt;1.65), while Size, Credit, Liquidity, Provisions never/almost never cross, and RoA&amp;rsquo;s significance is not robust. (2) Oster (2019) selection-on-observables bound (δ=10.9). (3) Progressive addition of fixed effects (Bank, Month, Firm, Firm×Month, Bank×Month)—Solvency coefficient stays stable (~−6) while R-squared rises from 0.7% to 45.5%. (4) Unreported Probit yields negative, significant Solvency. (5) Intensive-margin result re-run with a binary &amp;lsquo;credit went up&amp;rsquo; outcome to guard against outliers, and dynamics traced from x=1 to 24 months. (6) Delinquency result decomposed (columns 7–8) to show the sign reversal is driven by Firm×Month FE, not just the changed sample. (7) Appendix numerical provisioning example and a stylized theoretical model of the restructure-vs-evergreen tradeoff.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on Peek and Rosengren (2005) and Caballero et al. (2008) on Japanese zombie lending but shifts the lens from firm profitability to bank credit products. Among granular-data papers: Bonfim et al. (2020, Portugal) find low profitability and exclusive relationships drive refinancing of troubled borrowers, with supervisory inspections deterring some; Bergant and Kockerols (2020, Ireland) find capital-constrained banks forbear more to riskier borrowers, effective only short-run; Mourad et al. (2020, Brazil) and Tantri (2021, India) study restructuring/renewals. This paper&amp;rsquo;s distinctive contribution is identifying a &lt;em&gt;regulatory-arbitrage&lt;/em&gt; strategy (bullet-to-repay-amortizing) that is more flexible and less provisioning-costly than restructuring, and tracing its determinants and consequences for credit supply, performance, access to new lenders, and other firms. It also speaks to theory: contra Bruche and Llobet (2014) gambling-for-resurrection (since the practice is used by well-capitalized banks and matters more in booms), and in favor of Hu and Varas (2021) relationship/reputation mechanism for single-relationship firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The measure serves as an early-warning indicator for supervisors, who can flag bullet-loans-matching-repayments as potential evergreening and (as has occurred) require restructuring. Scope: the strategy is narrow (0.5%–2% of observations) and not restricted to deeply distressed firms—7.8% of evergreening cases involve &amp;gt;60-day delays, almost identical to the 7.4% in the full sample—so it is partly preemptive provision management, not only zombie support. Crowding-out concerns are muted in aggregate but real at narrowly-defined (5-digit) industry level, where high-evergreening banks cut credit to other firms. The authors note relevance is heightened post-COVID with more firms in distress.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the provisioning/regulatory mechanism in detail?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under Uruguayan regulation, borrowers are rated 1A/1C/2A/2B/3/4/5 by days past due; provisioning ranges from 0.5–1.5% (1C) up to 100% (rating 5, &amp;gt;180 days). The paper defines delinquent as ratings 3–4 (&amp;gt;60 days, &amp;lt;180 days) and excludes rating 5. In the stylized example (1,000-peso loan, zero collateral), total capital consumption (provisions + capital requirement) rises sharply with deterioration: ~84.6 at 1C to 236.4 at rating 3 and 540 at rating 4. Restructuring forces a worse rating than if the borrower had stayed current, so it carries even more capital consumption than the bullet-loan evergreening strategy—the core regulatory-arbitrage incentive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the theoretical model show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A stylized decision tree: facing a troubled borrower, the bank either restructures immediately (cost R) or extends an evergreen bullet loan. If it evergreens, with probability α the supervisor detects it and imposes restructuring plus penalty S; with probability 1−α it is not caught, and then the borrower repays with probability 1−β or defaults (forcing restructuring R) with probability β. The bank prefers evergreening when R &amp;gt; [(1−α)(1−β)/α]·S. Evergreening is less attractive when α→1 (supervisor catches often) or β→1 (loan almost surely needs restructuring). The model is not calibrated; it formalizes why low detection probability and modest penalties make evergreening attractive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there caveats about the magnitude and comparison to zombie-lending estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. The 0.5%–2% prevalence is far below the ~10% typical of zombie-lending studies, but the authors stress the two are not comparable—they capture a specific regulatory-arbitrage strategy, not broad firm-level distress, and the strategy is also used for firms not (yet) delinquent. Misclassification (missing larger or differently-structured evergreening) biases estimates downward. The intensive-margin credit-growth effect loses significance after ~19 months as standard errors grow (fewer observations at long horizons), and the two-year credit effect, while similar in magnitude, is no longer statistically significant.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Loan evergreening strategy (as defined here)&lt;/strong&gt;: A new bullet loan granted to a firm of an amount similar to its contemporaneous amortizing-loan repayment to the same bank in the same month (ratio between 0.5 and 1.5), used to extend the duration of exposure without increasing it and to delay loss/provision recognition. This is the paper&amp;rsquo;s specific, product-level operationalization, distinct from generic zombie lending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bullet loan&lt;/strong&gt;: A loan whose principal is repaid in full at maturity with only interest paid before then. In this paper, bullet loans (70% with maturity ≤181 days) are the instrument banks use to repay existing amortizing loans and keep the firm current.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Amortizing loan&lt;/strong&gt;: A loan whose principal is repaid gradually over its life. The benchmark credit product whose scheduled repayment is matched against new bullet loans to detect evergreening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Solvency&lt;/strong&gt;: Defined in the paper as regulatory capital over risk-weighted assets. It is the single consistently significant bank-level determinant of evergreening (lower solvency → more evergreening), and its importance rises in economic booms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regulatory arbitrage (provisioning avoidance)&lt;/strong&gt;: Using the bullet-to-repay-amortizing strategy to keep a borrower from being rated as delinquent, thereby avoiding the convex increase in loan-loss provisions and capital consumption that delinquency or formal restructuring would trigger. Restructuring is shown to consume even more capital than this strategy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Delinquent&lt;/strong&gt;: In this paper, a borrower delayed by more than 60 days in repayment (ratings 3–4 under Uruguayan regulation, i.e., 60–180 days past due); rating-5 loans (&amp;gt;180 days) are excluded from analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Top bank&lt;/strong&gt;: The bank providing the highest amount of amortizing credit to a firm; such main-relationship banks are substantially more likely to provide evergreening, and the solvency effect is concentrated among them.&lt;/p&gt;</description></item><item><title>Macroprudential Policy in the Euro Area</title><link>https://macropaperwarehouse.com/papers/macroprudential-policy-in-the-euro-area/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/macroprudential-policy-in-the-euro-area/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation. There is now broad consensus that monetary authorities should hold a financial-stability mandate and that macroprudential policy should be part of it, yet evidence on the macroeconomic effectiveness of these policies and their interaction with monetary policy remains thin and inconclusive. The paper addresses this gap for the euro area, a case of special interest because of its international structure and because, within the short life of the euro, member states experienced major episodes of financial instability (the great financial crisis, GFC, and the sovereign debt crisis). The contribution is twofold: (1) build a novel aggregate index of the euro-area macroprudential policy stance and document its stylized facts since 1999; (2) be the first to identify, within a structural econometric framework, both unanticipated (surprise) and anticipated (news) exogenous macroprudential policy shocks and trace their macroeconomic effects.&lt;/p&gt;
&lt;p&gt;Data and method. The authors use MaPPED (Macro-Prudential Policies Evaluation Database), built by ECB staff and national central banks. For euro-area countries it records 1205 policy actions between 1995 and 2019 across 11 instrument types (capital buffers, lending standards, maturity mismatch tools, limits on credit growth, exposure limits, liquidity rules, loan loss provisions, minimum capital requirements and risk weights, leverage ratio, and &amp;lsquo;other measures&amp;rsquo;). Actions are signed (+ tightening, − loosening, 0 ambiguous) and weighted following Meuleman and Vander Vennet (2020): activation 1, change in level 0.25, change in scope 0.10, maintaining level/scope 0.05; deactivation resets the cumulative index to zero. This yields around 470 instrument-level indices, summed within each country and then aggregated across countries using GDP-share weights to form the EAMPP index. The empirical model is a seven-variable Bayesian SVAR at quarterly frequency over 1999:Q1–2019:Q2, estimated in levels with 4 lags and a Minnesota prior using the hyperparameters of Kurmann and Otrok (2013). Variables: the narrative EAMPP (which excludes countercyclical/financial-cycle-reactive policies so it is exogenous in the Romer-Romer sense), total credit to the private non-financial sector, real GDP, core CPI, inflation expectations (ZEW 6-month survey), VSTOXX, and a monetary policy rate (EONIA 1999–2009, Wu-Xia shadow rate thereafter). The surprise shock is identified by a Cholesky ordering with EAMPP first; the news shock is identified via the Barsky-Sims (2011) forecast-error-variance maximization (horizon k=0 to k=24), orthogonal to the surprise shock and not affecting EAMPP contemporaneously.&lt;/p&gt;
&lt;p&gt;Main findings. Stylized facts: EAMPP shows a positive starting value (policies predating the euro), a small positive trend up to the GFC, a loosening on average at the start of the GFC in 2009, then a clear upward (tightening) trend over the following seven years driven by sovereign-debt-crisis concerns and Basel III/CRR-CRDIV; the level in 2016 is almost twice as tightening as pre-crisis. The largest quarterly EAMPP change occurred in 2013:Q3 (CRR/CRDIV announcements). Policy announcements averaged about 13 per quarter in 1999–2015 versus about 2 per quarter in 2016–2019. Macroprudential and monetary policy moved oppositely; their correlation is about −0.90, negative and significant. SVAR results: a tightening surprise shock persistently raises the policy index, lowers total credit (on impact, accentuating over the medium term), reduces output in a way negatively correlated with credit (lowering credit pro-cyclicality), and lowers VSTOXX over the medium term after an initial rise. The effect on core CPI is negligible and on inflation expectations insignificant, so no price-stability trade-off; the monetary policy rate declines (accommodative complement). The news shock produces a gradual, persistent tightening, reduces credit, lowers credit pro-cyclicality, has muted effect on VSTOXX, and an insignificant price effect; the policy rate first rises then turns negative over the medium term. FEV decomposition: the two shocks combine to explain about half of credit variability after 24 quarters; neither shock exceeds 12% of core-CPI forecast variance and combined they never exceed 15% of prices. News shocks explain about 20% of credit forecast variance within the first quarter. Granger-causality and serial-correlation tests support exogeneity of both shocks.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two shocks driving non-systematic macroprudential variation are identified within a seven-variable Bayesian SVAR (1999:Q1–2019:Q2, 4 lags, Minnesota prior). The surprise (unanticipated) shock is identified by a Cholesky decomposition with EAMPP ordered first, so it can affect EAMPP contemporaneously. The news (anticipated) shock uses the Barsky-Sims (2011) forecast-error-variance maximization: it is the orthonormal column that maximizes the cumulated forecast error variance of EAMPP over horizons k=0 to k=24, subject to not affecting EAMPP contemporaneously and being orthogonal to the surprise shock. A key prior step is constructing a narrative EAMPP that drops all policies with a countercyclical design (those reacting to the financial cycle), making the remaining index exogenous in the Romer-Romer (2010) sense. The main threats are: foresight/anticipation contaminating shock identification (addressed by using announcement rather than enforcement dates and by identifying news shocks); reverse causality and contemporaneous effects that plague recursive/GMM panel approaches; and informational insufficiency (whether the series are genuine shocks), which the authors test via Granger causality against forward-looking credit-standard surveys and serial-correlation tests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The mechanism is that a tightening macroprudential stance curbs total credit to the private non-financial sector, which is the most robust predictor of financial crises, thereby moderating systemic risk and the build-up of excess credit during booms. Crucially, output responds in a way negatively correlated with credit, so the policy lowers the pro-cyclicality of credit (the key financial-stability gain). Surprise and news shocks are distinguished by their dynamics and by the FEV decomposition: news shocks dominate at short horizons (agents react quickly to signals, ~20% of credit forecast variance in the first quarter), while surprise shocks build gradually to a comparable share at medium-to-long horizons. The monetary-policy interaction is read off the policy-rate response: it moves accommodatively (declines) after a surprise tightening, complementing macroprudential policy without a price trade-off.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity or differences across shock types are documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The two shock types differ. The surprise shock causes an immediate credit drop that accentuates over the medium term and an accommodative (declining) monetary policy rate; VSTOXX first rises then falls below baseline. The news shock causes a gradual, persistent policy tightening, a credit decline that moderates before dropping again over the medium term, a muted VSTOXX response, and a monetary policy rate that first increases (complementing the tightening and reflecting a small initial price rise) then turns negative over the medium term. Core prices show a small initial increase under the news shock before declining, whereas the surprise shock barely affects core CPI. Both shocks ultimately lower credit pro-cyclicality and have insignificant effects on price stability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Several. (1) Alternative macroprudential target variables replacing total credit: a systemic-risk index (CISS) — results barely change; bank credit — results similar, with a more pronounced decline in bank credit; household credit — results similar but the household-credit decline is stronger, while under the surprise shock the credit decline becomes insignificant and output rises initially. (2) Replacing VSTOXX with VDAX (German analogue) — qualitatively the same. (3) Longer FEV truncation horizons k=30 and k=40 — quantitatively and qualitatively similar. (4) Including policies with missing announcement dates (182 of 1205 actions) in the empirical analysis — results barely change. (5) Granger-causality tests: the identified shocks are regressed on up to 3 principal components (explaining ~98.4% of variance) of seven forward-looking loan-officer credit-standard surveys; the null of no Granger causality cannot be rejected at any reasonable level (p-values range roughly 0.37–0.99). (6) Serial-correlation test regressing each shock on its own two lags: p-values 0.47 (surprise) and 0.77 (news), so no serial correlation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It relates to (a) empirical work on macroprudential effectiveness and its monetary-policy interaction (Cerutti et al., Alam et al., Akinci and Olmstead-Rumsey, Kuttner and Shim, Budnik and Kleibl, etc.), most of which uses cross-country panels with GMM and cannot make clean causal claims; and (b) the SVAR/news-shock identification literature robust to foresight (Barsky and Sims 2011; Leeper et al. 2013; Kurmann and Otrok 2013; Ben Zeev et al. 2019). The two prior SVAR studies extracting exogenous macroprudential variation are Kim and Mehrotra (2017, four Asia-Pacific countries) and Klingelhofer and Sun (2019, China), both using recursive Cholesky orderings. Like Klingelhofer and Sun, the authors find macroprudential shocks explain a meaningful share of credit but little of prices. Unlike those studies, they find a strong macroprudential-monetary link (EAMPP-policy-rate correlation about −0.90, versus roughly +0.25 for Asia-Pacific in Bruno et al. 2017), and they are the first to identify both surprise and news macroprudential shocks. The narrative exclusion of cyclically-reactive policies follows Romer and Romer (2010), Richter et al. (2019), and Rojas et al. (2020).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Macroprudential policy in the euro area effectively safeguards financial stability over the medium term by reducing credit growth, credit pro-cyclicality, and systemic risk, without a significant trade-off against price stability (the ECB&amp;rsquo;s primary target). Because more than one objective cannot be met with one instrument, monetary policy complements macroprudential policy: it can move accommodatively to offset output/credit declines, yielding an effective overall policy mix. Scope conditions: the conclusions are specific to the euro area over 1999:Q1–2019:Q2, a sample dominated by the GFC and sovereign debt crisis and by deflationary pressures (which is why the strong, negative macroprudential-monetary correlation may not generalize, e.g., to Asia-Pacific where the correlation is positive); the narrative EAMPP only captures proactive, long-run-financial-stability-motivated policies; and price-stability effects, while insignificant overall, carry wide estimate uncertainty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the paper use announcement dates rather than enforcement dates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because foresight problems arise from inside and outside lags (Leeper et al. 2013): about 54% of euro-area policy tools in MaPPED experience a delay between announcement and implementation. Using the enforcement date would contaminate the identification of an &amp;lsquo;unanticipated&amp;rsquo; shock, since agents would already know about the policy from its announcement, making the shock no longer exogenous. The authors assume agents react from the announcement moment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there notable caveats about the index and impulse responses?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The first EAMPP value is not zero because 185 of 1205 policy actions were implemented before 1995, and MaPPED does not provide announcement dates for 182 of 1205 actions (assumed equal to enforcement dates only for the stylized-facts section; removed in the empirical analysis). GDP-share weights use the 2008–2015 average; time-varying weights have very limited impact since GDP shares are stable. Impulse responses report median with 16th and 84th posterior percentiles. The EONIA-shadow-rate splice is justified by a 0.98 correlation between the two over 2004:Q4–2008:Q4.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Macroprudential, Monetary Policy Synergies and Credit Supply: Evidence from Matched Bank-Firm Loan-Level Data in Brazil</title><link>https://macropaperwarehouse.com/papers/macroprudential-monetary-policy-synergies-and-credit-supply-evidence-from-matched-bank-firm-loan-level-data-in-brazil/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/macroprudential-monetary-policy-synergies-and-credit-supply-evidence-from-matched-bank-firm-loan-level-data-in-brazil/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Reserve requirements (RRs) were largely abandoned as a monetary tool in advanced economies after inflation targeting, but emerging markets (EMs) — especially Brazil — kept using them countercyclically before, during and after the GFC and COVID-19 (53 EMs eased RRs during the pandemic). Despite their wide use, there was scarce loan-level evidence on whether RRs actually manage domestic credit cycles through credit supply, and on whether they have synergies with the short-term policy rate. The paper fills this gap.&lt;/p&gt;
&lt;p&gt;Data and strategy: The authors use quarterly matched bank-firm loan-level data from Brazil&amp;rsquo;s credit registry (SCR), augmented with bank controls and firm employment data from RAIS, covering 2008Q1-2015Q2 (30 quarters). After cleaning and a 10% random firm sample, the working sample is 2,595,398 observations spanning 90,440 firms and 83 commercial banks. Identification rests on three moves: (1) firm-quarter fixed effects on multiple-bank-relationship firms (Khwaja-Mian/Jimenez approach) to absorb credit demand; (2) a bank-level counterfactual exposure variable, ΔResReq (the Camors et al. 2019 construction), measuring how much each bank is differentially &amp;ldquo;taxed&amp;rdquo; by RR rule changes given its ex-ante deposit mix, holding policy fixed at pre-September-2008 rules; ΔResReq averages -1.64 (sd 2.61) at bank level. (3) High-frequency monetary policy surprises (Kuttner 2001) from 30-day interest-rate swaps around Copom announcements, interacted with ΔResReq to identify policy synergies.&lt;/p&gt;
&lt;p&gt;Main findings (signs, magnitudes, scope): A 1 pp tightening of RRs reduces a bank&amp;rsquo;s credit to a firm by 0.52-0.56 pp next quarter (no firm-quarter FE), and -0.67 pp with firm-quarter FE — coefficient stability across saturations suggests exposure is orthogonal to demand. Private domestic banks are roughly twice as responsive: -1.39 pp (Table IV) and -1.68 pp in the synergies specification (Table V). With a simultaneous one-standard-deviation surprise policy-rate tightening, the response rises to -1.90 pp — evidence of monetary-macroprudential synergy. A comparable interest-rate surprise alone contracts credit 0.63 pp; a 1 pp Selic increase, 0.71 pp. Bank capital matters: a private domestic bank one sd above mean capital/assets cuts credit only 0.85 pp (vs 1.68 pp), implying capital-liquidity substitution — but only during tightening, not loosening. After controlling for heterogeneity, there is no significant tightening-vs-loosening asymmetry for private domestic banks; the asymmetry found in cross-country work is driven by less-responsive government and foreign banks (foreign banks fully mitigate loosening). Economic policy uncertainty (EPU, Baker-Bloom-Davis) weakens transmission: a 1 pp loosening raises credit 1.50 pp, but only 1.22 pp when EPU is one sd (71 points) higher — about 19% mitigation. Using an aggregate macroprudential index instead of bank exposure yields qualitatively similar but weaker effects (a 1 sd index move gives -1.43 pp vs -2.02 pp for the intensity-sensitive aggregate counterfactual), so cross-country index studies underestimate RR effects and overestimate asymmetries. At the firm level, firms do not insulate themselves (no leakage). Real effects on employment are modest and not economically significant: no significant hiring effect; a 1 pp RR loosening reduces firings by ~1.6% (all banks) / ~2% (private domestic), requiring an 8.33 pp loosening to prevent one additional firing.&lt;/p&gt;
&lt;p&gt;Implications: RRs are an effective state-contingent (Pigouvian) tax to manage domestic credit booms and busts via credit supply, can stimulate credit even with the policy rate unchanged (useful at the ELB or under &amp;ldquo;fear of floating&amp;rdquo;), and should be eased more aggressively when EPU is high.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three layers. First, firm-quarter fixed effects on firms with multiple bank relationships absorb firm-level credit demand (Khwaja-Mian/Jimenez et al. 2014), so the within-firm-quarter comparison isolates supply. Second, a bank-level counterfactual exposure variable, ΔResReq, measures differential RR &amp;rsquo;taxation&amp;rsquo; from each bank&amp;rsquo;s ex-ante deposit mix relative to pre-September-2008 rules, holding policy fixed — this separates RR supply effects from the policy rate and from aggregate credit-cycle dynamics. Third, high-frequency monetary policy surprises (one-day swap changes after Copom) provide exogenous variation in the policy rate for the synergy interaction. Main threats: (a) banks could shift their liability mix toward less-affected deposits (evasion) — addressed in Appendix A.3 (no significant deposit reallocation); (b) more-exposed banks could be differentially exposed to other macro shocks — addressed via &amp;lsquo;horserace&amp;rsquo; interactions with local and global variables (Tables VI-VII); (c) policy-rate endogeneity — addressed by using surprises; (d) excess/voluntary reserves as omitted variable — addressed in A.8-A.9 (insignificant). Coefficient stability when adding firm-quarter FE (Oster 2019) supports exogeneity of ΔResReq to demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core mechanism is RRs acting as a countercyclical Pigouvian tax that withdraws liquid funds during tightening (constraining supply) and injects cash during loosening (stimulating supply). The synergy mechanism is that simultaneous policy-rate tightening amplifies the RR credit-supply contraction (-1.68 to -1.90 pp for private domestic banks). The EPU mechanism is that high policy uncertainty makes banks more cautious, reducing the amplification of stimulus policy (loosening becomes ~19% less effective). These are distinguished by interacting ΔResReq separately with policy-rate surprises, with EPU, and with bank characteristics, all within the saturated firm-quarter FE model, and by running separate loosening vs tightening subsamples (16 loosening quarters, 14 tightening quarters).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By bank ownership: government and foreign banks are less sensitive to RRs (government banks lend countercyclically; foreign banks respond to home-country policy and fully mitigate loosening effects), while private domestic banks are about twice as responsive as the average bank. By capital: higher-capital private domestic banks are insulated from RR tightening (one sd above mean capital cuts the response from -1.68 to -0.85 pp), consistent with capital-liquidity substitution (Acosta-Smith et al. 2019); this insulation appears only during tightening, not loosening. By state of EPU: transmission is weaker when economic policy uncertainty is high. NPL share is not associated with lower credit growth during tightening as it is during loosening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(A.3) Bank-level panel regressing changes in savings/demand/time deposits on lagged exposure — no significant reallocation, so banks are not evading the policy. (A.4) Replicating Table V with the actual Selic change instead of surprises — a 1 pp RR tightening plus 1 sd (0.97) Selic tightening gives -2.02 pp (vs -1.9 pp with surprises). (A.5) Dropping influential policy quarters (2008Q4, 2009Q1, 2010Q1-Q2, 2010Q4, 2011Q1) — results unchanged. (A.6-A.7) Adding controls for ex-ante liability structure (shares of savings/time/demand deposits) — baseline qualitatively and quantitatively unchanged. (A.8-A.9) Controlling for / interacting with excess voluntary reserves (averaging 0.08% of liabilities) — insignificant and leaves estimates unchanged. Tables VI-VII horserace against local (inflation, GDP, current account, EPU) and global (Fed funds, US shadow rate, VIX, commodity prices, other macropru policies) variables — estimates stable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It uses the same counterfactual exposure variable as Camors et al. (2019), who studied RRs as a tax on dollar deposits in Uruguay; and relates to Epure et al. (2018) on Romania and the global financial cycle. Unlike that literature, which focuses on FX/dollar-denominated deposits and global-cycle spillovers, Brazil&amp;rsquo;s low foreign-debt banking sector lets the authors isolate RRs targeting the DOMESTIC credit cycle. They claim to be the first loan-level paper to estimate RR effects on domestic credit cycles while disentangling and documenting monetary-policy synergies, the first to link higher EPU to lower macroprudential effectiveness, and the first to assess bank capital&amp;rsquo;s mitigating role for RR tightening. Against the cross-country macroprudential-index literature (Cerutti-Claessens-Laeven 2017, Akinci-Olmstead-Rumsey 2018, Alam et al. 2019), which finds borrower-targeted tools stronger than bank-targeted RRs and tightening more effective than loosening, this paper shows the index approach ignores policy intensity and bank exposure, thereby underestimating RR effects and overestimating asymmetries. On real effects, modest employment results echo Richter, Schularick, and Shim (2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;RRs are effective for managing domestic credit booms and busts through credit supply, and can stimulate credit even when the policy rate is unchanged — relevant for EMs at the effective lower bound or constrained by &amp;lsquo;fear of floating&amp;rsquo; from using the policy rate countercyclically. Synergies with the policy rate are relevant and significant mainly during tightening (statistically weaker, for firms, during loosening). Because high EPU mutes the stimulus, policymakers trying to unfreeze credit (e.g., COVID-19) must ease RRs more aggressively when policy uncertainty is high. Scope conditions: results are estimated on Brazil 2008-2015, on multiple-bank-relationship firms, for credit in local currency, with the strongest responses concentrated in lower-capital private domestic banks; real effects on employment are modest and not economically significant in either direction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there leakage or general-equilibrium concerns at the firm level?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors test whether firms insulate themselves by substituting toward less-affected banks (Jimenez et al. 2017 found full insulation for Spanish dynamic provisions). Using firm-level regressions (equation 10), they find firms associated with more-exposed banks are NOT insulated from either loosening or tightening — strong effects survive at the firm level — so the transmission channel does not &amp;rsquo;leak,&amp;rsquo; confirming RRs are effective at dampening credit booms in aggregate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the relationship between the policy variables and the credit cycle in the raw data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Changes in RRs track aggregate bank credit countercyclically: the correlation between the system-wide counterfactual RR variable and aggregate credit is 0.50, far above the 0.14 correlation between credit growth and CPI inflation, supporting the financial-stability (not inflation) motivation. The correlation between RR changes and the Selic policy rate is 0.31, motivating the need to disentangle the two instruments.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Merger guidelines for the labor market</title><link>https://macropaperwarehouse.com/papers/merger-guidelines-for-the-labor-market/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/merger-guidelines-for-the-labor-market/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation. Antitrust review of mergers has historically focused almost entirely on harm to consumers (product-market monopoly), ignoring harm to workers (labor-market monopsony). Following the July 2021 White House executive order and the DOJ&amp;rsquo;s monopsony-based challenge to the Penguin Random House (PRH)/Simon &amp;amp; Schuster (SS) publishing merger, the agencies are now putting buyer power at the center of policy. The paper asks: how should Herfindahl-based merger-review thresholds, designed for product markets, perform if applied to local labor markets, and what efficiency gains would a merger need to leave workers unharmed?&lt;/p&gt;
&lt;p&gt;Model and data. The authors extend Berger, Herkenhoff, and Mongey (2022, &amp;ldquo;BHM&amp;rdquo;) to allow multi-plant (post-merger) ownership. The model has a representative household supplying labor through a nested-CES system (within-market substitutability governed by eta, across-market by theta, with eta &amp;gt; theta &amp;gt; 0), firms competing in quantities (Cournot/oligopsony), heterogeneous firm productivity, decreasing returns to scale, and capital. Firms set wages as a variable markdown on the marginal revenue product of labor; the markdown depends on the firm&amp;rsquo;s local payroll share. Markets are defined as 3-digit NAICS by commuting zone. Calibration is taken directly from BHM using confidential US Census data (LBD). Key estimated values: theta = 0.42 and eta = 10.85 (the elasticity-substitution parameters; the paper also reports theta = 0.45 in one passage), productivity dispersion sigma_z, returns to scale alpha, etc. The average market has 113 firms, an HHI of 0.11 (about nine equal firms), the average firm share is ~0.02, and the employment-weighted average markdown is 0.72 (workers paid 72% of marginal revenue product), equivalent to a labor-supply elasticity of 2.57.&lt;/p&gt;
&lt;p&gt;Theory. Proposition 1 shows that, absent efficiency gains, a within-market merger equalizes the two merged plants&amp;rsquo; markdowns at the level implied by their combined share, depresses both merging plants&amp;rsquo; wages, lowers the market wage index and employment, and reduces total worker pay. Non-merging firms&amp;rsquo; shares rise and they expand, so the actual rise in concentration is smaller than a &amp;ldquo;naive&amp;rdquo; calculation (adding pre-merger shares) would predict. Under the monopsony limit (infinitely many firms, or eta = theta), mergers have no effect.&lt;/p&gt;
&lt;p&gt;Main quantitative findings. (1) Model validation: replicating Arnold (2020), the model generates a change in log employment of -9.0 (vs Arnold -14.4, about three-fifths), log earnings -0.7 (vs -0.8), log payroll -10.5 (vs -12.1); earnings fall -4.4% in high-concentration vs -1.1% in medium-concentration markets (Arnold: -3.1% and -0.8%); the naive-concentration regression coefficient is 0.893 (Arnold 0.834), both below one. (2) PRH/SS simulation (PRH 37% share, SS 12%): with no efficiency gains the merger cuts author wages by 5%; the Required Efficiency Gain (REG) for worker-surplus neutrality is 17%. A merger of the two largest publishers gives -10% wages and a 30% REG; the two smallest Big Five give a 13% REG. (3) Applying product-market thresholds to labor markets via a 200,000-market simulation: under the stricter 1982 guidelines (block if post-merger HHI &amp;gt; 1800 and Delta-HHI &amp;gt; 100), the average REG of permitted mergers is 4.68%; under the looser 2010 guidelines (HHI &amp;gt; 2500, Delta-HHI &amp;gt; 200) it is 5.96%. Thus at the standard assumed 5% efficiency gain, 1982-permitted mergers raise the wage index (+0.04%) while 2010-permitted mergers lower it (-0.14%) and harm workers. (4) The Gross Downward Wage Pressure Index (GDWPI) equals (1/theta - 1/eta) times the other plant&amp;rsquo;s payroll share. Among mergers with GDWPI &amp;gt; 5% at both plants, more than 80% require a REG of at least 5.8% (20th-percentile REG = 5.8%, median 6.4%); among GDWPI &amp;gt; 10% at both plants, more than 80% generate a welfare loss under an assumed 5% efficiency gain.&lt;/p&gt;
&lt;p&gt;Implications. Product-market thresholds are too lenient for labor markets because labor is harder to substitute than products (low theta). The framework lets regulators trade off Type I error tolerance and efficiency-gain priors to set concentration thresholds.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/estimation strategy for the key parameters, and what are the threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is not separately estimated; calibration is inherited wholesale from BHM (2022). The crucial labor-supply substitution parameters theta (across-market) and eta (within-market) are estimated in BHM from tradeable firms&amp;rsquo; market-share-dependent employment responses to corporate tax changes, identifying how much firms with different market shares move employment when after-tax returns change. Productivity dispersion sigma_z matches the payroll-weighted HHI, alpha matches labor&amp;rsquo;s share, gamma the capital share, Z mean firm size, and phi mean worker earnings. Main threats: (i) theta and eta are estimated from tradeable (largely manufacturing) firms and held fixed economy-wide, while the authors acknowledge no economy-wide substitutability estimates exist outside manufacturing; (ii) markets are defined by NAICS3-by-CZ rather than occupation (the conceptually preferred unit), because occupation codes are unavailable for the universe of workers; (iii) the whole exercise relies on the calibrated structure being the right laboratory.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the model validated out of sample?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By replicating Arnold (2020), who estimates causal labor-market effects of US mergers. The authors draw and merge two firms per market, impose a pre-merger employment cutoff (tilde-n = 46, about five times average firm size) so that median pre-merger employment matches Arnold&amp;rsquo;s sample (116), and run Arnold&amp;rsquo;s exact regressions on simulated data. The model reproduces the sign and roughly the magnitude of employment and wage declines, the concentration interaction (effects more than three times larger in high-concentration markets), and the sub-one naive-concentration coefficient. This is out-of-sample because none of these moments were targeted in calibration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the central welfare metric and policy quantity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Worker Surplus Neutrality: a merger is worker-surplus neutral if the market-level wage index W_j is unchanged (using a household problem in which profits are NOT rebated, to mirror the product-market consumer-surplus standard). The key policy object is the Required Efficiency Gain (REG, Delta-star): the common post-merger productivity gain at both plants needed to keep W_j constant. By Proposition 1.5 the REG is always positive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms, and what is downward wage pressure specifically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Market power comes from costly worker mobility within (eta) and across (theta) markets. When two plants merge, hiring at Plant 1 raises the market wage and thus the wage the merged firm must pay its inframarginal workers at Plant 2 (and vice versa). The merged firm internalizes this cross-plant cost, which acts like a per-worker &amp;rsquo;labor cannibalization tax,&amp;rsquo; lowering the marginal benefit of hiring at both plants, so it hires less and pays less. Downward wage pressure at Plant 1 equals n_2j times the derivative of w_2j with respect to n_1j; in share form DWP_1j = w_1j (1/theta - 1/eta) s_2j. The GDWPI normalizes this by the wage: GDWPI_1j = (1/theta - 1/eta) s_2j, bounded in [0, theta^-1 - eta^-1], interpretable as a wage tax rate. Larger partner share and higher within-market substitutability (eta) raise downward pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Effects vary strongly with concentration: earnings fall -4.4% in high-concentration markets vs -1.1% in medium-concentration markets (model). Effects depend on the merging firms&amp;rsquo; shares: assuming a 5% efficiency gain, fewer than 12.1% of mergers in which the smaller firm&amp;rsquo;s payroll share exceeds 5% yield a worker-surplus gain. REGs differ across publisher pairings in the PRH case (17% for PRH+SS, 30% for the two largest, 13% for the two smallest). The model also generates wide firm-level variation in markdowns (small firms near competitive, large firms marked down well below 0.72).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What do the confidence/threshold figures show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Fixing a 5% efficiency gain, the simulation reports the fraction of mergers yielding a worker-surplus gain by concentration cell. 89.5% of mergers with post-merger HHI &amp;lt; 500 and Delta-HHI &amp;lt; 50 yield gains. Under the 2010 highly-concentrated definition (HHI &amp;gt; 2500, Delta-HHI &amp;gt; 100 in the cited cell), fewer than 34.8% yield gains. A merger with small-firm share 4% and large-firm share 18% has a 69.7% chance of a worker-surplus gain at 5% efficiency, rising to 97.7% at a 10% efficiency gain. This lets a regulator pick thresholds for a desired Type I error tolerance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How sensitive are results to the assumed efficiency gain?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Highly. Under 1982 guidelines, permitted mergers change average W_j by -0.40% at 1% efficiency, &amp;hellip; up to +0.04% at 5% efficiency; blocked mergers fall -7.39% (1%) to -5.99% (5%). Under 2010 guidelines, permitted mergers fall -0.63% (1%) to -0.14% (5%); blocked mergers fall -10.37% (1%) to -8.61% (5%). The 5% benchmark (Farrell-Shapiro) is itself questioned: Blonigen and Pierce (2016) find roughly zero or negative merger productivity gains, implying even the 1982 thresholds may be too lenient.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends BHM by adding multi-plant ownership and merger analysis. Relative to Nocke and Schutz (2018a,b) and Nocke and Whinston (2022), who derive product-market merger comparative statics under Bertrand competition (and, for Nocke-Whinston, CRS), this paper derives results for the LABOR market under nested-CES supply, Cournot competition, decreasing returns to scale, and endogenous household income. Relative to Naidu, Posner, Weyl (2018) and Marinescu-Hovenkamp (2019), who translate downward-wage-pressure concepts but assume symmetric firms, this paper provides a downward-wage-pressure test with firm heterogeneity across and within markets and shows it can be computed from readily available payroll shares and existing eta/theta estimates. It empirically benchmarks to Arnold (2020) and Prager-Schmitt (2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Product-market HHI thresholds are too lenient when applied to labor markets: at an assumed 5% efficiency gain, 1982 thresholds (1800/100) keep permitted mergers worker-surplus neutral while 2010 thresholds (2500/200) do not. Scope conditions: (i) results hinge on the assumed efficiency gain (which empirical evidence suggests may be well below 5%); (ii) the framework treats product-market effects as &amp;lsquo;out of market&amp;rsquo; and should be combined with consumer-harm analysis; (iii) parameters are economy-wide benchmarks that may not fit a specific industry; (iv) market definition (NAICS3-by-CZ) matters, though the low estimated theta makes it consistent with a hypothetical-monopsonist test. The framework can be modified to add monopolistic pricing or variable markups (e.g., Deb et al. 2022).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Are there internal inconsistencies a reader should note?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Table 1 reports theta = 0.42 (and 1.49 as the data moment), but the text at one point states &amp;rsquo;theta = 0.45, and eta = 10.85, giving theta^-1 - eta^-1 = 2.29.&amp;rsquo; The 2010 threshold is described in the abstract/Section 3 as Delta-HHI &amp;gt; 200 but the headline simulation result (4.68% vs 5.96%) compares &amp;lsquo;1800/100&amp;rsquo; against &amp;lsquo;2500/200&amp;rsquo;, and one passage lists the 2010 thresholds as (2500, 200) while the highly-concentrated text uses Delta-HHI of 200 for presumption and 100 in a figure cell. These are presentational; the substantive ranking (1982 stricter, 2010 more lenient) is robust.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;!-- flags: Internal parameter inconsistency: Table 1 reports theta=0.42 but text states theta=0.45 in the GDWPI bound passage (theta^-1 - eta^-1 = 2.29)., Threshold reporting: 1982 simulation uses Delta-HHI&gt;100 while Section 3 text also references Delta-HHI thresholds of 100/200; the headline comparison is 1800/100 vs 2500/200., Efficiency-gain assumption of 5% (Farrell-Shapiro) is load-bearing for the 'workers harmed under 2010 guidelines' conclusion; paper itself notes empirical evidence (Blonigen-Pierce 2016) of near-zero gains. --&gt;</description></item><item><title>Monetary and Macroprudential Policies under Dollar-Denominated Foreign Debt</title><link>https://macropaperwarehouse.com/papers/monetary-and-macroprudential-policies-under-dollar-denominated-foreign-debt/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-and-macroprudential-policies-under-dollar-denominated-foreign-debt/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Emerging economies have rapidly accumulated foreign-currency (mostly dollar) debt — the dollar share of 14 emerging economies&amp;rsquo; foreign debt rose from 75% in 2010 to 81% in 2018. Such debt is dangerous because sudden stops in capital inflows cause sharp currency depreciation that mechanically raises the domestic-currency value of the debt. The paper asks: when a country holds dollar-denominated foreign debt, does macroprudential policy mitigate depreciation and downturns during sudden stops, how should monetary policy be conducted, and how should the two policies cooperate? Existing sudden-stop models (with loan-to-value/debt-to-income collateral constraints and pecuniary externalities) do not model the channel by which depreciation inflates the value of dollar debt.&lt;/p&gt;
&lt;p&gt;Model setup: The author builds a small open economy in the tradition of Bianchi and Mendoza (2018), with three innovations: (1) foreign debt is denominated in foreign currency; (2) home tradable exports face a downward-sloping foreign demand (price elasticity rho &amp;gt; 1); (3) New Keynesian (Rotemberg) price stickiness to give monetary policy a role. The borrowing constraint is occasionally binding and the borrowing limit is denominated in domestic currency, creating a currency mismatch between foreign borrowing and the limit. The author deliberately abstracts from the collateral-asset-price pecuniary externality (assets valued at book value) to isolate a new balance-of-payments (BOP) externality. The model is solved with a global numerical method; each period is a year. Calibration targets the average of the 14 countries: discount factor beta = 0.92 (to hit mean foreign-debt-to-GDP of 40%), R* = 1.04, labor share = 0.66, imported-input share targeting import-to-GDP of 22%, theta = 8, price-adjustment cost psi = 50, export price elasticity rho = 3, tight borrowing limit kappa = 0.2 set so the unconditional crisis probability is 7.2%; productivity and interest-rate processes are from Mendoza (2010, Mexican data).&lt;/p&gt;
&lt;p&gt;Key mechanism: When the borrowing constraint binds, large debt repayment with limited new borrowing forces net capital outflows, which require larger net exports and thus real depreciation (because exports face downward-sloping demand). Depreciation raises the domestic-currency value of debt repayment, forcing further outflows and a second-round depreciation — an amplification loop. Because households take the exchange rate as given, they socially overborrow ex ante (&amp;ldquo;ex ante BOP externality&amp;rdquo;) and use too many imported inputs during crises (&amp;ldquo;ex post BOP externality&amp;rdquo;), both producing inefficiently large depreciation. Social costs are twofold: imported inputs become inefficiently expensive (lowering output, explaining the output drop without working-capital financing), and an inefficiently large share of output is exported (lowering consumption).&lt;/p&gt;
&lt;p&gt;Main findings: The optimal discretionary monetary policy (without taxes) is contractionary both when the constraint is slack (to discourage overborrowing via real appreciation raising the effective interest rate) and when it binds (to discourage imported-input use). But anticipation of crisis-time intervention lowers the ex ante effective interest rate and induces larger borrowing, destabilizing the economy. In crisis dynamics, without taxes the real exchange rate depreciates 10% under inflation targeting vs 6% under discretion; output drops 6.2% under targeting vs 14.4% under discretion. With macroprudential taxes, depreciation is 6% (targeting) vs 2% (discretion), and output drops 3.8% (targeting) vs 9.2% (discretion). Under taxes, foreign debt at the stochastic steady state is 6-7% smaller. Welfare (permanent-consumption metric, benchmark = inflation targeting without taxes): discretion without taxes is worse by 0.02%; evaluated at the simulation-mean foreign bond (-0.45), discretion with taxes gives +0.07% and targeting with taxes gives +0.03%. If the simulation starts with a binding constraint, the welfare gain under discretion with taxes can reach about 0.2%. Implication: the optimal mix is an ex ante macroprudential tax on foreign borrowing to correct overborrowing plus ex post monetary intervention to mitigate depreciation; monetary intervention improves welfare only when paired with the macroprudential tax.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core theoretical mechanism (the &amp;ldquo;amplification loop&amp;rdquo;) and why does it require a currency mismatch?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When the borrowing constraint binds, the country must repay outstanding foreign debt with only limited new borrowing, producing net capital outflows that must be matched by larger net exports via the balance-of-payments identity. Since exports face downward-sloping foreign demand, this requires real depreciation. Depreciation raises the domestic-currency value of the foreign-currency debt repayment (-e_t b*_{t-1}), but new borrowing is capped by the domestic-currency-denominated limit kappa*k, so the depreciation forces a cut in new borrowing, generating further outflows and a second-round depreciation. The loop continues. The currency mismatch — foreign-currency debt against a domestic-currency borrowing limit — is crucial: the author states explicitly that if the borrowing limit were denominated in foreign currency, the amplification loop would not occur.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two externalities and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;ex ante BOP externality&amp;rdquo; distorts borrowing in normal times: households do not internalize that reducing foreign debt today would reduce next-period net capital outflows and mitigate depreciation if the constraint binds, so they overborrow. The &amp;ldquo;ex post BOP externality&amp;rdquo; distorts imported-input use when the constraint is binding: households do not internalize that cutting imported inputs would improve the trade balance and mitigate depreciation, so they use socially excessive imported inputs. Both are formalized through the planner&amp;rsquo;s Lagrange multiplier gamma^SP_t (social value of real appreciation through BOP adjustment), which is strictly positive given rho&amp;gt;1 and negative net foreign assets. The ex ante term appears in the foreign-bond Euler equation; the ex post term appears in the imported-input first-order condition and is positive only when the constraint binds (mu^SP_t &amp;gt; 0).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the optimal discretionary monetary policy contractionary in both states, and what does &amp;ldquo;contractionary&amp;rdquo; mean here?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The target inflation is zero (Rotemberg cost), so positive inflation is &amp;ldquo;expansionary&amp;rdquo; and negative inflation &amp;ldquo;contractionary.&amp;rdquo; When the constraint is slack but may bind, contractionary policy causes real appreciation, which raises the effective interest rate on foreign borrowing (via the exchange-rate term in the Euler equation), discouraging borrowing and partially correcting overborrowing. When the constraint binds, contractionary policy discourages production and imported-input use, improving the trade balance and partially correcting the ex post externality. Proposition 1 and Corollary 1 establish that strict inflation targeting is not optimal and that the optimal discretionary policy is contractionary in both states. Crucially, this period-by-period optimality does not imply discretion dominates inflation targeting in welfare, because it ignores how anticipation of future intervention shapes ex ante borrowing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does adding a macroprudential tax change the optimal monetary policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With an optimal time-consistent macroprudential tax on foreign borrowing available, Proposition 2 / Corollary 2 show the optimal discretionary monetary policy becomes pi_t = 0 when the constraint is not binding (the tax now corrects overborrowing, so the eta^EE term is zero and monetary policy focuses only on minimizing price-adjustment cost) but remains contractionary (pi_t &amp;lt; 0) when the constraint binds — because the ex ante tax cannot correct the ex post externality of excessive imported inputs during a crisis. The macroprudential tax is strictly positive whenever there is positive probability the constraint binds next period, and rises with outstanding debt; it is notably higher under discretion (by about 0.6% before a crisis) to offset the extra overborrowing induced by anticipated intervention.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the quantitative crisis-dynamics evidence across the four regimes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Crisis is defined as the current account exceeding two standard deviations above its long-run mean; crisis events are picked under inflation targeting without taxes. Real exchange rate depreciation: 10% (targeting, no tax), 6% (discretion, no tax), 6% (targeting, with tax), 2% (discretion, with tax). Output drop: 6.2% (targeting, no tax), 14.4% (discretion, no tax), 3.8% (targeting, with tax), 9.2% (discretion, with tax). Macroprudential taxes reduce pre-crisis debt and capital-flow reversals; discretion raises pre-crisis debt through anticipation of intervention. Standard deviations (relative to targeting-no-tax = 100%): under discretion with tax, real exchange rate volatility falls to 37.9% and current-account/GDP to 82.0%, while output is 111.7% and consumption 88.3% — i.e., discretion lowers exchange-rate volatility but raises output/consumption volatility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the welfare results and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Welfare is measured as permanent-consumption gain/loss relative to inflation targeting without taxes. Without taxes, discretion is slightly worse (-0.02%). Evaluated at the simulation-mean foreign bond (-0.45) with no borrowing-limit shock at the initial period: discretion with tax gives +0.07%, inflation targeting with tax gives +0.03%. When a borrowing-limit shock hits at the initial period (constraint binding): discretion without taxes gives +0.03% and with taxes +0.09%, with larger gains for larger initial debt; the gain can be as high as about 0.2% when the simulation starts with the constraint binding. Scope condition: monetary intervention during a crisis improves welfare ONLY when combined with an ex ante macroprudential tax; absent the tax, anticipation of intervention induces overborrowing and reduces welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from closely related prior work (Fornaro 2015, Ottonello 2015, Mendoza and Rojas 2019, Devereux et al. 2018, Coulibaly 2018)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Fornaro (2015) and Ottonello (2015) introduce nominal wage rigidities and emphasize the BENEFIT of depreciation (boosting exports, reducing unemployment); this paper emphasizes the NEGATIVE effect of depreciation through inflating the value of foreign-currency debt. Mendoza and Rojas (2019) model depreciation as REDUCING the debt-repayment burden (depreciation lowers the consumption-composite real interest rate); here depreciation increases the burden. Devereux et al. (2018) and Coulibaly (2018) are closest — both add NK price stickiness and study monetary-macroprudential combinations — but in those the collateral channel/asset price drives the externality; this paper&amp;rsquo;s contribution is to study optimal policy where depreciation raises the domestic-currency value of foreign debt and causes a severe crisis. The welfare result (inflation targeting dominates discretion without taxes, but discretion preferable with the optimal tax) mirrors Coulibaly (2018).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the optimal policy time-consistent, and how is the planner&amp;rsquo;s problem set up?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The BOP externalities themselves do not generate time inconsistency (the macroprudential tax in this model is time consistent, unlike pecuniary externalities from collateral asset prices). However, NK price stickiness can create time inconsistency via firms&amp;rsquo; forward-looking pricing, so the author assumes no commitment and solves for time-consistent policy in a Markov perfect equilibrium: each period&amp;rsquo;s planner optimizes taking future planners&amp;rsquo; rules as given while internalizing how current policy affects them, and the optimal rules coincide with those expected by past planners. The Ramsey planner maximizes household utility subject to the decentralized equilibrium conditions as implementability constraints. The nominal interest rate R_t is backed out from the Euler equation after other variables are pinned down.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What real-side outcome does the model explain without standard assumptions, and what is the consumption-labor trade-off in welfare?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model explains the output drop during sudden stops WITHOUT working-capital financing (commonly assumed in the literature): the inefficiently expensive imported inputs caused by real depreciation directly reduce output. On welfare, although contractionary monetary intervention causes output and labor (hence labor disutility) to drop more under discretion, consumption does not drop as much because mitigated depreciation means smaller exports and a larger share of output consumed domestically. Period utility (consumption minus labor disutility) can therefore be slightly higher under discretion when combined with taxes. An appendix (Section F) with fixed labor and no labor disutility shows monetary intervention under discretion actually raises crisis-period consumption above inflation targeting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness/extensions does the paper note?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Section E of the appendix studies the model WITH the asset-price pecuniary externality (as in Bianchi and Mendoza 2018), which the baseline shuts off via book-value asset valuation. Section A proves the constant tax tau_m = 1/(rho-1) corrects the terms-of-trade externality. Section F examines fixed labor supply with no labor disutility. The conclusion proposes three extensions: foreign-reserve accumulation and reserve interventions (as in Arce et al. 2019), endogenous choice of borrowing currency, and introducing financial intermediaries with currency mismatch (as in Aoki et al. 2018 and Mendoza and Rojas 2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a theoretical/quantitative DSGE exercise, not an empirical-identification paper, so there is no causal identification strategy in the econometric sense; the model is calibrated (not estimated) to standard literature values and the average of 14 emerging economies. Results depend on parameter choices, notably the export price elasticity rho = 3 (within Simonovska-Waugh&amp;rsquo;s 2.79-4.46 range) and the domestic-currency denomination of the borrowing limit, which is essential to the amplification loop. The author also notes that introducing imported-input taxes only during crises may be difficult to implement in practice, motivating reliance on monetary policy for ex post intervention.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Monetary Policy When Preferences Are Quasi-Hyperbolic</title><link>https://macropaperwarehouse.com/papers/monetary-policy-when-preferences-are-quasi-hyperbolic/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-when-preferences-are-quasi-hyperbolic/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Experimental and survey evidence robustly documents &amp;ldquo;present bias&amp;rdquo; — people are more impatient over the short run than the long run, producing preference reversals inconsistent with standard exponential discounting. Dennis and Kirsanov ask how this behavioral feature, modeled as quasi-hyperbolic (quasi-geometric) discounting, changes the optimal conduct of monetary policy. Prior macro work on quasi-hyperbolic discounting concentrated on growth models, consumption/saving, and multiple equilibria; almost none examined monetary policy. The paper fills this gap.&lt;/p&gt;
&lt;p&gt;Model setup: A nonlinear New Keynesian business-cycle model with monopolistically competitive firms that own capital, hire labor (Cobb-Douglas, alpha=0.33), and set prices subject to Rotemberg (1982) quadratic adjustment costs (omega=100, roughly a Calvo model with 1-year average price duration). Households consume a Dixit-Stiglitz bundle, supply labor, and save via one-period nominal bonds (zero net supply) and equities (fixed net supply of 1). Preferences are quasi-hyperbolic: the discount sequence is 1, beta&lt;em&gt;theta, beta&lt;/em&gt;theta^2, &amp;hellip; with theta in (0,1) the usual geometric factor and beta the present-bias factor (beta=1 restores geometric discounting; beta&amp;lt;1 is greater short-run impatience). Three shocks: technology, cost-push (elasticity/markup), and labor-supply. The central bank shares household momentary utility and sets the nominal bond return optimally under discretion (its discount factors gamma, xi may differ from household&amp;rsquo;s beta, theta); a Taylor-type rule is the comparison. The model is solved globally with Chebyshev polynomials and Gaussian cubature to obtain a unique interior solution to generalized Euler equations, avoiding log-linearization indeterminacy. A period is a quarter; theta=0.99, sigma=1 (log utility), Frisch elasticity nu=1, chi=1, depreciation delta=0.025, steady-state elasticity epsilon=11 (10% markup). The authors restrict attention to beta in [0.90, 1] because experimentally plausible values (beta around 0.60, per Meier-Sprenger 2015 and Wang-Rieger-Hens 2016, median ~0.60) generate implausible/extreme general-equilibrium outcomes.&lt;/p&gt;
&lt;p&gt;Main quantitative findings (benchmark, central bank benevolent, beta=gamma): (1) Greater present bias lowers saving and capital accumulation. Lowering beta=gamma from 1.0 to 0.9 reduces output by about 10% (10.02%), with capital falling much more (24.55%), labor much less (1.84%), consumption 6.02%, and the real wage 7.77%; cutting beta to 0.7 cuts output ~30% (roughly linear). (2) Discretionary policy still produces positive average inflation (inflation bias), but the bias is SMALLER under present bias: average inflation falls from 2.553% (beta=1) to 2.362% (beta=0.9) under discretion, because firms, whose equity holders discount hyperbolically, spread costly price changes over time — present bias acts like greater price rigidity, so smaller inflation surprises suffice. (3) Asset returns balloon: a nonpecuniary return to capital (1-beta)/beta * KK(Z) appears, raising the total return on capital rcap and spilling into bonds. At beta=0.9 (discretion) the net real return on capital reaches 48.928% and the real interest rate 48.926% (annualized), versus ~4.0% at beta=1 — well above observed real rates, so experimentally-sized present bias is wildly counterfactual in general equilibrium. (4) The Taylor rule increasingly underperforms optimal discretion as households become more impatient (suboptimal-policy cost lambda_S rises with present bias). (5) Quasi-hyperbolic and geometric discounting are NOT equivalent because of the nonpecuniary (time-inconsistency) return to capital.&lt;/p&gt;
&lt;p&gt;Policy implications: A benevolent central bank (sharing household preferences) keeps steady-state inflation under control across a wide range of discount factors. If instead the central bank does NOT adopt household time preferences and tries to discourage early consumption/delayed saving, it achieves only a marginal output gain at the cost of much higher average inflation. Conversely, delegating policy to a central banker who is MORE present-biased than households raises household welfare (akin to Rogoff&amp;rsquo;s conservative central banker), because it emphasizes the current-period cost of changing prices, lowering inflation volatility and average inflation toward zero.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the model&amp;rsquo;s solution strategy and why does it matter for the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is solved as a fully nonlinear global problem rather than log-linearized. The authors use Chebyshev polynomials (giving continuous decision rules and derivatives) and compute expectations via Gaussian cubature instead of finite-state Markov chains. They impose symmetry across households and firms in equilibrium (kt=Kt, ct=Ct, etc.; bonds in zero net supply Bt=0, stocks fixed St=1) and solve the interior solution to a system of generalized Euler equations, following Maliar and Maliar (2005). This matters because quasi-hyperbolic discounting creates strategic interaction between the household and its future self that can generate multiple equilibria (Krusell and Smith 2003); log-linearization can introduce indeterminacy (Maliar and Maliar 2006a). Allowing a large domain for wealth/capital is, per Cao and Werning (2018), key to ruling out local multiplicities. The result is a unique stable equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the central economic mechanism through which present bias affects asset returns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Equation (25): the total gross return on capital equals the pecuniary part (shadow rental rate rk + 1 - delta) PLUS a nonpecuniary part (1-beta)/beta * KK(Z), where KK(Z) is the derivative of next period&amp;rsquo;s capital decision rule with respect to current capital. This nonpecuniary term arises only under time inconsistency (it vanishes when beta=1): the firm/household uses capital accumulation to constrain its future self. Even small present bias makes this term large, raising rcap; because households arbitrage between stocks and bonds (bonds offer no nonpecuniary return), the real bond rate rises commensurately. This is why beta=0.9 pushes real rates to ~49% — counterfactual — and why the paper restricts to beta in [0.90,1].&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does present bias REDUCE the discretionary inflation bias rather than raise it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Quasi-hyperbolic discounting weights the cost of changing prices today more heavily than future price-change costs (since firms&amp;rsquo; equity holders discount the future more). When shocks hit, firms make smaller price changes now and defer the rest, so present bias acts like an increase in price rigidity. The central bank then calculates that smaller inflation surprises are enough to boost output to the efficient level, so equilibrium average inflation falls (2.553% at beta=1 down to 2.362% at beta=0.9 under discretion). The structure of the policy trade-off (eq. 21) is unchanged by present bias; only the relative costs and benefits shift.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the three shocks differ in their interaction with present bias?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Technology shock (Fig 1): financial variables are affected most; relative to geometric baseline, consumption rises more and labor rises less, pushing real wages and real marginal costs up; the real and nominal interest rates rise by more due to increased demand for current consumption. Price-elasticity/cost-push shock (Fig 2): responses are generally more muted; labor rises less, consumption more, inflation falls by less (firms defer price changes); the real interest rate and nominal bond return are the most sensitive variables. Labor-supply shock (Fig 3): an adverse shock raises labor disutility, cutting labor, output, consumption, investment and capital while raising the real wage; inflation and real marginal costs are little affected, and policy eases (real and nominal rates fall); present bias mainly amplifies consumption/investment responses and raises impact responses, increasing unconditional volatility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What welfare measures are used and how do they move with present bias?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three consumption-equivalent costs: lambda_C (Lucas 1987 cost of business cycles), lambda_B (magnitude of the present bias), and lambda_S (cost of the suboptimal Taylor rule vs. optimal discretion). Greater present bias lowers the utility level U, raises lambda_C (e.g., 0.033 to 0.045 under discretion as beta=gamma goes 1.0 to 0.9), and raises lambda_B substantially (0 to 2.808). lambda_B rises much more than lambda_C, showing that discounting future consumption dominates cyclical-volatility effects. lambda_S also rises, meaning the Taylor rule becomes progressively more costly relative to discretion as households grow more impatient.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the comparison of quasi-hyperbolic vs. geometric discounting (Table 3) show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Comparing quasi-hyperbolic (beta=gamma=0.99, theta=0.99) to a geometric model (beta=1, theta=0.992) calibrated to be comparable: the geometric model produces LOWER average capital, labor, output, consumption, investment, and real wage. Under quasi-hyperbolic discounting, household ownership of capital generates a nonpecuniary return that compensates for the lower rental rate and encourages higher saving, so the capital stock is larger even though the marginal product and rental rate of capital are lower. The two are genuinely non-equivalent because of the time-inconsistency-driven nonpecuniary return. Welfare cost of business cycles is higher under geometric than quasi-hyperbolic discounting and higher under the Taylor rule than optimal discretion; to be compensated for the Taylor rule&amp;rsquo;s suboptimality households would require a permanent consumption increase of 0.07% (geometric) or 0.10% (quasi-hyperbolic).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the policy-delegation result and its scope condition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In Section 6 the central bank&amp;rsquo;s discount factor gamma is allowed to differ from the household&amp;rsquo;s beta. Allowing the central bank to be MORE present-biased than households (lower gamma) raises household welfare: welfare is higher in column (2) (gamma=0.9, beta=1) than column (1) (both =1), and higher in column (3) (both=0.9) than column (4) (beta=0.9, gamma=1). The mechanism is that a more present-biased central banker emphasizes the current-period cost of changing prices — like greater price rigidity or a conservative (Rogoff 1985) central banker — yielding less volatile and lower average inflation (e.g., inflation drops to 0.699% in column 2). Effects on real variables are small; effects on nominal variables are larger and quantitatively significant. This parallels Dennis (2014), where distorting the discretionary central bank&amp;rsquo;s objective (risk-sensitivity) improved welfare. Scope: this holds because policy is conducted under discretion, which is suboptimal; under commitment the delegation logic would differ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Where does present bias enter, and not enter, the equilibrium conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It does NOT enter the household&amp;rsquo;s intratemporal labor-leisure condition (eq. 7) or the firm&amp;rsquo;s static conditions defining the rental rate and real wage (eqs. 12-13). It enters the bond and stock Euler equations (eqs. 8-9) and the Phillips curve (eq. 11) only by changing how next period is discounted (via beta*theta). Most importantly, it enters the firm&amp;rsquo;s capital-accumulation Euler equation (eq. 10) in TWO ways: changing the discount rate AND adding the nonpecuniary term (1-beta)*KK(Z), which disappears when beta=1. The Phillips curve&amp;rsquo;s structure is otherwise unaffected because, in the symmetric equilibrium, all firms set the same price so the relative price equals one.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness/extensions are considered?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Capital ownership: the main analysis has firms own capital, but Online Appendices 1-2 show households-own-capital (rented competitively) is equivalent even under quasi-hyperbolic discounting. Geometric-discounting benchmark is explored fully in Online Appendix 4. Numerical accuracy (consumption-Euler residuals) is reported in the appendix. The authors also vary the markup elasticity epsilon and note that values of 6 or 21 gave implausible steady-state inflation, so they use epsilon=11. They report results across beta=gamma of 1.00, 0.99, 0.95, 0.90 under both discretion and the Taylor rule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from the closest prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Graham and Snower (2013) study a sticky-WAGE NK model where households prefer positive inflation because it erodes real wages over time, overturning the Friedman rule. This paper uses sticky PRICES (Rotemberg), firm-owned capital, and finds present bias LOWERS average inflation under optimal discretion. Maeda (2018) extends Krusell-Smith to a cash-in-advance monetary economy and recovers the Friedman rule via cash constraints. Most prior quasi-hyperbolic macro work (Krusell-Smith 2003, Maliar-Maliar, Krusell-Kuruscu-Smith 2002) focused on growth, consumption/saving, multiplicity, or income distribution — not monetary policy. This paper is distinctive in focusing on optimal discretionary monetary policy, quantifying the inflation bias, and identifying the asset-return implications and the welfare case for delegating to a present-biased central banker.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Monetary Policy, Firm Heterogeneity, and the Distribution of Investment Rates</title><link>https://macropaperwarehouse.com/papers/monetary-policy-firm-heterogeneity-and-the-distribution-of-investment-rates/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-firm-heterogeneity-and-the-distribution-of-investment-rates/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; Investment is a sizable and the most volatile component of aggregate GDP, so understanding the investment channel of monetary policy matters for policymakers. Prior work has overwhelmingly studied the effect of monetary policy on the &lt;em&gt;average&lt;/em&gt; investment rate. But an estimated average effect can reflect either a uniform rightward shift of the entire distribution (all firms invest a bit more) or a change in the &lt;em&gt;shape&lt;/em&gt; of the distribution (a few firms invest a lot more). The paper asks: how does monetary policy reshape the cross-sectional distribution of firm investment rates, and what does that reveal about the frictions driving (heterogeneous) transmission?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and empirical strategy.&lt;/strong&gt; Quarterly firm-level data from Compustat, sample 1986Q1–2018Q4, U.S. nonfinancial firms (financial firms, foreign firms, and firms with incomplete/questionable data excluded). Firm age is merged from WorldScope and Jay Ritter&amp;rsquo;s database. Accounting capital stocks are converted to real economic capital via a Perpetual Inventory Method (building on Bachmann and Bayer 2014). The investment rate is real capital expenditures (CAPX) net of sales of property/plant/equipment (SPPE), deflated and divided by the lagged real capital stock. The firm-level data are aggregated into quarterly investment-rate distributions and moments. Identification uses monetary policy shocks from the Gertler and Karadi (2015) Proxy SVAR (re-extracted with updated VAR data and high-frequency instruments). Estimation is via two-step quantile/bin local projections (eq. 1), with quarter dummies for seasonality and Newey-West standard errors. Shocks are scaled to reduce the 1-year Treasury yield by 25 basis points (100bp in some distribution figures for readability). As a validity check, an expansionary shock produces hump-shaped increases in investment (peak 1.4%) and GDP (peak 0.35%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings (three facts).&lt;/strong&gt; Fact 1: An expansionary shock changes the shape of the distribution — fewer zero and small investment rates and more large ones. The 75th percentile responds significantly more than the 25th (the interquartile range rises significantly); the share of firms in bins [0,2) and [2,4) falls significantly while higher positive bins rise, most sizably in bin [28,infinity); negative investment rates are not meaningfully affected. The spike rate (share with investment rate &amp;gt;10%) rises and the inaction rate (|i|&amp;lt;0.5%) falls. Fact 2: These shape changes are more pronounced and statistically significant among young firms (defined as less than 15 years old) than old firms; spike rates rise more and inaction rates fall more for young firms. These effects persist even among firms unlikely to be financially constrained (low leverage, high liquidity, or dividend payers), arguing against a purely financial explanation. Fact 3: A decomposition (eq. 3) into extensive vs. intensive margins shows the extensive margin accounts for around 60% (intensive 40%) of the effect on the average investment rate, and around 60% (intensive 40%) of the &lt;em&gt;heterogeneous&lt;/em&gt; average effect across age groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and mechanism.&lt;/strong&gt; The authors build a general-equilibrium New Keynesian heterogeneous-firm model with fixed and convex capital adjustment costs, maintenance investment, and firm entry/exit (life cycles), in the spirit of Khan and Thomas (2008) and Winberry (2021). Calibrated to U.S. data (quarterly, beta=0.99), it replicates all three facts. Fixed costs generate lumpy investment and an extensive-margin channel: an interest-rate cut raises the discounted benefit of investing, inducing some firms to switch from inaction to a sizeable investment. Young firms are on average farther from their optimal capital (higher marginal product of capital under decreasing returns), so they are induced to invest more easily — generating heterogeneity &lt;em&gt;without any financial friction&lt;/em&gt;. This implies observational equivalence with the financial accelerator, but with opposite cyclicality: fixed costs imply &lt;em&gt;procyclical&lt;/em&gt; policy effectiveness, whereas financial acceleration implies &lt;em&gt;countercyclical&lt;/em&gt; effectiveness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Aggregate/policy implications.&lt;/strong&gt; Monetary policy is most effective when many firms are &amp;ldquo;close to paying the fixed cost.&amp;rdquo; The decline in business dynamism / firm aging since the 1980s has made monetary policy about 12% less effective at stimulating investment; policy is also less effective in recessions than booms (about 22% more effective in a large boom than a deep recession).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use exogenous monetary policy shocks from the Gertler and Karadi (2015) Proxy SVAR, re-extracted after updating both the VAR time-series data and the high-frequency (high-frequency surprise) instruments. These shocks are fed into two-step local projections: in the first step they construct time series of distributional objects (quantiles, interquartile range, the share of firms in each investment-rate bin, the spike rate, the inaction rate); in the second step (eq. 1) they regress the h-period change in each object on the shock, with calendar-quarter dummies to absorb seasonality and Newey-West standard errors for heteroskedasticity and autocorrelation. The validity check is that the shocks produce plausible hump-shaped aggregate responses (investment peak 1.4%, GDP peak 0.35%). The key threats are the standard ones for high-frequency-identified monetary shocks (the shock series being a valid instrument / external to the outcome) and the aggregation step; the paper does not run firm-level panel regressions with firm fixed effects here but instead works on aggregated distributional time series, so threats relate to the time-series identification of the GK shocks rather than firm-level confounding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two margins: the intensive margin (firms changing the size of investment conditional on adjusting) and the extensive margin (firms changing whether to invest at all). Empirically they are separated via the decomposition in equation (3), which classifies observations into spikes (i&amp;gt;10%) and normal (i&amp;lt;=10%) and writes the average rate as the spike fraction times the conditional spike rate plus the complementary term. The extensive-margin component isolates the change in the average rate coming only from changes in the spike rate; the intensive component isolates changes in conditional investment rates. Two covariance terms are dropped as negligible. The shape change in the distribution (fewer small, more very-large investments, negatives unaffected), plus the rising spike rate and falling inaction rate, are the empirical fingerprints of the extensive margin. The decomposition attributes about 60% of the average effect to the extensive margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Heterogeneity by firm age (young = less than 15 years old, old = 15+). Young firms show larger and more statistically significant shape changes (bigger drop in bin [0,2), bigger rise in bin [28,infinity)), larger spike-rate increases, and larger inaction-rate declines. The disproportionate right-tail (upper-quantile) response holds in both groups but is much more pronounced for young firms. The extensive margin explains roughly 60% of the young-vs-old gap in average effects. Appendix C reports similar but quantitatively weaker results when comparing small vs. large firms instead of young vs. old. The heterogeneous age effect survives within groups unlikely to be financially constrained (low leverage, high liquidity, dividend payers) and is also present among likely-constrained firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the model decompose the heterogeneous extensive-margin effect, and what is the &amp;lsquo;heterogeneous size effect&amp;rsquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using eq. (22), the heterogeneous extensive-margin effect splits into (i) a &amp;lsquo;heterogeneous hazard rate increase&amp;rsquo; — an interest-rate cut raises young firms&amp;rsquo; hazard (adjustment probability) more than old firms&amp;rsquo;, because young firms have a higher marginal product of capital and are farther from optimal size, so the discounted benefit of investing rises more for them; and (ii) a &amp;lsquo;heterogeneous size effect&amp;rsquo; — among new adjusters, young firms choose higher conditional investment rates than old firms, so there would be a heterogeneous average effect even if hazard rates rose identically. Both are quantitatively important.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role do the different adjustment costs play, and how is the model calibrated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model has fixed adjustment costs (random, uniform on [0, xi-bar]), convex adjustment costs (parameter phi), and maintenance investment (parameter chi). In isolation, the fixed cost generates 55% of the heterogeneous average effect and the convex cost only 29%, with the remaining 16% from their interaction (the heterogeneous size effect needs both: hazard changes require fixed costs, differing conditional rates require convex costs). Five parameters (sigma_z=0.07, k0=2.27, xi-bar=0.90, phi=2.20, chi=0.34) are fitted to five moments: standard deviation of investment rates (data 0.20 / model 0.18), average investment rate (0.12/0.13), autocorrelation of investment rates (0.38/0.38), relative size of entrants (0.29/0.29), and relative spike rate of old firms (0.40/0.40). Fixed parameters include beta=0.99, psi=0.58, theta=0.21, nu=0.64, delta=1.93% (giving a 7.7% annual aggregate investment rate), rho_z=0.95, pi_exit=1.625%, phi(Rotemberg)=90, gamma=10, Taylor inflation coefficient phi_pi=1.5, smoothing rho_r=0.75, external capital adjustment cost kappa=11.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What untargeted moments validate the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model reproduces (i) firm life-cycle profiles — average investment rate highest for newborns and falling with age, decomposed into frequency of adjustment (extensive) and conditional investment rate (intensive), both higher for young firms; (ii) plausible aggregate monetary-policy responses; and (iii) the interest-rate elasticity of aggregate investment. All three investment frictions are needed for the life-cycle profiles: fixed costs generate adjustment frequencies below one, convex costs keep young firms&amp;rsquo; conditional investment rates plausible (no instant jump to optimal size), and maintenance investment makes hazard rates decline with age.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Robustness to alternative quantile choices (Figure A.1); alternative spike thresholds of 8% and 12% (Figure A.8); using the spike rate vs. hazard rate to identify extensive-margin adjustments in the model (Figure A.12, very similar results); replication of heterogeneous spike/inaction effects within groups unlikely to be financially constrained (Figure A.6) and within likely-constrained firms (Figure A.7); small-vs-large firm comparison (Appendix C); and comparison of extensive-margin contributions across different shocks (aggregate TFP, wage-markup) in Appendix E.4, showing the extensive-margin contribution can differ substantially when a shock directly affects adjustment costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the empirical investment-channel literature (Christiano et al. 2005; Gertler and Gilchrist 1994; Ottonello and Winberry 2020; Jeenas 2023; Cloyne et al. 2023) which focused on aggregate or average investment rates; its novelty is documenting effects on the &lt;em&gt;entire distribution&lt;/em&gt; and its moments. Against Cloyne et al. (2023), who interpret stronger young-firm responsiveness through the financial accelerator, this paper shows a non-financial friction (fixed adjustment costs) generates the same age heterogeneity — an observational-equivalence point — though it stresses its findings are &amp;lsquo;consistent with&amp;rsquo; and &amp;rsquo;not necessarily at odds with&amp;rsquo; the financial accelerator (the intensive margin, stronger among young firms, may reflect financial acceleration). On the lumpy-investment theory side it extends Khan and Thomas (2008), Winberry (2021), Koby and Wolf (2020), Reiter et al. (2013, 2020), Fang (2023) by adding firm life cycles. Relative to contemporaneous work by Lee (2023), which examines spike rates of small vs. large firms, this paper studies young vs. old firms and the entire distribution; relative to Gourio and Kashyap (2007), who study unconditional spike-rate cyclicality, this paper studies responses to monetary shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Monetary policy stimulates aggregate investment mainly because a few firms switch from inaction to sizeable investment (extensive margin), not because many firms invest a little more. Effectiveness is state-dependent: it is higher when many firms are &amp;lsquo;close to paying the fixed cost&amp;rsquo; — i.e., in booms and in high-business-dynamism economies with many young, growing firms. Scope conditions/quantification: the post-1980s decline in business dynamism / firm aging has made policy about 12% less effective; the impact effect on aggregate investment is 1.44% in baseline, 1.61% (about 11.5% larger) under a high-dynamism calibration (13% entrant share, as in 1984) and 1.32% (about 8.5% smaller) under low dynamism (3.375% entrant share); policy is about 22% more effective in a large boom than a deep recession. Critically, the cyclicality direction differs from the financial accelerator: fixed costs imply &lt;em&gt;procyclical&lt;/em&gt; effectiveness, financial acceleration implies &lt;em&gt;countercyclical&lt;/em&gt; — a distinction that matters for policy and aligns with evidence (Tenreyro and Thwaites 2016) that policy is weaker in recessions. A key caveat from general equilibrium: a higher young-firm share does not automatically raise effectiveness, because higher investment demand raises the price of capital and crowds out investment; state dependence only arises when the price elasticity of aggregate investment is sufficiently low (as in their model).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and open questions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The extensive-margin channel cannot rationalize the entire young-old responsiveness gap — the intensive margin is also quantitatively relevant and may reflect financial acceleration. The roughly-60% extensive-margin share of the heterogeneous effect cannot be rationalized by the classical Bernanke-Gertler-Gilchrist (1999) financial accelerator, which operates on the intensive margin. The spike rate is used as an empirical proxy for the model&amp;rsquo;s unobservable hazard rate. The paper leaves open why young firms grow slowly, how the relevant frictions respond to economic policy, and how policy effects are shaped by these frictions, pointing to non-financial constraints like productivity/demand uncertainty (Jovanovic 1982; Chen et al. 2023) as further avenues.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Nonmonetary News in Fed Announcements: Evidence from the Corporate Bond Market</title><link>https://macropaperwarehouse.com/papers/nonmonetary-news-in-fed-announcements-evidence-from-the-corporate-bond-market/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/nonmonetary-news-in-fed-announcements-evidence-from-the-corporate-bond-market/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;When the Federal Reserve unexpectedly tightens policy, do riskier assets fall relative to safer ones (the standard prediction), or do investors read tightening as a signal that fundamentals are stronger than they believed, leading riskier assets to outperform? Smolyansky and Suarez answer this through the cross-section of the roughly $9 trillion U.S. corporate bond market, arguing it offers cleaner identification than survey-based evidence because asset prices already reflect all macro news just before an FOMC release—largely sidestepping the omitted-variable critique of Bauer and Swanson (2023) and Karnaukh and Vokata (2022).&lt;/p&gt;
&lt;p&gt;Data: transaction-level secondary-market trades from the regulatory version of TRACE (Aug 2002–May 2023), merged with Mergent FISD for bond characteristics. The sample covers 165 scheduled FOMC meetings and over 400,000 bond returns (Table 2 reports 474,771) across roughly 35,000 unique fixed-coupon, USD, U.S.-issuer bonds with 2–30 years to maturity. Monetary policy surprises are measured following Hanson and Stein (2015) as the change in the 2-year nominal Treasury yield over a t-1 to t+1 window, capturing both current-rate surprises and forward guidance. Credit risk is the average S&amp;amp;P/Moody&amp;rsquo;s/Fitch rating mapped to a 1–21 notch scale. The key regression interacts the 2-year yield change with the bond&amp;rsquo;s credit rating, with meeting-by-years-to-maturity, meeting-by-SIC2-industry, and meeting-by-callability fixed effects, so it compares same-maturity bonds differing only in credit risk. Standard errors are two-way clustered by meeting and firm.&lt;/p&gt;
&lt;p&gt;Main finding: the interaction coefficient is positive (~0.2). For a hypothetical 100 bp rise in the 2-year yield, a one-notch worse rating (e.g., BBB to BBB-) is associated with a 0.2 percent higher return—riskier bonds outperform after surprise tightening. Expressed as spreads: for a 25 bp surprise rise, two bonds 10 notches apart (AA+ vs BB, average duration ~5) see the BB-AA+ spread narrow by about 10 bps. The authors call this magnitude &amp;ldquo;moderately sized,&amp;rdquo; noting it is the net effect after standard monetary and reaching-for-yield forces that push the other way.&lt;/p&gt;
&lt;p&gt;The result is driven by the forward-guidance component, not current-rate surprises. Decomposing the 2-year change into a current fed-funds surprise and the 2-year-minus-fed-funds spread, only the spread (medium-term path) matters; the fed-funds coefficient is insignificant and oppositely signed. Riskier bonds also outperform when 1- and 2-year forward rates rise, when the 10-year-minus-2-year curve steepens, and following rises in both the 2-year real (TIPS) rate and breakeven inflation, suggesting non-monetary news reflects both outlook and risk-premia/risk-distribution news.&lt;/p&gt;
&lt;p&gt;Sub-period: the effect is stronger pre-pandemic (~0.3, Aug 2002–Dec 2019) and statistically insignificant post-pandemic (Jan 2020–May 2023), plausibly because the aggressive 2022 anti-inflation tightening let standard monetary effects dominate. Results are stable excluding/isolating the 2008-09 crisis. Following Cieslak-Schrimpf and Jarocinski-Karadi, essentially all of the baseline effect comes from meetings where stock returns and Treasury yields move in the same direction (about one third of observations), the signature of non-monetary news. Policy implication: FOMC communications—especially forward guidance—transmit substantial non-monetary information, complicating the read of asset-price reactions to policy.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strategy exploits the cross-section of corporate bond returns around FOMC announcements rather than time-series or survey responses. The regression interacts the 2-year Treasury yield change with a bond&amp;rsquo;s credit rating, saturated with meeting-by-years-to-maturity, meeting-by-industry (SIC2), and meeting-by-callability fixed effects, so identification comes from comparing same-maturity, same-industry, same-callability bonds that differ only in credit risk on a given meeting day. A positive interaction (riskier bonds outperform after tightening) is the opposite of what pure monetary/reaching-for-yield channels predict, so it isolates non-monetary news. The central threat the authors address is omitted-variable bias (Bauer-Swanson): they argue asset prices already embed incoming macro news just before the FOMC release, so a short event window around the announcement largely neutralizes this. A second threat is a &amp;lsquo;coupon/duration effect&amp;rsquo;—higher-coupon bonds have lower duration and price sensitivity—addressed in Table 3 columns 2-3. A third is illiquidity/stale prices, addressed by using actual TRACE trade prices and liquidity-based robustness tests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two opposing forces: (1) standard monetary news plus reaching-for-yield, under which tightening raises default/discount-rate risk and risk compensation, making riskier bonds underperform (predicting a negative coefficient); (2) non-monetary news, under which tightening signals a stronger outlook or a more favorable distribution of risks, making riskier bonds—more sensitive to economic strength and risk premia—outperform (positive coefficient). The estimated positive coefficient shows non-monetary news dominates on net. The authors further attribute non-monetary news to forward guidance: decomposing the 2-year yield into a current fed-funds surprise and the 2-year-minus-fed-funds spread shows only the spread drives results (fed-funds coefficient insignificant, wrong sign). They cannot fully separate &amp;rsquo;expected outlook&amp;rsquo; news from &amp;lsquo;risk premia/distribution-of-risks&amp;rsquo; news (they note these are likely highly correlated), but provide suggestive evidence both operate: yield-curve steepening (10y-2y) and breakeven inflation also predict riskier-bond outperformance, and the curve/risk channel points to risk-premia effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across sub-periods?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The effect is stronger in the pre-pandemic sample (Aug 2002–Dec 2019), with a coefficient of about 0.3 versus 0.2 for the full sample. It is not statistically significant in the post-pandemic period (Jan 2020–May 2023), which the authors attribute to early-pandemic turbulence and the aggressive 2022 tightening cycle, where standard policy-tightening effects likely overwhelm any non-monetary component. Results are stable when excluding the 2008-09 financial crisis (Jul 2008–Jun 2009), when restricting to pre-July 2008, and when restricting to the post-crisis pre-pandemic window (Jul 2009–Dec 2019), indicating the non-monetary effect is present across different economic environments and FOMC communication regimes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Coupon/duration: controlling for coupon rate interacted with meeting-by-maturity fixed effects, and &amp;lsquo;duration-adjusting&amp;rsquo; returns by subtracting a synthetic risk-free security&amp;rsquo;s return—results unchanged. (2) Liquidity: using only disseminated trades excluding agency/interdealer trades and trades under $100,000, and WLS weighted by each bond&amp;rsquo;s dollar volume—coefficients roughly unchanged and significant. (3) Alternative credit-risk measure: a market-based &amp;rsquo;log discount&amp;rsquo; (log price gap between a synthetic Treasury with the same cash flows and the corporate bond); a one-percentage-point larger discount is associated with ~0.1 percent higher return per 100 bp rise. (4) High-frequency window (15 min before to 45 min after): using 6- and 8-quarter Eurodollar futures and 2-year yields—same sign, somewhat smaller, with 2-year significant at 10%. (5) Online Appendix: bond fixed effects, excluding lowest-rated bonds, symmetry of rises vs cuts, extended return windows (up to 25 trading days), unscheduled meetings, and a CDS reconciliation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the Fed-information-effect literature (Campbell et al. 2012; Nakamura-Steinsson 2018) and identification via stock-yield comovement (Cieslak-Schrimpf 2019; Jarocinski-Karadi 2020), but responds to the omitted-variable critique (Bauer-Swanson 2023; Karnaukh-Vokata 2022) by using asset prices on tight windows. Versus Guo, Kontonikas, and Maio (2020), who find lower-rated bond indices underperform after tightening: differences are the sample start (2002 vs 1989, since FOMC issued post-meeting statements only after mid-1999) and frequency (transaction-level daily event study vs monthly indices); the authors show extending the window 3+ weeks (when FOMC Minutes are released) can flip the sign toward Guo et al. Versus Palazzo and Yamarthy (2022), who find CDS spreads of riskier firms widen after tightening: reconciled by showing the CDS reaction is driven by the pure monetary component while the corporate bond reaction is driven by non-monetary news, with CDS-bond basis volatility (Bai and Collin-Dufresne 2019) explaining divergence. Versus Anderson and Cesa-Bianchi (2024), Gertler-Karadi (2015), and others using only current fed-funds shocks: this paper emphasizes forward guidance, and notes Gertler-Karadi&amp;rsquo;s results may reflect their earlier, more pre-1999-tilted sample. It complements Golez and Matthies (2023), who use S&amp;amp;P 500 dividend strips.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;FOMC announcements—particularly the forward-guidance/expected-path component rather than current-rate decisions—convey substantial non-monetary information about the economic outlook and the distribution of risks. This matters for monetary policy transmission and communication design, and means asset-price reactions to FOMC news cannot be read as purely monetary. Scope conditions: results are concentrated in the pre-pandemic period and in meetings where stocks and yields comove (about one third of observations); they weaken or vanish when standard monetary effects dominate (e.g., the 2022 tightening). The authors stress this does not mean monetary news is unimportant, only that it is not always the dominant news type in all markets. They also note non-monetary effects are likely more detectable in recent samples given longer FOMC statements (late 1990s) and press conferences (2010s).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Does the outperformance reflect more than just risk premia?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors argue it is unlikely to be entirely risk-premia driven. In the Online Appendix (Table A11), following a surprise tightening the relative default rate of riskier versus less-risky bonds decreases the subsequent quarter, indicating that unexpected tightening provides a genuine positive signal about the expected credit outlook—an outlook channel, not only a risk-premia channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why use a two-day (t-1 to t+1) window and the 2-year yield?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The 2-year nominal yield (Hanson-Stein 2015) captures both current fed-funds surprises and forward guidance over the next several quarters. The t-1 to t+1 window is used because the market may not incorporate the full information content instantaneously (Gurkaynak-Sack-Swanson 2005; press conferences from 2011 add post-statement information), because illiquid corporate bonds may not trade late on day t, and because it lets the same window measure both Treasury and corporate bond reactions. Robustness uses a high-frequency 15-min-before to 45-min-after window.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;placeholder&lt;/strong&gt;: placeholder&lt;/p&gt;</description></item><item><title>Nonresponse Bias in Household Inflation Expectations Surveys</title><link>https://macropaperwarehouse.com/papers/nonresponse-bias-in-household-inflation-expectations-surveys/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/nonresponse-bias-in-household-inflation-expectations-surveys/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Inflation expectations measured from household surveys are central inputs to monetary policy, but roughly half of respondents to the RBNZ Household Inflation Expectations survey decline to answer the quantitative inflation-expectations question. Because these item non-responses are not random across demographic groups, aggregate and subgroup measures derived only from those who answer can be systematically biased. The paper quantifies that non-response bias and proposes a simple, operational method to correct aggregate and subgroup inflation-expectation indices and disagreement measures.&lt;/p&gt;
&lt;p&gt;Data and strategy: Micro-data from the RBNZ Household Inflation Expectations survey, quarterly, achieving about 1,000 household responses per wave, covering 1998Q2 to 2022Q4 with 89,834 individual responses treated as repeated cross-sections. The focal question asks the expected annual rate of inflation/deflation over the next 12 months. The survey switched from telephone to online mode starting 2018Q3. Outliers are removed using a 1.5xIQR rule (excluding 4,535 observations in the baseline). The empirical approach has three steps: (1) Probit models of the probability of responding on demographics (gender, age, region, ethnicity, income, employment) plus macro controls (lagged inflation and its square, a year trend, seasonal dummies, an online-mode dummy); (2) a Heckman sample selection model (selection equation = the baseline Probit extended with online-mode interactions; outcome equation = inflation-expectation bias regression) with four exclusion restrictions dropped from the outcome equation (region, employment, year trend, lagged inflation squared); (3) a regression-on-quarter-dummies index that adds the inverse Mills ratio to deliver bias-adjusted average and dispersion series. Estimates use survey weights, extending Heckman estimators to weighted form.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: Item non-responses average about 44% over the full sample, falling to about 24% after the move to online mode. Non-responses artificially raise average one-year-ahead inflation expectations by about 0.3 percentage points; the average selection adjustment is -0.288 over the full sample, ranging from -0.385 (2018Q1) to -0.138 (2022Q3). Females are about 20% less likely to respond than men; older, employed, higher-income individuals respond more; Maori and Pacific Islanders respond less. Online mode raises response probability by about 33%. Response rates rise non-linearly with lagged inflation: moving from 2% to 7% raises average response probability by about 12%, while it barely changes over the 0-4% range, with the slope turning steeply positive in the 5-7% range. There is a downward trend in response of about 1% more item non-response per year. The online switch narrowed the female-male response gap from 24.4% (telephone) to 5.5% (online) and rendered most ethnicity gaps insignificant. In the bias (outcome) regressions without selection (weighted), respondents over 25 show bias more than 0.23 pp above the under-25 base; Pacific Islanders 0.34 pp, Maori 0.15 pp, Asians 0.12 pp above the base ethnic group. After the Heckman correction, gender, ethnicity, and income differences become insignificant or shrink substantially, while age effects strengthen (older respondents over-predict; under the two-step estimator, bias for those over 35 is more than double the no-selection estimate). The online dummy in the outcome equation lowers predicted expectations by more than 2.27 pp (interpreted cautiously, as it also captures large 2020Q3-onward negative biases).&lt;/p&gt;
&lt;p&gt;Implications: Survey weights correct unit non-response but not item non-response, so published aggregates overstate expectations by ~0.3 pp. The correction lowers all subgroup means, decreases cross-subgroup disagreement for gender/income/ethnicity (increases it across age), and generally decreases within-subgroup dispersion. Correcting also makes the household-vs-professional-forecaster intercept gap statistically insignificant. Policy: online survey modes and inclusive, layered communication (especially during high-inflation periods of greater public attention) can reduce measurement error.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification rests on a Heckman sample selection model. A Probit selection equation models the probability of answering the inflation-expectations question; its predicted probabilities yield the inverse Mills ratio, added to the outcome (bias) regression to correct for selection-as-omitted-variable bias. Identification is sharpened by exclusion restrictions: four variables (region, employment status, year trend, lagged inflation squared) enter the selection equation but are dropped from the outcome equation. The authors justify these because region and employment were found statistically insignificant in the outcome equation, and year trend and lagged inflation squared induced collinearity/variance inflation. The selection equation also includes online-mode interaction terms to better identify heterogeneity in response rates. Threats: the validity of the exclusion restrictions (the assumption that these variables affect participation but not the level of expectations bias) and the known sensitivity of the full-information ML Heckman estimator to collinearity; the authors address the latter by also reporting the two-step estimator.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms drive non-response. First, demographic propensity: young, female, low-income, and minority-ethnicity (Maori, Pacific Islander, Asian) respondents are less likely to answer, documented via Probit average partial effects. Second, state dependence on the inflation environment: response rates rise non-linearly when lagged inflation moves away from the target range (steeply positive slope at 5-7%), consistent with a &amp;lsquo;rational inattention&amp;rsquo; interpretation where agents notice inflation only when it becomes salient, and with the finding that inflation uncertainty co-moves with the inflation level (Binder, 2017). The authors also test whether non-response reflects lack of understanding using a 2018Q3-2021Q4 sub-question: only 5% of respondents indicated not understanding inflation, so 81% of non-responses are not due to lack of understanding, pointing instead to factors like cultural norms/uncertainty rather than literacy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Response heterogeneity: females respond ~20% less than males; response probability rises with age; Maori and Pacific Islanders respond markedly less; higher income and employment raise response; households with dependent children and non-freehold owners respond less; being the main grocery shopper slightly lowers response. Bias heterogeneity before correction: age, ethnicity (Pacific Islanders 0.34 pp, Maori 0.15 pp, Asian 0.12 pp), and income show differences. After Heckman correction, gender, ethnicity, and income differences become insignificant or shrink substantially, while age effects strengthen (older respondents over-predict inflation, with an upward-sloping age profile). Online mode reduces demographic gaps: the female-male response gap fell from 24.4% to 5.5%, and most ethnicity gaps became insignificant online.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Four Probit specifications with progressively richer covariates (occupation, grocery shopping, dependent children, home ownership) across sub-periods, with baseline effects stable. (2) Two Heckman estimators, two-step and ML, mostly consistent (the main divergence is gender, insignificant under two-step). (3) Comparison against random imputation, which reproduces the distorted no-selection picture. (4) Six outlier-detection rules (fixed -2/15 interval, 1.5xIQR, 3xIQR, hybrid IQR, top/bottom 5% by quarter, top/bottom 5% overall): Probit estimates are insensitive to the outlier definition. (5) A separate Probit on outlier responses shows similar demographic patterns (low-income young minority females give more outlier responses) but with differing magnitudes and trend/inflation effects, indicating outlier responses and non-responses are related but distinct. (6) An Appendix-E forward-looking Phillips curve exercise where adjusted subgroup expectations are always preferred to unadjusted.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the heterogeneity-of-expectations literature (Bruine de Bruin et al. 2010; Pfajfar and Santoro 2010; Malmendier and Nagel 2016; D&amp;rsquo;Acunto et al. 2023) documenting demographic differences in expectations, and on studies finding non-response from young/female/low-income groups (Blanchflower and MacCoille 2009; Leung 2009). Its distinctive contribution is showing that part of the observed gender/ethnicity/income differences in expectations is an artifact of non-response (selection) rather than true belief differences, and proposing an operational correction. Unlike imputation methods (e.g., the US Michigan Survey&amp;rsquo;s distribution-based imputation), the Heckman approach accounts for the socio-demographic composition of responders. Unlike methods requiring randomized incentives or special survey-design features (McGovern et al. 2018; Comerford 2023), it works on long-running repeated cross-sections lacking such features. It differs from attrition-focused work (Burgi 2023) by addressing item non-response in repeated cross-sections rather than panel attrition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, because survey weights correct only unit non-response, published aggregates overstate expectations by ~0.3 pp; central banks should apply an item-non-response correction. Second, response engagement rises when inflation deviates from target, so central banks could leverage high-inflation periods of elevated public attention for broader communication beyond financial-market audiences, using layered messaging. Third, moving surveys online substantially reduces non-response bias and improves representativeness, but requires ensuring digital accessibility to avoid new selection bias. Scope conditions: the non-linear inflation-response relationship is based on few episodes of out-of-range inflation, possibly confounded by Covid/recessions, so it should be interpreted with caution; the large online-mode coefficient on expectations also captures the post-2020Q3 negative biases from sluggish expectation adjustment; and RBNZ owns the survey and could change methodology accordingly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the adjusted index constructed operationally, and why is it attractive?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Average expectations are obtained by regressing micro inflation-expectations on quarter dummies (WLS); adding the inverse Mills ratio from the baseline Probit as an extra regressor yields the bias-adjusted average. Subgroup indices interact subgroup dummies with time dummies; an adjusted disagreement (dispersion) measure replaces the dependent variable with squared deviations from the quarterly mean. The approach is attractive operationally because updating each quarter only requires a new inverse Mills ratio from the pre-fitted, relatively stable Probit model, so the adjustment is unlikely to undergo severe revisions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the comparison with professional forecasters show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Regressing one-year-ahead Survey of Professional Forecasters expectations on household expectations, the unadjusted household series gives a negative, significant intercept (-0.294, confirming households&amp;rsquo; upward divergence), but using the adjusted household average makes the intercept insignificant (-0.019), suggesting the household-professional gap is partly a non-response artifact. The slope remains below one (0.759 unadjusted, 0.740 adjusted), consistent with Carroll (2003), so household expectations still do not scale one-to-one with professional forecasters.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Policy transition risk, carbon premiums, and asset prices</title><link>https://macropaperwarehouse.com/papers/policy-transition-risk-carbon-premiums-and-asset-prices/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/policy-transition-risk-carbon-premiums-and-asset-prices/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Central bankers, regulators, and investors increasingly worry about climate &amp;ldquo;transition risks&amp;rdquo; — abrupt shifts in climate policy, green technology breakthroughs, or consumer-preference shifts that re-price assets (Carney&amp;rsquo;s &amp;ldquo;tragedy of the horizon&amp;rdquo;). Rather than use the fixed NGFS-style stress-test scenarios, the authors ask how &lt;em&gt;policy transition risk&lt;/em&gt; — modeled as stochastic, reversible jumps between climate-policy regimes — endogenously affects carbon pricing, asset prices, risk premiums, the risk-free rate, and the speed of the green transition.&lt;/p&gt;
&lt;p&gt;Model setup: A global two-sector continuous-time DSGE macro-finance model of the climate and economy (building on Hambel, Kraft, van der Ploeg 2024). Two sectors produce perfectly substitutable final goods via Cobb-Douglas in capital and a CES energy composite of fossil fuel and renewables; sector 1 is &amp;ldquo;green&amp;rdquo; (renewables-intensive) and sector 2 is &amp;ldquo;brown&amp;rdquo; (fossil-intensive). Investment carries quadratic intertemporal adjustment costs and brown-to-green capital reallocation carries quadratic intrasectoral costs (a dollar of brown converts to less than a dollar of green). Temperature rises in cumulative emissions (TCRE specification). Households have Epstein-Zin recursive preferences; dividends are leveraged consumption (D=C^phi, phi&amp;gt;1). Capital is exposed to Brownian shocks plus Barro-style macro-disaster jumps; learning-by-doing lowers renewable costs. The core model has a two-state policy Markov chain — BAU (no carbon pricing) and CAP (carbon pricing internalizing damages and enforcing a Tcap=2C cap; if the cap is breached, fossil use is forced to zero). Policy tips with transition intensity calibrated at lambda_x = 4% per year from BAU to CAP. Model solved by finite differences; 20,000 simulated paths to 2100. Calibration: RRA gamma=2.977, EIS psi=1.5, time preference delta=0.0346, initial GDP $116tn, initial brown-capital share S0=0.876, TCRE=1.8 C/TtC, T0=1.27C.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: (1) Under pure BAU, the green transition is slow and temperatures reach on average 3.9C above pre-industrial by 2100; risk-free rate and risk premiums are almost unaffected (TFP damage alone cannot generate a temperature premium). (2) With policy transition risk, by 2100 about 28% of paths stay below 1.8C, 46% land between 1.8C and 2.5C, and the rest exceed 2.5C; roughly 45% of paths adhere to the 2C cap; 94% of paths have active climate policy by 2100. On the illustrative path tipping to CAP in 2045, a carbon price of &lt;del&gt;$700/tC (&lt;/del&gt;$190/tCO2) is imposed; the green share price jumps +22% and the brown price drops -21.5% on impact. In the ~4% of paths where CAP is adopted in 2021, the carbon tax starts at ~$218/tC ($60/tCO2), about 50% larger than Pigouvian pricing without an enforced cap — because the cap forces policymakers to catch up. (3) The model generates a sizable, positive carbon premium (brown minus green risk premium) that is initially near zero but becomes large when temperature is close to or above the 2C cap and the economy is still carbon-intensive; the dominant channel is the asymmetric temperature-shock impact on the brown sector&amp;rsquo;s price-dividend ratio (third term of eq. 3.4). Without transition risk (first-best Pigouvian pricing), the carbon premium is slightly negative. (4) The mean risk-free rate starts at 0.8% and is largely stable, but its lower quantile falls sharply when temperature approaches/exceeds the cap as precautionary saving rises. (5) Extensions table: in the pure PIGOU scenario (no cap, no transition risk) climate disasters roughly double the optimal carbon tax from $45/tCO2 (2025) to $91, and adding irreversible climate tipping raises it to $121; in the core BAU-&amp;gt;CAP model the average optimal CO2 tax rises from $73 to $108 (disasters) to $134 (tipping). News effects on share prices are far larger for policy tips than for climate or technology tips (climate tipping events move prices ~3-5%; a BAU-&amp;gt;CAP tip moves the brown price ~-27% and brown price-dividend ratio ~-13%, green price +18%, green PDR +42%).&lt;/p&gt;
&lt;p&gt;Implications: Policy transition risk makes average policy more ambitious than BAU but less than first-best; it produces risk-driven carbon premiums that accelerate the green transition, raises precautionary saving, and depresses the risk-free rate near the cap. Physical risks alone (assumed symmetric across sectors) cannot generate a sizable carbon premium but do raise carbon prices and create a temperature risk premium on all assets.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy, and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a calibrated structural (DSGE) model, not an empirical identification design, so &amp;lsquo;identification&amp;rsquo; here means the model mechanism that generates carbon premiums plus calibration to external sources. The carbon premium is generated purely endogenously by making the brown sector more fossil-/carbon-intensive than the green sector, with physical risks assumed to load symmetrically on both capital stocks so any premium asymmetry comes from policy transition risk and temperature exposure rather than from differential physical-risk loadings. The main threats the authors acknowledge are: (i) calibration choices for negative-emissions cost curves and transition probabilities are &amp;rsquo;tentative&amp;rsquo; and partly curve-fit/ad hoc; (ii) exogenous and stark policy states (two or three regimes with given/partly exogenous transition intensities) are a simplified representation of the political process; (iii) global-economy calibration sits uneasily with national-election interpretations of policy tipping. They argue the forward-looking households/firms make the model robust to the Lucas critique.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms, and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three channels for the carbon premium appear in equation (3.4): (1) a stochastic-discount-factor/transition-shock term scaling in transition intensities lambda_x; (2) a diffusive term from the volatility of the brown-capital share affecting the brown price-dividend ratio more (largest when S(1-S) is high, i.e. share neither very high nor very low) and from higher consumption-capital-ratio volatility in the brown sector combined with leverage; (3) a temperature-shock term that becomes large near 2C because the policy transition to CAP becomes potentially devastating (forced phase-out of fossil fuel) and hits the brown PDR much more than the green PDR. The authors state the third (temperature-near-cap) effect is quantitatively the most important. The premium is risk-driven, distinguished from preference-driven mechanisms (Pastor et al. 2021; Pedersen et al. 2021; Zerbib 2022) in which green investors accept lower returns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Heterogeneity is across states and paths rather than across firms in data. The carbon premium and risk-free-rate response depend nonlinearly on temperature (large near/above 2C) and on the brown-capital share S (large transition effect when S is high). Across simulated paths the outcomes diverge widely: ~28% below 1.8C, ~46% between 1.8C and 2.5C, the rest above 2.5C by 2100. The price impact of news differs sharply by type: policy tips dominate climate tips and technology tips. The risk-free rate&amp;rsquo;s lower quantile falls much more in high-temperature paths.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and extensions are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Extensions: (a) recurring temperature-dependent climate disasters (intensity rising linearly in T, lambda_c-hat=0.096, lambda_c(T0)=0.122, expected loss 1.5% vs 25% for macro disasters, alpha_c=65.7); (b) irreversible climate tipping via a 3-state chain raising TCRE from 1.8 to 2.1 to 2.4 C/TtC and adding permanent damages d=0,0.025,0.05; (c) a negative-emissions/technology-breakthrough state (2-state chain, ~50% chance of competitive technology by 2050, intensity 0.0224, cost curve fit to Rebonato et al. 2023); (d) a richer 3-state policy chain BAU/PIGOU/CAP with reversible and partly endogenous transition probabilities (switch to active policy rising toward 75% if T&amp;gt;1.5C; lobbying makes switches depend on brown/green capital shares), giving an 18-state (2x3x3) Markov chain. Core qualitative results (positive carbon premium driven by policy risk near the cap, precautionary saving lowering the risk-free rate) survive all extensions; the carbon premium is smaller in the 3-state model because only ~30% of paths reach CAP. A model variant with exhaustible fossil resources (cap 3000 GtC) found the exhaustibility constraint non-binding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends Hambel et al. (2024), which used a two-sector economy for climate disasters/tipping and first-best carbon prices but did not study policy transition risk or carbon premiums. It builds general-equilibrium structure on the partial-equilibrium reduced-form insights of Hsu et al. (2023) on the pollution premium (who report a 4.42% annual pollution premium). It is most closely related to Barnett (2024), also a DSGE transition-risk model, but adds richer interactions among climate tipping, political risk, and technology breakthrough, imperfect energy substitution, and intrasectoral adjustment costs; Barnett instead emphasizes a climate-policy-driven &amp;lsquo;run on fossil fuel&amp;rsquo;. It provides a risk-based mechanism for the carbon premium documented empirically by Bolton and Kacperczyk (2021, 2023) and Hsu et al. (2023), while noting contrary evidence (Pastor et al. 2021; Bauer et al. 2022; Aswani et al. 2024; Zhang 2025 — who finds the premium turns negative in the U.S. after a data-lag correction; Hambel and van der Sanden 2024). Calibration of policy scenarios follows Moore et al. (2022).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under policy transition risk, average climate policy is more ambitious than BAU but less ambitious than first-best; policymakers may set carbon taxes even higher than first-best to &amp;lsquo;catch up&amp;rsquo; for time lost by predecessors when the economy is close to the temperature cap. Carbon premiums encourage firms to shift investment from brown to green and accelerate the transition. Scope conditions: carbon premiums are large only when the economy is still carbon-intensive (high brown-capital share) AND temperature is near or above the 2C cap; if policymakers implement first-best Pigouvian taxes while ignoring transition risk, the carbon premium is slightly negative. Physical-risk symmetry across sectors is assumed; if physical risk hit sectors differently there would be additional carbon-premium effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What happens to asset prices at the moment of each type of tipping?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At a tip to more ambitious carbon pricing, green share prices rise and brown share prices fall (and conversely when policy weakens). At a climate tip, both green and brown share prices fall (~3-5% each in the illustrative path). When negative-emissions technology becomes available, green prices jump down and brown prices jump up while the carbon price falls (because the brown sector may use fossil fuel again). The brown asset becomes worthless once the transition completes and the brown capital stock is run down; partial stranding occurs when the cap is crossed and fossil use is banned. News effects on prices are much larger for policy than for climate or technology tipping.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What drives the risk-free rate dynamics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The risk-free rate (eq. 3.2) combines discounting, consumption-smoothing, standard diffusion and macro-disaster precautionary saving, an uninsurable temperature-risk term (small because consumption volatility is close to capital volatility, and it vanishes under CRRA), and a novel policy-transition-risk term that makes the rate jump with the policy state. Increased transition risk raises precautionary saving and lowers the rate, especially when temperature is close to its cap (where forced fossil phase-out makes expected consumption growth drop). As the transition completes and brown capital shrinks, precautionary saving falls and the rate stabilizes. Mean rate ~0.8%, stable; lower quantile falls over time.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Policy transition risk&lt;/strong&gt;: In this paper, the risk arising from stochastic, reversible jumps between discrete climate-policy regimes (no / modest / ambitious carbon pricing), modeled as a Markov chain with given or partly endogenous transition intensities — distinct from fixed NGFS-style scenarios. Financial markets price these regime-change risks even in the BAU state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon premium&lt;/strong&gt;: Defined as the difference between the brown and green risk premiums (r^p_2 minus r^p_1). In the model it is a purely risk-driven, endogenous object arising because policy/temperature shocks hit the carbon-intensive brown sector&amp;rsquo;s price-dividend ratio more than the green sector&amp;rsquo;s; it is large near the temperature cap and slightly negative under first-best pricing without transition risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;CAP policy state&lt;/strong&gt;: The &amp;lsquo;ambitious carbon pricing&amp;rsquo; regime in which policymakers set the carbon tax to internalize warming damages AND enforce a hard temperature cap Tcap=2C; if the cap is breached, fossil-fuel use is forced to zero (F1=F2=0) and carbon prices exceed the usual social cost of carbon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PIGOU policy state&lt;/strong&gt;: The &amp;lsquo;modest carbon pricing&amp;rsquo; regime (added in the extended 3-state chain) that internalizes all global-warming externalities, including risks of climate disasters and tipping, but does NOT impose a temperature cap — yielding lower carbon taxes than CAP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;TCRE (transient climate response to cumulative emissions)&lt;/strong&gt;: The proportionality coefficient (theta/vartheta) translating cumulative net emissions into temperature change; calibrated at 1.8 C/TtC in the core model and allowed to jump irreversibly to 2.1 and 2.4 C/TtC under climate tipping.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Temperature/transition risk premium&lt;/strong&gt;: A positive risk premium carried by all risky assets stemming from physical climate risk (disasters and tipping) that rises with the level of temperature; distinct from the carbon premium, which is the brown-minus-green differential and is driven mainly by asymmetric policy-transition exposure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Partial asset stranding&lt;/strong&gt;: The situation when the temperature cap is crossed and fossil fuel may no longer be burned, so the brown sector — though still operable with renewables — loses the value of its fossil-based capital, causing the brown share price to fall.&lt;/p&gt;</description></item><item><title>Precautionary Saving against Correlation under Risk and Ambiguity</title><link>https://macropaperwarehouse.com/papers/precautionary-saving-against-correlation-under-risk-and-ambiguity/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/precautionary-saving-against-correlation-under-risk-and-ambiguity/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: How much to save is a central household financial decision, and uncertainty drives the &amp;ldquo;precautionary saving motive.&amp;rdquo; The precautionary-saving literature has mostly studied one-dimensional (single-attribute) risk, yet households face multidimensional risk: both wealth and health conditions matter for saving. Because wealth and health are plausibly related, the authors argue the correlation between two risky attributes should be incorporated into precautionary-saving analysis. They further note that correlation between two attributes is harder to quantify than a single attribute&amp;rsquo;s risk (less experience, fewer observations), so they also introduce ambiguity about the correlation. The paper&amp;rsquo;s purpose is to characterize how the correlation between two risky attributes (wealth and health) affects optimal savings under multivariate preferences, both when correlation is known (risk) and when it is ambiguous.&lt;/p&gt;
&lt;p&gt;Model setup: A purely theoretical two-date model (t=0, t=1). The individual has time-separable lifetime utility from a bivariate utility function u(x,y) over wealth x and health y, increasing and concave in both (u^(1,0)&amp;gt;=0, u^(0,1)&amp;gt;=0, u^(2,0)&amp;lt;=0, u^(0,2)&amp;lt;=0); the sign of the cross derivative u^(1,1) is left unrestricted. The risk-free interest rate is zero and there is no time discounting, so the analysis isolates the effect of risk on saving. At t=1 the individual faces &amp;ldquo;good&amp;rdquo; and &amp;ldquo;bad&amp;rdquo; income risks (epsilon_G, epsilon_B occurring with probabilities 1-p, p) and &amp;ldquo;good&amp;rdquo;/&amp;ldquo;bad&amp;rdquo; health risks (delta_G, delta_B with probabilities 1-q, q), all four mutually independent. Correlation between income and health risk is captured by a parameter k: the probability of simultaneous bad income and bad health is kpq. When k=1 the risks are independent (joint probability = pq); k&amp;gt;1 (k&amp;lt;1) indicates positive (negative) correlation; correlation increases in k. The individual chooses saving s to maximize lifetime utility (equation 1). &amp;ldquo;Good&amp;rdquo; vs &amp;ldquo;bad&amp;rdquo; risks are ranked by stochastic dominance (FSD, Nth-order NSD, and Ekern&amp;rsquo;s Nth-degree risk increase).&lt;/p&gt;
&lt;p&gt;Main findings (theoretical propositions, no estimated magnitudes): (1) Proposition 1 — when income risk is ranked by Nth-order and health risk by Mth-order stochastic dominance, optimal savings increase (decrease) in correlation k if (-1)^(n+m) u^(n+1,m)(x,y) &amp;gt;= (&amp;lt;=) 0 for n=1..N, m=1..M. This condition defines &amp;ldquo;mixed correlation aversion (seeking).&amp;rdquo; In the special case N=M=1, optimal savings increase in k if u^(2,1)&amp;gt;=0, i.e., the individual is &amp;ldquo;cross prudent&amp;rdquo; (decrease if cross imprudent, u^(2,1)&amp;lt;=0). Intuition: cross-prudent individuals dislike the simultaneous occurrence of bad income and bad health, which becomes more likely as k rises, so they save more. (2) Proposition 2 (ambiguous correlation, smooth ambiguity model of Klibanoff et al. 2005, 2009) — if the second-order utility phi exhibits decreasing absolute ambiguity aversion (DAAA) and u exhibits mixed correlation aversion or seeking, then ambiguous correlation raises the optimal amount of savings relative to the risky benchmark with correlation k_O = sum q_theta k_theta. The result combines a &amp;ldquo;timing of uncertainty effect&amp;rdquo; (governed by beta(s_O)&amp;gt;=1 iff phi exhibits DAAA) and the sign of a covariance term. (3) Proposition 3 extends the same result to Nth-/Mth-degree risk increases: under DAAA and (-1)^(N+M) u^(N,M)&amp;gt;=(&amp;lt;=)0 and (-1)^(N+M) u^(N+1,M)&amp;gt;=(&amp;lt;=)0, ambiguous correlation raises savings.&lt;/p&gt;
&lt;p&gt;Implications: Whether correlation raises or lowers precautionary saving depends entirely on the signs of higher-order cross derivatives of utility, and under ambiguity additionally on the absolute-ambiguity-aversion coefficient. The authors link results to experimental evidence (Attema et al. 2019 find both cross prudence and imprudence; correlation aversion in gains, seekingness in losses) and to empirical work on public health systems, which by changing the wealth-health correlation affect precautionary saving (e.g., Rosen and Wu 2004; Atella et al. 2012; Chou et al. 2003; Jappelli et al. 2007), broadly consistent with cross prudence.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core mechanism linking correlation to saving, and how is it formalized?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Correlation between income and health risk is parameterized by a single scalar k that scales the joint probability of the simultaneous bad outcome to kpq (with k=1 = independence, k&amp;gt;1 = positive correlation, k&amp;lt;1 = negative correlation), following the representation of Doherty and Schlesinger (1990). The derivative of expected period-1 utility with respect to k reduces (Lemma 1) to pq times [E[f(eps_B,del_B)] - E[f(eps_G,del_B)] - E[f(eps_B,del_G)] + E[f(eps_G,del_G)]], so the sign of the response to correlation is governed by a cross-difference whose sign maps directly onto the signs of higher-order cross derivatives of u. As k rises, the simultaneous occurrence of two bad outcomes becomes more likely; agents who dislike that combination (mixed correlation averse / cross prudent) save more to protect against it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What exactly is &amp;lsquo;mixed correlation aversion (seeking)&amp;rsquo; and how does it relate to correlation aversion and cross prudence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;An individual is mixed correlation averse (seeking) if (-1)^(n+m+1) u^(n,m)(x,y) &amp;gt;= (&amp;lt;=) 0 for all n=1..N, m=1..M. It is a bivariate extension of Caballe and Pomansky&amp;rsquo;s (1996) univariate mixed risk aversion, and generalizes Epstein and Tanny&amp;rsquo;s (1980) correlation aversion (which corresponds to u^(1,1)&amp;lt;=0). Cross prudence (u^(2,1)&amp;gt;=0, per Eeckhoudt et al. 2007) is the third-order version of correlation aversion. The paper&amp;rsquo;s saving conditions use mixed correlation aversion (seekingness) excluding the second-order correlation-aversion term, expressed via the derivative pattern (-1)^(n+m) u^(n+1,m) &amp;gt;= (&amp;lt;=) 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the &amp;lsquo;good&amp;rsquo; vs &amp;lsquo;bad&amp;rsquo; ranking of risks made rigorous?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Through stochastic dominance. eps_G dominates eps_B in the sense of Nth-order stochastic dominance (NSD) iff E[u(w+eps_G,h)]&amp;gt;=E[u(w+eps_B,h)] for all u with (-1)^(n+1) u^(n,0)&amp;gt;=0, n=1..N (mixed risk aversion in wealth); analogously for health via Mth-order dominance (MSD). FSD corresponds to N=M=1. The paper also uses Ekern&amp;rsquo;s (1980) Nth-degree risk increase, where the first N-1 moments coincide (e.g., a 2nd-degree increase is a Rothschild-Stiglitz mean-preserving spread; a 3rd-degree increase is an increase in downside risk per Menezes et al. 1980).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is ambiguity about correlation modeled, and what drives the ambiguity result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The individual perceives a finite set of possible correlations {k_1&amp;lt;&amp;hellip;&amp;lt;k_Theta} with subjective second-order probabilities q_theta, and evaluates them via the recursive smooth ambiguity model of Klibanoff et al. (2005, 2009) using an increasing, concave, thrice-differentiable second-order utility phi (concavity = ambiguity aversion). Evaluating the FOC at the benchmark s_O (the optimum under the mean correlation k_O = sum q_theta k_theta) decomposes the effect into a &amp;rsquo;timing of uncertainty effect&amp;rsquo; (Osaki and Schlesinger 2014), captured by beta(s_O) which is &amp;gt;=1 iff phi exhibits decreasing absolute ambiguity aversion (DAAA), plus a covariance term Cov(phi&amp;rsquo;(v), v_s). Under mixed correlation aversion/seeking, v(s,k) and v_s(s,k) move in opposite directions in k (Lemma 3), so because phi&amp;rsquo; is decreasing the covariance is positive; combined with DAAA this yields higher savings (Proposition 2).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of decreasing absolute ambiguity aversion (DAAA)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;DAAA (lambda(z) = -phi&amp;rsquo;&amp;rsquo;(z)/phi&amp;rsquo;(z) decreasing in z) is the ambiguity analogue of decreasing absolute risk aversion. The Appendix proves (following Osaki and Schlesinger 2014) that beta(s)&amp;gt;=1 iff the ambiguity precautionary premium Psi_A &amp;gt;= the ambiguity premium pi_A, which is equivalent to DAAA. DAAA ensures the timing-of-uncertainty effect pushes toward more saving. The authors caution that empirical/experimental evidence on the sign of absolute ambiguity aversion is thin; Berger and Bosetti (2020) is cited as an exception finding evidence for DAAA, and the authors say more evidence is needed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the theoretical predictions connect to experimental and empirical observations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Experimentally, Attema et al. (2019) measure multivariate risk preferences (wealth and longevity as a health proxy) and observe both cross prudence and cross imprudence, and correlation aversion in the gain domain with correlation seekingness in the loss domain. So the model implies savings can rise or fall with correlation depending on the individual. Empirically, the wealth-health correlation is shaped by public health systems: a more protective system separates wealth and health risk (lowers correlation). Rosen and Wu (2004) find poor health leads to safer investment (consistent with cross prudence); Atella et al. (2012) find households invest more in risky assets when health risk is mitigated by a protective national health system; Chou et al. (2003, Taiwan) find public health insurance reduced precautionary saving (a correlation decrease); Jappelli et al. (2007, Italy) find higher precautionary saving where health care quality is lower (a correlation increase); Ayyagari and He (2017) and Christelis et al. (2020) find Medicare/Medicare Part D increased risky investment. These are described as consistent with cross prudence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from the closest prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Versus Eeckhoudt and Schlesinger (2008), which studies how risky shifts in future income affect saving via higher-order stochastic dominance, this paper adds correlation between two attributes and multivariate preferences. Versus Courbage and Rey (2007), who compare a certain-health vs risky-health setting, this paper compares two settings where health is risky in both but the income-health correlation differs, using the simpler Doherty-Schlesinger (1990) correlation representation. Versus Osaki and Schlesinger (2014) and Gierlinger and Gollier (2017), who study ambiguity in future income, this paper introduces ambiguity into the correlation rather than into income itself. The mixed-correlation-aversion concept builds on Jokung (2011) and Eeckhoudt et al. (2007, 2009).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because public health systems alter the correlation between wealth and health (e.g., medical-expense coverage separates the two risks, lowering correlation), they affect precautionary saving. The directional prediction is conditional: under cross prudence, lower correlation (more generous public health coverage) reduces precautionary saving and a positive wealth-health correlation raises saving above the independence benchmark; under cross imprudence the signs reverse. Under ambiguity the prediction additionally requires DAAA plus the relevant cross-derivative sign pattern. The authors stress that because experimental evidence shows both cross prudence and imprudence, no unconditional policy prediction follows &amp;ndash; e.g., for cross-imprudent individuals ambiguous correlation might lower savings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and directions for future research?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The results are sufficiency conditions tied to signs of higher-order cross derivatives, which are hard to interpret and whose empirical signs are not firmly established (experimental evidence is insufficient). The model is a stylized two-date setup with zero interest rate, no time discounting, additive time-separable utility, interior unique optimum, and a single scalar correlation parameter. The authors note the framework extends straightforwardly to multi-period models and suggest studying settings where the value and uncertainty of correlation change over time.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Real Effects of Exchange Rate Depreciation: The Roles of Bank Loan Supply and Interbank Markets</title><link>https://macropaperwarehouse.com/papers/real-effects-of-exchange-rate-depreciation-the-roles-of-bank-loan-supply-and-interbank-markets/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/real-effects-of-exchange-rate-depreciation-the-roles-of-bank-loan-supply-and-interbank-markets/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation. The paper asks how exchange rate movements affect the real economy and what role the banking system&amp;rsquo;s foreign-asset exposure plays in transmitting exchange rate shocks. The motivation is concrete: with the Federal Reserve’s “tapering” of quantitative easing, the euro lost slightly more than 20% against the US dollar between 2014:Q2 and 2015:Q1, a sharp, persistent and largely unanticipated move. Standard open-economy models predict depreciations raise output via the trade balance, but recent work questions this classical trade channel and emphasizes firm/bank balance-sheet channels. The paper complements this by examining how a depreciation reshapes the composition of bank credit and, ultimately, regional output—working through banks’ net foreign asset (NFA) exposure rather than trade.&lt;/p&gt;
&lt;p&gt;Data and empirical strategy. The authors build two datasets. The first is a matched bank-firm panel from the German credit registry (quarterly; reporting threshold 1 million euro, 1.5 million before 2014; ~two-thirds of German bank loans), merged with Bundesbank bank balance-sheet data and Amadeus firm accounts, yielding more than 300,000 bank-firm observations (Table 1: 344,777 for the loan-growth variable). The second matches INKAR region-level data on 401 German administrative regions with local savings-bank balance sheets, exploiting that savings banks lend within a fixed administrative district. Identification uses a difference-in-differences design around 2014:Q2-2015:Q1. The dependent variable is the log change in bank b’s credit to firm f from the pre-depreciation average (2013:Q2-2014:Q1) to the post average (2015:Q2-2016:Q1). Identification rests on banks’ differential pre-shock USD NFA share; firm fixed effects (sample restricted to firms borrowing from at least two banks) absorb loan demand (Khwaja-Mian, 2008), and bank fixed effects are added in the interaction model. Regressions are weighted by credit exposure.&lt;/p&gt;
&lt;p&gt;Main quantitative findings. (1) Only large banks with higher USD NFA expand lending after the depreciation. In the full sample the NFA coefficient is positive but just below 10% significance; for systemically important banks (SIBs) it is 5.651 (significant at 5%): a SIB with a 1-percentage-point higher NFA share than the median SIB has a 5.65 pp smaller credit contraction, and given the overall ~-7% credit decline, a SIB with a 1.24 pp higher NFA share than the median turns overall credit growth positive. (2) The effect is driven by interbank lending: dropping financial-sector borrowers makes the NFA coefficient negative and insignificant; for financial borrowers it is positive (significant at 10%), and for SIBs lending to financial borrowers the coefficient is 10.915 (1%). (3) Credit shifts toward export-intensive firms, not riskier firms: the NFA × export-intensity interaction is 0.092 (10%); a firm at the 75th vs 25th export-intensity percentile sees a credit-growth differential of about 2.4 pp per 1 pp higher NFA; Z-Score and leverage interactions are insignificant. (4) Large banks act as a central intermediary: NFA × borrowing-bank export-portfolio share is 0.268 (10%), implying a 6.9 pp credit-growth differential between borrowing banks at the 75th vs 25th portfolio-export-share percentile per 1 pp higher NFA, driven by small borrowing banks. (5) Small banks with high interbank dependence and high export-firm portfolio shares raise lending (coefficient 0.609, 5%). (6) Regional real effects: for high-interbank-dependence regions, the export-share coefficient is 0.030-0.031 (10%/5%), implying regions at the 75th vs 25th export-share percentile grow 1.2 pp more cumulatively over the two post-depreciation years relative to the two pre years; no effect (even negative) in low-dependence regions.&lt;/p&gt;
&lt;p&gt;Mechanisms and implications. The depreciation raises NFA-rich banks’ net worth (Appendix B: NFA coefficient on equity growth is 4.571 for SIBs, 1%), expanding their lending capacity. They channel this mostly via interbank loans to small, geographically constrained banks holding many exporters, which pass liquidity to export firms whose demand rises post-depreciation. Investment (not employment) of more-affected firms rises (Appendix C). The policy implication: exchange-rate depreciations can have sizeable real effects via interbank liquidity even when local banks have no direct foreign exposure; estimates are likely downward-biased since cooperative and private banks are excluded.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A difference-in-differences design around the 2014:Q2-2015:Q1 euro depreciation. The dependent variable is the log change in bank-to-firm credit from a four-quarter pre-average (2013:Q2-2014:Q1) to a four-quarter post-average (2015:Q2-2016:Q1); this pre/post averaging mitigates serial correlation (Bertrand et al., 2004) and seasonality (Duchin et al., 2010). Cross-bank identification rests on differential pre-shock USD NFA shares. The Khwaja-Mian (2008) within-firm approach restricts to firms borrowing from at least two banks and includes firm fixed effects to absorb loan demand and isolate supply; bank fixed effects are added in the interaction model. The key threat is that the depreciation be endogenous to German bank lending—addressed by arguing the shock was driven largely by Fed tapering (exogenous to German lending) and ECB policy calibrated for the euro area as a whole, not Germany. A second threat is that NFA correlates with other exposures (e.g., interest-rate risk, since rates also fell); column (4) of Table 3 controls for interest-rate exposure and the NFA coefficient survives (if anything increases). A third threat is the parallel-trends assumption, addressed by placebo tests around 2002 and all quarters 2001-2014 where the NFA coefficient is never positive and significant at 5%+. Selection between firms and banks is argued away by low correlations between firm characteristics and bank NFA (-4% leverage, -0.5% export shares, 7% size).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the two competing hypotheses on credit allocation and how are they distinguished?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;H1 (export channel): the depreciation disproportionately increases credit supply to firms with higher ex-ante export intensity, because exporters’ cash flows and creditworthiness improve. H2 (risk-taking channel): the depreciation disproportionately increases lending to riskier firms, because higher net worth loosens capital constraints (Martynova et al., 2020). They are distinguished by interacting bank NFA with (a) industry-median export intensity and proxies (size, TFP, labor productivity, capital intensity) for H1, and (b) Altman Z-Score and leverage for H2. The export interaction is positive and significant (0.092, 10% in Table 5 col 1), all four proxies are positive/significant, and in a horserace using residuals orthogonal to export intensity (col 6) only export intensity (and capital intensity) survives. The Z-Score and leverage interactions are insignificant. Conclusion: H1 confirmed, H2 rejected—no evidence of increased risk-taking.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the interbank intermediation mechanism established?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In three steps. First (Table 2), dropping financial borrowers kills the NFA effect while restricting to financial borrowers preserves it (col 7: 1.947, 10%; col 9 for SIBs: 10.915, 1%), showing the lending increase is interbank, not corporate. Second (Table 6), restricting to large lenders and financial borrowers, the NFA × borrowing-bank export-portfolio-share interaction is 0.268 (10%), a 6.9 pp differential per 1 pp NFA between borrowing banks at the 75th vs 25th portfolio export-share percentile—driven by small borrowing banks (col 2: 0.359 significant; col 3 large borrowers: 0.046 insignificant). Third (Table 7), small banks with high export-firm portfolio shares raise lending (full sample 0.452, 10%), and splitting by interbank dependence the effect is significant only for high-dependence small banks (0.609, 5%) and insignificant for low-dependence (0.141), confirming interbank liquidity—not pre-existing excess liquidity—drives the result. A double interaction (col 4: 0.025, 10%) shows small banks pass the liquidity especially to export-intensive firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Large vs small banks: only large/SIB banks with high NFA respond; small banks do not (Table 2 cols 3,5). Section 4.3 shows this is because only the largest banks have economically meaningful NFA (SIB average USD NFA/assets 4.6% vs 0.3% for others); dropping the 5 largest NFA banks among SIBs renders the coefficient insignificant (4.899) and dropping the 10 largest turns it negative and imprecise (-3.257). So it is NFA level, not size per se, that drives the response. Firm heterogeneity: export-intensive firms gain, riskier firms do not. Interbank-dependence heterogeneity: regional GDP and small-bank lending effects appear only for high-interbank-dependence banks/regions. Firm real outcomes (Appendix C): investment of exporters rises only when relationship banks have high interbank dependence (col 6: 0.146, 10%); employment effects are insignificant throughout.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 3: (1) broadening NFA to include CHF, JPY, GBP (5.850, 5%); (2) disaggregating into gross USD assets (3.829, 5%) and gross USD liabilities (4.369, 10%, counter-intuitive but attributed to 89% asset-liability correlation acting as a proxy); (4) adding interest-rate exposure as a control (NFA rises to 6.847, 5%); (5) eight-quarter pre/post windows (4.996, 5%); (6) a 2002 placebo where NFA is insignificant, plus all-quarters-2001-2014 placebos never positive-and-significant at 5%+, supporting parallel trends. Table 8 col 5 runs a regional placebo around 2002 with no disproportionate growth. Appendix D between-firm regressions (controlling for demand via Abowd et al. 1999 firm fixed effects) confirm more-exposed firms get higher overall credit (0.868, 5%), though the export interaction there is insignificant (all exposed firms benefit, no extra amplification for exporters in the between-firm dimension). Appendix B confirms the net-worth channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is closest to Agarwal (2019), who exploits the 2015 Swiss franc appreciation and shows banks with high foreign-currency liabilities changed domestic credit and growth. This paper differs by: (i) studying a depreciation rather than appreciation; (ii) using disaggregated bank-firm credit-registry data covering non-listed firms (Agarwal uses listed firms); (iii) identifying interbank lending as the dominant channel explaining the credit increase; (iv) showing banks use interbank liquidity to lend especially to exporters; and (v) documenting higher regional GDP growth. It also contrasts with Bruno and Shin (2019), who find Mexican firms reliant on high-dollar-funding banks suffer credit and export declines after the taper tantrum; here the same taper tantrum has a positive credit effect because USD appreciation raises the value of USD assets where domestic banks hold significant foreign-currency exposure. It contributes to the interbank-markets-and-monetary-policy literature (Abbassi et al., 2014; Freixas et al., 2011; Allen et al., 2014) by showing monetary policy can affect interbank markets indirectly via the exchange rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Exchange-rate depreciations can have sizeable real effects through bank-balance-sheet and interbank channels, distinct from the trade channel, and these effects reach banks with no direct foreign exposure via interbank liquidity reallocation. Scope conditions: the result requires (a) a banking sector with significant, imperfectly hedged net foreign-currency (USD) assets concentrated in large banks; (b) an export-intensive economy where credit to exporters has aggregate bite (Germany has one of the world’s largest net-exports-to-GDP ratios); (c) a geographically segmented banking system (German savings banks) that lets regional output be linked to local-bank exposure; and (d) the depreciation being large, persistent, and largely exogenous/unanticipated (driven by Fed tapering). The 1.2 pp regional growth differential is between high- vs low-export-share regions among high-interbank-dependence regions only. The authors stress estimates are likely downward-biased because cooperative and private credit banks are omitted from the regional analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the most important caveats and limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Export turnover is reported by only a minority of Amadeus firms, so export intensity is proxied by industry medians, introducing measurement error. (2) Regional GDP is nominal (no regional CPI), justified by low, stable German inflation. (3) Within-firm regressions capture only the intensive margin; new and terminated relationships are handled separately in Appendix D between-firm regressions. (4) Firm-level real-outcome regressions (Appendix C) have small samples covering a small subset of German firms and compare 2014 vs 2012 (firm data end 2014), so they are interpreted as merely indicative. (5) The gross-foreign-liability robustness result is counter-intuitive and attributed to high asset-liability correlation. (6) The paper studies a depreciation only; asymmetric responses to appreciation and the source of the exchange-rate move (domestic vs foreign monetary policy) are left for future research.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Shock Propagation within Multisector Firms</title><link>https://macropaperwarehouse.com/papers/shock-propagation-within-multisector-firms/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/shock-propagation-within-multisector-firms/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper documents a novel channel through which trade shocks propagate across industries: the internal networks of U.S. multisector firms (the working paper circulated as &amp;ldquo;Import Competition and Firms&amp;rsquo; Internal Networks&amp;rdquo;). The motivation is that prior China-shock research traced effects through input-output networks and agglomeration but overlooked multisector firms, which account for 71% of total U.S. manufacturing employment and 25% of overall U.S. employment. When a firm owns establishments in several industries with differing exposure to Chinese import competition, it is ex ante ambiguous whether an unexposed plant gains (worker reallocation toward it), loses (dampened firm-level production from complementarities or financial constraints), or is unaffected (independent plants).&lt;/p&gt;
&lt;p&gt;Data: the Longitudinal Business Database (LBD), the Census administrative panel covering the universe of non-farm establishments with at least one paid employee. The sample is multisector firms operating at least one manufacturing establishment, including both manufacturing and non-manufacturing plants, restricted to establishments active in 1991; main period 1991-2007 (pre-trend window 1976-1991). The core sample has roughly 573,000 establishments and 62,000 firms. The average firm has 427 workers (median 22), operates in 3 SIC-4-digit sectors, and has 9 establishments (2 manufacturing, 7 non-manufacturing); over half of establishments exited during 1991-2007.&lt;/p&gt;
&lt;p&gt;Strategy: direct China shock is industry-level growth in Chinese import penetration 1991-2007 (Acemoglu-Autor-Dorn-Hanson-Price measure). The key new variable, the &amp;ldquo;indirect shock,&amp;rdquo; is an employment-share-weighted average of direct China shocks hitting the firm&amp;rsquo;s OTHER industries (own industry excluded). Both shocks are instrumented using Chinese import penetration into eight other high-income countries (following Autor et al. 2014). Dependent variable is the Davis-Haltiwanger-Schuh arc-growth rate of establishment employment (bounded -2 to 2). Regressions are weighted by initial employment with county and SIC-2- or SIC-4-digit industry fixed effects; standard errors two-way clustered by state and firm.&lt;/p&gt;
&lt;p&gt;Main findings: both direct and indirect shocks significantly reduce establishment employment growth at the 1% level. The indirect effect is an order of magnitude stronger - an interdecile increase in the indirect shock lowers the arc-growth rate by 0.126 (= -0.166 x 0.759), roughly 12 times the 0.011 reduction from an interdecile direct shock (OLS Table 2 col 2). IV estimates are larger: direct coefficient about -0.102 to -0.108, indirect about -0.131 to -0.208 (Table 3). The effect operates primarily through the extensive margin (establishment exit), not the intensive margin; the entry margin is statistically and economically insignificant. The shock spills over both across manufacturing industries within a firm (manufacturing-only indirect coefficient about -0.13 to -0.18) and from manufacturing to non-manufacturing establishments (non-manufacturing indirect coefficient between -0.25 and -0.135). The effect accumulated mainly during the 1990s and stabilized after 2001. Mechanisms: plants that use inputs from sister establishments respond more strongly (within-firm downstream linkages); firms with wider scope absorb the shock more easily; larger establishments respond more. No support for upstream-supply linkages, capital/skill intensity, firm size, or financial-constraint channels. At the sector level, the indirect shock significantly lowers manufacturing employment growth (indirect coefficient about -0.747, significant at 10%; exit margin significant at 1%), so spillovers survive aggregation.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Each establishment&amp;rsquo;s direct exposure is its SIC-4-digit industry&amp;rsquo;s growth in Chinese import penetration 1991-2007 (numerator = change in real U.S. imports from China; denominator = 1991 domestic absorption). The indirect shock is the 1991-employment-share-weighted average of direct shocks in the firm&amp;rsquo;s OTHER industries, excluding the establishment&amp;rsquo;s own industry. To purge U.S. demand-driven import growth, both shocks are instrumented by Chinese import penetration into eight other high-income countries (Australia, Denmark, Finland, Germany, Japan, New Zealand, Spain, Switzerland). Threats addressed: (1) selection/pre-existing trends - a pretrend test on 1976-1990 employment growth shows no relationship (coefficient -0.013, insignificant); (2) the indirect effect could reflect connectedness to sectors in general rather than the firm&amp;rsquo;s specific sectors - a placebo test randomizing sister-establishment sector affiliations over 500 draws yields an insignificant placebo indirect coefficient (-0.001); (3) a common clustered shock hitting all of a firm&amp;rsquo;s industries - direct and indirect shocks (and their IVs) show no significant correlation; (4) demand-shock correlation across countries - results hold when dropping computer, construction, and apparel industries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Mechanisms are tested via heterogeneous treatment effects (Table 7), interacting the indirect shock with firm/establishment characteristics under SIC-4-digit FE. Within-firm trade: a &amp;lsquo;Use=1&amp;rsquo; dummy (establishment&amp;rsquo;s industry uses inputs from sister establishments&amp;rsquo; industries, from BEA I-O tables) significantly amplifies the indirect effect (interaction -0.090, significant at 5%), consistent with downstream plants losing relation-specific production; a &amp;lsquo;Supply=1&amp;rsquo; dummy (upstream linkage) is insignificant. Economies of scope: interactions with number of SIC-4 sectors and with 1-minus-HHI are both significant at 5% and positive (wider scope cushions the shock). Establishment size: larger plants respond more strongly to the indirect shock (significant), rationalized via Holmes-Stevens - large plants make standardized goods facing fierce Chinese competition - but firm size is insignificant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Spillovers occur both across manufacturing industries within a firm and from manufacturing to non-manufacturing establishments, with similar magnitudes (manufacturing indirect coefficient about -0.13 to -0.18; non-manufacturing about -0.135 to -0.25). Effects are stronger for establishments using inputs from sister plants, weaker for firms with broader scope, and stronger for larger establishments. Effects accumulated mainly in the 1990s and stabilized after 2001; subperiod analysis confirms the indirect shock was much stronger in 1991-1999 (indirect coefficient about -0.27 to -0.50) than 1999-2007.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pretrend test (1976-1990, no trend); placebo random networks (500 draws, insignificant); no direct-indirect shock correlation; disaggregated industry FE up to SIC-8-digit using NETS data (indirect coefficient stays about -0.063 to -0.065, significant at 1%); controlling for other-sector within-firm characteristics (log wages, wage and employment-share growth 1976-1991); shift-share robust standard errors following Adao et al. 2019 (which are smaller than the two-way-clustered baseline); dropping outliers by firm size and by indirect-shock deciles; dropping affiliation and industry switchers; dropping demand-shock-prone industries (computer/construction/apparel); an alternative weight using only manufacturing employment in the denominator; unweighted regressions; and an entry-margin augmentation (entry remains insignificant, exit dominates).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the China-shock literature (Autor-Dorn-Hanson 2013; Acemoglu et al. 2016; Pierce-Schott 2016; Asquith et al. 2019) but introduces within-firm sectoral networks as a new propagation channel, arguing the China shock&amp;rsquo;s impact may be larger than previously estimated. It extends the firm-internal-network literature (Giroud-Mueller 2019; Hyun-Kim 2020 on regional shocks; Cravino-Levchenko 2017 and Boehm et al. 2019 on cross-country shocks) to sector-level shocks. Versus Ding (2020), who studies manufacturing multi-industry firms with at least one directly-exporting industry, this sample is over 12 times larger and includes non-manufacturing plants. The extensive-margin (exit) finding aligns with Asquith et al. (2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the indirect channel propagates the China shock to plants with no direct exposure - including non-manufacturing establishments - and operates through permanent establishment exit, the documented economic, social, and political consequences of import competition may be even larger than estimates ignoring within-firm networks suggest. The authors stop short of quantifying the channel against other channels (supply chains, financial networks, migration, local adjustment) and note that designing optimal trade/industry policy under within-firm linkages requires a full structural model, which they leave to future work. Scope: results pertain to U.S. multisector firms with at least one manufacturing plant over 1991-2007, which cover three-quarters of manufacturing but only about 20-25% of overall employment, so sector-level estimates are less precise once non-manufacturing is included.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does the entry margin matter and what is found?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Establishment exit is more permanent than intensive-margin cuts, so it signals persistent damage. The baseline decomposition lacks an entry margin; the authors augment the sample with post-1991 entrants (assigning arc-growth of 2, weighting by midpoint employment). The exit margin remains highly significant and accounts for the overall effect, while the entry margin is quantitatively small and statistically insignificant - multisector firms do not adjust to the China shock by opening new plants.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is found at the sector level and why does it matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To rule out that laid-off workers are simply rehired by other plants in the same industry, the authors define sector employment as total employment of all plants (including single-sector firms) and build a sector-level indirect shock weighting each other sector by its within-firm importance averaged across firms. For manufacturing, the indirect sector shock is large and significant at the 10% level (coefficient about -0.747), with the exit margin significant at 1% (about -0.371). Results are strongest for manufacturing and less precise when non-manufacturing is included, because the sample covers about three-quarters of manufacturing but only about 20% of overall employment. Spillovers thus survive aggregation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;!-- flags: Working paper circulated under a different title ('Import Competition and Firms' Internal Networks'; CES 21-28) than the published JMCB title ('Shock Propagation within Multisector Firms'); confirmed same paper by authors and content., Census disclosure rounding: observation counts (e.g., 573,000; 62,000) and coefficients are rounded per Census Bureau disclosure rules, so exact magnitudes carry rounding. --&gt;</description></item><item><title>Studying Generational Risk in a Large-Scale Life-Cycle Model</title><link>https://macropaperwarehouse.com/papers/studying-generational-risk-in-a-large-scale-life-cycle-model/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/studying-generational-risk-in-a-large-scale-life-cycle-model/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Hasanhodzic and Kotlikoff ask a question prior work assumed away: how large is generational risk, and can pay-go Social Security actually mitigate it? Earlier studies (Diamond, Bohn, Krueger-Kubler, etc.) presumed generational risk is large enough to merit policy and showed Social Security can in principle share it, but did not directly measure its size. This paper measures it directly, with and without Social Security, in a realistically large overlapping-generations (OLG) model.&lt;/p&gt;
&lt;p&gt;Model setup: an 80-period annual OLG model with aggregate shocks. Agents work 45 periods (retire at R=45) and live 80, have isoelastic (CRRA) preferences with risk aversion gamma=2 (gamma=5 under the extra-large shocks calibration), annual discount factor beta=0.96 (quarterly 0.99). Production is Cobb-Douglas; log TFP is trend-stationary AR(1) (quarterly rho=0.95, sigma=0.01; annualized rho=0.814, sigma=0.019). Two calibrations add a normal capital-depreciation shock. Households invest in risky capital or one-period safe bonds (zero net supply); &amp;ldquo;soft&amp;rdquo; increasing borrowing costs (Chen-Mangasarian function, slope b) shut down private risk-sharing to expose generational risk in its purest form while still delivering a realistic risk and growth premium. Policy is pay-go Social Security with a fixed payroll tax tau=15% (also tested at 1%). The model is solved to high precision via a projection method (building on Marcet 1988; Judd, Maliar, Maliar 2011) over an 81-variable state space (79 cohort cash-on-hand values plus the TFP and depreciation shocks). Generational risk measures are evaluated 300 years into the transition; cohort utility uses generations born after year 300 of a 750-year run. The U.S. data targets cover the return to national wealth and one-month Treasuries, 1947-2015, and detrended NNP/consumption, 1929-2020.&lt;/p&gt;
&lt;p&gt;Four calibrations: (1) baseline (TFP shock only, matched to output/consumption variability); (2) larger shocks (adds depreciation shock to match variability of the return to national wealth); (3) extra-large shocks (bigger depreciation shock to match U.S. equity-market return variability, a la Krueger-Kubler); (4) negative risk-free-rate baseline (steeper borrowing costs giving a roughly negative 2% safe rate, to test Blanchard 2019).&lt;/p&gt;
&lt;p&gt;Main findings (compensating-consumption differentials needed to reach long-run average lifetime utility): generational risk is 1.396% under baseline, 2.128% under larger shocks, and 15.303% under extra-large shocks (without Social Security). The authors view baseline 1.396% as small (on the order of a good-sized distortion) and prefer the baseline calibration. Social Security slightly WORSENS baseline generational risk (rising to 1.462%), but reduces it by 8% in the larger-shocks and 19% in the extra-large-shocks calibrations. So Social Security&amp;rsquo;s risk-pooling value depends on calibration. Contemporaneous risk (absolute consumption adjustment for full risk sharing among living cohorts) is tiny: 0.206% baseline, 0.933% larger shocks, 0.437% extra-large; Social Security raises it to 0.310% in baseline but lowers it under the other two.&lt;/p&gt;
&lt;p&gt;On welfare and Blanchard&amp;rsquo;s conjecture: pay-go Social Security at a 15% tax cuts long-run expected utility by 18% in baseline and larger-shocks, and by 56% in extra-large shocks, via crowding out (long-run capital falls 28% baseline, 56% extra-large). Under the negative-safe-rate calibration there is still an 18% long-run welfare loss; the average growth rate is zero in all simulations. The authors find no support for Blanchard&amp;rsquo;s (2019) claim that deficits can be Pareto-improving when safe rates run below growth: even under Blanchard-favorable conditions, crowding out swamps risk sharing (e.g., 17.83% utility loss at 15% tax, 1.17% at 1% tax). Macro shocks are second-order for policy: the capital transition under Social Security with shocks closely tracks the no-shock (deterministic) path, echoing Lucas (1987).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What exactly is the paper&amp;rsquo;s primary measure of generational risk?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is the average absolute percentage adjustment to a cohort&amp;rsquo;s annual consumption needed to equate that cohort&amp;rsquo;s realized lifetime utility to the long-run cross-cohort average realized lifetime utility. Formally, for each generation born in period t they compute lambda_t = U-bar / U_t (U_t is realized lifetime utility, U-bar the average over generations born in years 301-750), then take the mean absolute deviation of lambda from 1. It captures both being born in a bad state and being hit by a bad sequence of lifetime shocks. A value near zero means birth date barely matters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does annualizing to 80 periods matter relative to two-period models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With one year per period, an agent experiences 45 annual wage shocks and 79 annual investment-return shocks that largely average out, and can self-insure by adjusting saving annually. In a two-period model a single negative TFP shock hits a worker&amp;rsquo;s entire lifetime earnings or a retiree&amp;rsquo;s whole old-age return. The authors note, however, that because TFP shocks are positively autocorrelated, amplifying multi-period shocks could in principle generate more risk, not less, so the result is not mechanical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is private risk-sharing handled, and why shut it down?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In three of four calibrations the authors impose &amp;lsquo;soft&amp;rsquo; increasing borrowing costs (Chen-Mangasarian function, parameter b) calibrated so the marginal borrowing cost is 15-20 times the safe rate (b=28 baseline, 25 larger shocks, 45 for negative-safe-rate cases). This nearly closes the bond market, isolating generational risk with no private or public mitigation. The extra-large calibration omits borrowing costs because its large depreciation shock alone delivers a realistic risk premium (and to match Krueger-Kubler). Notably, adding borrowing constraints has little impact on key macro aggregates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does Social Security INCREASE generational risk in the baseline (single-TFP-shock) case?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Five reasons given: (1) benefits depend on the prevailing wage, so autocorrelated TFP wage shocks now interact with capital-return shocks through retirement, extending nonlinear discounting past retirement; (2) crowding out lowers wages and raises risky returns, so the same percentage TFP shock is larger in absolute terms, making realized resources more variable; (3) Social Security is a random floor on old-age living standards, encouraging less risk-averse consumption and a higher propensity to consume; (4) positive TFP autocorrelation (high benefits today predict high benefits tomorrow) further raises the propensity to consume; (5) Social Security alters the stochastic distribution of the 79 cohort cash-on-hand state variables, producing complex consumption changes. This echoes Rios-Rull&amp;rsquo;s (1994) paradox that better micro insurance can amplify macro fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper test Blanchard&amp;rsquo;s (2019) &amp;lsquo;deficits may be free&amp;rsquo; conjecture and what does it find?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It uses Blanchard&amp;rsquo;s own ex-ante Pareto criterion but with 80 periods (vs his 2), realistic risk aversion, and dropping his assumption that half of wages are perfectly safe. Calibrations engineered with negative safe rates and large growth premiums (e.g. risky ~2%, safe ~negative 2%) still show Social Security reducing long-run expected utility: 17.83% loss at a 15% tax (1.17% at 1%) in the standard-premium case, falling to 12.51%/12.582% (15% tax) under even-larger growth premiums, but always negative. Crowding out dominates any risk-sharing gains. The authors find no support for the conjecture. They note Blanchard&amp;rsquo;s Pareto gains, when they arise, depend critically on his assumption that half of wages are certain, leaving workers ideally placed to insure the elderly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity across cohorts is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Baseline generational risk has mean 1.396%, s.d. 1.293%, max 4.949% (no Social Security). Decomposed: generations with worst luck need roughly +5.0% positive adjustment; those with best luck need roughly negative 5.1%. Extra-large shocks produce extreme spread: max positive adjustment 66.14%, max negative 44.10%. A separate exercise (Table 8) shows the cost of uncertainty depends on birth state due to mean reversion: those born with low capital actually prefer uncertainty (negative 1.482%) because capital and wages will rise, while those born with high capital would pay 2.374% to lock in their state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the welfare-cost-of-uncertainty and precautionary-saving findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under larger shocks, the compensating variation between the stochastic steady state and a no-shocks steady state is only 1.12% (newborns would need 1.12% more consumption each year to match a never-shocked long run), despite that calibration overstating macro variability. This is small because precautionary saving raises the stochastic economy&amp;rsquo;s average capital stock 18.4% above the no-shocks steady state: the uncertain long run is &amp;lsquo;riskier, but richer.&amp;rsquo; A decomposition removing the 0.77% average age-specific consumption difference leaves a 0.34% residual (about one quarter of 1.12%) reflecting age-pattern and cohort-sequence heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper build on and differ from Krueger-Kubler (2006)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Five differences: (1) many more periods (80 vs 9) permit better shock-averaging and more precise autocorrelation treatment plus more self-insurance opportunities; (2) two calibrations the authors view as more realistic than KK (who chose theirs partly to favor a Pareto improvement), using borrowing costs rather than excessively large depreciation shocks to get a realistic risk premium; (3) ex-ante rather than ex-interim expected utility; (4) explicit measurement of generational risk with and without Social Security; (5) testing whether a large growth premium can sustain an intergenerational Ponzi scheme at scale. Like KK, they find a negative net long-run welfare impact of pay-go Social Security.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the model deliberately omit, and why?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is &amp;lsquo;intentionally bare bones to maximize the potential for generational risk&amp;rsquo;: no variable labor supply (which would help cohorts self-insure), no progressive income taxation (which redistributes from winning to losing generations), and no social insurance other than Social Security. It also omits capital-adjustment costs (which would raise asset-return volatility) because incomplete markets make firm investment policy ill-defined when differently-aged shareholders disagree; the depreciation shock is a crude proxy for adjustment-cost-driven asset-return shocks. The authors flag correlated idiosyncratic shocks (Harenberg-Ludwig) as important future work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How well does each calibration match the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Baseline matches output (model 3.72% vs data 3.33%) and consumption (2.10% vs 1.75%) variability but understates the s.d. of the return to national wealth by an order of magnitude (0.14% vs 4.89%). Larger shocks reproduces the return-to-wealth s.d. (4.61-4.62% vs 4.89%) and a realistic wage/return correlation (negative 0.054) but overstates macro-aggregate variability. Extra-large shocks matches equity Sharpe ratio (model 0.333 vs target 0.286; risk premium 4.63%, return s.d. 13.92%) but overstates return-to-capital variability nearly three-fold and consumption variability sixteen-fold. The model&amp;rsquo;s overall risk premium ranges 3.55-6.03% vs 5.43% in data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the role of the bond market across calibrations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The one-period bond market only operates in the extra-large shocks calibration (borrowing costs close it in the others). There, the young short bonds and the old lend: because the young&amp;rsquo;s resources are mostly human capital (less risky than, and negatively correlated with, stock returns), the young use bonds to insure the old. Workers effectively borrow to hold equity, which the authors rationalize via student loans, credit cards, mortgages alongside 401(k) equity, or implicit long-term firm contracts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What policy implications follow, and what are their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If macro shocks are calibrated to realistic macro-aggregate volatility (the authors&amp;rsquo; preferred baseline), generational risk is small (about 1.4%) and pay-go Social Security slightly worsens it while imposing an 18% long-run welfare loss via crowding out; deterministic models (e.g. Auerbach-Kotlikoff 1987) then suffice to capture the long-run impact of intergenerational redistribution. Social Security&amp;rsquo;s risk-mitigation value emerges only under calibrations that overstate macro volatility (larger/extra-large shocks). The scope condition is decisive: the case for Social Security as generational insurance hinges on which calibration one finds realistic, and the authors&amp;rsquo; preferred reading implies a weak case. They also caution the conclusions may not extend to models with correlated idiosyncratic risk.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Time Averaging Meets Heckman, Lochner, and Taber and Ben-Porath</title><link>https://macropaperwarehouse.com/papers/time-averaging-meets-heckman-lochner-and-taber-and-ben-porath/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/time-averaging-meets-heckman-lochner-and-taber-and-ben-porath/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: How does endogenizing retirement (career-length) choice change the labor-supply and human-capital implications of the canonical Heckman, Lochner, and Taber (1998a, HLT) life-cycle general-equilibrium model, and what does this imply for social-security reform, labor-income taxation, aggregate labor-supply elasticities, and inequality? HLT already contains two ingredients of Ljungqvist-Sargent (2006) &amp;ldquo;time-averaging&amp;rdquo; models — credit markets and within-period labor-supply indivisibilities — but shuts time-averaging down by assuming inelastic labor supply until a mandatory retirement age of 65. The authors &amp;ldquo;activate&amp;rdquo; time-averaging by letting workers choose when to retire and by adding a pay-as-you-go social security system. This matters because the micro-foundation of the high aggregate labor-supply elasticity that Prescott invoked (switching from Rogerson&amp;rsquo;s employment lotteries to time-averaging) hinges on whether workers sit at corner solutions for career length.&lt;/p&gt;
&lt;p&gt;Model setup: A perfect-foresight OLG model in discrete annual time; agents live from age 18 to 80. Eight agent types index four innate ability levels (theta in {1,2,3,4}) crossed with two education levels (high school S=1, college S=2). Each type has a Ben-Porath (1967) human-capital technology. An aggregate CES/Cobb-Douglas production function combines physical capital and two human-capital aggregates. Within-period labor is indivisible (work full time omega=1 or not omega=0). Utility is time-separable with intertemporal elasticity 1/gamma and a fixed disutility B of working. The baseline social security program has payroll tax rate tau_p=0.10, eligibility age eta_p=65, and benefit P=8 (about 40% of average earnings), paid only to retirees; collecting nothing while working after 65 creates an implicit tax that pins all workers to a corner at age 65.&lt;/p&gt;
&lt;p&gt;Calibration: Most parameters are borrowed or backed out from HLT (delta=0.96, gamma rounded from 0.9 to 1, tau_l=tau_k=0.15, tuition zeta=1.02 thousand 1992 dollars). New parameters: disutility B=0.8, fraction of capital held by in-model agents kappa=0.388, efficiency-decline logistic parameters phi1=0.2, phi2=75. The model targets a capital-output ratio of 4 and an after-tax interest rate of 0.05; the calibrated model reproduces HLT&amp;rsquo;s baseline and post-skill-biased-technological-change (SBTC) steady states closely (e.g., baseline interest rate 0.0588 matched; aggregate human capital H1≈274/249, H2≈280/287 in HLT/our model).&lt;/p&gt;
&lt;p&gt;Main quantitative findings (with scope conditions): (1) Social security reform that pays benefits from 65 regardless of work removes the implicit tax wedge. At fixed prices all workers extend careers (high school +2.4 years on average; college +7.6 years to age 72.6); in general equilibrium effects are attenuated — high school workers actually retire ~1 year early (average 63.9) while college workers retire later (average 70.8). (2) Tax experiment along Prescott (2002) lines: raising tau_l with revenue rebated lump-sum produces a Laffer curve peaking at tau_l=0.54; without rebates the Laffer curve peaks at tau_l=0.73 (general equilibrium) and the small-open-economy version is nearly linear. (3) The aggregate labor-supply elasticity is zero at low tax rates (corner at 65), then rises above 1 and levels around 1.2 over a wide middle range before rising again past tau_l=0.7. (4) Ben-Porath nonconvexities create &amp;ldquo;tipping points&amp;rdquo;: e.g., high school ability-3 workers are indifferent between two starkly different career strategies over tax range 0.42-0.52, and at high tax rates workers jump discretely from long careers with high human capital to much shorter careers with little/no on-the-job investment.&lt;/p&gt;
&lt;p&gt;Implications: College-educated (steeper-earnings-profile) workers&amp;rsquo; labor supplies are more resilient to tax and social-security reforms than high school workers&amp;rsquo;. High tax rates with lump-sum rebates can produce a &amp;ldquo;dual labor market&amp;rdquo; / bifurcation, raising lifetime earnings inequality (Gini) while welfare conditioned on schooling converges, all at a growing efficiency cost.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the core methodological contribution relative to HLT?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors retain HLT&amp;rsquo;s primitives (credit markets, indivisible within-period labor, Ben-Porath human capital, aggregate production) but replace HLT&amp;rsquo;s exogenous mandatory retirement at 65 with endogenous career-length choice, and add a pay-as-you-go social security system. The social security system with an implicit tax on working past 65 puts all workers at a corner solution at age 65, so the model reproduces HLT&amp;rsquo;s outcomes. This provides a choice-theoretic rationalization for retirement behavior that HLT hard-wired. They state HLT could have used this time-averaging model with endogenous retirement to obtain the same quantitative findings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is there no separate identification/empirical strategy in the usual sense?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This is a calibrated/quantitative general-equilibrium model, not a reduced-form causal study. Parameters are borrowed or &amp;lsquo;backed out&amp;rsquo; from HLT (who estimated human-capital technologies via nonlinear least squares on NLSY 1979-1993 earnings profiles for white male civilians, plus CPS 1963-1993 and NIPA aggregates). New parameters are calibrated to be compatible with HLT: B and the efficiency-decline parameters (phi1, phi2) are jointly set so all agents retire at 65 in baseline; kappa=0.388 is set to match HLT&amp;rsquo;s interest rate given a capital-output ratio of 4; sigma (dispersion of nonpecuniary college cost) is calibrated to match the 8% rise in the relative college skill price between HLT&amp;rsquo;s two steady states; ability-specific means mu_theta target college enrollment rates from Taber (2002, Table 1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the three forces that make high school workers retire earlier than college workers under the social security reform?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, the social security system redistributes from high-ability to low-ability agents (equal benefit, proportional payroll tax), and the income effect on low-ability (mostly high school) workers reduces their labor supply; removing social security entirely (recalibrating kappa from 0.388 to 0.767) shows lowest-ability high school workers extend careers most. Second, per Ljungqvist-Sargent (2014), the more elastic an earnings profile to accumulated work, the longer the career; giving high school workers college workers&amp;rsquo; more productive human-capital technology lengthens their careers. Third, a time-averaging &amp;lsquo;apprenticeship&amp;rsquo; effect: college is treated as a fixed pre-work requirement Z tacked onto an optimal working span, so at an interior solution optimal career length = baseline length + Z; this accounts for roughly a 4-year career-length difference between high school and college workers in the relevant perturbed economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do the effects of a labor tax increase depend on how revenue is spent, and what is the mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following Prescott (2002): if revenue is rebated lump-sum (a good substitute for private consumption), the income effect of the tax is suppressed and the substitution effect dominates, sharply reducing labor supply (Laffer peak at tau_l=0.54). If revenue is squandered or spent on poor substitutes, income and substitution effects roughly cancel under balanced-growth preferences, so labor supply is little affected (Laffer peak at tau_l=0.73 in GE; nearly linear/flat in the small-open-economy version where capital inflows hold the interest rate constant at 0.059). With lump-sum rebates the equilibrium interest rate is U-shaped in the tax rate and the Laffer curve eventually approaches zero (output collapses); without rebates the interest rate rises monotonically to offset what would otherwise be capital inflows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the Ben-Porath nonconvexities and the &amp;rsquo;tipping points&amp;rsquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Returns to on-the-job human-capital investment can only be harvested over a long enough career, so the value function over retirement ages can become non-concave with two local maxima: a long career with high end-of-life human capital versus a short career with little/no investment. As a determinant (tax rate, disutility, technology productivity) changes incrementally, the optimal response can be discontinuous — a discrete jump to a much shorter career and much less human-capital accumulation. Example: at tau_l=0.45 high school ability-3 workers have two optima, retirement at 65 (high human capital) and early retirement at age 50 (low human capital); they are indifferent over tax range 0.42-0.52. The nonconvexity is intrinsic to the Ben-Porath technology and arises even in a laissez-faire economy with interior career-length solutions, not only because of the social-security corner.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the indifference between career strategies handled in equilibrium (heterogeneity and computation)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When otherwise-identical agents become indifferent between two career strategies, the regularity condition of a unique solution fails. The authors extend the equilibrium definition to allow equilibrium fractions of identical agents choosing different strategies; market clearing pins down these fractions (a &amp;lsquo;convexification&amp;rsquo;). Computationally they identify the &amp;lsquo;most indifferent&amp;rsquo; worker type (smallest gap between the two local maxima; threshold 0.05%) and vary the fraction retiring at each age until GE conditions are satisfied. They also introduce continuous retirement ages via cubic-spline interpolation of the value function, validated against a closed-form analytical formula for agents who do not accumulate human capital (largest deviation only about half a month at tau_l=0.61).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across the eight worker types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;College enrollment rises with ability in baseline (about 0.11, 0.34, 0.56, 0.86 for ability groups 1-4 in the authors&amp;rsquo; model). Group 4 has the second-highest average disutility of attending college, so 14% of group 4 become high school workers despite large advantages, and group 4&amp;rsquo;s enrollment falls most sharply with higher taxes. Group 1 has the highest disutility and lowest college human capital, so only ~11% attend college, falling below 1% above tau_l=0.45. End-of-life human capital of lower ability groups (1,2) falls monotonically with taxes, while higher ability groups (3,4) initially raise human capital as the interest rate falls. High school ability-1 workers eventually stop working entirely at the highest tax rates, with lifetime labor earnings falling to zero, relying on lump-sum transfers and social security.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper find for aggregate labor-supply elasticity, and why is ~1.2 notable?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With lump-sum rebates, after an initial range of zero elasticity (all at the corner of retiring at 65), the elasticity quickly rises above 1 and levels around 1.2 over a substantial middle range, then rises again after tau_l=0.7 (as physical capital gets scarce and the interest rate rises steeply). The ~1.2 is notable because in the Ljungqvist-Sargent (2014) framework with the same utility, the analytical aggregate elasticity is exactly one regardless of the learning-by-doing wage exponent; the model obtains ~1.2 despite college workers being stuck at the corner until tau_l≈0.6, because falling college enrollment shifts would-be college workers into earlier-retiring high school careers. Without rebates the elasticity is suppressed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the inequality findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two measures: present value of lifetime labor earnings and lifetime utility. The pre-tax earnings Gini is roughly flat for the first five percentage points above baseline (all still retiring at 65), then rises nearly one-to-one with the tax rate until tau_l=0.65, flattens as college ability groups 2 and 3 switch to short careers, drops when group 4 (highest earners) switches, then rises again as college workers&amp;rsquo; relative earnings surge (driven by the rising college skill premium compensating for tuition and nonpecuniary costs). Using the Holter-Ljungqvist-Sargent-Stepanchuk (2025) ex post-ex ante welfare measure, higher taxes with lump-sum transfers shrink welfare inequality conditional on schooling even as income inequality grows, at an efficiency cost that accelerates above tau_l=0.4.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How do taxation results differ under the social security reform versus the baseline social security system?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Laffer curves under the reform (Figure 12a) closely resemble the baseline (Figure 2a). The key difference is that under the reform workers are at interior career-length solutions, so high school workers&amp;rsquo; average retirement age falls with the very first tax increments (rather than staying stuck at 65), and college workers raise average retirement ages over a mid-range of taxes. At sufficiently high taxes the two economies become identical (above tau_l=0.74 with, 0.72 without rebates), because the implicit post-65 tax wedge becomes irrelevant once everyone retires early. Under the reform, college workers&amp;rsquo; careers are &amp;lsquo;anchored&amp;rsquo; near the age where human-capital efficiency depreciates rapidly rather than by the official retirement age.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the paper relate to and differ from Fan, Seshadri, and Taber (2024)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;FST (2024) independently endogenize career lengths in a Ben-Porath model estimated on SIPP data for male high school graduates, with nine worker types differing in disutility B(theta), learning ability A(theta), and initial human capital H(theta). A key difference: FST impose identical Ben-Porath exponents across all workers, so the Ljungqvist-Sargent force (more elastic earnings profiles imply longer careers) is largely absent; and FST do not impose balanced-growth preferences, so income effects of higher wages do not cancel. The authors suspect the sharp declines in career length with higher productivity in FST&amp;rsquo;s first two rows reflect income effects, and that time-averaging strengthens income effects. In the authors&amp;rsquo; own balanced-growth model, the level of wages does not affect labor supply — only the terms on which human capital can be accumulated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness/sensitivity checks and appendices are reported?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Appendix C: sensitivity analysis of disutility B and the efficiency-decline function e(n); searching over (B, phi1) that keep all agents retiring at 65 yields end-point coordinates approximately (0.59, 0.09) and (0.9, 0.31), with the baseline (B=0.8, phi1=0.2) chosen as an intermediate pair subject to no noticeable efficiency decline before the 60s. Appendix D: alternative social security reforms raising benefits — college workers keep retiring at 65 while high school workers retire ever earlier. Appendix F.1: elasticity of the aggregate human-capital composite Q. Appendix G: replacing the Ben-Porath technology with exogenous earnings-experience profiles yields less polarization (lower Gini) and a lower aggregate labor-supply elasticity. The authors also note an unresolved discrepancy: their present-value earnings are 6.9-7.0% (high school) and 7.1-7.2% (college) lower than HLT&amp;rsquo;s Table II, but college enrollment is little affected since differences are similar across schooling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and policy scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results depend on balanced-growth preferences (income/substitution effects of wage levels cancel), on HLT&amp;rsquo;s estimated human-capital technologies and nonpecuniary college-cost distributions, and on the auxiliary kappa device for targeting the capital-output ratio. The disutility B and efficiency-decline parameters are not pinned down by data when workers sit at the 65 corner, hence only a sensitivity analysis. Limited heterogeneity (only 8 types) means aggregate smoothness comes from convexification rather than from a continuum of switching agents. The central policy warning — that high enough tax wedges or distortions can dislodge even high-productivity workers into a &amp;lsquo;dual labor market&amp;rsquo; with earlier retirement and less human-capital accumulation, risking an implosion of activity — applies within this calibrated structure.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Uncertainty Shocks and the Cross-Border Funding of Banks: Unmasking Heterogeneity</title><link>https://macropaperwarehouse.com/papers/uncertainty-shocks-and-the-cross-border-funding-of-banks-unmasking-heterogeneity/</link><pubDate>Wed, 01 Jan 2025 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/uncertainty-shocks-and-the-cross-border-funding-of-banks-unmasking-heterogeneity/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: How does country-specific uncertainty explain variation in the cross-border funding of banks? Studying this link is practically relevant given rising reliance on international borrowing under financial globalization and the role of international banking in transmitting the Global Financial Crisis (GFC). The few prior studies on uncertainty and cross-border bank funding (Cerutti et al. 2017; Choi and Furceri 2019) focus on a single uncertainty measure and aggregate flows. Bénétrix and Curran&amp;rsquo;s innovation is to decompose both the funding source (banks vs. non-banks) and the type of uncertainty measure, &amp;ldquo;unmasking&amp;rdquo; heterogeneity that aggregate panel studies hide.&lt;/p&gt;
&lt;p&gt;Data and setup: International bank funding is measured as cross-border liabilities (loans plus debt securities) of banking systems reporting to the BIS Locational Banking Statistics (LBS), decomposed into liabilities vis-a-vis banks and non-banks (non-bank flows derived as the difference between all-sector and bank liabilities). The core sample is 24 reporter countries (excluding small states/financial centers driven by global shocks, e.g. Russia/China omitted for short coverage), quarterly 2003Q1–2018Q4. The crisis period is defined as 2008Q3–2012Q2 (start = TED spread record/Lehman; end = Draghi&amp;rsquo;s &amp;ldquo;whatever it takes&amp;rdquo;), with pre-crisis 2003Q1–2008Q2 and post-crisis 2012Q3–2018Q4 sub-samples. A newly compiled uncertainty dataset spans three classes: volatility-based (implied volatility at 1-month and 3-month maturities from Bloomberg OVM; realized volatility from national equity indices), news-based (EPU and the World Uncertainty Index WUI from policyuncertainty.com), and forecast-based (forecast dispersion = standard deviation of GDP-growth forecasts across forecasters, from Bloomberg ECFC). Coverage: 24/24 countries for realized vol, implied vol, and WUI; 16/24 for EPU; 15/24 for forecast dispersion.&lt;/p&gt;
&lt;p&gt;Empirical strategy: Two parts. First, descriptive dynamics of banking and uncertainty series (moments, persistence via AR(1)). Second, dynamic panel regressions with country fixed effects and Pesaran-Smith mean-group (MG) estimators, plus country-by-country regressions, of log cross-border liabilities on log uncertainty and a lagged dependent variable (so beta is an elasticity); standard errors clustered by source country. Multivariate models add lagged conditioning factors (real GDP growth, stock-market growth, policy rates, credit growth, exchange-rate growth, inflation, external debt/GDP). A GFC dummy and uncertainty-GFC interaction capture the time dimension.&lt;/p&gt;
&lt;p&gt;Main findings with magnitudes: Uncertainty is associated with less cross-border borrowing; effects are sizable but heterogeneous. A 1% rise in 3-month implied volatility can contract funding by up to 4.1%; across implied/realized volatility (same sample) elasticities run 1.5%–4.1% depending on measure, sector, and estimator. Volatility-based measures show the largest elasticities, then news-based. Contractions are largest for non-bank funding and smallest for aggregate (suggesting bank/non-bank substitution that mutes the aggregate). Economically, a one-standard-deviation uncertainty shock typically cuts aggregate funding by between $573 billion and $889 billion (the bounds correspond to 1-month vs. 3-month implied volatility; average aggregate funding is $820B, average non-bank funding $223B). Country regressions give similar but more often insignificant results. Over time: volatility-based uncertainty matters only during the GFC (interaction term strongly negative), while news-based uncertainty (EPU, WUI) is the only measure whose first two moments rose since the GFC and is the only one that dampens funding outside the crisis, particularly for European countries (EU15/euro area). Mechanisms discussed but not tested: deleveraging/precautionary saving, liquidity management, demand vs. supply channels (weaker supply channel for advanced &amp;ldquo;safe&amp;rdquo; countries).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is explicitly descriptive/documentary, not structural (&amp;lsquo;The goal of this paper is to document empirical evidence, not to model mechanisms&amp;rsquo;). Identification comes from dynamic panel fixed-effects and mean-group regressions of log cross-border liabilities on log uncertainty with a lagged dependent variable, plus country-by-country regressions. The main threat is reverse causality (uncertainty and bank flows co-determined). The authors mitigate this following Bruno and Shin (2015b) by re-estimating with uncertainty lagged one period (similar results, in the online appendix) and by lagging conditioning factors one quarter. They argue the lagged dependent variable absorbs much variation, leaving less for uncertainty and ameliorating omitted-variable bias, but they do not claim causal estimates. They do not use instruments; the multilateral (vs-the-rest-of-the-world) data is used to avoid purely idiosyncratic counterparty shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Four dimensions. (1) Funding sector: non-bank funding grows faster and is more volatile than bank funding, which is more volatile than aggregate; non-bank funding grew faster than bank funding in 75% of countries over the full period (54% pre-crisis, 75% during, 75% post-crisis). Uncertainty contractions are largest for non-banks, smallest for aggregate. (2) Uncertainty measure: volatility-based show the largest elasticities, then news-based; forecast dispersion is weakest/often insignificant. (3) Country: riskier countries (emerging markets like Brazil/Turkey; peripheral euro members Italy/Portugal/Spain) show significance for bank flows, while safe havens (Germany, USA) show significance for non-bank flows; some countries (Singapore, Norway, Switzerland) are largely unaffected; Finland and Japan show positive (wrong-signed) responses. (4) Time: volatility-based uncertainty matters only during the GFC; news-based matters outside it, especially for Europe.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the candidate mechanisms and are they tested?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Mechanisms are discussed but explicitly left for future research. Deleveraging/precautionary saving: under higher uncertainty banks shrink balance sheets and borrow less abroad. Liquidity management: uncertainty creates liquidity concerns, so banks may borrow more or less depending on term horizons. Rebalancing: volatility-based uncertainty (tracking equity risk) may drive borrowing from a risk-management/rebalancing perspective, while news-based uncertainty may operate through liquidity. Demand vs supply: higher uncertainty can cut a country&amp;rsquo;s banks&amp;rsquo; demand for funds or foreign supply of funds; advanced/safe-haven countries are argued to face a weaker supply channel because the rest of the world keeps trusting them, consistent with safe havens reducing non-bank funding demand while aggregate is little changed (a shift between bank and non-bank funding).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does volatility-based uncertainty produce the strongest results even though it is narrower than news-based?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A priori the broader news-based measures might be expected to matter more, but the authors find volatility-based the strongest. They reason that cross-border banking decisions place greater weight on financial-system conditions, which volatility-based uncertainty (tracking the stock market) captures directly; banks holding securities may need to rebalance, diversify, or recapitalize via international borrowing/lending in response to equity risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Bivariate vs multivariate: adding conditioning factors (GDP, stock market, inflation, policy rate, exchange rate, credit, external debt) leaves the negative uncertainty relation; multivariate panel elasticities narrow to roughly -2.2% to +0.5% vs bivariate -4.1% to +0.3%, MG largely unchanged. (2) Balanced 13-country fixed sample (panels C/D of Table 1) to compare measures on identical samples; similar negative, heterogeneous results. (3) One-period lag of uncertainty to address reverse causality (similar). (4) Crisis dummy plus interaction and separate pre/post-crisis estimation. (5) Alternative forecast-based measures (forecast-error dispersion, mean absolute forecast error) gave similar results. (6) An earlier version purged realized/implied volatility of the VIX to get idiosyncratic volatility (similar). (7) Persistence robust to including a constant; AR(1)/half-life analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from Choi and Furceri (2019)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It is closest in spirit to Choi and Furceri (2019), who find a negative relation between banking flows and uncertainty using realized volatility and EPU on bilateral, aggregate flows (assets and liabilities). Bénétrix and Curran instead decompose flows into bank vs non-bank sub-components and use a broad set of uncertainty measures (implied volatility at two maturities, realized volatility, EPU, WUI, forecast dispersion), arguing this avoids the limitations of relying only on backward-looking realized volatility or cross-country-incomparable EPU. The nuanced result that news-based uncertainty matters outside the GFC (because only it rose since the crisis) departs from existing panel studies like Choi and Furceri. From Cerutti et al. (2017) they take the relevant takeaway that cross-border flows decline when the US VIX rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the dynamic/descriptive findings on the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cross-border funding grew over two decades, especially pre-GFC; non-bank funding dominates growth during/after the crisis and is the most volatile, aggregate the least (e.g., Singapore and Finland std devs of 4.1 and 21). Cross-country average growth of non-bank liabilities is 2.2% vs 1.3% for bank liabilities. 64% of countries show positive autocorrelation in aggregate liabilities for the full period, while ~60% show negative autocorrelation for the two sub-components; pre-crisis ~80% show negative aggregate autocorrelation. Means/medians of flows are u-shaped (positive-negative-positive across pre/during/post), std devs n-shaped. For uncertainty, volatility-based moments peak during the crisis; only news-based (EPU, WUI) rose during and since the crisis. Uncertainty shocks are short-lived (half-lives about one quarter); ordering from least to most persistent: forecast-based, WUI, EPU, 1-month implied vol, realized vol, 3-month implied vol.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are notable country-specific results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;3-month implied volatility elasticities range -14.1% to 11.5% (non-negative ones all insignificant); 1-month range -11.4 to 10.3; realized volatility -18.7 to 14 (with some significant positive estimates: Japan +4.7 overall, Finland +13.6 and +13.9 for overall/bank). EPU ranges -11.2 to 20.9 (positive significant for Japan in aggregate/bank, Brazil non-banks); WUI tighter, -4 to 2.7 (max contraction 4% for Austria bank funding; India positive). Forecast dispersion -30.7 to 4.4 (or -8.2 to 4.4 excluding Brazil); significant negative for UK (all sectors) and Brazil/Italy/UK (non-banks). France, Portugal, Ireland show robust negative responses; Portugal is significantly negative for all measures and sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Policymakers should note that uncertainty mattered most during the GFC and European Sovereign Debt Crisis, and that news-based uncertainty has a distinct, sizable dampening effect on cross-border flows since the Great Recession, particularly for European nations (EU15/euro area), because only news-based uncertainty rose post-crisis. A single uncertainty measure does not fit all, since banking systems differ in structure, ownership, cross-border activity, size, and local-economy exposure. Scope conditions: results are associations not causal effects; effects are concentrated in the crisis window for volatility measures; non-European and emerging markets show no significant news-based effect outside the crisis; the sample is 24 countries, 2003Q1–2018Q4, multilateral liabilities only.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and limitations the authors acknowledge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Data limitations prevent regression analysis on intragroup, financial, and non-financial flow sub-components (explored only preliminarily). Non-bank liabilities are derived as a residual (all sectors minus non-banks) because bank-counterparty data are partly missing, though the authors argue the impact is minimal. Uncertainty coverage is unbalanced across measures (EPU 16, forecast dispersion 15 of 24 countries). Implied volatility (OVM) and forecast (ECFC) series could not be automated and required manual snapshots. The AR(1) persistence choice may miss nonlinearities/structural breaks and gives an upper bound on persistence. Country-level coefficients are often statistically insignificant given the strong lagged dependent variable. Mechanisms/channels are not tested and left for future work.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Asset Exemption in Bankruptcy, Access to and Cost of Credit</title><link>https://macropaperwarehouse.com/papers/asset-exemption-in-bankruptcy-access-to-and-cost-of-credit/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/asset-exemption-in-bankruptcy-access-to-and-cost-of-credit/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Under U.S. Chapter 7 bankruptcy, an individual entrepreneur has most unsecured debt discharged and only her non-exempt assets liquidated, producing an &amp;ldquo;insurance effect.&amp;rdquo; But this protection does not extend to assets voluntarily pledged as collateral, so a borrower can undo the insurance by posting sufficient collateral. The paper asks how asset exemption interacts with the decision to post collateral to shape access to and the cost of credit. The novel insight is that, because the opportunity cost of pledging collateral (forgoing the exempt assets one would otherwise keep in default) is lower for safe entrepreneurs than for risky ones, collateral becomes a more effective sorting device as exemption rises. Existing empirical work (Gropp et al. 1997; Berkowitz and White 2004; Berger et al. 2011) finds exemption reduces access and raises rates, but does not exploit the interaction between collateral and exemption.&lt;/p&gt;
&lt;p&gt;Model setup: A competitive credit market with risk-neutral entrepreneurs heterogeneous in success probability (safe type-H with pH, risky type-L with pL, pH &amp;gt; pL) and in pledgeable wealth w over [w, w-bar]. Each needs one unit of credit; lenders face opportunity cost r and cannot observe type. Lending contracts are triples (cost of credit RB, collateral C, access probability pi). Exemption eta shields wealth up to eta from liquidation but not wealth posted as collateral; liquidated wealth is worth only lambda &amp;lt; 1 to lenders. Competition is modeled as a three-stage game (a la Hellwig 1997) so that a subgame-perfect equilibrium exists and delivers the contract most preferred by safe types. The setup extends Besanko and Thakor (1987) by allowing any exemption between zero and infinity, adding the third (acceptance) stage, and adding wealth heterogeneity.&lt;/p&gt;
&lt;p&gt;Main theoretical results: With zero exemption, pooling is the only equilibrium and no rationing occurs. With positive exemption, the equilibrium involves separation (at least for intermediate wealth): safe entrepreneurs self-select into contracts with effective collateral and face a lower cost of credit, while risky ones post no collateral. As in Besanko and Thakor, separation entails rationing for safe entrepreneurs too wealth-constrained to meet collateral requirements. The key novelty: conditional on posting collateral, as exemption rises, access to credit rises and the cost of credit falls—collateral becomes a more powerful screening tool. The overall effect of higher exemption on aggregate rationing is ambiguous, because more safe entrepreneurs choose to separate (lowering their access probability) even as each separating safe type is rationed less; the net effect depends on the wealth distribution.&lt;/p&gt;
&lt;p&gt;Data and empirical strategy: The 2003 wave of the Survey of Small Business Finances (SSBF), 4240 firms, restricted to 1761 creditworthy firms that were financed at least once (96% always financed). Cross-state exemption variation is collapsed to a high/low dummy across nine census divisions (West North Central and West South Central coded high). Firm type is identified by whether it posts collateral (posters = type-H). An endogenous switching / inverse Mills ratio approach (Maddala 1983) handles self-selection in the cost-of-credit equation; access to credit is estimated by probit with a collateral-by-exemption interaction.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: Descriptively, high-asset firms face loan rates 1.5 pp lower and rationing 3.8 pp lower. Collateral-posting firms pay 0.7 pp lower rates overall; this differential grows from 0.53% in low-exemption to 1.20% in high-exemption subsamples. The Mills-ratio coefficients are negative and significant, confirming collateral conveys private information. In the access regression, posting collateral is positively associated with rationing, but firms posting collateral are less likely to be rationed in high-exemption divisions (predicted access falls 0.6% on average from posting collateral, but rises 1.5% in high-exemption areas). Reduced-form OLS: collateral firms pay 0.30% less, with the discount rising 0.55% moving low-to-high exemption. The simultaneous structural system implies a 34-basis-point average reduction in cost of credit from guarantees, three times larger in high-exemption states (75 vs 17 bp). Heckman selection correction does not alter conclusions. All main model predictions cannot be rejected.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification rests on three pillars. (1) Firm type is identified by the collateral decision: the model implies only type-H (safe) firms post collateral, so posters are treated as type-H and non-posters as type-L. (2) Cross-sectional variation in asset exemption across census divisions (a high/low dummy, with West North Central and West South Central coded high) provides exogenous variation in the strength of collateral as a sorting device. (3) The cost-of-credit equation uses an endogenous switching model (Maddala 1983) identified by the non-linearity of the inverse Mills ratio, under the model-based assumption that observed loan rates are determined by the endogenous collateral decision. Threats: (a) Selection bias from restricting to creditworthy/financed firms—addressed with a Heckman selection model that leaves conclusions unchanged. (b) Coarse exemption measurement—location is only observed at the nine-census-division level rather than by state, and unlimited-exemption states must be aggregated, so the high/low dummy is a proxy; an alternative averaging procedure is reported to give the same results. (c) SSBF data are partly imputed; estimates use Rubin (1987) multiple-imputation combination rules (STATA mi estimate), which inflates variance and can reduce significance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central mechanism is the opportunity cost of posting collateral: in default a borrower who pledged assets loses them all, whereas without pledging she would keep the exempt part. This opportunity cost rises with exemption and is lower for safe borrowers (lower default probability), so collateral sorts types more sharply as exemption rises. Empirically this is distinguished through the collateral-by-exemption interaction: the cost-of-credit discount from posting collateral, and the access-to-credit advantage of posters, both should strengthen with exemption. The negative, significant inverse Mills ratio coefficients show the collateral choice reveals private information about type; the estimated lambda_1L,v being roughly double lambda_1H,v indicates safe firms choose contracts with lower cost-of-credit variance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By wealth: high-asset firms face rates 1.5 pp and rationing 3.8 pp lower. The collateral cost discount is concentrated among low-asset firms (0.9 pp) versus high-asset firms (0.04%). The collateral-rationing association also depends on wealth: among low-asset firms, rationing is 4.4% higher for collateral posters, but for high-asset firms there is no difference. By exemption: the collateral cost differential grows from 0.53% (low) to 1.20% (high). Among collateral posters, the rationed fraction falls 1.1% moving low-to-high exemption, with a larger drop for low-asset firms (-1.9%) than high-asset firms (-0.5%). In the structural cost-of-credit table, wealth reduces the cost of credit for non-posters only in high-exemption areas and for posters only outside high-exemption areas—consistent with firms undoing exemption via collateral.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three. (1) A reduced-form OLS loan-rate regression with collateral, exemption, and their interaction confirms posters pay less (about 0.30% on average) and the discount grows 0.55% moving to high exemption; signs match predictions (beta_3 &amp;lt; 0, beta_4 &amp;lt; 0, beta_2 &amp;gt; 0). (2) A simultaneous structural two-equation system jointly determining cost of credit and guarantees yields a 34-bp average reduction in cost from guarantees, three times larger in high-exemption states (75 vs 17 bp). (3) A Heckman-style selection model accounting for the application/creditworthiness/financing stages leaves all conclusions intact. The imputation-robust (mi estimate) procedure is also applied throughout.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It confirms Gropp et al. (1997), Berkowitz and White (2004), and Berger et al. (2011) that higher exemption raises both rationing and the cost of credit. Its contribution is to use the theoretical model as an identification tool for the joint, interactive effect of exemption and the collateral decision—a prediction absent in prior empirical work. The collateral-as-quality-signal interpretation aligns with Jimenez et al. (2006) for Spanish firms and with Berger et al. (2011) on ex ante asymmetric information. Theoretically, it complements Manove et al. (2001) (too little exemption induces lazy bank screening) by showing that lower creditor protection via exemption gives lenders incentive to screen with collateral. It differs from Krasa et al. (2008) and Tamayo (2015), where creditor protection is an exogenous fraction of retained assets; here that fraction is endogenous because collateral can undo exemption. The model setup extends Besanko and Thakor (1987) with arbitrary exemption levels, a third acceptance stage (Hellwig 1997), and wealth heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Asset exemption levels materially affect credit-market functioning. Positive exemption lowers access and raises the cost of credit on average. But raising exemption enhances collateral&amp;rsquo;s power as a sorting device, so safe entrepreneurs who signal by posting collateral gain better access and larger rate discounts as exemption rises. The net effect of higher exemption on aggregate credit rationing is ambiguous and depends on how collateralizable wealth is distributed across entrepreneurs: more safe types separate (each facing a lower access probability) even as each separating safe type is rationed less. Scope conditions: results apply to individual entrepreneurs under Chapter 7 where exemption does not protect pledged collateral; the insurance/opportunity-cost channel requires exemption to be non-zero (at zero exemption only pooling, no rationing, and collateral conveys no signal); and the empirical magnitudes are estimated for small U.S. firms financed at least once in 2001-2003.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are notable caveats and data limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The dataset does not record the amount of collateral posted, only whether collateral was posted, so type is inferred from a binary decision. Firm location is observed only at the nine-census-division level, forcing a coarse high/low exemption dummy rather than state-level variation. The sample is restricted to firms financed at least once, raising selection concerns (addressed via Heckman). Much SSBF data are imputed. The model abstracts from positive, non-negligible transaction costs of posting collateral (only a negligible cost is assumed to select the unique separating equilibrium with CL = 0); incorporating such costs is left as an extension.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Insurance effect (of exemption and discharge)&lt;/strong&gt;: The protection an entrepreneur enjoys under Chapter 7 because most unsecured debt is discharged and only non-exempt assets are liquidated; in the paper this protection can be voluntarily undone by posting assets as collateral.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Opportunity cost of posting collateral&lt;/strong&gt;: The exempt wealth a borrower forgoes by pledging assets: in default a collateral-poster loses everything pledged, whereas a non-poster keeps the exempt part. This cost rises with the exemption level and is lower for safe (low-default-probability) entrepreneurs, making collateral an informative sorting device.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real guarantees (G)&lt;/strong&gt;: The effective amount of wealth a lender can actually recover in default, G = max(min(w_eta, RB/lambda), C): increasing in collateral C and decreasing in exemption eta. The model is stated in terms of guarantees rather than nominal collateral.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Separating vs. pooling equilibrium&lt;/strong&gt;: Under positive exemption, safe entrepreneurs self-select into high-guarantee, lower-rate (possibly rationed) contracts while risky ones take no-collateral contracts (separation); under zero exemption all borrow under one contract with no rationing (pooling). The model selects the subgame-perfect outcome most preferred by safe types.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Type-H / type-L identification via collateral&lt;/strong&gt;: The empirical convention, derived from the model, that firms posting collateral are safe (type-H) and those not posting are risky (type-L), since in equilibrium only safe firms post collateral.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous switching / inverse Mills ratio approach&lt;/strong&gt;: The estimation method (Maddala 1983) that corrects for self-selection in the collateral decision; negative, significant Mills-ratio coefficients indicate collateral posting conveys private information lowering the cost of credit, identified by the Mills ratio&amp;rsquo;s non-linearity.&lt;/p&gt;</description></item><item><title>Does a Financial Crisis Impair Corporate Innovation?</title><link>https://macropaperwarehouse.com/papers/does-a-financial-crisis-impair-corporate-innovation/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/does-a-financial-crisis-impair-corporate-innovation/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: Why do financial crises leave such deep and protracted economic wounds, with crisis-stricken economies failing to revert to pre-crisis growth trends even a decade later? Imai and Sawada test one specific channel: that crisis-induced disruptions in financial intermediation impair firms&amp;rsquo; ability to fund innovation projects, stalling technological progress and thereby pushing the economy onto a permanently lower growth path. They study this in the context of Japan&amp;rsquo;s 1997-1998 financial crisis, which featured a sharp decline in bank credit, the collapse of three major banks (Hokkaido Takushoku Bank, Long-Term Credit Bank, Nippon Credit Bank), and a failure to recover the pre-crisis growth trend. Laeven and Valencia (2020) estimate the crisis&amp;rsquo;s fiscal cost to Japanese taxpayers at 8.5% of GDP and its economic cost (GDP deviation from trend, 1997-2001) at 45% of GDP.&lt;/p&gt;
&lt;p&gt;Data and strategy: The authors link three firm-level longitudinal datasets. Innovation output is measured from the Institute of Intellectual Property (IIP) Patent Database (Japan Patent Office data): patent applications, granted patents (only ~30% of Japanese applications are granted, taking 7-8 years), and citation-weighted patents using forward citations accumulated in a 17-year window after application. The core sample period is 1994-2003 (a 10-year window around the crisis), with forward citations tracked up to 2018; this long post-crisis window is a deliberate design choice that lets truncation-prone citation data mature. Bank dependence is proxied by the ratio of total loans to total assets (drawn from Nikkei Financial Quest financial statements). Bank-failure exposure is identified from the Corporate Borrowings Database: firms borrowing more than 10% of total bank loans from a failed bank in the year before its failure are coded as client firms. Patent applicants are matched to financial data via NISTEP company-name identification codes, covering roughly 75% of patents by NISTEP-ID firms and 58% of all applications.&lt;/p&gt;
&lt;p&gt;Two empirical designs: (1) A DiD interacting the loan-to-assets ratio with a Crisis dummy (=1 for 1997-2001), with firm, industry-year, and prefecture-year fixed effects, firm controls (log sales, log age, ROA, cash-to-assets, tangible-to-assets) lagged one year and also interacted with the crisis dummy. (2) A bank-failure DiD adding a Bank Failure dummy (=1 for HTB clients 1997-2001, LTCB/NCB clients 1998-2001).&lt;/p&gt;
&lt;p&gt;Main findings with magnitudes: Bank-dependent firms cut both the quantity and quality of innovation more sharply and persistently after the crisis; the loan-ratio-x-crisis interaction is negative and significant for applications, grants, and citations, and robust to the fully saturated fixed-effects model. In the event-study, high bank-dependence (top quartile) firms gained roughly 50% fewer patents over 1997-2003 relative to low-dependence firms (marginally significant), with no pre-trend in 1994-1995. The effect is concentrated in small and medium firms (insignificant for large firms). Decomposing loan maturity, the short-term-loans-x-crisis interaction is negative and robustly significant while the long-term-loans interaction is not, pointing to rollover risk as the main mechanism. For bank failures, the average effect across all firms is small and insignificant, but for small firms it is negative and significant: bank failures are associated with declines of about 12% in granted patents and 17% in cited-weighted patents; the dynamic counterfactual implies small firms whose main bank failed would have been granted about 50% more patents absent the failure, with effects peaking ~2 years after failure and recovering to pre-failure levels within about 4 years.&lt;/p&gt;
&lt;p&gt;Implications: Post-crisis innovation performance depends on the degree to which firms rely on monitored, difficult-to-replace relationship lending. The crisis-induced decline in innovation among opaque, bank-dependent firms is offered as a plausible explanation for Japan&amp;rsquo;s long-term post-1990s productivity and growth stagnation.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What are the two identification strategies, and what is the key identifying assumption?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, a difference-in-differences design interacting a continuous bank-dependence proxy (loan-to-assets ratio) with a Crisis dummy (=1 for 1997-2001), identifying off differential responses of more- vs. less-bank-dependent firms. Second, a bank-failure DiD interacting a Bank Failure dummy (for clients borrowing &amp;gt;10% of bank loans from HTB/LTCB/NCB before failure) with the crisis period. The key identifying assumption is parallel trends: clients of failed banks and clients of surviving banks would have followed the same innovation path absent the failures. The authors support this with event-study coefficients showing no significant pre-trends (1994-1995 for bank dependence; 3-4 and 2 years before failure for bank failures).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main threats to identification and how are they addressed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Bank-dependent firms might be concentrated in declining or cyclically sensitive industries or worse regions — addressed by adding industry-year and prefecture-year fixed effects, so estimates come from firms in the same industry and prefecture; results are insensitive. (2) The decline might reflect poor financial performance or other firm correlates — addressed by interacting the crisis dummy with firm-level controls (size, age, ROA, tangible-to-assets, cash-to-assets); results hold. (3) Exposure to the late-1990s East Asian crisis via exports — addressed by interacting an overseas-sales-to-total-sales ratio with the crisis dummy (losing over half the sample); results robust (Table A2). (4) &amp;lsquo;Cleansing&amp;rsquo;/zombie-lending selection (failed banks served unviable firms) — addressed by dropping non-innovative firms and restricting to manufacturing (least affected by zombie lending); effects persist. (5) Omitted-variable bias for bank failure — assessed via coefficient-stability arguments (Altonji et al. 2005, Oster 2019); estimates stable to inclusion/exclusion of controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the main mechanism and how is it distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The bank lending channel: crises raise the cost of intermediated funds, disproportionately hurting firms reliant on bank finance. The authors further pin down rollover risk by decomposing loans into short-term (residual maturity &amp;lt;=1 year) and long-term relative to assets and interacting each with the crisis. The short-term-loan interaction is negative and robustly significant; the long-term-loan interaction is negative but not robustly significant and becomes insignificant when both are included. This indicates the impairment operates mainly through firms&amp;rsquo; exposure to short-term rollover risk rather than long-term debt levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Effects are concentrated in small and medium-sized firms (terciles by 1996 sales). For large firms the bank-dependence-x-crisis interaction is insignificant. Bank-failure effects are insignificant on average but negative and significant for small firms (about -12% granted patents, -17% cited-weighted patents), and small/insignificant for medium and large firms. The interpretation is that smaller, opaque firms face more severe asymmetric-information problems and find it hardest to replace an informed relationship lender when their main bank fails.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Progressive fixed effects (firm+year; +industry-year; +prefecture-year); crisis-dummy interactions with firm controls; dropping non-innovative firms (never applied/granted patents); restricting to manufacturing (least zombie-affected); R&amp;amp;D-intensity-based industry exclusions; an alternative small-firm definition (first quartile vs first tercile — application results similar, citation results weaken since these firms&amp;rsquo; patents are rarely cited); using R&amp;amp;D expenditure (Toyo Keizai self-reported) as an alternative outcome (bank-dependent firms cut R&amp;amp;D more, Table A1); interacting overseas-sales ratio with crisis (Table A2); separating loans from other debts (loans interaction more robust than other-debt interaction, Table A3); and an industry-linear-trend specification (qualitatively unchanged, unreported).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Did the financial health of the main bank matter, beyond the binary failure event?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No robustly. Using percentage change in main banks&amp;rsquo; share prices from 1993-1998 (interacted with the crisis dummy) to proxy bank weakness, the authors find no robust evidence that clients of weaker-but-surviving banks innovated differently. They conclude differences in main-bank financial health are second-order relative to firm-level heterogeneity in bank dependence (Table A4).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on Japanese bank-health-to-real-activity studies (Peek and Rosengren, Gibson, Amiti-Weinstein, etc.) but tracks much longer-horizon, persistent effects on innovation rather than short-term investment/employment. Relative to Nanda and Nicholas (2014, Great Depression patenting), it uses linked bank-firm data with industry-year and region-year fixed effects to control for demand shocks, and argues 1990s Japan (scarcer breakthrough opportunities) may be more relevant to contemporary settings than the technologically fertile 1930s US. Unlike Hardy and Sever (2021), which uses only US-office patents granted to foreign firms (selection concerns) at industry level, this paper uses all domestically granted Japanese patents at the firm level. It follows Duval, Hong, and Timmer (2020) on balance-sheet heterogeneity and Huber (2018) on bank failures, but adds invention-quality measurement via long forward-citation windows that the 2008-crisis literature cannot yet exploit. It complements Hombert and Matray (2017) on relationship lending and small-firm innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the dynamics of the bank-failure effect on small firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the event study, pre-failure coefficients (3-4 and 2 years before) are small and insignificant. Post-failure coefficients are largely negative, with the largest, significant declines about 2 years after failure (consistent with lags in producing innovation). Innovation performance recovers to pre-failure levels within about 4 years, but cumulative losses are large — implying small firms would have received roughly 50% more patents absent the failure. Effects are qualitatively similar excluding non-innovative firms or non-manufacturing firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy/theoretical implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The adverse real effects of a systemic banking crisis can linger because opaque, bank-dependent firms&amp;rsquo; innovation declines persistently, plausibly contributing to Japan&amp;rsquo;s long-run post-crisis productivity and growth stagnation. Scope conditions: the effect is specific to small, opaque, bank-dependent firms reliant on relationship and especially short-term bank finance; it does not generalize to large firms; the mechanism is loss of monitored, difficult-to-replace relationship lending plus rollover risk, not generic financial weakness or main-bank fragility; and the setting (heavily bank-centered Japanese financial system, scarce breakthrough opportunities) shapes external validity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are notable caveats and data limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bank dependence is proxied by total loans (including loans from non-financial parents/affiliates) over assets rather than pure bank borrowings, because the cleaner Corporate Borrowings Database omits pre-1996 OTC firms; the authors verify total loans only slightly exceed bank borrowings and results hold on the cleaner sub-sample. Patent-financial matching covers ~58% of all applications. Cumulative bank-dependence effects (~50%) are only marginally significant. R&amp;amp;D-based outcomes are hampered by a 2000 Japanese accounting-standard change and inconsistent firm reporting. Citation data are truncated, motivating the long 17-year (and 15-year for 1994-2003) windows.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>Macro and micro of external finance premium and monetary policy transmission</title><link>https://macropaperwarehouse.com/papers/macro-and-micro-of-external-finance-premium-and-monetary-policy-transmission/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/macro-and-micro-of-external-finance-premium-and-monetary-policy-transmission/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper establishes basic facts about the external finance premium (EFP) faced by euro area firms borrowing from banks, and studies how monetary policy is transmitted to it. The EFP — the extra cost a firm pays for external funds versus the opportunity cost of holding cash — is a central object in financial-accelerator theory (Bernanke-Gertler, Kiyotaki-Moore), but its determinants below the country level have rarely been measured directly. The motivation is that euro area policy discussion treats country-level sovereign spreads as sufficient summary statistics for financial conditions, yet there is little micro evidence on whether country variation actually captures the bulk of loan-level variation.&lt;/p&gt;
&lt;p&gt;Data and strategy: The authors use AnaCredit, a loan-level database of all euro area firm loans of at least €25,000, restricted to all new, unsecured loans (so they are not directly affected by Covid government guarantees) in the ten largest euro area economies (Austria, Belgium, Germany, Spain, Finland, France, Ireland, Italy, Netherlands, Portugal), which cover 93% of both the number and value of new euro area loans and 95% of euro area GDP. The sample spans January 2019 to December 2023 and contains about 36 million loans (35,919,600 in the contract tables). Loans are matched to Orbis (firm controls), ECB IBSI and supervisory data (bank balance sheets and capital), CSDB (bank bond yields) and iMIR (aggregate loan rates). The EFP is the loan spread over a maturity-matched OIS rate. They sequentially decompose it via weighted least squares (loan-size weighted) into country-time, then bank-time, then firm-time fixed effects, with contract-level effects as a residual — so each fixed effect is a value-weighted index at that level. Sequence runs aggregate-to-granular so any covariance is attributed to higher aggregation levels, making covariate explanatory power a lower bound.&lt;/p&gt;
&lt;p&gt;Decomposition findings: Country-time effects capture 48.5% of the variance; bank-time 23.8%; firm-time 16.3% (bringing country+bank+firm to 88.6%); residual contract-level variation is 11.4%. Banking relationships are highly local — 96% of bank-firm pairs are in the same country (84% value-weighted). At the country level, the relevant covariate is the euro-area average sovereign spread, not the country-specific one: local spreads explain 48% of country-level variation while the EA average explains nearly 80%, and local spreads add no power beyond the EA average — pointing to a common (global) risk factor. The EFP is roughly 2.6 times larger than the sovereign spread. The EFP is countercyclical (higher with lower GDP and higher unemployment). Bank-level: weaker banks (less capitalized, less liquid, more exposed to risky assets, higher funding costs, larger) charge higher EFPs; the 95-5 quantile range of Tier 1 capital implies almost 100 bps higher EFP. Firm-level: smaller, younger, more leveraged, less profitable firms pay more — the 5-95 leverage range implies 90 bps higher EFP, the probability-of-default range about 20 bps, and old (50yr) vs young (5yr) about 30 bps. Crucially, bank-, firm- and contract-level variation remains largely unexplained (R-squared on bank regressions ~0.01-0.05; firm ~0.11-0.18; contract ~0.0001-0.0003).&lt;/p&gt;
&lt;p&gt;Monetary policy transmission: Using Jorda local projections on high-frequency identified ECB surprises (Altavilla et al. 2019: Target, Forward Guidance, QE factors from OIS changes around announcements), a null EFP response means exact pass-through. A one-SD Target surprise (8 bps) raises the EFP about 10 bps (peaking 3-5 months); a one-SD QE surprise (€500 bn) lowers the EFP about 20 bps, split roughly equally across bank and firm levels. Effects are asymmetric: policy-rate tightening (not easing) and QE (not QT) are amplified through the EFP. Tightening amplification is mostly at the bank level (bank lending channel, driven by weaker banks); QE additionally narrows the EFP at the firm level (firm balance-sheet channel, helping fragile firms). QT, while fully passed through to tighten lending, leaves the EFP unchanged — attributed to QT&amp;rsquo;s slower, more predictable, &amp;ldquo;loud-bang-less&amp;rdquo; implementation versus QE&amp;rsquo;s large-envelope announcements (a difference-in-difference on QE envelope months shows a significant EFP decline after envelope announcements). Implication: as the ECB shrinks its balance sheet (lowering liquidity), rate hikes become more likely to generate financial amplification via the EFP, since less-liquid banks respond more to rate hikes. The QT result is caveated by limited sample evidence.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the empirical strategy for decomposing the EFP, and why does the order of fixed-effect extraction matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The EFP (loan spread over maturity-matched OIS) is decomposed sequentially via weighted least squares (each observation weighted by loan size) into country-time, then bank-time, then firm-time fixed effects, with contract-level effects as the residual (Equations 1-3). Each fixed effect is effectively a value-weighted index of spreads at that level. The sequence MUST run from aggregate to granular: starting with loan-level effects would soak up all variance. Because aggregate effects are estimated first, any covariance (e.g., a particular firm type clustering at a particular bank, or a country with a strong/weak banking system) is attributed to the higher aggregation level. This means variance attributed to higher levels may be slightly overstated relative to joint estimation, but covariate explanatory power can be read as a lower bound. The authors avoid simultaneous estimation for two reasons: it is computationally infeasible to estimate ~10 million fixed effects jointly and retrieve their values (which are the dependent variables in the second stage), and the sequential method makes clear exactly where covariances land. A check absorbing firm/bank effects via differencing while explicitly estimating country-time effects yields a 98% correlation between sequential and jointly estimated country-time fixed effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the variance decomposition result, and what is its headline interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Country-time effects capture 48.5% of loan-level variance, bank-time 23.8%, firm-time 16.3% (country+bank+firm = 88.6%), and residual contract-level variation 11.4%. The headline: country-level variation — the usual focus of euro area policy — is the single largest component but only about half the story. Policymakers and researchers must look at more disaggregated (bank and firm) data to understand financial conditions. The &amp;lsquo;proverbial glass is half full.&amp;rsquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why is the euro-area average sovereign spread, not the country-specific spread, the relevant covariate at the country level?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Regressing country-time EFP fixed effects on sovereign spreads: country-specific spreads explain 48% of country-level variation, while the EA-average spread explains nearly 80%; adding local spreads on top of the EA average yields no additional explanatory power (the local-spread coefficient is insignificant). This is consistent with variance along the time (t) dimension being much larger than across countries (c), suggesting a common factor — likely global risk aversion — drives country-level EFP variation. The EFP is roughly 2.6 times larger than the sovereign spread (specification 2). Heterogeneity: aggregate (EA) spreads matter most for large firms (multi-country operators) and short-maturity loans; for small firms and long-maturity loans the country-specific spread becomes relevant (verified with a Patton-Timmermann monotonicity test).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What evidence supports the bank lending channel at the bank level?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bank-time EFP is regressed on bank balance-sheet and funding-cost variables. Higher EFP is associated with weaker banks: less capitalized, more exposed to risky assets, less liquid, and with higher funding costs. The 95-5 quantile range of Tier 1 capital implies almost 100 bps higher EFP. These covariates (except the interbank rate, which is common across banks and captures time variation) are bank-specific, so they reflect the bank&amp;rsquo;s own balance sheet rather than its average borrower — the essence of the bank lending channel. Larger banks also charge higher rates, which the authors suggest may reflect market power. Caveat: R-squared values are very low (~0.01-0.05), so most bank-level loan-rate behavior remains unexplained.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What evidence supports the firm balance-sheet channel at the firm level?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Firm-time EFP (net of country and bank effects) is regressed on firm fundamentals. Smaller, younger, more leveraged, and less profitable firms pay higher EFPs — a clear balance-sheet/financial-accelerator mechanism. Magnitudes from specification (4): the 5-95 leverage range implies 90 bps higher EFP; the probability-of-default distribution implies about 20 bps; old (50yr) versus young (5yr) firms differ by about 30 bps. This is notable because the sequential extraction attributes all bank-firm covariance to banks, yet firm-level drivers still appear. Caveats: covariates explain only about a fifth of firm-time variation, and part of the fit comes from including probability of default (itself a financial price).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is found at the contract level?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;After controlling for country, bank, and firm effects, residual contract-level variation arises only for firms borrowing multiple times in the same month at different rates. Regressing on loan size and maturity, both are statistically significant but collectively explain a negligible share (R-squared ~0.0001-0.0003). The authors call this a &amp;rsquo;nothing to see here&amp;rsquo; result and conjecture that unobserved contract characteristics — likely loan covenants — drive it; because these would correlate with size and maturity, there is omitted-variable bias, so they do not interpret the coefficients. Notably these are unsecured loans, so covenants are not about explicit collateral.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy for monetary policy transmission, and what are its limits?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors estimate Jorda (2005) local projections of cumulative changes in the bank-time and firm-time EFP (h = 0..5 months) on high-frequency identified ECB monetary policy surprises from Altavilla, Brugnolini, Gurkaynak, Motto and Ragusa (2019) — rotated factors from OIS changes in a narrow window around announcements, interpretable as Target, Forward Guidance, and QE surprises (the QE sign is flipped so larger = larger easing). A null EFP response indicates exact pass-through of the policy rate to the loan rate, not ineffectiveness. Limits: at the country level, the analysis acknowledges it does not condition on exogenous variance, so causal claims at the country/macro covariate level are &amp;rsquo;not strongly grounded&amp;rsquo;; the paper frames the country-level work as comovement/fact-finding. The local-projection monetary-policy results are stated as causal. Forward-guidance surprises are too small in this sample (the ECB deliberately withheld guidance) to generate identifying variation, so FG results are relegated to the appendix.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main asymmetries in monetary policy transmission to the EFP?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two sign/instrument asymmetries: (1) Policy-rate tightening (but not easing) is amplified via the EFP, mostly at the bank level, driven by weaker (less capitalized, less liquid, higher-NPL) banks. The weaker amplification from rate cuts is linked to limited policy space near the effective lower bound, which binds for cuts but not hikes. (2) QE (but not QT) is amplified via the EFP, reducing it at both bank and firm levels, with the firm-level reduction indicating a firm balance-sheet channel that helps fragile firms. Magnitudes: a one-SD Target surprise (8 bps) raises EFP ~10 bps (peak 3-5 months); a one-SD QE surprise (€500 bn) lowers EFP ~20 bps, split roughly equally bank/firm. QT is fully passed through to tighten lending but leaves the EFP unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why does QT leave the EFP unchanged while QE moves it, and how is this tested?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors consider three channels: (i) QE&amp;rsquo;s signalling channel (signalling an accommodative stance near zero rates) has no QT equivalent; (ii) QE is announced in financial distress while QT occurs in calmer periods — but these concern &amp;lsquo;periods&amp;rsquo; not &amp;lsquo;surprises,&amp;rsquo; and in the event-study framework many QT surprises actually fall within the QE period as smaller-than-expected QE, so policy-cycle explanations don&amp;rsquo;t apply directly; (iii) the operationally relevant channel: QE arrives via large &amp;rsquo;envelope&amp;rsquo; announcements generating sizeable stock and flow effects (&amp;lsquo;a loud bang&amp;rsquo;), whereas QT is implemented slowly, predictably, and designed to be &amp;lsquo;as unsurprising and gentle as possible,&amp;rsquo; muting both effects. They test the third channel with a difference-in-difference comparing EFP changes around the five/six QE envelope announcement months (APP/PEPP announcements/recalibrations: September 2019, and March, April, June, December 2020) versus all other months. Both bank- and firm-level panels show no pre-trend divergence but a significant EFP decline after the envelope announcement, beyond the risk-free curve. Caveat: QT results rest on limited accumulated evidence and need reassessment; deviations from gradual balance-sheet normalization could have significant effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the bank/firm channel split corroborated via cross-sectional interactions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Equation (9) adds interactions of monetary policy surprises with bank/firm fragility characteristics (reporting h=3). Consistent with the bank lending channel, transmission of rate-tightening and QE-easing surprises is amplified for banks with weaker regulatory positions, less liquid assets, and higher funding costs. Consistent with the firm balance-sheet channel, the EFP is reduced more strongly for fragile firms (by size, age, leverage, profitability). Two implications: QE narrowed not just sovereign spreads but also the EFP on loans to more fragile firms; and because less-liquid banks respond more to rate hikes and QT lowers system liquidity, QT and rate hikes interact — as the ECB shrinks its balance sheet, rate increases are more likely to generate financial amplification via the EFP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run on the country-level results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three main checks (appendix): (A1) excluding 2020 (the Covid year) entirely leaves results unchanged, so country results are not Covid-driven; (A2) restricting to loans where bank country equals firm country strengthens the result, so the irrelevance of local spreads is not driven by bank-vs-firm country matching; (A3) a long macro sample built directly from aggregate iMIR data spanning April 2005 to December 2023 yields similar results, addressing the short-T concern and validating the bottom-up micro construction. Results are also robust to using 2-year or 10-year sovereign spreads, and main results hold under OLS rather than WLS (though equal-weighting overweights small loans — the smallest 90% of loans are just 1.3% of the market). Westerlund-style cointegration tests address potential non-stationarity/spurious regression.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds on the financial-accelerator literature (Bernanke-Gertler 1989; Kiyotaki-Moore 1997; Bernanke-Gertler-Gilchrist 1999) resting on a failure of Modigliani-Miller due to information asymmetries. Unlike the applied EFP literature that proxies the premium with bond spreads (Gilchrist-Zakrajsek 2012; Gilchrist-Mojon 2018) — relevant only to firms able to issue bonds, a significant limitation in the bank-intermediated euro area — this paper measures the EFP directly from bank loan rates. Unlike standard microdata work that saturates regressions with fixed effects (Khwaja-Mian 2008; Amiti-Weinstein 2018; Degryse et al. 2019) to separate supply from demand and then discards those fixed effects, this paper makes the fixed effects themselves the objects of study. On asymmetry, it adds to the literature on asymmetric monetary policy over the cycle (Keynes 1936; Cover 1992; Tenreyro-Thwaites 2016) and to the scant literature comparing instrument effectiveness during easing vs tightening (Wei 2022; Crawley et al. 2022), and complements Todorov (2020) showing QE shrinks risk premia for less creditworthy bond-market borrowers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Country-level sovereign spreads are inadequate summary statistics for euro area financial conditions — they capture only half the EFP variance — so monitoring must extend to bank and firm levels. (2) QE is effective at the micro level, narrowing the EFP especially for fragile banks and firms; it is a &amp;lsquo;fine substitute&amp;rsquo; for interest-rate policy. (3) QT&amp;rsquo;s gentle, predictable implementation has so far avoided EFP amplification, but this is contingent on that specific implementation modality — a fast or surprising QT (a tightening-direction &amp;rsquo;envelope&amp;rsquo;) could have significant effects on firm and household lending conditions. (4) Interest-rate and balance-sheet policies are complementary: as the balance sheet shrinks and liquidity falls, rate hikes become more amplifying via the EFP. Scope conditions: country-level/macro comovements are not conditioned on exogenous variance so are not strong causal claims; sovereign spreads are asset prices, not fundamentals; QT conclusions rest on a limited sample and need reassessment; the policy result reflects the specific ECB communication and operational modalities observed in 2019-2023.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What significant caveats and unexplained findings does the paper itself flag?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is explicitly framed as a &amp;lsquo;fact-finding effort&amp;rsquo; rather than a complete causal narrative. Most bank-, firm-, and essentially all contract-level variation remains unexplained by an extensive list of covariates (low R-squared). The finding that larger banks charge more (market power) is presented as an interpretation worth studying, not established. Country-level comovements are not causal. The QT/EFP-unchanged result rests on limited evidence. Contract-level drivers (likely loan covenants) suffer omitted-variable bias and are left uninterpreted. The authors repeatedly invite future work on causal mechanisms and sub-country determinants.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>News-Driven Household Macroeconomic Expectations: Regional vs. National Telecast Information</title><link>https://macropaperwarehouse.com/papers/news-driven-household-macroeconomic-expectations-regional-vs.-national-telecast-information/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/news-driven-household-macroeconomic-expectations-regional-vs.-national-telecast-information/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: The paper asks whether and which television news topics shape French households&amp;rsquo; one-year-ahead macroeconomic expectations (inflation, unemployment, economic situation), over and above information already in national statistics, and whether REGIONAL (not just national) news matters. This is important because media are the primary information intermediary between households and the economy, household expectations feed into consumption/spending decisions and thus monetary-policy transmission, and the literature had largely ignored that households&amp;rsquo; information sets may depend on local/regional economic conditions.&lt;/p&gt;
&lt;p&gt;Data and sample: Monthly data, January 2004 to December 2019. Household expectations come from INSEE&amp;rsquo;s monthly consumer-confidence survey (~2,000 households interviewed by phone each month, each interviewed three consecutive months). The author uses three qualitative questions (future prices, unemployment, economic situation) to build national and regional &amp;ldquo;balances of opinions,&amp;rdquo; plus a quantitative inflation-expectation question (answered on average by only 56% of monthly respondents, which prevents building regional quantitative series). News data come from the French National Audiovisual Institute archives of TF1 and France 2 (national, 8pm newscasts watched daily by roughly 20% of households) and France 3 (7pm regional newscasts). National and regional newscasts discuss roughly 24 and 11 stories per day, respectively. Human archivists assign standardized expert keywords/topics. The author constructs coverage indicators for 73 topics (12 aggregate + 61 socio-economic), selected if discussed in more than 75% of months. Two coverage measures are built: count-based (frequency of stories) and a novel time-based &amp;ldquo;viewer time exposure&amp;rdquo; (seconds spent on a topic). Metropolitan France is split into 13 administrative regions (Corsica/overseas excluded).&lt;/p&gt;
&lt;p&gt;Empirical strategy: Penalized predictive regressions (LASSO, Tibshirani 1996), following Larsen et al. (2021), with the rigorous data-driven plug-in penalty of Belloni et al. (2012, 2014) and post-LASSO OLS with Newey-West HAC standard errors. News variables are lagged one month (to avoid simultaneity/look-ahead); statistical controls lagged two months (except EPU index and diesel price, lagged one). National statistical controls include 10-year bond yield, CPI, exchange rate, unemployment rate, industrial production, EPU index, diesel price; milk and bread prices added for inflation regressions. Regional regressions are run separately per region adding national plus regional news and three regional controls (job seekers, dwelling permits, business failures). Household-level regressions use OLS (quantitative) and probit (binary) with demographic, year, and region effects.&lt;/p&gt;
&lt;p&gt;Main findings (with magnitudes): From 73 candidate topics, 14 are selected, with on average about four topics per regression in addition to statistical series, confirming news carries information not in national statistics. Average inflation expectations are significantly driven by news on energy and taxes; decomposing energy shows OIL news is consistently selected (gas to a lesser extent, not robust to statistics). Future-economic-situation expectations load on purchasing power, living cost, and economic plan; unemployment expectations load negatively on economic crisis and oppositely on economic life. Regional results: both regional AND national labor-market news predict the unemployment balance of opinions; regional lay-off and unemployment topics are consistently selected, and more regional unemployment coverage makes households more pessimistic about NATIONAL unemployment. At the household level, one additional energy story raises the probability of expecting price increases by 0.19% and one additional fiscal-policy story by 0.10%; one additional regional-unemployment story raises the probability of expecting more unemployment by 0.36% (0.33% in panel specification; energy 0.17% and fiscal policy 0.08% in panel). The unemployment balance-of-opinions dispersion across regions averages 24 percentage points. Independent/self-employed workers are most sensitive to regional unemployment news; the effect is weaker for young and below-first-quartile-income households. Implications: news topic fluctuations carry expectation-relevant information complementary to official statistics, regional news reveals a geographical dimension to household attention consistent with endogenous information acquisition / rational inattention, and this matters for using inflation expectations as a monetary-policy tool.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/empirical strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strategy is predictive: LASSO (with the Belloni et al. rigorous plug-in penalty) selects, from 73 candidate news topics plus statistical controls, those with predictive power for one-year-ahead expectations, followed by post-LASSO OLS with Newey-West HAC standard errors. The paper is explicit that it estimates a predictive relationship, not a structural causal effect. Threats addressed: simultaneity/look-ahead bias is handled by lagging news one month and statistics two months (one for diesel/EPU/milk/bread, which households observe in real time); overfitting and spurious selection are reduced by the data-driven penalty (more parsimonious than cross-validation, robust to heteroscedasticity). A residual threat is that news coverage and expectations could both respond to an unobserved underlying economic state; the author partially addresses this by showing news survives inclusion of official national and regional statistics and that &amp;lsquo;partial adjusted R2&amp;rsquo; attributable to news is non-zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core mechanism is endogenous/limited-capacity information acquisition: households cannot absorb all information and incorporate a subset heard from media intermediaries. Expectation-specificity is the key empirical discriminator: energy/oil and tax/fiscal-policy news affect ONLY inflation expectations; labor-market topics (lay-off, unemployment) affect MAINLY unemployment expectations; broad topics (economic crisis, living cost, economy) affect economic-situation and unemployment expectations. The regional dimension is distinguished by separating France 3 regional newscasts from TF1/France 2 national newscasts and running region-specific LASSO, showing regional labor-market news is selected even after controlling for national news and official regional indicators.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Regional heterogeneity: balances of opinions and news topic coverage vary substantially across the 13 regions (e.g., unemployment balance-of-opinions min-max gap averages 24 pp; lay-off/unemployment air-time differs markedly by region). Sentiment heterogeneity: economic crisis carries negative sentiment, economic life positive, yielding opposite-signed coefficients. Household heterogeneity: by employment sector, independent/self-employed workers are MOST sensitive to regional unemployment news (vs public and private sector employees); the regional-unemployment-news effect is less significant for young households and not significant for those below the first income quartile.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Count-based vs time-based (&amp;lsquo;viewer time exposure&amp;rsquo;) coverage measures give nearly identical selections and R2; time-based is somewhat more parsimonious and more significant for energy on inflation. (2) Outlier-robust inflation-expectation measures (5%, 10%, 15% trimmed means and the median) preserve the energy/tax/fiscal-policy results. (3) Including perceived inflation as a regressor: it is selected but insignificant and does not change energy/tax results; a separate analysis shows news matter for inflation EXPECTATIONS directly, not via perceptions (the selected topic sets are nearly mutually exclusive). (4) Household-level panel exploiting the up-to-three-month repeated interviews (household fixed-effects / random-effects probit) confirms results (energy 0.17%, fiscal policy 0.08% for prices; regional unemployment 0.33% for unemployment). (5) Energy decomposition by source confirms oil (and lesser gas) drives the energy effect. (6) Bootstrapped confidence intervals and demographic-stability checks address the concern that regional series differences are noise or demographic composition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It builds directly on Larsen et al. (2021), adopting their topic-based LASSO approach, and on Carroll (2003), Doms and Morin (2004), Pfajfar and Santoro (2013), Lamla and Lein (2014), Draeger and Lamla (2017), Ehrmann et al. (2015) on media and expectations. Four novelties distinguish it: (1) it uses TELEVISION content rather than newspaper corpora (television being the main source of household economic information per Blinder-Krueger, Curtin); (2) it separates REGIONAL from national newscasts to identify regional drivers of expectation heterogeneity; (3) it uses HUMAN-EXPERT-assigned topics rather than algorithmic topic models (more accurate for short TV stories, allows distinguishing sub-topics like deficit, lay-off, tax); (4) it adds a time-based &amp;lsquo;viewer time exposure&amp;rsquo; coverage measure capturing duration, not just frequency. The regional finding extends Kuchler-Zafar (2019) and Malmendier-Nagel (2016) extrapolation results: households extrapolate not just personal experience but their region&amp;rsquo;s labor-market experience to national expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Understanding which news households incorporate is key for using inflation expectations as a monetary-policy tool; energy/oil and tax/fiscal news drive inflation expectations, so central-bank communication and expectation management must account for media salience of these topics. The regional finding implies a geographical dimension to household attention relevant for modeling information frictions (rational inattention, sparsity, sticky information with endogenous updating). Scope conditions: results are predictive (not causal), specific to France 2004-2019, rest on expert-assigned TV topics, and the regional analysis applies to qualitative balances of opinions only (the quantitative inflation question&amp;rsquo;s 56% response rate prevents regional quantitative series). Whether households OVERWEIGHT local labor markets is explicitly stated to be beyond the paper&amp;rsquo;s scope.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What other significant findings, extensions, or caveats appear?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Correlations between national and regional news indicators are limited, confirming regional news carries information absent from national news (only country-wide topics like tourism, tax, economic crisis, demonstration, and prices are highly correlated). Regional peaks reflect identifiable local events (the 2013 &amp;lsquo;Red Beanies&amp;rsquo; movement and 2016 agricultural crisis in Brittany). Past inflation and official statistics are heavily selected for inflation/price expectations (consistent with Larsen et al.); milk and bread price changes matter for quantitative inflation expectations but not the qualitative price balance, suggesting households extrapolate frequently-bought items for quantitative answers. Electricity is absent from selection despite a larger basket weight than gas, plausibly due to France&amp;rsquo;s regulated electricity prices. The author notes media exhibit a documented negative-news asymmetry (Soroka 2006), so sentiment-neutral topics tend to carry predominantly negative news.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Balance of opinions&lt;/strong&gt;: A monthly index computed as the difference between the share of households expecting one macroeconomic direction and the share expecting the opposite (e.g., for unemployment, share expecting an increase minus share expecting a decrease; for prices, share expecting an increase minus share expecting prices to stay the same, since households rarely expect deflation). Used as the qualitative expectation measure at national and regional levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Viewer time exposure&lt;/strong&gt;: The paper&amp;rsquo;s novel time-based coverage measure: the monthly number of seconds viewers are exposed to a given news topic, as opposed to the count-based measure (number of stories). It captures both frequency and duration, reflecting the importance given to a story and its effect on viewer recall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expert-assigned topics&lt;/strong&gt;: News topics assigned by trained archivists of the French National Audiovisual Institute using a standardized grid (relying on title, image, and sound), rather than algorithmic topic models. The author argues these are more accurate for short TV stories and allow distinguishing specialized sub-topics (deficit, lay-off, unemployment) that algorithms would pool.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous information acquisition&lt;/strong&gt;: Used in the paper&amp;rsquo;s own sense as the theoretical frame in which households with limited capacity to acquire/process information choose what to attend to based on expected benefits — invoked to explain why households incorporate regional labor-market news (believing they are more affected by local conditions). Linked to rational inattention, sparsity, and sticky-information models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rigorous (plug-in) LASSO penalty&lt;/strong&gt;: The data-driven penalty of Belloni et al. (2012, 2014) for choosing the LASSO regularization parameter, preferred over cross-validation because it yields a more parsimonious variable selection, lowers overfitting, and is robust to heteroscedasticity; followed by post-LASSO OLS with Newey-West HAC standard errors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geographical dimension of attention&lt;/strong&gt;: The paper&amp;rsquo;s term for its central regional finding: households&amp;rsquo; information collection and attention have a spatial structure, whereby they incorporate regional news (especially on local lay-offs and unemployment) into their NATIONAL expectations, producing geographical heterogeneity in aggregate beliefs.&lt;/p&gt;</description></item><item><title>The Credit Channel of Public Procurement</title><link>https://macropaperwarehouse.com/papers/the-credit-channel-of-public-procurement/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-credit-channel-of-public-procurement/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question and motivation.&lt;/strong&gt; Public procurement accounts for roughly one-third of government spending (12.6% of GDP and 30% of total government expenditures in OECD countries in 2019). The standard view is that procurement helps firms grow by raising their &lt;em&gt;revenues&lt;/em&gt;. Gabriel asks whether procurement also operates through a previously underexplored &lt;em&gt;credit&lt;/em&gt; channel: if a procurement contract is a secure future cash-flow stream, firms can pledge it as collateral to obtain more credit. This matters especially in bank-dependent economies (in Portugal and several OECD countries, &amp;gt;80% of nonfinancial corporate debt is bank loans; &amp;lt;1% of Portuguese firms access capital markets), and for small/financially constrained firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and strategy.&lt;/strong&gt; The author web-scrapes &amp;gt;1 million Portuguese electronic procurement contracts (2009-2019) from the official BASE registry, matching winners&amp;rsquo; tax IDs to firm balance-sheet/income data (IES via BPLIM) and to the monthly Credit Registry (CRC) with loan-level collateral types. Focusing on contracts awarded via public contests (a silent sealed-bid first-price-auction-like setting) for quasi-exogenous variation yields 138,561 contract-winner pairings and 35,675 unique winner-year observations. Average contract award is ~€202,170 (median ~€33,762-34,762), average duration ~297 days, ~3.6 contestants. Identification uses Jordà (2005) local projections (Eq. 1) regressing credit growth (scaled by lagged assets) on the award amount (scaled by lagged assets), with firm and industry×year fixed effects, SEs clustered at the firm level. The identifying assumption is that winning via public contest is not systematically correlated with firm characteristics; conditional on fixed effects, winner/non-winner differences largely disappear (except total assets, which is controlled).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings (with magnitudes).&lt;/strong&gt; Winning an additional €1 of procurement raises total firm credit by up to €0.07 (3.3 cents drawn credit on impact, plus ~4 cents in potential/undrawn credit lines; total ~7 cents in the award year), and raises cash and bank deposits by ~6 cents. Interest rates fall by over 0.3 percentage points on impact, indicating the increase is supply-driven (winners&amp;rsquo; average implicit rate ~6.9%, median ~5.1%). A back-of-envelope calculation gives ~2.5 pp credit growth one year out (vs. ~5 pp in Spain per di Giovanni et al. 2024). The credit increase is almost entirely collateralized; in monthly data, firm personal guarantees (which include future procurement cash flows) account for &amp;gt;66% of the credit increase at month 4, and adding state guarantees, cash-flow-based lending explains ~75%. On the real side: +6 cents of non-current assets/investment (mostly PPE) per euro, persistent employment gains, ~70% rise in sales income one year post-award, positive net income of ~5 cents per euro. cash-flow-based lending is ~44% of firm credit in the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity and aggregate.&lt;/strong&gt; Investment responses are concentrated in small/constrained firms (β ≈ €7.3 for small/micro vs. −€1.2 for big firms 2 years out; difference significant at 1%); credit responses do not differ significantly by size. Regionally (Eq. 2, NUTS-III, region+year FE, clustered at region), €1 of procurement raises regional GVA by ~€1.3 (€1.32 on impact), implying ~€0.32 crowding-in of private production; the credit channel accounts for ~5% (5.5%) of this. Procurement boosts private R&amp;amp;D but not TFP, with only modest, short-lived inflation and no broad regional credit expansion (suggesting credit redistribution toward winners).&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification strategy and what are the main threats to it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The author exploits public contests, which resemble a silent sealed-bid first-price auction with a costly single bid: the hiring entity does not know who bids and firms do not know their competitors or how many there are, so the winner is not ex-ante predictable. He estimates Jordà (2005) local projections (Eq. 1) of credit growth on the award amount, both scaled by lagged total assets, with firm and industry×year fixed effects and firm-clustered SEs. The key identifying assumption is that winning via public contest is not systematically correlated with other firm characteristics. Threats: (i) selection if contracts go to more productive firms (would overstate effects) or displace private opportunities (would understate); (ii) anticipation, if firms foresee winning and adjust early. He addresses anticipation by including pre-event horizons h=-2, h=-3 (annual) and pre-months (monthly), finding no significant pre-trends, and by focusing on contests (where outcomes are unknown, unlike direct awards) and using yearly aggregation (the announce-to-decision gap was ~4 months in 2020). Figure C.1 shows unconditional winner/non-winner differences mostly vanish once fixed effects are included, except total assets (which is controlled). Appendix C.1 adds a local-projections difference-in-differences robustness check following Dube et al. (2023).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the credit channel mechanism and how is it distinguished from a demand story?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The mechanism is cash-flow-based lending: procurement contracts represent secure future cash flows that firms pledge as collateral (personal/firm guarantees), easing borrowing constraints. It is distinguished from a credit-demand story by the price of credit: a demand-driven increase would raise interest rates, but rates fall by &amp;gt;0.3 pp on impact, consistent with a supply-driven expansion. Two micro-mechanisms raise perceived creditworthiness: (i) collateral value of the contract itself, and (ii) a signaling/certification effect where government endorsement reduces bank information asymmetry. Monthly collateral decomposition (Figure 5) shows the credit increase is overwhelmingly backed by firm personal guarantees (&amp;gt;66% at month 4; ~75% including state guarantees), with asset-based collateral mostly insignificant, directly supporting the cash-flow collateral channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is the signaling/certification mechanism tested separately?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In Appendix Table C.3 (discussed in Section 3.5) the author compares first-time award recipients to firms with previous awards. First-time winners enjoy significantly higher and more persistent responses in credit, employment, and investment, which he interprets as a reputation/certification effect that partially resolves a banking information-asymmetry problem (banks learn the firm has government demand). This is distinct from the pure collateral mechanism, which is tested with the monthly collateral-type decomposition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By firm size (Commission Recommendation 2003/361/CE: small = headcount &amp;lt;50 and turnover/balance-sheet &amp;lt;€10m): credit responses do not differ significantly between small and big firms, but investment and employment responses are much larger and more persistent for small/constrained firms (investment β ≈ €7.3 small vs. −€1.2 big at 2 years, difference significant at 1% and growing with horizon; HAC p-values for employment differences are 0.05 at 1yr and 0.00 at 2yr). This is rationalized via the financial-accelerator hypothesis (Bernanke et al. 1999) and investment-cash-flow sensitivity literature (Fazzari et al. 1988). Employment heterogeneity mirrors Giroud and Mueller (2017). By sector: Construction and Medical Equipment (~60% of 2019 procurement value) account for much of the credit response but show no significant persistent differences in investment/employment. By award history: first-time winners respond more strongly (reputation effect).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the monthly analysis add over the annual analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using monthly credit/collateral data within the first year (relevant since the median contract lasts &amp;lt;1 year), the credit increase begins at award inception, rises sharply in the first month, and peaks ~3 months after the award (aligning with the annual ~3+ cents/euro). The increase is almost entirely collateralized (unsecured credit shows a muted response) and of sound quality (non-performing credit barely moves). Both long- and short-maturity credit rise, with long-term credit responding more strongly. Crucially, no significant credit movement appears up to three months before signing, reinforcing the no-anticipation conclusion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the aggregate/regional results and how are they estimated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The author aggregates procurement by spending location to NUTS-III regions and estimates local-projection multipliers (Eq. 2) with region and year fixed effects, SEs clustered at region, sample matched 2010-2016 (25 regions × 6 years), procurement winsorized at the 95th percentile. A €1 increase in regional procurement raises GVA by ~€1.3 (€1.32 on impact, interpreted as an open-economy relative multiplier à la Nakamura-Steinsson 2014), implying €0.32 crowding-in of private production. Eq. 3 interacts procurement with winners&amp;rsquo; credit (following Basso and Rachedi 2021): the positive significant interaction means credit amplifies the multiplier; a 1% credit-to-GVA increase raises the multiplier by 11% on impact, and since winners&amp;rsquo; credit is ~0.5% of GVA, the credit channel adds ~(0.11×0.5)% ≈ 5.5% (&lt;del&gt;5%). National-accounts regressions (Table 4) show procurement raises private value added (&lt;/del&gt;€1.2 on impact), private investment, private R&amp;amp;D (innovation), and modest short-lived inflation, but not TFP; aggregate nonfinancial-firm credit is subdued, suggesting credit redistribution toward winners rather than broad expansion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks and caveats are noted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Robustness: anticipation tests at multiple pre-horizons (annual and monthly); a local-projections diff-in-diff specification (Dube et al. 2023) in Appendix C.1; fixed-effects conditioning that removes most winner/non-winner differences; winsorizing the regional regressor at the 95th percentile (results sensitive to outliers). Caveats explicitly acknowledged: (i) no loan-level data, so the implicit interest rate is total interest expense / lagged effective credit, and financial covenants cannot be observed (if present, estimates would be conservative); (ii) under Portugal&amp;rsquo;s Public Procurement Code (Ch. IX), contracts above ~€500k may require a guarantee up to 5% of value, often a bank guarantee that appears as firm-guaranteed credit—but the central message still holds; (iii) procurement coverage is incomplete (web-scraped data ≈ one-third of total procurement, ~3% of GDP), so regional coefficients should be read with caution; (iv) the regional credit measure may not capture the full cumulative credit response and credit increases could partly reflect non-procurement factors; (v) collateral values are not market-adjusted and are often capped at the loan amount.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It contributes to three literatures. (1) Firm-level effects of fiscal policy/procurement (Barrot-Nanda 2020; Goldman 2020; Cox et al. 2024; Ferraz et al. 2021; Lee 2021): prior work emphasizes revenues as the driver; Gabriel adds a new credit/collateral transmission mechanism across all industries. The closest contemporaneous work is di Giovanni et al. (2024) for Spain, who document a positive procurement-credit correlation; relative to them, this paper provides detailed evidence on the credit-supply channel and its investment implications, measures contract heterogeneity, and—unlike their welfare/allocation-system focus—provides the first local procurement multiplier estimates with the credit channel&amp;rsquo;s share. (2) Government spending and fiscal multipliers, including stronger fiscal effects under tight credit (Ferraresi et al. 2015; Aghion et al. 2014). (3) Financial frictions and collateral type, shifting from asset/liquidation-value collateral (Kiyotaki-Moore 1997) to cash-flow-based collateral (Lian-Ma 2021; Ivashina et al. 2022; Drechsel 2022; Caglio et al. 2022); the novelty is cash flows from sales to the government as collateral. Notably his investment elasticity for small firms (~5 cents/euro cumulative at one year) is smaller than Hebous and Zimmermann&amp;rsquo;s (2021) ~13 cents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the policy implications and their scope conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two implications: (1) Targeting design—because small/financially constrained firms respond more strongly and persistently in investment and employment, targeting procurement to such firms (as pushed by the European Commission/Parliament for SMEs) likely raises aggregate investment and employment, not just efficiency. (2) Financial stability—letting firms pledge procurement contracts as collateral diversifies collateral away from real-estate/asset-based booms (which deplete project information and lead to deep downturns, Asriyan et al. 2022), so procurement could temper collateral-induced financial fluctuations. Scope conditions: external validity is greatest for countries where procurement is a large GDP share and firms rely heavily on bank credit (true for many developed and developing economies, e.g., Portugal where &amp;lt;1% of firms access capital markets); the effect grows more important the more bank-dependent firms are. The interest-rate decline is a firm-level result and should not be read as procurement lowering equilibrium interest rates economy-wide; a procurement shock can be a reallocation of spending rather than higher total spending/deficit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What is the nature of the real-side response and why is the sales response not larger?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Winning raises non-current assets by ~6 cents per euro (mostly PPE/tangible, not intangibles or financial investments), comparable to Hebous-Zimmermann&amp;rsquo;s ~10 cents and to real-estate-collateral elasticities (~6 cents, Chaney et al. 2012; Catherine et al. 2022). Employment rises persistently beyond the first year (Ferraz et al. 2021), though without a matching rise in value added. Sales income rises ~70% one year post-award—less than a one-for-one mapping of public demand to sales—for two reasons: a &amp;lsquo;duration effect&amp;rsquo; (contracts spread revenue over years; some last up to a decade) and a &amp;lsquo;capacity constraint effect&amp;rsquo; (firms prioritize government contracts, diverting other business to competitors, which also shows up in regional GVA), potentially mitigated by sub-contracting. Despite higher costs of goods sold, net income stays positive at ~5 cents per euro, so contracts are profitable.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>The Macroeconomic Effects of a European Deposit (Re-)Insurance Scheme</title><link>https://macropaperwarehouse.com/papers/the-macroeconomic-effects-of-a-european-deposit-re-insurance-scheme/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-macroeconomic-effects-of-a-european-deposit-re-insurance-scheme/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;Research question and motivation: The first two pillars of the European Banking Union (single supervision and single resolution) are in place, but the third pillar — a European deposit insurance scheme (EDIS) — is still missing. Recent policy proposals favor a reinsurance design, where European deposit insurance steps in only after national deposit insurance (DI) funds are depleted. The paper asks how well such a deposit reinsurance scheme absorbs macroeconomic and financial shocks relative to alternatives, and quantifies its stabilization, welfare, and moral-hazard implications.&lt;/p&gt;
&lt;p&gt;Model and method: The authors build a two-country regime-switching open-economy DSGE model with bank default, calibrated to Germany (home) and the euro area excluding Germany (foreign). Banks face idiosyncratic log-normal asset-return shocks and limited liability, so they can default and leave depositors (facing state-verification/monitoring costs) with losses. National DI funds collect risk-weighted contributions from banks and compensate insured depositors; when a fund is exhausted (DI_t &amp;lt;= 0), the share of insured deposits drops to zero and the economy enters a &amp;ldquo;constrained&amp;rdquo; regime. Four regimes capture whether home and/or foreign national DI is unconstrained or constrained, with Markov-switching transition probabilities (sigmoid functions). Two bank-government linkages are modeled: banks finance sovereign debt, and the fiscal authority provides tax/debt-financed guarantees on bank insolvencies. Three reinsurance arrangements are compared once national DI is exhausted: (A) no backstop, (B) national fiscal backstop, (C) EDIS. Most series are calibrated for 1999:Q1-2019:Q4 using ECB/Eurostat/OECD, Bundesbank, IMF, and micro data (Bloomberg, Eikon, Datastream). Key preset parameters: capital share 0.3, household habit 0.8, trade elasticity 1.5, home bias in traded goods 0.6, Basel III steady-state bank capital requirement 10.5 percent, LTV ratio 0.35, bank monitoring costs 0.3, DI and EDIS contribution sensitivity 0.45. Twelve remaining parameters are set by first-moment matching (total distance 2.836). The EDIS fund target is 0.8 percent of insured deposits; the simulated bank risk shock doubles the standard deviation of idiosyncratic bank asset returns to deplete national DI.&lt;/p&gt;
&lt;p&gt;Main quantitative findings: In response to an adverse home bank risk shock that depletes national DI (regime switch in period three), EDIS stabilizes the affected economy better than the fiscal or no backstop. Peak-to-trough GDP declines 0.3-0.4 percent across scenarios (deepest under no-backstop). Home output decline is about 10-20 percent smaller with EDIS; home consumption falls about 0.4 percent peak-to-trough with EDIS; investment declines are 30-40 percent smaller and bank loans 30-50 percent smaller with EDIS versus the other scenarios. The abstract/intro summarize the investment/consumption/loan gains as roughly 20-35 percent lower in the trough. The debt-to-GDP ratio rises markedly under the fiscal backstop but stays broadly stable under EDIS, since costs are covered by bank contributions rather than public debt. Costs of EDIS: banks contribute to both national DI and EDIS, raising the total burden and making national-fund recovery slowest under EDIS; foreign banks must contribute more, reducing margins and foreign lending. In a robustness analysis taking IRF differences one year after the shock, the baseline EDIS effect on home GDP is +0.1 ppt (range 0.05 to above 0.3 ppt across parameters) and on foreign GDP +0.06 ppt (range 0.02-0.2 ppt). Welfare (consumption equivalents, 100 x lambda_w, vs fiscal backstop baseline): differences are small but EDIS benefits savers in constrained economies, with the largest union-wide gains when both economies are constrained (regime 4). Risk-weighting contributions by country-specific default costs (baseline home share ~32 percent, foreign ~68 percent) renders EDIS risk-neutral in the long run so it does not foster additional moral hazard; only non-risk-weighted contributions induce structurally higher risk-taking that macroprudential policy can correct. The link between steady-state capital requirements and activity is hump-shaped with an optimum at 12 percent; the best stabilization comes when both EDIS and macroprudential policy are active and capital requirements are at 10.5 percent. A novel bank-run extension (state-dependent monitoring costs of 0.3 vs 0.6, plus a sunspot shock) shows runs deepen the output trough by about 40 percent relative to the no-run case, and that EDIS can prevent a self-fulfilling run by stopping the economy from entering the &amp;ldquo;in-between&amp;rdquo; region.&lt;/p&gt;
&lt;p&gt;Implications: A European deposit reinsurance scheme can deliver union-wide welfare gains and macro-financial stabilization, but regulators must design contribution and deductibility rules to avoid overburdening banks and constraining credit, ensure EDIS can pay out instantaneously once introduced, and recognize that costs and benefits are unequally distributed across countries, savers, and borrowers.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the modeling/identification strategy and what are its main limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strategy is a calibrated two-country regime-switching DSGE model (solved with the RISE toolbox), not an empirical causal-identification design. Identification of mechanisms comes from comparing counterfactual policy scenarios (no backstop, national fiscal backstop, EDIS) under the same bank risk shock. The authors themselves flag that the analysis is counterfactual: the euro area has not actually experienced explicitly exhausted national DI funds (the closest episode being October 2008 government deposit pledges). The main limitations are parameter uncertainty (the model is calibrated, not fully estimated) and the fact that the home/foreign calibration to Germany and the rest of the euro area does not imply general validity for other member states, motivating the robustness analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the four regimes and how does regime switching work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Regimes are defined by whether each country&amp;rsquo;s national DI is unconstrained (fund positive, insured share = kappa-bar) or constrained (fund &amp;lt;= 0, insured share = 0): Regime 1 both unconstrained; Regime 2 home constrained; Regime 3 foreign constrained; Regime 4 both constrained. Transition probabilities follow sigmoid (Markov-switching) functions: the probability of entering the constrained regime is one when the fund level hits zero (scaling alpha2 = 200), and the probability of switching back becomes one when bank default rates drop below a financial-stress threshold (scaling alpha1 = 300).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main mechanisms distinguishing EDIS from the fiscal backstop?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under the fiscal backstop, depositor losses enter the national government budget constraint, raising the debt-to-GDP ratio and affecting taxes/expenditure. Under EDIS, losses are covered by internationally shared, risk-weighted bank contributions, so public debt stays broadly stable. The trade-off: EDIS imposes a higher total burden on banks (they fund both national DI and EDIS), slows national-fund recovery the most (because EDIS contributions are deductible from national payments, stretching the refilling of two funds), and transmits the contribution burden to foreign banks, reducing their margins and lending. For the foreign economy, EDIS has an expansionary trade/financial channel that dominates in the first ~5-6 quarters and a contractionary higher-contribution channel that dominates in the medium-to-long run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented across the two countries?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Germany (home) has a higher home bias in bank equity (~80 percent) attributed to Landesbanken, savings and cooperative banks, and lower bank default risk (lower sigma of idiosyncratic asset-return shocks). The rest of the euro area (foreign) is the riskier banking sector with a higher default-shock standard deviation, so under risk-weighted contributions it bears the larger EDIS share (~68 percent vs ~32 percent home). Welfare effects differ: EDIS raises entrepreneurial welfare in the riskier foreign country but lowers it in the safer home country; savers in constrained economies gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What robustness checks are run and what do they show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors re-simulate the same home bank risk shock over minimum/maximum plausible ranges for calibrated and matched parameters, taking IRF differences one year out. The positive EDIS effect on home GDP is robust across all ranges where national DI depletes (0.05 to above 0.3 ppt; baseline 0.1 ppt); the foreign GDP effect ranges 0.02-0.2 ppt (baseline 0.06 ppt). Influential parameters include the goods home-bias/openness (more open economies gain less from EDIS), the LTV ratio, bank monitoring costs, and the idiosyncratic asset-return shock standard deviation (larger sigma means a more severe crisis and larger EDIS benefit). Higher fund target rates or insured-deposit shares can prevent depletion, in which case EDIS does not intervene and its effect is zero. Higher household-to-banker transfers and banker survival rates raise net worth, lower default risk, and shrink the EDIS effect. A sensitivity analysis on monitoring costs affects only quantitative, not qualitative, conclusions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How is welfare measured, and what does the contribution-weight analysis find?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Welfare is computed in the stochastic steady state (Coeurdacier et al., 2011) using a second-order approximation, expressed in consumption equivalents (lambda_w), aggregating borrowers and savers with Pareto weights (welfare weight zeta = 1). Conditional welfare is reported by regime relative to a fiscal-backstop baseline; EDIS gains are largest in regime 4 (both constrained), and deductibility (EDIS 1) is welfare-improving especially in the affected country versus no deductibility (EDIS 2). Varying the contribution split via alpha_RW shows low alpha_RW (contributions falling on the riskier foreign banks) is welfare-optimal union-wide (&amp;rsquo;excessive risk-sharing&amp;rsquo;), but deviations toward a more moderate split impose negligible welfare cost. Higher contributions in a country raise intermediation costs, cut loans and deposits, and lower borrower welfare there.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the paper conclude about EDIS and moral hazard?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because individual bank contributions are weighted by aggregate observable default risk, the steady-state default threshold is unaffected by deposit-insurance coverage, so under risk-weighted contributions EDIS does not induce additional moral hazard in the long run (defaults, firm loans, and corporate borrowing rates are unchanged by higher insurance shares in steady state). Moral hazard arises only if contributions are not risk-weighted or if long-run insurance payments do not match contributions, in which case low capital regulation fosters extra risk-taking and long-run macroprudential policy can correct it. Cyclically, EDIS can still temporarily foster risk-taking because insurance payouts are large during a crisis while contributions accrue with a lag, enlarging the complementary role for macroprudential policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does the bank-run extension work and what is the key result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The RS-FF (regime-switching financial friction) model makes monitoring costs state-dependent (0.3 in low distress, 0.6 in high distress, with the high-distress threshold set at a 2.5 percent quarterly default rate, following Linde et al. 2016). A sunspot shock can trigger a partial run in an &amp;lsquo;in-between&amp;rsquo; state where depositors wrongly believe they are in high distress; non-fundamental beliefs raise the default threshold above its fundamental level (omega* &amp;gt; omega), some sound banks face liquidity problems and default, making beliefs self-fulfilling. A run amplifies the recession: in the no-backstop run scenario the output trough is about 40 percent lower than the no-run case (default costs roughly double, deposits about one ppt lower), a relative magnitude (ratio ~2.7) close to Gertler et al. (2020). Crucially, EDIS, by compensating depositor losses, keeps the economy out of the &amp;lsquo;in-between&amp;rsquo; region and can prevent the self-fulfilling run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper differ from closely related prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It extends Mendicino et al. (2018) — a closed-economy model with bank default, deposit insurance, and optimal capital regulation — to an open two-country setting with a detailed government sector and a bank-financed deposit fund (rather than direct household transfers). Unlike Dedola et al. (2013), where financial-friction degrees are equal across countries, it allows heterogeneous bank riskiness. Unlike representative-global-bank models (Mendoza-Quadrini 2010; Kollmann et al. 2011; Kollmann 2013), it allows heterogeneous national banking sectors. Unlike Dubois (2021), which has a linear two-country bank-run model, its regime-switching nonlinearity permits an explicit reinsurance/backstop comparison. Relative to Amador and Bianchi (2022) (partial runs, U.S., no deposit insurance), it adds deposit insurance and EDIS risk-sharing and models runs as a combination of financial-regime switches and sunspot shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the short-term implementation costs of EDIS and how can they be mitigated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Filling the EDIS fund requires up-front bank contributions over about 3.5 years in the baseline. With deductibility, payments into national DI fall, temporarily lowering national coverage; households then demand higher deposit risk premia, reducing intermediation and activity. Removing deductibility keeps national coverage on target but the double burden lowers bank margins, lending, and raises defaults, though stress is shorter-lived. Extending the implementation horizon (e.g., to 7.5 years) lowers per-period contributions and mitigates peak default rates, but leaves coverage lower for longer, protracting the downturn. Policy options include ensuring EDIS pays out instantaneously once introduced and temporarily suspending contributions during acute distress.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;EDIS reinsurance scheme&lt;/strong&gt;: In this paper, a European deposit insurance arrangement that acts as a second line of defense, paying out only once a country&amp;rsquo;s national deposit insurance fund is exhausted (the constrained regime), financed by risk-weighted bank contributions deductible from national DI payments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Constrained vs unconstrained regime&lt;/strong&gt;: States distinguished by whether a national DI fund is positive (unconstrained, insured deposit share = kappa-bar) or depleted (constrained, insured share = 0); the model has four such regimes across home and foreign and switches between them via Markov sigmoid transition probabilities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-weighted contributions (&amp;lsquo;polluter-pays&amp;rsquo;)&lt;/strong&gt;: EDIS contributions allocated across countries in proportion to country-specific expected bank-default costs, so the riskier banking sector pays more; this design renders EDIS risk-neutral in the long run and prevents additional steady-state moral hazard.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deductibility of contributions&lt;/strong&gt;: The assumption that banks can subtract their EDIS payments from contributions to national DI funds, keeping total bank contributions from exceeding the no-EDIS level but slowing the refilling of both funds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank default threshold (omega)&lt;/strong&gt;: The realization of a bank&amp;rsquo;s idiosyncratic asset-return shock below which the bank defaults on depositors; its steady-state value is shown to be independent of deposit-insurance coverage, which is the analytical basis for the no-long-run-moral-hazard result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;In-between state / sunspot-driven partial bank run&lt;/strong&gt;: A region where a bank risk shock is large enough to bring the economy near the high-distress (high monitoring cost) state but not into it; a sunspot shock then makes depositors wrongly believe in high distress, raising the non-fundamental default threshold (omega* &amp;gt; omega) and triggering a self-fulfilling partial run that EDIS can prevent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hump-shaped capital-requirement effect&lt;/strong&gt;: The relationship between steady-state bank capital requirements and long-run output/intermediation/welfare, peaking at an optimum of 12 percent: below it, higher default costs dominate; above it, the equity-crowding-out of lending dominates.&lt;/p&gt;</description></item><item><title>Who bears the costs of inflation? Euro area households and the 2021-2023 shock</title><link>https://macropaperwarehouse.com/papers/who-bears-the-costs-of-inflation-euro-area-households-and-the-2021-2023-shock/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/who-bears-the-costs-of-inflation-euro-area-households-and-the-2021-2023-shock/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;This paper measures the heterogeneous first-order welfare effects of the 2021-2023 inflation surge across households in the four largest euro area countries (Germany, France, Italy, Spain). Motivation: euro area headline HICP inflation peaked at 10.6% (year-on-year) in October 2022, driven mainly by energy and food prices following Russia&amp;rsquo;s invasion of Ukraine; cumulatively over 2021-23 the price index rose roughly 14% in France and Spain, 16% in Italy and 20% in Germany. The classic question—who wins and who loses from surprise inflation, and through which channels—is the focus.&lt;/p&gt;
&lt;p&gt;Method: The authors build a tractable two-period overlapping-generations framework and use the envelope theorem to decompose the &amp;ldquo;money-metric&amp;rdquo; welfare change (in euros) into four additive, observable components requiring no functional-form or structural-parameter assumptions: (1) a direct component (raw inflation before fiscal support, holding wages and asset prices fixed; captures heterogeneous consumption baskets and the Fisher revaluation of net nominal positions, labor income, dividends and capital gains); (2) an unconventional fiscal policy component (ad-hoc energy price interventions and transfers); (3) an indirect component (short-run responses of nominal wages, pensions, taxes/fiscal drag, and asset prices); (4) a long-run adjustment component (relative prices returning to pre-shock ratios). They combine micro data—Household Budget Survey (2015 wave) for expenditure shares, HICP micro data for good-specific price changes (20 COICOP-based categories), the 2017 Household Finance and Consumption Survey (HFCS) for budget-constraint components, the Bruegel dataset for fiscal responses, and IMF (Dao et al. 2023) counterfactual prices—with event-study/high-frequency identification (on German HICP release days) for wage, pension, house, stock and bond price responses. Households are sorted into 15 groups: three age classes (25-44 young, 45-64 middle-aged, 65+ retirees) and five consumption (permanent-income proxy) quintiles per country. Welfare is expressed as a share of triennial (3-year) disposable income.&lt;/p&gt;
&lt;p&gt;Main findings: (i) Average country-level welfare losses were sizable and heterogeneous: around 3% of triennial income in France and Spain, 7% in Germany, and 9% in Italy. (ii) The episode resembles an age-dependent tax: retirees lost up to 14% (German and Italian high-income retirees), while roughly half of 25-44 year-olds were net winners; young French households gained up to 7% (about EUR 4,000 on average), young Spanish broke even; middle-aged households lost roughly 2-11%. Overall about one quarter of euro area households were net winners. (iii) Losses were quite uniform across consumption quintiles because rigid (sticky) rents hedged the poor; excluding rents, the poor suffer more due to higher energy/food exposure. (iv) Nominal net positions (NNP) were the key driver of cross-household heterogeneity—retirees hold large positive nominal assets, the young hold nominal mortgage debt. (v) Energy prices generated vast individual-inflation-rate variation, but unconventional fiscal policy (especially energy price caps, more so in France where it cut inflation ~2 p.p.) shielded households, reducing first-stage welfare costs by about one-fifth on average. Estimated asset-price elasticities to a 10% inflation surprise: house prices -1.38% (beta x delta = -3.995 x 0.035 = -0.138), stocks -0.410, bonds -0.726. Pensions, being indexed, rose faster than wages; fiscal drag taxed away gains in Italy and Spain (unindexed brackets), much less in France/Germany. The counterpart of household losses is a large government gain from eroded real public debt: governments in France, Italy and Spain were net winners (Italy +4.5 to 5.1% of triennial GDP), while Germany roughly broke even. Policy implication: in a monetary union where monetary policy cannot address country-specific dynamics, fiscal policy was crucial; and redistributing government inflation gains to households could substantially offset their losses.&lt;/p&gt;
&lt;h2 id="layer-2-deep-dive"&gt;Layer 2: Deep Dive&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;What is the identification/measurement strategy and what are its main threats?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core strategy is an envelope-theorem decomposition that yields analytical &amp;lsquo;sufficient-statistic&amp;rsquo; formulas for money-metric welfare change, requiring only observable budget-constraint quantities and price changes—no structural parameters or functional forms. The key assumption is that, to first order, substitution in consumption baskets and portfolio rebalancing after the shock have only second-order welfare effects, so observed pre-shock quantities (2015 HBS shares, 2017 HFCS positions) can be used. Four structural assumptions define the shock: (1) it is unanticipated; (2) the price-level jump is permanent but inflation is temporary (returns to zero from t=1); (3) the shock is long-run neutral in aggregate and across the distribution—all nominal variables and relative prices realign one-to-one with the new price level by t=1; (4) the government budget constraint accommodates either via the price level (active/FTPL) or via future real surpluses (passive). For asset-price responses they use high-frequency identification: regressing daily REIT, stock and bond returns on the inflation surprise (daily change in 1-year inflation-linked swaps) on German HICP release days, controlling for stock returns. Main threats: the first-order/second-order approximation could fail if substitution effects are large (the authors note that pre/post high-frequency micro data—unavailable to them—could test this); the use of 2015 expenditure shares and 2017 balance sheets to represent the pre-shock state; reliance on counterfactual price series (IMF, OMIE) for what prices would have been absent intervention; and the assumption that relative prices fully return to pre-shock ratios in the long run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the four channels and how are they distinguished empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;(1) Direct component: raw inflation effect on cost of living before fiscal support and before wage/asset-price adjustment; split into average inflation, the &amp;lsquo;pi difference&amp;rsquo; from heterogeneous baskets (C), net income/labor-income purchasing power (Y), net nominal positions (NNP), and dividends+capital gains (K). (2) Unconventional fiscal policy (UFP): energy price interventions (changes in good-specific tax/subsidy wedges, requiring counterfactual no-intervention price indices) plus ad-hoc transfers to households. (3) Indirect: short-run changes in nominal wages, minimum wages, pensions, fiscal drag, and asset prices (house, stock, bond) plus the direct effect of monetary-policy-driven interest-rate changes on deposits and debt. (4) Long-run: welfare from relative prices realigning to the new price level, discounted to t=0. They are computed sequentially in stages so each component&amp;rsquo;s contribution is isolated. NNP is the dominant driver of age heterogeneity; Y is the largest single contributor to losses but is fairly uniform across groups; C matters mainly for poor elderly in Italy and Spain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What heterogeneity is documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Age is the most pronounced dimension: retirees lose most (driven by large positive nominal asset holdings), the young least (often net winners via mortgage debt revaluation). German and Italian retirees lost up to 14% of triennial income; high-income retirees lost more than EUR 10,000 on average. By contrast, the consumption-quintile (permanent-income) gradient is weak because sticky rents hedge low-income renters; excluding rents reveals a negative inflation-income gradient (poor face higher inflation via energy/food). Cross-country: Italy highest cost (~9%), France lowest (~3%), due to (i) bigger raw price shock in Italy (energy import dependence/market structure), (ii) more effective fiscal offset in France, (iii) nominal wages lagging inflation much more in Italy, (iv) Italian middle-aged/elderly holding larger nominal positions while the young borrow less than in France. Within-bin heterogeneity (homeowners with mortgages vs renters) means about a quarter of households are winners overall; more than half of the young in France and Spain, ~50% in Germany, ~30% in Italy, and ~50% of Spanish retirees (extensive pension indexation) are winners.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What role did unconventional fiscal policy play?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Fiscal interventions reduced first-stage welfare losses by about one-fifth on average across countries and household types. Energy price caps were more important than transfers, especially in 2022 when caps were active in all countries. In France, interventions reduced the measured inflation rate by about 2 p.p.; in Italy interventions came ex-post via bonuses/transfers and so did not lower recorded inflation. Retirees benefited most, consistent with their higher energy/food shares and targeted measures. Government fiscal support outlays were approximately 1% of triennial GDP in all four countries, though in Italy and Spain a larger share (above 35% of costs) went to firms versus 14% (Germany) and 5% (France).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How are asset prices treated and what are the estimated elasticities?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;House prices: a two-step approach—daily REIT (FTSE EPRA NAREIT Eurozone Residential) returns regressed on inflation surprises (beta = -3.995 on the swap surprise) on German HICP release days, then quarterly house-price returns (2006Q1-2023Q4) regressed on lagged REIT returns (delta = 0.035); the product beta x delta = -0.138 means a 10% inflation surprise lowers house prices ~1.38%. Stock and bond elasticities are larger and negative: -0.410 and -0.726 respectively. The asset-price channel is quantitatively negligible in welfare terms because house elasticity is small and stock/bond holdings are concentrated only at the very top of the consumption distribution. Housing and stocks are therefore not good inflation hedges when inflation has a large cost-push component.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What about wages, pensions, and fiscal drag in the indirect channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Nominal wage increases were modest, generating a welfare gain of only about 3% of disposable income against a direct loss on nominal wages of about 9.5%. Wages rose faster in France (sectoral agreements, over 4% vs 2-3% elsewhere) and for low-quintile German workers (large minimum-wage rise in October 2022). Pensions, being indexed to past inflation, grew more than wages in all four countries, so retirees gained substantially from the indirect channel, especially in Spain (pensions up 9.5% for most pensioners in 2023). However, fiscal drag (unindexed tax brackets in Italy and Spain) taxed away nominal gains—up to 2.5% for higher-quintile pensioners—whereas France and Germany had near-real-time bracket indexation, so drag was small. Higher ECB interest rates (tightening from July 2022) raised mortgage payments for young Spanish households with adjustable-rate mortgages, partly wiping out their NNP gains; the effect was small elsewhere (fixed-rate mortgages, limited deposit-rate pass-through).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What does the sectoral (government and foreign) analysis show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using Euro Area Sector Financial Accounts (2017), the household sector holds positive net nominal positions (total NNP/triennial GDP: 0.28 Germany, 0.31 France, 0.35 Italy, 0.13 Spain), governments hold negative positions, and the foreign sector is a creditor against all except Germany. From the NNP channel alone the household sector lost (as % of triennial GDP): -3.8 Germany, -2.9 France, -3.9 Italy, -0.5 Spain; governments gained +3.5, +4.8, +7.5, +4.5; the foreign sector gained +0.3 in Germany but lost -1.9, -3.6, -3.9 in France, Italy, Spain. Adding fiscal drag (revenue), fiscal support cost (~1% GDP), higher pension cost (~1% GDP, peak 1.7% Italy), and higher government energy purchase cost, total government gains were: Germany -0.6 to +0.5 (roughly breaks even), France +1.3 to 2.1, Italy +4.5 to 5.1, Spain +1.6 to 2.2% of triennial GDP. Cross-country differences in government gains are driven mainly by the outstanding stock of public debt. Redistributing these government gains to households could substantially offset household losses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How does this paper relate to and differ from prior work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;It applies the envelope-theorem money-metric approach used by Auclert (2019), Slacalek et al. (2020), Fagereng et al. (2022) and Del Canto et al. (2023), but studies a specific historical episode as an event study rather than identified shocks. It builds directly on Cardoso et al. (2022), who quantify the direct channel for Spain using bank-account data, by adding the other three channels (fiscal, indirect, long-run) and covering four countries. It contributes to the inflation-heterogeneity literature (Kaplan-Schulhofer-Wohl, Jaravel, Hobijn-Lagakos, Argente-Lee) by documenting inflation-rate differentials an order of magnitude larger than pre-pandemic US estimates, and confirms Doepke-Schneider (2006) that age is the key dimension via life-cycle net nominal positions. Unlike fully specified HANK models (Pugsley-Rubinton, Olivi et al., Yang), the sufficient-statistic approach cannot evaluate policy counterfactuals. Most contemporaneous euro-area papers stop at measuring differential inflation; this one quantifies full welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What are the main caveats and robustness considerations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The framework is first-order: it assumes consumption and portfolio adjustments have only second-order welfare effects, which the authors flag as testable with high-frequency micro data they lacked. Survey-based (HFCS) nominal asset measures are 2-3 times smaller than financial-account measures because surveys undersample the very rich, so the Section 4 micro results best represent the population excluding the wealth top. Expenditure weights come from the 2015 HBS (judged stable using 2005/2015 HBS and credit-card evidence); inflation expectations (0.4-1.7%/year) come from Consensus Economics early 2021. A robustness note: assuming 0.75%/year trend productivity growth (so part of nominal wage rises reflects trend, not catch-up) increases welfare losses by roughly 1.5% of disposable income. The retiree/young housing trade is modeled as selling/buying one tenth of housing (3/30 over the 3-year long run). The conclusion notes the episode coincided with high pandemic excess savings that cushioned purchasing-power erosion, and that the inflation tax effectively redistributes from retirees to the young, partially offsetting future fiscal adjustment.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;</description></item><item><title>"Compensate the Losers?" Economic Policy and the Origins of U.S. Partisan Realignment</title><link>https://macropaperwarehouse.com/papers/compensate-the-losers-economic-policy-and-the-origins-of-u.s.-partisan-realignment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/compensate-the-losers-economic-policy-and-the-origins-of-u.s.-partisan-realignment/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Why have less-educated voters in the United States abandoned the Democratic Party over recent decades? The paper argues that the Democratic Party&amp;rsquo;s evolution on &lt;em&gt;economic policy&lt;/em&gt; — specifically its retreat from &amp;ldquo;predistribution&amp;rdquo; — is a central, previously understudied driver of partisan realignment by education.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conceptual Framework.&lt;/strong&gt; The authors distinguish between two categories of egalitarian economic policy: (1) &lt;em&gt;predistribution&lt;/em&gt; — policies that alter the pre-tax-and-transfer earnings distribution, including job guarantees, minimum wage increases, union support, and protectionist trade policies (following Hacker 2011); and (2) &lt;em&gt;redistribution&lt;/em&gt; — taxes and transfers. The paper&amp;rsquo;s central claim is that these two types of policy have sharply different educational gradients among voters, and that the Democratic Party moved away from predistribution beginning in the 1970s, triggering educational realignment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology.&lt;/strong&gt; The authors harmonize over 1,000 surveys (N ≈ 2.2 million observations) spanning 1942–2020, drawn from Gallup, ANES, GSS, CCES, and historical survey archives housed at iPoll/Cornell. Education is translated into a common metric (adjusted years of schooling) using Census data, controlling for sex, race, year, and birth cohort to address the changing selectivity of educational categories over time. Congressional roll-call data come from the Comparative Agendas Project (CAP). Campaign finance data come from FEC filings, Congressional hearing records, and watchdog sources. DLC membership data are compiled from official Democratic Leadership Council records (available for 1985, 1986, 1991, 1993, and 1997 onward) and DLC-aligned Congressional caucus lists. House election returns are taken from King and Palmquist (1997) at the minor-civil-division-group (MCDG) level (~60 units per Congressional district), matched to 1980 Census demographic data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Voter preferences (demand side):&lt;/em&gt; The educational gradient for predistribution is large and negative: averaged across the four predistribution questions (job guarantee, minimum wage, union support, trade protection), each additional year of education reduces support by 0.044 standard deviations (p &amp;lt; 0.001). A college graduate relative to a high school graduate supports predistribution 0.176 standard deviations less — equivalent to roughly half the average Democrat-Republican gap in predistribution support (which is 0.34 standard deviations). This gradient has been stable since at least the 1940s. By contrast, the educational gradient for redistribution (higher taxes on the rich, views on own taxes, welfare spending) is close to zero (summary β = 0.004, not distinguishable from zero in the full sample). The difference between the two gradients is statistically significant (p &amp;lt; 0.001). These results replicate in white-only samples. Notably, the educational gradient on social issues — measured across nine questions on racial attitudes, gender roles, sexual norms — is positive (more education predicts more liberal positions) but has been largely &lt;em&gt;stable&lt;/em&gt; since the 1940s, not increasing, conditional on the long-run sample.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Party supply (supply side):&lt;/em&gt; Before 1976, predistribution topics accounted for roughly one-quarter of Democratic House roll-call votes when Democrats controlled the chamber. After 1976 (taking Jimmy Carter&amp;rsquo;s presidency as the start of the &amp;ldquo;New Democrat&amp;rdquo; era), this share falls by approximately nine to ten percentage points, while the redistribution share of votes holds steady. Between 1968 and 1980, the union share of total PAC donations to Democratic Congressional candidates falls from approximately 90 percent to 40 percent, coincident with 1970s campaign finance reforms that placed union and corporate PACs on equal legal footing and allowed corporations to exploit their naturally deeper pockets. Corporate PAC share of Democratic donations correspondingly rises from approximately 10 percent to 45 percent over the same period. In individual contributions to primary elections (data beginning in 1980), Democratic primaries rely on increasingly more-educated census tracts relative to Republican primaries; by 2018 Democratic primaries are financed from census tracts averaging 0.41 more years of education than Republican primaries (against a within-year standard deviation of 1.56 years).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;The New Democrat/DLC faction:&lt;/em&gt; The authors identify the anti-predistribution faction through official DLC membership records and aligned caucus lists. DLC membership as a share of Democratic House seats grows from near zero in the mid-1970s to approximately half by the early 2000s. Roll-call voting analysis (N = 3,428,405 vote-observations) shows DLC members are more conservative than other Democrats overall, and &lt;em&gt;especially&lt;/em&gt; so on predistribution: for a 10-percentage-point increase in the share of Republicans voting for a bill, the probability a DLC member votes in favor increases 36 percent more on predistribution bills than on other bills. DLC members show no differential conservatism on redistribution. They are also significantly more socially conservative — more likely than other Democrats to support the Defense of Marriage Act (by 16 pp), the Partial-Birth Abortion Ban (by 7 pp), and restrictive immigration bills (by 10 pp). DLC candidates receive significantly less from labor PACs and significantly more from corporate PACs, and draw their out-of-district individual donations from census tracts averaging more than 0.1 years more educated than non-DLC Democrats.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Voter reaction and the inflection point:&lt;/em&gt; Using the N ≈ 2.2 million partisan identification dataset, the authors estimate a structural break in the education-party identification gradient. From the 1940s through the mid-1970s, each additional year of education reduces the probability of identifying as a Democrat by approximately 3 percentage points. A Chow breakpoint test identifies 1976 as the inflection point. Since 1976, the gradient steadily rises; by 2000 it reaches zero; and today (as of the sample period end ~2020) each additional year of education &lt;em&gt;increases&lt;/em&gt; Democratic identification by approximately 3 percentage points — an almost exact reversal. The breakpoint for Republican identification occurs later, in 1992, consistent with the Democratic agenda changing first. A Gallup prosperity question (&amp;ldquo;which party will better keep the country prosperous?&amp;rdquo;) shows a parallel pattern: controlling for views on parties&amp;rsquo; economic performance explains approximately 44 percent of partisan realignment, interpreted as an upper bound on economic policy&amp;rsquo;s contribution.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Factional tests — hypothetical elections and actual results:&lt;/em&gt; In hypothetical general-election matchups from 1972–1992 Democratic primaries (in which most contests pitted a &amp;ldquo;New Democrat&amp;rdquo; against an &amp;ldquo;Old Democrat&amp;rdquo;), a voter with a college degree is roughly 3 percentage points &lt;em&gt;more&lt;/em&gt; likely to vote Democratic when the candidate is a New Democrat rather than an Old Democrat. In 1980s actual House elections using MCDG-level data, DLC candidates out-perform other Democrats in more educated neighborhoods by a magnitude large enough to erase approximately 90 percent of the general Democratic underperformance in highly educated areas. Combining these estimates, the party&amp;rsquo;s shift toward the DLC accounts for a lower bound of approximately 20 percent, and an upper bound (from the prosperity question) of approximately 50 percent, of educational realignment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The analysis focuses on the United States, 1942–2015 (with some post-2015 discussion in the conclusion). The faction analysis focuses on the Democratic side; Republican faction changes are discussed but not the primary focus. The paper is explicit that between 20–50 percent of realignment is explained, leaving room for other factors, including social issues. The analysis ends mostly before 2016 to avoid complications from the closure of the DLC in 2011 and shifting post-2010 party dynamics.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the paper&amp;rsquo;s central conceptual innovation, and how does it differ from prior realignment research?&lt;/strong&gt;
The paper separates egalitarian economic policies into &amp;ldquo;predistribution&amp;rdquo; (pre-tax-and-transfer market interventions such as minimum wages, job guarantees, union support, and protectionism) and &amp;ldquo;redistribution&amp;rdquo; (taxes and transfers) and shows these two types have sharply different educational gradients. Prior work typically aggregated all economic policies into a single index, which the authors argue masks essential heterogeneity. By documenting that the educational gradient is large and negative for predistribution but close to zero for redistribution — a pattern stable since the 1940s — the paper reframes the &amp;ldquo;voting against economic interest&amp;rdquo; puzzle: less-educated voters leaving the Democratic Party may be responding rationally to changes in the supply of the type of economic policy they actually prefer.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How large and stable is the educational gradient on predistribution, and how does it compare to social issues?&lt;/strong&gt;
The average coefficient on adjusted years of schooling across the four predistribution questions is -0.044 (p &amp;lt; 0.001), stable over eight decades. A four-year difference in education (high school vs. college) shifts an individual&amp;rsquo;s support for predistribution by 0.176 standard deviations in the conservative direction — about half the average Democrat-Republican gap in predistribution support (0.34 standard deviations). For social issues, the summary gradient is positive (+0.028, p &amp;lt; 0.001 for the full sample), but this gradient has been largely &lt;em&gt;stable&lt;/em&gt; since the 1940s across nine social issue questions, not increasing over time. This stability undermines the interpretation that rising social liberalism among the educated is a new phenomenon driving realignment, at least through the supply of parties&amp;rsquo; social positions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What happened to predistribution as a share of the Democratic House agenda after the 1970s?&lt;/strong&gt;
Using the Comparative Agendas Project classification, predistribution topics (labor regulation, industrial policy, public works, trade) accounted for roughly one-quarter of all House roll-call votes during years Democrats controlled the Speakership before 1977. After 1977, this share falls by approximately 9–10 percentage points (a decline of nearly half from its pre-1977 share), and the decline is statistically significant (p &amp;lt; 0.001). The redistribution share of votes holds essentially constant. Party platform data from Hopkins et al. (2022) show a sharp decline in Democratic use of terms like &amp;ldquo;minimum wage,&amp;rdquo; &amp;ldquo;full employment,&amp;rdquo; and labor-relations language beginning in the 1970s and 1980s, while Republican platforms use these terms sparingly throughout.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How did 1970s campaign finance reforms change the financial composition of the Democratic Party?&lt;/strong&gt;
Before the early 1970s, unions enjoyed substantially more freedom than corporations under separate legal regimes governing PAC donations; mid-1970s reforms placed them on equal legal footing, enabling corporations to exploit their deeper pockets. The union share of total PAC donations to Democrats fell from approximately 90 percent in 1968 to approximately 40 percent by 1980, while the corporate share rose from approximately 10 percent to 45 percent. For Republicans, both series barely changed: unions had never donated substantially to the GOP, and the corporate share rose only modestly (from approximately 70 to 80 percent). The authors note the rapid decline cannot be attributed to falling union density in the economy, since both union and corporate PAC donations grew in absolute terms during this period; the relative shift was the result of the regulatory change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Who are the &amp;ldquo;New Democrats&amp;rdquo; / DLC, and when did they emerge?&lt;/strong&gt;
The DLC officially operated from 1985 to 2011, but members who would join it began entering Congress in large numbers in the 1970s (&amp;ldquo;Watergate Babies&amp;rdquo; of 1974, &amp;ldquo;Atari Democrats&amp;rdquo;). The DLC grew to approximately half of all Democratic House seats by the early 2000s. Members were drawn from suburban, affluent districts; their founder Al From explicitly criticized all four predistribution policies the paper studies (minimum wage, job guarantees, unions, and protectionism). The breakpoint test on DLC share in Congress identifies 1975 as the pivotal year — one year before the 1976 inflection point in partisan identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do DLC members vote differently from other Democrats, and how is this differential conservatism distributed across policy types?&lt;/strong&gt;
In roll-call regressions (N = 3,428,405 observations, with roll-call fixed effects), a 10 pp increase in the Republican vote share for a bill increases the probability a DLC member votes in favor by 1.48 pp more than for other Democrats (baseline result for all bills). For predistribution-classified bills, this excess alignment with Republicans is 36 percent larger than for non-predistribution bills. Crucially, DLC members are no more conservative than other Democrats on redistribution-classified votes (the interaction with redistribution is near zero and insignificant). DLC members are also differentially more conservative on social issues, a result that proves useful in separating economic from social-issue explanations of realignment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Do DLC members finance differently from other Democrats?&lt;/strong&gt;
Yes. In primary elections, DLC candidates receive approximately 9.7 pp less of their PAC financing from labor unions and approximately 6.7 pp more from corporate PACs (with state fixed effects) relative to non-DLC Democrats. Out-of-district individual contributions to DLC primary candidates come from census tracts averaging more than 0.1 years more educated than those for non-DLC Democrats, while within-district contributions show no significant difference (0.060 years, insignificant). This pattern suggests educated out-of-district donors, rather than local constituency demands, drive DLC candidates&amp;rsquo; anti-predistribution orientation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: When precisely did educational realignment in Democratic party identification begin, and what does the inflection-point analysis show?&lt;/strong&gt;
Using N ≈ 2.2 million observations from 1,006 surveys, a Bai-Perron breakpoint test on the year-by-year education gradient in Democratic party identification identifies 1976 as the inflection point (with robustness to alternative specifications yielding breakpoints of 1978–1980 for white-only samples and unadjusted years of schooling). Before 1976, each additional year of education reduces the probability of Democratic identification by approximately 3 percentage points (a stable, significantly negative relationship since the 1940s). After 1976, the gradient steadily rises; it reaches zero around 2000 and today is approximately +3 percentage points per year of education — nearly an exact reversal of the baseline. The corresponding Republican inflection point occurs in 1992, about 16 years later, consistent with the Democratic Party&amp;rsquo;s agenda changing first.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do hypothetical presidential matchup surveys test the DLC mechanism?&lt;/strong&gt;
The authors identify six Democratic primaries from 1972–1992 where a &amp;ldquo;New Democrat&amp;rdquo; and an &amp;ldquo;Old Democrat&amp;rdquo; were the top two contenders (e.g., Hart vs. Mondale in 1984, Clinton vs. Brown in 1992). Gallup and other surveys asked all respondents — regardless of party — whom they would vote for if either the New or the Old Democrat faced the eventual Republican nominee. A voter with a college BA is approximately 3 percentage points more likely to vote for the Democrat when the candidate is a New Democrat versus an Old Democrat (the &amp;ldquo;difference in differences&amp;rdquo; of hypothetical vote shares). This holds after controlling for state × election fixed effects and in five of the six election cycles studied (the 1976 exception is attributed to Mo Udall&amp;rsquo;s low name recognition, with 28 percent of respondents unfamiliar with him in a May 1976 poll). The result is attenuated but remains marginally significant when excluding non-white respondents, consistent with New Democrats&amp;rsquo; success with white voters due in part to their more conservative civil rights positioning.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What do actual House election results (MCDG-level data) show about DLC electoral performance by neighborhood education?&lt;/strong&gt;
Using 1980s House returns at the MCDG level (~60 neighborhoods per Congressional district), the authors regress Democratic vote share on neighborhood years of education interacted with a DLC candidate indicator, with Congressional district fixed effects. More-educated neighborhoods generally depress Democratic vote share (reflecting the still-negative overall educational gradient in the 1980s), but DLC candidates dramatically out-perform other Democrats in educated areas: the interaction coefficient is positive and significant, and its magnitude is large enough to erase approximately 90 percent of the general Democratic underperformance in highly educated neighborhoods. This result is robust to including District × Year fixed effects (so the identification comes from within-election, cross-neighborhood variation) and to adding controls for share white and share under age 35.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How much of educational realignment can the paper&amp;rsquo;s mechanism account for, and how is this calculated?&lt;/strong&gt;
Two bounding estimates are provided. Upper bound (~44–50%): controlling for a respondent&amp;rsquo;s view on which party is better for economic prosperity (from Gallup since 1950) explains approximately 44 percent of the change in the education-party identification gradient (specifically, the total difference in the unconditional gradient between the 1948–1967 baseline and 2001–2020 is 2.411 pp per year of schooling; after controlling for the prosperity question, the unexplained residual is 1.342 pp, leaving a share explained of 44.3 percent). Lower bound (~20%): the difference in the education gradient between matchups involving New versus Old Democrats in Table 4 (~0.75 pp) divided by the total realignment shift (~4 pp from pre-1976 to post-2008 for presidential voting) implies the faction shift accounts for at least approximately one-fifth of realignment. The authors interpret these as bounds because the prosperity question may partly capture party identification itself (upper bound concern), while the hypothetical matchup estimate misses the broader ideological shift not captured in a single election (lower bound).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Can social issues, Civil Rights realignment, or Republican changes better explain the 1970s inflection point?&lt;/strong&gt;
Three alternative explanations are addressed. (1) &lt;em&gt;Civil Rights:&lt;/em&gt; Regional analysis shows that educated white Southerners &lt;em&gt;left&lt;/em&gt; the Democrats in the 1940s–1960s (not the 1970s), consistent with their realignment being driven by Democrats&amp;rsquo; liberal turn on civil rights rather than economic policy. After the 1960s, the South follows all other regions in the pace of educational realignment. (2) &lt;em&gt;Republican changes:&lt;/em&gt; The Republican party identification inflection point occurs in 1992, about 16 years after the Democratic inflection in 1976. Reagan elections in 1980 and 1984 do not appear to have differentially attracted less-educated voters (the &amp;ldquo;Reagan Democrats&amp;rdquo; were not differentially less educated). (3) &lt;em&gt;Social issues:&lt;/em&gt; The New Democrats were actually &lt;em&gt;more&lt;/em&gt; socially conservative than other Democrats (more likely to vote for DOMA, anti-abortion bills, restrictive immigration legislation), yet they disproportionately attracted educated voters. This internal inconsistency rules out a pure social-issues explanation for why educated voters preferred the DLC faction. (4) &lt;em&gt;Religion:&lt;/em&gt; Flexibly controlling for religious affiliation explains essentially none of partisan realignment (Appendix Figure A.24).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What is the role of out-of-district individual donors in shifting Democratic Party positions?&lt;/strong&gt;
Out-of-district primary donors are analytically important because they influence candidate supply without being able to vote in the election, isolating the &amp;ldquo;within-party&amp;rdquo; financial influence of educated supporters. By 1980, out-of-district primary donors to Democratic candidates already come from census tracts more educated than those for Republican candidates, even as local Democratic voters and within-district donors remain less educated than Republican counterparts. Democratic candidates also receive a substantially higher share of out-of-district contributions than Republican candidates — by almost 10 percentage points (Appendix Table A.7). Out-of-district donors thus represent a channel through which educated, anti-predistribution preferences are transmitted into the Democratic Party&amp;rsquo;s candidate supply before the electoral realignment is visible in vote totals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: Are predistribution policies becoming less popular overall, which might independently push Democrats away from them?&lt;/strong&gt;
The paper tests this alternative in Appendix Table A.9 and finds no evidence that predistribution has become less popular relative to redistribution over time. Predistribution appears on average more popular than redistribution across the sample period. If anything, support for predistribution has held steady or slightly risen relative to redistribution over time, conditional on the paper&amp;rsquo;s survey harmonization. The stability of the educational gradient (shown in Appendix Table A.10 to be unchanged even using educational rank within cohort rather than raw years of schooling) further suggests the negative education-predistribution relationship is a relative, not absolute, phenomenon — consistent with rising average education and stable preferences by education rank.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Predistribution:&lt;/strong&gt; Policies that aim to change the distribution of earnings or income &lt;em&gt;before&lt;/em&gt; taxes and transfers are applied. In this paper, this comprises government job guarantees, minimum wage increases, support for unions and collective bargaining, and protectionist trade policies. Distinguished from redistribution in that it operates on pre-tax market income rather than post-tax outcomes. The paper uses this term following Hacker (2011): &amp;ldquo;a focus on market reforms that encourage a more equal distribution of economic power and rewards even before government collects taxes or pays out benefits.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Redistribution:&lt;/strong&gt; Policies that change post-market income through the tax and transfer system, including higher taxes on the rich, views on own tax burden, prioritization of tax cuts, and transfers to the poor (welfare spending). In the paper&amp;rsquo;s usage, redistribution is analytically distinct from predistribution and has a near-zero educational gradient, in contrast to predistribution&amp;rsquo;s strongly negative gradient.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Educational Gradient:&lt;/strong&gt; The coefficient on adjusted years of schooling in a regression of an outcome variable (policy preference or partisan identification) on education, estimated separately by time period. The paper&amp;rsquo;s core finding is that the educational gradient for predistribution is stably negative (approximately -0.044 per year of schooling over the full sample), while the gradient for redistribution is close to zero, and the gradient for Democratic party identification shifts from approximately -0.03 to +0.03 per year of schooling between the 1940s and 2020.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;New Democrats / DLC (Democratic Leadership Council):&lt;/strong&gt; An explicitly anti-predistribution faction within the Democratic Party, identified through official DLC membership records and affiliated Congressional caucus lists. Founded formally in 1985 (operating through 2011), the DLC arose in part from the &amp;ldquo;Watergate Babies&amp;rdquo; cohort of 1974. DLC members were more conservative than other Democrats &lt;em&gt;especially&lt;/em&gt; on predistribution and social issues, relying differentially on corporate PACs and educated out-of-district donors. The paper treats DLC membership as a proxy for an anti-predistribution faction that gained bargaining power within the Democratic Party from the 1970s onward.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adjusted Years of Schooling (AdjYearsEduc):&lt;/strong&gt; The paper&amp;rsquo;s harmonized education variable across more than 1,000 surveys spanning eight decades. Because raw educational categories change over time and represent different selectivity (e.g., in 1940 only one-quarter of adults had completed twelfth grade, versus nearly 90 percent today), the authors use Census microdata to predict years of schooling as a function of self-reported educational category, sex, race, year, and birth cohort in ten-year bins. This provides a common unit of measurement across surveys with incompatible category systems.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inflection Point (1976):&lt;/strong&gt; The structural break in the trend of the education-Democratic identification gradient, estimated using Bai-Perron (1998) methods on N ≈ 2.2 million observations. The data select 1976 as the year at which the previously stable negative gradient begins its upward trajectory. The corresponding Republican inflection point occurs in 1992. The paper argues that identification of this inflection point — not previously documented in the realignment literature — is made possible only by the large historical dataset assembled.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Minor Civil Division Group (MCDG):&lt;/strong&gt; The granular geographic unit used in the House election analysis for the 1980s, with approximately sixty MCDGs per Congressional district. Matched to 1980 Census demographic data to assign average years of education. Used to test whether DLC candidates out-perform other Democrats in more-educated neighborhoods, within the same Congressional district and election year, to address the concern that DLC candidates sort into more-educated districts.&lt;/p&gt;</description></item><item><title>(Not) Thinking About the Future: Financial Information and Maternal Labor Supply</title><link>https://macropaperwarehouse.com/papers/not-thinking-about-the-future-financial-information-and-maternal-labor-supply/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/not-thinking-about-the-future-financial-information-and-maternal-labor-supply/</guid><description>&lt;p&gt;This paper investigates whether information constraints — rather than fully forward-looking choices — contribute to mothers&amp;rsquo; reduced labor supply after childbirth, a key driver of gender inequality. The authors deploy two complementary methods in Switzerland: a representative descriptive survey of Swiss mothers aged 25–50, and a large-scale randomized controlled trial (RCT) among approximately 2,400 female public school teachers with children who work part-time.&lt;/p&gt;
&lt;p&gt;The descriptive survey first establishes that long-term financial factors are not top of mind for mothers making labor supply decisions: only about 11% of mothers spontaneously mention pensions or long-term career considerations when asked about their post-childbirth employment choices, compared to roughly half who mention child or own well-being. Beyond salience, the survey documents substantial misperceptions: 62% of women over-estimate pension receipt under part-time work by more than 10%, and a similar share believes wage growth under low part-time hours (40% FTE) is at least as high as under 80% employment. The authors label mothers with overly optimistic beliefs on both dimensions &amp;ldquo;cost-unaware&amp;rdquo;; 42% of the sample qualifies. Cost-unawareness is more prevalent among less-educated mothers and correlates with less financial interest and more gender-conservative attitudes.&lt;/p&gt;
&lt;p&gt;The RCT tests whether providing objective, individualized information shifts financial planning and labor supply. Teachers in treatment schools (two-thirds of all schools) were individually randomized into a treatment group viewing an informational video about the long-run earnings, pension, and life-event consequences of sustained part-time employment, plus access to a Future Calculator tool, or a placebo video on unrelated financial topics. The two-stage randomization (school-level first, then individual within treated schools) allows identification of both direct treatment effects and spillovers. Outcomes are measured in a Wave 1 post-video survey, a follow-up survey two months later, and linked administrative personnel records from the Department of Education one year post-intervention.&lt;/p&gt;
&lt;p&gt;Main findings: treated teachers are 31.26 percentage points (58% over the pure control mean) more likely to correctly rank the relative magnitude of long- versus short-term financial factors. Demand for financial planning tools rises by 0.39 standard deviations (SD) overall and by 0.31 SD among cost-unaware women specifically. In terms of stated labor supply plans, the treatment raises planned employment for the next academic year by 1.69 percentage points (ppt) in the full sample and by 4.95 ppt (9% over the pure control mean) among cost-unaware women. These plan effects persist two months later for cost-unaware women but fade for the full sample.&lt;/p&gt;
&lt;p&gt;Critically, stated plans translate into verified behavior: linked administrative data one year post-intervention show that cost-unaware teachers increase their contracted employment level by 3.87 ppt, or 7% over the pure control mean of 53.30% FTE. Cost-aware and overly pessimistic women do not reduce their labor supply upon learning they are better off than feared, an asymmetry consistent with agents responding more to perceived losses than gains. If the 3.87 ppt increase were sustained from age 40 onward, cost-unaware teachers would accumulate an additional 130,000 CHF in lifetime income and 40,000 CHF in pension wealth, shrinking the gender gap in lifetime income and pension receipt among teachers by approximately 18% each.&lt;/p&gt;
&lt;p&gt;The paper is scoped to Swiss female public school teachers — a population with linear pay scales, no part-time promotion penalty, and relatively low adjustment barriers — meaning the measured lifetime earnings and pension losses likely represent a lower bound relative to other occupations. Short-term RCT findings replicate among a sample of pregnant women in the general Swiss population, and the paper argues that similar labor supply adjustment magnitudes are feasible for a broader segment of part-time working mothers.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why does it matter?
A: The paper asks whether mothers&amp;rsquo; post-childbirth reduction in labor supply is partly driven by information constraints — specifically, whether mothers fail to account for the full long-term financial consequences of working reduced hours. This matters because if the child penalty partly reflects uninformed choices rather than deliberate tradeoffs, standard policy tools (parental leave, childcare subsidies) may underperform precisely because their long-term financial benefits are not internalized.&lt;/p&gt;
&lt;p&gt;Q: How prevalent is cost-unawareness among Swiss mothers?
A: 62% of mothers in the descriptive survey over-estimate pension receipt under part-time work by more than 10%, a similar share believes wage growth under low part-time (40% FTE) is at least as high as under 80% employment, and 42% are overly optimistic on both dimensions simultaneously. Cost-unawareness follows an education gradient: 77% of low-education women over-estimate pension receipt versus 51% of high-education women.&lt;/p&gt;
&lt;p&gt;Q: What share of mothers spontaneously considers long-term financial factors when deciding on their labor supply?
A: Only about 11% of mothers mention any long-term financial factor (pensions, financial independence, long-term career considerations) in open-ended responses; the share is similarly low across education groups (6% low, 12% mid, 13% high). About 50% mention child or own well-being; roughly 30% raise short-term financial factors such as current childcare costs.&lt;/p&gt;
&lt;p&gt;Q: What are the actual long-term financial stakes of the average female teacher&amp;rsquo;s part-time employment pattern in Switzerland?
A: Compared to full-time employment, the average female teacher&amp;rsquo;s employment trajectory produces a 35% reduction in potential lifetime earnings (approximately 3.34 million CHF versus 5.12 million CHF). Monthly pension receipt under the part-time scenario is 31% lower overall and 43% lower from the occupational second-pillar scheme specifically — a gap comparable to the average 47.5% gender pension gap observed in the second pillar in Switzerland in 2024.&lt;/p&gt;
&lt;p&gt;Q: How was the RCT designed and what populations were included?
A: The study recruited 2,359 part-time working mothers employed as public school teachers in a German-speaking Swiss canton. A two-stage randomization assigned two-thirds of schools to treatment schools (within which teachers were individually randomized 50/50 to treatment or spillover control) and one-third to pure control schools. This design allows estimation of direct treatment effects and spillover effects. The intervention was timed to precede December–January, the period when teachers communicate their preferred employment levels for the next school year.&lt;/p&gt;
&lt;p&gt;Q: What was the treatment intervention?
A: Treated teachers watched an informational video following a representative female teacher considering an employment-level increase, covering the impact of part-time work on lifetime earnings, monthly pension receipt, and financial exposure after adverse events such as divorce; it also benchmarked these magnitudes against childcare costs. Treated teachers additionally received individualized access to the Future Calculator, an online projection tool developed with a Swiss bank, calibrated to teachers&amp;rsquo; deterministic salary and pension schedules.&lt;/p&gt;
&lt;p&gt;Q: Did treated teachers understand and retain the treatment information?
A: Yes. Treated teachers were 31.26 ppt (58% over the pure control mean) more likely immediately after the intervention to correctly rank long- versus short-term financial factors in a vignette. Two months later, the treatment group remained significantly more likely to apply the information correctly (22.63 ppt higher), indicating the knowledge was not short-lived.&lt;/p&gt;
&lt;p&gt;Q: How did demand for financial planning tools respond to the treatment?
A: The treatment raised a financial information/tools index by 0.39 SD overall. For cost-unaware women specifically, demand for financial tools rose by 0.31 SD; cost-aware and pessimistic women showed no significant change. There was no significant average treatment effect on sign-up for an incentivized financial consultation.&lt;/p&gt;
&lt;p&gt;Q: How large were the labor supply plan effects in the survey, and did they persist?
A: For the full sample, treated teachers planned a 1.69 ppt higher employment level for the next school year immediately after the treatment, and 3.13 ppt higher in 10 years. For cost-unaware women, the short-run planned increase was 4.95 ppt (9% over the pure control mean of about 55%), and plans for 5 and 10 years into the future rose by approximately 4 ppt (6–7% over the mean). The short-run effects for cost-unaware women persisted to the two-month follow-up, while full-sample short-run effects faded.&lt;/p&gt;
&lt;p&gt;Q: What do the linked administrative data show about actual labor supply one year post-intervention?
A: Cost-unaware women in the treatment group increased their contracted employment level by 3.87 ppt relative to the pure control group (7% over the pure control mean of 53.30% FTE), closely matching the planned increase stated immediately after the treatment. Cost-aware women and the full sample showed no statistically significant shift in actual hours.&lt;/p&gt;
&lt;p&gt;Q: What asymmetry did the authors observe between cost-unaware and cost-aware women?
A: Cost-unaware (overly optimistic) women increased their labor supply upon learning the true financial costs; cost-aware and overly pessimistic women did not reduce their labor supply upon learning they were better off than expected. The authors interpret this as consistent with agents responding more to perceived losses (bad news for cost-unaware women) than to gains (good news for pessimistic women), and with cost-aware women already having incorporated the financial logic into their decisions even without precise estimates.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated lifetime impact of the observed labor supply adjustment?
A: If cost-unaware teachers maintain the 3.87 ppt employment increase from age 40 to retirement, they accumulate an additional 130,000 CHF in lifetime income and 40,000 CHF in pension wealth on average. This would reduce the gender gap in both lifetime income and pension receipt among teachers by approximately 18% each.&lt;/p&gt;
&lt;p&gt;Q: What emotional and social mechanisms did the paper document?
A: The treatment initially produced significantly negative emotional responses (−0.41 SD on an emotions index overall; −0.68 SD for cost-unaware women), consistent with cognitive dissonance from information conflicting with prior beliefs. Two months later, the treatment group reported feeling more in control and less stressed, and cost-unaware women returned to a neutral emotional baseline. Treated women were also 19.61 ppt more likely to have discussed the topic with anyone, with the largest effect on conversations with partners or family.&lt;/p&gt;
&lt;p&gt;Q: Did the treatment affect household-level labor supply — specifically, did partners reduce their hours?
A: No. The authors found no evidence that partners of cost-unaware women planned to work less in response to the treatment, and women did not plan to adjust future fertility. This suggests the observed hours increase by treated cost-unaware women was not offset by partner adjustments within the household.&lt;/p&gt;
&lt;p&gt;Q: Were there social spillover effects within schools?
A: Treated teachers were 11.59 ppt more likely to report having discussed the video with colleagues. Two months later, cost-unaware control teachers in treated schools (the spillover group) showed some evidence of absorbing the general treatment message and adjusting short-term labor supply plans upward, and a noisy increase in actual employment of roughly one-third the magnitude of the direct treatment effect, though these estimates were imprecise.&lt;/p&gt;
&lt;p&gt;Q: Why might cost-unaware women be uninformed in the first place?
A: In both the descriptive survey and the RCT sample, cost-unaware women lean more gender-conservative in their attitudes and report less interest in financial topics. The authors interpret this as suggesting a lack of information (rather than mere salience or forgetting) drives cost-unawareness, implying that passive information delivery through employers or pension funds could be effective.&lt;/p&gt;
&lt;p&gt;Q: What constraints to labor supply adjustment did the authors explore?
A: In a hypothetical scenario exercise, the scenario producing the largest desired employment increase for both treatment and control groups was if the partner were more engaged (roughly double the adjustment relative to a scenario of higher pay for additional hours). The treatment group adjusted their desired employment level by an additional 0.62–2.03 ppt relative to pure control across all scenarios except relaxing conservative gender norms.&lt;/p&gt;
&lt;p&gt;Q: How generalizable are the findings beyond the teacher sample?
A: The short-term RCT findings replicated among a sample of pregnant women in the general Swiss population. The authors also document that potential net gains from increasing labor supply — net of additional childcare costs — are large for the broader population of part-time working Swiss mothers, supporting feasibility of similar-magnitude adjustments outside teaching. The teaching context likely represents a lower bound for lifetime earnings and pension losses in other professions due to the absence of a part-time promotion penalty in teaching.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications?
A: The findings suggest that default exposure to individualized financial information about the long-term costs of part-time work — delivered by employers, pension funds, or the state — could improve decision quality and labor supply. More broadly, the results imply that policies designed to increase female labor supply (parental leave reforms, childcare subsidies) may underperform if mothers do not fully internalize the financial benefits of additional hours; ensuring that families solve the correct optimization problem is a precondition for unlocking the full potential of such policies.&lt;/p&gt;
&lt;p&gt;Child Penalty: The large and persistent reduction in women&amp;rsquo;s labor force participation and income following the birth of a first child, identified in the paper as the key driver of remaining gender inequality in the labor market in industrialized countries and a source of profound life-cycle financial consequences including reduced lifetime earnings and pension savings.&lt;/p&gt;
&lt;p&gt;Cost-Unaware: The authors&amp;rsquo; term for women who hold overly optimistic expectations about the financial consequences of part-time work — specifically, who over-estimate pension receipt under low part-time employment by more than 10% and who believe wage growth under low part-time is at least as high as under higher employment levels. In the descriptive survey 42% of mothers qualify on both dimensions.&lt;/p&gt;
&lt;p&gt;Future Calculator: An online individualized projection tool developed by the authors in cooperation with a Swiss bank, calibrated to teachers&amp;rsquo; deterministic salary and pension schedules, allowing users to estimate the long-term financial implications of different employment levels. Used both in the descriptive survey vignette and as part of the RCT treatment.&lt;/p&gt;
&lt;p&gt;Second Pillar (Occupational Pension Scheme, PP): Switzerland&amp;rsquo;s occupational pension scheme, the pillar most heavily affected by part-time work because contributions are directly proportional to earnings above a minimum annual earnings threshold. The paper documents an average gender pension gap of 47.5% in this pillar in 2024 and a 43% lower monthly pension receipt for the average female teacher&amp;rsquo;s part-time trajectory relative to full-time employment.&lt;/p&gt;
&lt;p&gt;Two-Stage Randomization: The experimental design used to separate direct treatment effects from spillover effects within schools. One-third of schools are assigned to a pure control group; in the remaining two-thirds, teachers are individually randomized into treatment or spillover control (untreated teachers in treated schools), enabling identification of both causal treatment impacts and social learning channels.&lt;/p&gt;
&lt;p&gt;Information Constraint: The paper&amp;rsquo;s central mechanism — mothers&amp;rsquo; failure to spontaneously account for the full long-term financial implications of reduced labor supply when making employment decisions, distinct from deliberate forward-looking tradeoffs. The authors document this both through the absence of long-term financial factors in open-ended decision narratives (only 11% of mothers mention them) and through systematic misperceptions of pension and wage outcomes.&lt;/p&gt;
&lt;p&gt;Cognitive Dissonance (as used in the paper): The authors use this term to describe the initial negative emotional response (−0.41 SD overall, −0.68 SD for cost-unaware women) when treated women learn that the true financial costs of part-time work are higher than they expected — information that conflicts with prior beliefs and prior choices, producing unpleasant emotions that subsequently reverse into lower stress levels two months later.&lt;/p&gt;</description></item><item><title>A Cognitive Theory of Reasoning and Choice</title><link>https://macropaperwarehouse.com/papers/a-cognitive-theory-of-reasoning-and-choice/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-cognitive-theory-of-reasoning-and-choice/</guid><description>&lt;p&gt;Bordalo, Gennaioli, Lanzani, and Shleifer develop a cognitive theory of choice in which a decision maker&amp;rsquo;s attention to the features of options is determined by her categorization of the current problem against a memory database of problems she solved in the past. The core claim is that before solving a problem, the decision maker asks &amp;ldquo;what kind of problem is this?&amp;rdquo; and resolves it by selecting the category — indexed by a prototype attention-plus-context vector and a time-discounted frequency — whose similarity to the current problem is maximized. This problem recognition step then pins down which features (price, quality, probabilities) receive attention, which in turn shapes valuation and choice.&lt;/p&gt;
&lt;p&gt;The model formalizes two-step choice. In step one (recognition), the decision maker jointly chooses an attention vector alpha_P and a category c* to maximize a separable similarity function S[(alpha_P, kappa_P), (alpha_c, kappa_c)] weighted by category frequency F_c, plus a Type I extreme-value shock that yields a logit probability over categories. In step two, she maximizes perceived value over the menu using the endogenously determined weights. Perceived hedonic value of feature i shrinks toward the menu average when alpha_{P,i} &amp;lt; 1; perceived probabilities compress toward uniform when the event-attention weight falls below 1, producing probability overweighting of unlikely events. Full attention recovers expected utility.&lt;/p&gt;
&lt;p&gt;The model yields three structural predictions that hold without changing tastes or information. First, within-person multi-modal attention: because categorization is stochastic, the same person can cluster on entirely different features (e.g., the base rate vs. the likelihood in an inference problem) across otherwise identical choice occasions. Second, systematic context-driven instability: when an irrelevant context feature kappa_{P,i} drifts away from a category&amp;rsquo;s diagnostic kappa_{c,i}, the probability of that category falls discontinuously, causing a discrete switch in the attention profile and hence in valuation. Third, experience-driven heterogeneity: people more frequently exposed to a category (higher F_c) are more likely to use it, producing persistent differences in price elasticities or probability weighting at constant income and tastes.&lt;/p&gt;
&lt;p&gt;Applied to riskless consumer choice, the paper introduces two categories — &amp;ldquo;buying&amp;rdquo; (full attention to price, partial to quality: alpha_{M_g}=1 &amp;gt; alpha_{Q_g}=alpha) and &amp;ldquo;consuming&amp;rdquo; (full attention to quality, partial to price: alpha_{Q_g}=1 &amp;gt; alpha_{M_g}=alpha). A jam problem categorized as buying yields valuation v = alpha&lt;em&gt;q - eta&lt;/em&gt;p; categorized as consuming, v = q - alpha&lt;em&gt;eta&lt;/em&gt;p. The valuation jumps discontinuously as context crosses a threshold kappa*, which shifts when relative category frequency F_{buy}/F_{con} changes. This framework accounts for context-dependent price elasticities (Wakefield and Inman 2003), poverty-driven excess price focus (Shah et al. 2018), de-commoditization through advertising, and mental accounting anomalies including opportunity cost neglect and the sunk cost fallacy — both arising because con neglects capital gains (alpha_{con,Delta_M}=0) and buy neglects quality shocks (alpha_{buy,Delta_Q}=0).&lt;/p&gt;
&lt;p&gt;Applied to statistical judgment, the paper introduces two categories — &amp;ldquo;frequency estimation&amp;rdquo; (attention alpha_1=1 to a single i.i.d. draw from a known DGP) and &amp;ldquo;agnostic inference&amp;rdquo; (attention alpha_S=1 to the share of heads as a sufficient statistic). The threshold N* separates recognition: for sequence length N_P &amp;lt; N*(F_{freq}/F_{inf}), the decision maker categorizes as frequency and correctly assesses odds; for N_P &amp;gt;= N*, she switches to inference and overweights balanced sequences, producing the Gambler&amp;rsquo;s Fallacy. The same competition between categories also accounts for base rate neglect, conjunction fallacy, and correlation neglect, with the bias strengthening as sequences grow longer.&lt;/p&gt;
&lt;p&gt;Applied to risky choice, bottom-up salience — sensory prominence and contrast — interacts with categorization. A publicity shock drawing attention to a low-probability contamination risk raises similarity to &amp;ldquo;consuming,&amp;rdquo; triggering a category switch that amplifies attention to quality broadly and reduces attention to price, producing large valuation drops disproportionate to the actual probability shift. This mechanism generates the framing effects of prospect theory without a stable S-shaped utility function: gains and losses frames correspond to different contexts activating different categories.&lt;/p&gt;
&lt;p&gt;Scope conditions: the theory applies when features and their values are fully known to the decision maker (no uncertainty about attributes), so the distortions take the form of altered sensitivity to known features rather than missing information. The set of categories C is taken as given in the formal analysis, though the authors discuss endogenization as future work.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central departure from standard rational inattention and noisy-perception models?&lt;/p&gt;
&lt;p&gt;A: Standard models (Sims 2003, Woodford 2012, Enke and Graeber 2023) produce unimodal, stably weighted valuations — the decision maker&amp;rsquo;s weighting of features is a smooth function of payoff-relevant costs or priors. In this paper, the weighting is determined by problem recognition, which is discrete and stochastic, producing within-person multi-modal attention: the same person can cluster on entirely different features across identical problems. The authors cite direct evidence from Bordalo, Conlon, Gennaioli, Kwon, and Shleifer [20] showing bimodal clustering on base rates vs. likelihoods in statistical problems, a pattern inconsistent with stable-weighting models.&lt;/p&gt;
&lt;p&gt;Q: How is perceived value distorted when the attention weight on a hedonic feature is below 1?&lt;/p&gt;
&lt;p&gt;A: The perceived value of hedonic feature i is u_i(alpha_P) = alpha_{P,i} * u_i + (1 - alpha_{P,i}) * u_bar_i, where u_bar_i is the average value of that feature across options in the menu. An attention weight of zero collapses perceived variation in that feature to zero; full attention recovers the true value. The implication is that under-attention shrinks the decision maker&amp;rsquo;s effective sensitivity to a known attribute, causing systematic under- or over-valuation relative to a rational benchmark while tastes (marginal utilities) are held fixed.&lt;/p&gt;
&lt;p&gt;Q: How is perceived probability distorted?&lt;/p&gt;
&lt;p&gt;A: With attention weight alpha_{P,W} on event W, the perceived probability of event e is P(e)^{alpha_{P,W}} / sum_{e&amp;rsquo;} P(e&amp;rsquo;)^{alpha_{P,W}}, which compresses the distribution toward uniform as alpha_{P,W} falls toward 0 and recovers the true distribution at alpha_{P,W}=1. In the jam example, under-attention to the small probability of spoilage causes the decision maker to overestimate the risk of contamination. For multi-dimensional event vectors the formula generalizes multiplicatively, allowing &amp;ldquo;editing out&amp;rdquo; of entire event dimensions (e.g., urn selection in a balls-and-urns problem) when their attention weight hits zero.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism for context-dependent price elasticity?&lt;/p&gt;
&lt;p&gt;A: When context kappa_P is below threshold kappa*(F_{buy}/F_{con}), the decision maker categorizes the problem as &amp;ldquo;buying&amp;rdquo; and her valuation is v = alpha&lt;em&gt;q - eta&lt;/em&gt;p, giving a high price sensitivity (coefficient eta) and attenuated quality sensitivity (coefficient alpha &amp;lt; 1). Above kappa*, she categorizes as &amp;ldquo;consuming&amp;rdquo; and valuation is v = q - alpha&lt;em&gt;eta&lt;/em&gt;p, reversing the emphasis. Because the threshold kappa* is increasing in relative frequency F_{buy}/F_{con}, a decision maker with more buying experience has a higher threshold and thus acts as more price-elastic at any given context level. These elasticity differences arise without any change in the true marginal utility of money eta or quality q.&lt;/p&gt;
&lt;p&gt;Q: How does the model generate the sunk cost fallacy and opportunity cost neglect as a unified phenomenon?&lt;/p&gt;
&lt;p&gt;A: Both anomalies arise because buying and consuming categories selectively neglect shocks. In the football example, recognizing the problem as &amp;ldquo;buying&amp;rdquo; activates alpha_{buy,Delta_Q}=0, so the blizzard quality shock Delta_q&amp;lt;0 is ignored and the decision maker drives to the game as if the shock did not occur — the sunk cost fallacy. In the wine example, recognizing the problem as &amp;ldquo;consuming&amp;rdquo; activates alpha_{con,Delta_M}=0, so the capital gain Delta_p is ignored and the decision maker reports a zero or purchase-price cost — opportunity cost neglect. The unifying mechanism is that each category attends only to the features diagnostic of its prototypical experiences: buying attends to price paid and normal quality; consuming attends to realized quality and partly to price, but not to capital gains.&lt;/p&gt;
&lt;p&gt;Q: What comparative static does the model predict for sunk cost susceptibility based on experience?&lt;/p&gt;
&lt;p&gt;A: People with higher F_{buy} (more buying experiences, e.g. poverty experiences or having recently purchased but not yet consumed the good) exhibit more sunk cost fallacy and less opportunity cost neglect. Conversely, season ticket holders face many consuming experiences relative to one buying event, raising F_{con} and thus reducing susceptibility to the sunk cost fallacy for sports events. Making the blizzard more salient in the description shifts similarity toward &amp;ldquo;consuming,&amp;rdquo; also reducing the sunk cost fallacy through a different channel (bottom-up salience rather than experience).&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s explanation for the Gambler&amp;rsquo;s Fallacy, and what distinguishes it from prior accounts?&lt;/p&gt;
&lt;p&gt;A: The Gambler&amp;rsquo;s Fallacy arises when sequence length N_P exceeds threshold N*(F_{freq}/F_{inf}), causing the decision maker to switch from the frequency category (which attends to the 50:50 fairness of the coin) to the inference category (which attends to the share of heads). Under inference, the decision maker treats balanced and unbalanced sequences as representatives of their &amp;ldquo;share of heads equivalence class,&amp;rdquo; and the class of balanced sequences is larger, so balanced sequences receive higher estimated probability — the Gambler&amp;rsquo;s Fallacy. This differs from Rabin and Vayanos (2010), where the bias stems from a belief that the coin is drawn from a pool; here the decision maker knows the coin is fair (kappa_{P,U}=0.5) but the inference representation causes question substitution rather than a wrong model of the DGP.&lt;/p&gt;
&lt;p&gt;Q: How does the model make the Gambler&amp;rsquo;s Fallacy testable beyond length effects?&lt;/p&gt;
&lt;p&gt;A: The model predicts the bias is stronger for decision makers who recently solved many inference problems (lower F_{freq}/F_{inf}), and weaker when the 50:50 nature of flips is made bottom-up salient in the choice context (because salience raises similarity to the frequency category, hindering recognition of inference). These cognitive proxies — experience frequencies and bottom-up salience — are orthogonal to the statistical content of the problem and thus allow identification of the mechanism separately from changes in information or incentives.&lt;/p&gt;
&lt;p&gt;Q: How does the model produce framing effects in risky choice without a stable S-shaped utility function?&lt;/p&gt;
&lt;p&gt;A: Gains and losses frames are modeled as different context vectors kappa_P that differentially increase similarity to a &amp;ldquo;safe outcome&amp;rdquo; category or a &amp;ldquo;risk&amp;rdquo; category. Recognizing the problem as the safe-outcome category shifts attention toward the certain option; recognizing it as the risk category shifts attention toward variance. The reversal of preferences between gain and loss frames (the Asian Disease problem, Tversky and Kahneman 1981) thus emerges from context-driven re-categorization rather than from a fixed probability weighting function. The novel prediction is that framing effects should be stronger for decision makers with more experience with the category activated by each frame, and weaker when bottom-up salience of the alternative frame&amp;rsquo;s features is raised.&lt;/p&gt;
&lt;p&gt;Q: How does bottom-up salience interact with top-down categorization in the contamination example?&lt;/p&gt;
&lt;p&gt;A: A publicity shock alpha_{delta,Q_b}&amp;gt;0 raises baseline attention to the spoiled-jam quality feature, increasing the similarity of the current problem to the &amp;ldquo;consuming&amp;rdquo; category (where quality is focal). This triggers a category switch for marginal agents, activating the full consuming attention profile — which attends to quality broadly, not just to contamination specifically, and reduces attention to price. The resulting valuation drop is therefore disproportionate to the actual probability of contamination and exhibits price insensitivity, because re-categorization shifts the entire attention profile rather than just updating a single probability.&lt;/p&gt;
&lt;p&gt;Q: How does the model relate to and distinguish itself from case-based decision theory (Gilboa and Schmeidler 1995) and analogical reasoning (Mullainathan 2002, Fryer and Jackson 2008)?&lt;/p&gt;
&lt;p&gt;A: In Gilboa-Schmeidler and related models, the decision maker uses past cases to resolve uncertainty about unknown attributes of current options; attention is full and the mechanism is extrapolation of payoffs from similar cases. In Mullainathan (2002) memory-based model, categories again serve to fill in missing information. In this paper, there is no uncertainty about attributes — features and their values are fully known — and the distortion instead takes the form of altered sensitivity to known features through selective attention. This allows the model to produce biases even in simple problems with full data disclosure, and to explain phenomena like base rate neglect and price insensitivity that are not primarily about missing information.&lt;/p&gt;
&lt;p&gt;Q: What does the model predict about within-person versus across-person distributions of valuations?&lt;/p&gt;
&lt;p&gt;A: Within a person, attention is multi-modal (bimodal in the two-category case) because categorization is stochastic. However, if many categories are possible across the population, the aggregate distribution of valuations can appear approximately unimodal even though each individual&amp;rsquo;s distribution is not. This distinction is empirically important: a researcher observing average choices may incorrectly infer smooth preference heterogeneity when the underlying mechanism is discrete category switching.&lt;/p&gt;
&lt;p&gt;Q: What cognitive proxies does the model propose for empirical identification?&lt;/p&gt;
&lt;p&gt;A: The theory links endogenous attention and choice to three observable (or measurable) proxies: (1) past experience frequencies F_c, measurable from administrative histories, surveys about past exposure, or experimental manipulation of training; (2) contextual similarity, measurable from field or experimental variation in irrelevant context features; and (3) bottom-up salience, experimentally controllable via prominence or contrast manipulations. The key identification logic is that these proxies are payoff-irrelevant — they do not change tastes, information, or the objective choice problem — yet predict systematic shifts in choice through their effect on recognition.&lt;/p&gt;
&lt;p&gt;Problem Recognition: The first step in the decision maker&amp;rsquo;s choice process, in which she jointly selects an attention vector alpha_P and a category c* by maximizing weighted similarity between the current problem (characterized by its context vector kappa_P) and the prototype of a past category (alpha_c, kappa_c), multiplied by the category&amp;rsquo;s time-discounted frequency F_c. Recognition is not about resolving uncertainty over attributes but about selecting which known attributes to attend to.&lt;/p&gt;
&lt;p&gt;Category: A partition element of the decision maker&amp;rsquo;s memory database, indexed by a prototype attention-plus-context vector (alpha_c, kappa_c) and a frequency scalar F_c. The prototype encodes both the context features diagnostic of experiences in that category (binary alpha_{c,i} for i in Phi_K) and the attention to hedonic and event features (alpha_{c,i} for i in Phi_H union Phi_E) used when solving problems in that category. Examples in the paper: &amp;ldquo;buying&amp;rdquo; and &amp;ldquo;consuming&amp;rdquo; for riskless choice; &amp;ldquo;frequency estimation&amp;rdquo; and &amp;ldquo;agnostic inference&amp;rdquo; for statistical judgment.&lt;/p&gt;
&lt;p&gt;Attention Weight (alpha_{P,i}): A scalar in [0,1] assigned to feature i of the current problem P. For hedonic features, alpha_{P,i}&amp;lt;1 collapses perceived variation toward the menu average; for event features, alpha_{P,i}&amp;lt;1 compresses perceived probabilities toward uniform. Full attention alpha_{P,i}=1 recovers expected utility. Attention weights are the endogenous output of the recognition step, not fixed preference parameters.&lt;/p&gt;
&lt;p&gt;Contextual Similarity S: A separable function measuring how close the current problem (alpha_P, kappa_P) is to a category prototype (alpha_c, kappa_c). It decreases in discrepancies in the attention vector (measured by a strictly increasing, convex function d) and in discrepancies in the values of context features diagnostic of the category (d_i(kappa_{P,i}, kappa_{c,i}) * alpha_{c,i}). Endogenous attention to context is set to reduce sensitivity to discrepancies, not to eliminate them.&lt;/p&gt;
&lt;p&gt;Mental Accounting (as categorization): In the paper&amp;rsquo;s account, non-fungibility, sunk cost fallacy, and opportunity cost neglect all arise because buying and consuming categories selectively attend to different monetary and quality features. The sunk cost effect is alpha_{buy,Delta_Q}=0; opportunity cost neglect is alpha_{con,Delta_M}=0. Mental accounts are not separate budget constraints but the by-product of category-specific attention profiles that were calibrated to normal-state experiences and do not generalize to shocks.&lt;/p&gt;
&lt;p&gt;Bottom-up Salience: Exogenous attention to a feature driven by sensory prominence (described by alpha_{delta,i} in the problem&amp;rsquo;s presentation vector) or payoff contrast (the DM attends more to features where her option&amp;rsquo;s value deviates more from the menu average relative to total menu variance). Bottom-up salience raises baseline attention to a feature before top-down categorization acts, and can trigger a category switch by raising similarity to the category for which that feature is focal.&lt;/p&gt;
&lt;p&gt;Gambler&amp;rsquo;s Fallacy via Question Substitution: In the model, the Gambler&amp;rsquo;s Fallacy arises when a long sequence length kappa_{P,N} causes recognition of the &amp;ldquo;agnostic inference&amp;rdquo; category, which focuses attention on the share of heads alpha_S=1. The decision maker then treats sequences as representatives of a &amp;ldquo;share of heads equivalence class,&amp;rdquo; and since the balanced class is larger than the unbalanced class, balanced sequences are assigned higher estimated probability. This is not a belief that the coin is unfair; it is question substitution induced by the inference representation.&lt;/p&gt;</description></item><item><title>A Housing Portfolio Channel of QE Transmission</title><link>https://macropaperwarehouse.com/papers/a-housing-portfolio-channel-of-qe-transmission/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-housing-portfolio-channel-of-qe-transmission/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper identifies and quantifies a &lt;em&gt;housing portfolio channel&lt;/em&gt; of quantitative easing (QE) transmission that operates through household portfolio rebalancing toward second homes (as opposed to the well-studied bank credit channel). The central question is whether, and how much, the ECB&amp;rsquo;s formal adoption of QE in January 2015 induced households with larger pre-existing bond holdings to shift wealth into residential real estate—specifically second homes held for investment—and what the downstream effects on regional housing market outcomes were.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Motivation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Germany is used as the empirical laboratory because it experienced a sustained housing boom from 2009 onward that was not accompanied by a household credit boom—a &amp;ldquo;housing boom without a credit boom.&amp;rdquo; The national house price-to-rent ratio rose markedly from 2009, especially accelerating after QE adoption in 2015, while the stock of mortgage credit to households as a share of GDP was flat or declining. This decoupling makes Germany well-suited for isolating a non-credit portfolio rebalancing mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Household-level data come from the Deutsche Bundesbank&amp;rsquo;s Panel on Household Finances (PHF), a triennial survey fielded in 2011, 2014, and 2017, from which the authors construct a panel of 1,651 households. The key exposure variable is each household&amp;rsquo;s pre-QE (2014) share of total wealth invested in bonds, both directly and indirectly via mutual funds and insurance. Regional housing outcomes (prices, rents, rental yields) are from Bulwiengesa AG for all 401 German administrative regions (Kreise) at annual frequency, and listing data come from Immoscout 24, Germany&amp;rsquo;s largest online real estate platform.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The household-level analysis uses a difference-in-differences (DiD) specification comparing changes in housing portfolio shares between the pre-QE wave (2014) and the post-QE wave (2017), against the pre-period change (2011 to 2014), with the degree of exposure measured by the 2014 bond share. The specification includes household and time fixed effects. A parallel-trends check using all three survey waves (Figure 2) shows that more- and less-exposed households tracked identically before QE adoption, diverging sharply thereafter. Two indirect placebo tests—using households&amp;rsquo; share in non-financial, non-housing assets as a spurious treatment, and using the change in non-financial assets as a spurious outcome—both return null results, supporting the identification assumption. For regional housing outcomes, the authors use a panel regression interacting lagged ECB debt-securities-to-GDP (the QE intensity measure) with a regional exposure variable—the 2008 pre-QE share of refugees housed in independent accommodations—across 401 regions from 2010 to 2017.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Benchmark portfolio rebalancing:&lt;/em&gt; A household with an ex-ante bond share that is 10 percentage points higher (roughly the interquartile range of the bond share distribution) increases its portfolio share of second homes by &lt;strong&gt;1.72 to 1.87 percentage points more&lt;/strong&gt; than a less-exposed household after QE adoption, conditional on household and time fixed effects. This result is statistically significant at the 1% level across multiple specifications and is robust to alternative bond share definitions, alternative portfolio denominators, and controlling for negative interest rate policy exposure (via initial deposit shares).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Equity rebalancing:&lt;/em&gt; Controlling for risk aversion does not attenuate the second-home result. Strikingly, households with larger ex-ante bond shares &lt;em&gt;reduce&lt;/em&gt;, rather than increase, their equity shares after QE (coefficient: −0.042, significant at 5%), ruling out the interpretation that the housing result merely picks up broad rebalancing toward all risky assets. This implies that cash purchases of second homes are funded by liquidating bonds, drawing down deposits, and also selling equities.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Heterogeneity—household characteristics:&lt;/em&gt; Rebalancing is stronger for (a) bank-advised households (triple-interaction significant at 5%), (b) financially more literate households (significant at 1%), and (c) households aged 40–60 (significant at 5%), consistent with a lifetime-income-peak, tax-optimization motive rather than a bequest motive. The result for age 61+ is positive but statistically insignificant.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Tax-motive heterogeneity:&lt;/em&gt; In Germany, rented-out second homes (or those declared for future letting) benefit from substantial tax deductions not available for owner-occupied primary residences, with the advantage rising in marginal tax rates. Rebalancing is stronger for higher-income households (triple interaction with income per capita positive and significant, especially after controlling for deposit shares) and for church-affiliated households, who face an additional 8–9% church tax surcharge on their regular tax bill, amplifying the tax gain from rental property deductions. For church members, the income-interaction triple coefficient is statistically significant; for non-church members it is not, directly linking the rebalancing gradient to the church tax burden.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Buy-to-let motive:&lt;/em&gt; The benchmark result is driven entirely by households that already owned a second home in the pre-QE period and were generating rental income from it (coefficient 0.821, significant at 1%); households without a pre-owned second home show a near-zero, statistically insignificant coefficient (0.000). This establishes that the rebalancing is driven by experienced buy-to-let investors, not vacation-home buyers or commuters.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Credit channel control:&lt;/em&gt; The portfolio rebalancing result is not driven by credit access or credit growth. The triple interactions of the bond-share × Post term with both (a) pre-QE leverage (mortgage credit to housing wealth) and (b) post-QE mortgage credit growth are statistically insignificant. Restricting the sample to households with no mortgage credit growth leaves the main coefficient essentially unchanged (0.175, significant at 1%). Nonetheless, an independent credit-channel effect is also present: mortgage credit growth has its own positive and significant effect on second-home share increases, confirming the two channels operate in parallel but independently.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Regional housing market outcomes—prices and yields:&lt;/em&gt; In regions more exposed to rental market tightness (higher refugee-in-independent-accommodation share), QE is associated with larger declines in rental yields. A one-standard-deviation increase in QE (approximately 4.3 pp higher ratio of ECB debt securities to GDP) reduces the rental yield in the 75th-percentile-exposure region relative to the 25th-percentile region by &lt;strong&gt;2 to 12 basis points per year&lt;/strong&gt; (depending on whether the refugee share or the renter share is used as the exposure measure). As ECB holdings rose from 7% of GDP in 2014 to 24% in 2017, the cumulative implied rental yield decline at the regional interquartile range is 8 to 48 basis points, sizable relative to the average regional rental yield decline of 140 basis points (from 7.4% to 6.0%) over the same period. House prices increase more than rents in more exposed regions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Regional housing market outcomes—listings:&lt;/em&gt; Using Immoscout 24 data, both sale and rental listings decline in more exposed regions as QE expands, but the &lt;em&gt;ratio&lt;/em&gt; of sale to rental listings falls significantly: sale listings decrease significantly more than rental listings in more exposed regions. This relative shift in supply toward the rental market is interpreted as evidence consistent with the buy-to-let motive documented at the household level and as potentially having benign implications for housing affordability through increased rental supply.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All household-level findings are conditional on the German institutional setting: Germany&amp;rsquo;s combination of a low-homeownership norm, substantial tax incentives favoring rental properties, triennial household survey data spanning one pre- and one post-QE wave, and a housing boom that was decoupled from household credit prior to 2015. The regional results apply to 401 German administrative regions (Kreise) over 2010–2017, using exposure instruments that are argued to capture rental-market tightness or depth rather than direct household bond holdings.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the housing portfolio channel of QE transmission, and how does it differ mechanically from the credit channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the housing portfolio channel, the ECB&amp;rsquo;s bond purchases reduce the net supply of bonds available to private investors, raising bond prices and reducing expected bond returns. Under the assumption that bonds and houses are substitutes in household portfolios, households with larger initial bond positions rebalance toward housing to restore their target allocation, bidding up house prices. This mechanism operates through changes in risk premia rather than through future short-term rates or bank reserves and loan supply. The credit channel, by contrast, operates through increased bank reserves enabling expanded mortgage lending. The authors show empirically that the two channels operate in parallel and independently, but that greater prior credit access and post-QE mortgage credit growth do not amplify the portfolio rebalancing effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the key exposure variable and why is it a valid identification strategy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The exposure variable is each household&amp;rsquo;s 2014 (pre-QE) share of total wealth invested in bonds, including both direct holdings and indirect holdings via mutual funds and insurance companies. The logic, drawn from the bank-portfolio-rebalancing literature (Rodnyansky and Darmouni, 2017; Luck and Zimmermann, 2020) and from the authors&amp;rsquo; own portfolio model, is that the larger a household&amp;rsquo;s bond share, the stronger its incentive to rebalance when the central bank reduces bond supply. Identification rests on the parallel-trends assumption: Figure 2 shows that before 2015, more- and less-exposed households (defined by a median split on the 2014 bond share) followed identical trends in second-home shares; the trends diverge sharply post-QE. Two indirect placebo tests corroborate this: using a spurious treatment variable (non-financial, non-housing asset share) and using a spurious outcome (change in non-financial, non-housing asset share) both yield null results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the benchmark magnitude of the portfolio rebalancing effect and how robust is it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: A 10-percentage-point higher 2014 bond share (the approximate interquartile range) is associated with a 1.72–1.87 percentage point larger increase in the second-home portfolio share post-QE relative to the pre-QE period (Table 3, columns 1–2, significant at 1%). This result is robust to: scaling second-home shares by a model-consistent denominator (bonds + housing + deposits, column 3); using total housing wealth instead of second-home wealth alone (column 4); using the count of second homes rather than their value share to rule out valuation-effect confounds (column 5); using direct bond holdings without imputation, or indirect holdings only, as alternative exposure measures (columns 7–8, where the coefficients are if anything larger at 0.403 and 0.420); controlling for a broad set of time-varying household characteristics including net worth, age, household size, financial literacy, and risk aversion (Table 4, range 0.19–0.23); and explicitly controlling for the deposit-share post-interaction to rule out the negative interest rate policy as a driver (column 6, main bond coefficient unchanged at 0.122).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Do households with higher bond exposure also rebalance toward equities after QE?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No. Column (7) of Table 4 shows that households with larger ex-ante bond shares &lt;em&gt;reduce&lt;/em&gt; their equity shares after QE adoption (coefficient: −0.042, significant at 5%). This rules out the interpretation that the second-home finding merely captures broad rebalancing toward all risky assets due to general risk-appetite changes. Combined with the evidence that deposit shares also decline (though not precisely estimated), the result implies that households fund second-home purchases by selling bonds, drawing down deposits, &lt;em&gt;and&lt;/em&gt; reducing equity positions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Which household characteristics amplify the rebalancing, and what do they reveal about the mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Five characteristics are shown to amplify rebalancing (Table 5 and Table 7): (1) being actively advised by a bank on asset allocation (triple interaction significant at 5%), consistent with banks that own real estate agencies steering clients toward property; (2) higher financial literacy (significant at 1%), consistent with more informed investors acting more quickly on QE-induced return differentials; (3) middle age (40–60), significant at 5%, but not older age (61+), ruling out bequest motives and pointing to households near their lifetime income peak optimizing their tax burden; (4) higher income per capita (positive and significant, especially among church members), reflecting the progressive German tax schedule that makes property-related deductions more valuable; and (5) church affiliation (the income-triple interaction is significant only for church members, who face an 8–9% church tax surcharge, amplifying the tax advantage of rental property ownership). Tenure status (renter vs. owner of main residence) shows that both groups rebalance, but the triple interaction is significant only at 10%, suggesting the effect is not confined to existing homeowners.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How is the buy-to-let motive established directly in the data, as opposed to vacation-home or commuter motives?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The authors use variation in whether households owned a second home and generated rental income from it &lt;em&gt;before&lt;/em&gt; QE adoption (Table 8). Households that owned a second home and reported rental income in the pre-QE wave rebalance very strongly (coefficient 0.821 on Bonds × Post, significant at 1%). Households that owned a second home but did not generate rental income show a positive but imprecisely estimated coefficient (0.641, significant at 10% in a very small sub-sample of 138 households). Critically, households that did not own any second home prior to QE show a coefficient of essentially zero (0.000). This pattern establishes that rebalancing is driven by experienced buy-to-let investors rather than by households acquiring second homes for personal use, and is consistent with the income-seeking motive documented in the Australian context by Gargano and Giacoletti (2022).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the paper demonstrate that the effect is independent of the credit channel, while also acknowledging the credit channel operates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper employs three complementary tests (Table 6). First, triple interactions of the Bonds × Post coefficient with pre-QE leverage (mortgage-to-housing-wealth ratio) and with post-QE mortgage credit growth are both statistically insignificant (columns 5–6 of Table 5), meaning that greater credit access does not amplify the bond-share rebalancing effect. Second, restricting the sample to households with zero mortgage credit growth between 2014 and 2017 leaves the main coefficient unchanged at 0.175 (column 1 of Table 6). Third, including the two credit variables as additional controls only marginally reduces the bond-share coefficient without affecting its significance (columns 2–3 of Table 6). At the same time, column 3 of Table 6 shows that mortgage credit growth &lt;em&gt;does&lt;/em&gt; have its own statistically significant positive effect on second-home shares (coefficient 0.009, significant at 1%), confirming a separate, independently operating credit channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How is regional exposure to the channel proxied, given that household survey data cannot be aggregated to the regional level?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Because the 1,651-household panel provides only 3–4 observations per region on average across 401 German Kreise, the authors cannot construct representative regional averages of household bond shares. Instead, they use the pre-QE (2008) share of refugees housed in independent accommodation in each region as developed by Bednarek et al. (2021), arguing that a larger refugee share creates tighter rental housing market conditions and therefore makes buy-to-let investment more attractive. For robustness, they also use the 2011 census share of renters in each region as an alternative measure of rental market depth. Both regional exposure variables take higher values in urban areas (refugee share: 21% urban vs. 10% rural; renter share: 70% urban vs. 46% rural), consistent with household-level rebalancing being stronger in urban regions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the quantitative effects on regional rental yields, house prices, and rents?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Table 9 shows that a one-standard-deviation increase in QE (approximately 4.3 percentage points higher ECB debt securities-to-GDP ratio) reduces the rental yield in a region at the 75th percentile of the refugee-share exposure distribution relative to the 25th percentile by 2 basis points per year (using the refugee share) to 12 basis points per year (using the renter share). Comparing the 5th vs. 95th percentile of exposure, the yield differential is 5–24 basis points per year. Over the full 2014–2017 QE expansion (from 7% to 24% of GDP), the cumulative implied rental yield decline at the interquartile range of exposure is 8 to 48 basis points—sizable relative to the average regional decline of 140 basis points. House prices increase more than rents in more exposed regions. Using the Campbell-Shiller decomposition, about 70% of return variation is attributable to future price-to-rent increases, 36% to lower future rent growth (consistent with more rental supply), and only 5% to discount rate differentials.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What do the listing data reveal about the supply implications of the channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Table 10 shows that QE reduces both sale and rental listings in more exposed regions (both significant at 1%), consistent with the aggregate national decline visible from 2015 onward. Critically, the &lt;em&gt;ratio&lt;/em&gt; of sale listings to rental listings declines significantly in more exposed regions: sale listings fall more than rental listings (columns 3 and 6, significant at 1% with both exposure measures). This relative shift implies that the share of properties available for rent increases relative to properties available for sale in regions more exposed to the portfolio rebalancing channel, providing evidence of an expanded rental supply. This finding is interpreted as a potentially beneficial side effect of QE-induced buy-to-let investment for housing affordability, to the extent that a larger rental supply mitigates rent increases even as house prices rise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the theoretical model underlying the empirical analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model (Appendix C) features a representative local household with mean-variance preferences managing a portfolio of bonds, housing, and cash (equities are omitted for tractability). Preferred habitat investors segment both the national bond market and the local housing market. QE reduces the fixed net supply of bonds, raising bond prices and reducing expected bond returns. Under the substitutability of bonds and houses, households rebalance toward housing to restore optimal allocation, bidding up house prices; the larger the initial bond share, the larger the required rebalancing. Housing supply constraints determine how much rebalancing depresses expected housing returns (rental yields). The model does not unambiguously predict the response of the cash (deposit) share, motivating the empirical investigation reported in column (6) of Table 3.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the aggregate household balance sheet patterns consistent with the individual-level results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Table 1 shows that Germany&amp;rsquo;s aggregate household real estate share rose from 55% of total assets in 2014 to 56–57% in 2017–2018, while the bond share declined by roughly 0.5 percentage points. The homeownership rate declined by about 2 percentage points over the sample period (from 52.5% in 2014 to 51.4–51.5% in 2017–2018), consistent with an increasing share of landlords and renters—which is compatible with the buy-to-let mechanism since more than 60% of German renters lease from other households. Household leverage also declined (loans-to-assets from 13% in 2014 to 12% in 2017), consistent with portfolio rebalancing rather than credit-driven housing acquisition. The deposit share remained constant over the period, weighing against the negative-interest-rate policy as a driver of portfolio rebalancing.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Housing portfolio channel of QE transmission:&lt;/strong&gt; The paper&amp;rsquo;s central concept—a mechanism by which central bank bond purchases (QE) induce households holding bonds to rebalance their portfolios toward second homes held for investment (buy-to-let), operating through changes in risk premia (bond prices and expected returns) rather than through bank lending channels or future short-term interest rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex-ante bond share (QE exposure measure):&lt;/strong&gt; Each household&amp;rsquo;s share of total wealth invested in bonds (direct holdings plus indirect holdings via mutual funds and insurance) measured in the 2014 pre-QE survey wave. Used as a continuous household-level treatment intensity: the larger this share, the stronger the portfolio pressure to rebalance when the ECB reduces bond supply to the private sector. Corresponds roughly to 10 percentage points per interquartile range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Buy-to-let motive:&lt;/strong&gt; In the paper&amp;rsquo;s usage, the investment purpose of purchasing second homes specifically to rent them out—or to declare them for future letting—in order to exploit Germany&amp;rsquo;s substantial tax advantages for rented properties (depreciation allowances, deductibility of mortgage interest, management costs, and property taxes against rental income), which are unavailable for owner-occupied primary residences. Distinguished from vacation-home or commuter motives by the presence of pre-QE rental income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Segmented housing markets / preferred habitat investors:&lt;/strong&gt; Assumptions embedded in the paper&amp;rsquo;s theoretical model (following Flavin and Yamashita, 2002; Gete and Reher, 2018; Greenwald and Guren, 2021) that local real estate markets are insulated from national or international housing markets, and that some investors have a binding preference to hold bonds or local housing, so that QE-induced price changes in the bond market are not fully arbitraged away by shifting into liquid alternatives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Parallel trends (DiD validity):&lt;/strong&gt; The identifying assumption that, absent QE, households with larger and smaller initial bond shares would have followed the same trajectory in their second-home portfolio shares. The paper documents this graphically using all three survey waves (Figure 2) and supports it with two indirect placebo tests involving unrelated treatment and outcome variables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regional rental yield:&lt;/strong&gt; The rent-to-price ratio at the regional (Kreise) level, derived from Bulwiengesa data. Used as the primary regional outcome variable because it jointly captures discount rate, rent-growth, and price-to-rent dynamics. A Campbell-Shiller decomposition decomposes its predictive content into three components: discount rates (5%), future rent growth (36%), and future price-to-rent ratio changes (70%) in the German regional panel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sale-to-rental listing ratio:&lt;/strong&gt; The ratio of sale listings to rental listings for apartments on Immoscout 24, used as a quantity-side outcome variable. A decline in this ratio in more-exposed regions is interpreted as evidence of a relative increase in rental supply, consistent with the buy-to-let motive and with potentially beneficial implications for housing affordability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Church tax (Kirchensteuer):&lt;/strong&gt; A German institutional feature—formally affiliated church members pay an additional 8–9% surcharge on their regular income tax bill (varying by state). Because the tax advantage of owning rental property is proportional to the marginal tax rate, church members face a higher effective marginal tax rate and thus derive larger tax benefits from buy-to-let investment, producing stronger QE-induced portfolio rebalancing for this sub-group.&lt;/p&gt;</description></item><item><title>A Model of Multiple Hypothesis Testing</title><link>https://macropaperwarehouse.com/papers/a-model-of-multiple-hypothesis-testing/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-model-of-multiple-hypothesis-testing/</guid><description>&lt;p&gt;This paper develops an economic framework for determining when and how much multiple hypothesis testing (MHT) adjustment is warranted in research settings. The research question is: under what conditions do MHT adjustments arise as an optimal solution to incentive misalignment between a researcher and a mechanism designer (social planner)?&lt;/p&gt;
&lt;p&gt;The model is a two-stage game. In the first stage, a benevolent social planner commits to a hypothesis testing protocol. In the second stage, a researcher decides whether to conduct a pre-specified experiment based on private costs and benefits. The planner&amp;rsquo;s utility function combines an ambiguity-averse (maximin) component—limiting harm from mistaken conclusions—with an expected-utility component capturing the generic benefits of research production. The framework focuses on multiplicity arising from testing multiple treatments or estimating effects within multiple subpopulations; multiple outcomes are treated as an economically distinct case covered in a companion paper.&lt;/p&gt;
&lt;p&gt;The main theoretical result is that separate t-tests are uniformly globally optimal under linearity of the researcher&amp;rsquo;s payoff and welfare functions and normality of test statistics. The optimal critical value takes the explicit form: t(J, Σ) = Φ⁻¹(1 − C(J, Σ) / (b · |J|)), where |J| is the number of hypotheses, C(J, Σ) is the experiment cost, and b is the researcher&amp;rsquo;s per-rejection benefit. This formula nests two limiting cases. When costs are fully fixed (invariant to |J|), the formula delivers a Bonferroni correction. When costs scale proportionally with the number of hypotheses, no MHT adjustment is warranted—because the researcher already faces sufficient deterrent from the incremental cost of each additional test.&lt;/p&gt;
&lt;p&gt;The key economic mechanism is as follows. In the worst states of the world (where all treatments are harmful relative to the status quo), a research study has only downside risk for society. The planner must keep the researcher&amp;rsquo;s expected payoff from false positives low enough that she chooses not to experiment. If critical values were invariant to |J|, for sufficiently many hypotheses the researcher&amp;rsquo;s expected payoff from false positives alone would exceed costs, inducing unwanted experimentation. Some upward adjustment to critical values (i.e., tighter thresholds) is therefore generically optimal. The same logic implies that critical values should also adjust for sample size, since larger samples raise costs.&lt;/p&gt;
&lt;p&gt;The framework is calibrated to two empirical applications. For FDA clinical trial approval, using Sertkaya et al. (2016) data on approximately 31,000 U.S. pharmaceutical trials (2004–2012), fixed costs constitute approximately 46% of average total trial cost. At a benchmark significance level of 5% and benchmark sample size, the optimal level is approximately 3.2% for two tests, 2.6% for three tests, and asymptotes to approximately 1.4% as |J| → ∞. Sidak&amp;rsquo;s correction yields 2.5% and 1.7% for two and three tests respectively, and tends to zero as |J| → ∞—more conservative than the model implies. Optimal adjustments must also be less conservative for larger samples to preserve researcher incentives to bear the correspondingly larger costs.&lt;/p&gt;
&lt;p&gt;For program evaluation in development economics, the paper uses a unique dataset of funding proposals submitted to J-PAL from 2009 to 2021. The estimated cost elasticity with respect to the number of treatment arms ranges from 0.13 to 0.22 (p &amp;lt; 0.05), indicating costs rise significantly but far less than proportionally. The implied optimal significance levels are slightly less conservative than Bonferroni/Sidak corrections but more conservative than unadjusted testing.&lt;/p&gt;
&lt;p&gt;Scope conditions: the framework assumes pre-specified experiments (no p-hacking), linear payoffs, normally distributed statistics, and a researcher whose preferences are common knowledge. The analysis focuses on multiple treatments and subpopulations, not multiple outcomes. Results extend to imperfectly informed researchers and heterogeneous variances.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism by which MHT adjustments arise as optimal in this framework?
A: The planner must deter experimentation in the worst-case states—those where all treatments are harmful. If the testing protocol did not adjust for the number of hypotheses, a researcher testing sufficiently many hypotheses could earn enough expected payoff from false positives alone to justify experimentation, even when all treatments are truly harmful. Tighter critical values (higher thresholds) reduce the probability of false positives and thus cap the researcher&amp;rsquo;s expected payoff in the null space, deterring unwanted experimentation. This is the maximin optimality condition: the researcher&amp;rsquo;s expected payoff must be non-positive over the null space.&lt;/p&gt;
&lt;p&gt;Q: What are the two limiting cases of the optimal critical value formula, and what do they correspond to?
A: The optimal level of the separate t-tests is α(J, Σ) = C(J, Σ) / (b · |J|). When C(J, Σ) = ᾱ (costs are fixed, invariant to the number of hypotheses), this reduces to ᾱ/|J|, the Bonferroni correction. When C(J, Σ) = ᾱ · |J| (costs scale proportionally with the number of hypotheses), the optimal level equals ᾱ regardless of |J|—no MHT adjustment is warranted. The intuition for the second case is that proportional costs already deter excess testing; the researcher has no undue incentive to test many hypotheses because each additional test costs the same incremental amount.&lt;/p&gt;
&lt;p&gt;Q: Why do optimal critical values also depend on sample size, and what is the policy implication?
A: Since research costs C(J, Σ) increase with sample size (Σ captures design features including sample size), the optimal test level α(J, Σ) = C(J, Σ)/(b·|J|) rises with sample size. Equivalently, larger studies warrant less conservative significance thresholds. The policy implication is that a single uniform correction (e.g., Bonferroni at the 5% level) applied without regard to sample size is suboptimal: it is too conservative for large studies, which would over-deter valuable high-powered research.&lt;/p&gt;
&lt;p&gt;Q: What are the two optimality properties required of protocols in the paper&amp;rsquo;s main characterization?
A: The paper shows (Proposition 3.1) that a protocol is uniformly globally optimal—optimal for all values of the welfare weight λ and prior π—if and only if it is both maximin optimal and unbiased. Maximin optimality (Proposition 3.2) requires two conditions: the researcher&amp;rsquo;s expected payoff must be non-positive over the null space (deterring experimentation when all treatments are harmful), and expected welfare must be non-negative when some treatments are beneficial. Unbiasedness requires that the researcher&amp;rsquo;s maximum power strictly exceeds the test size, ensuring that experimentation is motivated when treatments are genuinely beneficial.&lt;/p&gt;
&lt;p&gt;Q: How does the paper rationalize conventional hypothesis testing asymmetry (type I vs. type II error weighting) without extreme restrictions?
A: In Tetenov (2012), justifying 5%-level testing with minimax regret in a single-agent model requires the decision-maker to place 102 times more weight on type I than type II regret—an extreme restriction. In this paper, the asymmetry arises naturally from the planner&amp;rsquo;s desire to prevent harmful treatment implementation: the planner is willing to forgo some power (probability of detecting beneficial treatments) to ensure that harmful treatments are not implemented. The researcher&amp;rsquo;s private incentives and the planner&amp;rsquo;s objective diverge in a way that makes tight size control endogenously optimal.&lt;/p&gt;
&lt;p&gt;Q: What does the FDA empirical calibration imply quantitatively about optimal versus standard adjustments?
A: Using Sertkaya et al. (2016) data showing that fixed costs are 46% of average total trial cost for U.S. pharmaceutical trials, and using Pocock et al. (2002) to set J̄ = 3 (average number of subgroups), the paper calculates that at a benchmark level of ᾱ = 0.05: the optimal level is approximately 3.2% for two tests, 2.6% for three tests, and asymptotes to approximately 1.4% as |J| → ∞. By contrast, Sidak&amp;rsquo;s correction yields 2.5%, 1.7%, and zero, respectively. Both the unadjusted 5% and the Sidak/Bonferroni levels are therefore suboptimal—the unadjusted level is too permissive while standard FWER corrections are too conservative.&lt;/p&gt;
&lt;p&gt;Q: What do the J-PAL data reveal about optimal MHT adjustment in program evaluation?
A: Using the universe of J-PAL funding proposals from 2009 to 2021, the paper estimates the cost elasticity with respect to the number of treatment arms to be 0.13–0.22, which is statistically significant (p &amp;lt; 0.05) but far below 1 (the proportional case). This means costs rise with arms but much less than proportionally. As a result, optimal significance levels for program evaluation studies are slightly less conservative than Sidak/Bonferroni corrections (e.g., approximately 3.8–4.5% versus 2.5% at a two-arm study with ᾱ = 5%) but more conservative than unadjusted testing. The testing thresholds also vary moderately with sample size, with larger samples implying less conservative procedures.&lt;/p&gt;
&lt;p&gt;Q: When are cross-study MHT adjustments warranted according to the framework?
A: Cross-study MHT adjustments are warranted only when there are cost complementarities across those studies. If studies are conducted independently with separate cost structures, each study&amp;rsquo;s costs do not depend on the number of hypotheses tested in other studies, so no cross-study adjustment is optimal. This provides a principled resolution to the disputed question of whether researchers should correct for tests performed in other papers.&lt;/p&gt;
&lt;p&gt;Q: When is FWER control (e.g., Bonferroni or Sidak) the appropriate form of MHT adjustment?
A: Appendix B.2 shows that FWER control is appropriate when the researcher&amp;rsquo;s payoff is nonlinear—specifically when the researcher requires at least one positive finding to receive any benefit (e.g., to publish). In the baseline linear payoff model, average size control (Bonferroni) is the correct adjustment only when all costs are fixed. The broader insight is that the form of compound error control—whether average error rate or FWER—is itself determined by economic fundamentals rather than being a statistical choice made in advance.&lt;/p&gt;
&lt;p&gt;Q: How does the paper extend to cases of heterogeneous variances across hypotheses?
A: Proposition 5.2 shows that under heterogeneous variances, the optimal protocol uses separate t-tests based on sample-equalizing allocations—dividing the sample equally across treatment arms—with critical values t*(J, n(J)) = Φ⁻¹(1 − C(J, n(J))/(b·|J|)), where n(J) is the total sample size. This protocol remains maximin optimal and unbiased, preserving the main qualitative results.&lt;/p&gt;
&lt;p&gt;Q: What does the paper contribute relative to Tetenov (2016) on single-hypothesis testing?
A: Tetenov (2016) showed that in the single-hypothesis case, separate t-tests are maximin optimal and uniformly most powerful (UMP) unbiased. This paper extends that result to multiple hypotheses, but two major complications arise: first, maximin optimality in the multi-hypothesis case requires verifying that welfare is non-negative even when treatment effects have opposite signs, which requires a non-trivial argument absent in the single-hypothesis case; second, no protocol is UMP unbiased in the multi-hypothesis case, so the paper develops a weaker notion of unbiasedness (power exceeding size) that is sufficient to motivate experimentation.&lt;/p&gt;
&lt;p&gt;Q: Why do multiple outcomes require different procedures than multiple treatments or subpopulations?
A: Multiple outcomes and multiple treatments are economically distinct types of multiplicity. For multiple outcomes that are noisy proxies for a common underlying quantity, the optimal rule tests an index formed using statistical weights (as in Anderson, 2008). When outcomes capture distinct components of the planner&amp;rsquo;s utility, economic weights are appropriate. In contrast, multiple treatments or subpopulations lead to separate t-tests with cost-adjusted critical values. Conflating these two forms of multiplicity leads to incorrect inferences about what procedures are appropriate.&lt;/p&gt;
&lt;p&gt;Maximin optimality: A hypothesis testing protocol is maximin optimal if it maximizes the planner&amp;rsquo;s worst-case welfare across all parameter values, equivalent to two conditions: deterring researcher experimentation over the null space (where all treatments are harmful), and ensuring non-negative expected welfare when some treatments are beneficial.&lt;/p&gt;
&lt;p&gt;Unbiasedness (in the paper&amp;rsquo;s sense): A protocol is unbiased if the researcher&amp;rsquo;s maximum achievable power strictly exceeds the test size, ensuring that experimentation is motivated when treatments are genuinely beneficial. This is a weaker condition than UMP unbiasedness, which does not exist in the multi-hypothesis case.&lt;/p&gt;
&lt;p&gt;Uniform global optimality: A protocol is uniformly globally optimal if it maximizes the planner&amp;rsquo;s objective for all values of the welfare weight λ ≥ 0 and all priors π over the parameter space, making it robust to uncertainty about the relative importance of deterrence versus research motivation.&lt;/p&gt;
&lt;p&gt;MHT correction factor: Defined as C(J, Σ) / (C̄ · |J|), this factor captures how the cost per test varies as the number of hypotheses grows. It equals 1/|J| (Bonferroni) when all costs are fixed, and equals 1 (no correction) when costs are proportional to the number of tests; the empirically appropriate correction lies strictly between these extremes.&lt;/p&gt;
&lt;p&gt;Cost function C(J, Σ): The private cost borne by the researcher for conducting the experiment, which depends on both the set of treatments J and the experimental design Σ (including sample size). The degree of optimal MHT adjustment is a direct function of how this cost varies with the number of hypotheses tested.&lt;/p&gt;
&lt;p&gt;Global null space Θ₀(J): The set of parameter vectors θ for which the welfare effect of implementing any combination of treatments is strictly negative—i.e., the status quo of no treatment dominates all interventions. Maximin optimality requires deterring researcher experimentation over this set.&lt;/p&gt;
&lt;p&gt;Cost complementarities across studies: Cost structures in which conducting multiple studies together is cheaper than conducting them separately. Cross-study MHT adjustments are warranted if and only if such complementarities exist; absent complementarities, each study&amp;rsquo;s optimal threshold is set independently of others.&lt;/p&gt;</description></item><item><title>A Monetary-Fiscal Theory of Sudden Inflations</title><link>https://macropaperwarehouse.com/papers/a-monetary-fiscal-theory-of-sudden-inflations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-monetary-fiscal-theory-of-sudden-inflations/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Why do sudden inflations and currency crises occur, while symmetric sudden deflations never do? The paper asks whether treating nominal government bonds as analogous to ordinary corporate bonds — with an asymmetric payoff structure capped at face value on the upside but exposed to real losses when fiscal surpluses are insufficient — can generate a unified theory of these crises endogenously from a single model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intellectual Lineage and Approach.&lt;/strong&gt; The paper sits at the intersection of two literatures. The first is the Fiscal Theory of the Price Level (FTPL), originating with Leeper (1991), Sims (1994), and Sargent and Wallace (1985), which links the real value of nominal government debt to expected future surpluses. The second is the safe-asset literature, where Holmstrom (2015) and Gorton (2017) explain that assets can circulate as safe stores of value precisely because their backing is costly to investigate and consumers rationally remain uninformed. The paper applies this information-economics logic to nominal government bonds, so that consumers normally hold bonds without investigating the government&amp;rsquo;s true fiscal capacity, and only pay the cost to investigate when real repayment doubts become sufficiently severe.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model Structure.&lt;/strong&gt; The model is a two-period reduced-form general equilibrium. In period 1, a representative consumer buys nominal government bonds at an interest rate set by the monetary authority. In period 2, the government must repay those bonds. The fiscal authority attempts to hit a price-level target P* by raising tax revenue, but faces a hard ceiling τ_max on the surplus it can collect — arising from Laffer limits on taxation, political constraints on austerity, or the need to fund financial-sector bailouts. The consumer has prior beliefs that τ_max is low (L) with probability π and high (H) with probability 1−π, and can pay a fixed utility cost γ to learn τ_max before deciding how many bonds to purchase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond Payoff Structure and Asymmetry.&lt;/strong&gt; The key mechanism is the asymmetric, bond-like real payoff of nominal government debt. If τ_max ≥ B1/P*, the government raises enough surplus to repay bonds fully in real terms at the price-level target; the real payoff is flat at face value (the &amp;ldquo;in-the-money&amp;rdquo; region). If τ_max &amp;lt; B1/P*, the government sets taxes to the ceiling τ_max and the price level rises above P* to balance the budget constraint, reducing the real payoff proportionally (the &amp;ldquo;default&amp;rdquo; region). Critically, because the nominal payoff is capped at face value, there is no upside region: governments will not run surpluses large enough to deliver a windfall to bondholders, so sudden deflations — analogous to a corporate bond being worth more than face value — cannot occur. This asymmetry is the direct source of the one-sided nature of crises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two Illustrative Mechanisms for Sudden Inflations.&lt;/strong&gt; The paper numerically and analytically characterizes two triggering scenarios:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Lower surplus expectations (fiscal stress narrative, corresponding to Burnside et al. 2001 on the 1997 Asian crisis)&lt;/em&gt;: As the probability π of a low future surplus (e.g., from a prospective banking-sector bailout) rises, the value of information about τ_max increases. In the numerical example (i = 0.05, γ = 0.13, L = 0.1), the value of information equals the cost γ at π = 0.15. For π above 0.15, consumers pay to investigate, learn τ_max = L, and refuse to purchase bonds beyond what will be repaid in real terms (B1 = τ_max = L = 0.1). The price level in period 1 rises discontinuously as a function of π at this threshold.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Interest rate increases (speculative attack narrative)&lt;/em&gt;: As the monetary authority raises the interest rate to defend a currency, consumers demand more bonds. Larger bond quantities increase the risk that surpluses will be insufficient, raising the value of fiscal information. In the numerical example (π = 0.5, γ = 0.24, 1+i ∈ [1, 1.2]), the value of information equals γ at 1+i = 1.1 (i.e., i = 10%). For interest rates above this threshold, consumers learn τ_max = L, restrict bond purchases to what will be repaid, and the price level in period 1 jumps discontinuously. Further interest rate increases above the threshold produce only upward drift in the price level, not additional monetary tightening effects — illustrating the limits of monetary policy in fiscally stressed environments.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Results.&lt;/strong&gt; Two formal theorems establish generality. Theorem 1 shows that, given bond demand B1(π) such that L &amp;lt; B1 for all π ∈ (0,1), there exist thresholds k and γ &amp;gt; 0 such that the period-1 price level P1 is discontinuous as a function of π on (0, k]. Theorem 2 establishes an analogous discontinuity in P1 as a function of the interest rate i, given that B1(i) &amp;gt; L for all i in the relevant range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The model is a two-period reduced form that abstracts from dynamics, multiple maturities, and secondary market trading. The informational friction is a fixed binary cost γ, not a richer signal structure. The results depend on the existence of a binding surplus ceiling τ_max; when the government is far from this ceiling (i.e., consumers&amp;rsquo; beliefs are far from the &amp;ldquo;default boundary&amp;rdquo;), shocks produce only small, smooth price-level changes. Large discontinuous price-level jumps require the economy to be near the kink point of the bond payoff curve.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the fundamental analogy that drives the paper&amp;rsquo;s theory, and what economic literature does it build on?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper analogizes nominal government bonds to corporate bonds (following Sargent 1982&amp;rsquo;s advice that &amp;ldquo;government debt is valued according to the same economic considerations that give private debt value&amp;rdquo;). Like a corporate bond, the nominal government bond pays its face value if the underlying project (government fiscal capacity) delivers a surplus at least equal to the face value, but pays only a share of the realized surplus if the surplus falls short. This bond-like payoff — flat on the upside, proportional to outcomes on the downside — is the direct source of asymmetric crisis dynamics. The paper combines this with Holmstrom (2015) and Gorton (2017)&amp;rsquo;s framework in which safe assets function because their backing is costly to investigate, so consumers rationally remain uninformed in normal times.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the key information friction, and how does it generate the switch between &amp;ldquo;normal times&amp;rdquo; and crisis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In normal times, consumers are confident that the government&amp;rsquo;s future maximum surplus τ_max is sufficient to repay bonds in real terms. The fixed utility cost γ of investigating the true surplus exceeds the benefit, so consumers remain uninformed and bonds trade at a price reflecting only uninformed prior beliefs. A crisis arises when the value of information V(.) rises above γ — either because the probability of a low surplus state rises (fiscal stress) or because the interest rate rises and consumers demand more bonds, bringing them closer to the repayment boundary. Once V &amp;gt; γ, consumers investigate and, upon learning τ_max = L (low surplus), refuse to hold bonds that will not be repaid in real terms, triggering a discrete upward jump in the price level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the bond payoff structure explain the absence of sudden deflations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The real payoff of a nominal government bond cannot exceed its face value: the bond is capped at face value on the upside because the government will not voluntarily raise tax surpluses to deliver a windfall to bondholders. In the event that surpluses turn out to be higher than needed (τ_max ≥ B1/P*), the government simply sets taxes to exactly repay the bonds at P* and returns no additional real value to bondholders. This is the flat portion of the payoff curve. Because there is no upside kink — no region where learning that τ_max is unexpectedly large causes the price level to fall sharply — there is no mechanism for sudden deflations symmetric to sudden inflations. The 1933 U.S. episode (Jacobson et al. 2019) is cited: when deﬂation from leaving gold would have required fiscal austerity for full real repayment, Roosevelt chose to exit the gold standard rather than allow deflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the first numerical example (lower surplus expectations) work quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline parameters are: i = 0.05, γ = 0.13, L = 0.1, H ≈ ∞, P* = 1, e1 = e2 = 1, B0 = 1, τ1 = 0.8, β = 1. The analysis is restricted to π ∈ (0, 0.3]. As π (probability that τ_max = L) rises, the value of information V(.) rises. At π = 0.15, V equals the cost γ = 0.13. For π &amp;gt; 0.15, consumers pay to investigate and, upon learning τ_max = L, purchase only B1 = L = 0.1 in bonds — the amount that will be repaid — causing the period-1 price level P1 to jump discontinuously from approximately 0.95 to approximately 1.13. For π ≤ 0.15, consumers remain uninformed and P1 rises only smoothly from below 1 as π increases (fewer bonds demanded as repayment risk rises, even without investigation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the second numerical example (interest rate increase) work quantitatively, and what does it imply for monetary policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With π = 0.5, γ = 0.24, and 1+i ∈ [1, 1.2], as the monetary authority raises the interest rate, consumers demand more bonds, increasing real repayment risk and the value of information. At 1+i = 1.1 (i.e., i = 10%), V equals γ. For 1+i &amp;gt; 1.1, consumers investigate and learn τ_max = L; they then only purchase bonds up to the repayment limit, causing P1 to jump discontinuously to approximately 1.15. For interest rates above the threshold, further increases yield only a smooth upward slope in P1 (bond purchases are fixed in real amount but nominal revenue falls). This illustrates that the monetary authority&amp;rsquo;s ability to use higher interest rates to lower the price level is limited by the surplus constraint: once the interest rate is high enough to trigger consumer investigation and a fiscal crisis, raising rates further is inflationary rather than deflationary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the two regions of the deterministic model and how do they differ in fiscal and price-level dynamics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the deterministic version (1-π = 0, so τ_max = L with certainty, and there is no uncertainty), the model produces two distinct regions. In the &amp;ldquo;insufficient surplus&amp;rdquo; region where τ_max &amp;lt; B1/P*, the fiscal authority sets taxes to their maximum τ_max, the real payoff of bonds is τ_max/B1 &amp;lt; 1, the period-1 price level P1 = B0/(βτ_max), and real bond revenue Π = βτ_max (constant in τ_max). Selling additional bonds does not raise additional real revenue because any extra bonds lead to a proportional rise in P2 and a fall in Q. In the &amp;ldquo;sufficient surplus&amp;rdquo; region where τ_max ≥ B1/P*, the government meets its fiscal target (τ2 = B1/P*), P2 = P* is hit, P1 = βB1/(B0P*), and Π = βB1/P* (increasing in B1). In this region, selling additional bonds does raise real revenue and lowers P1 as the government absorbs more money.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the two interest rate regions in the deterministic model, and what is their implication for monetary policy effectiveness?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using B1 = B0(1+i) (debt rolled over at the chosen rate), the monetary authority has two interest-rate regions. In the &amp;ldquo;constrained&amp;rdquo; region where 1+i &amp;gt; τ_max P*/B0 (the surplus ceiling binds), raising i does not change the period-2 surplus (τ2 = τ_max), does not change real revenue (Π = βτ_max), and does not affect P1 — but raises P2 above the target P*. In the &amp;ldquo;unconstrained&amp;rdquo; region where 1+i ≤ τ_max P*/B0, raising i increases bond demand, increases real surplus backing, raises real revenue, and lowers P1 while P2 = P* is maintained. The boundary between these regions determines the limit of monetary policy: the monetary authority can reduce P1 by raising i only up to the point where the surplus ceiling would be hit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the paper relate to and extend prior FTPL literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper is grounded in the FTPL of Leeper (1991), Sims (1994), and Cochrane (2005, 2020), in which the price level is determined by the requirement that real government liabilities equal the present value of future surpluses. The paper&amp;rsquo;s contribution is to make the information structure endogenous: consumers&amp;rsquo; beliefs and their decision to acquire fiscal information determine whether or not the FTPL logic is operative. In normal times (consumers uninformed), the price level does not respond to changes in the maximum surplus — a result that resembles the &amp;ldquo;Ricardian&amp;rdquo; or non-FTPL regime. When consumers investigate and learn the surplus is insufficient, the connection between the surplus and the price level is restored, reproducing FTPL-type dynamics. This provides an endogenous, single-model rationale for the regime-switching behavior between FTPL and non-FTPL environments documented empirically in Bianchi and Melosi (2013, 2017) and Davig and Leeper (2006).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the welfare role of consumer ignorance in this framework?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Consumer ignorance of the government&amp;rsquo;s true surplus plays a dual role. On one hand, ignorance is individually rational in normal times because the cost γ of investigating exceeds the benefit V (.) when beliefs are comfortably away from the default boundary. On the other hand, following Dang et al. (2017), informed knowledge of the safe asset&amp;rsquo;s backing destroys the symmetric ignorance that supports the asset&amp;rsquo;s role as a safe store of value, reducing welfare. In this model the concern is repayment risk rather than adverse selection: the consumer fears not being repaid in real terms and chooses to investigate when that risk is sufficiently high, potentially triggering the very crisis they feared.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the scope conditions and limitations of the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is explicitly a two-period reduced form designed to illustrate the bond-payoff mechanism in the simplest possible setting. It abstracts from: multi-period bond maturities and secondary market trading; rich heterogeneity among consumers; endogenous monetary and fiscal policy responses beyond the simple rules specified; and the general equilibrium interactions between inflation, output, and labor markets. The information cost γ is modeled as a fixed binary cost rather than a continuous or richer signal structure. The results on discontinuous price-level jumps hold when bond demand is sufficiently large relative to L (i.e., L &amp;lt; B1), ensuring genuine repayment risk; when surpluses are very large relative to bond liabilities, no crisis dynamics arise.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Maximum Surplus (τ_max).&lt;/strong&gt; The paper&amp;rsquo;s name for the hard ceiling on the net tax revenue (taxes minus money transfers) the government can collect in the second period. This ceiling can arise from a Laffer limit on taxable income, political-economy constraints on austerity, or from a banking crisis requiring government transfers to bail out the financial sector. It is the paper&amp;rsquo;s analogue of a project&amp;rsquo;s liquidation value: the maximum the &amp;ldquo;project&amp;rdquo; (the government) can deliver to bondholders.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond-Like Payoff of Nominal Government Debt.&lt;/strong&gt; The paper&amp;rsquo;s central structural claim: the real payoff to holding a nominal government bond is capped at face value on the upside (the government will not raise surpluses beyond what is needed to repay bonds at the price-level target) but falls proportionally below face value when τ_max is insufficient for full real repayment. This is precisely the payoff structure of a standard corporate bond — flat on the upside, proportional to recovery on the downside — and it is the source of the asymmetry between sudden inflations and the absence of sudden deflations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Value of Information (V(.)).&lt;/strong&gt; Defined as the difference in expected utility between a consumer who learns the true τ_max before making bond-purchase decisions and one who remains uninformed and acts only on prior beliefs π, 1−π. The consumer investigates if and only if V(.) &amp;gt; γ. V is zero when beliefs are certain (limπ→0 and limπ→1), can be hump-shaped in π, and is increasing in the interest rate i (through its effect on bond demand). The threshold condition V = γ defines the boundary between &amp;ldquo;normal times&amp;rdquo; (no investigation) and crisis (investigation and possible sudden inflation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Information Structure.&lt;/strong&gt; The paper&amp;rsquo;s term for the property that whether consumers choose to learn the government&amp;rsquo;s fiscal capacity is itself determined within the model by the parameters of the economy (the interest rate, prior beliefs, the cost of investigation). This contrasts with models that exogenously specify whether agents are informed or not. The endogenous information structure is the mechanism by which the paper generates the two apparent regimes (FTPL-active vs. FTPL-dormant) from a single unified model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Default Boundary.&lt;/strong&gt; The kink point in the bond payoff curve at τ_max = B1/P*: the level of the maximum surplus at which the government exactly repays bonds in real terms at the price-level target. When beliefs or bond quantities place the economy near the default boundary, small changes in π or i can push the economy across it, triggering large price-level responses. When the economy is far from the boundary (τ_max comfortably above B1/P*), small shocks have only small smooth effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sudden Inflation / Currency Crisis (as defined in this paper).&lt;/strong&gt; A discrete, discontinuous jump in the period-1 price level P1 that occurs when consumers pass the threshold V(.) = γ and investigate the government&amp;rsquo;s fiscal capacity, finding surpluses to be insufficient. The mechanism is: informed consumers refuse to hold bonds they know will not be repaid in real terms at P*, forcing the price level to jump to clear the government&amp;rsquo;s budget constraint with fewer bonds outstanding. The paper treats sudden inflations and currency crises as the same mechanism in different institutional contexts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Repayment Risk Premium.&lt;/strong&gt; The markup above the risk-free rate that consumers require on government bonds to compensate for the probability that the government&amp;rsquo;s surplus will be insufficient for full real repayment (i.e., the probability that the economy is in the τ_max &amp;lt; B1/P* region). This premium is present even when consumers are uninformed (i.e., do not know which state of τ_max will occur), and is reflected in the consumer&amp;rsquo;s first-order condition for bond demand.&lt;/p&gt;</description></item><item><title>A Preferred-Habitat Model of Term Premia, Exchange Rates, and Monetary Policy Spillovers</title><link>https://macropaperwarehouse.com/papers/a-preferred-habitat-model-of-term-premia-exchange-rates-and-monetary-policy-spillovers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-preferred-habitat-model-of-term-premia-exchange-rates-and-monetary-policy-spillovers/</guid><description>&lt;h2 id="layer-1--core-argument"&gt;Layer 1 — Core Argument&lt;/h2&gt;
&lt;p&gt;The paper develops a two-country preferred-habitat model in which currency and bond markets are populated by different investor clienteles — currency traders with price-elastic demand for foreign assets, and bond investors whose preferences are habitat-specific by country and maturity — with segmentation partly overcome by global arbitrageurs who have limited capital and bear mean-variance risk. Risk premia in the model are time-varying, connected across markets, and consistent with the empirical violations of Uncovered Interest Parity (UIP) and the Expectations Hypothesis (EH): in particular, currency carry trade (CCT) and bond carry trade (BCT) strategies earn abnormally high expected returns in ways that co-vary across the two markets in a manner the standard frictionless model cannot generate. Through these time-varying, connected risk premia, large-scale bond purchases (QE) lower domestic bond yields, lower foreign bond yields, and depreciate the purchasing country&amp;rsquo;s currency; short-rate cuts also lower foreign yields, but with smaller effects than bond purchases. A key structural finding, quantified in the estimated model calibrated to US and Eurozone data, is that currency returns are nearly uncorrelated with long-maturity bond returns — an exchange-rate disconnect — yet the currency market is instrumental in transmitting bond demand shocks across countries, because arbitrageurs hedge their cross-currency positions in bond markets and vice versa. Sterilized foreign-exchange interventions have strong effects on the exchange rate but weak effects on bond yields, while QE/QT has weak effects on the exchange rate but sizeable effects on foreign bond yields — a sharp asymmetry that follows directly from the disconnect.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. Why do UIP and EH fail in the standard model, and what changes in this model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the standard model with perfect capital mobility, risk premia are constant, so the yield curve depends only on expectations of the domestic short rate and the exchange rate absorbs short-rate differentials exactly. In this model, arbitrageurs bear the residual risk when currency traders and bond clienteles are unwilling to absorb excess supply or demand at prevailing prices. Because arbitrageurs have limited capital (captured by a risk-aversion parameter &lt;em&gt;a&lt;/em&gt; ≥ 0 that can also represent capital or Value-at-Risk constraints in reduced form), they demand compensation — time-varying risk premia — for holding currency and maturity risk. When &lt;em&gt;a&lt;/em&gt; = 0, arbitrageurs are risk-neutral, UIP and EH both hold, and the model collapses to the standard frictionless benchmark.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. What are the three types of agents and what does each do?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Currency traders&lt;/em&gt; hold foreign assets and have a demand that is downward-sloping (price-elastic, with slope coefficient αe ≥ 0) in the log exchange rate; their demand also shifts with a stochastic currency demand factor γt. They can be interpreted as households engaged in expenditure switching or central banks managing reserve levels. &lt;em&gt;Bond investors&lt;/em&gt; form clienteles, each with a preferred-habitat demand for bonds of a specific country and maturity that is downward-sloping in the log bond price (slope αj(τ)) and shifts with a country-specific bond demand factor βjt; examples are pension funds and insurance companies whose liabilities are long-dated and denominated in their home currency. &lt;em&gt;Global arbitrageurs&lt;/em&gt; trade the currency and all bonds of both countries, maximizing mean-variance utility over instantaneous wealth changes; they bridge the segmented markets and their positions pin down equilibrium risk premia.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What is the equilibrium structure and which factors drive prices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The equilibrium exchange rate and bond prices are log-affine functions of five stochastic factors: the home short rate iHt, the foreign short rate iFt, the currency demand factor γt, and the two bond demand factors βHt and βFt. These factors follow a mean-reverting (Ornstein-Uhlenbeck) system. The equilibrium is characterized by a scalar nonlinear system (25 equations in the general case) whose solution pins down the loadings of prices on each factor. This affine structure means each asset&amp;rsquo;s risk premium is the product of the arbitrageur&amp;rsquo;s risk-aversion coefficient, the factor covariance matrix, and arbitrageur net positions, which are themselves determined by market-clearing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. How does a conventional short-rate cut transmit domestically and internationally in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following a home short-rate cut, arbitrageurs find it attractive to enter the CCT — borrow home currency, invest in foreign currency. If currency traders&amp;rsquo; demand is price-elastic (αe &amp;gt; 0), arbitrageurs&amp;rsquo; equilibrium foreign-currency holdings rise, and the expected return on the CCT rises too (arbitrageurs must be compensated for the increased risk). This &lt;em&gt;attenuation effect&lt;/em&gt; means the foreign currency appreciates less than implied by UIP: the exchange rate response is dampened. Simultaneously, arbitrageurs enter the home BCT (borrow at the home short rate, invest in long home bonds); if home bond investors&amp;rsquo; demand is price-elastic (αH(τ) &amp;gt; 0), arbitrageurs&amp;rsquo; long-bond holdings rise and the BCT&amp;rsquo;s expected return rises, attenuating the transmission to domestic long-maturity yields (which fall less than EH would imply). A &lt;em&gt;propagation effect&lt;/em&gt; to foreign bond yields arises through arbitrageur hedging: by taking long positions in foreign currency (CCT), arbitrageurs become exposed to the risk that the foreign short rate drops and the foreign currency depreciates; long-maturity foreign bonds provide a natural hedge (their price rises when the foreign short rate drops), so arbitrageurs increase foreign bond demand, depressing foreign yields. This international transmission of conventional policy is absent from the standard model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. How does unconventional policy (QE/QT) transmit domestically and to the exchange rate and foreign yields?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following QE purchases of home bonds, their prices rise; arbitrageurs accommodate by holding fewer home bonds, which reduces their exposure to home short-rate risk. With less home-rate risk, arbitrageurs become more willing to hold foreign currency (which depreciates when the home short rate rises, offering a natural hedge against the home rate risk they have shed). The increased foreign-currency position in turn makes arbitrageurs more willing to hold foreign bonds (which hedge the foreign-currency position against foreign rate changes). The net result in the model is: QE lowers domestic bond yields, lowers foreign bond yields, and depreciates the home currency. The quantitative finding from the estimated model is that QE/QT effects on foreign bond yields are sizeable and stronger than those of conventional short-rate policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What explains the exchange-rate disconnect, and how can the currency market still transmit bond demand shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the estimated model, variance decompositions reveal that long-maturity bond yields in each country are driven primarily by bond demand factors (βHt and βFt), while the exchange rate is driven primarily by the currency demand factor (γt); short rates account for a small fraction of movements in both, and each factor type accounts for negligible variation in the other asset class&amp;rsquo;s price. The disconnect between bond yields and the exchange rate arises because bond demand shocks in the two countries move the exchange rate in &lt;em&gt;opposite&lt;/em&gt; directions — a home bond demand shock that lowers home yields also raises the exchange rate via arbitrageur hedging, while a foreign bond demand shock moves the exchange rate in the opposite direction. These offsetting effects make the exchange rate nearly uncorrelated with long-maturity bond yields. However, bond demand shocks in one country are transmitted to bond yields in the &lt;em&gt;other&lt;/em&gt; country through the currency market: arbitrageurs hedge their bond positions using the currency, so a shock to home bond demand moves arbitrageurs&amp;rsquo; currency positions, which in turn affects their willingness to hold foreign bonds. Cross-country bond yield comovement is therefore positive and sizeable, despite the exchange-rate disconnect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. What are the model&amp;rsquo;s implications for foreign exchange intervention?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A sterilized purchase of foreign currency by the home or foreign central bank — which shifts the currency demand factor — has strong effects on the exchange rate but weak effects on bond yields. This follows directly from the variance decomposition: the exchange rate loads heavily on the currency demand factor and bond yields load lightly on it. The asymmetry mirrors the QE result in reverse: QE shifts bond demand factors, which load heavily onto bond yields and lightly onto the exchange rate; FX intervention shifts the currency demand factor, which loads heavily onto the exchange rate and lightly onto bond yields. The model thus delivers a sharp policy instrument separation between QE/QT (primarily a bond yield tool) and FX intervention (primarily an exchange-rate tool), with each having spillovers in the other dimension that are quantitatively weaker.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. How is the relationship between currency risk premia and bond risk premia captured, and what empirical regularities does the model match?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model&amp;rsquo;s risk premia are linked through the shared arbitrageur portfolio: the price of each risk factor is proportional to the covariance between that factor and the arbitrageur&amp;rsquo;s overall portfolio return, so a shock that changes arbitrageurs&amp;rsquo; currency positions also changes the compensation required for bond positions, and vice versa. The estimated model is reported to match closely the violations of UIP (CCT profitability) and EH (BCT profitability) documented in the literature, and the ways in which these violations are connected — including findings that yield-curve slope differentials predict CCT profitability, and that CCT profitability declines when carried out with long-maturity rather than short-maturity bonds. These matches are described as consistent with the empirical regularities, not structural identification of the underlying causes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. What is the role of segmented versus global arbitrage, and why does the distinction matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper considers both cases. Under &lt;em&gt;segmented arbitrage&lt;/em&gt;, separate arbitrageur pools operate in the currency market (risk aversion ae), home bond market (aH), and foreign bond market (aF); first-order conditions for each pool reflect only their own portfolio risk, so the prices of risk factors differ across markets. Under &lt;em&gt;global arbitrage&lt;/em&gt;, a single pool of arbitrageurs trades all assets, and their shared portfolio means the price of each risk factor is the same across currency and bond markets — this is the mechanism through which bond demand shocks in one country propagate through the currency market to bond yields in the other. Global arbitrage is the primary specification; segmented arbitrage serves as a benchmark to isolate the hedging-based transmission channel that requires global positions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. How does the model relate to and extend predecessor frameworks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model extends Vayanos and Vila (2021) — a closed-economy preferred-habitat yield curve model — to two countries by adding a currency market and a second country&amp;rsquo;s bond market, with arbitrageurs who are global rather than country-specific. In the currency dimension, the attenuation of UIP deviations parallels Gabaix and Maggiori (2015), which models exchange-rate dynamics with financially constrained intermediaries but without a yield curve. The two-country structure allows the paper to simultaneously study term premia (EH violations), exchange rate dynamics (UIP violations), and their connection, and to quantify the effects of QE, conventional monetary policy, and FX intervention within a single internally consistent framework estimated on US-Eurozone data.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Preferred-habitat demand:&lt;/strong&gt; A bond investor&amp;rsquo;s demand for bonds of a specific country and maturity that does not arise from portfolio optimization over the full menu of available assets, but rather from institutional constraints or liability-matching motives (e.g., pension funds matching long-dated domestic liabilities). In the model, preferred-habitat demand is price-elastic with slope αj(τ) and shifts with a country-specific bond demand factor βjt; the elastic component means that as bond prices rise, clientele demand falls, so arbitrageurs must absorb the residual supply and require a risk premium to do so.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global arbitrageur:&lt;/strong&gt; An investor who trades the currency and bonds of both countries simultaneously, bridging the segmented currency and bond markets. In the model, global arbitrageurs maximize mean-variance utility over instantaneous wealth changes; their shared portfolio across all asset classes is the mechanism through which shocks in one market create hedging-driven demand in other markets, generating the cross-market linkages in risk premia and monetary policy transmission.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Currency carry trade (CCT):&lt;/strong&gt; A strategy that borrows at the home short rate and invests at the foreign short rate, profiting when the foreign currency does not depreciate enough to offset the interest rate differential. Under UIP, the CCT earns zero expected return; the model generates a positive expected CCT return — a currency risk premium — when arbitrageurs are risk-averse and currency traders&amp;rsquo; demand is price-elastic. In the paper&amp;rsquo;s notation, the CCT return is det/et + (iFt − iHt)dt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond carry trade (BCT):&lt;/strong&gt; A strategy that borrows at the short rate and invests in long-maturity bonds of the same country, profiting when long yields fall or when expected short rates are below current long yields. Under EH, the BCT earns zero expected return; the model generates a positive expected BCT return — a term premium — when arbitrageurs are risk-averse and bond clientele demand is price-elastic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange-rate disconnect:&lt;/strong&gt; The empirical and model finding that movements in the exchange rate are nearly uncorrelated with movements in long-maturity bond yields, even though both are endogenously determined in the same model. The disconnect arises in the estimated model because long bond yields are driven primarily by bond demand factors, while the exchange rate is driven primarily by the currency demand factor, and the two sets of factors move the exchange rate in offsetting directions so that their net effect on bond yield-exchange rate covariance is approximately zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Attenuation effect:&lt;/strong&gt; The dampening of monetary policy transmission to asset prices caused by the need to compensate risk-averse arbitrageurs for the increased risk they bear when accommodating the policy-induced excess demand. In the currency market, a home short-rate cut causes the CCT&amp;rsquo;s expected return to rise (arbitrageurs must be paid more to hold foreign currency), which means the foreign currency appreciates less than UIP predicts. In the bond market, a short-rate cut causes the BCT&amp;rsquo;s expected return to rise (term premia increase), so long yields fall less than EH predicts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Propagation effect:&lt;/strong&gt; The international transmission of a domestic monetary policy shock to foreign asset prices through arbitrageur hedging. A home short-rate cut causes arbitrageurs to increase their foreign-currency position (CCT); this exposes them to the risk of foreign short-rate declines (which depreciate the foreign currency), and long-maturity foreign bonds hedge this risk; so arbitrageurs increase foreign bond demand, depressing foreign yields. This channel is absent from the standard model where risk premia are constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Log-affine equilibrium:&lt;/strong&gt; The conjectured and verified form of the equilibrium in which the log exchange rate and log bond prices are affine (linear plus constant) functions of the five state factors (iHt, iFt, γt, βHt, βFt). This structure allows the model to be solved as a system of ordinary differential equations and scalar equations, and enables closed-form or numerically tractable characterization of risk premia, variance decompositions, and policy effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond demand factor (βjt):&lt;/strong&gt; A stochastic variable that shifts the intercept of bond clientele demand in country j, independent of maturity τ. A positive shock to βjt increases desired bond holdings of country-j clienteles at any given price, forcing arbitrageurs to shed country-j bonds, which lowers bond yields. The factor follows a mean-reverting process and in the estimated model is found to be the primary driver of long-maturity yields in both countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Currency demand factor (γt):&lt;/strong&gt; A stochastic variable that shifts the intercept of currency traders&amp;rsquo; demand for foreign assets, independent of the exchange rate level. A positive shock to γt increases desired foreign asset holdings of currency traders, so arbitrageurs reduce their foreign-currency position, which affects their bond positions through hedging. In the estimated model, γt is the primary driver of exchange-rate movements.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on LSE Research Online accepted version (accepted manuscript). AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>A Theory of Supply Function Choice and Aggregate Supply</title><link>https://macropaperwarehouse.com/papers/a-theory-of-supply-function-choice-and-aggregate-supply/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-theory-of-supply-function-choice-and-aggregate-supply/</guid><description>&lt;h2 id="research-question"&gt;Research Question&lt;/h2&gt;
&lt;p&gt;Modern macroeconomic models of aggregate supply universally restrict firms to price-setting — committing to a price and supplying whatever quantity the market demands. Flynn, Nikolakoudis, and Sastry ask: what happens if instead firms choose any supply function, a mapping that describes the price charged at each quantity of production? The paper develops the first general-equilibrium, macroeconomic theory of supply function choice and characterizes its implications for the slope of aggregate supply, monetary non-neutrality, and time-varying inflation-output tradeoffs.&lt;/p&gt;
&lt;h2 id="methodology"&gt;Methodology&lt;/h2&gt;
&lt;p&gt;The paper proceeds in two stages. In partial equilibrium, a single monopolistic firm with constant-returns-to-scale technology and constant-elasticity demand faces log-normal uncertainty about demand shifters, the aggregate price level, real marginal costs, and the stochastic discount factor. The firm chooses a non-parametric supply function — any implicit mapping f(p,q) = 0 — to maximize expected real profits. The paper shows that supply function choice is equivalent to conditioning price-quantity decisions on the realized nominal demand state z = ΨP^η. The authors prove (Theorem 1) that the optimal supply function is endogenously log-linear: log p = α₀ + α₁ log q, where the inverse supply elasticity α₁ is characterized in closed form.&lt;/p&gt;
&lt;p&gt;In general equilibrium, the authors embed supply function choice in an otherwise standard monetary business cycle model (in the tradition of Woodford 2003a and Hellwig and Venkateswaran 2009), featuring a representative household demanding differentiated goods, a money supply following a random walk with time-varying volatility, and idiosyncratic shocks to productivity, wages, and demand. They guess and verify a log-linear equilibrium and derive a scalar fixed-point equation for the equilibrium supply elasticity (Theorem 3).&lt;/p&gt;
&lt;p&gt;For quantification, the authors calibrate structural parameters (η = 8 from Hottman et al. 2016 scanner data; γ = 0.11 from Gagliardone et al. 2023 Belgian firm data; κ^M = 0.29 calibrated to match an average aggregate supply slope of 0.11 from Hazell et al. 2022) and estimate time-varying uncertainty via a GARCH model of quarterly US data on GDP growth, inflation, and real marginal cost growth from 1960 Q1 to 2024 Q4. Idiosyncratic demand uncertainty is set proportional to aggregate TFP uncertainty using the proportionality factor R = 6.5 from Bloom et al. (2018).&lt;/p&gt;
&lt;h2 id="main-findings"&gt;Main Findings&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Optimal supply function.&lt;/strong&gt; The optimal firm-level supply function is log-linear with inverse supply elasticity α₁ determined by the relative variances and covariances of demand, the price level, and real marginal costs. Three comparative statics drive the macroeconomic results: (1) higher idiosyncratic demand uncertainty (σ²_Ψ) flattens the supply function toward price-setting, because a fixed price insulates profit markups against demand variation; (2) higher price-level uncertainty (σ²_P) steepens the supply function toward quantity-setting, because setting a fixed quantity allows relative prices to adjust; (3) lower price elasticity of demand (less elastic demand, more market power) flattens the supply function, conditional on a sufficient condition that holds in US data whenever η &amp;gt; 2.5.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;From micro supply to aggregate supply.&lt;/strong&gt; With fixed log-linear supply functions, the economy has a unique log-linear equilibrium with an AD/AS representation (Theorem 2). The slope of aggregate supply ε^S_t depends on ω₁ (the transformed inverse supply elasticity), κ^M (firms&amp;rsquo; signal precision about the money supply), γ (income effects), and η (demand elasticity). Aggregate supply is maximally elastic — money is as non-neutral as possible — if and only if firms are pure price-setters (ω₁ = 0). Aggregate supply is perfectly inelastic — money is neutral — if and only if firms are quantity-setters (ω₁ = 1/η). A lower elasticity of demand flattens aggregate supply through general equilibrium strategic complementarities, a prediction opposite to the New Keynesian model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equilibrium supply slope and its determinants.&lt;/strong&gt; The equilibrium ω₁ solves a fixed-point equation (Theorem 3) in which macroeconomic uncertainty shapes firms&amp;rsquo; optimal supply functions, which in turn shape macroeconomic dynamics. Under the special case of balanced strategic interactions (ηγ = 1), the slope of aggregate supply has a clean closed form depending only on the ratio ρ_t = σ_{ϑ,t}/σ^M_{t|s} (idiosyncratic demand uncertainty relative to posterior monetary uncertainty). Critically, the equilibrium supply slope is invariant to the overall level of uncertainty — only the composition of uncertainty matters (Proposition 3). Even vanishingly small uncertainty can generate any level of monetary non-neutrality depending on uncertainty composition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative results — United States over time.&lt;/strong&gt; The model&amp;rsquo;s estimated slope of aggregate supply shows sharp variation since 1960. The slope is relatively flat and stable during the 1960s, the Great Moderation (1991–2007), the Great Recession (2008–2019), and the recovery from the Great Recession. It spikes dramatically during the 1970s oil crisis and the post-Covid inflation of the 2020s. Compared to Ball and Mazumder (2011), the model qualitatively matches the steepening during 1973–1984 (+58% in the model) vs. the data&amp;rsquo;s +175%, and a subsequent flattening of −25% vs. −32% in the data during 1985–2007. Compared to Cerrato and Gitti (2022), the model accounts for approximately 4/5 of the steepening between the pre-Covid and post-Covid periods (+112% model vs. +145% data). For the Hazell et al. (2022) comparison, the model accounts for approximately 1/2 of the estimated flattening from 1978–1990 to 1991–2018.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative results — Cross-country.&lt;/strong&gt; Using OECD annual data from 1960–2019, the model&amp;rsquo;s predicted slope of aggregate supply is not positively correlated with the average level of inflation across countries. For countries with the highest inflation rates, the model predicts a negative slope of aggregate supply, driven by very high correlation between price-level uncertainty and real marginal cost uncertainty. The model-predicted slope correlates positively with the reduced-form regression coefficient of inflation on real output growth across countries, even after instrumenting for demand. This predictive power is over and above what can be explained by the level or volatility of inflation alone.&lt;/p&gt;
&lt;h2 id="scope-conditions"&gt;Scope Conditions&lt;/h2&gt;
&lt;p&gt;All results are derived under log-normality of uncertainty, which ensures the log-linear structure of optimal supply functions. The quantification relies on GARCH-estimated uncertainty and treats idiosyncratic demand uncertainty as proportional to aggregate TFP uncertainty. The model abstracts from microeconomic nominal price stickiness (though the authors show in Appendix B that Calvo-style sticky prices can be incorporated). The baseline model requires the equilibrium condition on firm beliefs to be consistent (rational expectations). Multiple equilibria of the scalar fixed-point are possible in principle, bounded by at most five log-linear equilibria (Proposition 2).&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is wrong with assuming price-setting or quantity-setting as a primitive restriction on firm behavior?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Price-setting and quantity-setting are two isolated, generically non-optimal points in the larger space of supply functions. Corollary 2 establishes that price-setting is optimal only in the limit as idiosyncratic demand uncertainty becomes unboundedly large (σ²_Ψ → ∞), while quantity-setting is optimal only in the limit as price-level uncertainty becomes unboundedly large (σ²_P → ∞). In a macroeconomic environment where both sources of uncertainty are present in comparable magnitudes, both extreme policies perform poorly and the analyst who imposes either inadvertently restricts firms&amp;rsquo; strategies in ways that have large macroeconomic consequences — for example, making money neutral under quantity-setting even when information frictions are present, or making the slope of aggregate supply invariant to demand elasticity under price-setting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the formal equivalence between supply function choice and conditioning on realized demand?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The firm&amp;rsquo;s problem of choosing a supply function f(p,q) = 0 ex ante is mathematically equivalent to choosing a price-quantity plan (p(z), q(z)) indexed by the nominal demand state z = ΨP^η (Equation 4 in the paper). After the supply function is set, the firm produces where the supply function intersects the demand curve, which pins down the market-clearing outcome as a function of z. Choosing the supply function ex ante is therefore the same as choosing z-contingent prices and quantities without any parametric constraint. This links the model to rational expectations equilibrium in the spirit of Lucas (1972): firms use the demand for their product as a noisy signal to update beliefs and set their optimal price and quantity in response to realized demand conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How is the optimal inverse supply elasticity α₁ derived, and what is the 2SLS interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Because the optimal supply function allows the firm to set a z-contingent price, the first-order condition at each realized demand state z = t equates expected marginal revenue and expected marginal cost (Equation 7). Under log-normality, this yields a log-linear relationship log p = α₀ + α₁ log q. The elasticity α₁ equals the ratio (d log p / d log z) / (d log q / d log z) = Cov[log z, log p**] / Cov[log z, log q**], where p** and q** are the full-information optimal price and quantity (Equation 9). This is formally equivalent to a 2SLS regression: the firm estimates how its optimal price should change with its optimal quantity, using the nominal demand state z as an instrument for the optimal quantity. The supply function is steep if nominal demand strongly predicts movements in the full-information optimal price (large reduced-form coefficient); it is flat if nominal demand primarily predicts movements in the full-information optimal quantity (large first-stage coefficient).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How do uncertainty and demand elasticity shape the firm&amp;rsquo;s optimal supply function in partial equilibrium?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Three key comparative statics apply when the supply function is upward-sloping. (1) Greater price-level uncertainty (σ²_P increases) steepens α₁ toward quantity-setting: not knowing competitors&amp;rsquo; prices makes aggressive dynamic pricing attractive because it allows the firm&amp;rsquo;s relative price to adjust ex post. (2) Greater idiosyncratic demand uncertainty (σ²_Ψ increases) flattens α₁ toward price-setting: demand uncertainty favors a fixed price to keep the markup over real marginal costs constant, accommodating demand with quantity variation. (3) A lower price elasticity of demand (more market power, lower η) flattens α₁: more market power reduces the cost of setting the &amp;ldquo;wrong&amp;rdquo; price, reducing the benefit of dynamic pricing. Corollary 1 provides a sufficient condition — σ_{M,P} ≥ 0, 2ησ_{M,P} + σ_{M,Ψ} ≥ σ_{P,Ψ}, and α₁ ≥ 0 — under which ∂α₁/∂η &amp;gt; 0, implying greater market power flattens supply; the paper verifies this condition holds in US data whenever η &amp;gt; 2.5.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the model generate an aggregate supply and demand representation from supply function choices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Theorem 2 establishes that, given any fixed log-linear supply functions with slope ω₁,t, there is a unique log-linear equilibrium. In this equilibrium, the price level and real output are jointly determined by an aggregate demand curve — shifting with the money supply but not productivity — and an aggregate supply curve — shifting with productivity but not the money supply. The inverse elasticity of aggregate supply is ε^S_t = γ(κ^M_t + ω₁,t(η − 1/γ)(1 − κ^M_t)) / ((1 − ω₁,t η)(1 − κ^M_t)), derived from aggregating firm-level pricing decisions. The slope depends on ω₁,t (micro supply), κ^M_t (signal precision about money), γ (income effects), and η (demand elasticity). An aggregate demand shock of ∆ log M raises the price level by ε^S_t ∆ log M / (ε^D_t + ε^S_t) and raises real output by ∆ log M / (ε^D_t + ε^S_t), where ε^D_t = γ is the inverse elasticity of aggregate demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the equilibrium fixed-point equation and why can there be multiple equilibria?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Theorem 3 shows that the equilibrium transformed inverse supply elasticity ω₁,t solves a quintic polynomial fixed-point equation (Equation 29) that depends on the variances of idiosyncratic demand shocks (σ²_ϑ,t), posterior uncertainty about productivity (σ^A_{t|s}), and posterior uncertainty about money (σ^M_{t|s}). Multiple equilibria can arise because of a self-reinforcing feedback: if firms set steep supply functions, prices respond more to demand, which raises price-level volatility, which in turn makes quantity-setting more attractive, further steepening supply functions. Proposition 2 establishes existence of at least one log-linear equilibrium and at most five. Idiosyncratic productivity and factor price uncertainty do not enter the fixed-point equation because the variance of real marginal costs per se does not affect optimal supply function choice — only the covariance of marginal costs with demand and the price level matters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What determines the slope of aggregate supply in the special case of balanced strategic interactions (ηγ = 1)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under ηγ = 1 — where strategic complementarities from relative price effects exactly offset strategic substitutabilities from aggregate consumption effects — the slope of aggregate supply has the closed-form expression ε^S_t = γ(κ^M_t / (1 − κ^M_t))(1 + 1/(γ²ρ²_t κ^M_t)) where ρ_t = σ_{ϑ,t}/σ^M_{t|s} is the ratio of idiosyncratic demand uncertainty to posterior monetary uncertainty (Corollary 5). Aggregate productivity uncertainty drops out entirely because firms do not use the demand state to infer aggregate productivity when strategic interactions are balanced. As ρ_t → ∞ (idiosyncratic demand dominates), the slope converges to the price-setting value γκ^M_t/(1 − κ^M_t). As ρ_t → 0 (monetary uncertainty dominates), the slope goes to infinity, corresponding to quantity-setting and monetary neutrality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the role of total uncertainty versus the composition of uncertainty?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Proposition 3 establishes a striking invariance result: if all standard deviations in the economy are scaled by a common factor λ &amp;gt; 0, the equilibrium supply elasticity and slope of aggregate supply are unchanged. The equilibrium outcomes depend only on the ratios of different sources of uncertainty, not their absolute magnitudes. This sharply distinguishes the model from menu-cost models, in which any increase in uncertainty unambiguously raises the benefit of price adjustment and steepens aggregate supply. A corollary is that idiosyncratic productivity uncertainty has no effect on the slope of aggregate supply in the supply function model, whereas it would steepen aggregate supply in Golosov-Lucas menu-cost models. Moreover, even a vanishingly small level of uncertainty can generate any level of monetary non-neutrality, because the equilibrium supply elasticity is discontinuous at zero uncertainty (ε^S_t (0) = {∞} while ε^S_t (λ) is bounded for any λ &amp;gt; 0).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does market power (demand elasticity) affect the slope of aggregate supply, and why does this differ from the New Keynesian prediction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the supply function model, a lower elasticity of demand (more market power, lower η) flattens aggregate supply by reducing general-equilibrium strategic complementarities. When other firms raise their prices following a demand shock, a given firm faces higher relative demand; the strength of this effect is parameterized by η. With supply functions (ω₁,t ≠ 0), this relative demand increase generates an additional price response, so higher η steepens aggregate supply. Crucially, this effect is exactly zero if and only if firms are pure price-setters (ω₁,t = 0) — meaning the prediction that market power affects aggregate supply is absent from price-setting models. This is the opposite of the New Keynesian prediction: in Woodford (2003b) with decreasing returns to scale, a higher elasticity of demand (less market power) steepens the Phillips curve, because more elastic demand amplifies the quantity response to price changes and thereby the marginal cost response to nominal cost shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the model rationalize the steepening of aggregate supply in the 1970s and 2020s?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The GARCH estimates of macroeconomic uncertainty show abrupt increases in inflation uncertainty during the 1970s oil crisis period and after the Covid-19 shock in the 2020s. In the model, a spike in aggregate price-level uncertainty (σ²_P increases) causes firms to choose steeper supply functions — closer to quantity-setting — endogenously. This steepens the aggregate supply curve so that demand shocks have larger nominal effects and smaller real effects. Quantitatively, relative to the base period, the model predicts a steepening of +58% during 1973–1984 and +112% during 2021–2023. The empirical comparisons are +175% (Ball and Mazumder 2011, 1973–1984) and +145% (Cerrato and Gitti 2022, 2021–2023). The model thus accounts for the direction and rough order of magnitude of both episodes but not their full extent. The quarterly time series of model-implied ε^S_t has a correlation of 0.93 with one-quarter-ahead inflation uncertainty and 0.62 with the quarterly level of inflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the cross-country evidence help distinguish the model from alternatives based on the level of inflation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The cross-country analysis uses OECD data from 1960–2019 to construct country-level model-implied slopes of aggregate supply using the same structural parameters (η = 8, γ = 0.11, κ^M = 0.29) and country-specific GARCH uncertainty estimates from a one-lag VAR. The key finding is that the model-implied slope is not positively predicted by average inflation across countries (Panel A of Figure 5) — in fact, for the highest-inflation countries such as Chile, Israel, and Mexico, the model predicts a negative slope of aggregate supply, reflecting high correlation between price-level uncertainty and real marginal cost uncertainty. By contrast, the model-implied slope correlates positively with the reduced-form regression coefficient of inflation on real output growth (Panel B), and this positive correlation is also found using a model-derived instrument isolating exogenous monetary variation. This implies that relative uncertainties, not the mean or volatility of inflation per se, help account for cross-country heterogeneity in inflation-output tradeoffs beyond the predictions of Ball et al. (1988).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How can supply functions be integrated into larger linearized macroeconomic models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Section 4.5 provides a general framework. For any model in which firms face a demand function q_it = d(p_it, z^D_it) and a value function V(p_it, q_it, z^V_it), log-linearization around a deterministic steady state yields an optimal pricing rule ˆp_it = ω₁,it ˆz^D_it (Equation 35) for some scalar ω₁,it determined by the covariance structure of the linearized model. The coefficients ω₁,it enter the standard representation of aggregate dynamics (McKay and Wolf 2023) through the ideal price index ˆP_t = ∫₀¹ ˆp_it di. The additional &amp;ldquo;rational expectations&amp;rdquo; restriction is that ω₁,it must be consistent with the equilibrium law of motion for prices. The paper argues that supply functions can thereby be embedded in the broad class of linearized DSGE models used for quantitative work, including models with decreasing returns, monopsony, endogenous markups, sticky prices, investment, and quality choice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What are the implications of supply function choice for monetary policy discretion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model implies a thorny tradeoff for monetary policymakers. If a central bank wishes to maintain discretion — the ability to surprise private agents — this increases firms&amp;rsquo; uncertainty about the money supply (higher σ²_M). Under balanced strategic interactions (ηγ = 1), greater posterior monetary uncertainty (σ^M_{t|s}) lowers the ratio ρ_t = σ_{ϑ,t}/σ^M_{t|s}, which flattens the aggregate supply curve (reduces ε^S_t) and thereby increases the real effect of monetary surprises. However, this also endogenously induces firms to set steeper supply functions — closer to quantity-setting — so that the aggregate supply curve steepens in response to the greater price-level uncertainty generated by such an environment. The paper therefore concludes that maintaining monetary policy discretion may be, at least partially, self-defeating.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inverse supply elasticity (α₁):&lt;/strong&gt; The percentage by which a firm increases its price in response to a one percent increase in production, characterizing the slope of the firm&amp;rsquo;s optimal supply function. It is endogenously log-linear and determined by the ratio of covariances relating the nominal demand state to the firm&amp;rsquo;s optimal price vs. optimal quantity under full information — formally equivalent to a 2SLS coefficient using nominal demand as an instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supply function:&lt;/strong&gt; A mapping f(p, q) = 0 describing the locus of prices and quantities a firm commits to, as an implicit function over price-quantity pairs. Unlike price-setting (f depends only on p) or quantity-setting (f depends only on q), the general supply function allows prices to vary with realized demand, nesting both polar cases as limits of extreme uncertainty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nominal demand state (z):&lt;/strong&gt; The composite variable z = ΨP^η that indexes the demand curve. Firms observing their own output market clearing can use z as a noisy signal for inference about the aggregate price level, real marginal costs, and monetary conditions. The supply function is formally equivalent to conditioning price-quantity choices on z.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Slope of aggregate supply (ε^S):&lt;/strong&gt; The inverse elasticity of the aggregate supply curve in the AD/AS representation, measuring the relative within-period response of the price level versus real output to an aggregate demand shock. It depends on the slope of firm-level supply functions (ω₁) interacted with the information precision about the money supply (κ^M) and income effects (γ).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Transformed inverse supply elasticity (ω₁):&lt;/strong&gt; The reparameterization ω₁ = α₁/(1 + ηα₁), where α₁ is the firm-level inverse supply elasticity and η is the price elasticity of demand. ω₁ = 0 corresponds to price-setting; ω₁ = 1/η corresponds to quantity-setting. The equilibrium value of ω₁ solves a fixed-point equation that maps macroeconomic uncertainty back into firms&amp;rsquo; optimal supply function choices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Balanced strategic interactions (ηγ = 1):&lt;/strong&gt; A parametric special case in which strategic complementarities from aggregate demand externalities (parameterized by η) exactly offset strategic substitutabilities from wage pressure (parameterized by 1/γ). Under this condition, the slope of aggregate supply has a closed-form solution that depends only on the relative uncertainty about idiosyncratic demand vs. the money supply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Relative uncertainty sufficient statistic (ρ_t):&lt;/strong&gt; The ratio σ_{ϑ,t} / σ^M_{t|s}, measuring firms&amp;rsquo; uncertainty about idiosyncratic demand shocks relative to posterior uncertainty about the money supply. Under balanced strategic interactions (ηγ = 1), ρ_t is the single sufficient statistic determining the equilibrium slope of aggregate supply. As ρ_t → ∞ (idiosyncratic demand uncertainty dominates), firms converge to price-setting and aggregate supply flattens; as ρ_t → 0 (monetary uncertainty dominates), firms converge to quantity-setting and aggregate supply becomes vertical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Invariance to total uncertainty:&lt;/strong&gt; A key property of the model: the equilibrium slope of aggregate supply is invariant to the overall scale of uncertainty (Proposition 3). Only the composition of uncertainty across idiosyncratic vs. aggregate sources and demand vs. productivity shocks matters. This distinguishes the model from menu-cost models, in which any increase in uncertainty raises the benefit of price flexibility and steepens aggregate supply regardless of uncertainty composition.&lt;/p&gt;</description></item><item><title>A traffic-jam theory of growth</title><link>https://macropaperwarehouse.com/papers/a-traffic-jam-theory-of-growth/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-traffic-jam-theory-of-growth/</guid><description>&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Finocchiaro and Weil ask whether financial development necessarily promotes long-run economic growth, or whether congestion externalities in R&amp;amp;D markets can offset — and even reverse — the growth benefits of easier credit access. The paper proposes that the empirical coexistence of expanding financial sectors and roughly constant per-capita GDP growth rates (approximately 2% annually in the United States over the last century) can be explained by the interplay of search frictions in two sequential markets: credit and innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The authors build a continuous-time endogenous growth model in which all growth is innovation-led. Firms must pass through four sequential stages — creation, fund-raising (Stage 0–1), R&amp;amp;D search (Stage 1–2), and high-productivity production (Stage 2–3) — before being exogenously destroyed. Both the credit market (firms searching for banks/venture capitalists) and the innovation market (firms searching for innovators after securing finance) are characterized by constant-returns-to-scale matching functions with endogenous market tightness. Nash bargaining determines the loan repayment, and free entry drives profits to zero in both markets. The model is then calibrated to annual U.S. data, with the risk-free rate r = 3.5%, separation rate s = 4%, symmetric bargaining power ω = 0.5, a productivity jump γ = 0.023 targeting a baseline growth rate of 2%, credit market duration for creditors just below one month and for firms slightly above one year (consistent with Wasmer and Weil, 2004), a two-year average patent approval time (USPTO 2020), 6% employment in finance (BLS 2020), and 0.5% employment in scientific R&amp;amp;D (BLS 2020).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Core Mechanism.&lt;/strong&gt; The paper derives a &amp;ldquo;spillover function&amp;rdquo; Q(p,g) that links the equilibrium probability of finding an innovator (q) to the probability of finding a bank (p) and the growth rate (g). Because free entry holds profits at zero, easier credit — a higher p — forces q downward: if a firm spends less time raising funds, the innovation market becomes more congested (Qp &amp;lt; 0). This negative spillover between the two markets is the paper&amp;rsquo;s central traffic-jam analogy: relieving one bottleneck shifts congestion downstream.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt; The GG curve — the locus of (p, g) pairs consistent with equilibrium — is hump-shaped under the symmetric cost condition c = ωn (flow search cost for firms in credit markets equals the firm&amp;rsquo;s share of search costs in innovation markets). Growth is maximized when expected credit search time equals expected innovation search time (1/p = 1/q). Beyond that interior optimum, further financial deepening lowers the growth rate. The calibrated economy sits to the right of the hump in a flat region (p &amp;gt; q), so that reducing credit frictions alone has a marginally negative effect on growth: eliminating credit frictions lowers g from 2.000% to 1.997%, a reduction of 0.003 percentage points. Reducing innovation frictions alone raises g modestly to 2.071% (+0.071 pp). Only a simultaneous reduction of frictions in both markets raises g meaningfully, to 2.122% (+0.122 pp). The quantitative effects are deliberately small, consistent with the near-constancy of long-run growth despite financial deepening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The non-monotonicity requires both markets to carry search frictions; when only one friction is present, financial development is unambiguously good for growth (Section 4.3). The hump-shape is established analytically in the symmetric case c = ωn; more generally, the paper shows (via back-of-envelope approximation) that the sign of the finance–growth link depends on whether c/ω is less than or greater than n. The quantitative insensitivity of growth to finance is amplified when the real interest rate is close to the growth rate and when potential growth γ is close to actual growth g: the elasticity of growth with respect to finance is proportional to (γ − g)/γ. Extensions to fixed bank entry costs (introducing a growth-to-finance feedback), endogenous innovator wages (Section 4.2), and frictionless innovation (Section 4.3) all confirm the benchmark conclusions under stated parameter conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the paper&amp;rsquo;s central theoretical claim about the finance–growth nexus?&lt;/strong&gt;
The paper claims that the finance–growth relationship is non-monotonic: financial development raises growth when credit is scarce (left of the hump on the GG curve) but lowers it when credit is readily available (right of the hump), because easier financing draws more firms into the innovation market, tightening it and reducing the probability of finding an innovator. This congestion spillover from the credit market to the innovation market is the &amp;ldquo;traffic-jam&amp;rdquo; mechanism. The non-monotonicity vanishes if either market lacks search frictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the &amp;ldquo;spillover function&amp;rdquo; and why is it central to the model?&lt;/strong&gt;
The spillover function Q(p, g) is derived from the free-entry zero-profit condition for firms and expresses the innovation-matching probability q consistent with equilibrium for given credit-matching probability p and growth rate g. It has Qp &amp;lt; 0 (easier credit reduces q) and Qg &amp;lt; 0 (faster growth reduces q), capturing the two-way negative interaction between the markets. It is central because all equilibrium and comparative-statics results flow through it: the GG curve is defined by substituting Q into the growth equation g = γ/(1 + s/p + s/Q(p,g)).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Under what condition is the GG curve hump-shaped, and what is the intuition?&lt;/strong&gt;
The GG curve is hump-shaped when the flow search cost for firms in the credit market c equals the firm&amp;rsquo;s share of innovation search costs ωn (Proposition 4). The intuition mirrors equalizing travel times across two congested roads: growth is maximized when expected credit search time (1/p) equals expected innovation search time (1/q). When credit is very tight (p small), a marginal increase in p raises the share of innovating firms faster than it tightens the innovation market, so growth rises. Once credit is abundant (p large), the congestion effect on innovation dominates and growth falls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What does the benchmark calibration predict about the quantitative effect of financial development on growth?&lt;/strong&gt;
The benchmark calibration, targeting 2% annual U.S. growth, places the economy to the right of the hump in a flat region of the GG curve (p &amp;gt; q). Eliminating credit market frictions alone reduces the annual growth rate by 0.003 percentage points (from 2.000% to 1.997%) while lengthening expected innovation search time from 2 years to 3.4 years. This marginally negative effect arises because the economy is already well to the right of the optimum. The results are deliberately small and consistent with the empirical near-constancy of growth alongside financial deepening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What combination of policies does the model recommend for raising growth?&lt;/strong&gt;
Only a simultaneous reduction of frictions in both the credit and the innovation market raises the growth rate meaningfully, to 2.122% in the calibration (+0.122 pp relative to the 2.000% benchmark). Isolated improvements in credit markets have a marginally negative effect; isolated improvements in innovation markets have a marginally positive effect (+0.071 pp). The authors interpret this as supporting the OECD view that growth-stimulating policies should be designed as a system rather than as isolated pro-growth measures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the elasticity of growth to finance depend on the gap between potential and actual growth?&lt;/strong&gt;
The authors show (referenced as available on request) that the elasticity of the growth rate with respect to financial factors is proportional to (γ − g)/γ, where γ is the potential growth rate (the productivity jump per innovation) and g is the actual equilibrium growth rate. When actual growth is close to potential — as in the benchmark calibration with γ = 0.023 and g = 2.000% — this factor is near zero, making growth nearly insensitive to changes in financial conditions. This provides a structural rationale for why empirically measured finance–growth effects are often small or nil in advanced economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does introducing fixed bank entry costs (Section 4.1) change the results?&lt;/strong&gt;
When banks bear a fixed licensing cost K (paid each time they enter the credit market), credit market tightness φ becomes an increasing function of (r − g)K: the annuity value of the fixed cost falls as growth rises, inducing more bank entry and reducing credit tightness. This introduces an upward-sloping PP curve (rather than a vertical one) and creates a direct positive feedback from growth to financial deepening. The qualitative conclusions on non-monotonicity are preserved: lower licensing costs shift the PP curve right and steepen it, with the equilibrium effect on growth remaining ambiguous due to the congestion spillover into the innovation market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What happens to the spillover function when innovators are paid (Section 4.2)?&lt;/strong&gt;
When innovators receive a Nash-bargained wage, the equilibrium wage (Equation 30) is increasing in innovator productivity (πγ), innovation market tightness (θn), and the growth rate, and decreasing in total credit market search costs K(φ). Easier credit raises both expected revenues and innovator wages for the firm. For innovator bargaining power α sufficiently small (and always for α &amp;lt; 1, as shown in the Appendix), the revenue effect dominates so that Qp &amp;lt; 0 is preserved: finance still creates bottlenecks in the innovation market, and the core non-monotonicity result carries through.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the model predict when only one market has search frictions?&lt;/strong&gt;
When only the credit market is frictional and innovators are found instantly after financing is secured, improving credit market efficiency unambiguously raises growth (Section 4.3, Figure 4). The GG curve becomes g = γ/(s/p + 1), which is strictly increasing in p, and the PP curve shifts in a way that unambiguously raises equilibrium growth. The paper uses this case to isolate the source of non-monotonicity: the negative spillover from credit ease to innovation congestion requires frictions in both markets to operate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the paper relate to the empirical &amp;ldquo;too much finance&amp;rdquo; literature?&lt;/strong&gt;
The paper offers a distinct theoretical mechanism for the inverted-U relationship between credit and productivity growth documented by Arcand et al. (2015), Aghion et al. (2019), and Popov (2018), among others. While Aghion et al. (2019) explain the inverted-U through less-efficient incumbents surviving longer with better credit access, and Malamud and Zucchi (2019) emphasize how financing frictions differentially affect entrant and incumbent composition, Finocchiaro and Weil&amp;rsquo;s mechanism operates through congestion externalities in sequential search markets — a channel not previously formalized in the innovation-led growth literature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Search frictions in credit markets:&lt;/strong&gt; Firms searching for financiers (banks or venture capitalists) and banks searching for firms face a matching technology with constant returns to scale; credit market tightness φ is the ratio of firms searching for banks to banks searching for firms, and the matching probability p(φ) is strictly decreasing in φ. Free entry drives bank profits to zero, pinning equilibrium tightness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Search frictions in innovation markets:&lt;/strong&gt; After securing financing, firms search for innovators who can upgrade their productivity by factor γ; innovation market tightness θ is the ratio of firms searching for innovators to innovators, and the matching probability q(θ) is strictly decreasing in θ. The number of innovators is held fixed (analogously to fixed labor supply in Mortensen-Pissarides).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spillover function Q(p, g):&lt;/strong&gt; Derived from the free-entry zero-profit condition for firms, Q expresses the equilibrium innovation-matching probability q as a function of the credit-matching probability p and the growth rate g. It has Qp &amp;lt; 0 and Qg &amp;lt; 0, meaning easier credit and faster growth both reduce q by tightening the innovation market. It is the formal embodiment of the traffic-jam mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;GG curve:&lt;/strong&gt; The locus of (p, g) pairs consistent with the equilibrium growth equation g = γ/(1 + s/p + s/Q(p,g)). Under the symmetric cost condition c = ωn, the GG curve is hump-shaped: it rises from the origin, reaches a maximum interior growth rate, then declines toward an asymptote g∞ &amp;lt; γ. Its shape encodes the non-monotonic relationship between finance and growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PP curve:&lt;/strong&gt; The locus of equilibrium credit-matching probabilities consistent with free entry in the credit market. In the benchmark model it is a vertical line at p* = p(ω/(1−ω) · k/c), independent of q and g. When banks bear a fixed entry cost K, the PP curve becomes upward-sloping, introducing a direct positive feedback from growth to financial deepening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Potential growth rate γ:&lt;/strong&gt; The productivity jump per successful innovation; in a frictionless world (p = q = ∞) the economy grows at γ. Actual growth g falls below γ to the extent that search frictions delay the delivery of credit and innovation. The elasticity of g to financial factors is proportional to (γ − g)/γ, so when actual and potential growth are close, financial factors matter little for growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Congestion externality in R&amp;amp;D:&lt;/strong&gt; The mechanism by which financial deepening — raising p — drives more firms to seek innovators, tightening the innovation market and reducing q. This negative spillover (Qp &amp;lt; 0) is the paper&amp;rsquo;s central departure from models with only a single friction, where finance is always growth-enhancing.&lt;/p&gt;</description></item><item><title>A Welfare Analysis of Policies Impacting Climate Change</title><link>https://macropaperwarehouse.com/papers/a-welfare-analysis-of-policies-impacting-climate-change/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/a-welfare-analysis-of-policies-impacting-climate-change/</guid><description>&lt;p&gt;This paper extends and applies the marginal value of public funds (MVPF) framework to evaluate the welfare consequences of 96 climate-related tax and spending policies in the United States. The MVPF is a benefit-cost ratio in which the numerator captures all benefits to individuals (measured by their willingness to pay) and the denominator captures net government costs; policies with higher MVPFs are better spending policies, while those with lower MVPFs are more efficient revenue-raising instruments.&lt;/p&gt;
&lt;p&gt;The sample covers policies rigorously evaluated using quasi-experimental or experimental methods drawn from 18 major economics journals between January 1999 and December 2023. Policies fall into three primary categories: subsidies (wind production tax credits, residential solar, electric vehicles, hybrid vehicles, vehicle buybacks, appliance rebates, and weatherization), nudges and marketing, and revenue raisers (gasoline taxes, other fuel taxes, cap-and-trade). A selected set of international aid policies is also analyzed. The analysis applies a harmonized method for translating behavioral changes into emissions changes — using the EPA&amp;rsquo;s AVERT model for electricity-sector emissions — and a consistent set of externality valuations, including an EPA 2023 social cost of carbon (SCC) of $193 per ton of CO2 in 2020 (rising over time), with robustness checks at $76, $337, and $1,367.&lt;/p&gt;
&lt;p&gt;The primary methodological contribution is a new sufficient statistics approach to quantifying learning-by-doing (LBD) externalities. When marginal cost of production is an isoelastic function of cumulative production and demand is an isoelastic function of price, the time path of production satisfies a second-order ordinary differential equation whose solution yields society&amp;rsquo;s willingness to pay for LBD spillovers. LBD generates two types of externalities: a price externality (lower future consumer prices) and an environmental externality (increased future take-up of clean goods). The approach requires four inputs: price elasticity of demand, elasticity of marginal cost with respect to cumulative production, cumulative production at the time of the subsidy, and product cost at the time of the subsidy.&lt;/p&gt;
&lt;p&gt;The three main empirical findings are as follows. First, subsidies for production that directly displaces dirty electricity generation have the highest MVPFs. Wind production tax credits have an MVPF of 3.85 without LBD, rising to 5.87 with LBD. Residential solar subsidies have an MVPF of 1.45 without LBD, rising to 3.86 with LBD. EV subsidies have an MVPF of approximately 1.4 with LBD and approximately 1 without it. Consumer subsidies for appliances, weatherization, vehicle retirement, and hybrid vehicles have MVPFs around 1. Second, conservation nudges targeting electricity consumption can deliver MVPFs exceeding 5 in regions with relatively dirty electric grids, but fall below 1 in cleaner-grid regions such as California and the Northeast — and their effectiveness is expected to decline as grids decarbonize. Third, fuel taxes (gasoline, diesel, jet fuel) and cap-and-trade permit reductions are efficient revenue raisers, with nearly all having MVPFs below 1 and most below 0.7, reflecting the Pigouvian logic that current tax rates fall below the associated environmental externalities. Cap-and-trade permit reductions can produce MVPFs below zero, meaning revenue is raised while providing net positive welfare to individuals.&lt;/p&gt;
&lt;p&gt;The paper also constructs three cost-per-ton metrics — resource cost per ton, government cost per ton, and social cost per ton — and shows they can yield substantively different and sometimes opposite rankings relative to each other and to the MVPF. For example, EV subsidies carry a government cost per ton of $1,356 (among the highest in the sample) yet an MVPF above most consumer subsidies, because that metric omits non-CO2 benefits including LBD effects. The scope of the analysis is US historical policy, with the MVPF comparison most informative when social welfare weights across beneficiary groups are treated as roughly equal.&lt;/p&gt;
&lt;p&gt;Q: What is the MVPF framework and how does it differ from cost-per-ton analysis?
A: The MVPF equals benefits to individuals (sum of willingness to pay) divided by net cost to the government. It is designed for a decision-maker maximizing social welfare subject to a budget constraint, whereas cost-per-ton metrics serve a decision-maker minimizing cost subject to a fixed CO2 reduction target. A higher MVPF means more welfare gain per dollar spent; a lower MVPF means less welfare cost per dollar of revenue raised.&lt;/p&gt;
&lt;p&gt;Q: What are the three cost-per-ton definitions the paper distinguishes, and why do they differ?
A: Resource cost per ton measures the economic resources consumed per ton of CO2 abated, independent of subsidy incidence; government cost per ton measures net government outlays per ton, omitting all non-CO2 benefits; social cost per ton subtracts non-CO2 benefits from government costs. For appliance rebates, these three values are -$2, $474, and an intermediate figure — a range that reflects whether inframarginal transfers and non-CO2 co-benefits are counted.&lt;/p&gt;
&lt;p&gt;Q: What is the new methodological contribution regarding learning by doing?
A: The paper derives a sufficient statistics result showing that when marginal production cost is an isoelastic function of cumulative production and demand is isoelastic in price, the time path of production follows a second-order ordinary differential equation. Solving this equation yields society&amp;rsquo;s willingness to pay for LBD spillovers from four observable parameters: demand price elasticity, the LBD elasticity of marginal cost with respect to cumulative production, cumulative production at the subsidy date, and unit cost at that date. This allows LBD benefits to be incorporated into both MVPF and cost-per-ton calculations without requiring a fully calibrated dynamic model.&lt;/p&gt;
&lt;p&gt;Q: What LBD elasticities does the paper use, and where do they come from?
A: Drawing on Way et al. (2022), a 1% increase in cumulative solar production is associated with a 0.319% price reduction; for wind the elasticity is 0.194%, and for EV batteries it is 0.421%. These are treated as the isoelastic parameter in the sufficient statistics formula.&lt;/p&gt;
&lt;p&gt;Q: How does LBD affect the MVPF estimates for wind, solar, and EVs specifically?
A: For wind production tax credits, the MVPF rises from 3.85 to 5.87 when LBD is included. For residential solar, it rises from 1.45 to 3.86. For EV subsidies, the MVPF rises from approximately 1 to approximately 1.4. Without LBD, EV subsidies are in line with other consumer subsidies; LBD is the primary reason EVs outperform that group.&lt;/p&gt;
&lt;p&gt;Q: What is the baseline social cost of carbon used, and how sensitive are results to alternative values?
A: The baseline SCC is $193 per ton of CO2 in 2020, following EPA 2023 guidance at a 2% discount rate. Robustness checks use $76, $337, and $1,367. Higher SCC values raise the MVPF of all subsidies in the sample, but the relative ordering — with wind PTCs above all other consumer subsidies — remains consistent across the full range.&lt;/p&gt;
&lt;p&gt;Q: How are EV subsidies evaluated, and what accounts for their MVPF exceeding other consumer subsidies?
A: The analysis uses the California EFMP program studied by Muehlegger and Rapson (2022), which finds a price elasticity of demand of -2.1 and 85% pass-through to consumers (15% captured by dealers). A $1 subsidy generates $0.85 in consumer WTP, $0.15 in dealer WTP, $0.17 in CO2 co-benefits, $0.05 in local pollution and accident co-benefits, offset by $0.10 in damages from increased electricity generation. Most benefits are non-environmental (inframarginal transfers and LBD effects on future vehicle prices), which is why the government cost per ton of $1,356 appears high while the MVPF is approximately 1.4.&lt;/p&gt;
&lt;p&gt;Q: What drives the high MVPFs for nudges in dirty-grid regions, and what is the implication for the future?
A: Conservation nudges in dirty-grid areas have high MVPFs (exceeding 5) because each kilowatt-hour of reduced consumption displaces generation from high-emission sources, amplifying the environmental benefit per dollar of program cost. In cleaner-grid regions like California and the Northeast, the same nudge displaces lower-emission generation, pushing the MVPF below 1. As grids decarbonize nationwide, the paper notes that nudge MVPFs will decline over time.&lt;/p&gt;
&lt;p&gt;Q: How do cap-and-trade permit reductions compare to fuel taxes as revenue-raising instruments?
A: Nearly all fuel taxes (gasoline, diesel, jet fuel) have MVPFs below 1, with most below 0.7, meaning they impose a welfare cost of only $0.70 per dollar of revenue raised. Cap-and-trade permit reductions can have MVPFs below zero, meaning they can raise revenue while simultaneously providing net positive welfare gains to individuals because environmental benefits from reduced emissions outweigh the permit costs borne by emitters.&lt;/p&gt;
&lt;p&gt;Q: What do the international subsidy findings suggest, and what are their limitations?
A: Subsidies for efficient charcoal cookstoves in Kenya (Berkouwer and Dean 2022) generate US-specific gains from CO2 reductions that are 37 times the net cost of the subsidy; including global benefits raises the MVPF to 323. However, the paper flags substantial uncertainty: estimated policy impacts vary widely within similar international categories, and the US-specific MVPF is highly sensitive to assumptions about the incidence of the social cost of carbon on US residents and US government tax revenue.&lt;/p&gt;
&lt;p&gt;Q: Why does the social cost per ton metric give opposite rankings within wind, solar, and EVs relative to the MVPF?
A: EVs have a social cost per ton of -$415 versus -$32 for wind PTCs, making EVs appear superior on that metric — the reverse of the MVPF ordering. The paper explains that when SCPT values are negative (policies that abate CO2 while also yielding positive non-CO2 net benefits), the metric loses its Lagrange multiplier interpretation: increased non-CO2 benefits make SCPT more negative while increased abatement makes it less negative, preventing meaningful cross-policy comparisons.&lt;/p&gt;
&lt;p&gt;Q: What is the overall policy ranking implied by the MVPF analysis?
A: From highest to lowest MVPF: international clean energy subsidies &amp;gt; wind production tax credits &amp;gt; residential solar subsidies &amp;gt; energy conservation nudges (dirty grids) &amp;gt; EV subsidies &amp;gt; consumer appliance and weatherization subsidies &amp;gt; hybrid vehicle subsidies &amp;gt; vehicle buyback rebates &amp;gt; energy conservation nudges (clean grids) &amp;gt; revenue raisers (gas taxes, fuel taxes, cap-and-trade). The paper notes that shifting $1 of government revenue from gas taxes (MVPF ~0.67) to wind PTCs (MVPF ~5.87) generates $5.20 in net welfare benefits to individuals, assuming equal social welfare weights across groups.&lt;/p&gt;
&lt;p&gt;Marginal Value of Public Funds (MVPF): A benefit-cost ratio equal to the sum of individuals&amp;rsquo; willingness to pay for a policy divided by its net cost to the government. Policies with higher MVPFs deliver greater welfare gains per dollar spent; those with lower MVPFs impose lower welfare costs per dollar of revenue raised. Used to compare spending and revenue-raising policies on a common welfare-maximizing basis.&lt;/p&gt;
&lt;p&gt;Learning-by-Doing (LBD) Externality: The spillover by which current production of a technology lowers its future marginal cost, generating future consumer surplus (price externality) and additional future uptake with associated environmental benefits (environmental externality). Treated in this paper as an uninternalized external benefit of subsidizing current production.&lt;/p&gt;
&lt;p&gt;Sufficient Statistics Approach to LBD: The paper&amp;rsquo;s methodological contribution — showing that when marginal cost is an isoelastic function of cumulative production and demand is isoelastic in price, the LBD welfare benefit can be computed from four observables: the demand price elasticity, the LBD cost elasticity, cumulative production at subsidy date, and unit cost at subsidy date, without requiring a fully specified dynamic model.&lt;/p&gt;
&lt;p&gt;Resource Cost per Ton (RCPT): Economic resources consumed to produce and use a product, divided by tons of CO2 abated. Appropriate for private firms minimizing abatement cost; independent of subsidy take-up rates and inframarginal transfers.&lt;/p&gt;
&lt;p&gt;Government Cost per Ton (GCPT): Net government outlay per ton of CO2 abated. The correct metric for a government focused exclusively on CO2 reduction at minimum fiscal cost; omits all non-CO2 welfare impacts, including co-benefits and LBD effects.&lt;/p&gt;
&lt;p&gt;Social Cost per Ton (SCPT): Government cost net of all non-CO2 benefits, per ton of CO2 abated. Intended to capture the social cost of abatement, but loses its Lagrange multiplier interpretation when values are negative, preventing valid cross-policy comparisons in that region.&lt;/p&gt;
&lt;p&gt;Social Cost of Carbon (SCC): The monetized damage from one additional ton of CO2 emissions. Baseline value of $193 per ton in 2020 from EPA 2023 at a 2% discount rate, rising over time. A key parameter driving MVPF levels across all policy categories; robustness checked at $76, $337, and $1,367.&lt;/p&gt;
&lt;p&gt;Pigouvian Efficiency of Environmental Taxes: The paper quantifies that fuel taxes have MVPFs below 0.7 because current tax rates fall below the associated Pigouvian optimum — i.e., taxing polluting goods raises revenue while reducing a pre-existing negative externality, so the welfare cost of the revenue is less than one dollar per dollar raised.&lt;/p&gt;</description></item><item><title>About Ledger</title><link>https://macropaperwarehouse.com/about/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/about/</guid><description>&lt;h2 id="what-this-is"&gt;What this is&lt;/h2&gt;
&lt;p&gt;Ledger tracks forthcoming papers in macroeconomics and monetary economics and publishes a faithful two-layer summary of each one. The goal is a fast path from &amp;ldquo;something new is out&amp;rdquo; to &amp;ldquo;I understand what it actually claims and how confident I should be.&amp;rdquo;&lt;/p&gt;
&lt;h2 id="how-summaries-are-made"&gt;How summaries are made&lt;/h2&gt;
&lt;p&gt;Every summary goes through the same pipeline:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Metadata pulled from APIs.&lt;/strong&gt; Bibliographic fields (title, authors, journal, DOI, links) come from Crossref and OpenAlex — never guessed or filled in by a model.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Source text identified.&lt;/strong&gt; The &lt;code&gt;source_text_origin&lt;/code&gt; field on each paper records whether the summary was built from the full manuscript, the open-access HTML, or the abstract only. Layer 2 is constrained to what that source actually says.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Two-layer summary drafted.&lt;/strong&gt; Layer 1 is a plain-language preview carrying the load-bearing qualifiers. Layer 2 is a Q&amp;amp;A that goes as deep as faithful coverage requires.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Claim-grounding review.&lt;/strong&gt; Every claim must be traceable to a source span. Ungrounded claims block publication.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Human approval.&lt;/strong&gt; No summary reaches the live site without passing the review gate and a human merge decision.&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="journals-covered"&gt;Journals covered&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Whitelist&lt;/strong&gt; (all papers): Journal of Monetary Economics · Journal of Money, Credit and Banking · Journal of Macroeconomics · Macroeconomic Dynamics · Journal of Economic Dynamics and Control · Review of Economic Dynamics · American Economic Journal: Macroeconomics · Journal of Economic Growth&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Filter&lt;/strong&gt; (macro/monetary papers only): American Economic Review · Econometrica · Journal of Political Economy · Quarterly Journal of Economics · Review of Economic Studies · AER: Insights · The Economic Journal&lt;/p&gt;
&lt;h2 id="corrections"&gt;Corrections&lt;/h2&gt;
&lt;p&gt;Found an error or a misrepresentation? The flag link on each paper page goes directly to the right place. Corrections from authors are especially welcome.&lt;/p&gt;</description></item><item><title>Abundance from Abroad: Migrant Income and Long-Run Economic Development</title><link>https://macropaperwarehouse.com/papers/abundance-from-abroad-migrant-income-and-long-run-economic-development/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/abundance-from-abroad-migrant-income-and-long-run-economic-development/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how persistent increases in international migrant income prospects affect long-run economic development in migrant-origin areas. The central question is whether Philippine provinces with persistent access to higher-income migration opportunities develop faster than provinces with less attractive migration opportunities, and through which channels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Natural Experiment and Identification Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors exploit the 1997 Asian Financial Crisis as a large-scale natural experiment. The crisis triggered sharp, heterogeneous, and persistent exchange rate changes across Philippine migrants&amp;rsquo; destination countries — ranging from a 4% depreciation against the Philippine peso (Korea) to a 57% appreciation (Libya), with Japan and Saudi Arabia in between (appreciations of 32% and 52%, respectively). Because Philippine provinces differed in the pre-crisis distribution of migrant income across destinations (measured using unusual POEA/OWWA administrative contract data covering all overseas worker contracts, including migrant incomes, origins, and destinations), these exchange rate shocks generated exogenous, province-level variation in a shift-share instrument: the predicted change in province migrant income per capita due to the 1997 shocks. Identification follows the &amp;ldquo;exogenous shares&amp;rdquo; framework of Goldsmith-Pinkham et al. (2020). Pre-trend tests across up to 12 years of pre-shock panel data find no evidence of differential trends across provinces. The five destinations with the highest Rotemberg weights — Saudi Arabia, Japan, United States, Taiwan, and Hong Kong — collectively account for 75% of the identifying variation. The exchange rate shocks and the exposure weights both exhibit strong persistence over two decades post-1997.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Philippine government administrative data (POEA/OWWA) on all overseas worker contracts, 1992–2015, matched at 95% rate, providing province-of-origin and destination-specific migrant income.&lt;/li&gt;
&lt;li&gt;Philippine Family Income and Expenditure Survey (FIES), up to twelve triennial rounds from 1985–2018 (74 provinces, ~40,000 households per round), for domestic income and expenditure.&lt;/li&gt;
&lt;li&gt;Six rounds of the Philippine Census of Population (1990–2015) for education, migration rates, and sectoral employment shares.&lt;/li&gt;
&lt;li&gt;Province-level consumer price index data (1994–2017) and firm-level export survey data for robustness checks.&lt;/li&gt;
&lt;li&gt;Unit of analysis: 74 Philippine provinces (consistent 1990 borders).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Six-fold magnification of migrant income&lt;/strong&gt;: Each unit of initial short-run shock (1997–1998) to migrant income per capita is magnified more than six-fold by 2009–2015. A one-standard-deviation shock (0.093) raises long-run migrant income per capita by 14.7% of the baseline mean (PhP 601 per capita, 0.2 standard deviations).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Domestic income gains predominate&lt;/strong&gt;: A one-standard-deviation shock raises domestic income per capita (excluding migrant income and remittances) by 6.4% of the baseline mean (PhP 1,676, 0.18 standard deviations). Remarkably, 73.6% of the long-run global income increase comes from domestic income and only 26.4% from migrant income.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Global income and expenditure&lt;/strong&gt;: A one-standard-deviation shock raises global income per capita by PhP 2,277 (0.2 standard deviations, or 7.5% of the baseline mean) in 2009–2015. Expenditure per capita rises by PhP 1,159 (0.13 standard deviations). Effects emerge gradually over two decades.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Education&lt;/strong&gt;: A one-standard-deviation shock increases the college-educated share of the population by 0.46–0.51 percentage points (0.11–0.12 standard deviations) and secondary completion by 0.63 percentage points. There is no significant effect on primary completion.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Migration rates and skill composition&lt;/strong&gt;: A one-standard-deviation shock increases the migration rate by 0.19 percentage points (0.22 standard deviations), raises the share of skilled migrants by 1.84 percentage points (0.19 standard deviations), and increases average migrant annual salary by PhP 23,703 (0.16 standard deviations). New migration concentrates in higher-education-quartile occupations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Structural change&lt;/strong&gt;: The shock reduces primary sector employment shares by 1.2 percentage points per standard deviation (0.06 standard deviations), with over 70% of that shift absorbed by non-tradable goods and services sectors. Domestic income gains are driven almost entirely by non-agricultural income, and roughly 55% of the increase in entrepreneurial income is from service sectors.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Education&amp;rsquo;s contribution to income&lt;/strong&gt;: Model-based calculations assign 19.6% of the global income gain, 17.8% of the migrant income gain, and 20.2% of the domestic income gain to educational investments. Exchange rate persistence plus altered migration flows explain an additional 64.6% of the migrant income increase, so together these mechanisms account for 82.3% of the six-fold magnification. A demand multiplier (assuming 64% of migrant income returns to origin economies and a multiplier of 2.9, consistent with estimates from the literature) accounts for approximately 83.3% of the non-education-related portion of the domestic income increase.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Threats to Identification Ruled Out&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Import and export shift-share controls (constructed analogously using bilateral trade data and province-level industry employment shares) are uncorrelated with the migrant income shock and leave coefficient estimates unchanged. Province-level manufactured exports, agricultural income, the CPI, and national-level FDI inflows show no statistically significant response to the shock. Internal migration rates are unaffected. Geographic spillover controls and tourism controls do not alter results. Placebo regressions in the pre-period yield small, statistically insignificant coefficients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper studies formal, government-regulated temporary labor migration from the Philippines, where migrants sign contracts through POEA-licensed agencies and typically expect to return after one or more contracts. The findings apply specifically to settings where persistent (not transitory) migrant income shocks occur. Approximately 60% of contract migrants are female. The study period spans 1985–2018, with main long-run outcome analyses comparing 1994 (pre-shock) with 2009–2015 (post-shock).&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What makes the 1997 Asian Financial Crisis useful as a natural experiment for this paper&amp;rsquo;s purposes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: The crisis was largely unanticipated by policymakers, international organizations, and financial markets, making it implausible that pre-1997 migration destination choices reflected anticipation of the shocks. Exchange rate changes were heterogeneous across destinations (ranging from a 4% depreciation to a 57% appreciation), and crucially, these changes proved highly persistent over two decades — regression coefficients of long-run exchange rate changes on the initial 1997–1998 shock are close to and statistically indistinguishable from 1 in nearly all post-shock periods. Combined with the province-specific variation in migrant destination exposure, this generates persistent, exogenous, and heterogeneous shocks to migrant income prospects across provinces.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the shift-share variable, and how does it combine &amp;ldquo;shifts&amp;rdquo; and &amp;ldquo;shares&amp;rdquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: The shift-share variable Shiftshareo equals the sum over destinations d of (ωdo0 × ΔRd), where ωdo0 is province o&amp;rsquo;s pre-shock migrant income per capita from destination d (the &amp;ldquo;exposure weight&amp;rdquo; or &amp;ldquo;share&amp;rdquo;), and ΔRd is the fractional change in destination d&amp;rsquo;s exchange rate from before to after the crisis (the &amp;ldquo;shift&amp;rdquo;). It captures the predicted change in province-level migrant income per capita due to the 1997 exchange rate shocks, and is derived directly from a theoretical model of migration. Identification relies on the &amp;ldquo;exogenous shares&amp;rdquo; approach of Goldsmith-Pinkham et al. (2020): the pre-1997 exposure weights are treated as as-good-as-randomly assigned conditional on controls, because they reflect historical migration networks formed well before the crisis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Why is the six-fold magnification of the initial migrant income shock so striking, and what does the structural model say about its sources?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: The coefficient on migrant income per capita (6.463 in Panel D of Table 1) implies that for each unit of initial short-run migrant income shock, migrant income per capita is more than six units higher in 2009–2015 — a far larger response than a one-for-one pass-through would predict. The structural model, which augments a Fréchet-based gravity model of migration with endogenous education investments, accounts for 82.3% of this magnification. Education investments explain 17.8% of the migrant income increase; persistent favorable exchange rates and resulting shifts in migration flows across destinations explain an additional 64.6%. The Fréchet elasticity of migration flows with respect to destination wages is estimated at θ = 3.42 via PPML, implying that even partial reorientation of migrants toward now-higher-wage destinations substantially raises aggregate migrant income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What evidence supports the parallel trends assumption in the pre-shock period?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: The authors present event study diagrams (Figure 2) showing no differential positive pre-trends in either expenditure per capita or domestic income per capita prior to 1997 — for domestic income, there is a statistically insignificant negative trend from 1985–1991 and no trend in 1991–1994. Placebo regressions estimated on the pre-period only (1985, 1988, 1991 as &amp;ldquo;pre,&amp;rdquo; 1994 and 1997 as &amp;ldquo;post&amp;rdquo;) yield small, statistically insignificant coefficients on both domestic income and expenditure. Balance tests focusing on the five high-Rotemberg-weight destination shares (Saudi Arabia, Japan, US, Taiwan, Hong Kong) — which collectively account for 75% of the identifying variation — also show no significant pre-trends in key outcomes across provinces with varying levels of exposure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How do the authors rule out trade flows as an alternative mechanism for the estimated income effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: They construct separate import and export shift-share variables, analogous to the &amp;ldquo;China shock&amp;rdquo; of Autor et al. (2013), using baseline bilateral trade values (from COMTRADE, disaggregated to 36 ISIC industries), province-level employment shares in import and export industries (from the 1990 Census), and the same destination exchange rate shocks. These trade shift-share variables are uncorrelated with the migrant income shock after conditioning on baseline controls (Appendix Table A5). Including them as additional controls in Panel D of all main regression tables leaves the migrant income coefficient stable. Further, province-level manufactured exports per capita show no large or statistically significant response to the migrant income shock, agricultural income similarly shows no significant response, and consumer price indices are unresponsive — ruling out import price changes as a confound. FDI inflows at the national level also show no significant relationship with destination-country exchange rate shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the composition of the domestic income gains — where do they come from?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: Both wage income and entrepreneurial/rental income rise significantly and in similar magnitude, while &amp;ldquo;other income&amp;rdquo; (pensions, interest, dividends) shows no robust increase (Table 4). Non-agricultural income drives virtually the entire domestic income gain; agricultural income per capita is statistically insignificant (Table 5, columns 1–2). Within entrepreneurial income, approximately 55% of the increase is from service sectors, with manufacturing and primary sector entrepreneurial income showing insignificant effects at the 10% level (Table 5, columns 3–5). These patterns are consistent with the structural change finding: the shock shifts labor from primary sectors toward non-tradable goods and services rather than toward tradable manufacturing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the &amp;ldquo;global income&amp;rdquo; concept and what share does each component contribute?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: Global income per capita is defined as the sum of domestic income per capita (earned within the Philippine economy, excluding all international transfers) and migrant income per capita (the full income earned abroad by a province&amp;rsquo;s international migrants, calculated from contract data). Of the long-run global income increase, 73.6% comes from domestic income and 26.4% from migrant income. A one-standard-deviation shock raises global income by PhP 2,277 per capita in 2009–2015 (0.2 standard deviations, or 7.5% of the baseline mean).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do education effects translate into more and higher-skilled migration?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: A one-standard-deviation migrant income shock increases college completion by 0.46 percentage points and secondary completion by 0.63 percentage points (with no significant effect on primary completion), consistent with the shock raising the return to higher education in the broader population. These better-educated workers then migrate at higher rates: the share of migrants who are skilled (college-educated) rises by 1.84 percentage points per standard deviation. Migration increases are concentrated in the two highest-education quartiles of occupations (engineers, medical professionals, teachers in the 4th quartile; caregivers, restaurant workers, performing artists in the 3rd quartile), with no significant effect in the two lowest quartiles. Average annual migrant salary rises by PhP 23,703 per standard deviation (0.16 standard deviations).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What mechanisms does the structural model invoke to explain the domestic income gains?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: The model treats domestic income changes as arising through at least two channels: (1) the education channel, which the model assigns 20.2% of the domestic income increase (using the estimated college completion response of 0.046 per unit shock, baseline skill-migration probabilities, and baseline skill premia for domestic income); and (2) a demand multiplier operating on the portion of migrant income remitted to origin provinces, combined with capital accumulation from sustained migrant income flows. Assuming 64% of migrant income returns to origin economies (estimated indirectly from KNOMAD/ILO and Survey on Overseas Filipinos data) and a multiplier of 2.9 (consistent with estimates from Kenya and India), this demand-plus-investment channel can explain approximately 83.3% of the remaining (non-education-related) domestic income increase of PhP 14.4 per unit shock. Under baseline assumptions (α = 0.64), the stylized dynamic model generates PhP 18.88 of domestic income by 2015 from a PhP 1 initial shock — close to the empirical estimate of PhP 18.02.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do the authors assess SUTVA and internal migration?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: They test whether the migrant income shock affects net internal migration rates at the provincial level (Appendix Table A6) and find no large or statistically significant impact. There is a small negative effect on outmigration of young adults (aged 16–24) that the authors judge cannot account for the documented income impacts. The Philippines&amp;rsquo; archipelago geography (over 7,000 islands) is noted as likely limiting inter-provincial economic spillovers; to the extent spillovers occur, they would be positive (demand spillovers from provinces experiencing income gains to neighboring provinces), making estimates conservative lower bounds. Direct tests controlling for the inverse-distance-weighted migrant income shock in neighboring provinces leave main estimates unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Are the exposure weights (migration shares) persistent, and does this support interpreting the shock as persistent?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: Yes. Regressions of dyadic migrant income per capita in post-shock years (2009, 2012, 2015) on dyadic migrant income per capita in 1995 yield coefficients ranging from 0.4 to 0.6, each statistically significantly different from zero (and from 1, indicating partial but substantial persistence). The exchange rate shocks ΔRd are even more persistent: regression coefficients on the initial 1997–1998 shock are close to 1 and statistically indistinguishable from 1 in nearly all post-shock periods (with the only exceptions in 2009–2012 during the Great Recession). Both components of the shift-share variable thus show persistence over two decades, supporting interpretation of the long-run effects as responses to a persistent (not transitory) income shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the policy implications and how do the authors connect findings to migration policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: The findings suggest migration policy should be an important part of the development policy toolkit. The results are directly relevant to origin-country policies facilitating formal, contract-based labor migration (e.g., regulation of recruitment agencies, educational investments to raise worker skills and competitiveness for overseas employment) and destination-country policies governing legal immigration opportunities. The authors also note implications for overseas development assistance: development agencies could consider supplementing traditional foreign aid with programs that facilitate international labor migration. The paper&amp;rsquo;s context — formal, government-regulated migration through POEA and OWWA — is described as highly policy-relevant, with 94% of developing countries with populations exceeding 1 million having a dedicated government migration agency and 78% having policies promoting migrant remittances.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shift-share variable (Shiftshareo):&lt;/strong&gt; The paper&amp;rsquo;s primary independent variable, equal to the sum over all overseas destinations d of (ωdo0 × ΔRd) — the province&amp;rsquo;s pre-shock migrant income per capita from each destination (the exposure weight or &amp;ldquo;share&amp;rdquo;) multiplied by that destination&amp;rsquo;s exchange rate shock (the &amp;ldquo;shift&amp;rdquo;). It is the predicted change in province migrant income per capita due to the 1997 Asian Financial Crisis exchange rate shocks, and is derived directly from the theoretical model of migration (Equation A9). Identification treats the exposure weights as exogenous following the &amp;ldquo;exogenous shares&amp;rdquo; approach of Goldsmith-Pinkham et al. (2020).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exposure weights (ωdo0):&lt;/strong&gt; Province o&amp;rsquo;s pre-shock aggregate migrant income per capita earned in destination d, calculated from administrative POEA/OWWA contract data for 1995. These serve as the &amp;ldquo;shares&amp;rdquo; in the shift-share and capture the extent to which a province&amp;rsquo;s residents are exposed to a given destination&amp;rsquo;s exchange rate shock. They reflect historically-formed migration networks rather than anticipation of future shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global income per capita:&lt;/strong&gt; The sum of domestic income per capita and migrant income per capita. Domestic income is household income earned within the Philippine economy (wages, entrepreneurial, and other sources), explicitly excluding all income from international sources including remittances. Migrant income is the full income earned abroad by all international migrants from the province, calculated from contract data (not remittances sent home). Global income thus captures the full resource gain available to a province from the combination of domestic production and international migration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Magnification (of migrant income shock):&lt;/strong&gt; The empirical finding that the long-run coefficient on migrant income per capita (6.463 in Panel D, Table 1) far exceeds 1 — meaning each unit of initial short-run shock becomes more than six units of migrant income per capita in 2009–2015. The paper decomposes this magnification into contributions from persistent exchange rates, educational investments raising skill levels and migration, and shifts in migration flows toward now-higher-wage destinations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Brain gain:&lt;/strong&gt; The paper&amp;rsquo;s term for the process by which improved migrant income prospects raise educational investments among the broader population (not just among migrants), leading to higher skill levels among non-migrants as well. The paper distinguishes this from &amp;ldquo;brain drain&amp;rdquo; (where migration of skilled workers reduces origin-area human capital) and provides evidence of a &amp;ldquo;virtuous cycle&amp;rdquo;: education raises migration rates and migrant skill levels, which in turn raises migrant and domestic incomes, potentially funding further education.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rotemberg weights:&lt;/strong&gt; Province-destination-level weights (following Goldsmith-Pinkham et al. 2020) characterizing which destination-specific exchange rate shocks drive the estimates most. Saudi Arabia (0.20), Japan (0.19), United States (0.18), Taiwan (0.10), and Hong Kong (0.08) together account for 75% of the total Rotemberg weight. These weights guide which destination-specific exposure shares receive the most scrutiny in pre-trend and balance tests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fréchet elasticity (θ):&lt;/strong&gt; The elasticity of migration flows from an origin province to a destination with respect to destination wages (in Philippine pesos), estimated at 3.42 via PPML using the exchange rate shocks. This parameter governs how much migration flows — and thereby migrant income — respond to the persistent exchange rate changes, and is central to the model&amp;rsquo;s decomposition of the six-fold magnification of migrant income effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Domestic income multiplier:&lt;/strong&gt; The ratio of long-run domestic income increase to the portion of the migrant income shock that returns to origin provinces. Assuming 64% of migrant income returns to origin economies (estimated from multiple administrative data sources), the implicit demand multiplier in the paper&amp;rsquo;s context ranges from about 2.9 to 3.4, consistent with multipliers found in related literature on cash transfers and credit supply shocks in low-income settings.&lt;/p&gt;</description></item><item><title>Across-Country Wage Compression in Multinationals</title><link>https://macropaperwarehouse.com/papers/across-country-wage-compression-in-multinationals/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/across-country-wage-compression-in-multinationals/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Many multinationals do not fully adjust wages to the local context of their foreign establishments; instead, they partially link the wages of foreign workers in a given position to the wages paid in the same position at headquarters — a practice the authors call &amp;ldquo;wage anchoring.&amp;rdquo; Using yearly establishment-level compensation data on roughly 1,200 multinationals operating across 174 cities worldwide (2000–2015) and matched employer-employee administrative data (RAIS) from Brazil, Hjort, Li, and Sarsons document that a 10 percent higher headquarters wage is associated with 1.63–2.8 percent higher wages for workers in the same occupation at foreign establishments, with the within-firm across-country correlation substantially exceeding the correlation between a given establishment&amp;rsquo;s wages and the local average paid by other multinationals for the same position. To establish a causal link between externally imposed headquarters wage changes and subsequent foreign establishment wage responses, the paper exploits two identification strategies: minimum wage shocks in the headquarters country or U.S. state and exchange rate fluctuations, both of which generate plausibly exogenous variation in headquarters wages that is then partially transmitted to foreign workers in the same position. Wage change transmission appears to be direct and to operate through firm-wide wage-setting procedures rather than through associated changes in technology or employment at foreign establishments, a conclusion the Brazil RAIS data support because total employment at multinationals&amp;rsquo; Brazilian establishments shows little change following positive external shocks to headquarters wages. Wage anchoring is strongest for low-skill occupations (cleaners, drivers, security guards), where a 10 percent higher headquarters wage is associated with a 2.8 percent higher foreign establishment wage, versus roughly 1.2 percent for middle- and high-skill occupations; the resulting spatial compression of wages is in line with how many multinationals themselves report setting pay across locations.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the central phenomenon documented in this paper, and what are the two broad empirical components of the analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central phenomenon is &amp;ldquo;wage anchoring&amp;rdquo;: multinationals link wages at their foreign establishments to the wage level at headquarters for the same narrowly-defined occupation, so that the within-firm across-country wage distribution is more compressed than what local labor-market conditions alone would imply. The first empirical component is descriptive — documenting the high cross-sectional correlation between headquarters and foreign establishment wages within a firm×occupation cell, controlling for city×year effects and local wage benchmarks. The second component is causal — using minimum wage shocks in the headquarters country or U.S. state and exchange rate shocks to generate externally imposed changes in headquarters wages, and tracing whether and how quickly those changes are partially transmitted to foreign establishments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the primary dataset, what does it cover, and what are its key limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The primary dataset was compiled by an unidentified consulting company that gathers compensation information from client employers and harmonizes positions globally into 309 occupations across 16 skill levels and 26 occupational categories. It covers roughly 1,200 multinationals (private-sector firms and multinational public-sector employers such as NGOs and multilateral organizations), operating in more than 170 cities, with yearly observations spanning 2000–2015. The data report average nominal gross total monthly wages for domestic (non-expat) workers in each establishment-occupation-year cell. Key limitations: the panel is unbalanced because multinationals choose which establishments report each year and often rotate establishments in and out; matching between the headquarters and any given foreign establishment requires observing the same occupation in the same year at both, which reduces the headquarters-matched sample to 80 employers and 611 foreign establishments (Sample 3, the most comparable subsample). The publicly listed U.S. firms in the data account for about one-third of total revenue of all publicly listed U.S. firms, so the sample is skewed toward unusually large employers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How do the authors define and measure &amp;ldquo;wage anchoring&amp;rdquo; in the descriptive section?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors regress log average wages of workers in occupation j at a firm f&amp;rsquo;s foreign establishment in city c in year t (wjfct) on log average wages for the same occupation at the firm&amp;rsquo;s headquarters (HQwjft), controlling for firm×occupation fixed effects, city×year fixed effects, and a local market wage benchmark measured either as the average paid by other multinationals in the same city-occupation-year cell or as a city×occupation×year fixed effect. The estimated coefficient on the headquarters wage — around 0.163 using the benchmark-wage control and about 0.09 using the more restrictive city×occupation×year fixed effect — measures how much of a headquarters wage difference is &amp;ldquo;passed through&amp;rdquo; to foreign establishment wages within the same firm and occupation. They further document that the within-firm wage slope (the difference between wages in consecutive skill levels within an occupational category) at foreign establishments is similarly anchored to the corresponding slope at headquarters, with a 10 percent greater consecutive-skill wage gap at headquarters associated with about a 1.4 percent greater gap at the foreign establishment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What exactly do the minimum wage and exchange rate identification strategies exploit, and what do they identify?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The minimum wage strategy compares multinationals whose headquarters are located in a country or U.S. state that experiences a minimum wage increase (&amp;ldquo;treated&amp;rdquo;) against multinationals whose headquarters are not exposed (&amp;ldquo;control&amp;rdquo;), conditioning on establishments being in the same foreign city. Within the treated group, it also exploits cross-occupation variation: within a given foreign establishment, workers in positions whose headquarters counterparts are more exposed to the minimum wage increase (because their wages are closer to the new minimum) experience larger foreign wage gains. The exchange rate strategy exploits appreciation of a non-U.S. headquarter country&amp;rsquo;s currency against the dollar: when the USD-measured headquarters wage of such a multinational increases following an appreciation, this tests whether foreign establishment wages in USD also rise. Because exchange rates increase and decrease, are less stable than minimum wages, and have different underlying drivers, the exchange rate design provides an independent corroboration of the minimum wage findings. Both strategies identify the effect of externally imposed headquarters wage changes on wages at the same firm&amp;rsquo;s foreign establishments in the same narrowly defined occupation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What evidence is marshaled against indirect pathways (technology changes, employment changes, offshoring) as the driver of foreign wage transmission?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper presents three types of evidence against indirect pathways. First, including headquarters country×year fixed effects in the descriptive wage regressions — which absorbs any technology shocks originating in the headquarters country that affect all occupations uniformly — leaves the estimated wage anchoring coefficient essentially unchanged. Second, event study and panel regressions using the Brazil RAIS data show little change in total employment at multinationals&amp;rsquo; Brazilian establishments following positive external shocks to headquarters wages, which is hard to reconcile with employment-driven or offshoring-driven wage adjustment. Third, a causal forest analysis of the conditional average treatment effect of minimum wage shocks on foreign wages — estimated allowing responses to vary with a wide range of job, employer, sector, and location characteristics — finds that occupation characteristics and sector have little explanatory power for which establishments transmit more, while differences in transmission are more closely related to characteristics of the headquarter-establishment country pair (proximity, similarity, shared language), which are more naturally associated with administrative coordination than with technology or production-style linkages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does occupation skill level moderate wage anchoring, and what does this heterogeneity imply?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Wage anchoring is strongest for low-skill occupations. In the descriptive correlations, a 10 percent higher headquarters wage is associated with 2.8 percent higher foreign wages in low-skill jobs (cleaners, drivers, data entry clerks, security guards) but only about 1.2 percent higher foreign wages in both middle-skill and high-skill jobs. The occupation heterogeneity is visible graphically (Figure 1 Panel C) and holds in regressions interacting the headquarters wage with skill-level indicators. A natural interpretation, consistent with the firm-wide wage-setting procedure explanation, is that firms are most likely to apply standardized pay rules to lower-level positions where local market customization may be seen as less important; higher-skill workers may be more likely to have individually negotiated contracts responsive to local conditions. The heterogeneity also implies that the spatial compression effect — wages in foreign establishments being pulled toward headquarters levels — is particularly pronounced at the lower end of the within-firm wage distribution, affecting positions like cleaners and guards in ways that can result in wages that are, relative to GDP per capita, an order of magnitude higher than what headquarters workers in the same position receive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the &amp;ldquo;spatial compression&amp;rdquo; implication and how does it relate to within-firm wage inequality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Wage anchoring implies that workers in the same occupation at foreign establishments located in lower-income countries receive wages that are compressed toward headquarters levels rather than fully adjusted to local wages. The paper shows that nominal wages at foreign establishments average about 89 percent of headquarters wages in the same occupation and year — and about 78 percent for establishments in countries poorer than the headquarter country — a ratio that is roughly stable across the within-firm headquarters wage distribution. This partial equalization is what the authors call &amp;ldquo;across-country wage compression&amp;rdquo;: it reduces the within-multinational cross-country wage dispersion relative to what would arise from purely market-based, locally responsive wage-setting. The spatial compression is consistent with how many firms self-report setting wages: a survey of primarily North American employers (Culpepper &amp;amp; Associates, 2011) found 29 percent report paying the same nominal wages across locations, and several large employers (Amazon, IKEA, Walmart) have self-imposed country-wide wage floors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What role do headquarter-establishment country-pair characteristics play in predicting which establishments exhibit stronger wage transmission?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using a causal forest algorithm to estimate the conditional average treatment effect of a minimum wage shock at headquarters and then constructing above- versus below-median predicted treatment groups, the paper finds that differences in transmission are &amp;ldquo;generally not large&amp;rdquo; but that higher transmission is somewhat associated with characteristics of the headquarter-establishment country pair: pairs that are more closely connected and share more similarities (e.g., common language, closer geographic distance) transmit more. Some foreign-establishment-country characteristics such as inequality and urbanization also appear related. In contrast, occupation characteristics (such as offshorability), the sector the multinational operates in, and characteristics of the headquarter country alone have little explanatory power. The paper notes these findings do not conclusively rule out alternative explanations but are more consistent with administrative coordination channels than with technology- or employment-based ones.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What role do potential fairness preferences and firm-wide wage norms play in the paper&amp;rsquo;s interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors suggest several possible mechanisms through which firm-wide wage-setting procedures could operate. Firms may adopt uniform wage-setting to reduce the menu and information costs of localized wage-setting (Lemieux et al., 2012); to increase foreign worker morale, particularly if workers are averse to pay inequality relative to headquarters peers (Card et al., 2012; Dube et al., 2019); or to respond to fairness preferences from headquarters workers or consumers (Harrison &amp;amp; Scorse, 2010). Survey evidence from Alfaro-Urena et al. (2019) explicitly records that multinationals pay high wages abroad in part to &amp;ldquo;ensure cross-country pay fairness within the MNC.&amp;rdquo; Alternatively, the authors note that firm-wide wage-setting may represent a form of firm inertia or mistakes — an inability or unwillingness to fully adapt pricing and compensation to local contexts — consistent with DellaVigna &amp;amp; Gentzkow (2019). The paper presents this as an open question for future research rather than definitively adjudicating among the explanations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the Brazil RAIS data corroborate and extend the global multinationals findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The RAIS matched employer-employee administrative data cover all employees at each Brazilian establishment of the 44 multinationals in the global dataset that operate in Brazil, with individual-level information on wages, education, race, gender, age, and tenure. Because RAIS is an administrative census of formal-sector employment rather than a consulting firm&amp;rsquo;s client dataset, it provides independent corroboration of the main findings. The paper confirms using RAIS that wages of individual workers at multinationals&amp;rsquo; Brazilian establishments rise abruptly when their foreign headquarters experience positive external shocks. The RAIS data then enable the additional step of examining employment responses, where event study and panel regressions find little change in total employment at multinationals&amp;rsquo; Brazilian establishments following such shocks — evidence against employment- or technology-driven indirect pathways as the primary explanation for wage transmission.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Wage anchoring:&lt;/strong&gt; The practice by which a multinational ties wages at its foreign establishments, for workers in a given occupation, to the wage level at its headquarters for the same occupation. In this paper&amp;rsquo;s usage, anchoring does not mean wages are set identically across locations but that they are partially linked — externally imposed changes in headquarters wages are partially transmitted to foreign establishment wages — rather than being independently set based on local labor-market conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Across-country wage compression:&lt;/strong&gt; The reduction in the cross-country dispersion of wages within a multinational that results from wage anchoring. Because foreign establishment wages are partially pulled toward headquarters levels rather than fully adjusting to local wages, the multinational&amp;rsquo;s within-firm wage distribution is more compressed across countries than it would be under purely localized wage-setting. In the paper&amp;rsquo;s data, this compression is particularly pronounced for low-skill occupations in lower-income host countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm-wide wage-setting procedures:&lt;/strong&gt; Administrative practices, such as applying a single pay scale or a fixed wage ratio across all of a firm&amp;rsquo;s establishments regardless of location, that mechanically link foreign establishment wages to headquarters wages. The paper argues these procedures — rather than correlated technology shocks or employment adjustments — are the proximate driver of wage anchoring, on the basis of the employment non-response in Brazil, the persistence of anchoring after controlling for headquarters-country technology shocks, and the pattern of heterogeneity across country pairs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Partial transmission:&lt;/strong&gt; A load-bearing qualifier in this paper describing the magnitude of wage anchoring: headquarters wage changes arising from external shocks are not fully extended to foreign workers, but a fraction of the change is passed through. The estimated pass-through in descriptive regressions ranges from about 0.09 to 0.31 depending on specification and sample, and is highest (around 0.28) for low-skill occupations. The partial nature of transmission means that the spatial compression is real but incomplete.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage slope:&lt;/strong&gt; The difference between log average wages paid by an employer to workers in jobs of consecutive skill levels within an occupational category, at a given establishment. The paper documents that the wage slope at foreign establishments is correlated with the wage slope at headquarters — a 10 percent greater consecutive-skill wage gap at headquarters is associated with a roughly 1.4 percent greater gap at the foreign establishment — suggesting that the anchoring extends beyond the level of wages to the internal wage structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;External shocks to headquarter wages:&lt;/strong&gt; Minimum wage increases in the headquarters country or U.S. state, and exchange rate fluctuations that change the USD value of wages set in local currency. These shocks serve as instruments or quasi-experimental sources of variation in headquarters wages that are plausibly exogenous to conditions at foreign establishments, enabling causal identification of the effect of headquarter wage changes on foreign establishment wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Causal forest (heterogeneous treatment effect estimation):&lt;/strong&gt; A machine learning algorithm used in the paper to estimate the conditional average treatment effect of a minimum wage shock at headquarters, allowing the size of the foreign wage response to vary flexibly with a large set of characteristics (job, employer, sector, headquarter country, establishment country, headquarter-establishment country pair). The resulting predicted treatment effect scores are used to construct above- and below-median transmission groups, which are then compared across observable characteristics to identify what predicts stronger wage anchoring.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on NBER Working Paper 26788 (February 2020, Revised April 2025). Source text was truncated after the beginning of Section 4.1 (minimum wage event study analysis); all causal evidence descriptions draw on the introduction and Section 3–4 framing rather than the full Section 4 tables and Section 5 heterogeneity analysis. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Additionality and Asymmetric Information in Environmental Markets: Evidence from Conservation Auctions</title><link>https://macropaperwarehouse.com/papers/additionality-and-asymmetric-information-in-environmental-markets-evidence-from-conservation-auctions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/additionality-and-asymmetric-information-in-environmental-markets-evidence-from-conservation-auctions/</guid><description>&lt;p&gt;This paper investigates the problem of additionality — the likelihood that a conservation action is marginal to (i.e., caused by) an incentive — in the United States Department of Agriculture&amp;rsquo;s Conservation Reserve Program (CRP), one of the largest and most mature Payments for Ecosystem Services (PES) mechanisms in the world. The CRP pays landowners $1.6–$1.8 billion per year under 10-year contracts to retire cropland and plant grass mixes, trees, or wildlife habitats, using a discriminatory scoring auction in which landowners submit bids on a menu of heterogeneous contracts ranked by a scoring rule.&lt;/p&gt;
&lt;p&gt;The central argument is that additionality represents a form of asymmetric information. Landowners possess private knowledge about their counterfactual land use (whether they would have conserved anyway), while the auction screens only on their private cost of accepting the contract. Because lower-cost landowners are lower-cost partly because they expect to conserve regardless of the CRP, cost and additionality are positively correlated — generating adverse selection: the least costly participants to purchase are the least socially valuable. The status quo scoring rule implicitly assumes all landowners are fully additional (tau = 1), an assumption the paper tests and rejects.&lt;/p&gt;
&lt;p&gt;The authors construct a dataset linking confidential administrative CRP bid data across seven auctions from 2009 to 2021 to satellite-derived land use classifications from the Cropland Data Layer (30m resolution) and USDA administrative land use reports. They exploit a regression discontinuity (RD) in contract awards around the winning score threshold to estimate the causal effect of CRP contracts on land use at the margin. The first-stage is close to one. The key finding is that CRP contracts reduce cropping by approximately eight percentage points at the margin, but the 100%-additional benchmark predicts a reduction of roughly 33 percentage points (matching the share of land covered by a contract at the margin). Therefore, only approximately one quarter (22–29%) of marginal auction winners are additional — meaning three-quarters would have conserved without the CRP contract.&lt;/p&gt;
&lt;p&gt;To test for adverse selection, the authors use the 82% of rejected bidders in the 2016 auction (the most restrictive) for whom counterfactual land use is observed, constructing a landowner-specific additionality measure. They document a systematic positive correlation between bid rental rates (reflecting higher costs) and additionality, which persists conditional on rich observable characteristics including prior land use interacted with soil productivity. Contract choice further reveals additionality: tree-related contract bidders exhibit substantially lower additionality than base grassland contract bidders.&lt;/p&gt;
&lt;p&gt;To quantify welfare implications, the authors develop and estimate a joint structural model of bidding and additionality. Costs are inferred via revealed preferences in optimal bidding (following the empirical auctions literature), and additionality is estimated as a conditional expectation function of observable characteristics and unobserved costs, matched to observed land use among rejected bidders via Method of Simulated Moments. Social benefits are taken from the CRP literature and USDA revealed preferences.&lt;/p&gt;
&lt;p&gt;Key welfare findings: (1) Despite widespread non-additionality and adverse selection, a hypothetical uniform-price market for the base conservation contract generates social welfare gains of $14.37 per acre-year at the socially-optimal price. Setting price equal to the full social benefit B — ignoring counterfactual land use — causes welfare losses of $12.68 per acre-year, nearly eliminating the gains. (2) The status quo auction generates social welfare gains of approximately $120 million per auction relative to no market, but implements only 12% of the gains achievable under the efficient allocation. (3) Simple modifications to the scoring rule that incorporate expected additionality — via uniform adjustments and market-size reductions — close 37% of the gap between the status quo and the efficient allocation, increasing social welfare by over $300 million per auction. Nearly all gains arise from incorporating additionality into the scoring rule. These modifications are described as implementable by the USDA in practice.&lt;/p&gt;
&lt;p&gt;Q: What is additionality, and why does it matter for conservation markets?
A: Additionality is defined as the expected impact of contracting on a landowner&amp;rsquo;s conservation action — i.e., the probability that a landowner would not have conserved absent the incentive. Social surplus depends on both a landowner&amp;rsquo;s cost of accepting a contract and her additionality, but market mechanisms screen only on cost. When the lowest-cost participants are the least additional, standard procurement mechanisms fail to implement the efficient allocation, undermining the environmental and fiscal effectiveness of conservation programs.&lt;/p&gt;
&lt;p&gt;Q: What is the rate of additionality at the margin of CRP contract awards?
A: Approximately one quarter (22–29% depending on specification) of marginal auction winners are additional. The RD design shows contracts reduce cropping by about eight percentage points at the margin, compared to the 100%-additional benchmark of approximately 33 percentage points (the share of land covered by the contract at the margin). This implies three-quarters of marginal winners would have conserved without a CRP contract.&lt;/p&gt;
&lt;p&gt;Q: What is the empirical evidence for adverse selection?
A: Among rejected bidders in the 2016 auction — where additionality is directly observed for 82% of bidders — there is a systematic positive correlation between bid rental rates (reflecting higher costs of accepting the contract) and additionality. This correlation persists conditional on rich observable characteristics, including prior land use interacted with soil productivity estimates. Contract choice also reveals additionality: bidders selecting tree-related contracts have substantially lower additionality than those choosing base grassland contracts.&lt;/p&gt;
&lt;p&gt;Q: How does soil productivity relate to additionality?
A: USDA-constructed soil productivity estimates, which approximate the earning potential of a parcel, are predictive of additionality in practice, consistent with theory. Higher soil productivity is associated with lower additionality — landowners with less productive land are more likely to conserve regardless of the CRP. Soil productivity is not currently incorporated into the CRP scoring rule to rank bidders.&lt;/p&gt;
&lt;p&gt;Q: How is the RD design validated?
A: The histogram of normalized score distributions shows no bunching at the winning threshold, validating that bidders do not know the exact ex-post threshold realization. Pre-period RD coefficients are indistinguishable from zero in both the remote sensing and administrative land use data. The first stage (share of bidders with a CRP contract just above the threshold) is close to one. Treatment effect magnitudes are stable over the 10-year contract period with no evidence of attenuation, and there are no spillovers to non-bid fields.&lt;/p&gt;
&lt;p&gt;Q: What do the social welfare calculations show for a uniform-price market?
A: Despite widespread non-additionality and adverse selection, a hypothetical uniform-price market for the base conservation contract generates social welfare gains of $14.37 per acre-year at the socially-optimal uniform price. However, setting price equal to the full social benefit B — as the status quo implicitly does by assuming tau = 1 — causes welfare losses of $12.68 per acre-year, nearly eliminating all gains.&lt;/p&gt;
&lt;p&gt;Q: How does the status quo auction perform relative to the efficient benchmark?
A: The status quo auction generates social welfare gains of approximately $120 million per auction relative to no market. The efficient allocation, which awards contracts based on both landowner costs and expected social benefits (incorporating additionality), would be substantially larger. The status quo implements only 12% of the social welfare gains achievable under the efficient allocation.&lt;/p&gt;
&lt;p&gt;Q: Can the efficient allocation be implemented by any mechanism?
A: Not necessarily. Implementing the efficient allocation requires that the expected net social surplus function B·tau(c) - c be monotonically decreasing in cost, so that a standard incentive-compatible auction can rank bidders appropriately. If lower-cost landowners are sufficiently less additional that the allocation rule is non-monotone in cost, no incentive-compatible mechanism can implement the efficient allocation (per Myerson 1981). Empirically, the authors find that for the base contract the efficient allocation is in the implementable case (similar to their Figure 1a), but implementing it exactly via an incentive-compatible auction remains complex.&lt;/p&gt;
&lt;p&gt;Q: What alternative auction designs are proposed, and how much do they improve welfare?
A: The authors propose alternative scoring rules that incorporate expected additionality — through uniform adjustments to the scoring rule, reductions in market size, and differentiation among heterogeneously additional landowners based on observables such as soil productivity and contract choice. These simple modifications close 37% of the gap between the status quo and the efficient allocation, increasing social welfare by over $300 million per auction. Nearly all gains come from incorporating additionality into the scoring rule, with a large share accruing through simple uniform adjustments.&lt;/p&gt;
&lt;p&gt;Q: How is the structural model of bidding estimated?
A: Estimation proceeds in three steps. First, beliefs about the winning score threshold distribution are estimated by simulating auctions via resampling (following Hortacsu 2000). Second, landowner costs are estimated via Maximum Simulated Likelihood using revealed preference inequalities from optimal bidding in the scoring auction. Third, the additionality conditional expectation function is estimated via Method of Simulated Moments, matching observed additionality levels, its distribution across rejected bidders, its covariance with scores, and its distribution by contract choice.&lt;/p&gt;
&lt;p&gt;Q: What sources of scoring rule variation identify the model?
A: Three sources are used. A mid-mechanism policy change in the 2021 auction added carbon sequestration payments differentially across contracts, providing two bids from the same bidders under different scoring rules. A policy change around 2011 shifted Wildlife Priority Zone (WPZ) bonus points to be contract-specific. Air Quality Zone (AQZ) status shifts the level of the score. These sources provide variation in relative payments across contracts, though the authors note the variation is modest and rely also on parametric extrapolation.&lt;/p&gt;
&lt;p&gt;Q: What assumptions are required for identification and how robust are results?
A: Key assumptions include perfect compliance (validated by inspection of over 1,000 aerial photographs), no spillovers to non-bid fields (validated in Table 2), and stability of the additionality function tau(z,c,kappa) across auction years. The authors assess robustness to alternative functional forms of tau, conduct a non-parametric inversion exercise across cost quantiles, and construct alternative scoring rules using cross-auction and cross-tract variation to probe the stability assumption. Model-implied additionality at the RD margin (23%) closely matches the empirical RD estimate.&lt;/p&gt;
&lt;p&gt;Q: Are the adverse selection and additionality findings specific to the 2016 auction?
A: The 2016 auction provides the most complete view because bid fields are observed and 82% of bidders are rejected. But cross-auction evidence replicates the core patterns. RD estimates exploiting threshold variation across auctions show additionality ranging from 10–20% among lower bidders to 40–50% among higher bidders across auctions, consistent with adverse selection. Tree-contract null RD effects replicate across all auctions. Cross-tract cropping rates show similar observable heterogeneity across auctions.&lt;/p&gt;
&lt;p&gt;Q: What is the social welfare impact of the market for conservation existing at all?
A: Theoretically ambiguous because non-additional landowners may receive transfers without generating social value, and adverse selection may tilt the market toward low-additionality participants. Empirically, despite these concerns, there exist positive social welfare gains of $14.37 per acre-year at the socially-optimal uniform price for the base contract, indicating that conservation markets of this type can improve welfare even in the presence of substantial non-additionality and adverse selection.&lt;/p&gt;
&lt;p&gt;Additionality: The expected impact of contracting on a landowner&amp;rsquo;s conservation action — formally, tau(c) = E[1 - a_i0 | c = c_i], the probability that a landowner would not have conserved absent the incentive. A landowner is additional if she would have cropped without the CRP contract; the social benefit of contracting depends only on this incremental conservation impact.&lt;/p&gt;
&lt;p&gt;Adverse Selection: The positive correlation between landowner cost of accepting a contract and additionality. Because landowners with low costs are low-cost partly because they expected to conserve regardless of the program, lower-cost participants are less socially valuable. This upward-sloping contract value curve mirrors adverse selection in insurance markets as modeled by Einav, Finkelstein, and Cullen (2010).&lt;/p&gt;
&lt;p&gt;Contract Value Curve: The function B·tau(F^{-1}_C(q)) plotting the expected social value of contracting at each quantile q of the cost distribution. It lies below the social benefit B due to non-additionality and slopes upward due to adverse selection. The vertical distance between the contract value and marginal cost curves equals expected social surplus B·tau(c) - c.&lt;/p&gt;
&lt;p&gt;Efficient Allocation: The allocation that maximizes expected social surplus B·tau(c) - c by awarding contracts to landowners for whom this quantity is positive. Implementing this allocation via an incentive-compatible mechanism requires that B·tau(c) - c be monotonically decreasing in cost; if not, no standard mechanism can achieve it.&lt;/p&gt;
&lt;p&gt;Scoring Rule: The known function s(b_i, z^s_i) that converts a landowner&amp;rsquo;s multi-dimensional bid (rental rate and contract choice) and observed characteristics into a score, determining contract awards. The status quo scoring rule implicitly assumes full additionality (tau = 1), ranking bidders as if all conservation actions are marginal to the incentive.&lt;/p&gt;
&lt;p&gt;Source Text Origin: The classification of the text on which a summary is based — &amp;ldquo;pdf&amp;rdquo; or &amp;ldquo;oa-html&amp;rdquo; for full working paper text, or &amp;ldquo;abstract-only&amp;rdquo; which is blocked from summarization. Determines the validity and completeness of any summary produced.&lt;/p&gt;</description></item><item><title>Aggregate demand externality and self-fulfilling default cycles</title><link>https://macropaperwarehouse.com/papers/aggregate-demand-externality-and-self-fulfilling-default-cycles/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/aggregate-demand-externality-and-self-fulfilling-default-cycles/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Why do corporate defaults cluster in recurring episodes rather than occurring smoothly? The paper asks whether observable fundamental factors — firm characteristics and macroeconomic variables — are sufficient to account for the clustered default patterns documented in the data, and, if not, what theoretical mechanism can explain them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Motivation.&lt;/strong&gt; Using Moody&amp;rsquo;s historical default rate data, the authors document that the long-run average corporate bond default rate during 1866–2008 was approximately 1.50%, yet defaults were highly episodic: the worst three-year period during the Great Depression totaled 12.88%, and the three-year period 1873–1875 after the railroad boom reached 35.80%. A Markov switching regression on post-war default rate data (1951–2017) strongly rejects a linear no-switch model in favor of a two-regime model across all information criteria (AIC, HQ, SC, and log-likelihood). The estimated high-default regime has a mean default rate of 1.93% (unconditional mean µ/(1−ρ)) — roughly eight times the 0.23% mean of the low-default regime — and a standard deviation nearly six times larger. The high-default regime persists on average 5.81 years (transition probability of staying ≈ 0.83), while the low-default regime lasts approximately 7.52 years (staying probability ≈ 0.87).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors build a continuous-time general equilibrium model with Dixit-Stiglitz monopolistic competition (CES aggregation with elasticity σ) and an endogenous entry/exit/default mechanism. Households are risk-neutral and also act as entrepreneurs. At each instant, δµ new project blueprints are invented; entrepreneurs borrow to invest, then face an idiosyncratic liquidity shock z drawn from a Pareto distribution G(z). Entrepreneurs continue if z ≤ Z*, a cutoff determined by the continuation value of the firm, and default otherwise. Continuing firms become monopolists for a new variety until that variety becomes obsolete at a Poisson rate δ. Each operating firm must borrow working capital constrained by its firm value Vt (collateral constraint wtnjt ≤ θVjt). The entire equilibrium reduces to a two-dimensional dynamical system in (Mt, Vt), where Mt is the number of operating firms (state variable) and Vt is the firm value (control variable).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Mechanism — Demand Externality and Positive Feedback.&lt;/strong&gt; Under CES aggregation, each firm&amp;rsquo;s gross revenue is y_jt^(1–1/σ) · Y_t^(1/σ), making individual firm revenue increasing in aggregate output Yt. A decline in Yt lowers firm profits and firm value Vt, which raises the default threshold Z* and increases the fraction of projects that are abandoned. Fewer operating firms further depress Yt, closing a positive feedback loop. This static strategic complementarity (through CES) is combined with dynamic strategic complementarity through the borrowing constraint: higher expected future firm value relaxes current working capital constraints, raising current production.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multiple Equilibria and Global Dynamics.&lt;/strong&gt; The two-locus phase diagram (˙Mt = 0 and ˙Vt = 0) yields multiple intersections — and hence multiple steady states — when productivity A lies in an intermediate range (A &amp;lt; A &amp;lt; Ā). When A &amp;gt; Ā, a single good saddle-point equilibrium exists. When A &amp;lt; A, no equilibrium can be sustained. In the intermediate range, a good steady state (low default rate, high firm value) coexists with a bad steady state (high default rate, low firm value). The good steady state is always a saddle; the bad steady state is a sink (locally indeterminate, κ &amp;lt; κ_Hopf) or a source (locally determinate but globally indeterminate, κ &amp;gt; κ_Hopf), depending on parameter κ = 1 + (θ + ρ)/δ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bogdanov-Takens Bifurcation.&lt;/strong&gt; Using global dynamical methods, the paper demonstrates richer indeterminacy than local analysis permits. Near the Bogdanov-Takens point (κ, Ā), the system can exhibit: (a) infinite equilibrium trajectories converging to the bad steady state; (b) saddle-loop bifurcation at κ = κ_SL ≈ 14.25 (under the baseline calibration); (c) stable or unstable periodic orbits for κ ∈ (κ_Hopf, κ_SL) — endogenous business cycles in a perfect-foresight equilibrium; and (d) multiple trajectories from near the source that converge to the good saddle equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Simulation of Clustered Defaults.&lt;/strong&gt; With a two-state Markov process for productivity (Ah = 10, Al = 9.34) and pessimistic sentiment shifts (the &amp;ldquo;ugly&amp;rdquo; state), the model replicates the cluster pattern: in the good/high-productivity state, the default rate is near zero; when productivity falls to low and sentiment turns pessimistic, the default rate can spike to approximately 12%, consistent with the Great Depression observation. Critically, the paper shows that the cluster pattern is generated only under global dynamics — restricting to local dynamics produces substantially smaller fluctuations in the default rate, confirming that the ugly (sink) equilibrium is essential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy.&lt;/strong&gt; A countercyclical subsidy to non-defaulting entrants — financed by a lump-sum tax, calibrated as tr(Vt) = τ(VG − Vt) — shifts the ˙Mt = 0 locus downward and can eliminate the bad steady state entirely, leaving only the good saddle-path equilibrium. The paper provides a closed-form sufficiency condition for τ (Proposition 7).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Multiple equilibria require: (i) productivity in the intermediate range A &amp;lt; A &amp;lt; Ā; (ii) the elasticity of substitution σ not too large (below a threshold σ̄ that itself depends on µ); (iii) the borrowing constraint binding (δ &amp;gt; θσ/((σ–1)κ), which can always be ensured by choosing δ sufficiently large). Clustered defaults in the simulation require the joint occurrence of a negative fundamental shock (productivity falling from high to low) and a shift to pessimistic sentiment; either factor alone generates only limited default amplification.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the core empirical motivation for the model, and what does the regime-switching analysis establish?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper documents that the corporate bond default rate, drawn from Moody&amp;rsquo;s data covering 1866–2008, clusters sharply in episodes: the long-run average is 1.50%, yet the worst three-year period of the Great Depression totaled 12.88% and 1873–1875 reached 35.80%. A Markov switching regression on 1951–2017 data strongly rejects a linear no-regime-switch model across all four criteria (log-likelihood, AIC, HQ, SC). The two-regime model identifies a high-default regime with unconditional mean 1.93% and standard deviation roughly six times the low-default regime&amp;rsquo;s, a persistence probability of approximately 0.83 (duration ≈ 5.81 years), and a low-default regime with unconditional mean 0.23% and persistence approximately 0.87 (duration ≈ 7.52 years). The regime-switching result supports the prior literature&amp;rsquo;s claim (Das et al. 2007; Duffie et al. 2009; Azizpour et al. 2018) that observable fundamentals alone cannot account for clustered defaults.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How does the Dixit-Stiglitz CES structure generate a demand externality that links aggregate output to individual firm default decisions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under CES aggregation with elasticity σ, each firm&amp;rsquo;s gross revenue equals y_jt^(1–1/σ) · Y_t^(1/σ) (equation 7), so aggregate output Yt directly enters individual firm revenue. Each firm takes Yt as given, yet the aggregation of all firms&amp;rsquo; output determines Yt. When aggregate output falls — because more firms have defaulted and exited production — each remaining firm&amp;rsquo;s revenue and profit fall, reducing the firm&amp;rsquo;s continuation value Vt. A lower Vt tightens the borrowing constraint (wtnjt ≤ θVjt), reduces working capital, and raises the probability that the firm&amp;rsquo;s idiosyncratic liquidity shock will exceed the default threshold Z*, producing further defaults. This positive feedback constitutes the demand externality: individual firms&amp;rsquo; decisions are strategic complements, both statically (through CES demand) and dynamically (through the borrowing constraint on working capital).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What is the two-dimensional dynamical system that summarizes the equilibrium, and what do the two loci look like in the phase diagram?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The entire equilibrium reduces to two differential equations in (Mt, Vt): ˙Mt = –δ[Mt – µG(Z(Vt))] and ˙Vt = κδVt[1 – F(Vt, Mt)], where F captures the ratio of monopoly profit to firm value including the borrowing constraint. The ˙Mt = 0 locus slopes strictly upward because a higher firm value Vt raises the default cutoff Z* and lowers the fraction of entrants who default, so more firms survive and Mt rises until absorption equals entry. This locus has a minimum at Mm = µG(zm) because firm value must exceed the threshold that sustains the credit market. The ˙Vt = 0 locus is non-monotonic: it first slopes upward (more firms raise aggregate demand and profit through the scale/externality channel) and then slopes downward (more firms tighten the labor market, raising wages and lowering profits). The two opposing channels make the ˙Vt = 0 locus hump-shaped, creating the possibility of two intersections and hence two steady states.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. Under what conditions do multiple steady states exist, and what does each look like?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Multiple steady states exist when productivity A satisfies A &amp;lt; A &amp;lt; Ā, where A and Ā are closed-form thresholds given by Equations (A.3) and (A.4), and the elasticity of substitution σ is below a threshold σ̄ (Equation A.5). When A &amp;lt; A, neither locus intersects and no equilibrium is sustainable. When A &amp;gt; Ā, a single good saddle-point equilibrium exists. In the multiple-equilibria range, the good steady state has a higher firm value and a smaller fraction of firms defaulting; the bad steady state has a lower firm value and a higher default rate. Under the paper&amp;rsquo;s numerical calibration (A = 10, η = 6.5, Zmin = 0.88), the low default rate at the good steady state is approximately 1.5% and the high default rate at the bad steady state is between 12% and 13%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What are the local dynamics around each steady state, and how does parameter κ determine whether the bad steady state is a sink or a source?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 5 shows that the good steady state is always a saddle point, ensuring a unique convergent path for initial Mt near Mg_0. The bad steady state&amp;rsquo;s local nature depends on κ = 1 + (θ + ρ)/δ and the critical value κ_Hopf = 1 + ψ/(θMb_0Vb_0). When κ is between 1 and κ_Hopf, the Jacobian trace is negative and the bad steady state is a sink with one order of indeterminacy: given Mt close to Mb_0, infinitely many initial values of the control variable Vt satisfy all equilibrium conditions. When κ &amp;gt; κ_Hopf, the bad steady state is a source point; the economy diverges from it. Because κ does not affect the steady-state locations (Proposition 3), one can vary κ to change the dynamic character without moving the equilibria in the phase diagram.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What does the global dynamics analysis reveal that local analysis misses?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Global analysis via Bogdanov-Takens bifurcation (Proposition 6) reveals three classes of dynamics absent from local analysis. First, even in the saddle-source case (locally determinate), there exist multiple equilibrium trajectories diverging from near the bad (source) steady state and converging to the good (saddle) steady state; these paths satisfy all equilibrium conditions including transversality but are incorrectly ruled out by local methods. Second, at the critical value κ_SL ≈ 14.25 (under the baseline calibration), a homoclinic saddle-loop orbit connects the saddle point to itself — all trajectories interior to the loop converge to the bad steady state. Third, for κ between κ_Hopf and κ_SL, periodic orbits arise in a perfect-foresight equilibrium with no external shocks. For example, at κ = 14.9, the phase diagram displays a unique periodic orbit around the bad steady state, with two distinct initial values of Vt for any given Mt near the orbit — endogenous, perpetual oscillations without any exogenous driving force. Numerical experiments confirm that Mt = 0.23 admits two rational-expectations values of Vt (2.09 and 3.55) on the saddle path alone, illustrating abundant indeterminacy even at the endpoint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How does the paper simulate the clustered default pattern and what is the role of the &amp;ldquo;ugly&amp;rdquo; equilibrium?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper constructs a three-state Markov economy: &amp;ldquo;good&amp;rdquo; (high productivity Ah = 10, single saddle equilibrium, near-zero default rate), &amp;ldquo;bad&amp;rdquo; (low productivity Al = 9.34, saddle-path equilibrium, modestly elevated defaults), and &amp;ldquo;ugly&amp;rdquo; (low productivity, sink-path equilibrium, sharply elevated defaults). The ugly state is reached when, upon a productivity decline, firms adopt pessimistic expectations and the economy slides to the high-default sink instead of remaining on the low-default saddle path. Transition probabilities are set so that the average ugly-state duration is approximately 6 years and roughly 45% of periods are ugly, consistent with the regime-switching estimates. With Zmin = 0.2 and η = 15, the ugly-state default rate can reach approximately 12%, matching the Great Depression observation. The counterfactual experiment deletes the ugly state (pGU = 0) and resets pGB = 0.45: the resulting default rate stays close to zero with no cluster pattern, demonstrating that global dynamics (the ugly sink) rather than the fundamental shock alone generate the clustering.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. Can purely sentiment-driven cycles generate the clustered default pattern?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Section 6.2 fixes productivity at a low level (A = 9.53) and drives switches between the bad (saddle path) and ugly (sink path) states by pure sentiment shocks alone (πBU and πUB). The simulated default rate does spike upward when sentiment turns pessimistic, but the rises are generally more modest than in the combined fundamental-plus-sentiment exercise, and the default rate can no longer be characterized as countercyclical. The authors conclude that the realistic observed default cluster is the result of a combination of negative fundamental shocks and pessimistic sentiment shifts; either ingredient alone is insufficient to replicate all features of the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. How does the collateral constraint on working capital create dynamic strategic complementarity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following Jermann and Quadrini (2012), Liu and Wang (2014), and Lian and Ma (2021), each operating firm must borrow to pay wages each period, subject to the constraint wtnjt ≤ θVjt. Since Vt is forward-looking (the discounted present value of the firm&amp;rsquo;s monopoly profit stream), optimistic expectations about future output raise Vt, relax the borrowing constraint, allow firms to hire more labor and produce more output today, and thereby validate optimism. This intertemporal complementarity means that the equilibrium is sensitive not only to current fundamentals but also to beliefs about the future, opening the channel for sentiment-driven multiple equilibria and self-fulfilling cycles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. What is the policy remedy for the bad equilibrium, and how does it work?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 7 establishes that a countercyclical lump-sum-tax-financed subsidy to non-defaulting entrants, tr(Vt) = τ(VG − Vt), with τ exceeding a computable threshold, eliminates the bad steady state. The subsidy works by effectively raising the value of continuing for a firm at any given Vt and Mt, shifting the ˙Mt = 0 locus downward until it lies below the ˙Vt = 0 locus everywhere in the relevant range, eliminating the second intersection and leaving only the good saddle-path equilibrium. The numerical illustration uses parameters from Section 6 with A = 9.67 and τ = 1/3 to demonstrate that the bad steady state vanishes and the phase diagram has a single equilibrium. The subsidy is self-limiting: in normal conditions when firm value is already high (Vt ≈ VG), the transfer is near zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. How does this paper differ from Cui and Kaas (2021), the most closely related predecessor?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Cui and Kaas (2021) show default cycles from self-fulfilling beliefs in a fully competitive firm environment, focusing on intertemporal default coordination. The present paper differs in three respects. First, firms engage in monopolistic competition under CES preferences, and the main novel mechanism is cross-firm default contagion through the demand externality — which can produce multiple equilibria even in a static setting, without any intertemporal coordination. Second, the paper examines the joint role of fundamental shocks and aggregate-demand externalities together, showing that multiple equilibria arise only in the presence of sufficiently low productivity (A &amp;lt; A &amp;lt; Ā), making indeterminacy contingent on external fundamentals rather than structural parameters alone. Third, the continuous-time framework with full global analysis via Bogdanov-Takens bifurcation allows characterization of periodic orbits and the interaction of the ugly sink path with Markov productivity regimes — dynamics not covered in Cui and Kaas (2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12. What is the markup prediction of the model, and is it consistent with empirical evidence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under Dixit-Stiglitz CES with elasticity σ, the equilibrium markup of each intermediate good equals σ/(σ–1) at the firm level. However, the measured gross markup — which includes the effective collateral constraint — is predicted to comove positively with the default rate in the model, and hence the markup is countercyclical. The paper notes this is consistent with the well-documented empirical regularity in Bils (1987) and Rotemberg and Woodford (1999). Additionally, the model replicates the finding in Gilchrist and Zakrajšek (2012) that a low default rate is associated with a high firm entry rate.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Demand Externality (Dixit-Stiglitz type).&lt;/strong&gt; In the paper&amp;rsquo;s sense, this is the mechanism by which individual firms&amp;rsquo; revenues depend on aggregate output Yt through the CES aggregator: each firm&amp;rsquo;s gross revenue is y_jt^(1–1/σ) · Y_t^(1/σ). Each firm takes Yt as given, but the aggregation of all firms&amp;rsquo; output determines Yt. This creates a positive spillover: more operating firms raise aggregate output, which raises each firm&amp;rsquo;s revenue, and vice versa. The paper uses this as the central transmission channel for self-fulfilling defaults, in contrast to prior literature that emphasized debt networks or asymmetric information contagion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-Fulfilling Default Cycle.&lt;/strong&gt; A dynamic equilibrium path in which pessimistic expectations about aggregate output are validated: if firms anticipate that more other firms will default (lowering Yt), their own continuation value Vt falls, raising the probability that their idiosyncratic liquidity shock will exceed the default threshold, increasing actual defaults, further lowering Yt, and so on. The paper distinguishes this from shock-amplifier stories by constructing a model with multiple rational-expectations equilibria in which the aggregate default rate is determined in part by initial beliefs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bogdanov-Takens Bifurcation.&lt;/strong&gt; A mathematical tool for global dynamics analysis applied to two-dimensional continuous-time systems. In the paper, it is used to characterize system behavior when the parameters (κ, A) are near the point (κ̄, Ā) at which the Jacobian has two zero eigenvalues. Near this point, the system can exhibit saddle-loop bifurcations, Hopf bifurcations, homoclinic orbits, and stable or unstable periodic orbits — all of which are invisible to local linearization analysis. The paper uses this to establish that indeterminacy is more pervasive than local analysis suggests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Good / Bad / Ugly Steady States.&lt;/strong&gt; In the paper&amp;rsquo;s three-regime framework: the &amp;ldquo;good&amp;rdquo; state is the unique saddle-point equilibrium under high productivity Ah, with near-zero default rates; the &amp;ldquo;bad&amp;rdquo; state is the saddle-path equilibrium under low productivity Al, with modestly elevated defaults; the &amp;ldquo;ugly&amp;rdquo; state is the sink-path equilibrium under low productivity, characterized by self-fulfilling high default rates (up to ~12%). The ugly state is reached only when pessimistic sentiment coincides with the low-productivity regime, and it is the ugly state that generates the cluster pattern in simulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collateral Constraint on Working Capital.&lt;/strong&gt; The firm-level borrowing constraint wtnjt ≤ θVjt, where θ is the collateral ratio and Vjt is the firm&amp;rsquo;s continuation value. This constraint means that higher expected future profits — by raising Vt — relax the current borrowing limit, increase current labor demand and output, and create dynamic strategic complementarity between current and future production. It is this constraint, combined with the CES demand externality, that makes the dynamical system two-dimensional and generates the non-monotonic ˙Vt = 0 locus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global Indeterminacy.&lt;/strong&gt; The existence, given an initial state variable Mt, of multiple equilibrium trajectories — each satisfying all equilibrium conditions including transversality — that converge to different steady states or follow periodic paths. In the paper, global indeterminacy arises even when the system is locally determinate (e.g., in the saddle-source case): trajectories diverging from near the source steady state can converge to the saddle steady state along multiple paths, none of which is detectable by local linearization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Periodic Orbit (Endogenous Cycle).&lt;/strong&gt; In the paper, a closed trajectory in the (Mt, Vt) phase plane that the economy follows indefinitely in perfect-foresight equilibrium without any exogenous shocks. Such orbits exist for κ ∈ (κ_Hopf, κ_SL), are stable if S &amp;lt; 0 and unstable if S &amp;gt; 0 (where S is a computable quantity defined in Equation A.13). Their existence demonstrates that business cycles can arise purely from internal forces — the demand externality and borrowing constraint — consistent with the view in Beaudry, Galizia, and Portier (2020).&lt;/p&gt;</description></item><item><title>Aggregation and the Estimation of Quality Change</title><link>https://macropaperwarehouse.com/papers/aggregation-and-the-estimation-of-quality-change/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/aggregation-and-the-estimation-of-quality-change/</guid><description>&lt;p&gt;Errico and Lashkari address two intertwined problems in the measurement of aggregate price indices: how to account for quality change and variety entry/exit when the demand system is not CES, and how to identify flexible demand systems from prices and market shares alone when supply and demand shocks are correlated. The paper makes a theoretical contribution and a methodological one, then applies both to the measurement of US import price inflation over 1989–2016.&lt;/p&gt;
&lt;p&gt;The theoretical contribution generalizes the unified CES price index of Redding and Weinstein (2020a) and the Feenstra (1994) variety correction to the full class of smooth, invertible demand systems. The key insight is that the contribution of quality change to the aggregate price index depends on heterogeneous cross-product elasticities of substitution, not a single scalar as in the CES case. For practical implementation, the paper specializes to the Homothetic with Aggregator (HA) family of demand systems — which includes Kimball (1995), CRESH (Hanoch, 1971), and HSA (Matsuyama and Ushchev, 2017) — showing that within this family cross-product elasticities collapse to product-level elasticities, dramatically reducing dimensionality. The resulting approximate price index (Proposition 2) weights each product by its love-of-variety index 1/(epsilon_it − 1), departing from the uniform CES weighting.&lt;/p&gt;
&lt;p&gt;The methodological contribution is a dynamic panel (DP) identification strategy that exploits the Markov structure of quality shocks. The paper assumes that innovations to product quality are mean-zero conditional on lagged prices. Under flexible pricing, firms maximize current-period profits without regard to future demand shocks, so lagged prices are valid instruments for current prices. This permits identification of rich demand systems without external cost instruments and without the conventional assumption of uncorrelated supply and demand shocks. The conventional Feenstra–Broda–Weinstein (FBW) approach imposes zero correlation between quality shocks and prices; the paper shows that when quality and marginal cost are positively correlated, FBW produces downward-biased elasticity estimates (endogeneity bias).&lt;/p&gt;
&lt;p&gt;The empirical application constructs a dataset covering 155 time-consistent 5-digit NAICS industries over 1989–2018, matching US customs import data with domestic production data and treating country-of-origin varieties as the unit of observation. The paper estimates both CES and Kimball demand systems using the DP approach and compares them to FBW estimates.&lt;/p&gt;
&lt;p&gt;Key quantitative findings: First, DP-estimated CES elasticities are larger on average than FBW estimates (weighted mean 5.99 vs. 4.62), confirming a downward endogeneity bias in conventional methods. Second, Kimball mean elasticities exceed CES estimates (weighted mean 3.11 for Kimball vs. 5.99 for CES at the industry level, but the Kimball distribution has a mean of 17.0 and median 4.70), reflecting a heterogeneity bias — CES understates the dispersion of elasticities and thereby understates the elasticity relevant for the base (domestic) product whose market share is declining. Third, quality improvements in imported goods reduced the US import price index by approximately 20.2 percentage points cumulatively (0.67 p.p. annually) under Kimball demand, and 15.9 percentage points cumulatively (0.53 p.p. annually) under CES demand, over 1989–2018. The headline figure cited in the abstract is approximately 0.7 p.p. annually. The aggregate import price index (price plus quality components combined) fell by 8.25 p.p. cumulatively under Kimball and 4.01 p.p. under CES, compared to a BEA PCE index increase of 57.8 p.p. over the same period. Sectorally, machinery and electrical equipment account for roughly 60% of total quality gains (~200 p.p. cumulative). By country, China accounts for approximately 35% of cumulative quality gains, with non-OECD countries collectively contributing ~59%, and China&amp;rsquo;s quality upgrading accelerating after WTO accession.&lt;/p&gt;
&lt;p&gt;Validation using US automobile market data (1980–2018) confirms the DP identification assumption: controlling for current product characteristics, future characteristics are uncorrelated with current prices. The DP approach produces elasticity estimates and quality change measures similar to those obtained using real exchange rate cost-shock instruments, and the Kimball demand closely matches mixed logit (BLP) estimates of both price elasticities and price indices. CES estimates exhibit a measurable downward heterogeneity bias in this validation setting, which the paper traces theoretically and empirically to a positive covariance between demand elasticities and price volatility across products.&lt;/p&gt;
&lt;p&gt;Scope conditions: results apply to homothetic (income-invariant) demand; nonhomothetic extensions are provided as a generalization (Proposition 4) but not the primary focus. The import price index measures the cost of imports conditional on given domestic consumption; it does not capture full consumption-side welfare effects including substitution away from domestic varieties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core theoretical result on price index measurement beyond CES?&lt;/strong&gt;
Proposition 1 shows that for any smooth, invertible demand system satisfying the connected substitute property, the change in the log aggregate price index can be approximated as a weighted sum of log price changes and log expenditure share changes, with the expenditure share changes premultiplied by the inverse of the matrix Psi_t capturing cross-product elasticities of substitution. In the CES special case this reduces to the scalar (1/(sigma−1)) weight of the Redding-Weinstein (2020a) CUPI. The key departure in general demand is that the weight applied to each product&amp;rsquo;s expenditure share change is heterogeneous and depends on the full matrix of cross-product substitutabilities, not a single constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the HA (Homothetic with Aggregator) family simplify the theoretical results?&lt;/strong&gt;
For HA demand — which nests Kimball, CRESH, and HSA — Lemma 1 establishes that cross-product elasticities sigma_ij depend only on product-level elasticities epsilon_i through simple analytic formulas (e.g., epsilon_i * epsilon_j / epsilon-bar for HDIA), reducing the estimation problem from an N×N matrix to a vector of N scalars. Proposition 2 then gives an approximate price index in which each product&amp;rsquo;s expenditure share change is weighted by its love-of-variety index 1/(epsilon_it − 1), rather than a common CES scalar. This is the operative formula for the Kimball application.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the endogeneity bias in conventional elasticity estimation and how large is it?&lt;/strong&gt;
Conventional FBW methods assume supply and demand shocks are uncorrelated; when quality improvements are positively correlated with product prices (e.g., higher-quality goods command higher prices and also have higher marginal costs), FBW estimates are biased downward. The paper documents this: for CES demand, the DP-estimated weighted mean elasticity is 5.99 versus 4.62 under FBW, and for median estimates the DP value is 4.27 versus 2.58 under FBW, across 155 industries. The bias matters because underestimated elasticities imply underestimated quality changes and a smaller quality correction to the price index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the heterogeneity bias and how does it differ from the endogeneity bias?&lt;/strong&gt;
Even after correcting for endogeneity, CES demand imposes a single elasticity per industry, ignoring the cross-product distribution. The paper shows that the CES estimate is an average that does not correctly capture the behavior of the base product (the domestic US variety) whose market share is declining. Because the domestic variety tends to have a lower elasticity than the import average, CES understates this product&amp;rsquo;s love-of-variety index and thereby understates the quality correction attributable to rising import shares. Theoretically and empirically (Appendix E.4), this bias is larger when demand elasticities covary positively with price volatility across products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the dynamic panel identification assumption and why does it hold under flexible pricing?&lt;/strong&gt;
The paper assumes that quality shock innovations u_it are mean-zero conditional on lagged log prices: E[u_it | log p_it−1] = 0. Under flexible pricing, firms maximize current-period profits using current variables only; current prices are determined by current quality but are not chosen in anticipation of future quality shocks. Therefore lagged prices are uncorrelated with future quality innovations, making them valid instruments for current prices. This assumption is validated empirically in the automobile market: controlling for current product characteristics (horsepower, weight, fuel economy), future characteristics are not correlated with current prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the headline findings on quality change in US import prices?&lt;/strong&gt;
Under Kimball demand, quality improvements in imported goods reduced the US import price index by 20.2 percentage points cumulatively over 1989–2018, equivalent to 0.67 p.p. annually (the abstract rounds this to approximately 0.7 p.p. annually). Under CES demand, the quality contribution is 15.9 p.p. cumulatively (0.53 p.p. annually). The aggregate import price index combining price and quality changes fell by 8.25 p.p. under Kimball and 4.01 p.p. under CES over the same period. These figures imply that official import price statistics substantially overstate import price inflation by failing to account for quality improvements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Which sectors and countries drive the quality gains?&lt;/strong&gt;
Machinery and electrical equipment account for approximately 60% of total cumulative quality gains, with roughly 200 p.p. cumulative quality improvement in that sector. Computer and peripheral equipment (NAICS 3341) is a notable contributor — the official import-to-producer price ratio shows a nearly five-fold increase between 1989 and 2018, but after quality adjustment this ratio reverses direction. By country of origin, China accounts for approximately 35% of cumulative quality gains; other non-OECD countries collectively contribute approximately 59%; OECD countries contribute approximately 7%. China&amp;rsquo;s quality upgrading is documented to accelerate following its WTO accession.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Why does CES understate the quality correction relative to Kimball?&lt;/strong&gt;
The primary mechanism is that the US domestic variety — which serves as the numeraire for quality measurement — has a declining market share over the sample period. In Kimball demand, products with declining market shares are assigned lower elasticities (higher love-of-variety indices), amplifying the quality correction associated with import share gains. CES imposes a uniform elasticity, failing to capture this asymmetry. The paper shows that the key driver of the CES-Kimball gap in the import price index is CES underestimating the love-of-variety index of the base domestic product.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How is the identification approach validated in the automobile market?&lt;/strong&gt;
Using the Berry-Levinsohn-Pakes dataset extended by Grieco et al. (2024) for 1980–2018, the paper first verifies empirically that future product characteristics (horsepower, weight, fuel efficiency) are uncorrelated with current prices after controlling for current characteristics. It then compares DP estimates for both CES and Kimball demand against estimates obtained using real exchange rate (RER) variation as a cost-shock instrument, finding similar results in both cases. Finally, it compares Kimball and CES estimates against mixed logit (BLP) demand: Kimball closely matches BLP price elasticities and implied quality changes, while CES shows a downward heterogeneity bias.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the automobile market validation imply for the import price index methodology?&lt;/strong&gt;
Since Kimball demand matches the richer mixed logit demand in the auto setting — where product characteristics are observed — the validation provides evidence that Kimball demand serves as a good approximation to rich heterogeneous-elasticity models when characteristics are unavailable. The paper constructs price indices for the US auto industry based on mixed logit, mixed CES, Kimball, and standard CES, and shows that the Kimball index is closer to the mixed logit and mixed CES indices than is the standard CES index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper handle product entry and exit?&lt;/strong&gt;
Proposition 3 generalizes Proposition 1 to accommodate product entry and exit. The expression includes a variety correction analogous to Feenstra (1994) but generalized to non-CES settings via the mean love-of-variety index of entering and exiting products. In the CES special case this reduces exactly to the Feenstra (1994) correction. In the empirical application to US imports, entry and exit of country-of-origin varieties within industries is a relevant margin given the expansion of trading partners over the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the paper relate to Redding and Weinstein (2020a)?&lt;/strong&gt;
Redding and Weinstein (2020a) derive a price index formula under CES demand that accounts for taste shocks, applied to US retail scanner data where quality is constant at the barcode level. The present paper generalizes their CUPI formula beyond CES to general and HA demand systems, and extends their identification strategy to settings where demand changes partly reflect quality changes rather than pure taste shocks. The paper also shows that the CES assumption used in Redding-Weinstein may overstate the contribution of taste shocks to cost-of-living indices, since part of the expenditure share variation attributed to taste shocks under CES would be reassigned under heterogeneous-elasticity demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Does the paper address welfare implications beyond the import price index?&lt;/strong&gt;
The paper explicitly notes that the import price index does not capture the full consumption-side welfare effects of rising imports, since gains from lower import prices may be partly offset by substitution away from domestic varieties. The paper also notes that it abstracts from nonhomotheticity (income effects), pointing to Jaravel and Lashkari (2021) for that extension. The primary welfare-relevant quantity reported is the quality-adjusted change in the cost of the imported goods basket, which is the import price index in the conventional sense.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Love-of-variety index&lt;/strong&gt;: For a product i, defined as 1/(epsilon_it − 1) where epsilon_it is the product-level demand elasticity in an HA demand system. It measures the welfare value of having access to that variety and serves as the weight applied to expenditure share changes in the generalized price index formula (Proposition 2). In the CES special case all products share the same love-of-variety index 1/(sigma−1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Homothetic with Aggregator (HA) demand&lt;/strong&gt;: A family of income-invariant (homothetic) demand systems — including Kimball (1995), CRESH (Hanoch, 1971), and HSA (Matsuyama and Ushchev, 2017) — in which preferences are represented by a utility function with a specific aggregator structure. The key property exploited in the paper is that cross-product elasticities of substitution sigma_ij depend only on product-level elasticities epsilon_i through simple analytic formulas, reducing the dimensionality of the estimation problem from an N×N matrix to N scalars.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogeneity bias (in elasticity estimation)&lt;/strong&gt;: Downward bias in estimated elasticities of substitution arising from a positive correlation between product quality shocks and prices. When higher-quality products command higher prices and also have higher marginal costs, conventional methods (FBW) that assume zero correlation between supply and demand shocks will attribute part of the price variation to supply, underestimating how much demand responds to price. The paper documents this bias as the gap between DP and FBW estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity bias (in elasticity estimation)&lt;/strong&gt;: Additional downward bias in CES elasticity estimates relative to the mean of Kimball elasticities, arising from CES imposing a single elasticity per industry when the true elasticities are heterogeneous across products. The bias is stronger for differentiated products and is theoretically traced to a positive covariance between demand elasticities and price volatility across products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic panel (DP) identification&lt;/strong&gt;: The paper&amp;rsquo;s proposed identification strategy, which exploits the Markov structure of quality shocks. The key moment condition is that quality shock innovations are mean-zero conditional on lagged prices, which holds under flexible pricing. Lagged prices (and higher-order lags and nonlinear transformations) serve as instruments for current prices, permitting identification of demand parameters without external cost instruments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quality shock (phi_it)&lt;/strong&gt;: An unobserved product characteristic that shifts demand for product i at time t, defined through the utility function as a scalar multiplying the quantity consumed. Quality is identified from residual demand — the component of demand not explained by price — following the approach of Khandelwal (2010) and Hallak and Schott (2011). The paper models quality shocks as following a stationary AR(1) process with product-specific means.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Unified CES price index (CUPI)&lt;/strong&gt;: The price index formula of Redding and Weinstein (2020a) for CES demand, which decomposes the aggregate price change into a price component (expenditure-share-weighted price changes) and a quality/taste component proportional to (1/(sigma−1)) times expenditure share changes. The present paper&amp;rsquo;s Proposition 2 generalizes CUPI to HA demand by replacing the scalar 1/(sigma−1) with product-specific love-of-variety indices.&lt;/p&gt;</description></item><item><title>All Along the Watchtower: Military Landholders and Serfdom Consolidation in Early Modern Russia</title><link>https://macropaperwarehouse.com/papers/all-along-the-watchtower-military-landholders-and-serfdom-consolidation-in-early-modern-russia/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/all-along-the-watchtower-military-landholders-and-serfdom-consolidation-in-early-modern-russia/</guid><description>&lt;p&gt;This paper investigates the origins of serfdom in early modern Russia, arguing that the institution consolidated primarily through political economy dynamics between the crown and a landholding military class, rather than from economic fundamentals such as labor scarcity, land-labor ratios, or grain trade opportunities. The central argument is that the prolonged defense of Russia&amp;rsquo;s southern frontier against Crimean Tatar nomadic raids generated a class of military landholders who possessed both the coercive capacity and the political leverage to press the state into restricting peasant labor mobility.&lt;/p&gt;
&lt;p&gt;The mechanism runs as follows. The Russian state, lacking the fiscal capacity to pay soldiers directly, granted frontier lands along the Tula defense line to high-ranked soldiers in exchange for military service under the pomest&amp;rsquo;e system. These lands were selected for their defensive rather than agricultural value and sat on the forest-steppe boundary roughly 180 km south of Moscow. Since soldiers could not farm while on duty and could not compete in free labor markets given the area&amp;rsquo;s low agricultural attractiveness, the arrangement was only sustainable if peasants were bound to the land. Military landholders collectively petitioned the Tsar repeatedly — with petition volumes peaking during urban uprisings (9 petitions in 1648, 13 in 1682) when the government&amp;rsquo;s political vulnerability increased the military&amp;rsquo;s bargaining power — until serfdom was codified in the Law Code of 1649.&lt;/p&gt;
&lt;p&gt;The authors test this theory using newly digitized data from the 1678 household census, which records male population by six legally distinct peasant categories across 172 districts of Muscovy, combined with data on landholder estate counts and sizes. The primary empirical finding is that districts on the Tula defense line had approximately 40% of their population composed of serfs, compared to roughly 14% nationally — a difference of about 25 percentage points that survives the inclusion of geographic and climatic controls (grain suitability, temperature seasonality, precipitation, terrain ruggedness, river location, distance to Moscow, and regional fixed effects). Placebo tests confirm this pattern is specific to the most legally dependent peasant groups: the defense line is negatively associated with royal peasants and statistically insignificant for church peasants, free peasants, and non-Russian peasants.&lt;/p&gt;
&lt;p&gt;To address potential endogeneity of the defense line&amp;rsquo;s location, the authors construct an instrumental variable using a novel geospatial algorithm. The algorithm computes optimal nomadic invasion routes from Crimea to Moscow via topographic cost rasters (using flow accumulation values as proxies for river-crossing barriers), then intersects these routes with the historically stable forest-steppe boundary (identified through FAO/UNESCO soil types — Podzoluvisols versus Chernozems). Districts at this intersection were 70 percentage points more likely to host the actual defense line. Two-stage least squares estimates confirm and slightly exceed the OLS magnitudes, supporting the causal interpretation.&lt;/p&gt;
&lt;p&gt;The paper further tests two canonical alternative explanations and finds them insufficient. Domar&amp;rsquo;s (1970) labor-scarcity hypothesis predicts serfdom should be higher where population density is lower; the data show the opposite sign, contradicting this prediction. The Baltic grain trade hypothesis yields only a small, unstable positive interaction between river access to the Baltic and grain suitability, which disappears when the defense line variable is included. A horse race including all variables simultaneously shows the defense line coefficient at approximately 24 percentage points remains stable while alternative predictors become insignificant.&lt;/p&gt;
&lt;p&gt;Mechanism tests show that defense line districts had 3.2 more estates per 100 square kilometers than the national average of 2.3, with the excess concentrated in very small (up to 5 serf households) and small (6–25 households) estates — consistent with the state&amp;rsquo;s strategy of maximizing soldier count by allocating the minimum serf labor sufficient to sustain a cavalryman. A bigram similarity analysis of collective petitions versus the 1649 Law Code yields a correlation coefficient of 0.7 for the top twenty bigrams between a 1637 petition and Chapter 11 (restricting peasant mobility), with no comparable similarity to other chapters. Persistence is documented through 1719, 1795, and 1858 censuses: defense line districts maintained the highest serf concentration through to three years before emancipation in 1861.&lt;/p&gt;
&lt;p&gt;Q1: What is the paper&amp;rsquo;s central argument about the origins of Russian serfdom?
A: The paper argues that serfdom consolidated primarily due to political economy dynamics: the crown&amp;rsquo;s dependence on a landholding military class for frontier defense against steppe nomads gave that class sufficient political leverage to secure the legal restriction of peasant labor mobility. The military landholders&amp;rsquo; coercive capacity and proximity to their small estates made labor coercion a viable complement to their military function. This explanation dominates alternative accounts based on labor scarcity, grain trade, or soil quality in all specifications tested.&lt;/p&gt;
&lt;p&gt;Q2: What was the Tula defense line and why was it located where it was?
A: The Tula defense line (Great Abatis Line) was a chain of about 40 fort towns stretching over 500 km east-west, centered on Tula approximately 180 km south of Moscow, erected in the 1560s using felled trees, earth mounds, ditches, and watchtowers. Its location on the forest-steppe boundary was determined by two military-logistical constraints: it had to block the main nomadic invasion routes from Crimea, and it had to lie within the forest zone where timber was the cheapest construction material and which provided natural shelter. The paper documents that the defense line area did not differ from the rest of Muscovy in agricultural suitability, annual precipitation, seasonality, or terrain ruggedness — its distinctive feature was purely defensive.&lt;/p&gt;
&lt;p&gt;Q3: How large is the estimated effect of defense line proximity on serf concentration?
A: In the unconditional specification, defense line districts had a 30 percentage point higher share of serfs than the rest of the country. After adding geographic controls (grain suitability, seasonality, precipitation, terrain ruggedness, river dummy, distance to Moscow, and regional fixed effects), the coefficient stabilizes at approximately 25 percentage points. Given that serfs averaged about 14% of total population nationally but about 40% in defense line districts, the estimated effect is substantial relative to the baseline.&lt;/p&gt;
&lt;p&gt;Q4: How do the authors address endogeneity of the defense line location?
A: They construct an instrumental variable defined as the intersection of two variables: districts lying on the computed optimal nomadic invasion routes (covering 98 of 172 districts, or 57% of the sample), and districts on the forest-steppe soil boundary (38 districts, or 22% of the sample). Their interaction covers 23 districts and is the excluded instrument. In the first stage, this interaction term raises a district&amp;rsquo;s probability of hosting the actual defense line by 70 percentage points, while the linear terms become essentially zero once the interaction is included. The 2SLS second-stage estimates of the serf-share effect are slightly higher than OLS and statistically significant, confirming the direction and approximate magnitude of the OLS results.&lt;/p&gt;
&lt;p&gt;Q5: What does the paper find about Domar&amp;rsquo;s labor-scarcity hypothesis?
A: The paper finds no support for Domar&amp;rsquo;s (1970) prediction that serfdom should be more prevalent where labor is scarcer (lower population density). Controlling for grain suitability and geographic factors, population density enters with a positive and statistically significant coefficient at the 5% level — the opposite sign from what Domar&amp;rsquo;s theory predicts. When the defense line dummy is added, population density becomes insignificant while the defense line coefficient remains at approximately 25 percentage points, consistent with the baseline.&lt;/p&gt;
&lt;p&gt;Q6: What does the paper find about the Baltic grain trade hypothesis?
A: An exogenous measure of Baltic trade potential — a dummy for districts with river access to the Baltic, interacted with grain suitability — yields a small and marginally positive effect on serf share in Baltic districts with higher grain suitability. However, this effect disappears when the defense line dummy is included, and is also sensitive to alternative spatial clustering (becoming insignificant at the 300 km clustering radius even without the defense line dummy). The authors interpret this instability as inconsistent with grain trade being a primary driver of serfdom.&lt;/p&gt;
&lt;p&gt;Q7: What is the evidence for the estate-size mechanism?
A: Defense line districts had on average 3.2 more estates per 100 square kilometers than the national average of 2.3 per 100 square kilometers. Among estate-size brackets, very small (up to 5 serf households) and small (6–25 serf households) estates were disproportionately concentrated in defense line districts, while the location of medium-sized and large estates was statistically independent of the defense line. This pattern is consistent with the state&amp;rsquo;s strategy of allocating minimum viable serf endowments to maximize the number of soldiers supportable along the line.&lt;/p&gt;
&lt;p&gt;Q8: What is the textual evidence linking military petitions to the 1649 Law Code?
A: A bigram similarity analysis between a 1637 collective petition and Chapter 11 of the 1649 Law Code reveals a correlation coefficient of 0.7 for the top twenty bigrams. The five most common bigrams appear in both texts: &amp;ldquo;runaway peasants,&amp;rdquo; &amp;ldquo;commoner peasants,&amp;rdquo; &amp;ldquo;census books,&amp;rdquo; &amp;ldquo;search years,&amp;rdquo; and &amp;ldquo;tsar&amp;rsquo;s decree.&amp;rdquo; This correlation does not extend to other chapters of the Law Code that regulate non-peasant matters, establishing specificity of the legislative influence.&lt;/p&gt;
&lt;p&gt;Q9: How does the timing of collective petitions relate to political crises?
A: Over a corpus of 96 petitions between 1608 and 1698, landholders petitioned on average once per year, but activity spiked sharply during domestic uprisings: 9 petitions in 1648 (the &amp;ldquo;Salt Riot&amp;rdquo; urban uprising) and 13 petitions in 1682 (the musketeers&amp;rsquo; revolt). These peaks coincide with moments when the government&amp;rsquo;s political vulnerability increased the military&amp;rsquo;s bargaining power, and in both cases were followed by legislative concessions — the 1649 Law Code and new decrees in 1683–85 on harsher punishment for harboring runaways, respectively.&lt;/p&gt;
&lt;p&gt;Q10: What do the placebo tests show?
A: Regressions of non-serf peasant shares on the defense line dummy show that the defense line is negatively associated with royal peasants and statistically insignificant for church peasants, free peasants, and non-Russian peasants. A placebo test replacing military landholders with merchants and artisans shows no significant defense line effect on the latter group, while Moscow has an 11 percentage point higher merchant/artisan share. The specificity of the defense line effect to legally dependent peasants and military landholders supports the military-political mechanism rather than a generic frontier-area effect.&lt;/p&gt;
&lt;p&gt;Q11: How persistent was the spatial distribution of serfdom after 1649?
A: The authors estimate their baseline equation with serf share from the 1719, 1795, and 1858 censuses as dependent variables. Defense line districts maintained disproportionately higher serf densities in all three periods, including when the sample is restricted to the original Muscovite districts to exclude post-18th century territorial acquisitions. By 1858, three years before emancipation, the spatial distribution of serfs remained similar to that observed 200 years earlier at the time of serfdom&amp;rsquo;s consolidation — despite the defense line having been militarily obsolete for over a century.&lt;/p&gt;
&lt;p&gt;Q12: What explains the persistence of serfdom beyond its original military rationale?
A: The persistence reflects a mutually beneficial exchange between the crown and former military landholders. Landholders provided local state capacity — overseeing tax collection, administering military conscription, and adjudicating peasant disputes through estate courts — in lieu of a centralized bureaucracy. In return, the crown granted successive expansions of landholder rights: Peter I equalized military landholdings with hereditary estates in 1714, and Peter III in 1762 freed landholders from military service obligations while retaining their property rights over land and serfs. This fiscal-administrative dependency is also cited as a reason for the late timing and unfavorable-to-peasants terms of the 1861 emancipation reform.&lt;/p&gt;
&lt;p&gt;Q13: How does this paper&amp;rsquo;s explanation relate to Eastern/Western European institutional divergence?
A: The paper argues that while the military revolution in Western Europe generated fiscally capable centralized states with regular infantry armies, Russia&amp;rsquo;s peripheral nomadic threat prolonged the feudal cavalry model supported by land grants and serf labor. This delayed the formation of Weberian bureaucracy and entrenched what the authors term a &amp;ldquo;garrison state&amp;rdquo; — one whose institutions and social structure were shaped primarily by military-security considerations. The paper positions military factors alongside existing divergence explanations emphasizing land property rights, political institutions, demographic regimes, and Enlightenment ideas.&lt;/p&gt;
&lt;p&gt;Q14: What is the methodological contribution of the optimal invasion route algorithm?
A: The algorithm uses flow accumulation rasters (proportional to river width and basin size) as a cost function to compute the lowest-cost travel paths from Crimea to Moscow, iteratively penalizing cells within 15 km of each computed route and re-running the path search to generate four distinct routes per origin point (eight total, including routes from the Don River steppe). This produces a high-resolution, geographically continuous measure of military threat exposure that the authors argue provides statistical power in contexts where terrain ruggedness or simple distance measures lack variation — particularly relevant for flat plains with a single threat origin correlated with other variables.&lt;/p&gt;
&lt;p&gt;Pomest&amp;rsquo;e system: The institutional arrangement by which the Russian state granted frontier lands to high-ranked soldiers in exchange for military service, under the rule that &amp;ldquo;the land must not leave the service.&amp;rdquo; Unlike hereditary estates, pomest&amp;rsquo;e holdings were conditional on active service and could not be passed to heirs unless sons continued military service. This system enabled the formation of a permanent cavalry force despite the state&amp;rsquo;s low fiscal capacity, but required binding peasants to the land to make the arrangement viable for the soldier-landholders.&lt;/p&gt;
&lt;p&gt;Serfs (bobyli and dvorovye): In the paper&amp;rsquo;s 1678 census framework, serfs are defined as the two most legally dependent subgroups of private peasants — cotters (bobyli), who owned no property and worked full-time for their landlord in exchange for payment in kind, and servants (dvorovye), who performed household and support functions on the estate. These groups constituting about 14% of total population nationally were totally dependent on their landlord and could not retain the marginal product of any part of their labor. After the 1649 Law Code, villeins (krest&amp;rsquo;yane) gradually converged to this status as well.&lt;/p&gt;
&lt;p&gt;Collective petitions (chelobitnye): The primary institutional channel through which the military landholder class communicated collective interests and applied political pressure on the crown in 17th-century Muscovy. The paper documents 96 such petitions between 1608 and 1698, showing that their volume, timing (peaking during urban uprisings), and textual content (closely matching Chapter 11 of the 1649 Law Code) were the proximate mechanism by which landholders converted military leverage into legal codification of serfdom.&lt;/p&gt;
&lt;p&gt;Optimal defense line (instrumental variable): The paper&amp;rsquo;s constructed instrument, defined as the intersection of computed optimal nomadic invasion routes (based on topographic cost rasters approximating river-crossing barriers) and the forest-steppe soil boundary (Podzoluvisols/Chernozems boundary from the FAO/UNESCO Soil Map). This instrument captures the geographically and militarily determined placement of defensive fortifications, purging variation in actual defense line location that might reflect agricultural or economic value.&lt;/p&gt;
&lt;p&gt;Garrison state: Used by the authors (adapting Lasswell&amp;rsquo;s term) to describe a state whose institutions and social structure are shaped primarily by military security considerations. In the Russian context, this refers to the persistence of a feudal cavalry system, land-grant-based military compensation, and labor coercion that together delayed centralized state formation and Weberian bureaucracy relative to Western European states undergoing the military revolution toward regular infantry armies.&lt;/p&gt;
&lt;p&gt;Labor coercion complementarity: The paper&amp;rsquo;s mechanism whereby employers with high coercive capacity (proximity to weapons, military training) can deploy that same capacity to restrict workers&amp;rsquo; outside options and extract labor surplus. In the defense line context, soldiers&amp;rsquo; military skills and armament made them effective at preventing serf flight and enforcing labor obligations — creating a complementarity between military capacity and serfdom that was absent among merchants or church institutions with comparable landholdings elsewhere.&lt;/p&gt;</description></item><item><title>An endogenous gridpoint method for distributional dynamics</title><link>https://macropaperwarehouse.com/papers/an-endogenous-gridpoint-method-for-distributional-dynamics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/an-endogenous-gridpoint-method-for-distributional-dynamics/</guid><description>&lt;p&gt;This paper introduces the Distributional Endogenous Gridpoint Method (DEGM), a novel numerical technique for solving the distributional dynamics that arise in heterogeneous agent macroeconomic models. The core problem is how to efficiently update the distribution of agents over the state space as the economy evolves. The dominant existing approach — the &amp;ldquo;lottery method&amp;rdquo; of Young (2010) — discretizes the state space and represents policy functions as lotteries over nearby gridpoints, producing a transition matrix that is linear in optimal policies. This linearity renders the lottery method incapable of capturing nonlinear effects in distributional dynamics, a limitation that becomes quantitatively significant for higher-order perturbation solutions.&lt;/p&gt;
&lt;p&gt;DEGM extends Carroll&amp;rsquo;s (2006) endogenous gridpoint method from individual optimization to the distributional level. Rather than discretizing the density and integrating forward, DEGM works directly on the cumulative distribution function (CDF). The key insight is that when the policy function is monotone — as savings functions typically are — the endogenous gridpoints generated by the policy function trace out exact points on the post-policy CDF without requiring integration. Specifically, if A*_{i,j} = a*(A_i, Y_j) are optimal asset choices from grid point A_i at income Y_j, then the CDF values at those endogenous points are known analytically as F_t(A_i | Y_j). An interpolant using shape-preserving splines constructed through these points allows evaluation of the updated CDF at any point without integration. The income transition step is handled separately via standard quadrature over the discretized income process.&lt;/p&gt;
&lt;p&gt;The paper demonstrates DEGM&amp;rsquo;s performance with two applications. First, in the Aiyagari (1994) economy, DEGM converges to the stationary equilibrium an order of magnitude faster than the lottery method in terms of gridpoints. At nk=40 gridpoints, the lottery method deviates from the benchmark capital stock by 1.72% and the wealth Gini by 2.24% (for nh=5), while DEGM deviates by only 0.09% and 0.12% respectively. Both methods converge to the same solution as the number of gridpoints increases, but DEGM reaches this limit far faster.&lt;/p&gt;
&lt;p&gt;Second, the authors introduce a Krusell-Smith style model with aggregate investment risk (capital depreciation shocks calibrated following Barro, 2006, as a 0.4% quarterly probability of 7.5% capital destruction causing a 10% annual GDP drop) as a new baseline for studying aggregate nonlinearities with household heterogeneity. This model overcomes the near-linearity of aggregate capital dynamics in the original Krusell-Smith specification. Using a third-order perturbation solution with DEGM, aggregate investment risk lowers the capital stock by 5 to 11 basis points and increases wealth inequality by up to 11 basis points relative to the non-stochastic steady state, depending on idiosyncratic income risk calibration. The lottery method systematically mispredicts these effects: it always predicts a decrease in wealth inequality in the presence of investment risk, while DEGM predicts an increase. At third order, the lottery method predicts wealth Gini changes of +2.0 bp (persistent calibration) and -149.7 bp (transitory calibration), while DEGM predicts +10.7 bp and +2.1 bp respectively.&lt;/p&gt;
&lt;p&gt;The mechanism for increased inequality under investment risk is heterogeneous: for less wealthy households the substitution effect dominates (they reduce saving more in response to risky returns), while for wealthy households the income effect is stronger and precautionary saving motives dominate. The lottery method, by making the distributional transition matrix linear in policies, zeros out the second derivative of the transition matrix with respect to the policy function, missing the term capturing how the density at the pre-image of each asset level is affected nonlinearly. DEGM&amp;rsquo;s cubic spline interpolant captures all nonlinearities up to third order, enabling economically meaningful results that qualitatively differ from lottery-method predictions on wealth inequality.&lt;/p&gt;
&lt;p&gt;Q: What is the fundamental numerical problem that DEGM solves?
A: Evolving the distribution of agents forward over time in heterogeneous agent models requires evaluating a Kolmogorov forward equation, which naively demands numerical integration. The lottery method avoids integration by discretizing the state space and expressing transitions as a linear matrix operation, but this forces the distributional dynamics to be linear in optimal policies. DEGM avoids integration by exploiting policy function monotonicity: the endogenous policy gridpoints are the interpolation nodes, so the CDF update requires only interpolation, not integration. This preserves nonlinear effects up to the order of the splines used.&lt;/p&gt;
&lt;p&gt;Q: How does DEGM handle the borrowing constraint and the resulting mass point?
A: Savings policy functions are typically weakly monotone: constant at the borrowing constraint for sufficiently poor households, then strictly monotone above a threshold. DEGM accommodates this by starting the endogenous grid at the EGM solution corresponding to the borrowing constraint (the threshold a_j above which the policy is strictly monotone), restoring strict monotonicity on the relevant domain. The mass point at the borrowing constraint is captured by evaluating F_t(a_j, Y_j). Echoes of the borrowing constraint diminish as the number of income states increases, and in practice 10 income gridpoints are sufficient to smooth them.&lt;/p&gt;
&lt;p&gt;Q: How much faster does DEGM converge relative to the lottery method for the stationary equilibrium?
A: In the Aiyagari economy with nk=40 asset gridpoints, the lottery method&amp;rsquo;s capital stock deviates from the benchmark by 1.72% and the wealth Gini by 2.24% (nh=5), while DEGM deviates by only 0.09% and 0.12% respectively — roughly a 20-fold improvement in accuracy for the same gridpoints. At nk=80, the lottery method still shows 0.56%/0.78% deviations while DEGM shows 0.03%/0.00%. Although for a fixed number of gridpoints the lottery method is faster in wall-clock time (0.35s vs 0.82s at nk=40, nh=20), DEGM is faster for a given level of accuracy because it requires far fewer gridpoints.&lt;/p&gt;
&lt;p&gt;Q: Why does the lottery method fail at higher-order perturbations?
A: The lottery method constructs its transition matrix as a piecewise linear function of the optimal policy a*, so its second derivative with respect to a* is zero. As a result, it misses the second term in the second-order derivative of the end-of-period CDF: the term involving the derivative of the density at the pre-image of each asset level times the squared linear policy effect. This missing nonlinearity becomes quantitatively important at second and third order. DEGM&amp;rsquo;s cubic hermitian spline interpolant captures all nonlinearities up to third order, allowing it to correctly represent how the distribution responds nonlinearly to aggregate shocks.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the effect of aggregate investment risk on the capital stock and wealth inequality?
A: Using a third-order perturbation solution with DEGM, aggregate investment risk lowers the capital stock by 5 to 11 basis points from the non-stochastic steady state, depending on whether income risk is persistent or transitory (DEGM third-order: -4.7 bp persistent, -11.4 bp transitory). Wealth inequality increases by up to 11 basis points (DEGM third-order: +10.7 bp persistent, +2.1 bp transitory). The lottery method diverges dramatically at third order, predicting Gini changes of +2.0 bp and -149.7 bp for the persistent and transitory calibrations respectively, compared to DEGM&amp;rsquo;s +10.7 bp and +2.1 bp.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism through which aggregate investment risk increases wealth inequality?
A: The mechanism operates through heterogeneous saving responses across the wealth distribution. For less wealthy households, capital income is a small share of total income, so the substitution effect of risky returns dominates: higher investment risk reduces their incentive to save. For wealthy households, capital income is central, so the income effect is stronger and precautionary saving motives intensify. A capital depreciation shock upon realization compresses the wealth distribution, but the risk of such a shock increases inequality on average because it disproportionately reduces saving among poorer households.&lt;/p&gt;
&lt;p&gt;Q: How do the authors extend DEGM to handle aggregate risk and higher-order perturbations?
A: The authors follow Reiter (2009) in including the distribution and value functions in the state space, defining a nonlinear difference equation over these objects. Higher-order perturbation of this system proceeds using the algorithms of Andreasen et al. (2018) and Levintal (2017), with second-order terms solved via a generalized Sylvester equation using Kim et al.&amp;rsquo;s (2008) doubling algorithm. The implementation handles up to 3,200 variables at second order and 220 variables at third order. For the second-order solution, the Bayer-Luetticke (2020) state-space reduction and its refinement in Bayer et al. (2024) yield results identical to the full unreduced system.&lt;/p&gt;
&lt;p&gt;Q: What is the state-space reduction procedure and how much does it compress the system?
A: The full system uses 402 states and 412 controls (persistent calibration). A copula representation of the distribution reduces this to 213 states and 412 controls; adding DCT compression of the value function gives 213 states and 98 controls; further adding a factor representation from the first-order solution yields 111 states and 98 controls — a 75% reduction. The R-squared-like IRF statistic remains 1.00 across all reductions, and ergodic moments are identical (capital: 25.54, Gini: 0.61 for the persistent calibration).&lt;/p&gt;
&lt;p&gt;Q: Does DEGM produce different first-order impulse responses than the lottery method?
A: For first-order perturbations, DEGM and the lottery method converge to the same solution as the number of gridpoints increases, but DEGM converges faster. For the first-order dynamics of the wealth distribution (wealth Gini IRFs), DEGM reaches convergence with nk=40 gridpoints while the lottery method requires nk=160. For aggregate capital stock IRFs, both methods converge quickly at first order. Quantitative differences become significant only at second and higher orders.&lt;/p&gt;
&lt;p&gt;Q: What calibration is used for the investment risk model?
A: Capital depreciation deviates from its steady-state value by a shock with second moment sigma_delta = 0.005 and third moment tau_delta = 0.012. This corresponds to a 0.4% quarterly probability that a disaster destroys 7.5% of the capital stock and causes a 10% drop in annual GDP, consistent with the evidence in Barro (2006). The model is solved under both a persistent income calibration (beta=0.98, rho=0.98, sigma_epsilon=0.14, implied Gini=0.66) and a transitory income calibration (beta=0.99, rho=0.88, sigma_epsilon=0.18, implied Gini=0.42).&lt;/p&gt;
&lt;p&gt;Distributional Endogenous Gridpoint Method (DEGM): A numerical method for evolving the joint CDF of agents over the state space by constructing an interpolant at endogenous gridpoints A*_{i,j} = a*(A_i, Y_j) — the optimal policy values — at which CDF values are known analytically as F_t(A_i | Y_j), thus updating the distribution through interpolation rather than integration and preserving nonlinearities up to the order of the spline.&lt;/p&gt;
&lt;p&gt;Lottery Method (LM): Young&amp;rsquo;s (2010) standard technique that replaces the continuous distribution with a discrete counterpart and represents optimal policy functions as probability weights over nearby gridpoints, yielding a single transition matrix A* such that f_{t+1} = f_t * A*. The transition matrix is linear in optimal policies, which zeroes out the second derivative of the distributional dynamics with respect to policies and causes systematic misprediction of distributional dynamics under higher-order perturbation.&lt;/p&gt;
&lt;p&gt;Kolmogorov Forward Equation (Distributional Dynamics): The law of motion for the joint CDF F_t(a, y) describing how the distribution of households over assets and income evolves given optimal policies and the income transition process. In DEGM, this equation is split into a sub-period for asset choices (where endogenous gridpoints allow integration-free updating) and a sub-period for income transitions (handled by quadrature over the discretized income process).&lt;/p&gt;
&lt;p&gt;Higher-Order Perturbation Solution: A Taylor expansion of the model&amp;rsquo;s nonlinear equilibrium conditions around the non-stochastic steady state beyond first order. Second-order solutions capture precautionary motives and mean deviations from the steady state; third-order solutions additionally capture asymmetric effects of shocks, requiring DEGM&amp;rsquo;s nonlinear distributional representation to produce accurate results.&lt;/p&gt;
&lt;p&gt;Aggregate Investment Risk (Capital Depreciation Shocks): Shocks to the aggregate capital depreciation rate calibrated following Barro (2006) as a 0.4% quarterly probability of a disaster that destroys 7.5% of the capital stock and causes a 10% annual GDP drop. Proposed as a replacement for near-linear Krusell-Smith aggregate productivity shocks to generate genuine nonlinearities in aggregate capital dynamics while remaining equally parsimonious.&lt;/p&gt;
&lt;p&gt;State-Space Reduction: A sequence of compression techniques — copula representation of the wealth distribution, discrete cosine transform (DCT) compression of the value function, and factor representation from the first-order solution — that reduce the Reiter (2009) system from 402 states and 412 controls to 111 states and 98 controls (a 75% reduction) with no measurable loss of accuracy in impulse responses or ergodic moments.&lt;/p&gt;
&lt;p&gt;Shape-Preserving Interpolation: Interpolation methods (linear spline or piecewise cubic hermitian splines) that maintain the monotonicity of the CDF when constructing the interpolant from endogenous gridpoints. Cubic hermitian splines additionally preserve differentiability, making the distributional dynamics smooth enough for third-order perturbation and capturing all nonlinear effects that the lottery method misses.&lt;/p&gt;</description></item><item><title>An Equilibrium Analysis of the Effects of Neighborhood-Based Interventions on Children</title><link>https://macropaperwarehouse.com/papers/an-equilibrium-analysis-of-the-effects-of-neighborhood-based-interventions-on-children/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/an-equilibrium-analysis-of-the-effects-of-neighborhood-based-interventions-on-children/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; How should governments design neighborhood-based policies to improve long-run outcomes for children, once one accounts for general equilibrium (GE) forces—endogenous rents, neighborhood quality, wages, and distortionary taxation—that small-scale experimental studies cannot identify?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The paper embeds neighborhood effects into a quantitative, heterogeneous-agent overlapping-generations (OLG) model with endogenous location choice and child skill development. The economy has three building blocks: (1) a dynastic life-cycle structure in which parents choose a neighborhood (from two options: a disadvantaged n=1 and an advantaged n=2) and allocate time to child development, with child skills produced by a nested CES aggregator combining parental time and neighborhood quality (proxied by per-capita income in the tract); (2) a GE Aiyagari incomplete-markets framework with endogenous labor supply, wage uncertainty, and progressive labor taxation; and (3) a government that finances housing vouchers or place-based wage subsidies by adjusting the labor income tax parameter, with all additional net expenses fully offset by tax revenue. Housing supply is upward-sloping (elasticity 1.75, from Saiz 2010), so rents are endogenous.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and calibration.&lt;/strong&gt; The model is estimated by simulated method of moments to match U.S. data from the 2000s, drawing on the PSID, NLSY, ATUS, the 2012–2016 ACS, and the Opportunity Atlas (Chetty et al. 2018). Neighborhoods are mapped to Census tracts divided into bottom-10-percent and top-90-percent median household income groups within each commuting zone. Key targeted moments include the income gap between neighborhoods (108 percent higher mean individual income in n=2), the 30 percent higher incomes for children from low-income families raised in the better neighborhood, and a 32 percent gap in weekly parental time with children across neighborhoods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Validation.&lt;/strong&gt; Before policy counterfactuals, the calibrated model is validated against two bodies of reduced-form evidence. First, a simulated small-scale, single-generation, partial-equilibrium voucher experiment generates 23 percent higher income for children—close to the 31 percent MTO experimental estimate from Chetty et al. (2016), with the difference largely explained by a smaller poverty-rate contrast (18 vs. 22 percentage points) in the simulation. Second, a simulated 20 percent place-based wage subsidy generates 17–21 percent earnings gains for adult residents of n=1, consistent with Busso et al.&amp;rsquo;s (2013) quasi-experimental EZ estimates of 17–24 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — housing vouchers.&lt;/strong&gt; The welfare-maximizing voucher program features a 100 percent subsidy rate, targets households with children and wages below the 80th percentile (fourth quintile), and is financed by progressive labor taxes. In the long-run steady state this policy raises 12.5 percent more children in the advantaged neighborhood, increases labor productivity by 1.1 percent, reduces income inequality (variance of log after-tax lifetime earnings) by 6.3 percent—comparable in magnitude to the Sweden–U.S. after-tax inequality gap—and raises upward mobility by 27.7 percent (roughly half its standard deviation across U.S. Census tracts). The average marginal tax rate must increase by 15.7 percent to fund the program. Despite this, long-run welfare rises by 3.4 percent in consumption equivalence units. A decomposition shows that intergenerational dynamics add 11.5 percentage points to welfare (relative to a short-run, single-generation scenario), while taxation subtracts 10.2 percentage points, and rent plus neighborhood-quality effects together subtract only 1.4 percentage points—leaving the net long-run GE gain similar to the short-run partial-equilibrium gain of 3.5 percent. Crucially, non-targeting children generates welfare losses of 5.0 percent, confirming that restriction to households with children is essential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — place-based wage subsidies.&lt;/strong&gt; A 12 percent wage subsidy to workers in the disadvantaged neighborhood yields the highest steady-state welfare gain of 0.7 percent. This is approximately one-fifth of the gain achievable with the optimal voucher. The subsidy induces substantial resorting toward n=1, reducing the share of children in n=2 by 6.7 percent while raising neighborhood quality in n=1 by 19.7 percent. Income inequality falls by 8.7 percent and upward mobility rises by 20.4 percent. However, in a short-run partial-equilibrium setup, the wage subsidy has a negative welfare effect of −1.0 percent because it draws parents (and their children) into the disadvantaged area; the positive net effect only emerges through long-run intergenerational channels (+2.5 percentage points) and equilibrium neighborhood-quality adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Political economy.&lt;/strong&gt; Because voucher gains are concentrated among young cohorts (those aged 16–43 at introduction), only 33 percent of incumbent adults would rationally vote for the housing voucher program. In contrast, the place-based wage subsidy provides positive average welfare gains for all age cohorts alive at introduction, yielding estimated majority support from over 63 percent of adults. This creates a fundamental political economy tradeoff: the policy with the larger long-run social gains lacks majority democratic support, while the policy with broader support delivers smaller long-run gains.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the two market frictions that justify government intervention in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: The first friction is the absence of intergenerational borrowing markets: parents cannot borrow against their child&amp;rsquo;s future income, which limits the parent&amp;rsquo;s willingness to pay the higher rent in n=2 to give their child a developmental advantage. Housing vouchers act as a tax-financed substitute for this missing contract by paying the rent premium and recovering the cost through taxes on the high-earning adults the children become. The second friction is a neighborhood externality: individuals do not internalize the effect of their own income on the neighborhood quality experienced by neighbors&amp;rsquo; children. Place-based wage subsidies partially correct this externality by subsidizing work in the disadvantaged area, raising local income per capita and thereby improving the neighborhood quality index for all children resident there.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How is neighborhood quality defined and modeled, and why is this specification chosen?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: Neighborhood quality sn is defined as total income per capita (the sum of labor and capital income) for all residents of neighborhood n, including non-workers. This specification is intended to capture multiple mechanisms: school quality (which depends on local tax bases), role-model effects from productive adults, and social organization effects through adult supervision of children. The formulation includes retired and non-working residents, which means the arrival of children mechanically reduces neighborhood quality per capita in the model, partially capturing a crowding channel. Formally, the neighborhood spillover function takes the power form f(sn) = A * sn^ζ, where ζ governs the elasticity of child development to neighborhood quality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the paper validate the model&amp;rsquo;s key mechanism — the neighborhood effect on children?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: The validation mimics the MTO RCT within the calibrated model: the government provides a 100 percent rent voucher usable only in n=2 to households in n=1 with incomes below the 10th percentile, holding prices and neighborhood qualities fixed (as in a small-scale experiment). The model generates 25 percent voucher take-up and a 23 percent increase in children&amp;rsquo;s income in their late 20s. This compares to the experimental MTO estimate of approximately 31 percent. The paper attributes most of the gap to the smaller poverty-rate contrast in the simulation (18 percentage points) relative to MTO (22 percentage points), and shows that plotting the simulated result against the site-specific MTO estimates in a scatterplot of child income gains against neighborhood poverty reductions places the model prediction on the fitted line through the experimental data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the quantitative role of long-run intergenerational dynamics in the voucher program, relative to other GE channels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: The decomposition in Table 5 isolates four GE channels. Starting from a short-run partial-equilibrium welfare gain of 3.5 percent (for the children of a single treated generation), allowing the economy to operate for the long run while holding prices and taxes fixed raises welfare to 15.0 percent — an increase of 11.5 percentage points — because improved skills in one generation create higher-skilled, higher-income parents who invest more in the next generation. Introducing housing market price adjustments (rents rise by 3.9 percent in n=2) reduces welfare by only 0.6 percentage points. Allowing neighborhood quality to adjust (quality in n=2 falls by 4 percent as lower-income families move in) reduces welfare by an additional 0.8 percentage points. Adding full taxation to balance the government budget reduces welfare by 10.2 percentage points, from 13.6 to 3.4 percent. The four channels nearly cancel, leaving the long-run GE steady-state gain close to the short-run single-generation gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why does the optimal voucher program require targeting to families with children, and what happens without this restriction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: When the voucher is extended to all households regardless of children (Column 6 of Table 4), nearly 82.6 percent of the population receives a subsidy, pushing almost everyone to n=2. Rents in n=2 rise by 5.3 percent. To finance this much broader program, the average marginal tax rate must increase by 44 percent, far exceeding the 15.7 percent required for the children-targeted program. The large tax increase suppresses labor supply and income, which reduces neighborhood quality in n=2 by 11.6 percent. The net effect is a welfare loss of 5.0 percent. The intuition is that the benefit of the voucher program flows primarily through child skill development, so subsidizing adults without children is fiscally expensive without producing the intergenerational gains that justify the cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What drives the difference in long-run welfare gains between vouchers (3.4 percent) and place-based wage subsidies (0.7 percent)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: The primary channel is labor productivity. The optimal voucher program raises labor productivity by 1.1 percent by increasing the average neighborhood quality to which children are exposed by 1.2 percent. The wage subsidy raises productivity by only 0.2 percent because it induces resorting toward the disadvantaged neighborhood, meaning children&amp;rsquo;s average neighborhood quality actually decreases by 0.2 percent despite large improvements in n=1&amp;rsquo;s quality (up 19.7 percent), since fewer children reside in n=1 after the subsidy draws their parents there. Inequality reduction is not the source of the gap: the wage subsidy actually reduces inequality more (8.7–8.9 percent) than the voucher (6.3 percent), but this inequality effect does not translate into larger aggregate welfare because productivity effects dominate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the wage subsidy produce positive long-run welfare when it generates negative welfare in the short run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: In the short run, the wage subsidy draws parents into the disadvantaged neighborhood to exploit higher wages, which reduces the share of children in the advantaged neighborhood n=2 and lowers children&amp;rsquo;s late-life productivity (welfare of −1.0 percent for treated children in the single-generation scenario). Two long-run channels flip the sign. First, the subsidy is permanent, so children themselves receive it as adults, providing a direct wage income benefit. Second, the sustained presence of higher-income workers in n=1 raises neighborhood quality there durably (by 19.7 percent at the steady state), which benefits the children who reside in n=1. Together these intergenerational effects add 2.5 percentage points to welfare, while taxation costs reduce it by only 1.4 percentage points, yielding a net gain of 0.7 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What determines the political economy divide between the two policies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: For the housing voucher, welfare gains are concentrated among younger incumbent adults (ages 16–43), particularly those who are about to have or already have children, while older adults tend to lose because they face higher taxes without benefiting from improved neighborhood quality for their (now independent) children. This concentration implies only 33 percent of incumbent adults would support the voucher under the model&amp;rsquo;s welfare metric. For the place-based wage subsidy, average welfare gains are positive for every age cohort alive at introduction (though larger for younger cohorts), because the wage subsidy raises incomes for workers in n=1 immediately and benefits from equilibrium rent declines in n=1 that allow all residents to benefit. Over 63 percent of adults would support the wage subsidy. The paper notes that if the government could borrow to initially finance the voucher program and pay for it later (as in Daruich 2020 for early childhood programs), majority support for the voucher could potentially be achieved.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How sensitive are the welfare results to the key calibrated parameters?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: The sensitivity analysis (Table 9, following Andrews et al. 2017) shows that individual parameters would need to change substantially to overturn the conclusion that vouchers generate larger steady-state welfare gains than wage subsidies. For example, the altruism parameter β̃ would need to increase by 22 percent to eliminate the voucher welfare gain, which would require average parental transfers to rise to 198 percent of income — far from the empirical target of 125.4 percent. Using the more conservative tract-level housing supply elasticity from Baum-Snow and Han (2021) of 0.3–0.4 (about 80 percent below the baseline Saiz 2010 estimate of 1.75) would reduce the voucher welfare gain from 3.37 to approximately 2.57 percent, not reversing the qualitative conclusion. The parameters with the largest influence on welfare gains are the labor disutility parameter µ and the altruism parameter β̃; the housing supply elasticity matters more for the voucher than the wage subsidy because easier housing supply accommodates growth in n=2 without displacement under the voucher.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the transition path of the voucher program look like, and why do welfare gains initially dip before recovering?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: When the voucher is unexpectedly introduced, the first newborn cohort gains approximately 4 percent welfare, but gains for subsequent cohorts initially dip to around 3 percent before stabilizing at 3.4 percent by the 20th post-introduction cohort. The dip occurs because moving costs slow resorting: immediately after introduction, rents in n=2 begin rising and neighborhood quality there begins falling as low-income families move in, but the capital stock adjustment (which would counteract these effects by raising GDP) lags the resorting. The rebound comes as capital accumulates in n=2 over time and as intergenerational productivity gains build through successive cohorts of better-skilled parents. Labor productivity jumps noticeably for the first cohort born to parents who received the voucher (approximately 28 years after introduction) and again for the first cohort born to grandparents who received it, visibly demonstrating the intergenerational mechanism. In contrast, the wage subsidy&amp;rsquo;s welfare gains are approximately constant at 0.7 percent across all cohorts because the key channels (neighborhood quality improvement in n=1 and wage gains) materialize rapidly and remain stable throughout the transition.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Neighborhood quality (sn):&lt;/strong&gt; In this paper, neighborhood quality is not school quality or amenities in a generic sense but is explicitly defined as total income per capita — the sum of labor income and capital income — for all residents of neighborhood n, including non-workers. This endogenous measure rises when higher-income or more productive residents move in and falls when lower-income residents or additional children arrive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intergenerational borrowing constraint:&lt;/strong&gt; The inability of parents to borrow against their child&amp;rsquo;s future income, modeled as a non-negativity constraint on the monetary transfer from parent to child (transfer ≥ 0). This is the paper&amp;rsquo;s first key market friction: without it, a poor parent who moved to a better neighborhood would smooth consumption across generations by having the high-earning child compensate the parent. The constraint prevents this, reducing parental investment below the socially efficient level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption equivalence (veil of ignorance):&lt;/strong&gt; The welfare metric used throughout the policy analysis. It is defined as the percentage change in consumption that would make a newborn individual indifferent between the pre-policy and post-policy steady states, computed before knowing their position in the skill or income distribution. This is the paper&amp;rsquo;s measure of long-run steady-state welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Parental investment aggregator (CES):&lt;/strong&gt; A nested constant-elasticity-of-substitution function that determines how parental time τ and neighborhood quality sn combine to form the effective investment input I into child skill development: I = Ā[αI f(sn)^γ + (1 − αI)τ^γ]^(1/γ). The elasticity parameter 1/(1 − γ), estimated at 0.41, governs the degree of complementarity between time and neighborhood quality; a lower elasticity (γ = −1.43) implies the two inputs are complements, so parents with children in better neighborhoods also spend more time with them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Place-based wage subsidy:&lt;/strong&gt; A neighborhood-specific wage premium (denoted w̃s) paid to all workers who both live and work in the disadvantaged neighborhood n=1, raising their effective wage to w1 = (1 + w̃s)w2. This policy targets the neighborhood externality by increasing the income of residents in n=1, which raises neighborhood quality and provides an incentive for higher-skilled workers to relocate to (or remain in) the disadvantaged area.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Upward mobility:&lt;/strong&gt; Measured in this paper as the probability that a child born to parents in the bottom 20 percent of the income distribution reaches the top 20 percent of the income distribution during the working stage of their own life. This is distinct from mean income rank measures; it specifically tracks cross-quintile transitions in the model&amp;rsquo;s stationary distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equilibrium decomposition:&lt;/strong&gt; A simulation-based method in which GE channels are progressively activated. Starting from a short-run, partial-equilibrium, single-generation baseline (analogous to an RCT), the authors sequentially allow: (i) long-run intergenerational dynamics while holding prices fixed; (ii) housing market price adjustments; (iii) neighborhood quality adjustments; (iv) tax and production-price adjustments. Each step&amp;rsquo;s change in outcomes identifies the quantitative contribution of that specific channel.&lt;/p&gt;</description></item><item><title>Anatomy of the Phillips Curve: Micro Evidence and Macro Implications</title><link>https://macropaperwarehouse.com/papers/anatomy-of-the-phillips-curve-micro-evidence-and-macro-implications/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/anatomy-of-the-phillips-curve-micro-evidence-and-macro-implications/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper addresses a fundamental puzzle in macroeconomics: why do estimates of the New Keynesian Phillips curve (NKPC) slope differ sharply depending on whether real marginal cost or the output gap is used as the real activity variable? The conventional, output gap-based NKPC yields very flat slope estimates (e.g., 0.006 to 0.024 in Hazell et al. 2022 and Rotemberg and Woodford 1997), which has led to the widespread view that the Phillips curve is &amp;ldquo;flat,&amp;rdquo; at least during the pre-pandemic period. The authors argue that this view conflates two distinct structural relationships: the elasticity of inflation with respect to real marginal cost, and the elasticity of marginal cost with respect to the output gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors assemble a unique quarterly micro-level dataset covering 4,598 manufacturing firms in Belgium over 84 quarters (1999:Q1–2019:Q4), totaling 132,915 observations. The dataset combines product-level domestic prices and quantities from the PRODCOM administrative database, customs data on foreign competitors&amp;rsquo; prices, and firms&amp;rsquo; variable production costs (labor costs from social security declarations plus intermediate input costs from VAT declarations). Intermediate inputs account for approximately 75 percent of total variable costs on average and are the most volatile cost component (within-firm coefficient of variation 1.77, versus 0.77 for labor costs).&lt;/p&gt;
&lt;p&gt;Their estimation strategy follows a &amp;ldquo;bottom-up&amp;rdquo; approach. Starting from a theoretical framework with heterogeneous firms subject to Calvo (1983) nominal rigidities and strategic complementarities in price setting (imperfect competition including dynamic oligopoly and Kimball demand), they derive a forward-looking dynamic pass-through regression linking a firm&amp;rsquo;s current price to discounted present values of its own marginal costs and competitors&amp;rsquo; prices, plus a lagged price level that serves as an error-correction term. This is Model A; robustness variants include Model B (absorbing competitor prices via industry-by-time fixed effects), Model C (imposing an AR(1) process for marginal cost), and Model A-U (unrestricted lagged-price coefficient).&lt;/p&gt;
&lt;p&gt;The structural parameters governing the NKPC slope — the degree of nominal rigidity (θ) and the strength of strategic complementarities (Ω) — are estimated jointly via GMM. Instruments for marginal cost are four-quarter-lagged firm-level total factor productivity (TFPQ), and instruments for competitors&amp;rsquo; prices exploit variation in EU-area export prices to third-country destinations and bilateral exchange rates between non-EU competitor currencies and the Euro. Sector-by-time fixed effects and firm fixed effects absorb confounding trends, shifting trend inflation, and permanent markups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline estimate (Model A) yields θ = 0.711 (SE 0.014), implying that prices remain fixed for approximately three to four quarters on average, consistent with Nakamura and Steinsson (2008) Belgian PPI data (0.72). The strategic complementarity parameter is Ω = 0.570 (SE 0.059), indicating that competitor price dynamics reduce the pass-through of own marginal cost shocks by approximately half relative to the no-complementarities benchmark.&lt;/p&gt;
&lt;p&gt;These structural estimates imply a slope of the marginal cost-based NKPC of λ = 0.052 (SE 0.007), tightly estimated and robust across specifications: λ = 0.077 in Model B, λ = 0.069 in Model C, and λ = 0.056 in the unrestricted Model A-U. This slope is two to ten times larger than existing estimates of the conventional output gap-based NKPC slope (κ ≈ 0.024, Rotemberg and Woodford 1997; κ ≈ 0.006, Hazell et al. 2022).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reconciling the High Cost-Based Slope with the Flat Output-Based Slope&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper shows that the output-based slope κ equals the product of the cost-based slope λ and the output elasticity of marginal cost σ_y: κ = λ · σ_y. Using Bartik-style instruments based on high-frequency ECB monetary policy surprises interacted with industry-level sensitivities, the authors estimate σ_y using two models. Model D yields σ_y = 0.406 and κ = 0.021; Model E (directly regressing changes in marginal cost on changes in output) yields σ_y = 0.112 and κ = 0.006. These estimates are consistent with, and overlap with, Rotemberg and Woodford (1997) and Hazell et al. (2022) during the pre-pandemic sample period. The low elasticity of marginal cost to output is attributed to near-constant short-run returns to scale at the firm level and wage rigidity that mutes general equilibrium effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Aggregate Inflation Dynamics&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Feeding an aggregate marginal cost index (constructed as a Törnqvist-weighted average of firm-level marginal costs) into the model-implied inflation expression produces a series that tracks Belgian manufacturing PPI inflation well: marginal cost fluctuations alone account for approximately 70 percent of inflation variation (R² = 0.68, correlation 0.8), without appealing to unobservable cost-push shocks or inflation lags.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model Validation via Supply Shocks&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A validation exercise using identified oil shocks (Känzig 2021 — measured as unexpected OPEC-day movements in oil futures prices) confirms the model. A one-standard-deviation shock to oil prices (a 15.7 percent increase in Brent crude) raises firms&amp;rsquo; real marginal costs by approximately 1.5 to 3 percent within the first three quarters, before reverting. The price response peaks at approximately 3 percent after six quarters, consistent with nominal rigidities generating a delayed but persistent response. Impulse-response matching yields λ_IRF = 0.042 (SE 0.005), within the confidence bands of the micro-level estimate λ = 0.052, validating the bottom-up approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All estimates are drawn from Belgian manufacturing firms over 1999–2019, a period of moderate inflation during which Calvo pricing provides a good approximation of firm behavior. The authors note that the elasticity of marginal cost to output may be time-varying and nonlinear, and that during large aggregate shocks (such as the post-pandemic inflation surge), both the frequency of price adjustment and the sensitivity of marginal cost to output can rise substantially, requiring state-dependent pricing models (addressed in a companion paper, Gagliardone et al. 2025).&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the primitive formulation of the NKPC, and how does it differ from the conventional formulation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: The primitive NKPC features real marginal cost (in log-deviation from its steady state) as the real activity variable: π_t = λ·mc_t + β·E_t{π_{t+1}} + u_t, where λ is the slope depending on nominal rigidities and strategic complementarities. The conventional formulation uses the output gap (or unemployment gap) as a proxy for marginal cost, which is valid only under specific conditions including perfectly flexible wages. When those conditions fail, the output gap is a poor proxy for marginal cost, typically leading to downward bias in slope estimates. Even when a proportionality holds, the output-based slope κ equals λ multiplied by σ_y (the output elasticity of marginal cost), so the two slopes carry different economic content.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What structural parameters govern the slope of the cost-based NKPC, and what is the formula?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: The slope is λ = &lt;a href="1%e2%88%92%ce%a9"&gt;(1−θ)(1−βθ)/θ&lt;/a&gt;, where θ is the Calvo probability of price non-adjustment (capturing nominal rigidity) and Ω = Γ/(1+Γ) is the strategic complementarities parameter derived from the markup elasticity Γ with respect to relative prices. High nominal rigidity (high θ) flattens the slope by making individual price adjustments less frequent; strong strategic complementarities (high Ω) flatten it further because firms mute their price response to marginal cost in order to avoid deviating from competitors. The discount factor β is calibrated at 0.99 for quarterly data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the dynamic pass-through regression differ from the static (long-run) pass-through regressions used in prior literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: The dynamic pass-through regression (Model A) includes the firm&amp;rsquo;s lagged price as a regressor, which functions as an error-correction term controlling for persistent deviations between the price and the optimal reset price. Failing to include this term with quarterly data leads to omitted variable bias of magnitude −θ·Var(Δp_ft), since the cointegration error is autocorrelated with coefficient θ. Static pass-through regressions (as in Amiti, Itskhoki and Konings 2019 using annual data) are appropriate only when nominal rigidities can be ignored (θ ≈ 0); with quarterly data and θ ≈ 0.711, the orthogonality condition of the static model fails and the dynamic framework is necessary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the baseline estimates of the structural parameters, and how robust are they?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: The baseline Model A yields θ = 0.711 (SE 0.014) and Ω = 0.570 (SE 0.059), implying prices fixed for approximately three to four quarters and competitor-price influence roughly equal to own marginal cost influence. The implied NKPC slope is λ = 0.052 (SE 0.007). Robustness checks across six specifications (Models B, C, A-U, variable SR-RTS controls, Translog TFPQ, eight-quarter-lagged instrument) yield λ in the range 0.044 to 0.077, with all estimates statistically significant and within each other&amp;rsquo;s confidence bands. The unrestricted model (A-U) cannot reject the restriction Ϛ = θ on the lagged-price coefficient (p-value 0.90).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the short-run elasticity of a firm&amp;rsquo;s own price to a permanent marginal cost shock, and how do nominal rigidities and strategic complementarities each contribute?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: The short-run pass-through elasticity is (1−Ω)(1−θ) ≈ (1−0.570)(1−0.711) ≈ 0.125. This is substantially below one because both forces dampen price adjustment: nominal rigidity (1−θ ≈ 0.289) means most firms cannot adjust in any given quarter, and strategic complementarities (1−Ω ≈ 0.430) mean that adjusting firms reduce their pass-through to avoid deviating from competitors&amp;rsquo; prices. Without strategic complementarities (Ω = 0), the elasticity would be roughly 0.289; without nominal rigidities (θ = 0), it would be roughly 0.430; both together produce the observed 0.125.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How is marginal cost measured in the data, and why is the inclusion of intermediate input costs important?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: Marginal cost is proxied by average variable cost per unit of output: the log-nominal marginal cost equals ln(TVC_ft/Y_ft) + ln(1+ν_ft), where TVC is the sum of intermediate input costs (from VAT declarations) and labor costs (wage bill from social security declarations), and Y_ft is a quantity index. Intermediate inputs account for approximately 75 percent of total variable costs on average and are the most volatile component (within-firm coefficient of variation 1.77 vs 0.77 for labor). The authors note that DSGE models typically feature only labor as a variable input, but accounting for intermediates is pivotal because intermediate goods price shocks were among the most important drivers of the post-pandemic inflation surge.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What instruments are used for marginal cost and competitors&amp;rsquo; prices, and what are the identifying assumptions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: The instrument for marginal cost is the four-quarter lagged firm-level TFPQ (physical total factor productivity), estimated as the residual from a gross-output production function. Its relevance depends on TFP persistence (confirmed); the exclusion restriction requires that persistent TFP variation is orthogonal to current and future demand shocks after removing permanent demand components (via firm fixed effects) and industry trends (via sector-by-time fixed effects). Two instruments for competitors&amp;rsquo; prices exploit international trade variation: (i) sales-weighted average export prices of EU-area competitors to non-Belgium, non-EU destinations (orthogonal to Belgian demand shocks by construction), and (ii) bilateral exchange rate movements between non-EU competitor currencies and the Euro. All instruments pass the Cragg-Donald and Kleibergen-Paap F-statistics (strongly rejecting weak instruments) and Hansen-Sargan over-identification tests (failing to reject validity).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What evidence supports the validity of the TFPQ instrument against capacity utilization concerns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: The authors run two empirical tests. First, regressing marginal cost on four-quarter-lagged capacity utilization yields a small, statistically insignificant elasticity (0.011, SE 0.052), suggesting the TFPQ instrument&amp;rsquo;s predictive power does not reflect capacity utilization variation. Second, re-estimating with &amp;ldquo;purified&amp;rdquo; TFPQ instruments adjusted for capital utilization (Column 4) and for both capital and labor utilization (Column 5) produces parameter estimates and NKPC slopes essentially unchanged from baseline. Additionally, regression residuals show only weak and short-lived autocorrelation (−0.09 at one-quarter lag, p=0.09; −0.01 at two-quarter lag, p=0.69), indicating demand shocks are highly transitory after conditioning on fixed effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the model track aggregate Belgian manufacturing PPI inflation, and what does this imply for cost-push shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: Using the reduced-form expression π_t = λ̃(mc_t^n − p_{t-1}) + α + θu_t, where the reduced-form slope λ̃ = 0.22 is evaluated at baseline structural estimates, the model produces a model-implied inflation series that accounts for approximately 70 percent of variation in manufacturing PPI inflation (R² = 0.68, correlation 0.8), without including inflation lags or cost-push shocks. The model captures the inflation drop during the 2008 financial crisis, the run-up in 2016, and the subsequent decline. This contrasts with the quantitative DSGE literature in which cost-push shocks (variation in desired price and wage markups) account for approximately 70 percent of inflation volatility (e.g., Primiceri, Schaumburg and Tambalotti 2006).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do the authors estimate the output elasticity of marginal cost σ_y, and what do they find?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: They use two approaches. Model D is a pricing equation directly relating firm-level prices and nominal output (value added), estimated via GMM, instrumented with Bartik-style shifters based on high-frequency ECB monetary policy surprises (Altavilla et al. 2019) interacted with industry-level sensitivities. Model E directly regresses changes in nominal marginal cost on changes in nominal output, also instrumented. Model D yields σ_y = 0.406 (SE 0.099) and implied κ = 0.021 (SE 0.005); Model E yields σ_y = 0.112 (SE 0.026) and κ = 0.006 (SE 0.001). The low σ_y is consistent with near-constant short-run returns to scale at the firm level and wage rigidity muting general equilibrium labor-market feedback, at least during the moderate-inflation pre-pandemic period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the oil shock validation exercise confirm the cost-based NKPC slope estimate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: Following Känzig (2021), the authors identify oil shocks as unexpected movements in Brent crude oil futures around OPEC meeting days, normalizing to a one-standard-deviation shock (15.7 percent Brent increase). Local linear projection IRFs show that firms&amp;rsquo; real marginal costs rise 1.5 to 3 percent within three quarters and then revert, while prices peak at approximately 3 percent increase after six quarters (consistent with nominal rigidity delaying the price response). Impulse-response matching — minimizing the weighted distance between empirical and model-implied price IRFs — yields λ_IRF = 0.042 (SE 0.005), which is close to and within the confidence bands of the micro-level estimate λ = 0.052, validating the bottom-up estimation approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What do the estimates imply about why the conventional NKPC appears flat in normal times?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: The flat conventional NKPC slope (κ ≈ 0.006–0.024) does not reflect limited transmission of marginal cost fluctuations to inflation — that transmission is high (λ ≈ 0.052–0.077). Rather, flatness reflects a weak link between the output gap and marginal cost during the pre-pandemic period (σ_y ≈ 0.112–0.406), attributable to near-constant short-run returns to scale in production and wage rigidity. This decomposition matters for policy: supply shocks that directly raise marginal cost will pass through strongly to inflation even when output does not move much, whereas demand shocks that operate through the output-cost channel face attenuated transmission.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Under what conditions does the cost-based Phillips curve decompose cleanly into a product of the two elasticities?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A13: The decomposition κ = λ · σ_y requires assuming that real wages are flexible and determined in general equilibrium at the industry level, with real wages increasing in industry output with elasticity σ_w; that the natural level of output is defined as the equilibrium under flexible prices and constant desired markups; and that the firm&amp;rsquo;s marginal product of labor depends on productivity and output with a common short-run returns-to-scale parameter ν (homogeneous across firms and time-invariant). Under these assumptions (which parallel those used to derive the conventional NKPC in the standard NK model), the output elasticity of marginal cost is σ_y = σ_w + ν, and the theoretical restriction κ = λ · σ_y holds exactly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: How do macroeconomic complementarities from aggregate decreasing returns to scale affect the NKPC slope?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A14: If aggregate SR-RTS fall below unity, the NKPC slope formula gains an additional term Θ = 1/(1+γν(1−Ω)) &amp;lt; 1, where ν is inversely related to average SR-RTS and γ is the within-industry elasticity of substitution. However, empirical estimates of sectoral SR-RTS range from 0.93 to 0.98, with an aggregate estimate of approximately 0.965 (implying ν ≈ 0.036). Given this and calibrating γ = 4, Θ ≈ 0.941, so macroeconomic complementarities would reduce the NKPC slope by only about 6 percent — well within the confidence bounds of the baseline estimates. The authors conclude that the constant-returns assumption in their main framework is a good approximation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Primitive (cost-based) NKPC slope (λ):&lt;/strong&gt; The coefficient linking inflation to real marginal cost in the underlying New Keynesian pricing equation, defined as λ = &lt;a href="1%e2%88%92%ce%a9"&gt;(1−θ)(1−βθ)/θ&lt;/a&gt;. It captures how strongly firms&amp;rsquo; aggregate price setting responds to movements in real marginal cost per unit of output, holding the discount factor, nominal rigidity, and strategic complementarities fixed. Estimated at 0.052 (tightly, range 0.044–0.077 across specifications) for Belgian manufacturing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Calvo probability of price non-adjustment (θ):&lt;/strong&gt; The parameter from Calvo (1983) staggered price setting capturing the share of firms that cannot change their price in a given period, equal to one minus the per-period probability of price adjustment. In this paper, θ is estimated directly from the dynamic pass-through regression coefficient on lagged prices, yielding θ ≈ 0.711, implying prices fixed approximately three to four quarters on average.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Strategic complementarities parameter (Ω):&lt;/strong&gt; Defined as Ω = Γ/(1+Γ), where Γ is the elasticity of a firm&amp;rsquo;s desired markup with respect to its own relative price. Captures the extent to which a firm weights competitors&amp;rsquo; prices (rather than its own marginal cost) when resetting its price. High Ω means firms strongly mute price responses to own cost changes to avoid relative price deviations from competitors. Estimated at Ω ≈ 0.570, implying competitor prices and own marginal cost enter the reset price with roughly equal weight.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic pass-through regression:&lt;/strong&gt; A forward-looking pricing equation (Model A) relating observed firm prices to the discounted present values of own marginal costs and competitors&amp;rsquo; prices, plus lagged own price as an error-correction term. The structural parameters θ and Ω are identified jointly from the regression coefficients, using GMM with instruments for the present values. The dynamic specification is necessary at quarterly frequency because the error-correction term (omitted in static pass-through models) is non-negligible when θ &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Output elasticity of marginal cost (σ_y):&lt;/strong&gt; The elasticity of firm-level real marginal cost with respect to the firm-level output gap, defined under the assumptions that real wages are flexible and industry-level, equal to σ_y = σ_w + ν (wage elasticity with respect to industry output plus the short-run returns-to-scale parameter). This parameter bridges the cost-based and output-based Phillips curve slopes via κ = λ · σ_y. Estimated from micro data using monetary policy shock instruments at σ_y ≈ 0.112–0.406 in the pre-pandemic period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Short-run returns to scale (SR-RTS):&lt;/strong&gt; The extent to which a firm&amp;rsquo;s marginal cost rises with output scale in the short run, parameterized by ν in the cost function MC^n_ft = C_{it} · A_{ft} · Y_ft^ν. If ν = 0, marginal cost is independent of output scale (constant returns), which the authors assume in their baseline. Firm- and sector-level estimates from Translog production functions yield SR-RTS ≈ 0.93–0.98 across sectors (aggregate ≈ 0.965), broadly consistent with the constant-returns assumption and implying modest macroeconomic complementarities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reduced-form aggregate pass-through slope (λ̃):&lt;/strong&gt; A composite parameter capturing the contemporaneous pass-through of aggregate real marginal cost (defined as nominal marginal cost relative to the lagged price level) into quarterly inflation under the assumption that nominal marginal cost follows a random walk. Evaluated at θ ≈ 0.70 and Ω ≈ 0.52 (median across models), λ̃ = 0.22. This is distinct from the structural NKPC slope λ because it also captures the persistence of cost shocks.&lt;/p&gt;</description></item><item><title>Are Inflationary Shocks Regressive? A Feasible Set Approach</title><link>https://macropaperwarehouse.com/papers/are-inflationary-shocks-regressive-a-feasible-set-approach/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/are-inflationary-shocks-regressive-a-feasible-set-approach/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; The paper asks whether inflationary shocks are regressive, and demonstrates that the answer depends critically on the &lt;em&gt;source&lt;/em&gt; of the shock. A single aggregate inflation statistic conceals radically different distributional consequences depending on whether inflation is driven by an oil supply contraction or by expansionary monetary policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Framework.&lt;/strong&gt; The authors develop a &amp;ldquo;feasible set approach&amp;rdquo; grounded in the envelope theorem. They show that the first-order money-metric welfare effect of any macroeconomic shock on a household is summarized by the present discounted value of changes to five components of the household&amp;rsquo;s budget constraint: (1) consumption prices, (2) wage income, (3) asset dividends, (4) asset prices, and (5) government transfers. Because the envelope theorem implies that endogenous substitution responses are not welfare-relevant to a first order, no assumption about the utility function&amp;rsquo;s form or the economy&amp;rsquo;s general equilibrium structure is required. The framework is valid for generic stationary shocks that do not directly shift household preferences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Strategy.&lt;/strong&gt; The welfare formula requires two inputs: (i) impulse response functions (IRFs) for all prices, dividends, wages, and unemployment, estimated using internal-instrument SVAR methods applied to two identified shocks — the Kanzig (2021) oil supply news shock (instrumented by oil futures surprises around OPEC announcements) and the Gertler-Karadi (2015) monetary policy shock (instrumented by fed funds futures surprises in 30-minute windows around FOMC announcements) — and (ii) cross-sectional data on consumption bundles, labor income, and asset portfolios from the CEX, CPS, SCF, and SIPP for three education groups (high school or less, some college, college-educated) across the full lifecycle. The baseline cross-section uses 2019 data. Shocks are normalized to produce comparable aggregate inflation responses: a 10% WTI oil price increase and a 25 basis point decline in the one-year Treasury yield each generate roughly 15–16 basis points of CPI-U inflation on impact, rising to approximately 34–35 basis points after two quarters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt; Oil supply contractions are regressive and monetary expansions are progressive, and this divergence is primarily driven by the asset price channel, not the consumption price or labor income channels.&lt;/p&gt;
&lt;p&gt;For the 10% oil supply shock: middle-aged households with high school education or less must be paid approximately $870 (around 2% of annual consumption) to be made whole relative to their pre-shock utility; college-educated middle-aged households, by contrast, gain the equivalent of approximately $833 (1.1% of annual consumption). Younger college-educated households (still net equity accumulators) gain around $572.&lt;/p&gt;
&lt;p&gt;For the 25 basis point monetary rate cut: low-education households approximately break even (net welfare effect near $23), while middle-aged college-educated households must be paid approximately $4,051 (around 5.5% of annual consumption) to restore their pre-shock utility. Older college-educated households must be paid approximately $851.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why asset prices dominate.&lt;/strong&gt; Oil supply contractions reduce equity prices (S&amp;amp;P500 falls approximately 2% one year post-shock) and depress dividends (approximately 82 basis points), while leaving house prices and bond prices largely unaffected. Because middle-aged college-educated households are the primary accumulators of equities, they benefit from the price decline (cheaper future accumulation), making oil shocks progressive through this channel — but regressive overall once the consumption and labor income channels (both mildly regressive) are included. Monetary expansions do the opposite: equity prices rise approximately 3 percentage points on impact, house prices rise approximately 1.5% after three years, and dividends increase. These asset price increases hurt those in the accumulation phase — disproportionately middle-aged college-educated households — creating a progressive distributional pattern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption and labor income channels.&lt;/strong&gt; Both shocks generate disproportionate inflation in motor fuel and fuel and utilities, and low-education households spend a larger share of their budget on these goods, making the consumption channel mildly regressive for both shocks. The labor income channel differs sharply: oil shocks raise unemployment (approximately 0.15 log points for low-education households two years post-shock) and reduce weekly earnings by 0.2–0.6 log points, mildly harming low-education workers; monetary expansions reduce unemployment (approximately 0.83 log points for low-education workers one year post-shock) and similarly benefit low-education households through the labor market, pushing toward progressivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results apply to short-run first-order welfare effects of identified stationary macroeconomic shocks (four-year horizon). The framework does not incorporate uncertainty shocks, preference shocks, or the role of hedging motives in portfolio choice. Results concern policy shocks rather than policy rules.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness.&lt;/strong&gt; Qualitative conclusions hold across six alternative specifications: incorporating borrowing constraints (with or without empirical death rates), adjusting for unemployment insurance replacement rates (approximately 6% true average replacement rate), allowing for log-linear trends in no-shock choices, and dropping aggregate CPI controls from IRF estimation.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the &amp;ldquo;feasible set approach&amp;rdquo; and how does it differ from prior work on inflation incidence?&lt;/strong&gt;
A: The feasible set approach measures welfare effects through changes in the household&amp;rsquo;s entire budget constraint — consumption prices, wage income, asset dividends, asset prices, and government transfers — rather than focusing on any single channel. Prior work either examined the Fisher channel (net nominal positions), or consumption price heterogeneity, or labor income responses in isolation. The key insight is that the envelope theorem implies substitution responses are not welfare-relevant to a first order, so the money-metric welfare change is simply the discounted sum of changes in the five budget constraint components evaluated at pre-shock choices, without requiring knowledge of the utility function&amp;rsquo;s form or the economy&amp;rsquo;s general equilibrium structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why is the asset price channel — rather than consumption prices — the dominant channel in both shocks?&lt;/strong&gt;
A: Asset holdings are large relative to annual consumption (net worth averages $1.5 million for college-educated and $260,000 for high-school-educated households in 2019), so even modest percentage movements in asset prices generate large dollar welfare effects. By contrast, the budget shares on the goods most responsive to both shocks (motor fuel, fuel and utilities) are relatively modest, so the consumption channel, while mildly regressive, is quantitatively small relative to the portfolio channel. The portfolio channel accounts for roughly 0.5% of consumption gains for middle-aged college-educated households under the oil shock, while the consumption channel produces losses of only about 0.1% for college-educated and 0.25% for low-education households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the direction of the equity price response differ between oil and monetary shocks, and why does this create opposite distributional effects?&lt;/strong&gt;
A: An oil supply contraction reduces equity prices (approximately 2% decline one year post-shock) and dividends (approximately 82 basis points decline), while a monetary expansion raises equity prices (approximately 3 percentage points on impact, approximately 4% higher after four quarters) and increases dividends. The welfare effect of asset price changes falls on those who &lt;em&gt;trade&lt;/em&gt; the asset, not those who merely hold it at a constant level: middle-aged college-educated households are the primary net &lt;em&gt;accumulators&lt;/em&gt; of equity, so falling prices benefit them (they can buy more cheaply) while rising prices hurt them. This is the principal reason oil shocks appear progressive through the portfolio channel — but regressive overall — while monetary expansions are regressive through the portfolio channel and progressive overall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the precise welfare numbers for oil supply shocks by education group (baseline, ages 22–65)?&lt;/strong&gt;
A: From Table 3 (baseline row, lifecycle-weighted averages for ages 25–65): households with high school or less experience a welfare loss of approximately $798; those with some college experience a loss of approximately $816; and college-educated households experience a welfare &lt;em&gt;gain&lt;/em&gt; of approximately $494. These numbers reflect the sum of the consumption, labor income, portfolio, and transfer channels over a 16-quarter horizon, discounted at the one-year Treasury yield.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the precise welfare numbers for monetary policy shocks by education group (baseline, ages 25–65)?&lt;/strong&gt;
A: From Table 3 (baseline row): households with high school or less experience a small welfare &lt;em&gt;gain&lt;/em&gt; of approximately $23; those with some college experience a welfare loss of approximately $1,278; and college-educated households experience a welfare loss of approximately $3,055. These losses for college-educated households are driven overwhelmingly by rising equity and house prices that raise the cost of planned asset accumulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the life cycle interact with the distributional incidence of both shocks?&lt;/strong&gt;
A: There is substantial heterogeneity within education groups across the life cycle because asset accumulation and decumulation patterns are age-dependent. Under oil shocks, younger college-educated households (who are net equity accumulators) gain approximately $572, middle-aged college-educated households gain approximately $833, while older college-educated households lose approximately $69 (because they hold large equity positions and lose dividend income). Under monetary shocks, middle-aged college-educated households lose the most (approximately $4,051) because they are simultaneously accumulating equities and housing, both of which become more expensive. Older college-educated households lose less (approximately $851) because rising dividends on existing holdings partially offset the asset price cost. Low-education households are approximately flat across the life cycle under monetary shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the consumption channel compare across education groups and across the two shocks?&lt;/strong&gt;
A: The consumption channel is mildly regressive for both shocks, but of similar absolute magnitude across the two shocks because both generate similar inflation in motor fuel and fuel and utilities — the goods with the largest price response. Low-education households spend a larger share on motor fuel and fuel and utilities; as a result, they lose approximately 0.25% of consumption from the consumption channel under the oil shock, compared with less than 0.1% for college-educated households. For monetary shocks, the consumption channel affects all household types roughly equally in proportional terms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the labor income channel differ between oil and monetary shocks across education groups?&lt;/strong&gt;
A: Oil shocks raise unemployment disproportionately for low-education workers (approximately 0.15 log point increase after two years, roughly 0.68 standard deviations, compared with near-zero response for college-educated workers) and reduce weekly earnings by 0.2–0.6 log points across groups. Monetary expansions reverse this: a 25 basis point rate cut reduces log unemployment by approximately 0.83 log points for low-education workers and approximately 1.96 log points for college-educated workers after one year, with limited response in conditional wages. Thus the labor income channel pushes toward regressive incidence for oil shocks and toward progressive incidence for monetary expansions, though in both cases it is quantitatively smaller than the portfolio channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the role of housing in the portfolio channel?&lt;/strong&gt;
A: Housing behaves simultaneously as a durable consumption good and a financial asset. A house price increase raises welfare for households planning to &lt;em&gt;decumulate&lt;/em&gt; (sell) housing (primarily older households) through the portfolio channel, but also raises the implicit rental cost for those who &lt;em&gt;use&lt;/em&gt; housing — a negative consumption-side effect. Monetary expansions raise house prices by approximately 1.5% after three years. College-educated households accumulate housing at a faster rate and earlier in the life cycle than low-education households, making them more exposed to the cost of rising house prices during the accumulation phase. This amplifies the progressive pattern of monetary shocks through the portfolio channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the paper handle the dual role of durable goods (vehicles and housing)?&lt;/strong&gt;
A: Durable goods are treated as both a consumption good and a financial asset. The utility-relevant consumption price of a durable is proportional to the price times the depreciation rate per unit of use, capturing the &amp;ldquo;implicit rent&amp;rdquo; of ownership. On the asset side, the durable enters the portfolio channel like a zero-dividend financial asset. This allows the framework to correctly attribute, for example, that a rise in house prices hurts net accumulators (through the portfolio channel) while also raising the implicit cost of housing services (through the consumption channel), rather than treating house price appreciation as an unambiguous welfare gain for homeowners.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What happens to the main conclusions when borrowing constraints are introduced?&lt;/strong&gt;
A: Incorporating net worth constraints (with either constant or empirical death rates) dampens the portfolio channel for young and middle-aged college-educated households, because rising asset prices relax borrowing constraints for these households, partially offsetting the welfare cost of more expensive accumulation. Under constant death rates with borrowing constraints, college-educated households&amp;rsquo; oil shock welfare gain falls from +$494 to +$76; under empirical death rates, it becomes a loss of -$394. For monetary shocks, the college-educated loss falls from -$3,055 to -$1,718 (constant death rate) or -$1,036 (empirical death rates). Despite these quantitative changes, the qualitative conclusion — oil shocks are regressive, monetary expansions are progressive — holds across all specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What is the implication of these findings for the policy interaction between oil shocks and monetary tightening?&lt;/strong&gt;
A: If the monetary authority responds to oil-price-induced inflation with unexpected interest rate increases, it may exacerbate the distributional consequences of the initial oil shock. An oil supply contraction is already regressive (harming low-education households through consumption prices and labor market effects); a disinflationary monetary tightening would additionally harm low-education households through the labor income channel (higher unemployment, lower wages) while partially benefiting college-educated households through lower asset prices. The paper notes this policy interaction as noteworthy, while cautioning that the results concern identified policy &lt;em&gt;shocks&lt;/em&gt; rather than policy &lt;em&gt;rules&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: How are the two shocks calibrated to be comparable?&lt;/strong&gt;
A: The oil shock is normalized to a 10% increase in WTI crude oil prices (approximately one standard deviation of monthly oil price growth). The monetary shock is normalized to a 25 basis point decline in the one-year Treasury yield — chosen because it generates approximately the same aggregate CPI-U inflation response as the oil shock (approximately 15–16 basis points on impact, rising to approximately 34–35 basis points after two quarters). This normalization allows the paper to attribute the different distributional outcomes to the &lt;em&gt;source&lt;/em&gt; of inflation rather than to differences in the aggregate inflation magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What role does the transfer channel play, and for whom?&lt;/strong&gt;
A: The transfer channel is small relative to the other three channels for the vast majority of working-age households, because transfer income is less than $100 per month for most households under age 65. Social Security payments — the bulk of transfer income — are explicitly indexed to the CPI; the paper models them as moving with CPI with a one-year lag. The transfer channel exclusively benefits older households (those receiving Social Security), and its quantitative effect is modest even there. Transfer income is more than 20 times smaller than labor and asset income for prime-age households of all education groups.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Feasible set approach.&lt;/strong&gt; The paper&amp;rsquo;s organizing framework, in which the first-order welfare impact of a macroeconomic shock is measured by how the shock changes the household&amp;rsquo;s budget constraint (consumption prices, wage income, asset dividends, asset prices, and government transfers) evaluated at the household&amp;rsquo;s pre-shock choices. Substitution responses are not welfare-relevant to a first order by the envelope theorem.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Money-metric welfare gain.&lt;/strong&gt; The willingness-to-pay measure used throughout: the welfare change from a shock divided by the household&amp;rsquo;s marginal utility of consumption at time zero, expressed in time-zero dollars. Interpreted as an equivalent variation — the amount the household must be paid or would give up to be indifferent to receiving the shock. Used because it places households with very different utility functions on a common dollar scale.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Portfolio channel.&lt;/strong&gt; The component of the welfare formula capturing the effect of asset price and dividend changes on household welfare. Asset price changes are welfare-relevant only for households that &lt;em&gt;trade&lt;/em&gt; (accumulate or decumulate) the asset: rising prices benefit sellers and harm buyers; falling prices benefit buyers and harm sellers. This is distinct from the &amp;ldquo;Fisher channel&amp;rdquo; in prior literature, which focuses on net nominal positions rather than on which households are in the accumulation versus decumulation phase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Internal instrument SVAR.&lt;/strong&gt; The time-series estimation procedure used throughout: the pre-estimated identified shock series (oil supply news or monetary policy surprise) is included as a variable ordered first in a recursive structural VAR for each outcome variable. This separates shock identification (using the published instruments and controls from Kanzig 2021 and Gertler-Karadi 2015) from IRF estimation for each outcome variable, allowing the use of the full available sample for each outcome series.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Oil supply news shock (Kanzig 2021).&lt;/strong&gt; An identified supply shock to oil markets, constructed from changes in oil price futures in tight windows around OPEC production announcements. Used to capture exogenous cost-push inflation driven by supply constraints rather than demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monetary policy shock (Gertler-Karadi 2015).&lt;/strong&gt; An identified demand-side shock, constructed from federal funds rate futures surprises in 30-minute windows around FOMC announcements, instrumented into a monetary SVAR. Captures exogenous interest rate cuts that generate aggregate demand expansion and inflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Borrowing constraint wedge.&lt;/strong&gt; An additional term that appears in the welfare formula when households face net worth constraints. Proportional to the Lagrange multiplier on the net worth constraint, it discounts future periods more heavily when constraints bind, and adds a term for the welfare value of relaxed constraints when asset prices rise. Identified from deviations from perfect consumption smoothing using CEX lifecycle consumption data.&lt;/p&gt;</description></item><item><title>Artificial intelligence and technological unemployment</title><link>https://macropaperwarehouse.com/papers/artificial-intelligence-and-technological-unemployment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/artificial-intelligence-and-technological-unemployment/</guid><description>&lt;p&gt;Wang and Wong develop a continuous-time labor-search model to assess the dynamic effects of generative AI (GenAI) on labor productivity and unemployment. The paper is motivated by conflicting empirical evidence: micro studies find productivity gains of 14% (Brynjolfsson, Li, and Raymond 2025) and 55.8% faster coding (Peng et al. 2023), while macro estimates suggest modest TFP gains of at most 0.064% annually (Acemoglu 2024), and occupation-level evidence shows a 13% relative employment decline in AI-exposed jobs (Brynjolfsson, Chandar, and Chen 2025).&lt;/p&gt;
&lt;p&gt;The model distinguishes GenAI from earlier automation technologies by its learning-by-using mechanism: AI capability grows at rate µ per employed worker (law of motion dAt/At = µHt − δ), raises employed workers&amp;rsquo; productivity, and creates a displacement threat through renegotiation. When renegotiation fails, AI replaces the worker, generating technological unemployment. Firms renegotiate wages at a rate ρµAt proportional to AI&amp;rsquo;s learning rate and the job&amp;rsquo;s exposure ρ. The joint surplus condition governs whether replacement occurs: AI replaces a worker if and only if πA (AI&amp;rsquo;s net present value per output) exceeds the post-renegotiation joint surplus St.&lt;/p&gt;
&lt;p&gt;The model admits three steady states: (i) a some-AI steady state with finite AI capability, persistent AI adoption (It = 1), expanded job creation but declining employment at H∞ = δ/µ; (ii) an unbounded-AI equilibrium with sustained endogenous growth, no displacement (It = 0), and employment at H∞ = α/(α+σ); and (iii) a no-AI equilibrium reverting to the Mortensen-Pissarides benchmark. In the benchmark model (exogenous job-finding rate, AI-augmented productivity), multiple steady states can coexist—global indeterminacy—when condition (28) holds. In the full model (endogenous job creation via free entry), both global and local indeterminacy are possible, and a continuum of oscillatory transition paths converge to the some-AI steady state.&lt;/p&gt;
&lt;p&gt;Calibrated to U.S. data, targeting a pre-AI unemployment rate of 5%, AI elasticity of productivity εy = 1.069 (from Czarnitzki et al. 2023), initial AI productivity boost of 14% (Brynjolfsson et al. 2025), worker exposure ρ = 0.618 (Brynjolfsson et al. 2018&amp;rsquo;s machine learning suitability index), AI replacement cost ϕ = 0.0043 (from U.S. business GenAI spending), AI learning rate µ = 0.632, and AI error rate δ = 0.462 (Moore&amp;rsquo;s law half-life of 1.5 years), the model converges to a some-AI steady state. The long-run results are: a 23% employment loss (H∞ = 0.732 vs. H0 = 0.95), AI capability improvement of 321%, and labor productivity gain of 366%. Approximately half of the employment loss—11.5 percentage points—occurs within the first five years, alongside a 49.3% output gain and 45.5% AI capability improvement over that period.&lt;/p&gt;
&lt;p&gt;Untargeted moments are validated: the model implies 7.08% labor productivity growth over the first 10 years (consistent with Briggs and Kodnani 2023) and an AI elasticity of vacancies averaging 0.16 over the first five years (consistent with Acemoglu et al. 2022).&lt;/p&gt;
&lt;p&gt;On welfare, equilibria are inefficient even when the Hosios condition holds. AI introduces four externalities beyond standard matching frictions: job destruction via displacement, productivity enhancement for employed workers, feedback from AI learning depending on employment, and direct effects on matching surpluses. A constrained-optimal subsidy to jobs at risk of AI displacement is 26.6% in the short run and exceeds 50% in the long run. In the full model, the Hosios condition requires fixing firm bargaining power θ to the vacancy elasticity of matching ξ, but an additional per-output transfer T = µApωA to firm-worker matches is necessary to correct AI adoption inefficiency.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism by which AI generates unemployment in this model?
A: AI capability grows through a learning-by-using process (dAt/At = µHt − δ), improving as it observes employed workers. As capability rises, firms gain a displacement option that arrives at rate ρµAt per matched pair. When renegotiation over wages fails—i.e., when the AI&amp;rsquo;s NPV πA exceeds the joint surplus—firms replace workers with AI, causing unemployment. This creates a feedback loop: higher employment accelerates AI learning, which increases displacement pressure and reduces employment.&lt;/p&gt;
&lt;p&gt;Q: What are the three steady states and what distinguishes them?
A: The some-AI steady state features finite AI capability, persistent displacement (It = 1), and long-run employment H∞ = δ/µ; it involves technological unemployment. The unbounded-AI steady state features infinite AI capability, no displacement (It = 0), endogenous productivity growth, and employment H∞ = α/(α+σ) as in the standard Mortensen-Pissarides model. The no-AI steady state has A∞ = 0 with the same H∞ = α/(α+σ) but no AI contribution. Employment is higher in the unbounded-AI equilibrium than in the some-AI equilibrium.&lt;/p&gt;
&lt;p&gt;Q: What does the calibration imply for long-run employment and productivity?
A: The calibrated full model converges to a some-AI steady state with a 23% employment loss (H∞ = 0.732), a 321% improvement in AI capability, and a 366% gain in labor productivity. The parameters yield a unique equilibrium under the baseline calibration (πA = 1.949 &amp;gt; sAI = 0.8735 confirms some-AI existence). These results reflect a large worker replacement effect under the calibrated AI learning and error rates, while the job creation effect is relatively modest.&lt;/p&gt;
&lt;p&gt;Q: How fast does technological unemployment materialize?
A: Approximately half of the total 23% employment loss occurs within the first five years; specifically, employment falls by 11.5 percentage points over that period. Over the same five years, AI capability improves by 45.5% and output rises by 49.3%. Over the first 10 years, AI capability improvement accumulates to 94.0% and output gain to 103% (approximately double the five-year output gain).&lt;/p&gt;
&lt;p&gt;Q: How does the full model differ from the benchmark model in transition dynamics?
A: In the full model, job-finding rates are endogenous: firms post vacancies until a free-entry condition (κyt = ftΠt) is satisfied, tying job-finding rate αt to the surplus ratio st via αt = α(st). This endogeneity implies that as AI raises labor productivity, firms create more vacancies, slowing the employment decline relative to the benchmark model with a fixed job-finding rate. At the same time, AI capability grows faster in the full model because higher employment accelerates AI learning.&lt;/p&gt;
&lt;p&gt;Q: What is global indeterminacy and when does it arise?
A: Global indeterminacy occurs when both the some-AI and unbounded-AI steady states coexist, so the long-run outcome depends on initial conditions or expectations. In the benchmark model this requires condition (28): 0 &amp;lt; r + σ + α(1−θ) − (1−b)/πA ≤ εy(µα/(α+σ) − δ). In the full model, global indeterminacy is plausible when firm bargaining power rises to θ = 0.95 given the baseline AI replacement cost ϕ = 0.0043. The region of global indeterminacy is larger when firm bargaining power is higher.&lt;/p&gt;
&lt;p&gt;Q: What is local indeterminacy and what does it imply for transition paths?
A: Local indeterminacy means there is a continuum of equilibrium paths converging to the some-AI steady state in the neighborhood of that steady state, rather than a unique saddle path. In the full model, under alternative parameters (θ = 1, ξ = 0.765, εy = 6), the eigenvalues feature a negative real root and two complex roots with negative real parts, yielding oscillatory local dynamics in employment and AI capability. This implies short-run cycles in productivity and unemployment, consistent with the wide range of empirical findings on AI&amp;rsquo;s labor-market effects.&lt;/p&gt;
&lt;p&gt;Q: Why does the Hosios condition fail to deliver efficiency in this model?
A: The Hosios condition eliminates the standard matching externality by setting firm bargaining power to the vacancy elasticity of matching. But AI introduces four additional externalities: (i) job destruction through displacement, (ii) productivity enhancement for employed workers, (iii) feedback from AI learning that depends on aggregate employment, and (iv) direct effects on matching surpluses and job-finding rates. These externalities mean the standard Hosios rule alone is insufficient; additional instruments are required.&lt;/p&gt;
&lt;p&gt;Q: What is the constrained-optimal policy response?
A: In the simple model, the constrained optimal AI adoption threshold differs from the equilibrium threshold because firm bargaining power θ distorts adoption decisions: AI is over-adopted when πA &amp;gt; (1−b)/(r+σ+α(1−θ)) and under-adopted when (1−b)/(r+σ+α) &amp;lt; πA ≤ (1−b)/(r+σ+α(1−θ)). In the full model, constrained optimality requires setting θ = ξ (Hosios) plus a per-output subsidy T = µApωA to firm-worker matches exposed to AI displacement. This targeted subsidy is 26.6% in the short run and exceeds 50% in the long run.&lt;/p&gt;
&lt;p&gt;Q: How does AI compare to computers in this model&amp;rsquo;s counterfactual?
A: The paper reports that exogenous productivity growth from computers reduced unemployment only modestly—by 0.16 percentage points. By contrast, AI&amp;rsquo;s learning-by-using and displacement features imply a nearly 20% long-run employment loss in a comparable counterfactual. The key distinction is that computers lack the self-learning improvement and associated renegotiation-triggered displacement that characterize GenAI in this model.&lt;/p&gt;
&lt;p&gt;Q: How is AI exposure parameterized and what does it capture?
A: The exposure parameter ρ captures the degree to which a job is subject to AI-driven replacement risk. It is calibrated using Brynjolfsson et al. (2018)&amp;rsquo;s suitability for machine learning (SML) index: on a 1–5 scale, SML averages 3.47 across 964 O*NET occupations, translating to (3.47−1)/(5−1) = 61.8%, so ρ = 0.618. The effective exposure measure is ρµ, which is higher when facing a faster-learning AI.&lt;/p&gt;
&lt;p&gt;Q: What is the predator-prey analogy in the model&amp;rsquo;s dynamics?
A: The dynamical system for AI capability (At) and employment (Ht) in the simple model resembles the Lotka-Volterra predator-prey system. Employment (prey) feeds AI learning; as AI capability (predator) grows, it displaces workers faster, reducing employment; lower employment then slows AI learning, causing capability to decay; and the cycle repeats with diminishing magnitude until the steady state is reached. This mechanism operates only when the AI learning rate µ is neither too high nor too low, with the convergence path being a spiral when µα &amp;lt; 4δ²(1 − δ(α+σ)/(µα)).&lt;/p&gt;
&lt;p&gt;Q: What is the labor-share implication of the unbounded-AI equilibrium?
A: In the unbounded-AI steady state, employment is higher than in the some-AI steady state (H^AJJ &amp;gt; H^AI) and labor productivity grows without bound. However, the labor share is lower in the unbounded-AI equilibrium if the firm&amp;rsquo;s bargaining power θ is sufficiently low. This implies that while workers are not fully displaced and rising AI-augmented productivity sustains employment, workers&amp;rsquo; income share may still decline even in the more favorable unbounded scenario.&lt;/p&gt;
&lt;p&gt;Technological unemployment: A phenomenon in which AI adoption raises labor productivity and expands job creation, yet still causes sizable employment losses because the worker displacement effect (driven by renegotiation failure when AI&amp;rsquo;s NPV πA exceeds the joint surplus) dominates the job-creation effect. In the calibrated model this amounts to a 23% employment loss despite a 366% productivity gain.&lt;/p&gt;
&lt;p&gt;Learning-by-using AI: The model&amp;rsquo;s representation of GenAI as a technology whose capability At grows through reinforced learning from employed workers at rate µ per worker, so aggregate AI growth is µHt, offset by deterioration at rate δ. This distinguishes GenAI from earlier automation technologies (computers, robotics) that do not self-improve through usage.&lt;/p&gt;
&lt;p&gt;Some-AI steady state: A long-run equilibrium with finite AI capability (gA∞ = 0), persistent AI adoption (It = 1), and employment pinned at H∞ = δ/µ—the ratio of AI&amp;rsquo;s error rate to its learning rate. Characterized by expanded job creation but lower employment than the no-AI benchmark, constituting the model&amp;rsquo;s primary calibrated outcome.&lt;/p&gt;
&lt;p&gt;Unbounded-AI steady state: A long-run equilibrium with infinite AI capability (A∞ = ∞), no displacement (It = 0), and endogenous growth at rate gA = µH^AJJ − δ. Employment equals the Mortensen-Pissarides level H∞ = α/(α+σ), and labor productivity grows without bound, complementing Aghion, Jones, and Jones (2019)&amp;rsquo;s idea production framework.&lt;/p&gt;
&lt;p&gt;Global indeterminacy: Coexistence of multiple steady states (some-AI and unbounded-AI) such that the long-run equilibrium depends on initial conditions or expectations rather than being uniquely determined. Arises in the benchmark model when condition (28) holds and becomes more likely with higher firm bargaining power θ.&lt;/p&gt;
&lt;p&gt;Local indeterminacy: A continuum of equilibrium transition paths converging to a single steady state from nearby initial conditions, rather than a unique saddle path. Arises in the full model under certain parameter configurations (e.g., θ = 1, ξ = 0.765, εy = 6), implying oscillatory short-run dynamics in employment and AI capability.&lt;/p&gt;
&lt;p&gt;AI exposure (ρ): A firm-level parameter capturing the degree to which a job-match is subject to AI-driven displacement risk. The displacement option arrives at rate ρµAt per matched pair; ρ is calibrated at 0.618 using the average suitability-for-machine-learning score across O*NET occupations. The effective exposure measure is the product ρµ.&lt;/p&gt;
&lt;p&gt;Renegotiation-proof displacement: Proposition 1&amp;rsquo;s result that the joint surplus Snt is independent of the renegotiation round n, so the AI adoption decision It is also round-invariant. This simplifies the model to a single indicator function: AI replaces the worker if and only if πA exceeds the joint surplus St, regardless of how many renegotiation rounds have occurred.&lt;/p&gt;</description></item><item><title>Auctions with Frictions: Recruitment, Entry, and Limited Commitment</title><link>https://macropaperwarehouse.com/papers/auctions-with-frictions-recruitment-entry-and-limited-commitment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/auctions-with-frictions-recruitment-entry-and-limited-commitment/</guid><description>&lt;p&gt;This paper develops an auction model that jointly incorporates three frictions pervading informal price-formation processes: (1) costly recruitment by the seller, (2) costly participation by bidders, and (3) the seller&amp;rsquo;s inability to commit to a recruitment level or reserve price. The authors argue these frictions are especially prevalent in markets for idiosyncratic assets such as mergers and acquisitions, real estate, and home repair contracting, where auction houses like Christie&amp;rsquo;s and Sotheby&amp;rsquo;s command fees of 20–30% of revenues precisely because they reduce the underlying inefficiencies.&lt;/p&gt;
&lt;p&gt;The model features a single seller who exerts recruitment effort gamma at cost gamma*s, generating a Poisson-distributed number of contacted bidders with mean gamma. Each contacted bidder independently decides whether to pay entry cost c &amp;gt; 0 to learn their private value and participate in a first-price auction (FPA). The seller cannot commit to gamma (which is unobservable to bidders) or to a reserve price. Two scenarios are analyzed: PO (participation-observable, where bidders observe the number of entrants before bidding) and PU (participation-unobservable).&lt;/p&gt;
&lt;p&gt;The central tension is between the seller&amp;rsquo;s incentive to recruit more bidders to intensify competition and raise revenue, and bidders&amp;rsquo; rational concern that excessive recruitment makes entry unprofitable. Because the seller cannot commit, this tension generates several novel inefficiency results.&lt;/p&gt;
&lt;p&gt;In the PO scenario, the seller&amp;rsquo;s marginal revenue from recruitment Ro&amp;rsquo;(lambda) is single-peaked, meaning there is a minimum profitable participation scale lambda_o below which the seller will never recruit. Combined with a maximum participation level lambda-bar_c above which bidders will not enter (defined by U(lambda-bar_c) = c, where U is the bidder&amp;rsquo;s expected payoff), no-trade equilibrium is the unique outcome whenever lambda-bar_c &amp;lt; lambda_o — even for arbitrarily small recruitment cost s. This result holds because with unobservable effort, bidders correctly anticipate the seller will target participation above lambda_o, making entry unprofitable. When lambda-bar_c &amp;gt; lambda_o, three regimes arise: (i) no trade if s exceeds a threshold s-bar_o; (ii) an interior equilibrium with full entry (q* = 1) and lambda* = lambda_o(s) for intermediate s; and (iii) for small s, an equilibrium with lambda* = lambda-bar_c and partial entry q* = Ro&amp;rsquo;(lambda-bar_c)/s &amp;lt; 1. In regime (iii), total recruitment cost lambda*(s/q*) equals the constant lambda-bar_c * Ro&amp;rsquo;(lambda-bar_c) regardless of s — so even as s approaches zero, wasteful recruitment costs do not vanish, because they are determined by incentive constraints rather than by technology.&lt;/p&gt;
&lt;p&gt;In the PU scenario, a no-trade equilibrium always exists for all parameter values, because the seller cannot credibly disclose participation, creating self-reinforcing expectations of zero competition. The seller&amp;rsquo;s recruitment incentive xi(lambda) is strictly weaker than Ro&amp;rsquo;(lambda) in the PO scenario (proven via revenue equivalence: Ro&amp;rsquo;(lambda) = xi(lambda) + a positive term reflecting how greater participation induces more aggressive bidding). This yields ranking reversals: for intermediate s and small c, the PO scenario dominates PU; but for small s or large c, the PU scenario&amp;rsquo;s weaker recruitment incentive reduces wasteful over-recruitment, making PU preferable. These comparisons translate directly to a comparison of FPA and SPA with unobservable participation: the two formats are not equivalent in the presence of recruitment and entry frictions because they generate different recruitment incentives.&lt;/p&gt;
&lt;p&gt;A sampling-curse mechanism drives near-complete market unraveling when sellers have privately known recruitment costs drawn from a continuous uniform distribution on [0, s_o]. Because low-cost sellers recruit more, a contacted bidder believes the seller is more likely to have low costs — and hence to have recruited many other bidders — making entry unprofitable. Proposition 3 establishes a threshold c-hat such that for c in (c-hat, c-bar), as the lower bound of the cost distribution approaches zero, the fraction of seller types that remain inactive approaches one — near-complete unraveling — even though each type would be active if its cost were commonly known.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s main modeling innovation relative to the existing literature?
A: The paper&amp;rsquo;s central novelty is combining all three frictions — costly recruitment by the seller, costly participation by bidders, and limited seller commitment — in one model. The existing literature had studied entry and recruitment separately; Szech (2011) examined costly recruitment with costless entry; McAfee and McMillan (1987) and Levin and Smith (1994) studied costly entry with an exogenously given number of potential bidders; Milgrom (1987) and McAfee and Vincent (1997) studied limited commitment to a reserve price with a fixed bidder set. None combine all three.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;minimum profitable scale&amp;rdquo; result and why does it arise?
A: Because the seller cannot commit to a reserve price, the first few bidders are complementary — they stimulate competitive bidding, causing the seller&amp;rsquo;s marginal revenue Ro&amp;rsquo;(lambda) to be initially increasing, then decreasing (single-peaked). This means the seller&amp;rsquo;s profit Pi_o(lambda, q) is maximized either at zero or at a participation level above a minimum scale lambda_o, defined by Ro&amp;rsquo;(lambda_o) = s-bar_o. The seller will never choose a participation level between 0 and lambda_o.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions does the market completely shut down in the PO scenario?
A: No-trade is the unique equilibrium outcome whenever lambda-bar_c &amp;lt; lambda_o, where lambda-bar_c is defined by U(lambda-bar_c) = c (the participation break-even level) and lambda_o is the seller&amp;rsquo;s minimum profitable scale. This condition arises when entry costs c are large enough relative to the competitive dynamics. Importantly, no trade occurs for every recruitment cost s &amp;gt; 0, including arbitrarily small s — commitment failure alone can cause complete market breakdown even when recruiting bidders is nearly costless.&lt;/p&gt;
&lt;p&gt;Q: What is the inefficiency in regime (iii) of Proposition 2 (small s, PO scenario)?
A: When s &amp;lt; Ro&amp;rsquo;(lambda-bar_c), equilibrium has lambda* = lambda-bar_c and q* = Ro&amp;rsquo;(lambda-bar_c)/s &amp;lt; 1. The total recruitment cost is lambda* * (s/q*) = lambda-bar_c * Ro&amp;rsquo;(lambda-bar_c), a strictly positive constant independent of s. As s approaches zero, total recruitment effort and its cost do not vanish — they are pinned by incentive constraints. This waste could be avoided if the seller could commit to an effort level below lambda-bar_c, illustrating that commitment failure creates persistent inefficiency even when the technology of recruitment is inexpensive.&lt;/p&gt;
&lt;p&gt;Q: Why does a no-trade equilibrium always exist in the PU scenario but not always in the PO scenario?
A: In the PU scenario, if bidders expect zero participation, they bid zero conditional on being contacted; the seller then has no incentive to recruit, validating the expectation. This equilibrium is self-sustaining for all parameter values (Claim 2). In the PO scenario, the equilibrium refinement (requiring that off-path beliefs not support negative seller payoff at lambda = 0 when trade equilibria exist) rules out no-trade equilibria when lambda-bar_c &amp;gt; lambda_o and s is not too large; specifically, Proposition 2 shows that no-trade equilibrium is unique only when s &amp;gt; s-bar_o or lambda-bar_c &amp;lt; lambda_o.&lt;/p&gt;
&lt;p&gt;Q: What drives the ranking reversal between PO and PU scenarios?
A: The core result is Claim 3: Ro&amp;rsquo;(lambda) &amp;gt; xi(lambda) for all lambda &amp;gt; 0, meaning the marginal incentive to recruit is strictly stronger under PO than PU. This follows from revenue equivalence: Ro&amp;rsquo;(lambda) = xi(lambda) + (d/d lambda-hat) Ru(lambda, beta_{lambda-hat})|_{lambda-hat=lambda}, and the second term is strictly positive because greater expected participation induces more aggressive bidding. For intermediate s and small c, stronger PO recruitment incentives support higher participation and revenue. For small s or large c, those same stronger incentives generate wasteful over-recruitment in PO, making PU preferable.&lt;/p&gt;
&lt;p&gt;Q: How does the paper connect its PO/PU comparison to a comparison of first- and second-price auctions?
A: In any standard auction where the highest-value bidder wins, payoff and revenue equivalence imply that the bidder payoff function U(lambda) and seller revenue Ro(lambda) are identical. In particular, the dominant-strategy equilibrium of the SPA (where bidders bid their true values regardless of participation) generates the same outcomes as the PO equilibrium, because with truthful bidding the observability of participation is irrelevant. Therefore, comparing PO and PU with an FPA is equivalent to comparing the SPA and FPA with unobservable participation. The two formats are not revenue-equivalent when recruitment and entry frictions are present: their ranking depends on s and c in exactly the way described for PO vs. PU.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;sampling curse&amp;rdquo; and how does it cause market unraveling?
A: The sampling curse arises when sellers have privately known recruitment costs. Because a lower-cost seller optimally recruits more bidders, the probability of any given bidder being contacted is higher when the seller has a lower cost. Conditional on being contacted, a bidder therefore believes the seller more likely has a low cost and thus has recruited many competitors, reducing the value of entry. In the binary-type case (Claim 8), if sL is sufficiently small relative to sH, the low-cost seller must recruit so many bidders that entry becomes unattractive; the resulting low q* makes the marginal recruitment cost sH/q* prohibitively high for the high-cost type, driving it out (lambda*_H = 0).&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 3 establish about near-complete unraveling with a continuum of seller types?
A: With seller costs uniformly distributed on [s-bar, s_o], Proposition 3 establishes a threshold c-hat strictly between 0 and c-bar such that: (i) for c in (c-hat, c-bar), as s-bar approaches zero, the fraction of seller types with zero recruitment approaches one — near-complete market unraveling; (ii) for c &amp;lt; c-hat, all seller types remain active regardless of how small s-bar is. This is striking because for any commonly known s in (0, s_o), the PO scenario supports trade for all c &amp;lt; c-bar; unraveling arises purely from the interaction of private cost information and the sampling curse, not from any type&amp;rsquo;s cost being intrinsically too high.&lt;/p&gt;
&lt;p&gt;Q: What does the welfare analysis say about equilibrium efficiency?
A: The welfare-maximizing participation level lambda_w satisfies U(lambda_w) = c + s (equating the marginal bidder&amp;rsquo;s surplus to the full social cost of one more participant), with full entry q_w = 1. In equilibrium under PO, q* &amp;lt; 1 in some cases (wasted recruitment) and lambda* differs from lambda_w for almost all (s, c) pairs — both excessive participation (lambda* &amp;gt; lambda_w) and deficient participation (lambda* &amp;lt; lambda_w) can arise. Full efficiency requires Ro&amp;rsquo;(lambda*) = s and U(lambda*) = s + c simultaneously, but since both U and Ro&amp;rsquo; are independent of s and c as parameters, these equalities generically fail.&lt;/p&gt;
&lt;p&gt;Q: Does the seller benefit from being able to commit to recruitment effort?
A: Claim 10 shows that with observable effort in the PO scenario, the seller commits to gamma-hat = min{lambda-bar_c, lambda_o(s)} when lambda-bar_c &amp;gt;= lambda_o, and to lambda-bar_c (if profitable) when lambda-bar_c &amp;lt; lambda_o. Commitment strictly improves the seller&amp;rsquo;s profit whenever gamma-hat = lambda-bar_c: it enables positive trade when lambda-bar_c &amp;lt; lambda_o and Ro(lambda-bar_c) &amp;gt; lambda-bar_c * s (otherwise impossible without commitment), and it saves recruitment costs when lambda-bar_c &amp;gt; lambda_o and Ro&amp;rsquo;(lambda-bar_c) &amp;gt; s. However, the commitment outcome is always welfare-inefficient: lambda-bar_c &amp;gt; lambda_w whenever s &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;Q: What anecdotal evidence do the authors cite for the model&amp;rsquo;s relevance?
A: Subramanian (2010) and Boone and Mulherin (2004, 2009) show that the majority of merger and acquisition auctions are &amp;ldquo;informal&amp;rdquo; — mixtures of auctions and negotiations rather than structured processes with rules laid out in advance — and that sellers are typically unable to credibly commit to participation levels. Milgrom (2003) states from consulting experience that marketing an auction is often more critical than clever mechanism design. Fees of 20–30% of revenues paid to intermediaries like Christie&amp;rsquo;s and Sotheby&amp;rsquo;s are offered as quantitative evidence of the magnitude of the inefficiencies that such intermediaries reduce. Home repair contracting is cited as a familiar informal-auction setting where both recruitment and entry costs are material.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Recruitment effort (gamma):&lt;/strong&gt; The seller&amp;rsquo;s costly action of contacting potential bidders, modeled as a Poisson process with mean gamma at cost gamma*s; unobservable to bidders in the baseline model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Participation-observable (PO) vs. participation-unobservable (PU) scenarios:&lt;/strong&gt; The two variants of the model; in PO, bidders observe the total number of entrants n before bidding; in PU, they do not observe n and the seller cannot credibly disclose it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Minimum profitable scale (lambda_o):&lt;/strong&gt; The smallest positive participation level the seller will ever choose in equilibrium, defined as the value where Ro&amp;rsquo;(lambda_o) equals the peak of the average revenue curve s-bar_o. The seller always recruits either zero bidders or at least lambda_o, due to the initial complementarity of bidders (they stimulate each other&amp;rsquo;s bids) under no-commitment-to-reserve-price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Break-even participation level (lambda-bar_c):&lt;/strong&gt; The maximum participation level at which a bidder&amp;rsquo;s expected gross payoff U(lambda) equals the entry cost c; bidders will not enter if they expect participation above lambda-bar_c.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sampling curse:&lt;/strong&gt; The adverse-selection mechanism arising when sellers have privately known recruitment costs: because low-cost sellers recruit more, a contacted bidder infers the seller is more likely to have a low cost and thus to have recruited many competitors, making entry less attractive and potentially driving higher-cost seller types out of the market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;xi(lambda):&lt;/strong&gt; The seller&amp;rsquo;s marginal revenue with respect to recruitment in the PU scenario, defined as the total derivative of Ru(lambda, beta_{lambda-hat}) evaluated where actual and expected participation coincide (lambda-hat = lambda). Strictly less than Ro&amp;rsquo;(lambda) for all lambda &amp;gt; 0, reflecting that in PU the seller loses the ability to leverage bidder aggression via observable competition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wasteful recruitment:&lt;/strong&gt; The equilibrium phenomenon in which total recruitment cost lambda*(s/q*) remains at the positive constant lambda-bar_c * Ro&amp;rsquo;(lambda-bar_c) even as s approaches zero, because incentive constraints — not technology — pin the equilibrium effort level.&lt;/p&gt;</description></item><item><title>Automated credit limit increases and consumer welfare</title><link>https://macropaperwarehouse.com/papers/automated-credit-limit-increases-and-consumer-welfare/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/automated-credit-limit-increases-and-consumer-welfare/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Should regulators restrict banks from proactively raising credit card limits using machine-learning algorithms, and if so, how? The paper asks: to what extent are bank-initiated credit limit increases directed toward revolving borrowers (those who carry interest-accruing balances month-to-month), and what are the welfare consequences of policies that constrain such increases?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The empirical analysis uses the Federal Reserve&amp;rsquo;s Capital Assessments and Stress Testing (Y-14M) regulatory data, January 2014 to December 2024, covering monthly account-level records for all credit cards issued by large stress-tested banks (assets &amp;gt; $100B). The 26 banks in the sample collectively represent more than 70% of U.S. credit card balances. A 0.5% sample yields more than 150 million observations across more than 3.6 million unique active credit cards. A key advantage of Y-14 over credit bureau data is that it identifies whether each limit change was bank-initiated or consumer-initiated — a distinction not available in other datasets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stylized Facts.&lt;/strong&gt; Credit limit increases are an important and understudied source of consumer credit. During the post-pandemic period, limit increases generate more than $40 billion of additional available credit per quarter, roughly 60% of the approximately $70 billion coming from new card originations; prior to the pandemic the figure was about $30 billion, or roughly half of new issuance. The number of accounts undergoing a limit increase each quarter is on average 30% higher than the number of new cards issued. Consistent with &amp;ldquo;low-and-grow&amp;rdquo; lending strategies, limit increases are disproportionately important for lower credit-score borrowers: average subprime credit limits rise from $700 at origination to $2,700 by five years after origination (a 285% increase) and to nearly $5,000 by eight years, while average superprime limits rise only from approximately $12,000 to $15,000 (a 25% increase). About 30% of total revolving balances are made possible by limit increases, with the share reaching 60% for subprime borrowers but only 12% for superprime borrowers. Approximately 75–80% of all limit increases — both by dollar amount and by number of cards — are bank-initiated rather than consumer-initiated. Banks that more frequently reference &amp;ldquo;artificial intelligence&amp;rdquo; or &amp;ldquo;machine learning&amp;rdquo; in their 10-K filings support a larger share of revolving balances through limit increases. Bank-initiated increases are roughly 1.5–2 times more prevalent among accounts that have revolved in the prior three months, whereas consumer-initiated increases show essentially no differential by revolving status.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Analysis.&lt;/strong&gt; Using a linear probability model with card-portfolio-group fixed effects, month fixed effects, and controls for credit score, income, prior limit changes, and other account characteristics, the authors show that the probability of a bank-initiated limit increase follows an inverse-U shape in revolving utilization: accounts with revolving utilization in the moderate range (roughly 0.2–0.7) are most likely to receive an increase, while those near zero or near 1.0 are not. An account with revolving utilization in the (0.2, 0.3] bin is approximately as likely to receive a limit increase as an account whose credit score just rose by 66 points. Transacting utilization, by contrast, follows a logistic growth pattern: the probability rises monotonically until about a utilization of 0.3 and is flat above that. An event study shows that after a bank-initiated limit increase, revolving utilization rebounds to its pre-increase level within approximately 8 months; on average, revolving balances increase by about 40% of the limit increase, with approximately 30% of the limit increase going toward revolving balances. This rebound occurs even for accounts with revolving utilization below the pre-increase mean of 0.28, indicating that the effect is not confined to liquidity-constrained borrowers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors develop a life-cycle consumption–saving model with credit card borrowing, uninsurable income and employment risk, potential default (Chapter 7 style), and heterogeneous preferences following Nakajima (2017) and Gul–Pesendorfer (2001, 2004). Two household types coexist: 60% with standard exponential-discounting preferences (calibrated β = 0.92) and 40% with temptation preferences (calibrated β = 0.96, temptation parameter λ = 0.28 from Kovacs et al., 2021). The credit limit increase function is calibrated using Y-14M data via a latent-variable formulation, replicating the empirical inverted-U relationship between revolving utilization and limit increase probability. The four internally calibrated targets are: share of households with revolving credit card debt (data: 45%, model: 41.8%); utilization rate conditional on debt (data: 35%, model: 28.9%); default probability (data: 0.94%, model: 0.94%); debt-to-income ratio (data: 8.6%, model: 6.8%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Baseline.&lt;/strong&gt; Through the model, tempted agents are disproportionately likely to receive credit limit increases because they are more likely to revolve. For customers with utilization above 50%, the majority of credit limit increases are detrimental from the borrower&amp;rsquo;s own perspective. Standard agents almost always benefit from higher credit limits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual 1 — UK-style (prohibit limit increases for revolving borrowers).&lt;/strong&gt; This policy reduces the annual probability of limit increases from roughly 5.5% to approximately 1.0%. The default probability falls from about 0.9% to near zero. The debt-to-income ratio declines by roughly 2 percentage points. Aggregate welfare improves by 1.12% in consumption equivalent variation (CEV) when the social planner internalizes the psychological cost of temptation (0.98% without). Standard households incur a modest welfare loss of 0.21% from reduced consumption-smoothing flexibility, while tempted households gain approximately 3.12% in CEV.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual 2 — Canada/EU-style (require consumer consent).&lt;/strong&gt; This policy reduces the annual limit-increase probability from 5.5% to approximately 1.9%. Aggregate welfare improves by 1.16% in CEV (1.04% without psychological costs). Standard households lose 0.19%, while tempted households gain approximately 3.19%. Under the baseline assumption of sophisticated tempted households, results are nearly identical to the UK-style policy. However, when the fraction of naïve tempted households is large, the consent-based policy becomes ineffective (naïve consumers accept limit increases they will regret), whereas the UK-style revolving-borrower ban remains welfare-improving regardless of the naïve share.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness.&lt;/strong&gt; When the firm is allowed to re-optimize its credit limit increase policy, it endogenously reallocates more limit increases toward standard consumers. Welfare gains remain positive but are attenuated: the UK-style policy yields 0.21% CEV (vs. 1.12% in the baseline calibration) and the consent-based policy yields 0.27% CEV.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy Implications.&lt;/strong&gt; The U.S. lacks regulation of bank-initiated proactive credit limit increases (existing rules under ECOA and ability-to-pay provisions are largely non-binding for this purpose). The authors conclude that banks&amp;rsquo; revealed preference for targeting revolvers constitutes an implicit targeting of consumers with self-control issues, and that if a meaningful share of households have self-control issues, there are strong consumer protection grounds for regulating algorithmic credit limit increases.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why do the authors use Y-14M data rather than credit bureau data, and what does this data uniquely enable?&lt;/strong&gt;
A: The Y-14M dataset allows the authors to distinguish between bank-initiated and consumer-initiated credit limit changes — a distinction not observable in credit bureau data. It also contains actual payment information enabling identification of revolvers (those carrying interest-accruing balances) rather than just total balances. The sample covers more than 70% of U.S. credit card balances and more than 150 million monthly observations over the January 2014 to December 2024 period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How large are credit limit increases relative to new card originations in the U.S. credit card market?&lt;/strong&gt;
A: During the post-pandemic period, limit increases produce more than $40 billion of additional available credit per quarter, roughly 60% of the approximately $70 billion created by new card originations. Prior to the pandemic the figure was approximately $30 billion, or about half of new issuance. On a count basis, the number of cards undergoing a limit increase each quarter is on average 30% higher than the number of new cards issued.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the &amp;ldquo;low-and-grow&amp;rdquo; strategy, and how large is the subsequent credit expansion?&lt;/strong&gt;
A: The low-and-grow strategy involves originating higher-risk borrowers at low initial credit limits and then expanding limits based on observed borrowing behavior. For the average subprime credit card, the initial limit of $700 grows to $2,700 by five years after origination (a 285% increase) and to nearly $5,000 by eight years. For superprime borrowers, the initial limit of approximately $12,000 grows only to $15,000 (a 25% increase) by five years and then is approximately unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does a borrower&amp;rsquo;s revolving status affect the probability of receiving a bank-initiated limit increase?&lt;/strong&gt;
A: Bank-initiated increases are approximately 1.5–2 times more prevalent among accounts that have revolved at least once in the prior three months, compared to non-revolving accounts. By contrast, consumer-initiated increases show essentially no differential between revolvers and non-revolvers. This reveals a bank-side revealed preference for targeting revolvers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the shape of the relationship between revolving utilization and the probability of a bank-initiated limit increase, and how large is its economic magnitude?&lt;/strong&gt;
A: The relationship follows an inverted-U shape. Accounts with revolving utilization in bins between approximately 0.2 and 0.7 have the highest probability of receiving an increase; accounts near zero or near full utilization are as unlikely to receive an increase as zero-utilization accounts. The effect of being in the (0.2, 0.3] revolving utilization bin has approximately the same positive effect on the probability of receiving a limit increase as a 66-point increase in credit score, making it economically large relative to standard risk signals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does transacting utilization relate to bank-initiated limit increases, and how does this differ from revolving utilization?&lt;/strong&gt;
A: Transacting utilization follows a logistic growth pattern rather than an inverted-U. The probability of receiving a limit increase rises monotonically with transacting utilization until about a utilization of 0.3, above which the probability does not vary with utilization. This contrasts with revolving utilization, where very high utilization (above 0.9) is actually no more predictive than zero utilization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What does the event study show about borrowing behavior following credit limit increases?&lt;/strong&gt;
A: After a bank-initiated limit increase, revolving utilization (as a share of the credit limit) drops mechanically but then rebounds to pre-increase levels within approximately 8 months. On average, revolving balances increase by about 40% of the amount of the limit increase, with approximately 30% of each dollar of new credit limit going toward revolving balances. These magnitudes are somewhat larger than the 13% (Gross and Souleles, 2002) and 18% (Aydin, 2022) found in prior work, which the authors attribute to the non-causal nature of their event study, higher average utilization in their sample, and their focus on revolving rather than total utilization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Is the post-increase borrowing rebound driven by liquidity-constrained borrowers?&lt;/strong&gt;
A: No. The authors show that limiting the sample to accounts with revolving utilization below the pre-increase mean of 0.28 — accounts that are unlikely to be liquidity constrained — yields very similar results. This finding is consistent with the presence of self-control issues rather than binding credit constraints.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the key modeling assumptions about household types, and how were the share parameters calibrated?&lt;/strong&gt;
A: The model features two types: 60% with standard exponential-discounting preferences (estimated discount factor β = 0.92) and 40% with temptation preferences (β = 0.96, temptation parameter λ = 0.28 set from Kovacs et al., 2021). The 40% tempted share is internally estimated via the Method of Simulated Moments targeting four aggregate moments: share with revolving credit card debt (45% in data, 41.8% in model), utilization rate conditional on debt (35% vs. 28.9%), default probability (0.94% vs. 0.94%), and debt-to-income ratio (8.6% vs. 6.8%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do tempted and standard households differ in their credit card usage within the model?&lt;/strong&gt;
A: In the model, 76% of tempted agents carry revolving credit card debt, with an average utilization rate of 73.6%, a debt-to-income ratio of 15.4%, and a default probability of 2.22%. Standard agents carry debt only 18.9% of the time, with average utilization of 4.1%, a debt-to-income ratio of 1.1%, and a default probability of 0.08%. Tempted agents also pay a substantially higher share of income on credit card interest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the model capture the mechanism by which credit limit increases harm tempted households?&lt;/strong&gt;
A: The Gul–Pesendorfer temptation utility function makes household welfare depend on both actual consumption and the most tempting consumption alternative available (the budget-set maximum). When credit limits rise, the most tempting alternative ˜c_t increases, which raises the utility cost of self-restraint even for households that do not succumb to temptation. This mechanism is distinct from hyperbolic discounting: temptation imposes a psychic cost even on those who ultimately choose not to over-borrow.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the quantitative welfare effects of the UK-style policy prohibiting limit increases for revolving borrowers?&lt;/strong&gt;
A: The policy yields an overall welfare gain of 1.12% in consumption equivalent variation (CEV) when the social planner internalizes the psychological cost of temptation (0.98% without). Standard households suffer a modest welfare loss of 0.21% from reduced consumption-smoothing flexibility. Tempted households gain approximately 3.12% in CEV, because the benefit from reduced temptation and lower interest expenditure outweighs the cost of reduced credit access.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What are the quantitative welfare effects of the Canada/EU-style consent-required policy?&lt;/strong&gt;
A: The consent-based policy yields an overall welfare gain of 1.16% in CEV (1.04% without psychological costs). Standard households lose 0.19%, and tempted households gain approximately 3.19%. Under the baseline assumption of fully sophisticated tempted households, results are nearly identical to the UK-style ban.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: How sensitive are the two policy counterfactuals to the share of naïve (unaware of their self-control issues) tempted households?&lt;/strong&gt;
A: The UK-style ban on limit increases for revolving borrowers remains welfare-improving regardless of whether tempted households are sophisticated or naïve — the welfare impact is approximately flat as the naïve fraction rises from zero to one. The consent-based policy, by contrast, exhibits a negative linear relationship between the naïve fraction and welfare impact, with welfare gains disappearing as the naïve fraction approaches one. Naïve consumers accept limit increases they would regret, so the policy&amp;rsquo;s effectiveness depends on households accurately recognizing their own self-control issues.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: What happens when the firm is allowed to re-optimize its credit limit increase policy in response to regulation?&lt;/strong&gt;
A: With firm re-optimization, both counterfactual policies continue to improve welfare but the magnitudes are attenuated. The UK-style policy yields 0.21% CEV overall (tempted: 0.89%) and the consent-based policy yields 0.27% overall (tempted: 0.98%), compared to 1.12% and 1.16% without re-optimization. The re-optimizing firm reallocates more limit increases toward standard consumers, which reduces the number directed at tempted households but also limits the welfare gains from regulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q16: What do lenders&amp;rsquo; 10-K filings reveal about the role of AI/ML in targeting revolvers for limit increases?&lt;/strong&gt;
A: Banks that mention &amp;ldquo;artificial intelligence&amp;rdquo; or &amp;ldquo;machine learning&amp;rdquo; above the median number of times in their 2024 10-K filings support a higher share of revolving balances through credit limit increases, for all credit score groups. This difference is not driven by differences in credit limits at origination between higher-AI and lower-AI lenders, suggesting that AI/ML adoption affects the targeting of limit increases toward revolvers rather than the initial credit allocation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Revolving utilization.&lt;/strong&gt; In this paper, revolving utilization is defined as the portion of overall credit card utilization attributable to balances that the borrower carries from one month to the next without full repayment, thereby accruing interest. It is measured as revolving balances divided by credit limit, averaged over the prior three months. This is distinct from transacting utilization (new purchases as a share of limit) and is the primary signal banks use — implicitly, via their algorithms — to select accounts for proactive limit increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank-initiated vs. consumer-initiated credit limit increase.&lt;/strong&gt; A bank-initiated limit increase is one in which the lender proactively raises a borrower&amp;rsquo;s credit limit without a request from the borrower. A consumer-initiated increase is one explicitly requested by the borrower. The Y-14M data uniquely identify the source of each change. The paper documents that approximately 75–80% of all limit increases are bank-initiated, and that bank-initiated increases are strongly correlated with revolving utilization whereas consumer-initiated increases are not.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Low-and-grow strategy.&lt;/strong&gt; The practice of originating higher-risk borrowers at low initial credit limits and then expanding those limits over time based on observed borrowing behavior. In the paper this is a documented empirical pattern, not an assumption: subprime accounts start at an average $700 limit at origination and reach nearly $5,000 by eight years, a 285% increase versus only 25% for superprime accounts over the same horizon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Temptation preferences (Gul–Pesendorfer).&lt;/strong&gt; A utility framework in which household welfare depends not only on actual consumption but also on the most tempting consumption alternative within the budget set. The disutility from temptation arises even when the household does not succumb — it reflects the psychological cost of self-restraint. In the paper, λ (set to 0.28) parameterizes the weight of this temptation cost relative to standard utility. Temptation preferences are time-consistent, which facilitates welfare analysis, and are preferred to hyperbolic discounting in this setting because they predict that individuals may pay to have tempting options removed even without acting on them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revealed preference for targeting revolvers.&lt;/strong&gt; The paper&amp;rsquo;s characterization of banks&amp;rsquo; credit limit increase behavior as reflecting a systematic preference for giving increases to revolving borrowers, inferred from the empirical pattern in the Y-14M data (the inverted-U shape between revolving utilization and limit increase probability). Because banks&amp;rsquo; algorithms are proprietary and unobserved, the paper interprets the observed allocation of limit increases as a revealed preference, consistent with banks&amp;rsquo; profit motive since revolvers generate the majority of credit card interest income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption equivalent variation (CEV).&lt;/strong&gt; The welfare metric used throughout the paper&amp;rsquo;s counterfactual analysis. CEV is defined as the percentage change in consumption in every period and state that would make households indifferent between the baseline policy regime and the counterfactual policy. A positive CEV indicates that the counterfactual policy improves welfare; a negative CEV indicates harm. The paper considers two versions: one in which the social planner internalizes the psychological cost of temptation (consistent with tempted households&amp;rsquo; actual preferences), and one in which the planner ignores that cost (λ = 0 for the planner) but households still face temptation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Persistent revolving debt (UK regulatory definition).&lt;/strong&gt; In the UK Financial Conduct Authority&amp;rsquo;s framework, a borrower is considered in &amp;ldquo;persistent revolving debt&amp;rdquo; when the cumulative amount paid toward interest and fees exceeds the cumulative amount of principal repaid over a 12-month period. The UK rule prohibits lenders from increasing credit limits for borrowers meeting this definition. The paper models a stylized version: any account currently carrying a revolving balance is ineligible for a bank-initiated limit increase in the UK-style counterfactual.&lt;/p&gt;</description></item><item><title>Automation and Rent Dissipation</title><link>https://macropaperwarehouse.com/papers/automation-and-rent-dissipation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/automation-and-rent-dissipation/</guid><description>&lt;p&gt;Acemoglu and Restrepo examine the effects of automation in economies where labor market distortions cause some workers to earn rents—wages above their opportunity cost or outside option. The central question is how the interplay between automation and these distortions shapes wages, inequality, and productivity. The paper makes three contributions: a theoretical framework identifying a rent dissipation mechanism, reduced-form empirical evidence using US data from 1980 to 2016, and a general equilibrium quantification of automation&amp;rsquo;s aggregate effects.&lt;/p&gt;
&lt;p&gt;The theoretical framework extends the task model of Acemoglu and Restrepo (2022) to incorporate task-specific wage wedges. In this setup, a firm employing labor of type g in task x pays a wage equal to the base wage multiplied by an exogenous wedge capturing rents from efficiency wages, bargaining, licensing, regulations, or norms. Because these wedges artificially inflate labor costs in high-rent tasks, firms have a stronger incentive to automate precisely those tasks—automation saves more in labor costs where rents are highest. Proposition 3 establishes that endogenous adoption decisions are tilted toward high-rent tasks: the rent distribution in automated tasks first-order stochastically dominates the rent distribution across all tasks. This targeting generates the rent dissipation mechanism. The equilibrium is inefficient on both the intensive margin (too little employment in high-rent tasks) and the extensive margin (excessive automation of high-rent tasks that a social planner would prefer to keep labor-intensive).&lt;/p&gt;
&lt;p&gt;The rent dissipation mechanism has three consequences identified theoretically. First, it amplifies average wage losses for exposed groups beyond what displacement alone would produce, pushing displaced workers toward lower-paying jobs. Second, it compresses within-group wage dispersion by concentrating losses at higher percentiles of the within-group distribution, generating a U-shaped pattern of wage changes: workers at low percentiles earn no rents and experience only base-wage adjustments, while workers between the 70th and 95th percentiles face the steepest declines due to loss of high-rent jobs. Third, it is inefficient: because the tasks targeted by automation are not those where wages reflect scarcity or skill but rather distortionary rents, a planner would have preferred more labor allocated to these tasks, and rent dissipation offsets part or all of the cost-saving productivity gains from automation.&lt;/p&gt;
&lt;p&gt;The empirical analysis covers 500 detailed demographic groups defined by education (five levels), gender, five age groups, five race/ethnicity groups, and nativity. Task displacement is measured as a weighted sum of industry-level automation exposure using three proxies: adjusted industrial robot penetration, specialized software services, and dedicated machinery in value added. Workers in the middle and lower-middle of the wage distribution lost 15–20% of their tasks to automation between 1980 and 2016, while post-college workers saw few tasks automated.&lt;/p&gt;
&lt;p&gt;A 10 percentage point increase in task displacement is associated with a 24% decline in group-level relative wages (β = −2.36, s.e. = 0.13), falling to 19% after controlling for gender, education, sectoral demand, and rent shifters (β = −1.90, s.e. = 0.29). The U-shaped pattern in within-group wage changes is clearly visible: wages decline by 25–30% per 10 percentage point task displacement at the 70th–90th percentiles, compared to only 16% at the 5th–40th percentiles. Decomposing the average wage effect, the base-wage component is β = −1.53 (s.e. = 0.33) and the rent-dissipation component is β = −0.37 (s.e. = 0.11), implying a rent dissipation rate of approximately 37%. Across multiple proxies for rents—inter-industry/occupation wage differentials, wage losses after job displacement, and quit rates—the average estimated rent dissipation rate is approximately 35%. Rent dissipation accounts for one-fifth of the overall relative wage decline experienced by groups exposed to automation.&lt;/p&gt;
&lt;p&gt;In the general equilibrium quantification (with elasticity of substitution λ = 0.5, average cost savings π = 30%, and average rent in automated tasks of 35%), automation accounts for 52% of the rise in between-group wage inequality since 1980: 42 percentage points via baseline displacement effects on labor demand, and 10 percentage points via rent dissipation. Cost savings from automation increased TFP by approximately 3% between 1980 and 2016, but inefficient rent dissipation offsets 60–90% of these gains, leaving net TFP gains of only 0.3–1.3% and net aggregate consumption gains of only 0.45–1.95% over the 36-year period.&lt;/p&gt;
&lt;p&gt;Q: What is the rent dissipation mechanism, and why does it arise?
A: Rent dissipation arises because labor market wedges make high-rent tasks artificially costly to staff with workers, giving firms a stronger incentive to automate precisely those tasks. When automation displaces workers from high-rent jobs, workers lose the premium above their opportunity cost that those jobs paid, amplifying wage losses beyond what displacement alone would cause. The mechanism is endogenous: firms do not randomly automate tasks but disproportionately target tasks where rents are highest, since doing so saves the most in labor costs. Proposition 3 formalizes this as first-order stochastic dominance of the rent distribution in automated tasks over the rent distribution in all tasks.&lt;/p&gt;
&lt;p&gt;Q: Why is rent dissipation inefficient?
A: In a distorted economy, high-rent tasks already feature too little employment at the equilibrium—firms under-hire in these tasks because the wage wedge makes labor artificially expensive. A social planner would want to allocate more labor to these tasks, not less. When automation further removes labor from high-rent tasks, it moves the economy further from the efficient allocation, dissipating rents that reflect distortions rather than true scarcity. The TFP formula shows that this inefficient targeting offsets part or all of the cost-saving gains from automation, and can even reduce aggregate productivity if the cost savings are small relative to the rent losses.&lt;/p&gt;
&lt;p&gt;Q: What is the U-shaped pattern of within-group wage changes, and what does it indicate?
A: The U-shaped pattern means that wage declines due to automation are smallest at the bottom percentiles of a group&amp;rsquo;s within-group wage distribution, largest in the 70th–95th percentile range, and then smaller again at the very top. Workers at low percentiles earn no rents, so they experience only the base-wage adjustment from reduced labor demand. Workers in the middle-upper range of the distribution hold the high-rent jobs that are disproportionately automated, so they lose both the base-wage component and the rent component of their wages. This pattern is directly visible in US data 1980–2016, with declines of 25–30% per 10 percentage point task displacement at the 70th–90th percentiles versus 16% at the 5th–40th percentiles.&lt;/p&gt;
&lt;p&gt;Q: How is task displacement measured, and which groups are most exposed?
A: Task displacement is measured as a weighted sum of industry-level automation exposure, accounting for each demographic group&amp;rsquo;s specialization in routine tasks within industries. Three proxies are used: the adjusted penetration of industrial robots, the increase in specialized software services, and the increase in dedicated machinery in value added. Workers in the middle and lower-middle of the wage distribution—broadly corresponding to non-college workers—lost 15–20% of their tasks to automation between 1980 and 2016. Post-college degree workers saw few tasks automated.&lt;/p&gt;
&lt;p&gt;Q: How large is the rent dissipation rate, and how robust is this estimate?
A: The baseline estimate from the U-shaped within-group wage change decomposition implies a rent dissipation rate (μ_Ag/μ_g − 1) of approximately 37% (β = −0.37, s.e. = 0.11). Using inter-industry and occupation wage differentials as a proxy for rents, the estimate is 39% (β = −0.39, s.e. = 0.11). Using wage losses after job displacement, the estimate is 20% (β = −0.20, s.e. = 0.04). After purging compensating differentials from the wage differential proxy the estimate remains 37%; after purging from the displacement-loss proxy it falls to 19%. Quit-rate evidence is consistent with rent dissipation: automation shifts workers toward higher-quit-rate jobs, which are lower-rent jobs. The average across proxies is approximately 35%.&lt;/p&gt;
&lt;p&gt;Q: How much of between-group wage inequality since 1980 does automation explain, and what share is due to rent dissipation specifically?
A: Automation accounts for 52% of the rise in between-group wage inequality in the US since 1980. Of this 52 percentage points, 42 percentage points are attributable to the baseline displacement effect working through reduced labor demand for exposed groups. The remaining 10 percentage points are attributable to rent dissipation—automation pushing exposed groups away from high-rent tasks into lower-paying employment. Rent dissipation thus accounts for roughly one-fifth (10/52) of automation&amp;rsquo;s total contribution to between-group inequality.&lt;/p&gt;
&lt;p&gt;Q: How large are the productivity gains from automation, and how much does rent dissipation offset them?
A: Cost savings from automation increased TFP by approximately 3% between 1980 and 2016. However, inefficient rent dissipation offsets 60–90% of these gains, because automation disproportionately targets high-rent tasks rather than tasks where the efficiency case is strongest. The net TFP increase attributable to automation is only 0.3–1.3% over the 36-year period, and the corresponding net increase in aggregate consumption is only 0.45–1.95%.&lt;/p&gt;
&lt;p&gt;Q: How does automation affect within-group versus between-group inequality, and why is this notable?
A: Automation increases between-group inequality by reducing relative wages of exposed groups (largely non-college workers) relative to unexposed groups, accounting for 52% of the rise in between-group inequality since 1980. At the same time, automation reduces within-group wage dispersion for exposed groups by compressing wages at higher percentiles. This contrasts with the standard view that inequality is fractal—rising at all levels of aggregation due to skill-biased demand—and helps explain why within-group inequality has risen steadily for college workers since the 1980s while remaining flat and then declining for non-college workers since the 1990s.&lt;/p&gt;
&lt;p&gt;Q: What do the propagation matrix and rent-impact matrix represent in the general equilibrium analysis?
A: The propagation matrix encodes how task reallocation due to automation in one demographic group creates competition for marginal tasks across other groups, transmitting the wage effects of automation to groups not directly displaced. The rent-impact matrix encodes how this task reallocation changes the rent composition of employment across groups. Both matrices are estimated from US data on task shares and group-level wage elasticities and are used to translate partial-equilibrium estimates of task displacement and rent dissipation into general equilibrium effects on wages and productivity for all demographic groups simultaneously.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of inefficient rent dissipation?
A: Because rent dissipation is inefficient, the social value of automation is lower than what firms and consumers are willing to pay—firms capture all the labor cost savings but do not internalize the welfare cost of destroying high-rent jobs that the distorted equilibrium already under-supplies. Second-best interventions should address the underlying distortions generating rents rather than trying to slow automation directly. The paper suggests that strengthening labor market institutions supporting worker rents in non-automatable tasks could partially counteract the adverse distributional consequences of automation.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to Bound and Johnson (1992) and Borjas and Ramey (1995)?
A: Bound and Johnson (1992) decompose changes in the US wage structure between 1979 and 1988 into technology, supply, and rent components (modeled as exogenous industry wedges), finding that 10–20% of between-group wage changes reflect rent losses. Borjas and Ramey (1995) estimate that trade increased the college premium by 1.3–2.6 log points between 1976 and 1990, with 15–33% due to loss of rents from trade-exposed jobs. Both are comparable to this paper&amp;rsquo;s finding that rent dissipation accounts for one-fifth of the wage effect of automation, though Bound and Johnson&amp;rsquo;s estimates include all factors affecting rents while this paper isolates automation specifically.&lt;/p&gt;
&lt;p&gt;Worker rents: Wages above a worker&amp;rsquo;s opportunity cost or outside option, arising from efficiency wages, bargaining, licensing, regulations, or norms. Modeled as task-specific multiplicative wedges (μ_gx ≥ 1) that force firms to pay more than the base wage for labor in particular tasks. Explicitly excludes compensating differentials and skill premia.&lt;/p&gt;
&lt;p&gt;Rent dissipation: The loss of above-opportunity-cost wages experienced by workers displaced from high-rent tasks into lower-paying employment. Occurs because automation endogenously targets high-rent tasks where labor is most expensive, and pushes workers into tasks where rents are lower. Quantified as the ratio of average rents in automated tasks to average rents across all tasks, minus one (approximately 35% in US data 1980–2016).&lt;/p&gt;
&lt;p&gt;Task displacement: The share of tasks performed by a demographic group that are automated away, measured as a weighted sum of industry-level automation exposure accounting for the group&amp;rsquo;s specialization in routine tasks. Distinct from employment loss because it captures reallocation of tasks from labor to capital within the production function.&lt;/p&gt;
&lt;p&gt;U-shaped within-group wage change profile: The pattern whereby automation generates the largest wage declines at intermediate-to-upper percentiles (70th–95th) of an exposed group&amp;rsquo;s within-group wage distribution, with smaller declines at the bottom, because high-percentile workers disproportionately hold high-rent jobs targeted by automation. Predicted theoretically and confirmed empirically in US data 1980–2016.&lt;/p&gt;
&lt;p&gt;Propagation matrix: A matrix estimated from US data on task shares and group-level wage elasticities that encodes how automation of tasks performed by one demographic group creates competition for marginal tasks with other groups, transmitting wage effects across the demographic distribution in general equilibrium.&lt;/p&gt;
&lt;p&gt;Inefficient automation targeting: The mechanism by which labor market distortions cause firms to automate high-rent tasks that a social planner would prefer to keep labor-intensive, since the distorted equilibrium already features too little employment in those tasks. Results in rent dissipation offsetting 60–90% of automation&amp;rsquo;s direct TFP gains from cost savings.&lt;/p&gt;
&lt;p&gt;Rent-impact matrix: A matrix that encodes how task reallocation due to automation changes the rent composition of employment across demographic groups, used alongside the propagation matrix to compute general equilibrium effects of automation on wages and productivity accounting for distortions.&lt;/p&gt;</description></item><item><title>Bank Information Production Over the Business Cycle</title><link>https://macropaperwarehouse.com/papers/bank-information-production-over-the-business-cycle/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bank-information-production-over-the-business-cycle/</guid><description>&lt;h2 id="bank-information-production-over-the-business-cycle"&gt;Bank Information Production Over the Business Cycle&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Banks produce private information about borrowers that is inherently unobservable to outside researchers. Howes and Weitzner ask whether the quality of this private information is countercyclical — that is, whether banks invest more in learning about borrowers when local economic conditions deteriorate — and whether any such cyclicality reflects endogenous information production incentives rather than exogenous changes in the information environment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses the Federal Reserve&amp;rsquo;s Y-14Q Schedule H.1 confidential regulatory data, which covers commercial and industrial (C&amp;amp;I) loans exceeding $1 million originated by bank holding companies with $50 billion or more in total assets. This universe covers 85.9% of all banking sector assets and approximately 70% of all C&amp;amp;I loan volume (as documented by Bidder, Krainer, and Shapiro (2020)). A distinctive feature is that qualifying banks must report their internal probability of default (PD) estimates for each loan to the Federal Reserve. The sample is restricted to newly originated loans from 2014Q4 through 2019Q1 — the window over which PD data are well populated — with at least one year of subsequent observation to allow defaults to materialize. The outcome variable is a binary default indicator equal to one if the borrower defaults within two years of origination (0.41% of firms in the sample).&lt;/p&gt;
&lt;p&gt;The measure of information quality is defined as the OLS coefficient on PD when regressing realized default on the bank&amp;rsquo;s internal PD estimate. A larger coefficient indicates that the bank&amp;rsquo;s private risk assessment carries more predictive content for realized default outcomes, above and beyond observable firm and loan characteristics. The authors identify cyclical effects by exploiting cross-sectional variation in county-level unemployment rates across the US at each point in time, controlling for bank-by-quarter fixed effects (to absorb supply-side bank-level factors), industry-by-quarter fixed effects, and bank-by-county fixed effects. The key interaction is between PD and the local unemployment rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper establishes three main results:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Banks&amp;rsquo; PDs predict default and contain private information.&lt;/strong&gt; Even after controlling for firm size, leverage, profitability, tangibility, log loan size, loan maturity, loss given default (LGD), loan type fixed effects, bank-quarter fixed effects, and industry-quarter fixed effects, PD remains a statistically and economically significant predictor of realized default. A one-percentage-point increase in PD increases the probability of default by approximately 25 basis points (coefficient of 0.245).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Information quality is countercyclical.&lt;/strong&gt; A one-percentage-point increase in the local county unemployment rate increases the sensitivity of realized default to PD by approximately 8 basis points — roughly one-third of the average unconditional PD coefficient. When the unemployment rate is above a county&amp;rsquo;s median, the PD coefficient is approximately three times as large as during low-unemployment periods. Correspondingly, during high-unemployment periods, the total R-squared of a regression predicting default from observable firm and loan characteristics falls (from 0.311 to 0.264 — an 18% decline), while the marginal contribution of PD to the R-squared increases. This pattern is consistent with observable characteristics doing a worse job at predicting default in bad times, which in turn incentivizes banks to invest more in their internal risk assessments.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;The cyclicality is driven by newly originated loans and more information-sensitive loans.&lt;/strong&gt; The triple interaction between PD, the new-loan indicator, and the unemployment rate is positive and statistically significant across all specifications; the interaction between PD and unemployment for previously issued (non-new) loans is consistently less than half the size of the triple interaction term. The cyclical sensitivity also decreases by more than 0.1 (against a base of 0.08) in the year after origination and continues to fall over the loan&amp;rsquo;s life. Additionally, a one-standard-deviation increase in log loan size (approximately 1.29) increases the sensitivity of realized default to PD by about 0.085 — roughly one-quarter of the unconditional effect — and a one-standard-deviation increase in LGD (0.158) increases the PD coefficient by 0.098, or about one-third of the unconditional effect. Both the loan-size and LGD interactions are amplified when the local unemployment rate is high, consistent with Dang, Gorton, and Holmstrom (2012). The cyclical sensitivity of information quality is statistically significant only for firms in nontradeable industries (e.g., utilities, construction, retail, professional services), not for tradeable-sector firms.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results are conditional on: large US bank holding companies ($50bn+ in assets) lending to non-financial, non-public domestic corporate borrowers with at least $100k in reported assets; a sample period from 2014Q4 to 2019Q1, covering a predominantly expansionary phase of the US business cycle; and county-level rather than aggregate time-series variation in economic conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy Implications&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Countercyclical information production implies that bank lending stimulus policies — including interest rate cuts, liquidity facilities, and asset purchase programs — may be less effective in recessions because banks simultaneously increase screening intensity. The marginal borrowers who gain access to credit from stimulus will differ across states of the cycle: in downturns, banks grant credit to fewer but higher-quality firms, so the incremental impact of expanding the credit supply on the number and type of firms funded may be attenuated. The authors connect this mechanism to prior empirical evidence that monetary policy is less effective in recessions (Tenreyro and Thwaites (2016)) and to LTRO and QE program evidence showing no increase in lending to riskier firms.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the precise definition of &amp;ldquo;bank information quality&amp;rdquo; used in this paper, and why is this measure preferred over alternatives?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Information quality is defined as the OLS coefficient β on the bank&amp;rsquo;s internal PD estimate when predicting realized two-year default in a regression that also includes firm and loan characteristics and a rich set of fixed effects. A higher coefficient indicates that the bank&amp;rsquo;s private risk assessment contains more predictive content for actual default beyond what is captured by observable firm and loan characteristics. This approach is preferred because it directly quantifies the marginal information content of the bank&amp;rsquo;s private assessment and can be estimated at the loan level using the cross-sectional variation in county-level economic conditions, rather than relying on aggregate time-series variation that would confound bank supply-side factors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the authors establish that the PD estimates contain genuine private information rather than merely reflecting publicly observable characteristics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Column (1) of Table 3 shows a PD coefficient of 0.245 in a regression predicting default without controls. Columns (2) and (3) add firm and loan characteristics (size, leverage, profitability, tangibility, log loan size, maturity, LGD, and loan type fixed effects) plus bank-quarter, industry-quarter, and bank-county fixed effects, and also add the interest rate as an additional control; the PD coefficient remains statistically and economically significant across all specifications. This demonstrates that PD retains predictive power for realized default even after absorbing all variation captured by observable firm-level fundamentals and pricing signals, implying the PD estimate contains private information not contained in observables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the baseline magnitude of the cyclicality finding, and how is it identified?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A one-percentage-point increase in the county-level unemployment rate increases the PD coefficient by approximately 8 basis points (Table 5, Column 1). This represents about one-third of the average unconditional PD coefficient estimated in Section 3.1. Identification uses bank-by-quarter fixed effects so that the effect is estimated by comparing two loans made by the same bank at the same time to borrowers in counties with different unemployment rates, ruling out bank-level supply-side confounders such as changes in a bank&amp;rsquo;s cost of capital or risk appetite.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the split-sample analysis (above/below county-median unemployment) further characterize the cyclicality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Columns (3) and (4) of Table 4 show that, when predicting default with PD alone (no controls), the PD coefficient is approximately three times as large during high-unemployment periods as during low-unemployment periods, and the R-squared is substantially higher for high-unemployment observations. The R-squared from a regression of default on observable controls alone is 17.8% higher when unemployment is low (0.311 versus 0.264), while the marginal contribution of PD to the R-squared is higher when unemployment is high (going from 0.264 to 0.267, versus 0.311 to 0.313). This pattern — observables explain less but PD explains more in bad times — is consistent with information frictions being more severe in downturns, which in turn raises banks&amp;rsquo; incentives to invest in private information production.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How do the authors distinguish endogenous information production from a purely exogenous improvement in information quality during downturns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three tests are designed to be difficult to rationalize under a purely exogenous information channel. First, the cyclicality is concentrated in newly originated loans: the triple interaction term (PD × unemployment × new-loan indicator) is positive and statistically significant, while the PD × unemployment interaction for previously originated loans is less than half the size of the triple interaction. If information quality improved exogenously during downturns, there is no clear reason why this improvement would be far larger for loans where the bank is making a new capital commitment. Second, the cyclicality declines by more than 0.1 (relative to a base of 0.08) in the year after origination and continues to fall — simultaneously, the unconditional predictive power of PD increases over the loan life. This divergence is inconsistent with a purely exogenous mechanism. Third, the cyclical sensitivity is concentrated in loans that theory (Dang, Gorton, and Holmstrom (2012)) predicts to have higher information production incentives: larger loans, higher-LGD loans, and loans to nontradeable-sector borrowers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do loan characteristics (size and LGD) relate to information quality, and how does this relationship evolve over the business cycle?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 7 shows that a one-standard-deviation increase in log loan size (approximately 1.29) increases the sensitivity of realized default to PD by about 0.085, or roughly one-quarter of the unconditional PD coefficient. A one-standard-deviation increase in LGD (0.158) increases the PD coefficient by 0.098, or about one-third of the unconditional effect. Table 8 shows that both of these interaction coefficients have the same sign and are amplified during periods of high unemployment, consistent with Dang, Gorton, and Holmstrom (2012)&amp;rsquo;s prediction that information production decisions become more sensitive to loan features following negative aggregate shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What does the tradeable versus nontradeable industry test contribute?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because nontradeable-sector firms (utilities, construction, retail, transportation, accommodation, food services, information and communication, professional services) are more likely to depend on local demand, the same change in the county-level unemployment rate will have a larger impact on their default probability. Table 9 shows that the cyclical sensitivity of PD&amp;rsquo;s predictive power — the PD × unemployment interaction — is statistically significant only for nontradeable-sector firms, not for firms in tradeable industries. This provides additional evidence that the mechanism operates through local economic conditions affecting borrower riskiness in a way that raises information production incentives, rather than through some aggregate or bank-level mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Do composition effects (changes in the pool of borrowers) account for the main findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 11 shows that observable loan characteristics — average loan size, interest rate, LGD, and maturity — do not vary meaningfully with the local unemployment rate. Realized default rates increase slightly with unemployment but the effect is not statistically significant. The PD itself increases by only about 3 basis points for a one-percentage-point increase in unemployment (significant only at the 10% level). Loan volume declines: a one-standard-deviation increase in the unemployment rate (1.3 percentage points) leads to a 1.6% decrease in loan volume and a 5.46% decrease in the number of loans. The minimal variation in the risk profile of loans actually granted suggests that composition effects in the pool of approved borrowers are unlikely to explain the main result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the implications of countercyclical information production for monetary policy transmission?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When unemployment is high, banks screen potential borrowers more intensively, which changes the composition of firms that gain access to credit. Policies designed to expand credit supply — interest rate cuts, liquidity facilities, asset purchase programs — face a more heavily screened pool of potential recipients during downturns. This means the marginal firms that receive additional credit following a stimulus in a recession will be of higher quality than the marginal recipients in an expansion, implying the credit transmission of monetary policy reaches a different — and potentially smaller — set of firms in recessions. The authors connect this to Tenreyro and Thwaites (2016)&amp;rsquo;s finding that monetary policy is less effective in recessions, and to evidence from the Eurosystem&amp;rsquo;s LTRO program that aggregate lending rose but lending to riskier firms did not, and to UK QE evidence finding no stimulation of bank lending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does this paper differ from the most closely related prior study (Becker, Bos, and Roszbach (2020))?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Becker, Bos, and Roszbach (2020) also find that bank credit ratings predict default better in bad economic times, using data from a single Swedish bank and relying on aggregate time-series variation. The present paper differs in three ways. First, it uses cross-sectional variation across US counties within each time period, exploiting bank-by-quarter fixed effects to rule out bank supply-side confounders. Second, it uses loan-level rather than firm-level data, enabling the analysis of how loan characteristics (size and LGD) interact with information quality and cyclicality. Third, Becker, Bos, and Roszbach interpret the cyclicality as exogenous; Howes and Weitzner provide evidence against this interpretation — specifically, the concentration in newly originated loans and in loans with characteristics that theoretical models predict should generate higher endogenous information production.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Bank Information Quality (as used in this paper)&lt;/strong&gt;
The size of the OLS coefficient on a bank&amp;rsquo;s internal probability of default (PD) estimate in a regression predicting realized loan default. A larger coefficient means the bank&amp;rsquo;s private risk assessment carries more predictive content for actual default beyond observable firm and loan characteristics. It is a measure of how much private information the PD encodes about borrower risk, not a measure of accuracy in an absolute sense.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Probability of Default (PD) — Y-14Q Internal Estimate&lt;/strong&gt;
Banks&amp;rsquo; own model-based estimate of each corporate borrower&amp;rsquo;s likelihood of defaulting, reported confidentially to the Federal Reserve under Y-14Q Schedule H.1 filings. In the paper, PD is used as the observable proxy for the bank&amp;rsquo;s private risk assessment; its predictive power for realized default is the object being studied, not the PD level itself.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Countercyclical Information Production&lt;/strong&gt;
The property that banks&amp;rsquo; incentives to invest in learning about borrower quality increase as economic conditions deteriorate. In the theoretical literature the paper tests empirically, the returns to distinguishing between borrower types rise in downturns (because the distribution of borrower quality widens and the consequences of adverse selection increase), inducing banks to produce more private information at loan origination. The paper uses &amp;ldquo;information quality is countercyclical&amp;rdquo; to mean that the predictive content of PD for realized default is higher when the local unemployment rate is higher.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Information Sensitivity (of a loan)&lt;/strong&gt;
The degree to which the value of a loan depends on information that is privately held by potential borrowers. Following Dang, Gorton, and Holmstrom (2012), loans are more information-sensitive when they are larger (larger potential loss from adverse selection) or when they have higher loss given default (lower expected recovery value). The paper uses loan size and LGD as proxies for information sensitivity and tests whether banks invest more in information about higher-information-sensitivity loans.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Loss Given Default (LGD)&lt;/strong&gt;
The bank&amp;rsquo;s estimate of the fraction of the loan&amp;rsquo;s value that would be lost if the borrower defaults, reflecting the expected recovery value of collateral and other loan features. In the paper, higher LGD (lower recovery) is a proxy for higher information sensitivity, since the consequences of lending to a bad borrower are larger when recovery is low.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank-by-Quarter Fixed Effects&lt;/strong&gt;
A set of fixed effects that absorbs all variation in outcomes attributable to a particular bank at a particular point in time. In the context of this paper, including bank-by-quarter fixed effects means the cyclicality results are identified from variation across counties for loans made by the same bank in the same quarter, ruling out supply-side explanations such as changes in a bank&amp;rsquo;s cost of capital, risk appetite, or credit standards that affect all of its loans uniformly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous versus Exogenous Information Quality&lt;/strong&gt;
A core distinction in the paper. Exogenous information quality would mean banks passively receive more precise signals about borrowers during downturns regardless of their investment in screening. Endogenous information quality means banks actively choose to invest more in information production during downturns because the returns to distinguishing borrower types are higher. The paper argues its results — especially the concentration of cyclical effects in newly originated loans and in loans with characteristics that theory predicts should generate higher screening incentives — are consistent with the endogenous channel and are difficult to rationalize under a purely exogenous mechanism.&lt;/p&gt;</description></item><item><title>Bargaining and Inequality in the Labor Market</title><link>https://macropaperwarehouse.com/papers/bargaining-and-inequality-in-the-labor-market/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bargaining-and-inequality-in-the-labor-market/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; How prevalent is individual wage bargaining in the labor market, what determines firms&amp;rsquo; bargaining strategies, how do bargaining encounters unfold for workers, and does heterogeneity in bargaining behavior translate into wage inequality—including the gender wage gap?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting.&lt;/strong&gt; The paper develops and validates novel linked survey data for Germany. A firm survey was fielded by the ifo Institute to senior HR professionals and managers in two waves (September 2021 and January 2022), yielding 772 complete responses across all major sectors and regions. These responses were linked—with consent obtained from 72% of firms—to German Social Security records (the Integrated Employment Biographies, IEB) covering 416,821 full-time employees at matched firms in 2020, and to Orbis balance sheet data for firm productivity proxies. A separate worker survey was fielded by the IAB to 135,000 full-time German workers, with 9,756 completing it; nearly 10,000 responses were used for analysis, with 7,079 workers employed at surveyed firms. The worker survey elicited detailed bargaining histories for workers who had received an outside offer in the prior six months, bargaining at the start of current employment (for workers with tenure of three years or less), and responses to a hypothetical salary expectation scenario.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Definition of Individual Bargaining.&lt;/strong&gt; The authors define a firm as having a &amp;ldquo;bargaining strategy&amp;rdquo; if it differentiates pay between workers in the same position it perceives to have similar productivity—encompassing both variation in initial offers (which may reflect firms using information on workers&amp;rsquo; salary expectations) and back-and-forth negotiation. Elicitation distinguishes four employee groups (recent labor market entrants, experienced non-managers, managers, and bottleneck-occupation workers) and two contexts (new external hires and incumbent workers who receive an outside offer).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Prevalence of Bargaining.&lt;/strong&gt; Approximately 50% of surveyed firms are willing to differentiate base wages for recent labor market entrants, more than 80% for experienced non-managers and managers, and nearly all for workers in bottleneck occupations they are struggling to fill. For incumbent workers facing outside offers, 57% of firms would increase pay for recent entrants, and more than 80% for experienced incumbents, managers, and bottleneck workers. In total, 80% of workers in the sample are in positions where individual bargaining is possible.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Magnitude of Wage Differentiation.&lt;/strong&gt; For new external hires, the typical firm expects a gap between the highest and lowest offers of 3% for recent entrants, 5% for experienced non-managers, and 10% for managers (conditional on a gap: 6%, 10%, and 12% respectively). For incumbent workers responding to outside offers, the typical firm will adjust pay by 3% for recent entrants, 6% for experienced non-managers, and 10% for managers (conditional on responding: 6%, 7%, and 14% respectively). Forty-four percent of firms report that variation in initial offers is at least as important as back-and-forth negotiation in determining workers&amp;rsquo; final pay.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Predictors of Firm Bargaining Strategies.&lt;/strong&gt; Contrary to models predicting more productive firms are more likely to bargain (Doniger 2015; Postel-Vinay and Robin 2004; Flinn and Mullins 2021), firms that bargain are not more productive—as proxied by firm age, size, or assets per employee—nor do they pay higher mean wages. A variance decomposition shows that employee-group dummies alone explain 33% of variation in bargaining strategies for new hires, comparable to more than 500 firm dummies. Labor market factors—particularly whether a position is hard to fill—are systematically associated with bargaining willingness. Collective bargaining agreement (CBA) coverage and East German location are negatively correlated with bargaining flexibility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;How Bargaining Unfolds.&lt;/strong&gt; In 57% of worker-firm interactions, the worker provides salary expectations before the firm makes its initial offer; 29% of firms require this information. About one-third of applicants ask for more after the initial offer, requesting on average a 3% increase; conditional on asking, about half of firms raise the offer, but fewer than one-third match what was requested, with the typical worker improving the offer by 1.5%. The majority of outside offers are rejected: only 9% of workers who received an outside offer in the prior six months chose to move to a new firm. Of the 91% who remained at their incumbent firm, 13% successfully renegotiated their pay. Back-and-forth dynamics—where offers are accepted or rejected only after multiple rounds—are consistent with models of two-sided incomplete information.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Worker Heterogeneity and Wage Inequality.&lt;/strong&gt; Workers with better self-assessed outside options are 9 percentage points more likely to ask for an increase after the initial offer and 7 percentage points more likely to successfully negotiate a raise, relative to same-occupation coworkers with worse outside options. Women are 6 percentage points less likely to successfully negotiate their pay upward and show lower salary expectation provision rates, including in a hypothetical scenario in which pay range information is equalized. These gender differences in bargaining are not explained by women negotiating more over non-wage amenities; controlling for outside options and risk tolerance shrinks the female coefficient by at most 15%. Among surveyed workers, after controlling for occupation-establishment fixed effects, there is no gender wage gap at firms that do not bargain, but a 4–5 percentage point gender wage gap at firms that do bargain. Across specifications, firms that engage in individual bargaining have a 3 percentage point higher gender wage gap. A simple decomposition suggests that at surveyed firms, 44% of the residual gender pay gap can be attributed to bargaining. For workers at bargaining firms, a 10 percentage point higher pay premium at the prior firm is associated with 0.5 percent higher pay at the current firm, conditional on occupation-establishment fixed effects; this relationship is statistically insignificant for workers at non-bargaining firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply to full-time private-sector workers in Germany between ages 25 and 50, with the firm sample over-representing medium and large firms (median size 50–249 employees). CBA coverage in the sample (41%) reflects Germany&amp;rsquo;s institutional context where firms retain the right to pay above CBA floors. Results are robust to re-weighting to match the overall distribution of German firm size and sector.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. How do the authors define &amp;ldquo;individual bargaining&amp;rdquo; and why is this definition broader than standard labor economics usage?&lt;/strong&gt;
The authors define a firm as having a bargaining strategy if it differentiates pay between workers in the same position it perceives to have similar productivity, covering both tailoring of initial offers and back-and-forth negotiation. Standard labor economics definitions typically condition on wages being set ex post once outside options are revealed, and focus on back-and-forth negotiation alone. The authors&amp;rsquo; definition is most analogous to standard definitions of price discrimination. Empirically, the vast majority of firms that differentiate initial offers (93%) are also willing to engage in back-and-forth negotiation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How was the firm survey designed to elicit bargaining strategies reliably, and what is the &amp;ldquo;protocol question&amp;rdquo;?&lt;/strong&gt;
The protocol question asked: &amp;ldquo;How much more could a person maximally receive compared to the fixed compensation you would have offered based on the person&amp;rsquo;s qualification/fit for the position alone?&amp;rdquo; with options ranging from &amp;ldquo;0%/no adjustments possible&amp;rdquo; to &amp;ldquo;more than 40%.&amp;rdquo; Wording was developed through over 100 conversations with HR professionals; &amp;ldquo;qualifications and fit&amp;rdquo; was the phrase most closely aligned with HR professionals&amp;rsquo; concept of productivity. The survey was fielded by the ifo Institute—an organization with decades of experience surveying this population—with a 51% response rate, 83% completion rate, and median response time of 11 minutes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What validation exercises support the reliability of the elicited firm bargaining measures?&lt;/strong&gt;
Four exercises are reported. First, intra-respondent reliability: the cross-tabulations between the protocol and incidence questions show most mass on or below the diagonal (incidence-implied spread no greater than the protocol-implied flexibility). Second, inter-respondent reliability: among 37 firms with multiple respondents, there is significant overlap in independently provided answers. Third, external validity using publicly available data: for 90% of firms reporting no CBA, no CBA evidence is found; for 99% reporting no pay information in job ads, none is found in online postings; for 82% reporting no salary expectation elicitation, no evidence of it appears in online application forms. Fourth, the elicited firm strategies are highly correlated with the matching workers&amp;rsquo; survey responses—e.g., workers at firms stating they elicit salary expectations are significantly more likely to report having provided these expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. Is firm productivity associated with whether a firm engages in individual bargaining?&lt;/strong&gt;
No. Firms that bargain and those that do not are similar with respect to firm size, firm age, and total assets per employee, and they also do not differ significantly in their AKM wage premium. These findings are inconsistent with theoretical models predicting that more productive firms are more likely to set pay via bargaining (Doniger 2015; Postel-Vinay and Robin 2004; Flinn and Mullins 2021). The result holds for both binary and continuous measures of bargaining, and is not overturned by machine learning prediction attempts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What firm characteristics other than productivity predict bargaining strategies?&lt;/strong&gt;
CBA coverage is negatively correlated with wage flexibility—CBA-covered firms report less flexibility even for managers who are typically exempt from CBAs and for groups not covered by CBAs, suggesting institutional norms or culture matter. Firms headquartered in East Germany are less likely to bargain with workers in all groups. Publicly traded firms (stock-based corporations) are more likely to set wages flexibly. These correlations are consistent with the view that managerial style and firm culture (rather than productivity) shape wage-setting strategies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What does the variance decomposition say about the relative importance of firm versus market factors in predicting bargaining strategies?&lt;/strong&gt;
Employee-group dummies alone explain 33% of the variation in bargaining strategies for new hires. After adjusting for the number of fixed effects used, four employee-group dummies explain as much variation as more than 500 firm dummies. Adding firm characteristics or coarse industry dummies does not significantly improve the adjusted R-squared relative to a model containing only group dummies. This supports models emphasizing market-level factors (worker replaceability, labor market tightness) over firm-level factors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How common is it for workers to provide salary expectations before receiving an initial offer, and what do firms do with this information?&lt;/strong&gt;
In 57% of worker-firm interactions, the worker provides salary expectations before the firm makes its initial offer. Twenty-nine percent of firms require this information; most ask for it. Forty-four percent of firms report that variation in initial offers is at least as important as subsequent back-and-forth negotiations in determining workers&amp;rsquo; final pay. HR professionals and prior research indicate firms interpret variation in stated expectations as reflecting outside options rather than productivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. What fraction of outside offers are rejected, and what happens when workers stay at the incumbent firm?&lt;/strong&gt;
Only 9% of workers who received one or more outside offers in the prior six months chose to move to a new firm. Of the 91% who remained at the incumbent firm, 13% successfully renegotiated their pay at the incumbent. A follow-up survey fielded in spring 2024 corroborates this finding, showing approximately 80% of workers who received an outside offer remained at the incumbent firm; even recoding all job-to-job transitions as accepted offers implies no more than 26% of offers lead to a transition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. What do the back-and-forth dynamics imply for appropriate theoretical models of wage bargaining?&lt;/strong&gt;
That many offers are accepted or rejected only after multiple rounds of negotiation is difficult to rationalize with models assuming either firms or workers have perfect information, which typically predict immediate acceptance or rejection. The patterns are consistent with models of two-sided incomplete information (Perry 1986; Chatterjee and Samuelson 1983). Sixty-nine percent of HR professionals in the survey report that decision-makers at their firm only have market-level information on wages, not specific information on what competitors pay.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. How do outside options predict worker bargaining behavior and outcomes, controlling for occupation-establishment fixed effects?&lt;/strong&gt;
Workers who rated it &amp;ldquo;easy&amp;rdquo; or &amp;ldquo;very easy&amp;rdquo; to obtain a better outside offer are 9 percentage points more likely to ask for an increase after the initial offer and 7 percentage points more likely to successfully negotiate a raise relative to same-occupation-establishment coworkers who rated it &amp;ldquo;difficult&amp;rdquo; or &amp;ldquo;very difficult.&amp;rdquo; The same pattern persists during the employment spell: workers with better outside options are 9 percentage points more likely to initiate and 8 percentage points more likely to succeed in renegotiation. These workers are not more likely to receive raises without asking.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. How does risk tolerance predict bargaining, and how does it compare to outside options?&lt;/strong&gt;
Workers with greater risk tolerance (those rating themselves 7 or above on a 10-point scale) are more likely to engage in wage negotiations and more likely to succeed both at the start of and during employment spells. Gaps in successful negotiations are somewhat larger than gaps in attempted negotiations, suggesting risk-tolerant workers also negotiate more effectively. However, outside options explain more of the between-worker variation in bargaining behavior than risk tolerance does.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12. What are the gender differences in bargaining behavior, and can they be explained by differences in outside options or risk tolerance?&lt;/strong&gt;
Women are less likely to engage in back-and-forth negotiations and are 6 percentage points less likely to successfully negotiate pay upward during an employment spell. Women are also less likely to provide salary expectations and provide lower expectations as a fraction of their current salary in the hypothetical scenario, including when the salary range is provided—women are 6 percentage points less likely to provide expectations above the top of the stated range. Controlling for outside options and risk tolerance shrinks the female coefficient by at most 15%. There is no evidence that women substitute toward negotiating for non-wage amenities. The pattern is most consistent with women finding negotiation uncomfortable, not with a belief that it will not pay off or fear of backlash.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13. What is the estimated gender wage gap attributable to individual bargaining?&lt;/strong&gt;
Among surveyed workers, after controlling for occupation-establishment fixed effects, there is no gender wage gap at firms without individual bargaining (coefficient closes to zero), while a 4–5 percentage point gender wage gap persists at firms with individual bargaining. This difference is robust across measures of pay (total daily pay, base pay, pay conditioning on hours worked), alternative fixed effect specifications, and to including non-surveyed workers at surveyed firms. A simple decomposition suggests 44% of the residual gender pay gap at surveyed firms can be attributed to bargaining. Across the interaction specifications, bargaining firms have a 3 percentage point higher gender wage gap and—in one key specification—a 6 percentage point difference between the gender gaps at bargaining and non-bargaining firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14. How does a worker&amp;rsquo;s prior firm wage premium affect current wages, and does bargaining status matter?&lt;/strong&gt;
In a regression of log current wages on the AKM wage premium of the prior firm (conditional on occupation-establishment fixed effects), a 10 percentage point higher pay premium at the prior firm is associated with 0.5 percent higher pay at the new firm for workers at bargaining firms. For workers whose pay is not set via individual bargaining, the relationship between the prior firm&amp;rsquo;s pay premium and current pay is statistically insignificant. The result is consistent with the idea that during negotiations with a new firm, workers use their prior firm&amp;rsquo;s pay policy as an outside option.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15. How do AKM person effects relate to bargaining behavior?&lt;/strong&gt;
Higher-person-effect individuals are more likely to have provided salary expectations when applying to their current firm and ask for a larger fraction of their current salary in the hypothetical scenario (conditional on their wage). These differences persist when controlling for occupation-establishment fixed effects and age and experience. Higher-person-effect workers are not more likely to receive raises without asking. These results are inconsistent with AKM person effects reflecting only productivity differences and instead suggest that fixed differences in individual bargaining behavior contribute to the variance in person effects—which Card, Heining, and Kline (2013) estimated explains a large share (40%) of the growth in German wage inequality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q16. Are the bargaining patterns found at surveyed firms representative of bargaining more broadly?&lt;/strong&gt;
Two robustness exercises support broader representativeness. First, similar bargaining dynamics are found when including a random sample of German workers employed at non-surveyed firms. Second, re-weighting the sample to match the overall distribution of firm size and sector in Germany yields similar results. Because medium and large firms are over-represented in the firm sample, and because small firms hire infrequently and are less likely to have formal bargaining strategies, the true prevalence of individual bargaining among all German firms may be somewhat lower.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Individual Bargaining Strategy (firm-level).&lt;/strong&gt; A firm has an individual bargaining strategy if it differentiates pay between workers in the same position that it perceives to have similar productivity. This definition encompasses both tailoring of initial offers (based on, e.g., workers&amp;rsquo; stated salary expectations) and back-and-forth negotiation. It is analogous to price discrimination rather than to the standard labor economics distinction between wage posting and Nash bargaining.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Protocol Question.&lt;/strong&gt; The main survey measure of firm bargaining strategies: firms are asked the maximum percentage by which pay could be increased for a new hire above the fixed compensation the firm would have offered based on qualifications and fit alone, with response bins from &amp;ldquo;0%/no adjustments&amp;rdquo; to &amp;ldquo;more than 40%.&amp;rdquo; A zero response is used to classify a firm as not bargaining.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incidence Question.&lt;/strong&gt; A supplementary survey measure eliciting the expected spread (between highest and lowest offers) that the firm would make to ten candidates with identical qualifications and fit but differing stated salary expectations and competing offers. Used to validate the protocol question and to quantify the importance of initial-offer differentiation relative to back-and-forth negotiation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bottleneck Occupation.&lt;/strong&gt; A firm-defined category of workers in positions that are particularly difficult to fill, drawing on an official German Federal Employment Agency designation. In the paper, bargaining willingness is systematically higher for workers in these positions than for other workers at the same firm, providing evidence that labor market tightness drives bargaining strategies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Outside Offer Renegotiation.&lt;/strong&gt; Wage renegotiation at the incumbent firm triggered by a worker receiving an outside offer, without a change in job tasks. The paper documents this is empirically more common than actual job-to-job transitions: of workers receiving outside offers, 91% remain at the incumbent firm, and 13% of those who remain successfully renegotiate their pay.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;AKM Person Effect.&lt;/strong&gt; A worker fixed effect estimated from a two-way fixed effects regression of log wages on worker and firm fixed effects (following Abowd, Kramarz, and Margolis 1999). In this paper, AKM person effects are taken from Bellmann et al. (2020), estimated over 2010–2017 German population data. The paper provides evidence that these effects capture, in part, fixed differences in individual bargaining behavior rather than solely differences in productivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;AKM Firm Effect (Wage Premium).&lt;/strong&gt; The firm fixed effect from the same two-way fixed effects regression, representing the pay premium a firm pays relative to what would be expected given its workforce composition. The paper uses the prior firm&amp;rsquo;s AKM effect as a measure of a worker&amp;rsquo;s outside option quality when testing whether prior-firm pay policy influences current pay under individual bargaining.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Salary Expectations (Gehaltsvorstellungen).&lt;/strong&gt; The wage figure a worker provides to a prospective employer, typically before the firm&amp;rsquo;s initial offer. Legally, German firms (like most US states) cannot ask for salary history but can ask for salary expectations. In the paper, 57% of worker-firm interactions begin with the worker providing expectations; firms report using these to tailor initial offers, interpreting variation in stated expectations as reflecting outside options rather than productivity.&lt;/p&gt;</description></item><item><title>Barriers to Global Capital Allocation</title><link>https://macropaperwarehouse.com/papers/barriers-to-global-capital-allocation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/barriers-to-global-capital-allocation/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Why do observed international investment positions and cross-country differences in rates of return to capital fail to conform to a frictionless capital-market benchmark? The paper asks how large the efficiency and distributional costs of barriers to global capital allocation are, and which frictions — capital income taxes, political risk, and geographic/cultural/linguistic distances — matter most.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors develop a multi-country dynamic spatial general equilibrium model in which the entire network of bilateral cross-border investment positions is endogenously determined. Production in each country i follows a three-factor Cobb-Douglas function in reproducible capital, labor, and natural resources, with country-varying income shares. Capital is the only mobile factor. A logit asset demand system governs portfolio shares: the share of country j&amp;rsquo;s savings invested in country i is proportional to the risk-adjusted expected return on capital in i, scaled by the capital stock of i, and inversely proportional to a bilateral portfolio wedge ∆ij. These wedges can be microfounded via either rational inattention (where wedges reflect the precision of prior beliefs about returns) or extreme-value-distributed transaction costs. The model admits multiple microfoundations but yields the same functional form and the same counterfactual welfare calculations regardless of interpretation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Frictions measured.&lt;/strong&gt; Three categories of frictions enter the empirical implementation: (a) bilateral capital income tax rates — a new dataset covering 225 countries (50,625 country pairs), constructed from corporate income tax rates and treaty-adjusted withholding tax rates on dividends and interest, further adjusted for effective tax rates accounting for tax-haven routing; (b) political risk, proxied by an ICRG composite index (excluding socioeconomic conditions) following Alfaro, Kalemli-Ozcan, and Volosovych (2008); (c) geo-political distance, comprising geographic distance, cultural distance (based on 496 World Values Survey questions across 116 countries), and linguistic distance (based on a language-family tree covering 6,737 languages and 242 countries). These distance measures are publicly available at geopoliticaldistance.org. The model covers 96 countries (9,216 dyads), representing 92% of world GDP in 2017.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gravity Estimation.&lt;/strong&gt; Bilateral investment data (restated for tax havens using the nationality-basis methodology of Coppola et al. 2020 and Damgaard et al. 2019) are regressed on cultural, geographic, and linguistic distance with origin and destination fixed effects. In OLS, a one-standard-deviation increase in cultural distance (0.023 units) is associated with a 24.0% decrease in foreign assets; geographic distance (0.977 units in logs) with a 78.6% decrease; linguistic distance (0.174 units) with a 51.5% decrease. These magnitudes are robust across OLS, PPML, and IV (using religious distance as an instrument for cultural distance). Under IV, the standardized effect of cultural distance on log foreign assets rises to −76.5%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tax haven analysis.&lt;/strong&gt; A Tobit regression of the share of bilateral investment routed through tax havens on the estimated tax saving from routing through havens yields coefficients of 0.413–0.999 for equity and 1.001–1.777 for debt (across specifications with varying fixed effects), confirming that tax incentives are a primary driver of the discrepancy between residency-based and nationality-based bilateral positions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model fit (untargeted moments).&lt;/strong&gt; The calibrated baseline model produces: (i) a correlation of 0.658 between model-implied and empirical rates of return to capital (vs. 0.325 for the frictionless benchmark), with a standard deviation of 0.417 (vs. 0.091 frictionless; data: 0.496); (ii) a correlation of 0.947 between model-implied and empirical capital per employee (vs. 0.918 frictionless); (iii) a correlation of 0.94 between model-implied and empirical home bias; the model reproduces the mean home bias of 3.973 vs. 4.006 in data and standard deviation of 1.065 vs. 1.224, while the frictionless benchmark produces exactly zero home bias for all countries. Portfolio-share MSE: 1.16 (baseline) vs. 1.86 (frictionless).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual findings.&lt;/strong&gt; Removing all measured barriers raises world GDP by 6.8% relative to the observed equilibrium (equivalent to stating that the distorted equilibrium is 6.8% below the frictionless benchmark). Geo-political distance alone accounts for most of this: when only distance frictions are retained, world GDP is 5.2% below the frictionless level. Capital taxes alone reduce world GDP by 2.6% below frictionless; political risk alone by 0.4%. The standard deviation of log capital per employee is 51.5% higher than it would be without barriers; the standard deviation of log output per employee is 22.5% higher. In the frictionless equilibrium, capital flows from rich to poor countries (the correlation between net foreign assets and development doubles in absolute value), accounting for the Lucas (1990) puzzle. In short-term (one-period) counterfactuals holding wealth fixed, the GDP gain from full barrier removal is 3.6%; the inequality effect remains similar (standard deviation of log capital per employee 48.4% higher with barriers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; The model focuses on steady-state outcomes; dynamic transition effects are analyzed in extensions but are smaller. Quantitative conclusions are conditioned on: (i) the model sample of 96 countries covering 92% of world GDP in 2017; (ii) the conservative OLS coefficient estimates used for baseline calibration (IV estimates are larger and would amplify results); (iii) the assumption that the logit demand system captures frictions regardless of their microfoundation; (iv) omission of goods-trade frictions from the baseline (when included, the world GDP effect falls to 3.7% and the capital inequality effect to 23.3%).&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the core theoretical prediction about cross-country rates of return when investment barriers exist?&lt;/strong&gt;
A: In the model&amp;rsquo;s frictionless benchmark (Propositions 1 and 2), all origin countries hold identical portfolios and risk-adjusted expected returns are equalized across destinations. When bilateral frictions are introduced, countries that are more &amp;ldquo;peripheral&amp;rdquo; (harder to access for foreign investors due to high geo-political distance or political risk) receive less inward capital and therefore command higher physical rates of return to capital. Countries that are easily accessible (&amp;ldquo;central&amp;rdquo;) attract more capital and exhibit lower rates of return. The Dual Efficiency Theorem establishes that capital is efficiently allocated if and only if marginal products of capital are equalized across countries, which requires that taxes are uniform and that portfolio wedges satisfy a specific cancellation condition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How are portfolio wedges measured, and what is the identifying strategy?&lt;/strong&gt;
A: Portfolio wedges ∆ij are decomposed into a geo-political distance component and a political risk component. The geo-political distance component is specified as a log-linear function of geographic distance, cultural distance, and linguistic distance, with coefficients (β_g, β_c, β_l) estimated from a gravity regression of log bilateral investment on these distances, controlling for origin and destination fixed effects. Because political risk varies only by destination country, it cannot be separately identified from destination fixed effects in the bilateral regression; its elasticity is therefore taken from Alfaro, Kalemli-Ozcan, and Volosovych (2008). The key identification advantage of bilateral data is that origin and destination fixed effects absorb all country-level confounders, so the distance coefficients are identified purely from within-origin, within-destination variation across country pairs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What do the OLS gravity regressions find, and are the coefficients stable across specifications?&lt;/strong&gt;
A: In the baseline OLS specification (Table 2, column 1), the estimated coefficients on cultural distance, geographic distance, and linguistic distance are −11.944, −1.579, and −4.162 respectively (all significant at the 1% level). In standardized terms, a one-standard-deviation increase in cultural distance reduces foreign assets by 24.0%, geographic distance by 78.6%, and linguistic distance by 51.5%. Adding a rich set of control variables (colonial ties, legal origin, currency pegs, trade agreements, effective tax rates) leaves these magnitudes broadly similar: standardized effects on foreign assets are −26.4%, −80.1%, and −47.6%, respectively. Results are also robust across OLS and PPML specifications and across years 2013–2017. Effects are quantitatively similar for foreign equity and foreign debt, though linguistic distance has a somewhat smaller effect on debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. How does the instrumental variable strategy address reverse causality in cultural distance, and what does it find?&lt;/strong&gt;
A: The authors instrument cultural distance with religious distance (based on historical trees of religious affiliation), assuming religious history affects international investment only through its contemporary effect on differences in values and beliefs as captured by the World Values Survey. The instrument is a strong predictor of cultural distance (passes weak-instrument tests comfortably). Under IV, the standardized effect of a one-standard-deviation increase in cultural distance on log foreign assets rises from −24.0% (OLS) to −76.5% (IV). The authors use conservative OLS estimates for their baseline calibration, so the IV results imply the headline counterfactual effects are likely understated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. How does the model predict home bias, and how well does it match the data?&lt;/strong&gt;
A: Home bias is defined as the log difference between the domestic portfolio share and the country&amp;rsquo;s share in the world capital stock. In the frictionless model, Proposition 1 implies that all countries hold identical foreign portfolios, so the model produces exactly zero home bias for every country. The baseline model, by incorporating bilateral frictions, generates home bias endogenously without targeting it. The model-implied home bias correlates with the empirically measured home bias at 0.94 across countries and matches both the mean (3.973 model vs. 4.006 data) and standard deviation (1.065 vs. 1.224) closely. The model also predicts, consistent with Lau, Ng, and Zhang (2010), that home bias and rates of return on capital are positively correlated (model-implied ρ = 0.55), and that rates of return on capital correlate negatively with the log of GDP per employee (model-implied ρ = −0.70).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What is the quantitative decomposition of the world GDP loss by type of barrier?&lt;/strong&gt;
A: World GDP in the observed (distorted) equilibrium is measured at $112.9 trillion (PPP), which is 6.8% below the frictionless counterfactual. When all barriers are present except geo-political distance, world GDP is 5.2% below frictionless — meaning distance frictions account for the largest share. When all barriers are present except political risk, world GDP is only 0.4% below frictionless. When all barriers are present except taxes, world GDP is 2.6% below frictionless. These are not exactly additive because the distortions interact; the results confirm that geo-political distance (cultural, linguistic, and geographic) constitutes the dominant source of global capital misallocation among the three measured frictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How do barriers affect the cross-country distribution of capital and income?&lt;/strong&gt;
A: The standard deviation of log capital per employee is 51.5% higher in the distorted equilibrium than in the frictionless counterfactual; the standard deviation of log output per employee is 22.5% higher. When only geo-political distance distortions are maintained, dispersion in log capital per employee is 38.2% higher and in log output per employee 15.9% higher. Maintaining only taxes raises the dispersion in log capital per employee by 12.9% and log output per employee by 6.0%; maintaining only political risk raises them by 7.3% and 3.8%, respectively. In the frictionless equilibrium, the poorest countries gain the most: some of the poorest countries see capital per employee increase by an order of magnitude and income per employee double.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. Does the model account for the Lucas puzzle (capital not flowing from rich to poor countries)?&lt;/strong&gt;
A: Yes. In the observed distorted equilibrium, net foreign asset positions correlate only weakly with the level of development, consistent with Lucas&amp;rsquo;s (1990) observation that capital fails to flow from rich to poor countries. In the frictionless counterfactual, the absolute value of the correlation between net foreign asset positions and log GDP per employee doubles, and capital indeed flows from rich to poor countries as neoclassical theory predicts. The distortions from taxes, political risk, and geo-political distance thus account for the absence of a strong correlation between net positions and development in the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. How do extensions incorporating goods-trade frictions, capital controls, and currency hedging costs affect the headline findings?&lt;/strong&gt;
A: Adding goods-trade frictions (country-specific prices for output and capital installation following Monge-Naranjo et al. 2019) reduces the world GDP effect to 3.7% (from 6.8% baseline) and the dispersion of log capital per employee to 23.3% higher (from 51.5%), but the overall pattern of results is preserved. Replacing political risk with capital controls (using Jahan and Wang 2016 de-jure capital account openness) yields a comparable world GDP loss of 6.6% and a geo-political distance effect of 6.2%, very close to the 6.8% and 5.2% in the baseline. Adding currency hedging costs leaves world GDP loss and inequality effects essentially unchanged relative to baseline. None of these extensions materially alters the headline conclusions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. How do the authors validate the model against nationality-based versus residency-based bilateral investment data?&lt;/strong&gt;
A: The model is calibrated to nationality-based positions (restated for tax havens). The MSE for fitting nationality-based external portfolio shares is 1.16, while the MSE for residency-based positions is 1.22. The model was not explicitly designed to distinguish between the two, yet it naturally produces better predictions for nationality-based positions because its frictions incorporate the incentives for indirect investment routing through tax havens. This cross-validation supports the methodological approach of using nationality-restated data and confirms the internal consistency of the model&amp;rsquo;s treatment of tax-haven routing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. What are the implications for global tax policy coordination?&lt;/strong&gt;
A: In the presence of information frictions, simple harmonization of capital tax rates across countries does not improve capital allocation efficiency and could worsen it. The Dual Efficiency Theorem implies that efficient capital allocation in a world with information frictions requires that taxes, risk premia, and information frictions satisfy a joint cancellation condition. From a normative perspective, a global social planner maximizing world GDP should impose lower capital tax rates in countries that are &amp;ldquo;peripheral&amp;rdquo; in the network of informational distances, in order to offset the disadvantage created by information frictions for those countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12. How is the elasticity parameter η calibrated, and how sensitive are the results?&lt;/strong&gt;
A: The elasticity of substitution among countries&amp;rsquo; assets, η, is calibrated at 18.5 based on Koijen and Yogo (2020)&amp;rsquo;s demand-price elasticities for long-term debt (3.1, converted to a gross-return elasticity of approximately 30), short-term debt (25.2, converted to approximately 24.3), and equity (1.3, converted to approximately 14.8), with weights reflecting the composition of global portfolios. The baseline gravity coefficients are calibrated from OLS with controls (cultural: −13.129, geographic: −1.645, linguistic: −3.850), chosen as conservative estimates relative to IV or PPML. Sensitivity analysis using PPML or IV estimates of β yields broadly similar steady-state GDP losses (around 6%), confirming robustness.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Portfolio wedge (∆ij):&lt;/strong&gt; A bilateral distortionary term in the logit asset demand system that captures all frictions reducing the ability of investors from country j to invest in country i. Decomposed empirically into a geo-political distance component and a political risk component. A wedge of 1 means no friction; larger values reduce the share of investment flowing from j to i. Can be interpreted either as prior-belief imprecision under rational inattention or as systematic transaction costs under the extreme-value microfoundation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geo-political distance:&lt;/strong&gt; A composite of geographic distance (population-weighted geodesic distance), cultural distance (expected disagreement in World Values Survey responses between randomly drawn individuals from two countries, constructed with the &amp;ldquo;flex&amp;rdquo; method using up to 496 questions), and linguistic distance (normalized tree distance in the Ethnologue language family graph, covering 6,737 languages). Distinct from simple physical distance: it captures the informational and transactional barriers that arise from societal dissimilarity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dual Efficiency Theorem:&lt;/strong&gt; A theoretical result (Theorem in Section 2.8) establishing that capital efficient allocation, equalization of marginal products of capital across countries, and uniform taxes combined with a specific cancellation condition on portfolio wedges are mutually equivalent statements in steady-state equilibrium. This is not a restatement of the First Welfare Theorem; it is a statement about GDP (not welfare) and does not require risk premia to be equalized.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective bilateral tax rate (τij):&lt;/strong&gt; The composite bilateral tax rate on capital after accounting for tax-haven routing. Firms in the destination country optimally choose the share of capital issued through tax havens (solving a quadratic cost optimization), trading off the lower tax rate available through havens against an increasing quadratic routing cost. The effective rate is therefore lower than the statutory (de jure) rate when the tax-haven rate is lower than the statutory rate, with the gap depending on the estimated βth coefficient from the Tobit regressions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Logit asset demand system:&lt;/strong&gt; A portfolio allocation rule in which the share of country j&amp;rsquo;s savings invested in destination country i is proportional to the risk-adjusted expected return raised to the power η (the elasticity of substitution) times the destination capital stock, divided by the portfolio wedge and summed over all destinations. Microfounded either by rational inattention (Matejka and McKay 2015; Pellegrino 2023) or by extreme-value-distributed transaction costs. Produces portfolio gravity analogous to trade gravity when combined with the market clearing conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Home bias:&lt;/strong&gt; Defined as the log difference between a country&amp;rsquo;s domestic portfolio share (πii, the share of domestic savings invested at home) and that country&amp;rsquo;s share of world capital stock (ki/K). In the frictionless benchmark, home bias is exactly zero for all countries by Proposition 1. The baseline model generates home bias endogenously as a consequence of portfolio wedges and reproduces both the level and cross-sectional distribution of empirically observed home bias without targeting these moments directly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Core-periphery structure:&lt;/strong&gt; An emergent property of international capital markets under investment barriers: countries that are easily accessible to international investors (low geo-political distance, low political risk, favorable tax treatment) are &amp;ldquo;central&amp;rdquo; and attract capital inflows, driving their rates of return to capital lower; &amp;ldquo;peripheral&amp;rdquo; countries that are less accessible have smaller capital stocks and higher rates of return, compensating investors for overcoming barriers. This structure generates persistent capital misallocation and cross-country income inequality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nationality-based vs. residency-based bilateral investment positions:&lt;/strong&gt; Residency-based data (e.g., raw IMF CPIS) attributes investment to the immediate counterparty country, including tax-haven shell companies. Nationality-based data (Coppola et al. 2020; Damgaard et al. 2019; Beck et al. 2024) reattributes investment to the country of the ultimate investor and ultimate issuer, bypassing offshore centers. The model fits nationality-based positions better (MSE 1.16 vs. 1.22 for residency-based) because it incorporates frictions that generate incentives for indirect routing, which is what nationality restatement is designed to undo.&lt;/p&gt;</description></item><item><title>Biased expectations and labor market outcomes: Evidence from German survey data and implications for the East–West wage gap</title><link>https://macropaperwarehouse.com/papers/biased-expectations-and-labor-market-outcomes-evidence-from-german-survey-data-and-implications-for-the-eastwest-wage-gap/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/biased-expectations-and-labor-market-outcomes-evidence-from-german-survey-data-and-implications-for-the-eastwest-wage-gap/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; The paper asks two questions: (1) How do workers&amp;rsquo; biased expectations about job finding and job separation shape the labor market equilibrium and wages? (2) Are differences in expectation biases across workers a quantitatively important driver of wage differentials, specifically the East–West German wage gap?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The empirical analysis uses the German Socio-Economic Panel (SOEP), a nationally representative longitudinal survey of approximately 30,000 participants per wave. The working-age sample (ages 25–65) covers nine biennial survey waves from 1999 to 2015, yielding 67,772 observations for job separation expectations and 6,423 for job finding expectations. Perceived transition probabilities are reported on a 0–100 scale in steps of 10 percentage points. Actual (statistical) transition probabilities are constructed by estimating probit models that predict realized transitions within 24 months using a rich set of individual, job, and employer characteristics, and are rounded to the nearest decile for consistency with the survey scale.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main empirical findings.&lt;/strong&gt; Employed workers in Germany overestimate their job separation probability by 6.4 percentage points on average (perceived: 19.8%; actual: 13.3%), a pessimistic bias significant at the 1% level. Unemployed workers overestimate their job finding probability by 8.2 percentage points on average (perceived: 57.0%; actual: 48.8%), an optimistic bias also significant at the 1% level. The East–West divergence is striking. East German workers exhibit a pessimistic job separation bias of 12.1 percentage points, compared to only 4.7 percentage points in the West, despite broadly similar actual separation rates (15.1% vs. 12.8%). For job finding, West Germans overestimate their probability by 12.9 percentage points, while East Germans overestimate by only 2.0 percentage points — meaning East Germans are also substantially less optimistic about re-employment. These East–West differences survive controls for compositional differences and alternative definitions of job separation (dismissals only; selected reasons; spell-based) and job finding (including those out of the labor force). The biases are stable over the 1999–2015 sample period with no discernible trend. A cohort analysis shows that the excess pessimism in East Germany is concentrated among cohorts who were already in the labor market at the time of German reunification (born in the 1950s and 1960s), consistent with persistent effects of the communist GDR experience. Individuals do not systematically learn over time: mean changes in individual-level absolute deviations between consecutive waves are close to zero. Individual deviations between perceived and actual rates have statistically significant but quantitatively negligible predictive power for subsequent transitions (a 1 pp higher perceived job separation is associated with only a 0.001 pp higher realized separation rate), ruling out private information as a first-order explanation for the biases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors extend the Diamond–Mortensen–Pissarides (DMP) frictional labor market framework by (i) allowing workers to hold biased perceived transition rates (λw for job finding, σw for job separation) while firms have rational expectations, and (ii) introducing wage contracts of explicit length T periods after which parties re-bargain. Common knowledge of each party&amp;rsquo;s perceived values is assumed, and generalized Nash bargaining is applied. The contract length T is a key parameter: there exists a critical threshold T* such that a pessimistic job separation bias raises the equilibrium wage for T &amp;lt; T* (the continuation-value effect dominates) and lowers it for T ≥ T* (the within-contract discounting effect dominates). An optimistic job finding bias unambiguously raises the equilibrium wage by inflating the perceived value of unemployment and hence the reservation wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative results.&lt;/strong&gt; The model is calibrated to East Germany. The job separation bias (∆σ = 0.0194) and job finding bias (∆λ = 0.0044) are set to SOEP-based estimates. The critical threshold implied by calibrated parameter values is T* = 10 quarters. The baseline contract length, constructed from the share of permanent (88%) and temporary (12%) contracts in SOEP and average remaining tenure until retirement, is T = 67 quarters (a lower bound). This exceeds T*, so the pessimistic separation bias depresses wages in the baseline. A counterfactual experiment assigns West German bias levels to East German workers, while holding all other parameters fixed. For the preferred calibration range (γ ∈ {0.35, 0.50}, T ∈ {67, 106, 159}), East German wages rise by 1.07 to 2.36 percent. This corresponds to a reduction in the conditional East–West German wage gap (23 percent) of 4.6 to 10.6 percent, and a reduction in the unconditional gap (30 percent) of 3.6 to 7.9 percent. Although wages rise, equilibrium unemployment increases by 0.70 to 1.01 percentage points, widening the already large East–West unemployment gap (approximately 7 percentage points). Net of the unemployment effect, expected lifetime income (computed at actual, unbiased transition rates) rises by 0.7 to 1.88 percent for East German workers under West German biases, implying an unambiguous welfare gain. Under a biennial calibration (robustness), wages increase by up to 3.3 percent and expected lifetime income rises by up to 2.23 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results apply to a stationary environment (no aggregate fluctuations). Firms are assumed to have rational expectations; an extension shows results hold provided firm bias is smaller than worker bias. Workers are assumed homogeneous in their bias levels; learning is abstracted from. The quantitative magnitudes are sensitive to the workers&amp;rsquo; bargaining power γ and the contract length T, both of which are subject to uncertainty in calibration.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: How are actual (statistical) transition probabilities constructed, and why are probit-predicted probabilities preferred over realized sample means?&lt;/strong&gt;
A: Realized transition rates in the sample mix transitions for various idiosyncratic reasons that vary substantially across population groups, so raw sample means do not reflect the probability a given individual faces at interview time. The authors estimate probit models separately for job separation (employed sample) and job finding (unemployed sample), including a rich set of covariates — age, gender, education, tenure, firm size, unemployment experience, industry, survey year, and East Germany indicator, among others — and predict individual-level probabilities at the time of the interview. For consistency with the survey&amp;rsquo;s discrete response format, probit-predicted probabilities are rounded to the nearest decile (0%, 10%, &amp;hellip;, 100%). The bias is computed as the individual-level difference between perceived and probit-predicted actual probabilities, averaged over the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the magnitude and direction of the aggregate expectation biases in Germany?&lt;/strong&gt;
A: Employed workers overestimate job separation by 6.4 percentage points on average (perceived 19.8% vs. actual 13.3%), a pessimistic bias significant at the 1% level. Unemployed workers overestimate job finding by 8.2 percentage points (perceived 57.0% vs. actual 48.8%), an optimistic bias also significant at the 1% level. Both directions are statistically robust across alternative definitions of separation and finding, as well as to trimming extreme responses (0% and 100% answers) and adjusting for directional rounding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How large are the East–West differences in expectation biases, and do they survive controls for compositional differences?&lt;/strong&gt;
A: East German workers exhibit a pessimistic job separation bias of 12.1 percentage points, more than 2.5 times the West German level of 4.7 percentage points, despite actual separation rates being broadly comparable (15.1% vs. 12.8%). For job finding, West Germans are optimistic by 12.9 percentage points while East Germans are optimistic by only 2.0 percentage points, a difference of 10.9 percentage points. The paper states these differences persist after accounting for compositional differences between regions, and are robust across all alternative definitions of job separation (Dismissals, Selected, Spell) and job finding (out of U or O). The table of robustness results (Table 2) confirms that in all specifications, the pessimistic separation bias is substantially larger in the East and the optimistic finding bias is substantially smaller.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What cohort analysis is conducted to explore the origins of greater East German pessimism?&lt;/strong&gt;
A: The authors conduct a regression of the individual-level bias on birth-cohort indicators, controlling for age, demographic, and economic characteristics. They find that the pessimistic job separation bias is most pronounced among cohorts born in the 1950s and 1960s — those who experienced adult working life in the communist GDR and lived through reunification — and is smaller for cohorts born before 1950 and substantially smaller for cohorts born after 1970. For job finding, the optimistic bias is comparably low among cohorts born in the 1960s and earlier, but rises significantly for later-born East German cohorts. This cohort pattern is consistent with a long-lasting &amp;ldquo;experience effect&amp;rdquo; of communist institutions and the reunification shock on beliefs, analogous to findings in the broader literature on the persistent effects of communism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Is there evidence that individuals update their biased expectations over time?&lt;/strong&gt;
A: To assess learning, the authors use the panel dimension and compute for each individual in two consecutive survey waves the absolute value of the deviation between perceived and actual transition probabilities, then examine the change in this absolute deviation between waves. The histograms of individual-level changes show substantial dispersion but means close to zero in all four sub-groups (East/West, job separation/finding), indicating no systematic convergence of beliefs toward actual rates. Biases are also stable in the time-series dimension, with perceived and actual rates moving largely in parallel across survey waves from 1999 to 2015, leaving the aggregate bias level roughly constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the model rule out private information as an alternative explanation for the biases?&lt;/strong&gt;
A: If biases reflected private information about idiosyncratic risk not captured by observable characteristics, individual-level deviations between perceived and actual rates should predict subsequent realized transitions. The authors add the individual-level deviation as an additional regressor in the probit transition models. The estimated coefficients are statistically significant and positive, but quantitatively negligible: a 1 percentage point higher expected job separation probability is associated with only a 0.001 percentage point higher realized separation probability, and a 1 percentage point higher expected job finding probability with a 0.002 percentage point higher realized finding probability. These magnitudes are too small to materially alter the interpretation of the biases as reflecting systematic expectation errors rather than private information.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Q7: What is the role of contract length T in the model, and what is the critical threshold T&lt;/em&gt;?&lt;/em&gt;*
A: The wage contract length T determines which of two opposing effects of pessimistic job separation expectations dominates in bargaining. The first (negative wage) effect: a pessimistic worker discounts future wages within the current contract more heavily than the firm does, so the worker values the contract less and accepts a lower wage. The second (positive wage) effect: a pessimistic worker also discounts the continuation value of future contracts more heavily, making it less attractive to remain in the match, so the firm must offer a higher wage to retain the worker. For short contract lengths (T &amp;lt; T*), the second (positive) effect dominates, so the pessimistic bias raises wages. For long contracts (T ≥ T*), the first (negative) effect dominates, so the pessimistic bias depresses wages. The critical threshold T* is the smallest positive integer such that T*/λw(θ) &amp;lt; β times a weighted sum involving σw and T*. Using calibrated parameter values for East Germany, T* = 10 quarters (2.5 years). The baseline contract length is T = 67 quarters (approximately 16.8 years), well above T*, placing the economy in the regime where pessimism depresses wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the optimistic job finding bias affect equilibrium wages and unemployment?&lt;/strong&gt;
A: An optimistic job finding bias (λw &amp;gt; p(θ)) raises the perceived value of unemployment U because workers expect to escape unemployment sooner. A higher value of unemployment raises the worker&amp;rsquo;s outside option in bargaining, increases the reservation wage, and thereby pushes up the bargained wage. In general equilibrium, the job creation condition (which is unaffected by worker expectations) is unchanged, so the upward rotation of the wage curve reduces labor market tightness θ, raises equilibrium unemployment, and extends average unemployment duration. This comparative static holds unambiguously for any contract length T.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the quantitative results of the counterfactual experiment assigning West German biases to East German workers?&lt;/strong&gt;
A: The counterfactual assigns West German bias levels (smaller pessimistic separation bias, larger optimistic finding bias) to East German workers while holding all other parameters at East German calibrated values. For the preferred calibration with γ ∈ {0.35, 0.50} and T ∈ {67, 106, 159}, wages in East Germany rise by 1.07 to 2.36 percent. This implies a reduction in the conditional East–West wage gap (23 percent) of 4.6 to 10.6 percent and a reduction in the unconditional gap (30 percent) of 3.6 to 7.9 percent. Equilibrium unemployment in East Germany rises by 0.70 to 1.01 percentage points as a side effect. Net of the unemployment effect, ex-ante unbiased expected lifetime income rises by 0.7 to 1.88 percent, confirming a positive welfare effect of reducing East German pessimism to West German levels. Under the biennial calibration robustness check, wage increases reach up to 3.3 percent, the conditional wage gap narrows by up to 11 percent, and lifetime income rises by up to 2.23 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How is the bargaining power parameter γ calibrated and why does it matter for the results?&lt;/strong&gt;
A: The paper considers a range γ ∈ {0.35, 0.50, 0.65}, rather than a single calibrated value, because γ plays a crucial role in the sensitivity of wages to expectation biases. Lower bargaining power reduces the equilibrium wage directly; however, because lower wages spur job creation, the model requires a higher vacancy cost κ to match the empirical job finding rate, which in turn increases the elasticity of wages with respect to the bias (see the wage equation, which shows that the bias effect scales with κθ/p(θ)). The paper argues that γ = 0.65 is inconsistent with the empirical wage–bias relationship estimated in SOEP data (which is negative and about twice as negative in East Germany as in the West), while γ ∈ {0.35, 0.50} is consistent. Lower bargaining power is also argued to be realistic for East Germany given weaker union representation there relative to the West.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the empirical relationship between the job separation bias and wages serve as a model validation target?&lt;/strong&gt;
A: Using SOEP data, the authors regress log hourly wages on the individual-level difference between perceived and actual job separation rates, controlling for individual fixed effects and other covariates, and allow the slope to differ between East and West Germany. They find a statistically significant and negative relationship in both regions, with the effect approximately twice as large in East Germany as in the West. The estimate implies that if East German workers&amp;rsquo; job separation pessimism were reduced to West German levels, hourly wages in the East would be about 1 percent higher. This empirical gradient is used as an external validation check — not a calibration target — to assess which combinations of (γ, T) in the model are quantitatively plausible.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What does the model predict about the general equilibrium effects on unemployment from reducing East German pessimism?&lt;/strong&gt;
A: Reducing East German pessimism — both the pessimistic separation bias and the low optimistic finding bias — shifts the wage curve upward in equilibrium. Because the job creation condition is unaffected by worker beliefs (firms have rational expectations), higher wages reduce the firm&amp;rsquo;s incentive to post vacancies, lowering labor market tightness θ. This leads to higher equilibrium unemployment and longer average unemployment duration. The counterfactual with West German biases implies that East German unemployment would rise by 0.70 to 1.01 percentage points, further widening the approximately 7 percentage point East–West unemployment gap. The authors note this is a welfare-relevant trade-off, but show that the wage gain dominates the unemployment cost in terms of expected lifetime income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What robustness checks are performed on the quantitative results?&lt;/strong&gt;
A: The paper considers (i) a narrower definition of job separation (dismissals only) to match the most likely interpretation of the survey question; (ii) targeting the officially reported East German unemployment rate (14.5% average from the Federal Employment Agency) rather than the SOEP-implied rate of 8.6% as a calibration target; (iii) a biennial calibration frequency instead of quarterly. The main results — wage increases and narrowing of the wage gap — are quantitatively similar across these alternatives, with one exception: the biennial calibration yields substantially larger wage increases (up to 3.3%), a larger reduction in the conditional wage gap (up to 11%), and larger lifetime income gains (up to 2.23%).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Expectation bias (job separation / job finding).&lt;/strong&gt; In this paper, a bias in expectations is defined as a systematic average difference between an individual&amp;rsquo;s perceived transition probability and the actual (statistically predicted) transition probability for their demographic and job group. A pessimistic job separation bias means workers overestimate the probability of losing their job (σw &amp;gt; σ); an optimistic job finding bias means unemployed workers overestimate the probability of re-employment (λw &amp;gt; p(θ)). Biases are not attributed to private information but to systematic expectation errors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Actual (statistical) transition probability.&lt;/strong&gt; The paper defines actual transition probabilities not as raw sample transition rates but as individual-level predicted probabilities from probit models estimated on realized transitions within 24 months, conditional on a comprehensive set of individual, job, and employer characteristics observed at interview time. These are rounded to the nearest decile for comparability with the survey&amp;rsquo;s discrete response format.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage contract length (T).&lt;/strong&gt; The contract length T is the number of periods for which a bargained wage is fixed before the match parties re-bargain. A job match consists of a sequence of consecutive wage contracts of length T. The paper departs from the standard DMP assumption of period-by-period bargaining (T = 1) and shows that T is central to how job separation expectations feed into the bargained wage. A permanent job approximates T → ∞.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Critical contract length (T&lt;/em&gt;).&lt;/em&gt;* A theoretically derived threshold: the pessimistic job separation bias raises equilibrium wages for contract lengths T &amp;lt; T* and depresses wages for T ≥ T*. Specifically, T* is the smallest positive integer such that T*/λw(θ) &amp;lt; β times a weighted sum involving β, σw, and T*. In the East German calibration, T* = 10 quarters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Generalized Nash bargaining with common knowledge / agree to disagree.&lt;/strong&gt; The model assumes that both the worker and the firm know each other&amp;rsquo;s perceived values of the job match and outside options and accept them as the basis for bargaining, even though they differ. Workers use their biased perceived transition rates to value employment and unemployment; firms use actual rates. There is no private information. The paper refers to this as workers and firms &amp;ldquo;agreeing to disagree.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex-ante unbiased expected lifetime income (EI_{W,U}).&lt;/strong&gt; A welfare measure defined as the present discounted value of income for an individual entering the economy, computed at actual (unbiased) job separation and job finding probabilities rather than at workers&amp;rsquo; perceived (biased) rates. This measure captures the net welfare effect of changing expectation biases because it correctly accounts for actual employment transitions, even though the behavioral responses in equilibrium are driven by biased perceptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective discount factor (β(1 − σw)).&lt;/strong&gt; When a worker holds pessimistic job separation expectations, future payoffs within the current contract are discounted not at the pure time discount factor β but at β(1 − σw), which is smaller when σw is larger. A more pessimistic worker therefore effectively discounts future wage payments more steeply, and this differential discounting relative to the firm (which uses β(1 − σ)) is the key mechanism generating the contract-length dependence of the wage effect.&lt;/p&gt;</description></item><item><title>Borrowing and Spending in the Money: Debt Substitution and the Cash-Out Refinance Channel of Monetary Policy</title><link>https://macropaperwarehouse.com/papers/borrowing-and-spending-in-the-money-debt-substitution-and-the-cash-out-refinance-channel-of-monetary-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/borrowing-and-spending-in-the-money-debt-substitution-and-the-cash-out-refinance-channel-of-monetary-policy/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Does monetary policy stimulate household borrowing and consumption by enabling cash-out mortgage refinancing (&amp;ldquo;the cash-out refinance channel&amp;rdquo;), or does it primarily induce substitution across borrowing products without meaningfully changing total new household borrowing?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation.&lt;/strong&gt; Prior work (Eichenbaum, Rebelo and Wong 2022; Berger et al. 2021) interprets the strong positive correlation between a borrower&amp;rsquo;s refinance incentive and cash-out refinancing as evidence of a potent, path-dependent monetary policy transmission channel: when rates fall below a borrower&amp;rsquo;s outstanding mortgage rate (&amp;ldquo;in-the-money&amp;rdquo;), the incentive to refinance generates large cash-out activity and consumption. This interpretation presumes that mortgages are effectively the only household borrowing product and that cash-out refinancing reflects a stimulated demand for new borrowing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Alternative Hypothesis.&lt;/strong&gt; The authors argue instead that households have inelastic, exogenous liquidity needs (for consumption smoothing, housing repairs, health shocks, etc.) and satisfy those needs using whichever borrowing product is cheapest given the rate environment. When mortgage rates fall below a borrower&amp;rsquo;s outstanding rate, cash-out refinancing becomes the least-cost vehicle, so borrowers shift from credit cards, HELOCs, personal loans, and second liens (closed-end seconds) toward cash-out refinancing—substituting borrowing products rather than expanding total borrowing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The authors use the Equifax Credit Risk Insight Servicing McDash (CRISM) dataset, which anonymously matches credit bureau records to mortgage servicing data (McDash). The main sample is a 16.5% draw of fixed-rate, first-lien mortgage loans observed at monthly frequency during 2013, yielding approximately 35 million loan-month observations. For the long time-series analysis, the full 2006–2021 sample is used. Borrowing events are identified across five credit instruments: cash-out refinance, HELOC, closed-end second (CES), credit card, and personal loan, each requiring at least $5,000 in new credit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy.&lt;/strong&gt; The paper uses two complementary approaches to address the endogeneity of mortgage rates and borrower refinance incentives.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Taper Tantrum quasi-experiment (main):&lt;/em&gt; In late spring 2013, two FOMC communication events triggered an approximately 80 basis-point increase in the 30-year fixed mortgage rate over the course of one month. Critically, because the shock arose from changes in long-term rate expectations (LSAPs), short-term rates—and thus HELOC and consumer credit rates—were largely unchanged. The authors exploit cross-sectional variation in pre-Taper &amp;ldquo;rate gaps&amp;rdquo; (outstanding mortgage rate minus estimated current market rate) using a difference-in-differences design (equation 6) to compare how cash-out and alternative borrowing change after the shock for borrowers with different pre-existing refinance incentives.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Monetary policy surprise IV (2006–2021):&lt;/em&gt; Following Berger et al. (2021), the authors instrument for the aggregate share of borrowers with rate gaps between 0 and 2 percentage points using the Bu, Rogers and Wu (2021) (BRW) unified measure of Fed monetary policy shocks, which spans both conventional and unconventional policy. This approach tests whether substitution persists when both long and short rates move together.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Extensive margin (probability of borrowing):&lt;/em&gt; After the Taper Tantrum, the monthly probability of cash-out refinancing declines for all rate gap bins, most strongly for borrowers pushed out of the money by the rate increase (a roughly 0.0012 percentage-point monthly probability decline—more than 85 percent below baseline—for borrowers with pre-Taper rate gaps of approximately 1 percent). Simultaneously, the probability of other borrowing (HELOCs, credit cards, personal loans, CES) rises in a near-mirror image, especially for borrowers at intermediate rate gaps. The combined effect on total borrowing probability is negligible and shows little variation with rate gap.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Intensive margin (amount borrowed conditional on borrowing):&lt;/em&gt; Conditional on a cash-out refinance occurring after the Taper, the average extraction amount &lt;em&gt;increases&lt;/em&gt;, consistent with a borrower-selection effect: low-liquidity-need borrowers, who face the highest effective borrowing cost increase when they move out of the money, disproportionately exit cash-out refinancing, leaving behind a pool of high-liquidity-need borrowers. For borrowers with pre-Taper rate gaps of around 1 percent, the conditional cash-out amount rises about 20 percent after the Taper.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Aggregate borrowing elasticity:&lt;/em&gt; Combining extensive and intensive margin estimates via a hurdle model, a 1 percentage-point increase in mortgage rates reduces total new household borrowing by between 0 and 8 percent (the aggregate borrowing elasticity is not statistically significantly different from zero at the preferred estimate, with a lower-bound of −8 percent), compared with a cash-out probability elasticity of approximately −45 percent in absolute terms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Debt paydown:&lt;/em&gt; About 10–12 percent of new mortgage debt from cash-out refinances is used to pay down other outstanding debt, and this share is constant across rate gap groups and is not affected by the Taper, implying the MPC from cash-out borrowing does not vary with the rate environment.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Conventional monetary policy:&lt;/em&gt; Using the BRW IV over 2006–2021, the IV first stage yields an F-statistic of approximately 11. The cash-out extensive margin responds positively to the in-the-money share (elasticity 3.5 in IV), while other borrowing responds negatively (elasticity −0.87 in IV), and the all-borrowing elasticity is 0.09 and statistically insignificant. The intensive margin results are directionally consistent: conditional cash-out amounts fall as more borrowers are in the money, while total borrowing amounts respond positively (but insignificantly). Substitution thus holds even when both long and short rates move together.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Implications for Path Dependence.&lt;/strong&gt; Because out-of-the-money borrowers substitute toward non-cash-out products, the non-linear dependence of cash-out refinancing on the distribution of outstanding mortgage rates does not translate into a correspondingly path-dependent total borrowing response. A back-of-the-envelope calculation using standard MPC assumptions (100 percent for cash-out, 80 percent for rate-term savings) and empirical refinancing frequencies and amounts (average first-lien equity extraction of $40,000 vs. average annual payment savings of $3,000 from rate-term refinancing, with rate-term frequency about 1.5x higher and semi-elasticity about 2x larger) implies that the potential near-term consumption stimulus from cash-out refinancing is approximately 5.5 times larger than from rate-term refinancing—making cash-out the dominant channel in principle. But because debt substitution substantially offsets the interest-rate sensitivity of cash-out refinancing, and because the path dependence of cash-out refinancing is largely eliminated by borrower substitution, the paper concludes that the overall path dependence of monetary policy is weaker than suggested by Berger et al. (2021) and Eichenbaum, Rebelo and Wong (2022).&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the &amp;ldquo;rate gap&amp;rdquo; and why does it capture the cash-out refinance incentive?&lt;/strong&gt;
The rate gap is defined as a borrower&amp;rsquo;s outstanding fixed mortgage rate minus an estimate of the 30-year fixed mortgage rate currently available to that borrower if they were to refinance (estimated from a regression of origination-period rates on LTV, credit score, loan type, investor type, and month fixed effects). A positive rate gap means the borrower is &amp;ldquo;in the money&amp;rdquo; for a rate-term refinance: they can reset their existing mortgage at a lower rate. The rate gap captures the degree of refinance incentive because resets the interest cost on the entire outstanding balance. Cash-out refinancing is especially attractive when the rate gap is positive because the rate reduction on the existing balance partially subsidizes the new borrowing, lowering its effective cost relative to alternative products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the conceptual model of debt substitution the authors propose?&lt;/strong&gt;
The authors model a homeowner with an inelastic liquidity need l that arrives with probability λ. The borrower can satisfy this need through a cash-out refinance at mortgage rate r_m (resetting their entire mortgage at r_m, which implies an interest cost on the existing balance) or through an alternative product at rate r_a &amp;gt; r_m. The key trade-off is that a cash-out refinance saves on the rate for the liquidity need itself but incurs a cost or benefit depending on whether r_m exceeds or falls below the outstanding rate r_0. When the rate gap is negative (r_0 &amp;lt; r_m), the cash-out refinance penalizes the borrower on the existing balance; when the gap is positive (r_0 &amp;gt; r_m), it saves on the existing balance, further lowering the effective cost of the liquidity need. The model predicts that: (i) the probability of cash-out refinancing is nonlinear and step-like in the rate gap; (ii) the probability of alternative borrowing has the opposite pattern; (iii) higher mortgage rates raise the conditional cash-out amount through selection (low-l borrowers exit cash-out); and (iv) total borrowing is relatively insensitive to mortgage rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the Taper Tantrum provide exogenous variation, and what are its limitations?&lt;/strong&gt;
The Taper Tantrum began in late spring 2013 when two FOMC communication events—Chairman Bernanke&amp;rsquo;s congressional testimony and the subsequent FOMC meeting—shifted market expectations about the pace of tapering large-scale asset purchases (LSAPs). The 30-year fixed mortgage rate rose approximately 80 basis points within one month, driven by changes in long-term rate expectations. Because the shock was unanticipated and FOMC did not announce any concrete policy change, the scope for a &amp;ldquo;Fed information effect&amp;rdquo; biasing results is limited. The critical limitation is that the Taper Tantrum affected primarily long-term rates: HELOC rates and consumer credit rates (tied to the federal funds rate and bank prime rate, which were unchanged) were little affected. This means the estimated substitution elasticity holds when the rate spread between mortgage and alternative products widens, which is more directly applicable to unconventional monetary policy (LSAPs) than to conventional policy that moves rates across the full yield curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What do the Taper Tantrum extensive margin results show, and what pattern confirms substitution?&lt;/strong&gt;
Figure 4 plots the difference-in-differences coefficient β₂ + β₃ by pre-Taper rate gap bin for three outcome variables. The cash-out refinancing probability (blue line) declines for all rate gap bins, most sharply for intermediate rate gap values (borrowers pushed out of the money by the Taper). Borrowers with pre-Taper rate gaps of ~1 percent experience a decline in monthly refinancing probability of about 0.0012, or more than 85 percent below their baseline rate. Other borrowing (black line) shows an almost exact mirror-image pattern: it rises after the Taper, most strongly for the same intermediate rate gap borrowers. The total borrowing probability (red line) shows essentially no response and little variation across rate gap groups, implying substitution nearly completely offsets the cash-out decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How do the intensive margin results for cash-out refinancing compare to the extensive margin, and what explains the difference?&lt;/strong&gt;
After the Taper, the conditional cash-out amount &lt;em&gt;rises&lt;/em&gt; (the intensive margin effect is positive), while the cash-out probability falls (the extensive margin effect is negative). These opposite signs are consistent with borrower selection: borrowers with small liquidity needs face the steepest increase in effective borrowing cost when they move out of the money and so disproportionately exit cash-out refinancing, raising the average extraction amount among those who remain. For borrowers with pre-Taper rate gaps of ~1 percent, the conditional cash-out amount rises approximately 20 percent after the Taper. Figure 6 corroborates this by showing the increase in average extraction is driven by a sharp decline in small extraction amounts (relative to outstanding balance).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How is the aggregate borrowing elasticity computed and what does it imply about monetary policy transmission?&lt;/strong&gt;
The authors combine extensive and intensive margin estimates using a two-tiered (hurdle) model that allows the decision to borrow and the decision of how much to borrow to respond differently to covariates. The total expected borrowing amount is the product of the estimated borrowing probability and the expected conditional borrowing amount. Pre- and post-Taper aggregate predicted borrowing is calculated for each rate gap group, and the percentage change is divided by the 80 basis-point rate increase to produce a semi-elasticity. The aggregate borrowing elasticity is not statistically significantly different from zero at the main estimate, and the lower-bound estimate (which avoids reliance on the Post dummy for aggregate borrowing) is at most −8 percent per percentage-point increase in rates. This compares with a cash-out probability elasticity of approximately −45 percent, illustrating that substitution accounts for the overwhelming majority of the observed cash-out response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why is the BRW monetary policy shock IV important for generalizing the Taper Tantrum findings?&lt;/strong&gt;
The Taper Tantrum moved only long rates, whereas conventional monetary policy moves both long and short rates. When short rates rise, the alternative borrowing products (HELOCs, credit cards, personal loans) become more expensive, which could dampen substitution in two ways: (a) the rate spread between mortgage and alternative products narrows, reducing the range of borrower-amount combinations for which substitution makes financial sense; and (b) higher absolute borrowing costs on alternative products may reduce total borrowing among borrowers who would otherwise substitute. The BRW IV, which spans 2006–2021 and reflects shocks to the full yield curve (conventional and unconventional), addresses whether substitution holds when both rate types move. The IV results in Table II (F-statistic ~11) confirm that the cash-out probability elasticity is 3.5 (IV), the other-borrowing elasticity is −0.87 (IV), and the all-borrowing elasticity is 0.09 and statistically insignificant, broadly consistent with the Taper Tantrum findings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Does the share of cash-out proceeds used for debt paydown vary with the rate environment, and why does this matter?&lt;/strong&gt;
An event study finds that total household debt increases by about 88 percent of the increase in mortgage balance in the first two months after a cash-out refinance, implying approximately 12 percent debt paydown; by six months out, the net paydown stabilizes at around 8 percent. Crucially, this share is constant across rate gap groups and does not change after the Taper Tantrum. This constancy implies that the marginal propensity to consume (MPC) out of cash-out refinances does not vary with the rate environment, and therefore the path-dependence of the cash-out channel cannot be attributed to compositional changes in how borrowers use extracted funds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Why does the paper argue cash-out refinancing has far greater near-term consumption potential than rate-term refinancing, and what are the implications for path dependence?&lt;/strong&gt;
A back-of-the-envelope calculation uses: (1) empirical frequencies (rate-term refinance probability is ~1.5x higher than cash-out); (2) near-term liquidity per event (average first-lien cash-out extraction ~$40,000 vs. annual payment savings ~$3,000 from rate-term); (3) semi-elasticities (rate-term has ~2x higher semi-elasticity to rates than cash-out per the IV estimates); and (4) standard MPC assumptions (100% for cash-out, 80% for rate-term savings). The calculation implies the consumption stimulus potential from cash-out refinancing is approximately 5.5 times that of rate-term refinancing per percentage-point change in rates. Because the paper shows the path-dependence of cash-out refinancing is largely offset by substitution, and because cash-out is the dominant near-term channel, the overall path-dependence of monetary policy is weaker than prior models predict.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the key robustness checks and how do they address potential confounds?&lt;/strong&gt;
Three main robustness exercises are reported. First, a QE1 robustness (Appendix) uses the large decline in mortgage rates after the first LSAP announcement in 2008 as an alternative shock, finding consistent substitution patterns (households shift into cash-out refinancing from other borrowing when pushed into the money). Second, a placebo test shifts the sample back six months and estimates the same specification over the twelve months preceding the Taper; Figure 8 shows no differential substitution by rate gap during this stable-rate period, supporting the interpretation that the Taper Tantrum rate increase drives the cross-sectional substitution pattern. The placebo does reveal a negative Post dummy for other borrowing, consistent with a possible pre-trend in other borrowing, which motivates the lower-bound elasticity calculation that avoids reliance on this coefficient. Third, the authors show that results are little changed when adjustable-rate mortgages (~10 percent of outstanding mortgages in 2013) are included in the sample.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Rate Gap:&lt;/strong&gt; The difference between a borrower&amp;rsquo;s outstanding fixed mortgage rate and the estimated current 30-year fixed mortgage rate available to that borrower if they were to refinance (adjusting for borrower-specific LTV and credit score). A positive rate gap means the borrower is &amp;ldquo;in the money&amp;rdquo; for a rate-term refinance. This is the paper&amp;rsquo;s central measure of refinance incentive, determining whether cash-out refinancing or an alternative borrowing product is the cost-minimizing option for satisfying a given liquidity need.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt Substitution:&lt;/strong&gt; The paper&amp;rsquo;s core mechanism: households shift their new borrowing across products (cash-out refinance, HELOC, CES, credit card, personal loan) in response to changes in relative borrowing costs, without proportionally changing total new borrowing. When the rate gap is positive, cash-out refinancing is the cheapest way to borrow (it lowers the rate on the existing balance while providing liquidity), so borrowers substitute from alternative products into cash-out. When the rate gap is negative or mortgage rates rise, borrowers substitute in the opposite direction, keeping their original mortgage rate intact by using alternative products.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cash-Out Refinance Channel of Monetary Policy:&lt;/strong&gt; The theoretical transmission mechanism by which monetary easing lowers mortgage rates, incentivizes in-the-money borrowers to refinance and extract home equity at reduced cost, and thereby stimulates consumption. Prior literature (Eichenbaum, Rebelo and Wong 2022) treats this channel as path-dependent and quantitatively important because it depends on the distribution of outstanding mortgage rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Path Dependence of Monetary Policy:&lt;/strong&gt; The property by which the same monetary policy shock generates different aggregate borrowing or consumption responses depending on the historical distribution of outstanding fixed mortgage rates, which reflects prior monetary policy. A large share of in-the-money borrowers (due to a prior rate-cutting cycle) amplifies the cash-out refinance channel; a large share of out-of-the-money borrowers weakens it. The paper shows this path dependence is substantially attenuated by debt substitution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;In-the-Money Borrower:&lt;/strong&gt; A borrower whose outstanding mortgage rate exceeds the current market mortgage rate (positive rate gap), creating a financial incentive to refinance. In-the-money status interacts with borrowing product choice because a cash-out refinance resets the interest cost on the entire existing balance, generating implicit savings that partially subsidize new liquidity extraction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hurdle (Two-Tiered) Model:&lt;/strong&gt; An estimation approach that allows the decision to borrow (extensive margin) and the amount borrowed conditional on borrowing (intensive margin) to respond differently to covariates. The authors use this model to combine extensive and intensive margin estimates into a single aggregate borrowing elasticity, avoiding the distortion that arises from using dollar volume as a dependent variable when intensive and extensive margins have opposite responses to the rate gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taper Tantrum (2013):&lt;/strong&gt; A quasi-experimental shock used as the paper&amp;rsquo;s main source of exogenous variation. In late spring 2013, Federal Reserve communications about tapering large-scale asset purchases (LSAPs) caused the 30-year fixed mortgage rate to increase approximately 80 basis points within one month. Because the shock operated through long-term rate expectations, it moved mortgage rates without significantly affecting HELOC or consumer credit rates (tied to the unchanged federal funds and bank prime rates), enabling the authors to estimate substitution holding alternative product rates approximately fixed.&lt;/p&gt;</description></item><item><title>Bottom-Up Markup Fluctuations</title><link>https://macropaperwarehouse.com/papers/bottom-up-markup-fluctuations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bottom-up-markup-fluctuations/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper asks how firm-level, sector-level, and aggregate markups comove with output at different levels of aggregation, and whether a single structural model can reconcile seemingly contradictory empirical findings about markup cyclicality that arise when researchers use different aggregation schemes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build a granular macroeconomic model featuring oligopolistic competition with a nested constant-elasticity-of-substitution (CES) demand structure following Atkeson and Burstein (2008). The economy contains N sectors, each with a discrete number of firms competing under Cournot oligopoly with flexible prices. Firm-level markups are endogenously increasing in within-sector market shares: under Cournot, the sectoral markup is a simple function of the sector&amp;rsquo;s Herfindahl-Hirschman index (HHI), and the aggregate markup is a function of the expenditure-share-weighted average of sectoral HHIs. Firm-level productivity follows a discretized random growth (Gibrat&amp;rsquo;s law) process as in Carvalho and Grassi (2019), generating fat-tailed firm-size distributions and granular aggregate fluctuations. The baseline calibration features only idiosyncratic firm-level productivity shocks and abstracts from aggregate shocks, because—in the model—aggregate shocks that move all firms proportionately do not affect relative market shares and hence do not affect markups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical analysis uses French administrative firm-level data from the FICUS-FARE datasets covering the universe of French firms from 1994 to 2019, yielding approximately 9.38 million firm-year observations across 26 years, 22 two-digit sectors, and 275 five-digit NAF sectors. Firm-level markups are estimated following De Loecker and Warzynski (2012) using a translog production function estimated by GMM (following De Ridder et al. 2024) on a subsample of approximately 220,733 firm-year observations where physical output quantity is available from the Enquete Annuelle de Production survey (2009-2019). Using quantity rather than revenue as the output measure avoids the measurement biases documented in Bond et al. (2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings and Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Markup-market-share relationship (firm level):&lt;/strong&gt; Regressions of the change in the inverse firm markup on the change in firm market share yield a negative and significant coefficient of approximately -0.268 to -0.293 (depending on fixed-effect specification), consistent with the model prediction that markups rise with market share. Sector-level analogues yield a slope of the change in inverse sector markup on the change in sector HHI of approximately -0.37, which is simultaneously a calibration target (implying sigma = 1.8 given epsilon = 5) and an empirical moment the model closely matches (model counterpart: -0.36).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Within-between decomposition of sector markup changes:&lt;/strong&gt; In the model under Cournot competition, changes in firm-level markups (the &amp;ldquo;within&amp;rdquo; term) account for exactly 50% of changes in sector-level markups, with between-firm reallocation accounting for the other 50%. In the French data, for the median sector, the within term accounts for 59% of changes in sector markups (interquartile range across sectors: 34%-81%).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Firm-level markup cyclicality with sector output (heterogeneous by size):&lt;/strong&gt; The average firm&amp;rsquo;s markup is countercyclical with respect to own-sector output (beta_1 approximately -0.073 in levels specification), but this relationship reverses for large firms: firms with market shares roughly above 10% (top 0.1% of the market-share distribution) have procyclical markups (interaction coefficient beta_2 approximately 0.574 in levels). The model qualitatively and roughly quantitatively reproduces this heterogeneity.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sector-level markup cyclicality with sector output (procyclical):&lt;/strong&gt; Following Nekarda and Ramey (2013), sector markup changes comove positively and significantly with sector output changes: estimated coefficient of 0.160 (standard error 0.040) in first-differences. The calibrated model yields a median coefficient of 0.139 (std dev 0.057 across 5,000 simulated 25-year samples), close to the data. Consistently, sector concentration (HHI) is also procyclical with sector output (estimated coefficient 0.332, std error 0.067 in first-differences).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sector-level markup cyclicality with aggregate output (acyclical to weakly countercyclical):&lt;/strong&gt; Following Bils et al. (2018), the comovement between sector markups and aggregate output is fragile in sign and significance: the French data yields a point estimate of -0.239 (std error 0.116) in first-differences, marginally significant (t-stat 2.06) and with sign sensitive to detrending method. The model without aggregate shocks predicts positive comovement (median coefficient 0.165) that is not statistically different from zero across samples. Adding aggregate productivity shocks (calibrated to match French aggregate output volatility) brings the model-implied coefficient close to zero (median 0.008), with 20-30% of 25-year simulated samples displaying countercyclical sectoral markups relative to GDP—consistent with the ambiguity in the data.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Aggregate output volatility:&lt;/strong&gt; The baseline calibration with only granular firm-level shocks generates a standard deviation of detrended aggregate output of 0.83%, equal to 26% of the 3.16% observed in the French data. (The comparable granular ratio from Carvalho and Grassi 2019 for a perfectly competitive US model is 30%.) Variable markups dampen granular aggregate volatility: the standard deviation of aggregate output under variable markups is 0.87 times that under heterogeneous-but-constant markups (95% CI: 0.82-0.97), because incomplete pass-through reduces the effective weight of large firms in the price index.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Aggregate markup volatility:&lt;/strong&gt; In the data, the relative standard deviation of aggregate markup to aggregate output is 0.40-0.50 (depending on detrending). The model generates a relative volatility of 0.36 (median across samples). The correlation between aggregate markup and output in the data is at most 0.06; the model without aggregate shocks implies a counterfactually large median correlation of 0.91, which falls to 0.27 when aggregate TFP shocks are superimposed (with 16% of 25-year samples displaying countercyclical aggregate markups).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to French private-sector firms (including formerly government-owned firms, most of which privatized during the sample period) across manufacturing and some non-manufacturing sectors at the national-market level. The analysis abstracts from import competition (market shares are computed relative to all French firms in the sector), local geographic markets (relevant for non-tradeable goods where national-level shares understate local concentration), and multi-product firm structure. Findings are for a flexible-price model driven by idiosyncratic productivity shocks; the paper explicitly discusses how nominal rigidities would further strengthen procyclicality at the sector level.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the central mechanism by which granular firm-level shocks generate markup cyclicality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Because markups are endogenously increasing in within-sector market shares under oligopolistic competition, a firm that receives a positive productivity shock gains market share and therefore raises its markup, while its competitors lose market share and lower their markups. The net effect on the sectoral markup depends on the shocked firm&amp;rsquo;s initial size: a positive shock to a sufficiently large firm (above a threshold market share) raises the sectoral markup, while a positive shock to a small firm lowers it. Since sectoral expansions in a granular economy are disproportionately driven by large firms, sector output and sector markup tend to comove positively in the medium run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does the sign of markup cyclicality differ depending on the level of aggregation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Sector-level markups react only to within-sector idiosyncratic shocks, so sectors that happen to be driven by large-firm booms display positive comovement between sector markup and sector output. However, a given sector&amp;rsquo;s markup is uncorrelated with aggregate output movements coming from other sectors. In small samples (such as 25-year windows), whether a sector&amp;rsquo;s markup comoves positively or negatively with aggregate output depends on whether the sector happens to lead or lag the aggregate cycle. Over sufficiently long samples, the model implies positive comovement of sector markups with aggregate output, but in finite samples the relationship is indeterminate. This asymmetry across aggregation levels explains why researchers using different reduced-form specifications in the same dataset can reach opposing conclusions about procyclicality versus countercyclicality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the within-between decomposition of sectoral markup changes and what does it imply quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Changes in the inverse sectoral markup can be decomposed into (i) a within term—changes in firm-level markups holding market shares fixed—and (ii) a between term—changes in market shares holding firm-level markups fixed. Under Cournot competition, the within and between terms are analytically equal in every period, so each accounts for exactly 50% of the change in sectoral markups; this 50-50 split holds globally (not only to first order). In the French data, for the median sector, within-firm markup changes account for 59% of sector markup changes (interquartile range across sectors: 34%-81%), close to but slightly above the model&amp;rsquo;s 50% prediction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How do variable markups affect granular aggregate output volatility relative to a model with constant markups?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Variable markups (endogenous pass-through that is decreasing in firm size) reduce granular aggregate output volatility relative to a model where markups are heterogeneous but fixed. The intuition is that larger firms have lower pass-through rates, so their productivity shocks translate into smaller price changes and therefore smaller output responses than they would under constant markups—effectively reducing the weight of large firms in the aggregate price index in a way similar to a decline in market concentration. Quantitatively, using first-order approximations around equilibrium distributions from the calibrated model, the standard deviation of aggregate output under variable markups is 0.87 times that under heterogeneous-but-constant markups (95% confidence interval: 0.82-0.97). The overall standard deviation under variable and heterogeneous markups is only 1.02 times that under homogeneous and constant markups (95% CI: 0.99-1.14), meaning markup heterogeneity and variability together have limited net effects on aggregate output volatility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the model predict for firm-level markup cyclicality, and how heterogeneous is this across firm size?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Proposition 4 states that, in the asymptotic limit, firm-level markups comove positively with own-sector output for firms with market shares above a threshold, and negatively for firms below it. This occurs because large firms have a disproportionate impact on sector-level price and output (when the product of market share and pass-through rate is increasing in size), so large-firm shocks simultaneously drive sector expansions and raise large-firm markups while compressing small-firm markups. In the French data, the average firm&amp;rsquo;s markup is countercyclical with respect to sector output (beta_1 approximately -0.073 in log-levels with firm and year fixed effects), but firms with market shares above roughly 10% (top 0.1% of the distribution, since the average market share is only 0.07%) display procyclical markups (interaction coefficient beta_2 approximately 0.574). The model reproduces this qualitative pattern and the order of magnitude of these estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the paper calibrate the key demand elasticities, and what are the resulting pass-through implications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The within-sector substitution elasticity is set to epsilon = 5, a standard value. The cross-sector substitution elasticity sigma is calibrated to match the slope of the inverse sector markup on sector HHI in first-differences. The empirical slope is -0.37; under the model, the slope equals -(epsilon/sigma - 1)/(epsilon - 1), and given epsilon = 5, sigma = 1.8 delivers a model counterpart of -0.36. These parameter values imply own-cost pass-through rates that are decreasing in firm size; for large firms (with market share &amp;gt;= 57%, approximately the top 0.004% of the distribution), the implied pass-through rate is 0.63, within the confidence intervals reported in Amiti, Itskhoki, and Konings (2019) for large Belgian firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why do aggregate productivity shocks not affect markups in the model, and what are the implications for aggregate markup cyclicality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the model, firm-level markups are functions of within-sector market shares, not the level of productivity. An aggregate shock that shifts all firms&amp;rsquo; productivity proportionately leaves relative market shares unchanged and therefore leaves all markups unchanged. This means aggregate shocks increase aggregate output volatility but leave markup volatility unchanged, reducing the correlation between aggregate markup and aggregate output. When aggregate TFP shocks are added to match French aggregate output volatility, the model-implied median correlation between aggregate markup and output falls from 0.91 (without aggregate shocks) to 0.27 (with aggregate shocks), while 16% of 25-year simulated samples display countercyclical aggregate markups—more consistent with the weak and fragile empirical relationship.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the paper address the potential measurement-error bias in the negative correlation between markups and marginal costs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Since marginal cost is computed as price divided by estimated markup, regressing market shares or markups on marginal costs risks spurious correlation via measurement error in the markup (which appears in both sides). The authors address this concern by constructing an instrumental variable for marginal cost based on firm-specific energy intensity interacted with energy price changes, following Ganapati, Shapiro, and Walker (2020). Table A10 confirms that instrumenting for marginal cost yields negative effects on both markup and market share with larger point estimates than the OLS specifications in Table 4, validating the baseline findings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Is the 50-50 within-between decomposition of sectoral markup changes robust to the choice of competition mode?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No. The exact 50-50 split of within and between terms in sectoral markup changes is a specific property of Cournot competition and holds globally (not just as a first-order approximation). Under Bertrand competition, the within and between terms are generally not equal to each other. The paper derives analytic results under both competition modes and focuses on Cournot for quantitative work because it generates more markup variation and better matches the estimated pass-through rates and markup-size relationship.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What do model simulations imply for the magnitude and cyclicality of aggregate markups versus the data, and what is the role of variable versus constant markups?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the data (detrended), the standard deviation of aggregate markup is 1.27% with a relative volatility (to output) of 0.40 and a correlation with output of 0.03. The baseline model with only granular shocks yields a median markup standard deviation of 0.30%, relative volatility of 0.36, and correlation with output of 0.91. The model with aggregate shocks added yields median markup standard deviation of 0.30%, relative volatility of 0.09, and correlation of 0.27. Counterfactually fixing markups at their initial heterogeneous levels while keeping the same market shares and shock variance yields aggregate markup standard deviation approximately 0.93 times the variable-markup value (standard deviation of markups under variable markups is 1.08 times that under constant markups, with a 95% CI of 1.00-1.18), and a correlation with output of 0.92 versus 0.87 under variable markups. Overall, the magnitude and cyclicality of aggregate markups are not substantially different between variable and constant-markup specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper reconcile its findings with prior literature on markup cyclicality (Bils et al. 2018 vs. Nekarda and Ramey 2013)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Nekarda and Ramey (2013) find procyclical sector markups with respect to sector output in US data—a result replicated in French data (beta approximately 0.160). Bils, Klenow, and Malin (2018) find countercyclical sector markups with respect to aggregate output in US data. Both results can be generated simultaneously in the model: sector markups are positively correlated with own-sector output because granular booms in a sector are driven by large-firm expansions that raise sector markups; however, a given sector&amp;rsquo;s markup is weakly and ambiguously correlated with aggregate output because aggregate fluctuations reflect shocks across many sectors, only some of which are in the same sector. The model can therefore simultaneously predict procyclicality with respect to sector output and an acyclical-to-weakly-countercyclical relationship with aggregate output—explaining why both empirical findings can be correct.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the data limitations and how do they affect the interpretation of results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Three limitations are noted. First, market shares are computed relative to total revenue of all French firms in the sector without accounting for imports, so foreign competition is ignored and domestic concentration may be overestimated. Second, revenues are reported at the national level, so for non-tradeable goods (whose relevant market is local) the paper underestimates true local market concentration, attenuating the markup-concentration relationship in those sectors. Third, the model abstracts from entry and exit (the number of firms per sector is held fixed at sector-year averages), though Appendix D demonstrates robustness of main empirical results to restricting the sample to continuing firms.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Granular macroeconomic model:&lt;/strong&gt; A model in which the economy consists of a finite (large but discrete) number of firms, so that idiosyncratic firm-level shocks to large firms do not average out and instead generate aggregate fluctuations. In the paper&amp;rsquo;s usage, granularity means that sectoral and aggregate business-cycle fluctuations are driven primarily by shocks to the largest firms, which also have the highest markups and market shares.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nested CES demand structure (Atkeson-Burstein):&lt;/strong&gt; A two-level constant-elasticity-of-substitution aggregation where the final good aggregates N sectors with cross-sector elasticity sigma, and each sector aggregates the output of its Nk firms with within-sector elasticity epsilon &amp;gt; sigma. This structure generates firm-level markups that are endogenously increasing in within-sector market shares (under both Cournot and Bertrand competition) and yields closed-form expressions for sector-level markups as a function of sector HHI and aggregate markups as a function of the expenditure-weighted average of sector HHIs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Markup elasticity with respect to market share (Gamma_ki):&lt;/strong&gt; Under Cournot competition, the semi-elasticity of firm i&amp;rsquo;s log markup with respect to its log market share, equal to (epsilon/sigma - 1)s_ki / (epsilon/(epsilon-1) - (epsilon/sigma - 1)s_ki). This is strictly positive for epsilon &amp;gt; sigma and increasing in market share, implying that larger firms have markups that are more responsive to changes in their competitive position.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pass-through rate (alpha_ki):&lt;/strong&gt; The fraction of an idiosyncratic cost shock that is passed into the firm&amp;rsquo;s price relative to the sectoral price index, given by 1/(1 + (epsilon-1)Gamma_ki). Pass-through is decreasing in market share (larger firms have lower pass-through), which dampens their price response to own shocks and mutes the impact of large-firm shocks on aggregate price volatility—acting like a reduction in market concentration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Within-between decomposition of sector markup changes:&lt;/strong&gt; The change in inverse sector markup decomposed into (i) a within term measuring changes in firm-level markups holding market shares fixed, and (ii) a between term measuring reallocation of market shares across firms with heterogeneous markups. Under Cournot competition, these two terms are exactly equal (each 50%) for any firm-level shocks—a result that holds globally (not merely as a first-order approximation)—because the forces that increase the within term (higher markup sensitivity) also raise heterogeneity between firms (increasing the between term).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sectoral markup (mu_kt):&lt;/strong&gt; Defined as the ratio of sectoral revenues to total wage payments in the sector, equal to the harmonic mean of firm-level markups weighted by market shares. Under Cournot competition, this is a simple increasing function of the sector&amp;rsquo;s HHI: mu_kt = (epsilon/(epsilon-1))[1 - (epsilon/sigma - 1)/(epsilon-1) x HHI_kt]^(-1). This mapping between concentration and the markup price-cost wedge gives the central empirical prediction tested at the sector level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Markup cyclicality (at different aggregation levels):&lt;/strong&gt; The comovement between markups and output, which the paper distinguishes sharply across three levels: (i) firm markup vs. own-sector output—countercyclical for small firms, procyclical for large firms; (ii) sector markup vs. own-sector output—procyclical (positive covariance) under conditions proven in Proposition 3; (iii) sector markup vs. aggregate output—theoretically positive over long samples but ambiguous and close to zero in short samples, because aggregate output also reflects shocks to other sectors whose markups are uncorrelated with the focal sector&amp;rsquo;s markups. The paper&amp;rsquo;s central insight is that the same underlying model generates all three empirical patterns simultaneously.&lt;/p&gt;</description></item><item><title>Bridges</title><link>https://macropaperwarehouse.com/papers/bridges/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bridges/</guid><description>&lt;p&gt;This paper measures the causal effects of land transport infrastructure on economic activity, exploiting quasi-experimental variation in bridge construction over the Mississippi and Ohio Rivers in the United States. The central empirical puzzle motivating the study is a hump-shaped relationship between per capita income and distance to major land transport routes in contemporary U.S. data: income peaks around 5 km from a transport route, with an elasticity of 0.072 closer than 4.1 km and -0.096 at greater distances, so that 85% of Americans live where local income increases with distance to transport routes rather than decreasing. The question is whether this pattern reflects causal effects of infrastructure, selection, or sorting.&lt;/p&gt;
&lt;p&gt;The paper develops two complementary identification strategies. The first exploits tributary confluences — where smaller rivers join larger rivers, sharply raising downstream flow rates and bridge construction costs — to generate quasi-random variation in bridge location. Because bridge construction costs increase convexly with river flow (maximum bending moment scales with span length squared), bridges are disproportionately built just upstream of confluences. The median upstream census tract lies 0.7 km from a bridge versus 2.3 km for the median downstream tract, making upstream tracts on average 60% closer to bridges and 27% closer to the nearest major land transport route. This asymmetry dates to at least 1880 and persists to 2010. Despite this persistent connectivity advantage, by 2010 upstream tracts have 13% lower per capita incomes and 63% higher population densities than downstream neighbours. The implied elasticity of per capita income with respect to distance to land transport, scaling the income effect by the distance-to-transport effect, is approximately 0.44. Income density (income per unit area) is higher upstream, though the difference is not statistically significant. Historical placebo tests using pre-bridge-construction data show no asymmetry in land values or population upstream versus downstream, supporting the identification assumption.&lt;/p&gt;
&lt;p&gt;The second strategy exploits variation in the timing of bridge construction. Because major bridge projects involve decades of planning, financing, design, and construction — the Wheeling Suspension Bridge was chartered in 1816 but opened in 1849 — the precise opening date is argued to be exogenous to short-run deviations from local growth trends. Using a county-level panel from 1860 to 2010 (432 counties, 14–19 states), the paper estimates event-study regressions around the first time a county experiences a 50% reduction in distance to a bridge. After such a reduction, farm land values (the best available consistent proxy for total economic activity in historical data) rise immediately and cumulatively by approximately 9% over 30 years. Population rises by approximately 5% over the same period. The proportionally larger rise in land values than population implies higher per capita economic activity in better-connected counties after 30 years.&lt;/p&gt;
&lt;p&gt;These two sets of results are reconciled through a narrative account of development. Better bridge access drives industrialization — manufacturing employment shares rise in counties experiencing improved connectivity — and urbanization. Cities form around historical transport routes and expand. Richer households then sort away from historical city centres into lower-density suburban areas, while lower-income households remain near or selectively migrate to the historical transport corridors. This within-city sorting produces the observed cross-sectional gradient: areas nearest transport routes end up with higher population density but lower per capita incomes. The negative local income effect of proximity to transport routes is larger in more urbanized areas and areas with higher income inequality, and is concentrated among non-white and low-education populations.&lt;/p&gt;
&lt;p&gt;The paper also contributes a new dataset covering every road and rail bridge (237 total) ever constructed over the Mississippi and Ohio Rivers from 1849 to 2010, assembled from the National Bridge Inventory and extensively cross-checked with satellite imagery and historical sources.&lt;/p&gt;
&lt;p&gt;Q: What is the motivating empirical puzzle about transport infrastructure and income?&lt;/p&gt;
&lt;p&gt;A: In contemporary U.S. census data, per capita income does not monotonically increase with proximity to land transport routes. Instead, the relationship is hump-shaped: income peaks around 5 km from a major transport route, with a positive elasticity of 0.072 within 4.1 km and a negative elasticity of -0.096 beyond that distance. Population density, by contrast, falls monotonically with distance to transport routes. As a result, 85% of Americans live in places where local mean income increases with distance to transport infrastructure rather than decreasing.&lt;/p&gt;
&lt;p&gt;Q: How does the tributary confluence identification strategy work?&lt;/p&gt;
&lt;p&gt;A: Tributary confluences — where smaller rivers join the main river — cause sharp, localized increases in river flow rates and thus in bridge construction costs, because cost scales convexly with required span length. This makes bridges systematically more likely to be built just upstream of confluences than just downstream. The strategy compares census tracts located upstream versus downstream of the 27 major tributary confluences identified on the Mississippi and Ohio Rivers, controlling for nearest-tributary fixed effects and distance to the confluence.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the connectivity difference between upstream and downstream census tracts?&lt;/p&gt;
&lt;p&gt;A: Upstream census tracts are approximately 60% closer to a bridge than downstream tracts (coefficient of 0.91 in log distance to bridge, p &amp;lt; 0.01), and consequently 27% closer to the nearest major land transport route (coefficient of 0.32, p &amp;lt; 0.10). This asymmetry is established by 1880 and persists through 2010. The advantage arises approximately equally from proximity to railroads and primary roads.&lt;/p&gt;
&lt;p&gt;Q: What are the causal effects of this connectivity advantage on per capita income and population density?&lt;/p&gt;
&lt;p&gt;A: Despite being better connected, upstream census tracts have 13% lower per capita incomes (coefficient 0.14 on the downstream indicator in log per capita income, p &amp;lt; 0.05) and 63% higher population densities (coefficient -0.49 on the downstream indicator in log population density, p &amp;lt; 0.05) in 2010. Income density is higher upstream, but the difference is not statistically distinguishable from zero. Scaling the income effect by the effect on distance to land transport implies an elasticity of approximately 0.44.&lt;/p&gt;
&lt;p&gt;Q: What pre-bridge-era placebo tests support the identifying assumption for the tributary confluence strategy?&lt;/p&gt;
&lt;p&gt;A: Matching modern census tracts to county-level historical data from 1840 and 1850 (before substantive bridge construction began), the paper finds no statistically significant asymmetry in land values or population density upstream versus downstream of tributary confluences. Asymmetric patterns emerge only after bridge construction begins. Ferry crossing locations, traced through place names in the USGS Geographic Names database, also appear equally frequently upstream and downstream, suggesting ferries did not differentially locate upstream.&lt;/p&gt;
&lt;p&gt;Q: How does the timing-based identification strategy work, and what is its key assumption?&lt;/p&gt;
&lt;p&gt;A: The strategy uses a county-level panel from 1860 to 2010 and estimates event-study regressions around the first time a county experiences a 50% reduction in distance to a bridge. County fixed effects and county-specific quadratic time trends absorb all fixed differences across counties and average changes in trends. The key assumption is that the exact opening date of a bridge is exogenous to short-run deviations from local long-run growth trends — supported by the argument that major bridges involve decades-long planning processes that evolve independently of local economic fluctuations. Pre-trend tests show no significant differences in outcomes before the event.&lt;/p&gt;
&lt;p&gt;Q: What are the quantitative effects of a major improvement in bridge access on land values and population?&lt;/p&gt;
&lt;p&gt;A: After a county first experiences a 50% reduction in distance to a bridge, farm land values rise immediately and cumulatively by approximately 9% (cumulative effect on log land values of about 0.09) over 30 years, relative to counties with no such change. Population rises by approximately 5% (cumulative log effect of about 0.05) over the same period. The proportionally larger effect on land values than on population implies that per capita economic activity is higher in better-connected counties 30 years after the event. The divergence between land value and population effects grows over time, suggesting productivity advantages accumulate.&lt;/p&gt;
&lt;p&gt;Q: Why does the paper use farm land values rather than other income measures in the historical panel?&lt;/p&gt;
&lt;p&gt;A: Farm land values — the total value of farm land and buildings — are the best consistently measured proxy for total economic activity available throughout the 1860–2010 census panel. The paper notes explicitly that as the economy industrializes and urbanizes, farm land values increasingly miss urban land values, implying that the estimated effects on farm land values are likely lower bounds on the true effects on total economic activity.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address the concern that bridge timing might reflect anticipated local growth?&lt;/p&gt;
&lt;p&gt;A: The paper shows that results hold when restricting to counties whose distance to a bridge is only affected by bridges constructed in other counties, addressing the concern that local planners might time construction in anticipation of local growth. The results are also insensitive to controlling for pre-period trends, and outcomes of interest are uncorrelated with future changes in distance to a bridge in preferred specifications.&lt;/p&gt;
&lt;p&gt;Q: How does the paper reconcile the negative local income effect (tributary confluence strategy) with the positive aggregate effect (timing strategy)?&lt;/p&gt;
&lt;p&gt;A: The reconciliation proceeds through a narrative account combining industrialization, urbanization, and within-city sorting. Better bridge access drives a shift toward manufacturing employment and attracts population, consistent with a productivity advantage enabling exploitation of economies of scale. Cities form around historical transport routes. As cities mature and expand, richer households sort into lower-density suburban areas further from the historical transport corridor, while lower-income households remain near or migrate to the city centre. This within-city sorting produces lower per capita incomes near transport routes even as aggregate economic activity is higher in better-connected areas.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports the within-city sorting mechanism specifically?&lt;/p&gt;
&lt;p&gt;A: The negative income effect of proximity to transport routes is larger in more urbanized areas and in areas with higher income inequality. The effect is concentrated in areas that were more rapidly urbanizing in the 19th century, and it is stronger for non-white and low-education populations. Upstream census tracts simultaneously show higher manufacturing employment shares and higher population densities, consistent with cities having formed around transport routes, followed by residential sorting away from the core.&lt;/p&gt;
&lt;p&gt;Q: What are the two novel identification strategies and their broader applicability?&lt;/p&gt;
&lt;p&gt;A: The tributary confluence strategy exploits discontinuities in bridge construction costs generated by sharp increases in river flow rates at confluences; it requires only that bridges are more likely built upstream of confluences than downstream, an asymmetry the paper shows is detectable elsewhere in the world from satellite imagery. The timing strategy exploits the multi-decade planning and construction process for major bridges as a source of near-exogenous variation in opening dates. Both strategies can be applied in other settings where major rivers form substantial barriers to land transport networks.&lt;/p&gt;
&lt;p&gt;Q: What does the paper contribute to the debate about whether early U.S. transport infrastructure followed or led economic development?&lt;/p&gt;
&lt;p&gt;A: The results support the view that early investments in land transport infrastructure led to meaningful changes in economic geography rather than merely following pre-existing growth patterns. However, the paper finds a moderate level of responsiveness — population density responds to bridge access over several decades, not immediately — consistent with a broader literature documenting sluggish population responses to changes in economic conditions.&lt;/p&gt;
&lt;p&gt;Tributary confluence: A location where a smaller river (tributary) joins a larger river, causing a sharp, localized increase in downstream flow rates and therefore a discontinuous increase in bridge construction costs, generating the quasi-experimental variation in bridge location exploited in the paper.&lt;/p&gt;
&lt;p&gt;Within-city sorting: The process by which, as cities expand around historical transport routes, richer households differentially relocate to lower-density suburban areas further from the transport corridor while lower-income households remain near or migrate to the historical city centre, reversing the income gradient at small spatial scales.&lt;/p&gt;
&lt;p&gt;Income density: The product of population density and per capita income, corresponding to total economic activity per unit area; the paper finds income density is higher in better-connected upstream census tracts even when per capita income is lower, reflecting the dominant effect of higher population density.&lt;/p&gt;
&lt;p&gt;Farm land values: The total value of farm land and buildings, used as the best consistently available proxy for total economic activity in the 1860–2010 historical county panel; the paper treats estimated effects on farm land values as lower bounds on effects on total economic activity because farm values increasingly miss urban land as the economy industrializes.&lt;/p&gt;
&lt;p&gt;Structural transformation: The shift in the composition of employment away from agriculture and toward manufacturing, which the paper documents occurring in counties that experience improved bridge access, interpreted as evidence that transport infrastructure provides a productivity advantage attracting industrial activity.&lt;/p&gt;
&lt;p&gt;Distance to a bridge (as proxy for land transport access): In the study area along the Mississippi and Ohio Rivers, where all land has comparable water access, distance to the nearest bridge strongly predicts distance to the nearest major land transport route (rail or primary road), allowing bridge distance to serve as a consistent measure of transport connectivity throughout the entire study period.&lt;/p&gt;
&lt;p&gt;Market access: A measure of economic connectivity that captures both the state of the transport network and the size of accessible markets; the paper notes that log distance to a bridge explains 46% of the variation in market access in 1890 (from Donaldson and Hornbeck&amp;rsquo;s data) with an elasticity of approximately 0.1, and that halving distance to a bridge increases market access by approximately 7%.&lt;/p&gt;</description></item><item><title>Bridging micro and macro production functions: The fiscal multiplier of infrastructure investment</title><link>https://macropaperwarehouse.com/papers/bridging-micro-and-macro-production-functions-the-fiscal-multiplier-of-infrastructure-investment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/bridging-micro-and-macro-production-functions-the-fiscal-multiplier-of-infrastructure-investment/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates the fiscal multiplier of infrastructure investment, specifically by incorporating firm-level investment decisions — a dimension absent from prior literature. The central analytical challenge is bridging the micro (firm-level) and macro (state-level) production functions for infrastructure, given that public capital is non-rivalrous: it can be used simultaneously by all firms without being depleted. The paper demonstrates that this non-rivalry generates a systematic discrepancy between firm-level and aggregate-level estimates of the elasticity of substitution between private and public capital, and it shows how this discrepancy shapes the magnitude of the fiscal multiplier.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build and estimate a heterogeneous-firm general equilibrium model. Firms operate a constant-elasticity-of-substitution (CES) production function using private capital, non-rivalrous public capital (infrastructure), and labor. Firms are subject to idiosyncratic productivity shocks and make lumpy investment decisions subject to both fixed and convex capital adjustment costs, following Cooper and Haltiwanger (2006) and Winberry (2021). The economy has two regions — one with poor infrastructure and one with good infrastructure — motivated by the near-invariant cross-state distribution of infrastructure spending observed in U.S. data.&lt;/p&gt;
&lt;p&gt;The model is estimated via an extended Simulated Method of Moments (SMM) that treats market clearing prices as additional parameters estimated simultaneously with structural parameters, reducing computational cost relative to standard GE estimation. Estimation uses a multi-block Metropolis-Hastings algorithm. Target moments include lumpy investment fraction (0.14, from Zwick and Mahon 2017), average investment-to-capital ratio (0.10), standard deviation of i/k (0.16), private-to-infrastructure capital ratio (0.75, from BEA), high-infrastructure region&amp;rsquo;s private capital share (0.83, from Census BDS), and total working hours (0.33).&lt;/p&gt;
&lt;p&gt;The identification of the key parameter — the firm-level elasticity of substitution between private and public capital (λ) — comes from the relative size of private capital stocks across the two infrastructure groups: under greater complementarity, regions with more infrastructure should hold relatively more private capital.&lt;/p&gt;
&lt;p&gt;External validation is provided by estimating the state-level elasticity from the model&amp;rsquo;s simulated data using a nonlinear least squares method following An et al. (2019), and comparing it to empirical state-level estimates from actual U.S. state data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Firm-level vs. aggregate-level elasticity gap.&lt;/strong&gt; The estimated firm-level elasticity of substitution is λ = 1.185, implying gross substitutability between private and public capital at the firm level. The state-level elasticity implied by the same model is 0.48 (or 0.35 in a decreasing-returns-to-scale specification), implying gross complementarity. The empirical state-level counterpart estimated from actual U.S. data is 0.445. The paper proves theoretically (Proposition 1) that, given non-rivalry and under mild conditions, firm-level gross substitutability implies aggregate-level gross complementarity. Proposition 2 further shows that this same mechanism micro-founds the increasing-returns-to-scale assumption in Baxter and King&amp;rsquo;s (1993) Cobb-Douglas aggregate production function.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Fiscal multiplier (baseline, 2-year horizon).&lt;/strong&gt; The aggregate output multiplier over a 2-year horizon in the heterogeneous-firm general equilibrium model is &lt;strong&gt;1.088&lt;/strong&gt; in response to a one-time unexpected infrastructure spending shock equal to 1% of steady-state GDP, financed by a lump-sum tax. The corresponding partial-equilibrium output multiplier (holding prices fixed at steady state) is 1.858; the gap reflects crowding out of private investment induced by the general equilibrium interest rate response. In the baseline, the interest rate rises by 0.39% after the shock; the investment multiplier is -0.043.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Comparison with representative-agent model.&lt;/strong&gt; When the same implied returns-to-scale parameters are used in a representative-agent model (following Baxter and King 1993), the output multiplier is 0.991 and the investment multiplier is -0.157, both substantially lower than the heterogeneous-firm baseline. The key mechanism: under convex adjustment costs, the Jensen&amp;rsquo;s inequality effect implies that heterogeneous firms face a greater average adjustment burden than the representative firm, making their investment less responsive to the general equilibrium crowding-out pressure.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sensitivity to elasticity of substitution.&lt;/strong&gt; Across the heterogeneous-firm model: at λ = 3 (high substitutability), the output multiplier falls to 0.672; at λ = 0.5 (complementarity), it rises to 1.364. The multiplier is significantly more sensitive to λ in the heterogeneous-firm model than in the representative-agent model, because non-rivalry amplifies the effect of any given elasticity value through each firm&amp;rsquo;s production function.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Cross-state distribution of gains.&lt;/strong&gt; Under the baseline spending allocation (81% to Good states, 19% to Poor states), per $1 of infrastructure spending, Good states receive $1.072 of the $1.088 total output gain, while Poor states receive only $0.016. In a counterfactual with equal spending across states, the total output multiplier falls to 0.873, Good states&amp;rsquo; output multiplier falls to 0.810, and Poor states&amp;rsquo; output multiplier rises to approximately 0.062 (about four times the baseline level of 0.016). This quantifies a sharp efficiency-equality trade-off in the allocation of infrastructure investment.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Employment and earnings effects.&lt;/strong&gt; Compared to steady state, the baseline fiscal shock produces an average annual increase of 0.304% in employment and 0.389% in wages, yielding a $0.713 increase in earnings and a $0.148 increase in consumption per $1 of fiscal spending in general equilibrium. In partial equilibrium (no price changes), earnings increase by $1.294 and consumption by $0.605 per $1 spent.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results are conditional on: (i) lump-sum tax financing of the fiscal shock; (ii) a one-time unexpected (MIT) shock with no persistence; (iii) a closed-economy framework with endogenous real interest rate; (iv) the estimated two-region structure calibrated to U.S. state-level infrastructure data; (v) firm-level investment dynamics calibrated to Compustat and BDS moments. The authors note that incorporating time-to-build assumptions (tested in an appendix) reduces the aggregate fiscal multiplier, consistent with Ramey (2020).&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core theoretical result connecting firm-level and aggregate-level elasticities, and what is the intuition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Proposition 1 proves that, given non-rivalrous public capital and mild data conditions (at least one firm has private capital below total infrastructure, and aggregate private capital exceeds total infrastructure), if the firm-level elasticity of substitution λ ≥ 1 (gross substitutes), then the aggregate-level elasticity ξ &amp;lt; 1 (gross complements). The intuition is that a marginal increase in public capital raises the marginal product of private capital for every firm simultaneously due to non-rivalry; the sum of these MPK gains across all firms exceeds any single firm&amp;rsquo;s gain. To represent this amplified benefit within an aggregate production function, a stronger complementarity is required than what any single firm faces. Put differently, non-rivalry means aggregate private and public capital &amp;ldquo;look&amp;rdquo; more complementary than they truly are at the firm level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does non-rivalry micro-found the Baxter-King aggregate production function?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Proposition 2 shows that if firms use a CES production function with gross substitutability (λ ≥ 1) and non-rivalrous public capital, then fitting aggregate output with a Cobb-Douglas production function (as in Baxter and King 1993, H(K,N,L) = zK^α L^{1-α} N^ζ) yields ζ &amp;gt; 0, implying increasing returns to scale (IRS). This is the paper&amp;rsquo;s micro-foundation for a widely-used but previously ad hoc assumption in the macro-fiscal literature. The corollary states that both gross complementarity in the aggregate CES function and IRS in the aggregate Cobb-Douglas follow from the same non-rivalry mechanism at the firm level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Why does the heterogeneous-firm model produce a higher output multiplier than the representative-agent model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Two mechanisms drive the difference. First, due to Jensen&amp;rsquo;s inequality and the convexity of adjustment costs, heterogeneous firms face a higher average adjustment burden than the representative (average) firm; this means heterogeneous firms are less responsive to interest rate changes that crowd out investment. The investment multiplier is -0.043 in the heterogeneous-agent baseline versus -0.157 in the representative-agent model. Second, the fixed adjustment cost (present in the baseline but absent from the representative-agent model) further dampens investment sensitivity via the extensive margin. Because less private investment is crowded out, more of the direct output boost from infrastructure spending survives into the aggregate multiplier, yielding 1.088 versus 0.991.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the novel estimation procedure and why is it necessary?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Standard SMM applied to GE models requires solving for market-clearing prices for every candidate parameter vector, creating a nested optimization loop that is computationally prohibitive. The authors extend SMM by treating market-clearing prices (wage w and marginal utility of consumption p) as additional parameters and appending market-clearing conditions as additional target moments — effectively requiring those moments to equal zero. A multi-block Metropolis-Hastings algorithm jointly draws from the price block and the parameter block. This approach generates posterior draws that simultaneously satisfy market clearing and fit empirical moments, without the inner loop. The resulting market-clearing accuracy is e^{-4} at the posterior mean.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How is the firm-level elasticity of substitution (λ) identified from the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: λ is identified from the cross-state difference in private capital stocks between high- and low-infrastructure regions. Under the model, if private and public capital are more complementary (lower λ), high-infrastructure regions should attract relatively more private capital. The data moment used is the Good region&amp;rsquo;s share of aggregate private capital (0.83 from Census BDS data). This identification strategy is analogous to Bartik-instrument approaches in the empirical literature, where a parameter governing cross-state sensitivity to aggregate shocks is identified from cross-sectional variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How is the model validated externally?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The authors compute the state-level elasticity from the estimated model by fixing firm-level parameters and re-estimating only the elasticity and regional productivity from the model&amp;rsquo;s simulated state-level data, using the same NLLS estimator as An et al. (2019). The model-implied state-level elasticity is 0.349 (DRS specification) or 0.482 (CRS specification). The empirical estimate from actual U.S. state-level data following the same estimator is 0.445. Both indicate gross complementarity at the state level, consistent with the theoretical prediction. This external validation is not used in the estimation itself, providing an independent check.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the roles of extensive vs. intensive investment margins in the crowding-out effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Table 9 decomposes the investment multiplier of -0.043 by investment margin. When only the extensive margin (the discrete decision of whether to invest) is allowed to respond, the investment multiplier is -0.032 — approximately 74% of the baseline crowding-out effect. When only the intensive margin (investment size conditional on adjusting) responds, the multiplier is -0.011 — about 25% of the total. Thus the extensive margin is the dominant channel through which higher interest rates crowd out private investment. When both margins are held fixed, the output multiplier rises to 1.139, confirming that investment crowding-out reduces the output multiplier by about 0.05.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the elasticity of substitution affect the fiscal multiplier quantitatively, and why does this matter more in the heterogeneous-firm model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the heterogeneous-firm GE model: λ = 3 gives an output multiplier of 0.672, λ = 1.185 (baseline) gives 1.088, and λ = 0.5 gives 1.364 — a range of 0.692. In the representative-agent model, the comparable range across the implied ζ values is much narrower (0.970 to 0.998). The amplification in the heterogeneous-firm model occurs because non-rivalry means each firm&amp;rsquo;s production function directly incorporates the public capital stock, so the elasticity parameter has first-order consequences for every firm&amp;rsquo;s investment incentive response to a fiscal shock. This heightened sensitivity underscores why accurately estimating λ at the firm level — rather than importing a state-level estimate — is critical for quantifying infrastructure multipliers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the efficiency-equality trade-off in cross-state infrastructure allocation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under the baseline allocation (81% of infrastructure spending to Good states, 19% to Poor states), per $1 of infrastructure spending, the Good states receive $1.072 of output gains and Poor states receive only $0.016. In the equal-spending counterfactual, the total output multiplier falls from 1.088 to 0.873. The Poor states&amp;rsquo; output multiplier rises from $0.016 to $0.062 (approximately fourfold), while the Good states&amp;rsquo; falls from $1.072 to $0.810. The Poor states also see earnings multipliers more than double (from $0.017 to $0.042). This trade-off arises because Good states have both more private capital (benefiting from non-rivalry) and higher estimated TFP — so each dollar of infrastructure is more productive there. Equal allocation reduces aggregate efficiency while partially mitigating regional inequality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do the paper&amp;rsquo;s multiplier estimates compare to the existing literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In partial equilibrium (no GE adjustment), the authors find an output multiplier of 1.858, consistent with Chodorow-Reich&amp;rsquo;s (2019) cross-sectional multiplier of approximately 1.8. Once the general equilibrium interest rate effect is included, the multiplier falls to 1.09, which falls within the 0.6-1.2 range from Ramey (2011). Literature using representative-agent models without non-rivalry (e.g., Ramey 2020) typically reports multipliers of 0.3 to 0.8 using returns-to-scale parameters of 0.07-0.12; the paper shows these correspond to fiscal multipliers of 0.847-0.882 in the representative-agent framework. The heterogeneous-firm model, once it incorporates the non-rivalry-corrected elasticities, yields a meaningfully higher multiplier of 1.088.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What role does time-to-build play, and how does the paper handle it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The baseline model assumes a time-to-build period s = 1 year (one-year lag before new infrastructure is productive). The paper notes in Appendix H that incorporating extended time-to-build reduces the aggregate fiscal multiplier, operating through two channels: a news effect (agents adjust behavior upon anticipating future infrastructure) and a general equilibrium effect endogenous to the news effect. This finding is consistent with Ramey (2020). The baseline results are therefore reported under the minimal one-year time-to-build assumption, with longer lags serving as a robustness check.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What is the role of region-specific TFP heterogeneity in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model includes two regions that differ both in infrastructure levels and in region-specific productivity (TFP) levels. The TFP of the Good region is estimated to be approximately double that of the Poor region (x = 2.064 for Good vs. 1 for Poor). This productivity difference is estimated to partially capture heterogeneous congestion effects (which are not separately modeled) and is estimated jointly with the infrastructure elasticity. The productivity differential is identified from the Good region&amp;rsquo;s share of aggregate output (0.849 in the data). The large TFP gap is also the reason why equal spending on Poor states generates a much smaller output gain than spending on Good states: not only is infrastructure utilization lower (fewer firms), but underlying productivity is also lower.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Non-rivalry of public capital&lt;/strong&gt;: The property by which infrastructure stock (Nj,t) enters each firm&amp;rsquo;s production function at the full regional level, not divided among firms. Formally, a single marginal unit of public capital raises every firm&amp;rsquo;s marginal product of private capital simultaneously, so the aggregate marginal product gain summed across firms exceeds any single firm&amp;rsquo;s gain. This is the central mechanism driving the micro-macro elasticity discrepancy in the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm-level elasticity of substitution (λ)&lt;/strong&gt;: The elasticity governing the degree of substitutability between private capital (k) and public infrastructure (N) in the firm&amp;rsquo;s CES production function. At λ = 1 the production function is Cobb-Douglas; λ &amp;gt; 1 is gross substitutability; λ &amp;lt; 1 is gross complementarity. In the paper&amp;rsquo;s estimation, λ = 1.185, meaning private and public capital are gross substitutes at the firm level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gross substitutability vs. gross complementarity&lt;/strong&gt;: Two inputs are gross substitutes (complements) if an increase in the quantity of one raises (lowers) the demand for the other, holding output price fixed. In the paper&amp;rsquo;s framework, private and public capital are gross substitutes at the firm level (λ = 1.185 &amp;gt; 1) but gross complements at the state level (ξ ≈ 0.48 &amp;lt; 1), with non-rivalry explaining the inversion upon aggregation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Convex adjustment cost&lt;/strong&gt;: A cost C(I,k) = (µ/2)(I/k)² · k that scales quadratically with the investment rate. In the heterogeneous-firm model, this cost plays a critical role: by Jensen&amp;rsquo;s inequality, heterogeneous firms&amp;rsquo; average adjustment burden under a convex cost exceeds that of the representative (average) firm, making aggregate investment less sensitive to interest rate changes and thereby dampening crowding out.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fixed adjustment cost (ξ)&lt;/strong&gt;: A one-time overhead cost drawn from a uniform distribution [0, ξ̄], paid only when a firm makes a large-scale investment outside the &amp;ldquo;inaction band&amp;rdquo; [−νk, νk]. This cost generates lumpy investment at the firm level, with about 14% of firms making lumpy investments in any given year. It also creates an extensive margin of investment adjustment that accounts for approximately 74% of the baseline crowding-out effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fiscal multiplier (as defined in this paper)&lt;/strong&gt;: The ratio of the present value of aggregate output deviations from steady state to the present value of the fiscal spending shock, both summed over a T-year horizon. For the short run, T = 2 years; for the long run, T = 5 years. This is computed as a perfect-foresight transition path response to a one-time MIT shock equal to 1% of steady-state GDP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MIT shock (one-time unexpected shock)&lt;/strong&gt;: An unanticipated, non-persistent one-period deviation in infrastructure spending. The term &amp;ldquo;MIT shock&amp;rdquo; refers to a deterministic transition experiment where agents have perfect foresight about all future values after the initial shock occurs. This contrasts with persistent policy rules and allows isolating the dynamic effects of a one-time fiscal impulse.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extended SMM with market-clearing moments&lt;/strong&gt;: The paper&amp;rsquo;s estimation innovation. Rather than solving for market-clearing prices at each parameter candidate (the standard costly inner loop), wages (w) and marginal utility of consumption (p) are treated as parameters with associated moments being the market-clearing conditions set to zero. A multi-block Metropolis-Hastings algorithm draws from the price block and the parameter block separately, generating posterior draws that jointly satisfy market clearing and empirical moment conditions.&lt;/p&gt;</description></item><item><title>Business, Liquidity, and Information Cycles</title><link>https://macropaperwarehouse.com/papers/business-liquidity-and-information-cycles/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/business-liquidity-and-information-cycles/</guid><description>&lt;p&gt;The paper studies how the two roles of stock markets — revealing information about firms&amp;rsquo; fundamentals (which guides capital allocation) and providing liquidity — interact, arguing that when stocks are used more intensively for liquidity, their prices reveal less information about fundamentals. The authors build a Grossman-Stiglitz-style trading model with two types of rational traders (&amp;lsquo;day&amp;rsquo; traders who value liquidity and &amp;rsquo;night&amp;rsquo; traders who value fundamentals) that generates endogenous noise in prices, derive an analytical measure of price informativeness (PI), and structurally estimate PI from firm-level panel data for 16 countries over 1984-2022, finding that PI declines in periods of insufficient funding liquidity (such as the Great Recession and the COVID-19 pandemic) and that these fluctuations are explained mostly by changes in trading activity rather than information quality. Integrating the trading module into a real business cycle model with heterogeneous firms calibrated to the United States, they simulate recessions: a stand-alone recession is &amp;lsquo;cleansing&amp;rsquo; — prices become more informative and allocation improves, mitigating output losses by 4.4% — whereas a recession coinciding with banking distress is &amp;lsquo;sullying&amp;rsquo; — agents rely more on stocks for liquidity, prices become less informative, and worsened misallocation magnifies output losses by 22%. A counterfactual with exogenous (rather than endogenous) information implies output would fall about 43% more than in the benchmark, which the authors read as evidence that endogenous information acquisition lets stock markets &amp;rsquo;lean against the wind&amp;rsquo; in recessions. All magnitudes are model-based and specific to the U.S. calibration.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-interaction-between-stock-market-roles-does-the-paper-study"&gt;Q1. What interaction between stock-market roles does the paper study?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper studies how the liquidity role of stock markets affects their information role: if stocks are used more intensively for liquidity, prices reveal less information about firms&amp;rsquo; fundamentals.&lt;/strong&gt; While the information and liquidity roles of stock markets are each well studied, their interaction is less understood; the authors ask whether using stocks for liquidity enhances or weakens their information role, how distress in other liquidity sources (such as banks) affects price informativeness, and how this contributes to the depth of recessions.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-trading-model-generate-the-information-liquidity-tradeoff"&gt;Q2. How does the trading model generate the information-liquidity tradeoff?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors extend Grossman and Stiglitz (1980) by replacing noise traders with two types of rational traders — &amp;lsquo;day&amp;rsquo; traders interested in liquidity and &amp;rsquo;night&amp;rsquo; traders interested in fundamentals — so that each type&amp;rsquo;s trades act as endogenous noise for the other.&lt;/strong&gt; In equilibrium a linear pricing function exists in which price informativeness depends on the relative weights of fundamental versus liquidity information in prices, and those weights are determined by how many day and night traders operate, their information choices, and how aggressively they trade. When funding markets malfunction, the economy relies more on stocks for liquidity, there are more day traders, and price informativeness declines.&lt;/p&gt;
&lt;h3 id="q3-what-is-price-informativeness-pi-and-how-is-it-estimated"&gt;Q3. What is Price Informativeness (PI), and how is it estimated?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Price Informativeness (PI) is defined analytically as a function of the dispersion of firm productivity, the dispersion of stock-price fluctuations, and their respective price loadings; in a high-PI market, a firm&amp;rsquo;s high relative stock price is a strong signal of positive information about its fundamentals.&lt;/strong&gt; The authors estimate PI structurally using firm-level panel data from 16 countries spanning 1984 to 2022. The linear relationship among stock prices, earnings, and stock liquidity holds independently of general-equilibrium considerations, which is what makes the structural estimation tractable.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-empirical-cyclical-properties-of-pi"&gt;Q4. What are the empirical cyclical properties of PI?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;PI exhibits cyclicality and, more importantly, declines in periods of insufficient funding liquidity, such as the Great Recession and the COVID-19 pandemic.&lt;/strong&gt; Decomposing PI into its four components, the authors show its fluctuations are mostly explained by changes in trading activity rather than by changes in information quality or the amount of information acquired.&lt;/p&gt;
&lt;h3 id="q5-how-is-the-trading-module-embedded-in-a-general-equilibrium-model-and-disciplined"&gt;Q5. How is the trading module embedded in a general-equilibrium model and disciplined?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The trading module is integrated into a real business cycle model with heterogeneous firms in which stock prices guide capital allocation, calibrated to the United States with two possibly correlated aggregate shocks — one to aggregate productivity and one to funding liquidity — to capture recessions with and without banking distress.&lt;/strong&gt; The calibrated model replicates the cyclical properties of the empirical PI measure without targeting them. The authors also discipline how much new information prices convey using price-investment correlations across firms and over time, concluding that new stock-price information is roughly as important as what decision makers already know.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-quantitative-real-effects-in-recessions"&gt;Q6. What are the quantitative real effects in recessions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In a stand-alone recession, increased uncertainty induces all traders to acquire more information, raising price informativeness and improving allocation, which mitigates output losses by 4.4% (&amp;lsquo;cleansing&amp;rsquo;); when a recession coincides with funding-market distress, heightened liquidity-driven trading makes prices less informative and worsens allocation, magnifying output losses by 22% (&amp;lsquo;sullying&amp;rsquo;).&lt;/strong&gt; The authors interpret the 22% figure as a sizable real effect of banking problems operating through a novel channel: the weakening of the information and allocative role of stock markets.&lt;/p&gt;
&lt;h3 id="q7-what-do-the-information-structure-counterfactuals-show"&gt;Q7. What do the information-structure counterfactuals show?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;If information were exogenous rather than endogenously acquired, liquidity distress would reduce PI by more and output would decline about 43% more than in the benchmark — implying endogenous information acquisition lets stock markets &amp;rsquo;lean against the wind&amp;rsquo; during recessions.&lt;/strong&gt; The authors further find that halving the cost of information about fundamentals would make output declines about 5% smaller, whereas halving the cost of information about a stock&amp;rsquo;s liquidity would make declines about 2% larger, leading them to conclude that the welfare effect of transparency is nuanced — easier access to one type of information can make it harder to infer another.&lt;/p&gt;
&lt;h3 id="q8-what-are-the-main-limitations-and-scope-conditions"&gt;Q8. What are the main limitations and scope conditions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors flag two limitations: the framework assumes no feedback from the real economy back to financial markets (prices affect investment, but investment does not affect prices), and the counterfactuals focus on how the information environment affects price informativeness, abstracting from other channels through which information affects production.&lt;/strong&gt; Adding two-way feedback would sacrifice the tractability of linear pricing but could introduce additional magnification forces. All quantitative magnitudes are specific to the U.S. calibration.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;price informativeness (PI)&lt;/strong&gt; : the extent to which stock prices reveal to an outside observer the information that informed traders hold about firms&amp;rsquo; fundamentals; defined in the paper as an analytical function of productivity dispersion, price-fluctuation dispersion, and their price loadings, and estimated structurally.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;day traders vs. night traders&lt;/strong&gt; : the paper&amp;rsquo;s two types of rational traders — day traders trade to satisfy liquidity needs, night traders trade on information about fundamentals — whose trades act as endogenous noise for one another, replacing the exogenous noise traders of Grossman-Stiglitz.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;funding liquidity vs. market liquidity&lt;/strong&gt; : funding liquidity is liquidity provided by intermediaries through credit; market liquidity is the ability to trade stocks to meet liquidity needs; when funding liquidity is scarce, agents substitute toward market liquidity, raising liquidity-driven trading.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;cleansing vs. sullying recessions&lt;/strong&gt; : in the paper&amp;rsquo;s usage, a cleansing recession improves allocation (here via more informative prices), while a sullying recession worsens it; a recession is cleansing without banking distress and sullying when it coincides with funding-market distress.&lt;/p&gt;</description></item><item><title>Can Trade Policy Mitigate Climate Change?</title><link>https://macropaperwarehouse.com/papers/can-trade-policy-mitigate-climate-change/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/can-trade-policy-mitigate-climate-change/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;Farrokhi and Lashkaripour (2025) study the interaction between trade policy and climate change. The central research question is whether and how countries can use trade policy — specifically import tariffs — to address carbon leakage arising from domestic carbon pricing. When a country prices carbon domestically, production and emissions can shift to countries without carbon pricing, partially offsetting domestic emissions reductions. The paper asks how optimal import tariffs should be designed to internalize this leakage, how they relate to standard terms-of-trade tariffs, and what additional gains multilateral coordination can deliver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology and Data.&lt;/strong&gt; The paper develops a multi-country, multi-sector trade model in which carbon emissions are proportional to output with sector-specific emission intensities, and countries choose trade taxes and subsidies strategically in Nash equilibrium alongside domestic carbon prices. The model is calibrated to 43 countries and 56 sectors using the 2014 baseline from the World Input-Output Database (WIOD 2016) for trade flows and input-output linkages, IEA data for sector-level carbon emissions, and GTAP for trade elasticities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt; The paper&amp;rsquo;s first key result is that the optimal unilateral import tariff decomposes additively into a standard terms-of-trade component and a carbon leakage correction component. The carbon leakage correction is proportional to the emission intensity of imports from the exporting country in that sector and to the gap between the social cost of carbon and the actual domestic carbon price in the exporting country, divided by the import price. This decomposition implies that countries have incentives to impose import tariffs beyond those justified by standard terms-of-trade arguments, specifically to correct for the carbon embodied in imports from countries with insufficient carbon pricing.&lt;/p&gt;
&lt;p&gt;The paper derives a sufficient statistic for the optimal carbon tariff that depends only on observable trade elasticities and emission intensities, enabling calibration without full structural estimation beyond the model&amp;rsquo;s standard parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Magnitudes.&lt;/strong&gt; In the calibrated model, optimal unilateral carbon tariffs are on average 30% above standard optimal tariffs globally (28% above for the EU; 33% above for the US). The excess is largest in carbon-intensive sectors: petroleum products (41% above standard optimal), cement and non-metallic minerals (45% above standard optimal), basic metals (38% above standard optimal), and chemicals (32% above standard optimal). Imposing the optimal unilateral carbon tariff yields a welfare gain of +0.8% consumption equivalent for the imposing country, with trading partners losing on average 0.3%, and a net global gain of +0.4%.&lt;/p&gt;
&lt;p&gt;Multilateral coordination — a symmetric global carbon pricing agreement — eliminates the strategic motive for carbon trade wars, delivers an additional global welfare gain of +0.6% above the unilateral optimum, and eliminates 85% of the carbon leakage remaining under unilateral policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;CBAM Analysis.&lt;/strong&gt; The paper evaluates the EU Carbon Border Adjustment Mechanism (CBAM) against the theoretically optimal carbon tariff. The EU CBAM as currently implemented — covering only direct emissions — captures 60% of the theoretically optimal carbon tariff. Extending coverage to indirect (supply-chain) emissions would capture 85% of optimal. The welfare gain to the EU from CBAM relative to no border adjustment is +0.4%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Robustness.&lt;/strong&gt; Results are qualitatively robust to trade elasticity assumptions but quantitatively sensitive to them. Optimal carbon tariffs are regressive with respect to developing countries; multilateral coordination mitigates this distributional effect via income transfers. General equilibrium labor market effects reduce welfare gains by approximately 20% but do not change the qualitative ranking of policies.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the formal structure of the optimal unilateral import tariff in the presence of carbon externalities?&lt;/strong&gt;
The optimal import tariff from country j in sector s is tau*_js = tau^ToT_js + tau^carbon_js, where tau^ToT is the standard terms-of-trade optimal tariff (inverse of the export supply elasticity) and tau^carbon is a carbon leakage correction equal to e_js × (lambda_j − lambda*) / P_js. Here e_js is the emission intensity of country j in sector s, lambda_j is the social cost of carbon in the importing country, lambda* is the actual domestic carbon price in the exporting country, and P_js is the import price. Countries therefore have two distinct and additive incentives to impose import tariffs: the classical terms-of-trade motive and a novel carbon leakage correction motive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the sufficient statistic result and why does it matter for implementation?&lt;/strong&gt;
The paper shows that the optimal carbon tariff can be expressed as a function of observable trade elasticities and emission intensities alone, without requiring estimation of structural parameters beyond those standard to the trade model. This sufficient statistic result matters because it means regulators can in principle calculate and implement the theoretically optimal carbon border adjustment using data that are already collected — sectoral emission intensities and trade elasticities — rather than relying on unobservable structural primitives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: By how much do optimal carbon tariffs exceed standard optimal tariffs in the aggregate and in the most carbon-intensive sectors?&lt;/strong&gt;
Globally, optimal unilateral carbon tariffs are on average 30% above standard optimal tariffs (28% above for the EU, 33% above for the US). The excess is largest in highly carbon-intensive sectors: cement and non-metallic minerals (45% above), petroleum products (41% above), basic metals (38% above), and chemicals (32% above). These are precisely the sectors where emission intensities are highest, consistent with the carbon leakage correction being proportional to emission intensity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the welfare effects of unilateral optimal carbon tariff policy?&lt;/strong&gt;
For the country imposing the optimal unilateral carbon tariff, the welfare gain is +0.8% in consumption-equivalent terms relative to no carbon tariff. Trading partners lose on average 0.3%. The net global welfare gain is +0.4%. These numbers reflect the fact that unilateral carbon tariffs are partly beggar-thy-neighbor in structure — they improve the imposing country&amp;rsquo;s terms of trade in addition to correcting leakage — which is why multilateral coordination is needed to eliminate the strategic distortion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What additional gains does multilateral coordination deliver over unilateral policy?&lt;/strong&gt;
Multilateral coordination — modeled as a symmetric global carbon pricing agreement — generates an additional global welfare gain of +0.6% above the unilateral optimum. It also eliminates 85% of the carbon leakage that persists under unilateral policy. The mechanism is that coordination removes the strategic motive for trade wars over carbon policy: under unilateral policy, each country has an incentive to impose carbon tariffs partly for terms-of-trade reasons, but under a coordinated agreement these beggar-thy-neighbor components are internalized.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How well does the EU&amp;rsquo;s CBAM as actually implemented capture the theoretically optimal carbon border adjustment?&lt;/strong&gt;
The EU CBAM as implemented — covering only direct emissions from covered sectors — captures 60% of the theoretically optimal carbon tariff. Extending the CBAM to include indirect emissions embedded in supply chains would raise this to 85% of optimal. The remaining gap (15% under the extended CBAM) reflects the difficulty of accounting for all upstream emission intensities across complex global supply chains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the welfare gain to the EU from CBAM relative to no border adjustment?&lt;/strong&gt;
The welfare gain to the EU from implementing CBAM (relative to having no carbon border adjustment at all) is +0.4% in consumption-equivalent terms. This figure corresponds to the direct CBAM as implemented, covering only direct emissions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How sensitive are the results to trade elasticity assumptions, and what are the distributional implications for developing countries?&lt;/strong&gt;
The results are qualitatively robust to trade elasticity assumptions but quantitatively sensitive — the magnitude of optimal carbon tariffs and welfare effects depends on the specific elasticities used. On distributional grounds, optimal carbon tariffs are regressive with respect to developing countries, meaning developing economies bear disproportionate costs from carbon border adjustments. Multilateral coordination partially mitigates this distributional concern through income transfers implied by the symmetric global agreement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do general equilibrium labor market effects alter the conclusions?&lt;/strong&gt;
General equilibrium labor market effects reduce the welfare gains by approximately 20% relative to the baseline estimates, but do not change the qualitative ranking of policies (unilateral carbon tariff better than no border adjustment; multilateral coordination better than unilateral). This suggests that the core policy conclusions are robust to incorporating labor market general equilibrium effects, even if the precise magnitudes are somewhat smaller.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Carbon Leakage.&lt;/strong&gt; In this paper, carbon leakage refers specifically to the shift in production and emissions to countries without domestic carbon pricing that occurs when one country implements a carbon price. It is the mechanism by which domestic carbon pricing is partially offset, motivating the use of trade policy as a complementary instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon Leakage Correction (tau^carbon).&lt;/strong&gt; The component of the optimal import tariff that is distinct from the standard terms-of-trade tariff. It equals emission intensity × (social cost of carbon − domestic carbon price in exporter) / import price. It corrects for the fact that imports from countries with insufficient carbon pricing embody unpriced carbon externalities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Terms-of-Trade Tariff (tau^ToT).&lt;/strong&gt; The standard optimal import tariff arising from a large country&amp;rsquo;s ability to manipulate its terms of trade. Equal to the inverse of the export supply elasticity of the trading partner. The paper establishes that carbon tariffs add to — rather than replace — this classical component.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistic for Optimal Carbon Tariff.&lt;/strong&gt; A formula expressing the optimal carbon tariff as a function of observable trade elasticities and emission intensities, without requiring estimation of unobservable structural parameters beyond those standard to the trade model. The term is used in the paper&amp;rsquo;s specific sense of an empirically implementable formula that is exact within the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Emission Intensity.&lt;/strong&gt; Sector-specific carbon emissions per unit of output in a given country, denoted e_js for country j and sector s. Used as the key observable that scales the carbon leakage correction component of the optimal tariff.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multilateral Coordination.&lt;/strong&gt; Modeled as a symmetric global carbon pricing agreement in which all countries simultaneously adopt optimal carbon pricing. In the paper&amp;rsquo;s framework, this eliminates the strategic motive for unilateral carbon trade wars and achieves additional welfare gains and leakage reductions beyond what any single country can achieve unilaterally.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon Border Adjustment Mechanism (CBAM).&lt;/strong&gt; The EU policy instrument that imposes a carbon price on imports from sectors covered by the EU Emissions Trading System, evaluated in the paper against the theoretically optimal carbon tariff. The paper distinguishes between the direct-emissions-only CBAM as implemented (capturing 60% of optimal) and a hypothetical full CBAM including indirect supply-chain emissions (capturing 85% of optimal).&lt;/p&gt;</description></item><item><title>Cap‐and‐Trade and Carbon Tax Meet Arrow–Debreu</title><link>https://macropaperwarehouse.com/papers/capandtrade-and-carbon-tax-meet-arrowdebreu/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/capandtrade-and-carbon-tax-meet-arrowdebreu/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Anderson and Duanmu (2025) ask how general equilibrium (GE) interactions — factor reallocation across sectors, capital misallocation under climate uncertainty, and the distributional incidence of damages — alter the social cost of carbon (SCC) relative to the partial equilibrium (PE) estimates embedded in standard integrated assessment models (IAMs). The paper also characterizes conditions for Pareto improvements through climate policy and derives the optimal carbon tax in second-best environments with pre-existing distortions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build a dynamic Arrow-Debreu economy with L goods, K capital stocks (including climate stocks), and T periods. The climate module specifies that the carbon stock evolves as S_{t+1} = S_t + sum_j e_j(q_j) − alpha·S_t, and climate damage functions D_j(S_t) = 1 − d_j·(S_t − S_0) reduce sector-specific production possibilities sets. Firms and households take the climate trajectory as given and do not internalize their own emissions&amp;rsquo; impact, generating the externality. Under standard regularity conditions, the authors prove existence of a competitive equilibrium and establish that it is inefficient: output is too high and climate-intensive sectors are too large relative to the social optimum.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General Formula for the SCC&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper derives a general SCC formula — SCC_t = Sum_{tau &amp;gt;= t} beta^(tau−t) · [dW/dS_tau / (dW/dY_t)] — that decomposes into four components: (1) the standard direct productivity-loss term, (2) a GE factor-reallocation term capturing inefficient reallocation as damages shift relative prices, (3) a capital-misallocation term reflecting distortions in investment from climate uncertainty, and (4) a distribution term reflecting the welfare losses from the regressive incidence of climate damages. All three correction terms are positive under standard conditions, so the GE SCC exceeds the PE SCC. The paper shows that this formula nests existing IAM frameworks as special cases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Calibrating to three leading IAMs, the authors find that general equilibrium interactions raise the SCC by 15–40% above standard PE estimates:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;DICE-calibrated: GE correction of &lt;strong&gt;18%&lt;/strong&gt; above the PE estimate.&lt;/li&gt;
&lt;li&gt;FUND-calibrated: GE correction of &lt;strong&gt;15%&lt;/strong&gt; above the PE estimate.&lt;/li&gt;
&lt;li&gt;PAGE-calibrated: GE correction of &lt;strong&gt;40%&lt;/strong&gt; above the PE estimate, the largest correction owing to greater sector heterogeneity in that model.&lt;/li&gt;
&lt;li&gt;Median calibration: a PE SCC of &lt;strong&gt;$51/tCO₂&lt;/strong&gt; rises to a GE SCC of &lt;strong&gt;$62/tCO₂&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Decomposing the aggregate GE correction: factor reallocation across sectors accounts for &lt;strong&gt;55%&lt;/strong&gt;, capital misallocation due to climate uncertainty for &lt;strong&gt;30%&lt;/strong&gt;, and the distributional regressivity of damages for &lt;strong&gt;15%&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-Best Policy and Uncertainty&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In environments with pre-existing distortions, the optimal carbon tax deviates from the SCC: revenue recycling through labor tax cuts generates additional welfare gains of &lt;strong&gt;10–15%&lt;/strong&gt; of carbon tax revenue; undertaxed capital implies the optimal carbon tax should be set above the SCC (double dividend); and in monopolistically competitive sectors the optimal carbon tax is below the SCC because the carbon tax amplifies monopoly distortions. Under climate uncertainty, the SCC carries a risk premium proportional to the variance of damage estimates times the coefficient of relative risk aversion, estimated at &lt;strong&gt;+$8–15/tCO₂&lt;/strong&gt; (15–25% of the base SCC).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The quantitative corrections are calibrated to DICE, FUND, and PAGE and therefore inherit those models&amp;rsquo; parameterizations of damage functions and discount rates. The GE factor-reallocation and capital-misallocation channels are larger when sectors are more heterogeneous in damage exposure — as is explicit in the PAGE result. Second-best corrections depend on the sign and magnitude of pre-existing distortions (labor taxes, capital taxes, market structure).&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the core inefficiency result, and what does it imply about the competitive equilibrium?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s efficiency theorem establishes that the competitive equilibrium is Pareto inefficient because firms and households take the climate trajectory as given and do not internalize the impact of their own emissions on the carbon stock. As a consequence, output is too high and climate-intensive sectors are too large relative to the social optimum. This externality is the fundamental justification for climate policy in the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How does the paper&amp;rsquo;s general SCC formula extend existing approaches, and what are the novel terms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The general formula SCC_t = Sum_{tau &amp;gt;= t} beta^(tau−t) · [dW/dS_tau / (dW/dY_t)] nests standard IAM SCC formulas as special cases. The novel terms relative to partial equilibrium are: (i) a GE reallocation term capturing losses from inefficient factor reallocation as climate damages change relative prices across sectors; (ii) a capital-misallocation term capturing distortions in investment arising from climate uncertainty; and (iii) a distribution term capturing welfare losses from the regressive incidence of damages. All three terms are positive under standard conditions, implying GE SCC &amp;gt; PE SCC in all calibrations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. How are the quantitative GE corrections decomposed, and which channel dominates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Of the total GE correction above the PE baseline, factor reallocation across sectors contributes 55%, capital misallocation due to climate uncertainty contributes 30%, and the distributional regressivity of damages contributes 15%. Factor reallocation is the dominant channel because, as climate damages alter relative prices, production shifts toward less-damaged sectors in ways that are distorted by the original carbon externality — generating second-order losses absent from PE damage functions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. Why does the PAGE calibration produce a larger GE correction (40%) than DICE (18%) or FUND (15%)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper attributes PAGE&amp;rsquo;s larger GE correction to greater sector heterogeneity in that model&amp;rsquo;s parameterization. When damage exposure is more heterogeneous across sectors, the relative-price effects of marginal carbon are larger, amplifying the factor-reallocation channel. DICE and FUND, with more uniform sector-level damage structures, exhibit smaller reallocation corrections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What is the median-calibration implication for the SCC in dollar terms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the median calibration, a PE SCC of $51/tCO₂ rises to a GE SCC of $62/tCO₂, an increase of roughly $11/tCO₂ or approximately 22%. This figure is directly computable from observable trade elasticities and sector-level damage estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. How should the carbon tax be adjusted when pre-existing labor market distortions are present, and what is the magnitude of the welfare gain from revenue recycling?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When labor taxes create a pre-existing wedge, using carbon tax revenue to reduce labor taxes generates additional welfare gains of 10–15% of total carbon tax revenue — the double dividend in the labor market dimension. The optimal carbon tax in this case includes the SCC plus a correction term for the labor-market distortion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How do capital market distortions alter the optimal carbon tax relative to the SCC?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If capital is undertaxed (a pre-existing distortion in capital markets), the optimal carbon tax is set above the SCC. The intuition is that a higher carbon tax partially offsets the under-taxation of capital by raising the effective cost of carbon-intensive investment, capturing a double-dividend in the capital market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. How does monopolistic competition modify the optimal carbon tax?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For monopolistically competitive sectors, the optimal carbon tax is below the SCC. The reasoning is that applying a carbon tax to these sectors amplifies existing monopoly markups and associated distortions, so the social cost of the carbon tax exceeds the raw SCC in those sectors. The optimal policy trades off carbon correction against monopoly amplification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. What is the risk premium in the SCC under climate uncertainty, and how is it estimated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper adds a term to the SCC proportional to the variance of damage estimates times the coefficient of relative risk aversion. Using empirical estimates of damage uncertainty, this risk premium is estimated at +$8–15/tCO₂, representing 15–25% of the base SCC. This term is absent from deterministic SCC calculations and constitutes a further reason standard PE estimates understate the true social cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. What is the paper&amp;rsquo;s claim regarding computability of the GE correction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper states that the novel GE terms are computable from observable trade elasticities and sector-level damage estimates, implying the GE correction is not merely a theoretical construct but can be implemented in quantitative policy analysis using data sources already available to researchers and policymakers.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Social Cost of Carbon (General Equilibrium Formula)&lt;/strong&gt;
Defined in the paper as SCC_t = Sum_{tau &amp;gt;= t} beta^(tau−t) · [dW/dS_tau / (dW/dY_t)], the present discounted value of the marginal welfare loss from an additional unit of carbon, expressed relative to the marginal utility of current output. The paper&amp;rsquo;s version adds GE reallocation, capital-misallocation, and distributional terms absent from standard PE formulations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;GE Adjustment Factor&lt;/strong&gt;
The ratio of the general equilibrium SCC to the partial equilibrium SCC, expressed as GE/PE = 1 + phi_realloc + phi_capital + phi_distribution. Under standard conditions all three phi terms are positive, so the GE SCC strictly exceeds the PE SCC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Climate Damage Function (Sector-Specific)&lt;/strong&gt;
Specified as D_j(S_t) = 1 − d_j·(S_t − S_0), a sector-specific multiplicative reduction in the production possibilities set as the carbon stock rises above the pre-industrial level S_0. Heterogeneity in d_j across sectors is the driver of the factor-reallocation GE correction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon Stock Evolution&lt;/strong&gt;
S_{t+1} = S_t + sum_j e_j(q_j) − alpha·S_t, where alpha is the natural decay rate of atmospheric carbon and e_j(q_j) is sectoral emissions as a function of output. Firms and households treat S_t as exogenous, generating the externality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Double Dividend&lt;/strong&gt;
In second-best environments, a carbon tax can generate two welfare gains simultaneously: correcting the carbon externality and reducing the deadweight loss from a pre-existing distortion (labor or capital tax). The paper finds revenue recycling via labor tax cuts yields 10–15% of carbon tax revenue as additional welfare gain; undertaxed capital implies the optimal carbon tax is set above the SCC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk Premium in the SCC&lt;/strong&gt;
An additive term in the SCC under climate uncertainty, proportional to the variance of damage estimates times the coefficient of relative risk aversion. Empirically estimated at +$8–15/tCO₂, representing 15–25% of the base SCC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-Best Optimal Carbon Tax&lt;/strong&gt;
Written as tau*_carbon = SCC + CORRECTION, where the correction depends on the sign and magnitude of pre-existing distortions. The correction is positive under undertaxed capital (raise above SCC), negative under monopolistic competition (lower below SCC), and augmented by revenue-recycling gains when labor taxes are present.&lt;/p&gt;</description></item><item><title>Cash or card? A structural model of payment choices</title><link>https://macropaperwarehouse.com/papers/cash-or-card-a-structural-model-of-payment-choices/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/cash-or-card-a-structural-model-of-payment-choices/</guid><description>&lt;p&gt;Lippi and Moracci (2026) ask how euro area households choose between cash and card payments, and whether existing theoretical models can explain observed behavior. They draw on ECB payment diary surveys (SUCH and SPACE waves I–III, 2015–2024) covering transaction-level records that include purchase size, payment method chosen, cash on hand before each transaction, and merchant acceptance of cards. This granular data allows the authors to isolate unforced payment choices — transactions in which the consumer had sufficient cash, the merchant accepted cards, and the consumer held a card — from mechanically constrained ones.&lt;/p&gt;
&lt;p&gt;The authors document three empirical patterns. First, roughly 39% of individuals in the sample violate the simple transaction-size threshold rule of Whitesell (1989): their largest unforced cash payment exceeds their smallest unforced card payment. Second, between 27% and 49% of unforced transactions are settled by card across survey waves, contradicting the &amp;ldquo;cash burns&amp;rdquo; policy of Alvarez and Lippi (2017) under which cards are used only when cash is exhausted. Third, and most novel, the probability of card use rises sharply as implied residual cash holdings (m′ = m − s) approach zero — that is, when a cash payment would nearly deplete the wallet. This suggests a precautionary motive: consumers maintain a cash buffer to cover purchases at merchants who do not accept cards.&lt;/p&gt;
&lt;p&gt;To rationalize these facts, the authors build an inventory-theoretic model with a compound Poisson expenditure flow (random arrival times and random transaction sizes drawn from a lognormal distribution), imperfect card acceptance (fraction ϕ of merchants accept cards, set at 0.89 for 2023–24), a fixed cost b per cash withdrawal, a fixed cost κ per card transaction (sign unrestricted), and a utility penalty u per missed purchase. The optimal policy takes an (s,S) form for withdrawals and a state-dependent threshold for payment choice. When 0 &amp;lt; κ &amp;lt; b, the agent uses cards for purchases large enough that paying cash would push balances below a threshold m̃, thereby avoiding a costly withdrawal or the risk of missing a future purchase. The critical transaction size above which cards are used, s(m), rises with cash on hand, generating the interaction the data reveals.&lt;/p&gt;
&lt;p&gt;The model is calibrated by minimum distance to four moments from the 2023–24 SPACE wave: average cash balances relative to daily expenditure, annual withdrawal frequency, the unforced card expenditure share, and realized purchase frequency. The estimated annual cost of managing consumption transactions for the average euro area household is approximately 15 euros — a remarkably small burden. Three counterfactual experiments quantify welfare implications. Removing card access raises the annual cost from 15 to about 50 euros, implying a card ownership value of roughly 35 euros per year. Near-universal card acceptance (ϕ = 0.99) reduces the annual cost by nearly 75%, from 15 to about 4 euros, while average cash holdings fall from 130% to about 20% of daily expenditure. A complete ban on cash would cost the average consumer approximately 60 euros per year more than the current mixed system. A cashless equilibrium requires both near-universal acceptance (ϕ above 99%) and card costs at or below zero (κ ≤ 0); neither condition alone is sufficient given the estimated magnitude of the missed-purchase cost u.&lt;/p&gt;
&lt;p&gt;Q: What is the central empirical puzzle the paper addresses?
A: Existing models predict either a pure transaction-size threshold (Whitesell 1989) or a pure cash-burns rule (Alvarez and Lippi 2017). The data shows both rules are violated: 39% of individuals with observed unforced transactions of both types violate the threshold rule, and 27–49% of unforced transactions are paid by card despite available cash. Neither model alone accounts for the novel finding that card usage spikes precisely when a cash payment would nearly exhaust the wallet.&lt;/p&gt;
&lt;p&gt;Q: What data does the paper use and what is its key advantage?
A: The authors use ECB payment diaries from four survey waves: SUCH (2015–16) and SPACE I, II, III (2019, 2021–22, 2023–24). For each transaction the diary records payment method, purchase size, and cash on hand, along with merchant acceptance of each payment method. Critically, the combined information on cash holdings and acceptance allows the authors to distinguish forced from unforced payment choices, which is essential for identifying the behavioral determinants of payment method selection.&lt;/p&gt;
&lt;p&gt;Q: What is the novel empirical fact the paper contributes?
A: The paper documents that the probability of card use increases sharply as implied residual cash (m′ = m − s) approaches zero. This pattern holds across all survey waves. It is consistent with a precautionary motive: consumers use cards to avoid depleting a cash buffer that provides insurance for encounters with merchants who do not accept cards.&lt;/p&gt;
&lt;p&gt;Q: How does the theoretical model generate the precautionary motive for cash?
A: Cards are accepted in only fraction ϕ of stores; when a merchant does not accept cards and the consumer lacks cash, the purchase is missed at utility cost u. This creates an incentive to maintain positive cash balances. Combined with a fixed withdrawal cost b and a fixed card cost κ, the agent optimally targets a cash level m* and withdraws before the wallet empties (trigger m̄ &amp;gt; 0), holding a buffer against card-rejection events.&lt;/p&gt;
&lt;p&gt;Q: What is the key proposition characterizing the optimal payment policy?
A: Proposition 1 establishes three regimes. When κ ≤ 0, the card always dominates and is used for all purchases. When κ ≥ b, cash always dominates and cards are used only for forced transactions. In the intermediate case 0 &amp;lt; κ &amp;lt; b, a threshold m̃ ∈ (m̄, m*) divides behavior: for m &amp;lt; m̃ the agent uses cash for all transactions; for m ≥ m̃ the agent uses a card for any purchase exceeding a size threshold s(m), where s(m) is increasing in m. The threshold s(m) distinguishes this policy from Whitesell (1989)&amp;rsquo;s fixed threshold.&lt;/p&gt;
&lt;p&gt;Q: How does the payment threshold s(m) vary with cash on hand, and why?
A: s(m) is the purchase size above which the value loss from paying cash — pushing the agent closer to m̄ and raising the probability of a missed purchase or costly withdrawal — exceeds the fixed card cost κ. As m rises, a larger cash payment is needed to trigger this concern, so s(m) increases. This means card use becomes less frequent as cash balances grow for most of the state space, consistent with the empirical finding that cash probability rises with cash on hand.&lt;/p&gt;
&lt;p&gt;Q: What are the calibrated parameter values and what do they imply?
A: The withdrawal cost b is estimated at 0.003 EUR — very small. The per-transaction card cost κ is about 60% of b, meaning cards are cheaper to use per transaction than visiting an ATM. The cost of a missed purchase u is approximately 1 EUR. The arrival rate λ is calibrated so that about 2% of purchase opportunities are missed under the estimated card acceptance rate of 0.89. These values imply that the payment system imposes a small but non-trivial welfare burden, concentrated in the precautionary costs of maintaining cash.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated annual cost of managing consumption transactions?
A: Under the optimal policy for 2023–24 parameters, the annual cost C is approximately 15 euros per household. This decomposes into opportunity costs of holding cash (RM), withdrawal costs (bn), card usage costs, and the disutility from missed purchases. The authors characterize this as &amp;ldquo;remarkably small,&amp;rdquo; suggesting the current payment system is relatively efficient from the household&amp;rsquo;s perspective.&lt;/p&gt;
&lt;p&gt;Q: How does this cost compare across demographic groups and over time?
A: Until 2019 the estimated annual cost was around 20 euros; it stabilized around 15 euros from 2021–22 onward, with the decline driven primarily by households holding less cash in the post-pandemic period. Across age groups, education levels, income brackets, and gender, each subgroup faces a very similar cost as a proportion of their expenditure, indicating limited distributional variation in payment system costs.&lt;/p&gt;
&lt;p&gt;Q: What is the welfare value of owning a payment card?
A: Setting ϕ = 0 (cash-only economy), the annual cost rises from 15 to approximately 50 euros. The value of card ownership is therefore approximately 35 euros per year. The savings come primarily from lower opportunity costs of holding cash (since card access reduces the precautionary motive) and lower disutility from missed purchases; withdrawal cost reductions play a negligible role.&lt;/p&gt;
&lt;p&gt;Q: What happens under near-universal card acceptance (ϕ = 0.99)?
A: Average cash holdings fall from about 130% of daily expenditure to about 20% of daily expenditure, a reduction of approximately 110 percentage points. The unconditional card expenditure share rises by 17 percentage points to about 93%, mostly through an increase in forced card transactions (agents more often lack cash). Unforced card expenditure falls by about 10 percentage points because the precautionary motive for using cards — preserving a cash buffer — weakens when acceptance is near-universal. The annual management cost falls by nearly 75%, from 15 to approximately 4 euros.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions does a cashless economy emerge?
A: The model identifies two jointly necessary conditions: card acceptance near universal (ϕ above 99%) and card costs at or below zero (κ ≤ 0). Raising ϕ alone from the estimated 0.89 to 0.99 reduces cash use substantially but does not eliminate it, because the estimated cost of missed purchases u is large enough that consumers still maintain a small cash buffer. For κ ≤ 0, cash holdings M/e are insensitive to κ and depend only on ϕ. With current card usage costs, even near-universal acceptance would not produce a cashless economy.&lt;/p&gt;
&lt;p&gt;Q: What is the cost of a complete cash ban?
A: Under a cashless policy, the annual cost is approximately 75 euros — about 5 times the 15-euro baseline and about 25 euros more than the cash-only cost of 50 euros. A complete ban on cash would increase transaction management costs by approximately 60 euros per year for the average consumer. This is because at ϕ = 0.89, nearly 11% of purchase encounters would result in missed transactions.&lt;/p&gt;
&lt;p&gt;Q: How does card acceptance affect cash management in the model and data?
A: As ϕ falls, the precautionary motive for holding cash strengthens: the withdrawal trigger m̄ rises, average cash holdings increase, and withdrawals occur when the wallet is still substantially full. This prediction is qualitatively consistent with the empirical finding that in areas with lower card acceptance, individuals hold higher cash balances and withdraw at higher residual cash levels.&lt;/p&gt;
&lt;p&gt;Q: What are the main limitations the authors acknowledge?
A: Three caveats are identified. First, the model has no exogenous cash inflows (wage payments, gifts); incorporating Miller-Orr-style inflows could affect cash resilience estimates. Second, the card cost κ is fixed and independent of transaction size s; allowing κ(s) = κ₀ + κₛ·s would better capture reward-program economies relevant for the US. Third, merchant card acceptance is treated as exogenous; endogenizing it as a game between merchants would allow a joint welfare evaluation of acceptance decisions, payment choices, and cash management.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Unforced transactions: Transactions in which both cash and card payments are feasible — specifically, cash holdings exceed the purchase size, the merchant accepts cards, and the consumer holds a card. Isolating unforced transactions is necessary to identify behavioral determinants of payment choice, stripping out mechanical constraints imposed by cash insufficiency or merchant non-acceptance.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Precautionary cash buffer: A positive cash balance maintained above the withdrawal trigger (m̄ &amp;gt; 0) to insure against purchases at merchants who do not accept cards. In the model, this buffer arises because card non-acceptance combined with insufficient cash results in a missed purchase at utility cost u; the precautionary motive is stronger when ϕ is lower.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Transaction-size threshold s(m): The purchase size above which a consumer with cash holdings m optimally pays by card (when cards are available and 0 &amp;lt; κ &amp;lt; b). Unlike the fixed threshold of Whitesell (1989), s(m) is increasing in m, generating a novel interaction between cash on hand and payment method choice that the ECB diary data confirms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Cash burns policy: The policy of Alvarez and Lippi (2017) in which cards are used only when cash is fully exhausted (m = 0). The paper documents that 27–49% of unforced transactions are settled by card across survey waves, constituting a systematic violation of this rule that the model resolves by introducing transaction-size heterogeneity and a precautionary motive.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Imperfect card acceptance (ϕ): The exogenous fraction of merchants willing to accept card payments, set at 0.89 for 2023–24 in the calibration. Imperfect acceptance is the primary driver of the precautionary demand for cash; it also determines the frequency of missed purchases under a cashless policy and is the key parameter governing whether a cashless economy can emerge.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Annual transaction management cost (C): The total yearly household cost of operating within the payment system, defined as C = RM + bn + κ·(number of card purchases) + u·(number of missed purchases). Estimated at approximately 15 euros for the average euro area household in 2023–24, decomposed across opportunity costs of cash holdings, withdrawal costs, card usage costs, and missed-purchase disutility.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Ss withdrawal policy: The optimal cash replenishment rule characterized by a trigger level m̄ and a target level m*. The agent withdraws whenever cash falls to m̄, resetting balances to m*. A strictly positive trigger (m̄ &amp;gt; 0) reflects the precautionary motive: the agent refills before cash is exhausted in order to maintain insurance against card non-acceptance events.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Catastrophes, Delays, and Learning</title><link>https://macropaperwarehouse.com/papers/catastrophes-delays-and-learning/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/catastrophes-delays-and-learning/</guid><description>&lt;p&gt;This paper develops a general model of experimentation under catastrophe risk in which the catastrophe is triggered when a stock variable exceeds an unknown threshold, but occurs only after a stochastic delay. The central contribution is the concept of the &amp;ldquo;legacy of the past&amp;rdquo;: at any planning date, past experiments may have already triggered a catastrophe that has not yet materialized, and the planner cannot observe whether triggering has occurred. The legacy is formally defined as the probability, conditional on survival, that a catastrophe was triggered in the past.&lt;/p&gt;
&lt;p&gt;The model unifies two canonical but previously incompatible approaches in the literature. In the hazard-rate approach, the catastrophe is bound to happen and the planner manages its timing and severity. In the unknown-threshold approach, learning is instantaneous and the catastrophe is certainly avoided if the stock has not yet exceeded the threshold. Neither approach captures the intermediate case where the planner remains uncertain about whether the catastrophe is already underway. By introducing a delay governed by an exponential distribution with parameter α, the authors show that both approaches are limiting special cases: as α → ∞ (no delay), the legacy vanishes and the unknown-threshold approach is recovered; when the legacy is set permanently to one (catastrophe triggered with certainty), the hazard-rate approach is recovered.&lt;/p&gt;
&lt;p&gt;Three benchmark stock levels anchor the analysis. QN is the long-run target absent any catastrophe risk. QD (&amp;ldquo;Damages&amp;rdquo;) is the optimal stabilization target when the planner knows a catastrophe was triggered in the past — it lies weakly below QN because the planner trades off current gains against the discounted marginal damage from raising the stock at the moment of eventual catastrophe occurrence. QE (&amp;ldquo;Experimentation&amp;rdquo;) is the stock level below which stabilization is suboptimal when the planner is certain no triggering has occurred — it also lies weakly below QN.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s two main theorems are distinguished by the ranking of QD and QE, which reflects whether mitigation strategies are effective.&lt;/p&gt;
&lt;p&gt;Theorem 1 (QE &amp;lt; QD): When damage is not highly sensitive to the stock level at catastrophe time — so mitigation is relatively ineffective — optimal paths are monotonically increasing and converge to a long-run stock level Q∞ ∈ [QE, QD]. The stopping condition equates the marginal benefit of experimentation to a weighted average of the expected cost under the unknown-threshold approach (weight 1 − π) and the marginal damage under the hazard-rate approach (weight π), where π is the legacy at stopping time. A higher legacy at the stopping time is associated with a higher long-run stock level. A higher initial legacy induces fatalism: since the catastrophe is more likely already triggered, the planner shifts priority toward current consumption rather than caution, leading to more total experimentation.&lt;/p&gt;
&lt;p&gt;Theorem 2 (QD &amp;lt; QE): When damage is highly sensitive to the stock level — so mitigation is valuable — the long-run target is uniquely QE regardless of the initial legacy. However, the short-run path is non-monotonic: for a sufficiently high initial legacy, the planner first reduces the stock sharply (lockdown, emissions cut) to mitigate pending catastrophe damages, then, as the legacy declines because no catastrophe occurs, gradually allows the stock to rise back toward QE. The direction of caution reverses relative to Theorem 1: a higher legacy now induces more caution, not less.&lt;/p&gt;
&lt;p&gt;Applications include pandemic management (stock = infected population, catastrophe = health system collapse) and climate change (stock = cumulative CO2 emissions or atmospheric pollution stock). In the disease control application, whether a planner prioritizes economic production or mortality reduction determines which theorem governs, with the key ratio being production losses relative to mortality increases. For pandemic policy, Theorem 2 produces a formal learning-based rationale for non-monotonic &amp;ldquo;hammer-and-dance&amp;rdquo; policies (strict early lockdown followed by relaxation) that differs from prior explanations in the literature. In the carbon budget application, Proposition 5 formally proves that higher initial legacy raises the optimal carbon budget under Theorem 1 conditions, and can imply unbounded consumption (certainty of catastrophe) above a critical legacy threshold π*. Under Theorem 2 conditions (Proposition 6), the optimal policy can involve first reducing then expanding the stock before stabilizing, with both transition dates increasing in the initial legacy.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;legacy of the past&amp;rdquo; and how is it computed?
A: The legacy πt is defined as the probability, conditional on survival to date t, that a catastrophe was already triggered by past experiments. Formally, πt = 1 − [1 − F(Qt)] / pt, where Qt is the highest stock level ever reached, F is the prior distribution over the threshold, and pt is the survival probability. A past experiment at time t&amp;rsquo; contributes to the current legacy with weight exp[−α(t − t&amp;rsquo;)], so recent experiments matter more than distant ones. As time passes without catastrophe, the legacy of any fixed past experiment declines geometrically at rate α.&lt;/p&gt;
&lt;p&gt;Q: How do the three benchmark stock levels QN, QD, and QE relate to each other?
A: QN is the optimal long-run stock without any catastrophe. QD is defined by the condition where the marginal net benefit of increasing the stock — ν(Q) − [α/(α+δ)]D&amp;rsquo;(Q) — equals zero, and satisfies QD ≤ QN. QE is defined by ν(Q) − [α/(α+δ)]ρ(Q)D(Q) = zero, and also satisfies QE ≤ QN. The ranking between QD and QE depends on whether damage is more sensitive to the marginal increase in stock at catastrophe time (which pushes QD below QE) or to the level of the stock at triggering (which pulls QD above QE).&lt;/p&gt;
&lt;p&gt;Q: What is the key optimality condition in Theorem 1 and how does it unify prior approaches?
A: The stopping condition (equation 15) states: ν(QT) = [α/(α+δ)] × [(1 − πT)ρ(QT)D(QT) + πT D&amp;rsquo;(QT)]. When πT = 0 (no legacy, unknown-threshold limit), this reduces to the experimentation stopping condition of Tsur and Zemel, governed by the hazard rate ρ(QT) times expected loss D(QT). When πT = 1 (full legacy, hazard-rate limit), it reduces to the damage-mitigation condition governed by marginal damage D&amp;rsquo;(QT). The legacy at stopping time thus serves as the mixing weight between the two canonical approaches, embedding both as special cases.&lt;/p&gt;
&lt;p&gt;Q: How does the initial legacy affect total experimentation under Theorem 1 versus Theorem 2?
A: Under Theorem 1 (QE &amp;lt; QD), a higher initial legacy π0 leads to more total experimentation (higher Q∞), because the planner becomes fatalistic — since the catastrophe is more likely already triggered and mitigation is relatively ineffective, current consumption is prioritized. Proposition 5 formally proves this for the carbon budget application: the optimal stopping date T and optimal budget QT are nondecreasing in π0. Under Theorem 2 (QD &amp;lt; QE), a higher legacy triggers more caution in the short run (larger reduction in the stock during the mitigation phase), but the long-run target QE remains the same regardless of π0.&lt;/p&gt;
&lt;p&gt;Q: What generates non-monotonic policies in Theorem 2, and what does this look like in the pandemic application?
A: Non-monotonicity arises because the optimal response to a high legacy is first to reduce the stock sharply to limit catastrophe damages (since damage is sensitive to the stock level), and then, as time passes without catastrophe and the legacy declines, to allow the stock to recover. In the disease control application with high mortality weight, a complete lockdown is optimal in the first phase whenever the legacy is strictly positive. As the legacy declines, the lockdown is gradually relaxed, and eventually the infection level returns to its pre-lockdown level. Figures 3 and 4 show that a higher initial legacy (π0 = 0.1, 0.5, or 0.9) leads to a longer lockdown and slower recovery, though all paths converge to the same long-run infection level.&lt;/p&gt;
&lt;p&gt;Q: How does the model&amp;rsquo;s disease control application determine which theorem governs?
A: Lemma 2 states that if 1 / [1 + (Y(r+d) − Y*) / (wµ&lt;em&gt;dI^D)] &amp;lt; ρ(I^D), then I^E &amp;lt; I^D and Theorem 1 applies; otherwise I^E &amp;gt; I^D and Theorem 2 applies. The key ratio is (Y(r+d) − Y&lt;/em&gt;) / (wµ*d), the production loss relative to mortality increase. A planner who weights economic activity heavily (large production loss ratio) falls under Theorem 1 and tolerates rising infections; a planner who weights mortality heavily falls under Theorem 2 and imposes an initial lockdown.&lt;/p&gt;
&lt;p&gt;Q: What is the carbon budget result under Theorem 1 (Proposition 5)?
A: Under the condition u1 &amp;gt; [α/(α+δ)]v0 (marginal consumption value exceeds discounted marginal damage), Theorem 1 applies and there exists a critical legacy threshold π* such that: below π*, the planner consumes maximally (qt = q-bar) until a finite date T and then stops, with QE &amp;lt; QT &amp;lt; QD; above π*, the planner consumes maximally forever, triggering the catastrophe with certainty. The stopping date T and the optimal budget QT are nondecreasing functions of initial legacy π0, formally proving that higher past emissions (captured through legacy) justify higher future carbon budgets in this model.&lt;/p&gt;
&lt;p&gt;Q: What is the carbon budget result under Theorem 2 (Proposition 6)?
A: Under condition u1 &amp;lt; [α/(α+δ)]v0, QD &amp;lt; QE and Theorem 2 applies. Starting from Q0 above QE, if π0 is small enough (specifically u1 &amp;gt; π0[α/(α+δ)]v0), the optimal policy is to stabilize the stock forever at Q0. Otherwise, there exist two finite dates t1 &amp;lt; t2, both increasing in π0, such that the planner first reduces the stock at maximum rate (qt = q-bar-negative) for t &amp;lt; t1, then expands at maximum rate for t1 &amp;lt; t &amp;lt; t2, then stabilizes at Q0 forever. The optimal carbon budget is Q0 in all cases, showing that the long-run target is independent of legacy under Theorem 2.&lt;/p&gt;
&lt;p&gt;Q: How does the model relate to the hazard-rate literature formally?
A: Papers such as Nordhaus and others that use an exogenous hazard rate h(Qt) for catastrophe — yielding survival probability pt = p0 exp(−∫h(Qτ)dτ) — are shown to be equivalent to the special case where the catastrophe was triggered in the past (legacy = 1 permanently). Their formulation corresponds to assuming α is constant and the legacy is identically one, which reduces the law of motion for pt to pt = p0 exp(−αt). The key difference is that in the hazard-rate approach the planner can reduce the arrival rate by lowering the stock (h is increasing in Q), whereas in the authors&amp;rsquo; model the delay parameter α is constant and policy affects only damages.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the exponential delay distribution assumption?
A: The assumption that the delay τ follows an exponential distribution with parameter α is made for tractability. Under this assumption, the entire past trajectory of the stock (Qt)t≤0 can be summarized by just two state variables — the highest stock on record Q0-bar and the initial legacy π0 — because the exponential &amp;ldquo;memoryless&amp;rdquo; property means that the additional expected waiting time until catastrophe occurrence does not depend on how long the triggering has already been in effect. Without this assumption, the full chronicle of past experiments would be required as a state variable, making the problem intractable.&lt;/p&gt;
&lt;p&gt;Q: What happens when the delay parameter α approaches zero or infinity?
A: When α → ∞ (instantaneous catastrophe upon triggering), pt = 1 − F(Qt) and the legacy is identically zero, recovering the Tsur-Zemel unknown-threshold approach (Proposition 3). The optimal path converges to QE0 from below or stabilizes if already above QE0. When α → 0 (infinite delay, effectively no catastrophe), QE = QD = QN and the problem reduces to the simple stock-flow problem (Proposition 1), with the optimal path converging monotonically to QN.&lt;/p&gt;
&lt;p&gt;Q: Does the model allow for damage mitigation after triggering but before occurrence?
A: Yes, this is a key feature. The continuation payoff after catastrophe occurrence is V(QT) where QT is the stock level at the time of occurrence T, not at triggering time T(S). This means the planner can reduce the stock after triggering to lower damages — analogous to a skater turning back toward shore after the ice first cracks. The assumption that V depends on the stock at occurrence rather than at triggering or at the maximum historical level is what allows this mitigation channel and is explicitly noted as a modeling choice.&lt;/p&gt;
&lt;p&gt;Legacy of the past (πt): The probability, conditional on survival to date t, that past experiments have already triggered a catastrophe. Formally πt = 1 − [1 − F(Qt)] / pt. Recent experiments contribute more to the legacy than distant ones, with contribution decaying at rate α. The legacy is zero when α → ∞ and is the central state variable bridging the paper&amp;rsquo;s two canonical extremes.&lt;/p&gt;
&lt;p&gt;QE (&amp;ldquo;Experimentation&amp;rdquo; threshold): The stock level at which the net marginal gain from further experimentation, defined as ν(Q) − [α/(α+δ)]ρ(Q)D(Q), equals zero, under the assumption that no catastrophe has been triggered. Below QE, stabilization is suboptimal; above QE, the planner does not experiment further when the legacy is zero.&lt;/p&gt;
&lt;p&gt;QD (&amp;ldquo;Damages&amp;rdquo; threshold): The stock level at which the net marginal benefit from holding the stock, defined as ν(Q) − [α/(α+δ)]D&amp;rsquo;(Q), equals zero, under the assumption that the catastrophe is known to have been triggered. QD ≤ QN and represents the optimal long-run target when the hazard-rate approach applies.&lt;/p&gt;
&lt;p&gt;Marginal payoff ν(Q): Defined as uq(0, Q) + (1/δ)uQ(0, Q), it measures the net gain from marginally increasing the flow when the stock is stabilized at Q. It is strictly decreasing in Q under Assumption 1 and equals zero at QN.&lt;/p&gt;
&lt;p&gt;Damage function D(Q): Defined as (1/δ)u(0, Q) − V(Q), it measures the welfare loss from catastrophe occurrence when the stock is Q at occurrence time, relative to permanent stabilization at Q. Assumed weakly positive and weakly increasing in Q.&lt;/p&gt;
&lt;p&gt;Survival probability (pt): The probability, computed from prior beliefs F at the beginning of times, that the catastrophe has not yet occurred by date t. Its law of motion is ṗt = α[1 − F(Qt) − pt], driven solely by the catastrophe parameter α and the current maximum stock Qt.&lt;/p&gt;
&lt;p&gt;Fatalism (under Theorem 1): The policy implication that a higher legacy — meaning a higher probability the catastrophe is already triggered — leads the planner to increase the stock further and accept more experimentation, because mitigation is relatively ineffective (QE &amp;lt; QD) and current consumption must be enjoyed before the catastrophe arrives.&lt;/p&gt;</description></item><item><title>Central bank communication by ??? The economics of monetary policy leaks</title><link>https://macropaperwarehouse.com/papers/central-bank-communication-by-the-economics-of-monetary-policy-leaks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/central-bank-communication-by-the-economics-of-monetary-policy-leaks/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates the economics of monetary policy leaks — anonymous disclosures of confidential information by insiders to the media — focusing on three central questions: (1) Are leaks random accidents, strategic individual disclosures, or institutionally authorized &amp;ldquo;plants&amp;rdquo;? (2) Do leaks shape public (financial market) views, and by how much? (3) Can attributed (named) communication by central bank officials mitigate the effects of leaks?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors study the Eurosystem (ECB and euro area National Central Banks) over January 2002 to December 2021. Their primary data source is a novel database of 368 unique policy-relevant leaks — assembled by manually filtering and classifying more than a million news items from Reuters, Bloomberg, and Market News International archives — with precise minute-level timestamps. Topics covered include: policy rates (178 leaks), unconventional monetary policy/UMP (207 leaks), economic growth (47), inflation (41), and euro exchange rate (36); individual leaks may cover multiple topics. They complement this with a dataset of 7,883 attributable public statements by ECB Governing Council members, identified via keyword filtering and machine learning classification of the Reuters News Archive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper employs four main empirical strategies. First, high-frequency event studies using asymmetric windows (5 minutes before to 30 minutes after an event) compare absolute market reactions in OIS rates across the full term structure (3M to 10Y) and in the EURO STOXX 50 across leaks, 5,000 randomly sampled placebo events, and attributable statements. Second, Poisson regression models relate the number of leaks per policy meeting to proxies for Governing Council disagreement (Italian-German sovereign yield spread, inter-quartile range of national inflation rates, number of attributable statements per meeting) and a dummy for quarterly macroeconomic projection releases. Third, a regression framework tests whether leaks move market expectations toward the subsequent policy outcome — identifying whether leaks are informative about the direction of policy. Fourth, an augmented version of the Tillmann (2021) model relates end-of-day changes in longer-term OIS rates to high-frequency monetary policy surprises, interacted with dummies for post-announcement leaks and attributable statements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Incidence and timing.&lt;/em&gt; The number of Eurosystem leaks peaked at 36 in 2019 (more than four per policy meeting on average) before declining by more than one third following the start of Christine Lagarde&amp;rsquo;s presidency in November 2019. Leaks cluster around policy meetings and, since 2015, have shifted notably from before meetings to after meetings, a shift driven by leaks related to UMP. Leaks occur even during the ECB&amp;rsquo;s quiet period, when policy-makers are formally restricted from public statements on policy-sensitive topics.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Leaks are not accidents.&lt;/em&gt; Poisson regressions reveal that the number of leaks per meeting is significantly and positively associated with proxies for Governing Council disagreement: every additional percentage point in the Italian-German sovereign yield spread is associated with approximately half an additional leak per meeting. The propensity of a policy change increases by four to six percentage points with each additional pre-meeting leak (statistically significant at the 5% or 10% level). The specification explains around 15% of the variation in leak counts.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Market impact.&lt;/em&gt; Market movements around leaks are up to 85% larger than those around placebo events. Leaks trigger market reactions that are consistently larger than those of attributable statements by individual Governing Council members across the entire OIS term structure and in equities — a result robust to controlling for distance to policy meetings. Rate leaks mainly move the short and medium end of the yield curve; UMP leaks affect the long end and equities. Leaks about general economic conditions (growth, inflation, exchange rate) produce little statistically significant market response.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Leaks are uninformative about policy direction.&lt;/em&gt; Conditional on a pre-meeting leak occurring, the average leak does not move market rates closer to the levels prevailing directly after the subsequent policy announcement. By contrast, attributable statements systematically do reduce this distance. This asymmetry implies that leaks predominantly reflect minority opinions within the Governing Council. Consistent with this, leaks counteract prevailing trends in market expectations at the short end of the yield curve (as established by a negative coefficient on the interaction between the prevailing seven-day pre-leak trend and the leak dummy).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Leaks are not plants; attributed communication mitigates their effects.&lt;/em&gt; Post-announcement leaks dampen the transmission of monetary policy surprises to longer-term rates (negative and significant interaction coefficient in the augmented Tillmann framework). Attributed statements by ECB Executive Board members, by contrast, systematically move in the direction opposite to the preceding leak across most of the yield curve, partially reversing leak-induced market moves. More intense pre-leak attributable communication is also associated with lower market impact of the subsequent leak, across most maturities. These results jointly indicate that most Eurosystem leaks originate from individual insiders with minority opinions rather than constituting institutional plants.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to the Eurosystem committee setting, where decision-making is broadly consensus-based and voting records are not published; they may not fully generalize to institutions with concentrated decision-making power. The study measures effects on financial markets, not broader public opinion.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: How is a &amp;ldquo;leak&amp;rdquo; defined in this paper, and how are Eurosystem leaks identified empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A leak is defined as a disclosure of confidential information by an insider to the media with an expectation of anonymity. Eurosystem leaks are identified from Reuters, Bloomberg, and Market News International archives (2002–2021) using keyword-driven pre-filtering followed by manual classification of &amp;ldquo;candidate&amp;rdquo; items. The resulting database contains 1,253 news items that aggregate to 368 unique policy-relevant leaks with minute-level timestamps. Policy-relevant leaks touch on: policy rates, unconventional monetary policy tools, economic growth, inflation, or the euro exchange rate; leaks about local economic conditions, banking regulation, or managerial appointments are excluded.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the broad trends in the number and topic composition of Eurosystem leaks over 2002–2021?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The number of leaks rose sharply in the second half of the sample, peaking at 36 in 2019 (more than four per meeting on average). Since Christine Lagarde took over the ECB presidency in November 2019, leaks fell by more than one third from that peak. The topic composition shifted substantially over time: policy-rate leaks predominated in the earlier period, while leaks related to UMP came to dominate in the 2015–2021 sub-period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the timing of leaks within the policy meeting cycle change across sub-periods?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the full sample, leaks cluster in the run-up to policy meetings and immediately following announcement days (both on the announcement day itself and the following Friday). Since 2015, a notable shift occurs from pre-meeting to post-meeting timing, driven specifically by leaks related to UMP. The authors attribute this shift to the expectation-management role of UMP: post-meeting leaks allow dissenting insiders to reshape market expectations that are otherwise guided by official press releases and press conferences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What regression evidence supports the view that leaks are not random accidents?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Poisson regressions of the number of leaks per meeting on disagreement proxies find significant positive coefficients on: the lagged Italian-German sovereign yield spread (about half a leak more per meeting for each additional percentage point of spread), the inter-quartile range of national inflation rates, and the number of attributable statements per meeting. Meetings coinciding with the release of quarterly macroeconomic projections also attract significantly more leaks. These results are robust to replacing the disagreement proxies with a binary dissent index based on Q&amp;amp;A sessions at ECB press conferences (Tillmann, 2021), even after excluding disagreement-related leaks from the dependent variable to address endogeneity. The model explains about 15% of the variation in leak counts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Does the number of pre-meeting leaks predict policy changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. The propensity of a monetary policy change increases by four to six percentage points with each additional pre-meeting leak (significant at the 5% or 10% level). This signal about the propensity of change (not the direction) is hard to square with the random accidents hypothesis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How large are the financial market reactions to leaks relative to placebo events and to attributable statements?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Market movements around leaks are up to 85% larger than the average size of market reactions to 5,000 randomly sampled placebo events. When leaks are compared directly to attributable statements (with leaks as the baseline and fixed effects for year, month, weekday, and hour), average absolute market moves around leaks are consistently larger across the entire term structure of OIS rates and for the EURO STOXX 50. This result is robust to differences in distance to policy meetings, with size differences across the full term structure persisting for periods far from meetings; near meetings, differences narrow but the average market reaction to leaks never falls below that to attributable statements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Do the market effects of leaks differ by topic?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Leaks about policy rates primarily move the short and medium end of the yield curve. Leaks about UMP tools affect the long end of the curve and equities. Leaks about general economic conditions (growth, inflation, euro exchange rate) do not produce statistically significant market reactions, consistent with the interpretation that economic condition leaks require more interpretation before their implications for the policy path become apparent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Do leaks move market expectations in the direction of the subsequent policy outcome?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The average pre-meeting leak does not reduce the absolute distance of market rates to post-announcement levels. This result holds across maturities from 3M to 10Y and is robust to separating leaks inside and outside the ECB&amp;rsquo;s quiet period. Attributable statements, by contrast, systematically reduce this distance (Table 7). The failure of leaks to align expectations with outcomes is interpreted as evidence that leaks predominantly reflect minority views within the Governing Council rather than information held by the decisive voter.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Do leaks counteract or reinforce prevailing trends in market expectations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Leaks counteract prevailing trends. The regression of market reactions to leaks and placebo events on the seven-day pre-event trend reveals a significantly negative interaction between the trend and the leak dummy at the short end of the yield curve. This result is driven specifically by leaks about policy rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Do post-announcement leaks dampen the transmission of monetary policy surprises to longer-term rates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. In the augmented Tillmann (2021) framework, the interaction of the high-frequency 2Y monetary policy surprise with a dummy for post-announcement leaks is negative and significant for 2Y, 5Y, and 10Y OIS rates. In contrast, the interaction with a dummy for post-announcement attributable statements is positive and significant across maturities, indicating that attributed communication reinforces the official policy signal. These two results jointly show that leaks weaken official policy announcements while attributed communication strengthens them.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Does more intense pre-leak attributable communication reduce the market impact of subsequent leaks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Using an intensity measure that weights each attributable statement by the inverse of its distance in hours to the subsequent leak (covering a window from 36 hours to 30 minutes before the leak), the paper finds a significant negative relationship between pre-leak communication intensity and the absolute market reaction to the leak, controlling for year, month, weekday, and hour fixed effects. This holds across most maturities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Does the market impact evidence support the &amp;ldquo;plant&amp;rdquo; hypothesis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. If leaks were institutional plants intended to prepare markets for new policy, one would expect the ECB Executive Board — which controls official communication — to subsequently reinforce the signal from leaks. Instead, attributable statements by ECB-affiliated Governing Council members are systematically negatively correlated with the market direction of the preceding leak across the yield curve, with significant coefficients at medium maturities. NCB Governor statements show weaker and more ambiguous effects, potentially because their statements generate smaller average market movements rather than reflecting a lack of willingness to counteract leaks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Why do markets react to leaks even though leaks are generally uninformative about policy outcomes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper offers three candidate explanations: (1) automated trading algorithms that do not distinguish between attributed and anonymous communication; (2) leaks serve as a coordination device in the spirit of Morris and Shin (2002), amplifying even noisy signals; (3) media-reporting models such as Nimark (2014) and Chahrour et al. (2021) predict that &amp;ldquo;man-bites-dog&amp;rdquo; news — unusual events such as revelations of committee disagreement — shift beliefs beyond their true information content. Leaks are unusual both in frequency (far less common than attributed statements) and in content (they reveal disagreement that rarely surfaces in official communication).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What are the implications for the measurement of monetary policy shocks from high-frequency identification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper notes that Eurosystem leaks frequently occur shortly before or after official policy announcements. Pre-announcement leaks can shift market expectations before the start of standard event windows, reducing the measured surprise component of official announcements. Post-meeting leaks dampen the end-of-day effects of announcements. In both cases, standard high-frequency surprise instruments extracted from official announcements alone may miss the full extent of new information available to market participants, suggesting that accounting for leaks could improve the relevance of high-frequency instruments used in monetary policy identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: What are the implications for the design of central bank quiet periods?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The ECB&amp;rsquo;s quiet period ends with the policy announcement, whereas the Federal Reserve&amp;rsquo;s extends to the day after the meeting. Based on the finding that post-announcement leaks dampen policy announcement effects while post-announcement attributed statements reinforce them, the paper suggests that permitting attributed communication shortly after policy decisions may help mitigate the market impact of post-announcement leaks.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Monetary policy leak (&amp;ldquo;sources story&amp;rdquo;):&lt;/strong&gt; In this paper, a leak is defined as a disclosure of confidential information emanating from an insider within the Eurosystem (ECB or NCB staff or policy-makers) that is transmitted to financial media with an expectation of anonymity for the source. The paper excludes whistle-blower cases and focuses on leaks where anonymity keeps attention on the content rather than the identity of the source. Leaks are distinct from &amp;ldquo;plants&amp;rdquo; (formally authorized institutional disclosures intended to advance the institution&amp;rsquo;s goals) and from &amp;ldquo;pleaks&amp;rdquo; (the middle ground).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Plant:&lt;/strong&gt; An authorized or semi-authorized anonymous disclosure of confidential information made for the purpose of advancing the public institution&amp;rsquo;s own goals and interests, as distinct from a leak that originates from an individual insider&amp;rsquo;s personal agenda. The paper tests and rejects the plant hypothesis for most Eurosystem leaks on the basis that ECB Executive Board members&amp;rsquo; attributed statements systematically counteract the market impact of leaks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Single voice principle:&lt;/strong&gt; The ECB&amp;rsquo;s communication norm requiring that Governing Council members discuss and resolve disagreements internally while publicly representing the official policy stance. This principle creates a setting where individual members with minority views may resort to anonymous communication as a way to express dissent &amp;ldquo;off-protocol.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quiet period (purdah):&lt;/strong&gt; The ECB&amp;rsquo;s rule requiring policy-makers to refrain from public statements on policy-related topics in the seven days before each Governing Council monetary policy meeting. Leaks cluster during this period despite the restriction, supporting the non-random interpretation of leaks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Attributable (named) statement:&lt;/strong&gt; A public statement clearly attributed to a specific, named member of the ECB Governing Council, reported as a breaking-news headline. Attributable statements serve both as a comparison benchmark for measuring the market impact of leaks and as a mitigation instrument when they counteract leak-induced market moves.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pre-leak communication intensity (lambda):&lt;/strong&gt; The paper&amp;rsquo;s measure of the intensity of attributable communication in the 36-hour window before a given leak, defined as the sum of inverse time distances (in hours) from each attributable statement to the leak. A higher value means more recent and/or more numerous attributed statements precede the leak.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;High-frequency event study window:&lt;/strong&gt; The paper uses an asymmetric window starting 5 minutes before and ending 30 minutes after a leak&amp;rsquo;s timestamp. Market reactions are measured as the change in the median OIS quote during the 10 minutes after the window versus the 10 minutes before, matching methodology used for both leaks and attributable statements to ensure comparability across communication types.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Post-announcement leak dummy:&lt;/strong&gt; An indicator taking the value of one if at least one leak occurs between the end of the official ECB monetary policy announcement window (15:50 CET) and end of trading hours on the announcement day. Used in the augmented Tillmann (2021) regression to measure whether leaks dampen the transmission of monetary policy surprises to longer-term rates.&lt;/p&gt;</description></item><item><title>Central Bank Digital Currency with Collateral-Constrained Banks</title><link>https://macropaperwarehouse.com/papers/central-bank-digital-currency-with-collateral-constrained-banks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/central-bank-digital-currency-with-collateral-constrained-banks/</guid><description>&lt;p&gt;The paper analyzes the implications of introducing a retail central bank digital currency (CBDC) that competes with commercial bank deposits for household liquidity, in a model where banks must post government bonds as collateral to access central bank lending. The authors revisit Niepelt&amp;rsquo;s (2022) &amp;ldquo;equivalence of payment systems&amp;rdquo; result and find that equivalence survives even under a collateral constraint: the central bank can still offer loans to banks that replicate the no-CBDC equilibrium allocation, but at a lending rate lower than Niepelt&amp;rsquo;s unconstrained rate, because tighter terms are needed to incentivize sufficient loan uptake when banks must redirect portfolio holdings toward government bonds to qualify. A structural cost remains: banks must hold government bonds as collateral at the expense of extending credit to firms, so equivalence in allocation does not imply full neutrality — banks&amp;rsquo; business models and the government&amp;rsquo;s intermediation role change even when aggregate output and prices are unchanged. In the dynamic extension where the central bank does not sterilize the CBDC introduction, banks respond by narrowing deposit spreads to attract inflows, with the result that a CBDC ramp-up to 5 percent of steady-state output expands rather than contracts bank credit to firms.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-equivalence-of-payment-systems-result-and-how-does-the-collateral-constraint-change-it"&gt;Q1. What is the equivalence of payment systems result and how does the collateral constraint change it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Brunnermeier and Niepelt (2019) and Niepelt (2022) established that the central bank can neutralize the real effects of CBDC introduction by lending to banks at an appropriate rate to replace lost deposit funding, a result the present paper revisits by adding a collateral requirement on central bank lending — specifically, that banks must hold eligible government bonds up to a fraction θb of their central bank loan value.&lt;/strong&gt; Under this constraint, Proposition 1 shows that equivalence survives: there exists a central bank lending rate that replicates the no-CBDC equilibrium allocation and price system. However, this lending rate is lower than Niepelt&amp;rsquo;s unconstrained rate by a factor increasing in the restrictiveness of the constraint (lower θb requires a lower lending rate), because when banks are collateral-constrained, cheaper terms are needed to induce them to borrow enough from the central bank to offset deposit outflows.&lt;/p&gt;
&lt;h3 id="q2-what-is-corollary-1-and-why-does-full-neutrality-fail"&gt;Q2. What is Corollary 1 and why does &amp;ldquo;full neutrality&amp;rdquo; fail?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Corollary 1 states that even when the central bank achieves allocation equivalence by setting the appropriate lending rate, banks must redirect portfolio holdings from firm loans to government bonds to meet the collateral requirement — crowding out bank credit to firms by an amount equal to the bond uptake, with the crowding-out diminishing as the collateral constraint becomes less restrictive (higher θb).&lt;/strong&gt; This is the sense in which &amp;ldquo;full neutrality&amp;rdquo; fails under the collateral constraint: aggregate output and prices are unchanged, but the composition of credit changes — banks extend less to firms and hold more government bonds — and the government or household sector must absorb the gap in firm financing. In the limiting case where CBDC and deposits are equally valuable to households (λ = 1), the government alone compensates for the reduction in bank loans, effectively expanding its own intermediation role.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-dynamic-extension-show-about-bank-disintermediation"&gt;Q3. What does the dynamic extension show about bank disintermediation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Simulating a gradual and near-permanent increase in CBDC to 5 percent of steady-state output without central bank sterilization, the paper finds that banks respond by narrowing their deposit interest spread to attract deposit inflows, such that total deposits do not fall and bank loans to firms expand rather than contract — the opposite of the disintermediation hypothesis.&lt;/strong&gt; The mechanism relies on the assumption that banks have market power in their regional deposit markets (each bank is a monopsonist): in response to CBDC competition, the bank voluntarily reduces the rent it extracts on deposits (the spread between the risk-free rate and the deposit rate), attracting more deposit inflows. This deposit inflow, combined with central bank loan uptake, expands the bank&amp;rsquo;s balance sheet and increases credit extension to firms. The result stands in contrast to models with competitive deposit markets, where banks cannot respond to CBDC competition through deposit pricing.&lt;/p&gt;
&lt;h3 id="q4-what-changes-even-if-credit-is-not-reduced"&gt;Q4. What changes even if credit is not reduced?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Even when the dynamic model shows credit expansion rather than contraction, the paper establishes that CBDC introduction alters banks&amp;rsquo; balance sheet composition and business model: banks shift toward holding more government bonds and away from firm loans, the government assumes a larger credit intermediation role, and the aggregate distribution of capital ownership changes — constituting the form of non-neutrality that survives even when total credit is unchanged.&lt;/strong&gt; This is what Corollary 1 calls the failure of &amp;ldquo;full neutrality&amp;rdquo;: the real allocation equivalence holds at the aggregate level, but the sectoral distribution of who provides credit to firms shifts from the banking sector toward the public sector. The paper interprets this as a structural consequence of the collateral requirement on central bank lending that is absent in the frictionless equivalence benchmark.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;equivalence of payment systems&lt;/strong&gt; : the theoretical result (from Brunnermeier-Niepelt 2019 and Niepelt 2022) that the central bank can ensure the same equilibrium allocation whether or not CBDC exists, by adjusting its lending terms to banks; this paper revisits and extends the result to environments with a collateral constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;collateral constraint (θb)&lt;/strong&gt; : the requirement in this model that banks hold eligible government bonds as a fraction of the central bank loans they take on; adding this friction to Niepelt&amp;rsquo;s framework preserves equivalence in allocation but requires a lower central bank lending rate and crowds out bank loans to firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;disintermediation&lt;/strong&gt; : the concern that CBDC adoption would cause households to shift en masse from bank deposits to CBDC, reducing bank funding and contracting bank credit; the paper finds this does not occur in either the equivalence analysis or the dynamic extension.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;monopsony in deposits&lt;/strong&gt; : the market structure assumption that each regional bank is the sole deposit provider in its region, giving it pricing power over deposit rates; this is what enables banks in the dynamic model to narrow the deposit spread in response to CBDC competition, generating deposit inflows rather than outflows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;full neutrality&lt;/strong&gt; : a stronger invariance result requiring that not only the equilibrium allocation but also banks&amp;rsquo; balance sheet composition and business model are unchanged by CBDC introduction; the paper shows this fails under the collateral constraint even when allocation equivalence holds.&lt;/p&gt;</description></item><item><title>Central bank reputation with noise</title><link>https://macropaperwarehouse.com/papers/central-bank-reputation-with-noise/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/central-bank-reputation-with-noise/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; How does noise in the mapping from central bank actions to realized inflation affect the existence and character of reputational equilibria in monetary policy? Specifically, can a central bank that faces uncertainty about whether it is perceived as &amp;ldquo;hawkish&amp;rdquo; or &amp;ldquo;dovish&amp;rdquo; sustain a pure strategy separating equilibrium, and how should each type behave as a function of its current reputation?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology.&lt;/strong&gt; Amador and Phelan build on the monopolistic-competition, cash-in-advance framework of Chari, Christiano, and Eichenbaum (1998) and extend it to allow for (i) two central bank types — hawkish (type 1, high penalty γ₁ for inflationary actions) and dovish (type 2, lower penalty γ₂ &amp;lt; γ₁) — whose identity is private information; (ii) type switching governed by a Markov process, with probability δ that a hawkish bank is replaced by a dovish one and probability ε that a dovish bank is replaced by a hawkish one; and (iii) noise between the central bank&amp;rsquo;s chosen action μᵢ and realized money growth μₐ, which is drawn from a density f(μₐ|μᵢ) with full support. The equilibrium concept is pure symmetric Markov perfect equilibrium, in which all strategies are functions only of the public Bayesian posterior ρ that the current central bank is hawkish. The paper proceeds analytically to characterize no-pooling results and then computationally to demonstrate existence of separating equilibria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;No pooling equilibria exist (analytical).&lt;/strong&gt; Propositions 2 and 3 establish that no pure symmetric Markov equilibrium can have both types choosing the same positive action for any reputation ρ, as long as γ₁ ≠ γ₂ and Assumption 1 (pricing distortion sufficiently severe) holds. The intuition: if both types pool, realized inflation is uninformative, reputation does not change, and there are no dynamic incentives — but different static incentives (γ₁ ≠ γ₂) then imply different optimal actions, a contradiction.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Without sufficient noise, separating equilibria also fail to exist.&lt;/strong&gt; In the no-noise limit, Bayesian updating forces the dovish bank&amp;rsquo;s reputation to jump to its maximum after one period of mimicking the hawkish action, making mimicry cheap when the discount factor β is high or the type-persistence probability ε is low. This makes the incentive-compatibility constraint for the dovish bank very difficult to satisfy, potentially precluding existence of a separating equilibrium.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;With sufficient noise, pure strategy separating equilibria exist and have appealing properties (computational).&lt;/strong&gt; The benchmark parameterization sets α = 1, σ = 5, β = 0.99, h(μ) = 0.5μ², ε = δ = 0.02, and the noise distribution such that the hawkish type&amp;rsquo;s unconstrained target would deliver mean inflation of 2% and the dovish type&amp;rsquo;s 3%. Under these parameters:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In the full-information (known-type) world: price P = 1.313 for the hawkish type and P = 1.338 for the dovish type, with E[log(c) − αc] = −1.0297 and −1.0320 respectively, versus the efficient benchmark of −1.&lt;/li&gt;
&lt;li&gt;In the reputational equilibrium, both types choose lower inflationary actions than they would absent reputation considerations — because reputation is valuable (higher ρ lowers household prices and thus improves welfare for both types).&lt;/li&gt;
&lt;li&gt;Both types&amp;rsquo; optimal actions are U-shaped in reputation ρ: they are most restrained — choosing the lowest inflationary actions — when ρ is middling (interior), because Bayesian updating is most sensitive (and thus the reputation cost of inflating is greatest) at interior beliefs, while it is difficult to move extreme beliefs.&lt;/li&gt;
&lt;li&gt;Average equilibrium inflation is 2.1%, which lies below the weighted average of unconstrained type targets (2.5% given equal switching probabilities), demonstrating that reputation concerns compress inflation outcomes.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Ergodic distribution of reputation remains interior.&lt;/strong&gt; Starting from ρ = 0.5, expected reputation conditional on being hawkish stays below 0.63 and conditional on being dovish stays above 0.38, reflecting that noise and type switching prevent reputation from collapsing to its extremes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Welfare implications.&lt;/strong&gt; The hawkish type is made worse off by ongoing household uncertainty (relative to the reference game in which type is immediately revealed), while the dovish type is made better off. Households are better off under continuing uncertainty than under immediate revelation, unless reputation is near its maximum — because uncertainty suppresses inflationary temptations for both types.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply within a monopolistic-competition, cash-in-advance economy with discrete time, infinite horizon, and Markov strategies. The no-pooling result requires Assumption 1 (the pricing distortion is sufficiently severe that the central bank has a positive incentive to inflate from μ = 0). The no-noise existence failure is an informal argument holding fixed discount and type-switching parameters. Computational results are specific to the benchmark parameterization but are verified to be robust to variation in β, σ, γ₁, γ₂, ε, and δ.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the fundamental time-inconsistency problem in the underlying Chari et al. (1998) economy, and how does the paper extend it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: In the Chari et al. (1998) monopolistic-competition cash-in-advance economy, households exploit market power when setting prices, and the cash-in-advance constraint depresses consumption efficiency; this creates an ex-post temptation for the central bank to inflate and partially offset these distortions, even though in equilibrium such inflation is anticipated and only worsens inefficiencies. Equilibrium consumption equals (1/α) × ((σ−1)/σ) × (β/(1+μ)), compounding a monopoly distortion (σ−1)/σ &amp;lt; 1 and a cash-in-advance distortion β/(1+μ) &amp;lt; 1 below the efficient level 1/α. Amador and Phelan add household uncertainty about the central bank&amp;rsquo;s type — captured by the Bayesian posterior ρ that the bank is hawkish — allowing reputation to be endogenously determined and to feed back into equilibrium pricing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does reputation matter only through differences in inflation costs γᵢ and not through differences in effective discount factors alone?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: Proposition 1 establishes that if γ₁ = γ₂ (equal inflation penalties), then even if the two types have different effective discount factors β₁ = β(1−δ) ≠ β₂ = β(1−ε), there exists a pooling Markov equilibrium in which both types choose the same action μ* and reputation plays no role. When both types have identical static incentives, they will always choose the same action given that reputation doesn&amp;rsquo;t affect payoffs in such an equilibrium. Hence the relevant dimension of heterogeneity for reputation to matter is the inflation cost parameter γᵢ, not patience.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the formal argument that no pooling equilibrium can exist when γ₁ ≠ γ₂?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: Propositions 2 and 3 provide the formal argument. If both types pool at any reputation ρ with a common positive action μ, Bayesian updating implies that ρ⁺ is independent of the money growth realization μₐ. The first-order condition for type i then reduces to the static condition (∂E[log(c) − αc|μ]/∂μ) = γᵢh&amp;rsquo;(μ), which cannot hold simultaneously for types 1 and 2 since γ₁ ≠ γ₂ and h&amp;rsquo;(μ) &amp;gt; 0 for μ &amp;gt; 0. This logic rules out pooling at the stationary reputation ρ* = ε/(δ+ε) in Proposition 2 and at any reputation where μ &amp;gt; 0 in Proposition 3.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Why does noise facilitate the existence of separating equilibria?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: Without noise, if types separate, observing the hawkish action reveals the bank is hawkish with certainty, pushing reputation to its maximum (1−δ) in a single period. This makes mimicry extremely cheap for the dovish type when β₂ is large or ε is small: the incentive compatibility condition requires that the dovish type&amp;rsquo;s static gain from choosing its own action exceeds the value gain from jumping to the best possible reputation, which is a very stringent requirement. With noise, mimicry generates only a probabilistic shift in beliefs rather than a discrete jump to the extreme, so the dovish type must maintain the hawkish action repeatedly to achieve a reputational gain — making mimicry costly enough that the incentive compatibility condition can be satisfied.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the &amp;ldquo;reference game&amp;rdquo; and what analytical purpose does it serve?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: The reference game is a variant in which the central bank&amp;rsquo;s type is fixed and is revealed to households immediately after they set prices at date t = 0. From t = 1 onward, the game reduces to the full-information, single-type game of Section 4. This allows the authors to isolate the &amp;ldquo;direct&amp;rdquo; effect of reputation — the fact that expected type affects equilibrium prices today — from the &amp;ldquo;indirect&amp;rdquo; or strategic effect of the central bank actively managing its reputation. In the numerical example, the reference-game prices form the upper dashed line in Figure 1, while the actual game&amp;rsquo;s prices form the lower solid line, with the gap between them attributable to the central bank&amp;rsquo;s incentive to restrain inflation in order to protect reputation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the equilibrium price and welfare levels in the benchmark numerical example, and how do they compare to efficient and full-information benchmarks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: The efficient benchmark delivers log(c) − αc = −1 with consumption c* = 1/α = 1. Under full information with only the hawkish type present, P = 1.313 and E[log(c) − αc] = −1.0297; under only the dovish type, P = 1.338 and E[log(c) − αc] = −1.0320. In the reputational equilibrium, prices lie below the full-information mixed benchmark for any given ρ (the solid line in Figure 1 lies below the dashed reference-game line), reflecting that the central banks&amp;rsquo; desire to maintain reputation leads both types to restrain inflation beyond what the direct price effect alone would induce.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the U-shape of optimal central bank actions in reputation arise, and what does it imply for policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: The U-shape arises because Bayesian updating is most powerful at interior beliefs: for extreme reputations (near ε or 1−δ), any given realization of money growth moves the posterior relatively little, so the reputational cost of inflating is small. For interior (middling) reputations, the same action shifts the posterior substantially, making reputation more sensitive to inflation choices and thus increasing the marginal cost of inflating. Both types therefore choose their minimum inflationary actions at middling reputations. The policy implication is that a hawkish central bank with a very low reputation (following a run of high realized inflation outcomes) should not dramatically tighten, because further contraction does relatively little for its reputation until nature delivers enough favorable realizations to move it to a more interior range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What happens to the ergodic distribution of reputation and inflation, and what does this imply about the persistence of reputational dynamics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: Starting from ρ = 0.5, expected reputation remains in the interior: above 0.38 for the dovish type and below 0.63 for the hawkish type. The ergodic distribution of ρ (Figure 5) concentrates at interior values rather than the poles, showing that noise and type switching prevent reputation from stabilizing at extremes. The ergodic inflation distribution (Figure 6) has an average of 2.1%, compared to 2% under an all-hawkish world and 3% under an all-dovish world. Because ε = δ (types are equally likely in the long run), the unconstrained-type-weighted average would be 2.5%, so reputational incentives reduce equilibrium average inflation by approximately 0.4 percentage points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Who gains and who loses from ongoing type uncertainty relative to immediate revelation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: The hawkish type&amp;rsquo;s value function (Figure 3a) lies below the reference-game dashed line for intermediate reputations, indicating that the hawkish type is made worse off by uncertainty — it must bear the cost of restraining inflation beyond what is statically optimal in order to signal its type, but the households partially &amp;ldquo;blame&amp;rdquo; it for high realized inflation regardless. The dovish type (Figure 3b) is made better off under continuing uncertainty because its reputation benefits from households&amp;rsquo; inability to perfectly distinguish types. Households (Figure 3c) are better off under uncertainty unless reputation is very high, because uncertainty suppresses inflation temptations for both types and keeps prices lower.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What happens to equilibrium behavior under robustness checks on key parameters?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: When the discount factor β or the elasticity of substitution σ decreases, both types inflate more and prices rise. When the hawkish type&amp;rsquo;s penalty γ₁ decreases (becomes less hawkish), both types inflate more and prices rise. When the dovish type&amp;rsquo;s penalty γ₂ decreases (becomes more dovish), the dovish type inflates more and, somewhat counterintuitively, the hawkish type inflates less, leaving prices roughly unchanged but slightly higher. When switching probabilities ε or δ increase, prices rise and both types inflate more, analogously to a decrease in β. Across all robustness exercises, the dovish type never inflates less than the hawkish type — consistent with Proposition 1&amp;rsquo;s implication that the inflation-cost difference γ₁ − γ₂ is the fundamental driver of separation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Hawkish type (type 1):&lt;/strong&gt; A central bank that receives a relatively large negative payoff γ₁h(μᵢ) for taking inflationary actions, where γ₁ &amp;gt; γ₂. In the paper&amp;rsquo;s own sense, this type is not behavioral — it optimizes fully and can choose any action — but has a strong intrinsic cost to inflation, making it prefer lower money growth rates ceteris paribus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dovish type (type 2):&lt;/strong&gt; A central bank with a lower penalty parameter γ₂ &amp;lt; γ₁ for inflationary actions. Like the hawkish type, it is fully strategic and optimizing, differing only in the magnitude of its intrinsic inflation cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reputation (ρ):&lt;/strong&gt; The Bayesian posterior probability that households assign to the current central bank being the hawkish type. It is the single payoff-relevant state variable in the Markov equilibrium, evolving through Bayes&amp;rsquo; rule applied to realized money growth and type-switching probabilities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pure symmetric Markov perfect equilibrium:&lt;/strong&gt; An equilibrium in which all households set the same price and consume the same amount (symmetry), and all strategies — prices P(ρ), central bank actions μ₁(ρ) and μ₂(ρ), and household consumption c(μₐ, ρ) — depend on history only through the current reputation ρ (Markov). The paper focuses exclusively on pure (non-mixed) strategy equilibria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pooling equilibrium:&lt;/strong&gt; An equilibrium in which both types choose the same action μ₁(ρ) = μ₂(ρ) at some reputation ρ. The paper proves analytically that no pooling equilibrium can exist when γ₁ ≠ γ₂ and the pricing distortion is sufficiently severe (Assumption 1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Separating equilibrium:&lt;/strong&gt; An equilibrium in which μ₁(ρ) ≠ μ₂(ρ) for all ρ, so that realized money growth outcomes are informative about type and reputation evolves non-trivially. The paper argues that sufficient noise is necessary for such equilibria to exist.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective discount factor (βᵢ):&lt;/strong&gt; The discount factor net of type-switching: β₁ = β(1−δ) for the hawkish type (which survives as hawkish with probability 1−δ) and β₂ = β(1−ε) for the dovish type. Central banks care only about payoffs while they are active, so effective discounting captures both time preference and expected tenure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Noise (disconnection between actions and outcomes):&lt;/strong&gt; The stochastic wedge between the central bank&amp;rsquo;s chosen action μᵢ and realized money growth μₐ, governed by a density f(μₐ|μᵢ) with full support. In the paper&amp;rsquo;s framework, noise is not merely a nuisance but a structural feature that makes reputational equilibria possible by preventing single-period complete revelation of type.&lt;/p&gt;</description></item><item><title>Changing Opportunity: Sociological Mechanisms Underlying Growing Class Gaps</title><link>https://macropaperwarehouse.com/papers/changing-opportunity-sociological-mechanisms-underlying-growing-class-gaps/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/changing-opportunity-sociological-mechanisms-underlying-growing-class-gaps/</guid><description>&lt;p&gt;This paper documents sharp divergent trends in intergenerational economic mobility by race and class in the United States across the 1978 to 1992 birth cohorts, and investigates the causal mechanisms driving those changes. The core empirical facts are two: between 1978 and 1992 birth cohorts, the earnings gap between white children from high-income versus low-income families grew by approximately 28–30% (the &amp;ldquo;white class gap&amp;rdquo;), while the earnings gap between white and Black children from low-income families shrank by approximately 27–30% (the &amp;ldquo;white-Black race gap&amp;rdquo;). These twin trends — growing class gaps and shrinking race gaps — appear consistently across earnings, employment rates, educational attainment, SAT/ACT scores, incarceration, marriage, and mortality, and they hold in nearly every region of the country.&lt;/p&gt;
&lt;p&gt;The data are drawn from de-identified federal income tax returns linked to decennial census records and the Numident database, covering 57 million children born between 1978 and 1992, with information on parental and child incomes, employment, marital status, mortality, and residential location, supplemented by ACS educational attainment and linked SAT/ACT records covering 24.8 million students. Children&amp;rsquo;s outcomes are measured primarily as household income percentile ranks at age 27.&lt;/p&gt;
&lt;p&gt;In dollar terms, the white class gap (mean income difference between children raised at the 25th vs. 75th parental income percentile) grew from $17,720 to $20,950 in real 2023 dollars, while the white-Black race gap for low-income families fell from $20,810 to $14,910. The intergenerational rank-rank slope for white children increased from 0.23 to 0.29. The racial gap in intergenerational persistence of poverty — the probability of a child born to the bottom income quintile remaining there — shrank from 14.7 percentage points to 4.1 percentage points (a 72% reduction), driven roughly equally by improvement in Black children&amp;rsquo;s chances of escaping poverty and deterioration in low-income white children&amp;rsquo;s chances. The white class gap in early-adulthood mortality more than doubled, while the white-Black race gap in mortality fell by 77%.&lt;/p&gt;
&lt;p&gt;The paper systematically rules out three alternative explanations. Observable family characteristics (parental education, wealth, occupation, and marital status) explain only 7% of the growing white class gap and none of the shrinking white-Black race gap. Neighborhood-level common shocks, tested by including childhood county or Census tract-by-cohort fixed effects, similarly explain only 7% of the class gap and none of the race gap. The divergent trends persist even among children raised in the same Census tract, pointing to forces that operate differentially across race and class groups within the same neighborhood.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central finding is that changes in children&amp;rsquo;s outcomes across cohorts are strongly and positively correlated (r = 0.91 across subgroups) with changes in parental employment rates within the child&amp;rsquo;s social community, defined as families sharing the same race, class, and childhood county. Low-income white communities experienced sharp relative declines in parental employment rates; low-income Black communities experienced relative improvements. These community-level parental employment changes account for nearly all of the divergent trends.&lt;/p&gt;
&lt;p&gt;To establish causation, the paper exploits variation in the age at which children move to counties with changing parental employment rates. Children who moved at younger ages (before age 8) to counties where parental employment was increasing experienced larger improvements in earnings than those who moved at older ages (after age 13), consistent with a causal exposure effect with greater impact for longer durations of exposure. Sibling comparisons — comparing outcomes of younger versus older siblings who moved together — confirm that the age gradient reflects causal exposure rather than family-level selection.&lt;/p&gt;
&lt;p&gt;The social interaction mechanism is supported by two sources of variation: children&amp;rsquo;s outcomes are more strongly related to parental employment rates of their own birth cohort than adjacent cohorts (cohort specificity unlikely to be explained by resources), and outcomes are primarily driven by the employment rates of same-race, same-class community members, with cross-racial influence appearing only in counties where cross-racial interaction is greater (counties with small Black population shares or higher interracial marriage rates). The unified explanation the paper proposes is that children&amp;rsquo;s outcomes mimic those of the adults in their social communities, following Borjas (1992).&lt;/p&gt;
&lt;p&gt;Q: What are the precise magnitudes of the growing white class gap and shrinking white-Black race gap in income percentile ranks?
A: The white class gap — the difference in mean household income ranks between white children raised at the 25th versus 75th parental income percentiles — increased from 11.1 to 14.1 percentile ranks between the 1978 and 1992 birth cohorts, a 28% increase. The white-Black race gap for children from low-income families fell from 14.9 to 10.9 percentile ranks, a 27% decrease. The intergenerational rank-rank slope for white children increased from 0.23 to 0.29 (a 28% rise in persistence).&lt;/p&gt;
&lt;p&gt;Q: How did the trends in poverty persistence versus upward mobility differ?
A: The convergence in white-Black outcomes was driven almost entirely by changes in poverty persistence rather than upward mobility. The racial gap in the probability of remaining in the bottom income quintile shrank from 14.7 percentage points to 4.1 percentage points (a 72% reduction), with roughly half from Black children being less likely to remain at the bottom and half from white children being more likely to remain. By contrast, the white-Black gap in the probability of rising from the bottom quintile to the top quintile fell by only 1.9 percentage points (17%).&lt;/p&gt;
&lt;p&gt;Q: How widespread geographically were the divergent trends?
A: Outcomes declined for low-income white families in nearly every county, but the largest declines occurred in historically high-mobility areas such as the Great Plains and the coasts. For low-income Black families, outcomes improved in most areas, with the largest gains in historically low-mobility regions including the Southeast and the industrial Midwest. The correlation between county-level changes for low-income white versus low-income Black children is a positive 0.58, meaning the areas where Black families improved most tended to be areas where white families declined least, not most.&lt;/p&gt;
&lt;p&gt;Q: Do the trends persist when using non-rank, inflation-adjusted dollar outcomes?
A: Yes. The white class gap in mean household income grew from $17,720 to $20,950 in real 2023 dollars, and the white-Black race gap for low-income families narrowed from $20,810 to $14,910. The paper also reports similar patterns for individual earnings (as opposed to household income), ruling out changes in household composition as a driver.&lt;/p&gt;
&lt;p&gt;Q: What do the pre-labor-market outcomes show?
A: The divergent trends emerge before children enter the labor market. The white class gap in educational attainment grew by 20%, driven by growing gaps in four-year college completion. The white-Black race gap in educational attainment disappeared by the 1992 cohort, driven by narrowing gaps in high school graduation. The white class gap in the share of students taking the SAT/ACT increased by 12.1 percentage points between the 1980 and 1991 birth cohorts, while the white-Black race gap in SAT/ACT-taking decreased by 20.3 percentage points. The white class gap in mean SAT/ACT scores grew by 62% between the 1980 and 1997 birth cohorts among test-takers.&lt;/p&gt;
&lt;p&gt;Q: How large is the mortality dimension of these trends?
A: The white class gap in early-adulthood mortality (ages 24–27) more than doubled between the 1978 and 1992 birth cohorts, while the white-Black race gap in early-adulthood mortality decreased by 77%. These non-monetary outcomes are invariant to inflation and income measurement choices, confirming the robustness of the broader trends.&lt;/p&gt;
&lt;p&gt;Q: How much do family-level characteristics explain?
A: Controlling jointly for parental education, wealth, occupation, and marital status reduces the estimated growth in the white class gap by only 7% (from 3.37 to 3.13 percentile ranks). The same controls do not explain the shrinking white-Black race gap — the estimated reduction in the race gap actually becomes slightly larger (4.56 rather than 4.16 percentiles) after controlling for family characteristics, indicating that observable family factors work against the observed convergence.&lt;/p&gt;
&lt;p&gt;Q: How much do neighborhood-level common shocks explain?
A: Including childhood county fixed effects interacted with birth cohort explains only 7% of the growing white class gap and none of the shrinking white-Black race gap. Including Census tract fixed effects yields essentially identical results. The divergent trends persist among children growing up in the same Census tract, ruling out explanations based on differential exposure to neighborhood-level economic shocks.&lt;/p&gt;
&lt;p&gt;Q: What is the community-level parental employment correlation, and what does it explain?
A: Changes in children&amp;rsquo;s earnings, SAT/ACT scores, and educational attainment across cohorts are strongly positively correlated with changes in parental employment rates within the child&amp;rsquo;s community (same race, same class, same county), controlling for the employment status of the child&amp;rsquo;s own parents. The correlation between changes in children&amp;rsquo;s outcomes and changes in community parental employment rates across all race and class subgroups is 0.91. This single community-level factor — as proxied by parental employment rates — accounts for nearly all of the divergent trends by race and class.&lt;/p&gt;
&lt;p&gt;Q: What is the quasi-experimental design for estimating causal effects, and what does it assume?
A: The paper compares outcomes of children who moved to counties with increasing parental employment rates at younger versus older ages, across earlier versus later birth cohorts. The identification assumption is &amp;ldquo;constant selection by age&amp;rdquo;: any selection of families into moving to a given county in years when parental employment is higher may differ across cohorts, but those selection differences must not themselves vary systematically with the age at which children move. The paper treats this as a &amp;ldquo;constant selection by age&amp;rdquo; assumption standard in the neighborhood effects literature.&lt;/p&gt;
&lt;p&gt;Q: What do the causal exposure results show?
A: Children who moved before age 8 to communities where parental employment was increasing show systematically higher earnings in later birth cohorts, while children who made the same move after age 13 show little difference in earnings across cohorts. This pattern — larger effects at younger ages — is consistent with a causal exposure effect of growing up in an improving community, with effects proportional to the duration of exposure.&lt;/p&gt;
&lt;p&gt;Q: How do sibling comparisons validate the identification assumption?
A: When siblings move together to a community with increasing parental employment rates, the younger sibling — who receives more years of exposure to the higher-employment environment — earns significantly more than the older sibling. The earnings difference is proportional to the age gap between siblings. This rules out explanations based on fixed unobserved family characteristics and supports the constant-selection-by-age assumption.&lt;/p&gt;
&lt;p&gt;Q: What evidence distinguishes social interaction mechanisms from economic resource mechanisms?
A: Two sources of variation are used. First, children&amp;rsquo;s outcomes are much more strongly related to the parental employment rates of peers in their own birth cohort than peers in adjacent cohorts — a cohort-specificity that is implausible for economic resource channels (school budgets, local tax bases) which would not vary sharply across adjacent cohorts. Second, outcomes of low-income white children are driven primarily by the employment rates of low-income white parents, not by low-income Black or high-income white parents&amp;rsquo; employment rates, and vice versa for low-income Black children — consistent with interaction patterns being stratified by race and class.&lt;/p&gt;
&lt;p&gt;Q: What role does cross-racial interaction play?
A: In counties where Black children constitute a small share of the population (making cross-racial interaction more likely), Black children&amp;rsquo;s outcomes are also related to low-income white parental employment rates. Similarly, in counties with higher interracial marriage rates (a proxy for cross-racial interaction), Black children&amp;rsquo;s outcomes are related to white parental employment rates even after controlling for racial composition. This cross-sectional variation supports the interpretation that the influence channel is social interaction rather than parallel economic shocks.&lt;/p&gt;
&lt;p&gt;Q: How do the findings for Hispanic, Asian, and AIAN children compare?
A: Changes in economic mobility for Hispanic, Asian, and AIAN children between 1978 and 1992 birth cohorts were much more modest than for white and Black children. For children from low-income families, mean household income ranks were essentially unchanged for Asian children and rose by only about 0.5 percentiles for Hispanic and AIAN children. However, the same community-level parental employment rate mechanism explains the (smaller) changes for these groups as well; the correlation between changes in children&amp;rsquo;s outcomes and changes in community parental employment rates is 0.91 across all subgroups.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s unified theoretical account of all the divergent trends?
A: The paper concludes that a parsimonious theory — that children&amp;rsquo;s outcomes mimic those of the parents in their social communities, following Borjas (1992) — explains the divergent trends by race and class. Because social interaction is stratified by race and class even within neighborhoods, changes in parental outcomes in the parent generation propagate differentially to white versus Black and high-income versus low-income children, producing growing class gaps and shrinking race gaps through the same underlying mechanism.&lt;/p&gt;
&lt;p&gt;Q: What does the paper imply about the malleability of economic mobility disparities?
A: Because the causal exposure effects of community environments on children&amp;rsquo;s outcomes can be detected within a 14-year span (1978 to 1992 birth cohorts), the paper implies that differences in economic mobility by race and class may be malleable in policy-relevant timeframes. This is despite the fact that long-standing disparities partly trace back to historical factors such as slavery, Jim Crow laws, redlining, and the Great Migration.&lt;/p&gt;
&lt;p&gt;White class gap: The difference in mean household income ranks in adulthood for white children born to families at the 25th versus 75th percentiles of the national parental income distribution; increased from 11.1 to 14.1 percentile ranks (28%) between the 1978 and 1992 birth cohorts.&lt;/p&gt;
&lt;p&gt;White-Black race gap: The difference in mean household income ranks in adulthood for white versus Black children born to families at the 25th percentile of the national parental income distribution; decreased from 14.9 to 10.9 percentile ranks (27%) between the 1978 and 1992 birth cohorts.&lt;/p&gt;
&lt;p&gt;Social community: In this paper&amp;rsquo;s usage, other families who share the same race, class category, and childhood county as a given child; the unit within which community-level parental employment rates are measured and found to be predictive of children&amp;rsquo;s outcomes.&lt;/p&gt;
&lt;p&gt;Causal exposure effect: The effect on a child&amp;rsquo;s adult outcomes of an additional year spent growing up in a community with higher parental employment rates, estimated quasi-experimentally by comparing children who moved to counties with changing parental employment rates at younger versus older ages; larger effects at younger ages imply a causal, duration-sensitive exposure channel.&lt;/p&gt;
&lt;p&gt;Constant selection by age: The identification assumption underlying the quasi-experimental design; requires that any systematic differences in the types of families who move to a county when parental employment is high versus low do not themselves vary with the age at which children move to that county.&lt;/p&gt;
&lt;p&gt;Intergenerational rank-rank slope: The OLS slope coefficient from regressing child income percentile rank on parental income percentile rank; for white children, increased from 0.23 in the 1978 birth cohort to 0.29 in the 1992 birth cohort, indicating greater persistence of economic status.&lt;/p&gt;
&lt;p&gt;Cohort-specificity of community effects: The empirical pattern that children&amp;rsquo;s outcomes are more strongly related to the parental employment rates of peers in their own birth cohort than those of adjacent cohorts, used in the paper as evidence favoring social interaction over economic resource channels as the mediating mechanism.&lt;/p&gt;</description></item><item><title>Choice and Opportunity Costs</title><link>https://macropaperwarehouse.com/papers/choice-and-opportunity-costs/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/choice-and-opportunity-costs/</guid><description>&lt;p&gt;&lt;strong&gt;Layer 1 — Overview&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper develops a unified choice-theoretic framework in which agents evaluate alternatives not in isolation but relative to their opportunity costs — the alternatives they forgo. The central departure from classical theory is the relaxation of additive separability between benefits and costs. In the standard additive model, accounting for opportunity costs is behaviourally equivalent to simple utility maximisation: a decision maker who correctly perceives the feasible set and maximises an additively separable utility will make identical choices whether or not opportunity costs are explicitly considered (the paper calls this the irrelevance of opportunity costs under additivity, formally establishing it as a general result). Once additive separability is relaxed, however, opportunity costs become non-trivial and generate a genuinely distinct theory of choice.&lt;/p&gt;
&lt;p&gt;The primitive of the model is a net preference — an asymmetric binary relation on pairs (x, y) of distinct alternatives, where (x, y) ≻ (w, z) means the agent strictly prefers obtaining x while forgoing y over obtaining w while forgoing z. Because the opportunity cost of a chosen alternative depends on what else the agent would choose, and vice versa, choice emerges from an intrapersonal equilibrium rather than from direct maximisation.&lt;/p&gt;
&lt;p&gt;The paper defines and axiomatically characterises two nested models. The Recursive Opportunity Model (ROM) adopts a behavioural definition of opportunity costs: the cost of the chosen alternative x in menu A is c(A \ x), the alternative that would actually be chosen were x unavailable; the cost of every unchosen alternative is x itself. This recursive structure is completely characterised by a single observable condition — Weak Path Independence (WPI): if x is chosen when added to a menu A, then x must also be chosen in a pairwise comparison against c(A). WPI is shown to imply Always Chosen (AC) — that a Condorcet winner is always selected — but it permits pairwise cycles of choice (failures of No Binary Cycles). Rationality within the ROM requires additionally that the net preference be a strict order satisfying Congruence, an acyclicity condition on the gross preference induced by the net preference. Even then, the utility function being maximised need not coincide with the gross preference naturally implied by the underlying psychological net preference, raising a welfare identification problem.&lt;/p&gt;
&lt;p&gt;The Opportunity Model (OM) generalises the ROM by allowing the opportunity cost of the chosen alternative to be any unchosen alternative rather than the recursively determined one. This relaxation permits both pairwise cycles and menu effects (Condorcet violations). The OM is completely characterised by Never Chosen (NC): an alternative that loses every pairwise comparison within a menu (a Condorcet loser) cannot be chosen. Imposing a strict order and Congruence on the net preference of an OM rules out only pairwise cycles, leaving menu effects intact. Full rationality within the OM is restored only with the additional assumption that opportunity costs are non-decreasing in the induced gross preference as the feasible set expands (the Increasing Opportunity Model).&lt;/p&gt;
&lt;p&gt;Extensions characterise multivalued versions of both models (M-ROM and M-OM) via adapted axioms on choice correspondences, and show that several known behavioural models in the literature — including list-rationalizable choice and game-tree rationalizable choice — satisfy WPI and thus are instances of ROM. Applications demonstrate that OMs can represent the attraction effect and the multiple decoy effect, providing a preference-maximisation account without appealing to bounded cognition, and that ROMs can represent intransitive pairwise choices via smooth parametric net preferences, avoiding the discontinuities of lexicographic semiorder models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the paper&amp;rsquo;s foundational definition of opportunity cost, and how does it differ from the standard textbook definition?&lt;/strong&gt;
A: The paper defines the opportunity cost of the chosen alternative x in menu A as the alternative that would actually be chosen from A \ {x} — that is, c(A \ {x}). The opportunity cost of any unchosen alternative y is the actual choice x. The standard textbook definition — &amp;ldquo;the next-best feasible alternative&amp;rdquo; — presupposes context-independent, additively separable preferences, precisely the assumption the paper relaxes. The behavioural definition is grounded directly in the agent&amp;rsquo;s own choice function, making it consistent with non-separable evaluations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Under what conditions do opportunity costs become irrelevant, and why?&lt;/strong&gt;
A: If preferences admit an additively separable utility representation u, then for any finite menu A and any two alternatives x and y, u(x) ≥ u(y) if and only if u(x) − max_{a ∈ A{x}} u(a) ≥ u(y) − max_{a ∈ A{y}} u(a). Net utility maximisation and gross utility maximisation rank alternatives identically. Opportunity costs become non-trivial only when additive separability is relaxed — at that point, the agent&amp;rsquo;s comparative evaluation of (alternative, cost) pairs can produce choices that no gross utility function rationalises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the Recursive Opportunity Model (ROM) and what single axiom characterises it?&lt;/strong&gt;
A: A choice function c is a ROM if there exists a net preference ≻ such that for every menu A and every unchosen alternative x, the chosen alternative evaluated at its opportunity cost is preferred to x evaluated at c(A). This is equivalent to the choice function satisfying Weak Path Independence (WPI): if x ∉ A and x = c(A ∪ {x}), then x = c({x, c(A)}). WPI is necessary and sufficient for a ROM (Theorem 1). It is not sufficient for full rationality, as it permits pairwise cycles while ruling out menu effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What kinds of irrationality can a ROM exhibit, and what kinds does it preclude?&lt;/strong&gt;
A: The paper establishes (Corollary 1) that WPI implies Always Chosen — a ROM always selects the Condorcet winner when one exists. Therefore, the only admissible form of irrational behaviour in a ROM is pairwise cycles (failures of No Binary Cycles). Condorcet violations (menu effects) are precluded. A ROM becomes fully rational if and only if it additionally satisfies No Binary Cycles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What additional condition on the net preference guarantees that a ROM is rational?&lt;/strong&gt;
A: Theorem 2 establishes that a choice function is rational if and only if it is a ROM generated by a net preference that is a strict order (complete, asymmetric, transitive) satisfying Congruence. Congruence requires that the induced binary relation P≻ on alternatives — defined by xP≻y whenever there exists z such that (x, z) ≻ (y, z) or (z, y) ≻ (z, x) — is acyclic. For a (u, v)-additive net preference, Congruence holds if and only if u and v are ordinally equivalent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Can rational behaviour generated by a ROM be welfare-analysed using revealed preference in the standard sense?&lt;/strong&gt;
A: No — and this is a key warning in the paper. Even when a ROM with a strict order and Congruence produces fully rational behaviour, the utility function being maximised need not coincide with the gross preference P≻ naturally induced by the underlying net preference. The paper provides an explicit example (Remark 1, equation 10) in which the choice-rationalising order P is xPyPz while the induced preference is xP≻zP≻y. The utility &amp;ldquo;revealed&amp;rdquo; by choice may diverge from the psychological primitive driving that choice, undermining the normative authority of standard revealed preference welfare analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the Opportunity Model (OM) and how does it extend the ROM?&lt;/strong&gt;
A: The OM relaxes the recursive assumption by allowing the opportunity cost of the chosen alternative to be any unchosen element of the menu rather than specifically c(A \ c(A)). This breaks the recursive structure while preserving the intrapersonal equilibrium character (the choice still affects the net value of alternatives). The OM is completely characterised by Never Chosen (NC): no Condorcet loser can be chosen (Theorem 3). Unlike the ROM, an OM may fail to select the Condorcet winner, permitting both pairwise cycles and Condorcet violations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the Increasing Opportunity Model and when does it restore full rationality?&lt;/strong&gt;
A: An IOM is an OM in which the opportunity function o is monotone in the sense that if A ⊃ B and o(A) ≠ o(B), then o(A) is ranked higher than o(B) in the induced gross preference P≻. Intuitively, opportunity costs do not decrease as the feasible set expands. Theorem 5 establishes that a choice function is rational if and only if it is an IOM generated by a net preference that is a strict order satisfying Congruence. Full rationality within the OM thus requires both the internal consistency of the net preference (strict order, Congruence) and this monotonicity of opportunity costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper explain the attraction effect using the OM?&lt;/strong&gt;
A: In the canonical formulation, c({x,y}) = x, c({y,d}) = y, c({x,d}) = x, and c({x,y,d}) = y, where d is a decoy. This pattern is incompatible with gross preference maximisation. The paper represents it as an OM with opportunity function o({x,y,d}) = d and a strict net preference order yd ≻ xy ≻ yx ≻ xd ≻ dx ≻ dy. The psychological interpretation is that the introduction of the decoy shifts the comparator for y from x to d; y looks more favourably comparable to d than x does, so the equilibrium where y is chosen is selected. No bounded cognition or imperfect attention is assumed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the framework account for multiple decoys?&lt;/strong&gt;
A: With decoys dx and dy specific to x and y respectively, the observed pattern c({x,y}) = x and c({x,y,dy}) = y and c({x,y,dx,dy}) = y can be represented as an OM with a transitive net preference satisfying xdx ≻ ydy ≻ xy ≻ yx ≻ dyy ≻ dxx and opportunity function o({x,y,dx,dy}) = dx, o({x,y,dy}) = dy. The paper notes this net preference can be extended to a strict order while preserving the choice pattern. This accommodates a phenomenon that poses a challenge to standard theoretical choice literature (per Masatlioglu, Nakajima and Ozbay [25]).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the ROM explain intransitive choices more smoothly than lexicographic semiorder models?&lt;/strong&gt;
A: The paper shows that the Tversky (1969) cyclical pattern c({x,y}) = x, c({y,z}) = y, c({x,z}) = z with x=(115,7), y=(117,3), z=(120,0) can be generated by net preferences that admit smooth parametric representations. Specifically, for any two alternatives w=(a,b) and z=(c,d), the paper proposes (w,z) ≻ (z,w) iff (max{a−c, b−d})² &amp;gt; k(min{a−c, b−d})², where k is a relative sensitivity parameter. For k=1/2 this yields the required cycle. Lexicographic models require sharp discontinuities in preference and systematic avoidance of trade-offs, which are often viewed as implausible within the standard economic paradigm; the smooth parametric form avoids these features.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the relationship between ROMs and previously studied choice models in the literature?&lt;/strong&gt;
A: Several known models satisfy WPI and are therefore, by Theorem 1, instances of ROMs: specifically, Rationalizability by Game Trees (Xu and Zhou) and List-Rationalizable Choice (Yildiz) are shown to satisfy WPI. The two-stage choice model of Bajraj and Ulku satisfies NC but not WPI, making it an OM but not a ROM. The net preference being maximised in each case can in principle be recovered using the explicit construction in the proof of Theorem 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the ROM relate to Koszegi-Rabin personal equilibrium?&lt;/strong&gt;
A: Both models involve preferences that depend on a variable determined endogenously by choice, requiring an intrapersonal equilibrium concept in which the agent&amp;rsquo;s conjectures about their own behaviour must be internally consistent. The key difference is that in Koszegi-Rabin the psychological primitive is a set of reference-dependent preferences ≻&lt;em&gt;r on alternatives in X (where r is the reference point), and equilibrium requires c(A) ≻&lt;/em&gt;{c(A)} y for all y ∈ A \ c(A). In the ROM, the primitive is a preference on pairs of distinct alternatives, and the opportunity cost differs for each alternative being compared (the chosen alternative has one opportunity cost, each unchosen alternative has a different one, namely c(A) itself).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Net preference:&lt;/strong&gt; An asymmetric binary relation on pairs (x, y) of distinct alternatives, where (x, y) ≻ (w, z) means the agent strictly prefers to be in a situation where they choose x while forgoing y over a situation where they choose w while forgoing z. The primitive is defined on X = {(x, y) ∈ X × X : x ≠ y}, without imposing additive separability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Recursive Opportunity Model (ROM):&lt;/strong&gt; A choice function c is a ROM if there exists a net preference ≻ such that for every menu A and every unchosen x, the pair (c(A), c(A \ c(A))) ≻ (x, c(A)). The opportunity cost of the chosen alternative is defined recursively as c(A \ c(A)); choice results from intrapersonal equilibrium rather than simple maximisation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Opportunity Model (OM):&lt;/strong&gt; A generalisation of the ROM in which the opportunity cost of the chosen alternative can be any unchosen alternative in the menu (not necessarily the recursively determined one). Characterised by Never Chosen: no Condorcet loser can be chosen. Permits both pairwise cycles and Condorcet violations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Weak Path Independence (WPI):&lt;/strong&gt; The axiom characterising ROMs: if x ∉ A and x = c(A ∪ {x}), then x = c({x, c(A)}). Equivalently, if an alternative is chosen upon being added to a menu, it must also win in a pairwise comparison with what was previously chosen from the original menu.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Congruence:&lt;/strong&gt; A consistency condition on net preferences requiring that the induced binary relation P≻ — defined by xP≻y whenever there exists z such that (x,z) ≻ (y,z) or (z,y) ≻ (z,x) — is acyclic. For a (u,v)-additive net preference, Congruence holds if and only if u and v are ordinally equivalent. Together with a strict net preference order, Congruence in a ROM is equivalent to rational choice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intrapersonal equilibrium:&lt;/strong&gt; The concept underlying both models: an agent is in equilibrium when selecting x from A if they correctly anticipate their own contingent behaviour across hypothetical scenarios (i.e., they use the actual choice function c to evaluate what they would choose from A \ {x}), and the chosen alternative is net-preference-maximal given those consistent conjectures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Never Chosen (NC):&lt;/strong&gt; The axiom characterising OMs: an alternative that is a Condorcet loser — losing in every pairwise comparison within a menu — cannot be chosen from that menu. NC is weaker than WPI (which implies both Always Chosen and Never Chosen) and is the precise behavioural content of the OM.&lt;/p&gt;</description></item><item><title>Civil War–Induced Displacement and Human Capital</title><link>https://macropaperwarehouse.com/papers/civil-warinduced-displacement-and-human-capital/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/civil-warinduced-displacement-and-human-capital/</guid><description>&lt;p&gt;This paper examines the impact of conflict-driven forced displacement on human capital accumulation using the Mozambican civil war (1977–1992) as the empirical setting. During this war, over four million civilians — roughly a third of the population — fled to rural areas, cities, neighboring countries, or UN-managed refugee camps. The study advances on prior work in three dimensions: it uses the full post-war population census (12 million individuals) rather than a small survey; it studies multiple displacement trajectories in a single framework; and it separately identifies place-based exposure effects from a general uprootedness effect.&lt;/p&gt;
&lt;p&gt;The primary data source is the 1997 Mozambican census, which records each individual&amp;rsquo;s place of birth, residence in 1992 (the war&amp;rsquo;s end), and residence in 1997. Key outcomes are educational attainment and sectoral employment (agricultural versus services). The authors supplement the census with digitized colonial road and school maps, georeferenced conflict events, and landmine contamination data.&lt;/p&gt;
&lt;p&gt;The main identification strategy compares approximately 135,000 siblings (from 45,000 families) separated during the war, using the sibling who stayed behind as a within-family counterfactual. This design controls for household-level characteristics including religious and ethnic background, aspirations, and exposure to violence.&lt;/p&gt;
&lt;p&gt;The key findings are as follows. First, rural-born IDPs displaced to cities have a 7.3 percentage point higher likelihood of attending primary school and 0.53 more years of schooling compared to their siblings who stayed behind — roughly one-third of the non-displaced mean. Rural-born IDPs displaced to other rural areas also show gains, with a 3 percentage point higher likelihood of attending school and 0.24 additional years, supporting the uprootedness hypothesis even for displacements that did not reach urban centers. Urban-born IDPs forcibly relocated to the countryside — primarily through FRELIMO&amp;rsquo;s villagization scheme — experienced 9 percentage point lower primary school attendance and approximately 0.5 fewer years of schooling relative to siblings who remained in cities.&lt;/p&gt;
&lt;p&gt;External displacement (to camps in Malawi or Zimbabwe) generated no significant schooling gains relative to staying siblings, despite UN-built schools in camps, likely because scarce employment opportunities reduced perceived returns to education.&lt;/p&gt;
&lt;p&gt;Second, the paper jointly estimates place-based and uprootedness effects in a single within-family framework. Place effects are statistically significant: displacement to a district one standard deviation more developed than one&amp;rsquo;s birthplace raises schooling likelihood by approximately 3 percentage points (OLS) to 5 percentage points (2SLS reduced form). Crucially, a residual uprootedness effect of approximately 2–4 percentage points persists even after controlling fully for destination-origin differences in development and conflict intensity. This uprootedness effect is quantitatively comparable to being displaced to a district one standard deviation more developed than one&amp;rsquo;s birthplace.&lt;/p&gt;
&lt;p&gt;Third, a primary survey of 208 Nampula residents conducted in early 2020 — three decades after the war — confirms lasting educational gains. IDPs displaced to Nampula have a 10 percentage point higher likelihood of completing primary school relative to their siblings who stayed in the countryside, and their educational attainment converged to levels of urban-born, never-displaced residents despite large urban-rural education gaps. However, IDPs report significantly lower social capital, civic participation, and community trust than urban-born respondents, and score significantly worse on mental health indicators, including depression, loneliness, and pessimism. These psychosocial costs persist three decades after the war&amp;rsquo;s end.&lt;/p&gt;
&lt;p&gt;The findings apply to a low-income, post-colonial African setting characterized by widespread illiteracy (over 60%) and subsistence agriculture (over 85% of employment) at the war&amp;rsquo;s close. The results are robust to alternative age restrictions, extended family comparisons, dropping the oldest sibling, same-sex sibling pairs, and narrowing the age gap between sibling pairs to as few as two years.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification strategy and why is it preferred over cross-sectional estimates?
A: The authors compare siblings within the same household who experienced different displacement trajectories during the war. Because siblings share household-level characteristics — parental preferences for education, ethnic and religious background, wealth, and local conflict exposure — the within-family design controls for confounders that would bias cross-sectional estimates. The within-family estimates are systematically smaller than cross-sectional ones (e.g., 7.3 pps vs. 24–30 pps for rural-to-urban displacement in primary school attendance), confirming that sorting was present even in the unpredictable civil war setting.&lt;/p&gt;
&lt;p&gt;Q: What do the results show for rural-born IDPs displaced to urban centers?
A: Within the sibling-pair framework, rural-born IDPs displaced to cities and towns have a 7.3 percentage point higher likelihood of attending primary school and 0.53 more years of schooling compared to their siblings who stayed in rural birthplaces, against a non-displaced sibling mean of approximately 20% primary school access and one year of formal schooling. These IDPs also show a 4 percentage point higher likelihood of non-agricultural employment five years after the war&amp;rsquo;s end.&lt;/p&gt;
&lt;p&gt;Q: What do the results show for rural-born IDPs displaced to other rural areas?
A: Even displacement to a different rural district — not a city — generates modest but statistically significant gains: a 3 percentage point higher likelihood of attending school and 0.24 additional years of schooling relative to siblings staying in their birthplace rural district. The authors interpret this as evidence for the uprootedness hypothesis, since rural Mozambique at the time was among the most impoverished and insecure environments in the world, meaning destination quality alone cannot explain the gain.&lt;/p&gt;
&lt;p&gt;Q: What do the results show for externally displaced refugees?
A: Refugees displaced to camps and settlements in Malawi, Zimbabwe, Tanzania, Zambia, and Swaziland show schooling levels statistically similar to their siblings who remained in their rural birthplaces, despite UN-built primary schools in camps. The authors attribute the absence of gains to low perceived returns to education stemming from scarce employment opportunities at displacement destinations. Externally displaced individuals do show a 5 percentage point lower likelihood of agricultural employment relative to staying siblings.&lt;/p&gt;
&lt;p&gt;Q: What are the consequences of urban-to-rural forced displacement?
A: Urban-born individuals forcibly relocated to the countryside — primarily through FRELIMO&amp;rsquo;s villagization and food production programs — have approximately 9 percentage point lower likelihood of attending primary school and 0.5 fewer years of schooling compared to siblings who remained in urban areas. These results indicate that FRELIMO&amp;rsquo;s coercive relocation policies imposed material human capital costs on the displaced.&lt;/p&gt;
&lt;p&gt;Q: How are place-based and uprootedness effects separated empirically?
A: The authors construct principal component indices for destination-origin differences in regional development (aggregating population density, Portuguese-speaking share, offspring mortality, road density, colonial market density, and school density) and conflict intensity (conflict events per capita and landmine contamination per capita). They then include these continuous exposure measures alongside a binary displacement indicator in within-family regressions. The coefficient on the binary displacement indicator — conditional on destination-origin development and conflict differences — isolates the uprootedness effect for individuals displaced to districts with identical characteristics to their birthplace.&lt;/p&gt;
&lt;p&gt;Q: What are the magnitudes of the place-based and uprootedness effects?
A: Under OLS, displacement to a district one standard deviation more developed than one&amp;rsquo;s birthplace raises schooling likelihood by approximately 3 percentage points. The residual uprootedness effect — displacement per se, controlling for destination quality — raises schooling likelihood by approximately 2 percentage points. Under 2SLS (instrumenting destination-origin development differences with the development of districts within 100 km of birthplace), the place-based effect rises to approximately 5 percentage points in the reduced form, and the uprootedness effect remains significant at approximately 4 percentage points. Both the uprootedness and place-based effects are of comparable magnitude.&lt;/p&gt;
&lt;p&gt;Q: What instrument is used in the 2SLS specifications and what is its first-stage strength?
A: The instrument exploits the fact that Mozambique&amp;rsquo;s heavily mined and rudimentary transportation network constrained civilian movement — the median displaced sibling ended up roughly 97 kilometers from birthplace. The authors instrument actual destination-origin development and conflict differences with the predicted differences based on the characteristics of districts within 100 km of the birthplace. The first-stage elasticity between actual and proximity-predicted differences in development is 0.86, and for conflict is 0.88, both precisely estimated.&lt;/p&gt;
&lt;p&gt;Q: What do the long-run survey results from Nampula show about educational persistence?
A: In a 2020 survey of 208 Nampula residents aged over 35, IDPs who fled to Nampula during the war have a 10 percentage point higher likelihood of completing primary school relative to their siblings who stayed in the countryside. Their educational attainment converges to the level of urban-born, never-displaced Nampula residents, despite large historical and contemporary urban-rural education gaps in northern Mozambique. The majority of IDPs (73%) report that extended relatives or friends advised them to attend school upon arriving in the city, and most believed education was necessary for urban employment.&lt;/p&gt;
&lt;p&gt;Q: What are the long-run psychosocial costs documented in the Nampula survey?
A: Even three decades after the war&amp;rsquo;s end, IDPs in Nampula report significantly lower social capital, civic participation, and community trust compared to urban-born never-displaced residents. IDPs also score significantly worse on mental health indicators including depression, loneliness, and pessimism. These findings suggest that forced displacement imposes persistent psychosocial costs that are not remediated by economic or educational convergence.&lt;/p&gt;
&lt;p&gt;Q: What drives displacement in the data, and does selection threaten identification?
A: Linear probability and multinomial logit models show that conflict intensity and geographic proximity (distance to the border for external displacement; distance to cities for urban displacement) are the primary correlates of displacement type, while differences in destination development are uncorrelated with displacement. Nevertheless, the overall explanatory power of these models is low, confirming many idiosyncratic and unpredictable features of the war. The within-family design addresses residual selection on household characteristics, and the 2SLS design addresses selection on destination-specific characteristics.&lt;/p&gt;
&lt;p&gt;Q: How do educational gains translate into sectoral employment outcomes?
A: Across specifications, gains in schooling move in tandem with a shift out of agriculture into services. Rural-to-urban IDPs have a 4 percentage point higher likelihood of non-agricultural employment five years after the war, while externally displaced show a 5 percentage point lower likelihood of agricultural employment. Urban-born IDPs displaced to the countryside are more likely to work in agriculture after the war. The authors interpret this co-movement as suggesting that conflict-driven human capital accumulation may contribute to structural transformation away from subsistence agriculture.&lt;/p&gt;
&lt;p&gt;Q: How robust are the within-family estimates?
A: The authors conduct six sensitivity checks: adding family fixed effects to cross-sectional regressions, restricting to individuals aged 12–18 in 1997 to address co-habitation concerns, extending comparisons to cousins and other relatives, dropping the oldest male sibling to minimize favoritism concerns, restricting to same-sex sibling pairs, and narrowing the age gap to two years. Across all permutations, the qualitative ordering is preserved: refugees show no significant schooling gains, rural-to-urban IDPs show gains of 5–6 percentage points in primary attendance and 0.35–0.5 extra years, rural-to-rural IDPs show small positive gains, and urban-to-rural IDPs show losses.&lt;/p&gt;
&lt;p&gt;Uprootedness hypothesis: The idea, traced in the paper to Stigler and Becker (1977) and earlier scholars, that forced displacement incentivizes human capital investment precisely because education is a mobile asset that cannot be expropriated — distinct from place-based effects of destination quality.&lt;/p&gt;
&lt;p&gt;Place-based (exposure) effects: The impact on human capital outcomes attributable to differences between the development level and conflict intensity of the displacement destination and the individual&amp;rsquo;s birthplace, measured as destination-origin differences in a principal component index of regional development.&lt;/p&gt;
&lt;p&gt;Separated siblings design: An identification strategy that compares siblings from the same household who experienced different displacement trajectories during the war, holding constant all household-level characteristics including parental preferences, ethnicity, religion, wealth, and local conflict exposure.&lt;/p&gt;
&lt;p&gt;Internal displacement (IDP): Conflict-driven movement within national borders to either rural areas or urban centers, constituting approximately 60% of global forced displacement and the majority of displacement in the Mozambican civil war context.&lt;/p&gt;
&lt;p&gt;Source text origin: A categorization of the working paper text used for summarization — distinguishing full PDF or HTML text from abstract-only text. Abstract-only text is a hard block for summary generation in the pipeline.&lt;/p&gt;
&lt;p&gt;Structural transformation: In this paper&amp;rsquo;s usage, the shift of workers out of subsistence agriculture into services associated with human capital accumulation triggered by conflict-driven displacement, treated as a potential mechanism of post-conflict recovery.&lt;/p&gt;
&lt;p&gt;Psychosocial costs of displacement: Long-run deficits in social capital, civic engagement, community trust, and mental health (depression, loneliness, pessimism) reported by IDPs three decades after displacement, persisting despite convergence in educational attainment and employment.&lt;/p&gt;</description></item><item><title>Climate change and the macroeconomics of bank capital regulation</title><link>https://macropaperwarehouse.com/papers/climate-change-and-the-macroeconomics-of-bank-capital-regulation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/climate-change-and-the-macroeconomics-of-bank-capital-regulation/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks two related questions about the intersection of climate policy and bank capital regulation. First, can differentiated bank capital requirements — imposing higher equity charges on loans to fossil energy firms — serve as a quantitatively meaningful climate policy instrument, in particular relative to carbon taxes? Second, how should optimal bank capital requirements respond to a carbon-tax-induced clean energy transition?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build a quantitative multi-sector DSGE model with two layers of default: corporate default at the firm level and bank failure at the bank level. Three intermediate goods sectors are modeled — non-energy, fossil energy, and clean energy — linked via a nested CES final-good production structure. Banks collect deposits from households (who value deposits for liquidity services) and issue defaultable loans to all three sectors. Deposit insurance, combined with limited liability for bank owners, generates an inefficiently high bank risk-taking motive, creating a role for capital regulation. The Ramsey-optimal capital requirement balances the social benefit of liquid deposit provision to households against the social cost of bank failure.&lt;/p&gt;
&lt;p&gt;The model is calibrated to quarterly data, targeting a 0.7% annualized bank failure rate, a 2% annualized corporate default rate, a 30% loan recovery rate, a deposit spread of -100 basis points, and a baseline Ramsey-optimal equity requirement of 8% (consistent with Basel III). Sectoral parameters follow Bartocci, Notarpietro, and Pisani (2022) and Fried, Novan, and Peterman (2022): the energy-to-non-energy elasticity of substitution is 0.2, the clean-to-fossil energy elasticity is 3, and full abatement occurs at carbon taxes exceeding 125 $/tonne of carbon (ToC). The clean transition experiment imposes a linear carbon tax path from zero to 10 $/ToC over 40 quarters, announced as an unanticipated but fully credible shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 1 — Fossil-penalizing capital requirements are quantitatively negligible as climate policy.&lt;/em&gt; Raising the capital requirement on fossil loans from the baseline 8% to 12% (a 150% risk-weight, consistent with current BB- treatment) reduces the fossil capital share within the energy sector by only 0.06 percentage points (from 80.00% to 79.94%) and cuts aggregate emissions by only 0.08%. A 1 $/ToC carbon tax, by contrast, achieves a 5.23% emission reduction while modestly reducing the fossil capital share to 79.80%. The difference arises because capital requirements affect only the size and financing cost of fossil firms, leaving abatement incentives unchanged; the loan-rate effect on fossil firms is small (loan rate rises from 124 bps to 128 bps), consistent with Kashyap, Stein, and Hanson (2010).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 2 — Sustainability-linked capital requirements remain insufficient.&lt;/em&gt; Conditioning the fossil capital requirement on firms&amp;rsquo; abatement effort (κ_f = 0.12 − η_t) induces an optimal abatement effort of 2.69% and an effective fossil requirement of approximately 9.5%. The implied emission reduction remains far below even a modest carbon tax: the authors state the induced emission reduction falls short by a factor of almost 100 relative to full abatement.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 3 — Ramsey-optimal capital requirements decline monotonically along the transition (in the baseline real model).&lt;/em&gt; When a carbon tax gradually rises from zero to 10 $/ToC over 40 quarters, aggregate loan demand contracts permanently because clean, fossil, and non-energy goods are imperfect substitutes and the shock is recessionary for GDP. Banks reduce balance sheets, deposit supply falls, the deposit spread widens by approximately 8 basis points in the long run, and corporate default rates across all sectors rise by almost 0.1 percentage points from the baseline of 2.05% (in steady state). To counteract the deposit scarcity and associated firm risk-taking, the Ramsey-optimal capital requirement declines symmetrically and monotonically to a lower long-run level. Bank capital regulation cannot affect impact default rates because leverage decisions are made before the transition is announced.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 4 — Nominal rigidities produce a temporary tightening before the long-run relaxation.&lt;/em&gt; When debt is denominated in nominal terms and Rotemberg price adjustment costs are added, the clean transition is inflationary in the short run (consistent with Ciccarelli and Marotta 2021). Inflation makes deposit financing more attractive, inducing firms to temporarily increase nominal loan issuance; real deposits rise briefly, the deposit spread narrows by around 2 basis points, and the optimal capital requirement tightens over the initial phase of the transition before converging to the same lenient long-run level as the baseline. The short-run tightening is followed by a permanent relaxation.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Finding 5 — Differentiated sector-specific capital requirements are only warranted when banks are not diversified across sectors.&lt;/em&gt; In the baseline, perfectly diversified banks face a symmetric aggregate loan demand contraction, so uniform adjustment suffices. When sector-specific banks are introduced (an extreme case meant to bound concentration effects), fossil banks experience a strong reduction in deposit supply while clean banks experience the opposite. The optimal response is temporarily tighter capital requirements for clean banks and relaxed requirements for fossil banks. In the long run, both converge to an aggregate risk-weight of approximately 99.85% relative to the baseline (a small but symmetric relaxation), very close to the diversified baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All results are derived within a model calibrated to match broad financial-market and macroeconomic regularities rather than a specific country. Physical risk from climate change is abstracted away throughout. The carbon tax is set exogenously (not derived from a climate policy optimum). Firms cannot switch technologies, providing a conservative lower bound on the sectoral reallocation. Results are robust to halving the deposit demand elasticity parameter (γ_D = 0.6 versus 1.5 in the baseline) and to raising the energy/non-energy substitution elasticity to 3 from 0.2.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core trade-off that determines the optimal level of bank capital requirements in this model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The optimal capital requirement balances two welfare-relevant effects of bank leverage. Tighter requirements reduce bank failure rates, limiting the resource losses (proportional to deposits under DIA management) and the inefficient risk-taking that deposit insurance induces. At the same time, tighter requirements force banks to reduce deposit-financed lending, shrinking the supply of liquid deposits that households value directly in utility. The Ramsey planner chooses the capital requirement that equates the marginal welfare benefit of lower bank failure against the marginal welfare cost of reduced deposit provision. In the baseline calibration this optimum is at 8%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does raising capital requirements on fossil loans have such a small effect on carbon emissions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Capital requirements affect the deposit-financing wedge for fossil loans — the share of loans that can be funded via cheap, deposit-financed sources — but they do not enter firms&amp;rsquo; first-order condition for abatement. Firms respond by modestly reducing leverage and investment (the loan rate for fossil energy firms rises from 124 bps to 128 bps), but the emission intensity of fossil production is unchanged. In equilibrium, the fossil capital share within the energy sector declines by only 0.06 percentage points (from 80.00% to 79.94%), reducing total emissions by 0.08%. A 1 $/ToC carbon tax produces a 5.23% emission reduction, many times larger, because carbon taxes directly alter the return to abatement and the profitability of fossil relative to clean production.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the sustainability-linked capital requirement work and why is it still insufficient?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under sustainability-linked capital requirements, the fossil loan charge is set as κ_f = κ̃ − η_t, so firms that abate more face lower capital requirements on their loans and thus lower financing costs. This creates a direct financial incentive for abatement that the simple penalizing factor lacks. With κ̃ = 0.12, the equilibrium abatement effort is 2.69% and the effective fossil requirement falls to approximately 9.5%. Despite this improvement relative to the plain fossil factor, the climate impact remains far smaller than even a modest carbon tax: the induced emission reduction falls short by a factor of almost 100 relative to full abatement. The fundamental limitation is that the feedback from abatement to financing cost is attenuated by deposit-financing wedge mechanics, making the instrument too weak to substitute for direct carbon pricing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the impact, short-run, and long-run effects of the clean transition on default rates and bank failure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: On impact, the unexpected compliance cost increase raises fossil firms&amp;rsquo; default threshold, causing a sharp but short-lived uptick in fossil firm default rates (from 2.05% to approximately 2.08% in the baseline transition) and a brief increase in bank failure. Clean firm defaults fall slightly on impact due to higher clean energy prices. In the short run, clean firms increase risk-taking (higher leverage) because the relative attractiveness of debt financing improves as deposit spreads widen; fossil firms deleverage. In the long run, aggregate corporate default rates rise by almost 0.1 percentage points from the baseline of 2.05% (equivalently 2.7% in the Appendix B long-run analysis), driven by the widening of the deposit spread (approximately 8 bps), which raises the deposit financing wedge for all firms. Bank failure rates are always tied to binding capital requirements and revert quickly to their steady-state level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why can bank capital regulation not mitigate the impact default spike when the transition is announced?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: At the moment of announcement, leverage decisions for the current period have already been made. The bank capital requirement binds on new lending decisions but cannot alter the existing capital structure of banks or firms. Therefore the regulator faces a &amp;ldquo;bygone&amp;rdquo; on impact: changing the capital requirement in the announcement period does not affect current corporate default rates or bank failure rates. The regulator&amp;rsquo;s tool only becomes effective for lending decisions going forward, implying that the transition-induced impact default surge cannot be smoothed by macroprudential policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why do Ramsey-optimal capital requirements decline along the transition rather than tighten to address higher default risk?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The key channel is that aggregate loan demand contracts permanently as imperfect substitutability across sectors makes the carbon tax recessionary. Banks shrink their balance sheets, reducing deposit supply. The resulting deposit scarcity makes deposits more valuable to households (widening the spread), which also makes deposit financing cheaper for banks, partially offsetting the loan demand decline but at the cost of higher corporate leverage. The welfare loss from reduced liquidity provision and higher firm default rates dominates, so the planner relaxes capital requirements to stimulate deposit supply. The dominant effect is the large, permanent decline in credit demand, which makes it welfare-improving to allow banks to operate at lower capital ratios to rebuild deposit provision.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the role of the deposit financing wedge in transmitting carbon tax shocks to the entire corporate sector?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The deposit financing wedge (Ξ_t) reflects the benefit for banks of funding loans through deposits rather than equity, combining the liquidity premium households pay on deposits and the deposit insurance put (expected repayment is only 1 − F(μ_{t+1}) per unit of deposits issued). When aggregate loan demand falls due to carbon taxes, deposits become scarcer relative to their steady-state level, making the wedge larger. Through the loan pricing condition, all sectors — not just fossil — face more attractive deposit-financed debt, causing clean and non-energy firms to also increase their leverage and default risk along the transition. This is the mechanism through which a sector-specific shock has symmetric aggregate effects that shape optimal bank regulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do nominal rigidities change the optimal path of capital requirements along the clean transition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: With Rotemberg price adjustment costs and nominally denominated debt, the clean transition is inflationary in the short run (consistent with empirical evidence in Ciccarelli and Marotta 2021). Inflation lowers the real value of outstanding nominal loan obligations, incentivizing firms across all sectors to temporarily increase nominal borrowing. Banks accommodate this demand by increasing deposit issuance, which briefly narrows the deposit spread by around 2 basis points. With deposit supply temporarily elevated, the regulator&amp;rsquo;s trade-off tilts toward reducing bank failure rather than stimulating deposit provision, so optimal capital requirements tighten during the inflationary phase before reverting to the lenient long-run path of the baseline model. The long-run level is unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Under what conditions are sector-specific capital requirements welfare-improving?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Sector-specific requirements are only welfare-improving when banks are not perfectly diversified across sectors, so that the transition has heterogeneous effects on sector-specific deposit supply and bank failure rates. In the baseline with perfectly diversified banks, the loan demand decline affects all banks uniformly, so a symmetric uniform adjustment is optimal. When sector-specific banks are introduced as an extreme case of carbon concentration, fossil banks experience a sharp reduction in deposit provision while clean banks see deposits temporarily increase. The planner responds by temporarily relaxing requirements for fossil banks and tightening them for clean banks. In the long run, both converge to approximately the same aggregate relaxation as the diversified baseline (aggregate risk-weight of 99.85%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the carbon tax shock experiment relate to the perfect-foresight transition analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the carbon tax shock experiment, the tax level follows an AR(1) process with persistence ρ_τ = 0.9, starting from a long-run level of 10 $/ToC, with a one-standard-deviation shock implying an additional 10 $/ToC on impact. Fossil firm default rates spike from 2% to approximately 2.8% on impact and revert relatively quickly. Emissions decline by slightly more than 10% on impact and revert as the shock dissipates. The macroeconomic dynamics — GDP, investment, loan demand, and bank failure rate responses — closely resemble the impact and short-run effects of the perfect-foresight transition. Optimal capital requirements decline temporarily in both cases, confirming that the transition-path results are not an artifact of the specific perfect-foresight assumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the &amp;ldquo;forced safety effect&amp;rdquo; and how does it interact with the model&amp;rsquo;s capital requirement trade-off?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The &amp;ldquo;forced safety effect&amp;rdquo; (following Bahaj and Malherbe 2020) refers to the positive effect of tighter capital requirements on loan supply that operates through reducing bank failure probability. When banks are less likely to fail (lower F(μ_{t+1})), the expected bank productivity conditional on not failing — (1 − G(μ_{t+1})) — rises toward one, reducing the discount applied to future loan payoffs in the bank&amp;rsquo;s stochastic discount factor. This improves the profitability of lending and expands loan supply. In the model, this effect partially offsets the direct loan-supply reduction from higher equity requirements but does not dominate, so the overall effect of tighter requirements on deposit supply is still negative, preserving the core trade-off.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What robustness checks are performed and do they materially change the main results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The authors consider three main robustness checks. First, reducing the deposit demand elasticity parameter from γ_D = 1.5 to γ_D = 0.6 (recalibrating ω_D = 0.012 to preserve the -100 bp deposit spread target) has almost no effect on the optimal path of capital requirements. Second, raising the energy/non-energy substitution elasticity from ε̃ = 0.2 to ε̃ = 3 (and adjusting the energy weight to maintain a 10% energy share) produces much stronger fossil investment declines and smaller clean investment responses, but aggregate loan demand and bank deposits contract only slightly less, so the relaxation in capital requirements is slightly smaller than in the baseline. Third, recalibrating to a 2% annualized bank failure rate (versus the baseline 0.7%) does not materially change results. The conclusion that capital requirements should decline along the transition is robust across all specifications.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Deposit financing wedge (Ξ_t):&lt;/strong&gt; The gain for banks from funding loans via deposits rather than equity. It comprises two components: (i) the liquidity premium — households value deposits for their liquidity services, so the deposit rate lies below the risk-free rate; and (ii) the deposit insurance put — the expected repayment obligation per unit of deposits is only 1 − F(μ_{t+1}), not one, since the DIA covers depositors in the event of bank failure. A larger wedge makes deposit-financed lending more profitable, expanding loan supply. In this paper the wedge is the central transmission mechanism through which capital requirements and aggregate loan demand interact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank failure threshold (μ_t):&lt;/strong&gt; The realization of the bank-specific idiosyncratic risk shock below which a bank cannot service depositors and transfers all assets and liabilities to the deposit insurance agency. It depends on the ratio of deposit repayment obligations to the aggregate realized loan portfolio return. In the model the threshold increases when aggregate loan payoffs fall (as in a carbon tax shock), temporarily raising bank failure rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ramsey-optimal capital requirement:&lt;/strong&gt; The sequence of sector-specific (or uniform) capital ratios chosen by a benevolent government planner to maximize household welfare, treating the capital requirement as the sole policy instrument. In this model the Ramsey problem is solved nonlinearly along the perfect-foresight transition path. The planner internalizes that tighter requirements simultaneously reduce bank failure probability and shrink deposit supply; the optimum trades off these two objectives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sustainability-linked capital requirement:&lt;/strong&gt; A capital requirement on fossil loans that explicitly depends on the abatement effort undertaken by fossil firms (κ_f = κ̃ − η_t), creating a direct financing-cost incentive for emission reduction. This contrasts with a plain fossil penalizing factor, which affects only the financing cost of fossil capital without altering abatement incentives. The paper shows that even sustainability-linked requirements are quantitatively negligible as climate policy relative to carbon taxes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Carbon compliance cost per unit of fossil production (ξ_t):&lt;/strong&gt; A summary statistic combining the direct carbon tax payment and the abatement cost at the optimal abatement effort. It measures the total policy-induced wedge that reduces the profitability of fossil capital and raises fossil firms&amp;rsquo; break-even default threshold. In the transition experiment, compliance costs rise from zero to approximately 4% of fossil production value as the tax increases from 0 to 10 $/ToC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asset stranding channel:&lt;/strong&gt; The mechanism through which an unanticipated tightening of carbon policy raises fossil firms&amp;rsquo; default probability on impact (by increasing compliance costs above the level priced into existing loan contracts) and subsequently reduces their loan demand permanently. The paper contrasts its treatment of this channel — where stranding affects bank regulation through aggregate deposit supply effects — against models (such as Carattini, Melkadze, and Heutel 2023) where stranding causes an inefficient credit crunch via a financial accelerator.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deposit spread (s^D_t):&lt;/strong&gt; Defined as the annualized difference between the deposit rate and the risk-free rate, expressed in basis points. Because households value deposits for liquidity services, the deposit rate lies permanently below the risk-free rate (spread is negative). In the baseline calibration the target is -100 bps. The spread widens (becomes less negative) when deposits become scarcer, which is the case along the carbon tax transition as bank balance sheets contract.&lt;/p&gt;</description></item><item><title>Closing Gender Gaps Through Workplace Diversity: The Intergenerational Effects of World War I</title><link>https://macropaperwarehouse.com/papers/closing-gender-gaps-through-workplace-diversity-the-intergenerational-effects-of-world-war-i/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/closing-gender-gaps-through-workplace-diversity-the-intergenerational-effects-of-world-war-i/</guid><description>&lt;p&gt;This paper asks whether exposure to greater female representation in the workplace can persistently reduce intergenerational gender gaps in labor market outcomes. The authors exploit the sudden, city-by-department variation in female employment within the U.S. federal government triggered by World War I mobilization. Using the Official Registers of the United States — biennial personnel rosters covering the near-universe of federal employees from 1913 to 1921 — linked to full-count decennial censuses (1900–1940), they construct a granular measure of each office&amp;rsquo;s (city × department) change in female share between 1915 and 1919, then trace labor force outcomes for the children of incumbent civil servants in the 1940 Census.&lt;/p&gt;
&lt;p&gt;WWI caused the female share of the federal civilian workforce to jump by 13 percentage points — a doubling within two years (1917–1919). These wartime female entrants were younger, more likely to be single, more educated, more geographically mobile, and less likely to have been previously employed than their male counterparts, suggesting the war mobilized a previously untapped labor pool. The increase was driven almost entirely by clerical positions: the female share of the federal clerical workforce rose from roughly 30% to nearly 70% within two years.&lt;/p&gt;
&lt;p&gt;The main finding is that a one standard deviation (SD) increase in parental exposure to female co-workers reduces the gender gap in labor force participation (LFP) among children of incumbent civil servants by 4.1–4.6 percentage points in the within-city, within-department specification — a decline in the mean gender LFP gap of approximately 8.6–9.6% by 1940. This effect is entirely driven by a higher propensity of daughters to work; sons&amp;rsquo; LFP is unaffected. The intergenerational effect operates primarily through exposed fathers, including fathers without working wives, identifying a channel beyond the mother-to-daughter vertical transmission emphasized in prior literature. Children who were teenagers at the time of parental exposure show the largest effects, consistent with formative-years malleability. A placebo test using civil servants who left the same offices before the wartime shock shows no comparable effect, ruling out time-invariant office-level selection.&lt;/p&gt;
&lt;p&gt;Parental exposure extends beyond the public sector: the private sector LFP effect is comparable in magnitude to the public sector effect. The gender earnings gap among children of exposed civil servants narrows by 12%, driven by daughters moving into higher-paying, previously male-dominated positions rather than by differences in hours or weeks worked. Marriage, fertility, and schooling differences only partially mediate the LFP effect, with a residual exposure effect remaining after controlling for these proximate determinants.&lt;/p&gt;
&lt;p&gt;At the aggregate level, a 1 SD increase in city-level exposure to female federal workers raises overall female LFP by 0.9–1.0 percentage points, with no effect on male LFP, and the effect persists through 1940. A back-of-envelope calculation implies each additional female wartime civil service entrant generated approximately 2.4 additional women entering the workforce — a multiplier effect. Neighborhood-level analysis shows LFP gains are concentrated in enumeration districts where wartime female civil servants resided, and cities with greater female federal employment exposure also saw faster women&amp;rsquo;s club membership growth after WWI.&lt;/p&gt;
&lt;p&gt;The scope conditions are important: the sample covers 70 cities and 8 federal departments with meaningful pre-war staffing; children must have been born by 1917; and the 1940 outcomes reflect adulthood labor decisions in a labor market shaped by subsequent decades of change. The design relies on within-city and within-department residual variation in female share change being conditionally exogenous, supported by lack of correlation with pre-war office characteristics.&lt;/p&gt;
&lt;p&gt;Q: What was the scale of the WWI shock to female federal employment?
A: The U.S. entry into WWI in April 1917 triggered a near-doubling of total federal civilian employment from roughly 150,000 to over 300,000 workers by 1919. Within this expansion, the share of female civil servants increased by 13 percentage points — a doubling of the female share within two years. The increase was driven almost entirely by clerical positions, where the female share rose from around 30% to nearly 70%.&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure parental exposure to female co-workers?
A: Exposure is measured as the change in the share of female civil servants at the city-by-department (&amp;ldquo;office&amp;rdquo;) level between 1915 and 1919. The sample is restricted to offices with at least 20 civil servants in 1915 and cities with at least two federal departments, yielding 70 cities and 8 departments. The interquartile range of exposure across offices is approximately 10 percentage points, and cross-city and cross-department variation explains 58% of the overall variation, leaving substantial residual office-level variation for identification.&lt;/p&gt;
&lt;p&gt;Q: What is the main intergenerational finding and its magnitude?
A: A 1 SD increase in parental exposure to female co-workers increases the relative likelihood that a daughter works (compared to a son) by 2 percentage points in the baseline specification, and by 4.1–4.6 percentage points in the preferred within-city and within-department specification. Since daughters of civil servants are on average 48 percentage points less likely than sons to be in the labor force in 1940, this corresponds to closing the mean gender LFP gap by approximately 8.6–9.6%.&lt;/p&gt;
&lt;p&gt;Q: Does the effect operate through daughters or sons?
A: The effect is entirely driven by daughters. Parental exposure to female co-workers has no statistically discernible impact on the labor force participation of sons. The decline in the gender LFP gap is thus attributable to a higher propensity of daughters of exposed civil servants to work.&lt;/p&gt;
&lt;p&gt;Q: What is the key placebo test, and what does it show?
A: The authors exploit high-frequency personnel records to identify civil servants who selected into the same offices that would later be exposed but who left before the wartime shock occurred. These pre-departure leavers show no intergenerational exposure effects on their children&amp;rsquo;s LFP, ruling out the interpretation that time-invariant selection into particular offices drives the results.&lt;/p&gt;
&lt;p&gt;Q: Which parent serves as the primary channel of transmission?
A: Exposed fathers are the primary conduit. The effect for daughters is precise and sizable even when restricting the sample to fathers without working wives, suggesting the channel does not depend on children observing maternal employment. While the estimated effect through mothers is positive, it is imprecise — likely due to the small sample of female incumbent civil servants. This identifies fathers as a new channel of vertical intergenerational norm transmission, beyond the mother-to-daughter pathway emphasized in prior literature.&lt;/p&gt;
&lt;p&gt;Q: How does children&amp;rsquo;s age at the time of parental exposure moderate the effect?
A: The exposure effects are concentrated among children who were teenagers at the time of parental exposure during WWI. Children who were older and more likely to have already left the household or formed fixed beliefs show little to no detectable effect. This pattern is consistent with the formative-years hypothesis that experiences during adolescence shape lifetime economic behavior.&lt;/p&gt;
&lt;p&gt;Q: Does the intergenerational effect extend beyond the public sector?
A: Yes. The private sector LFP effect for daughters is comparable in magnitude to the public sector effect, with a 1 SD increase in parental exposure having approximately equal effects on LFP within public and private employment. There is also no measurable shift toward clerical occupations specifically, suggesting the channel is a broader change in attitudes toward women working, not transmission of information about specific government or clerical jobs.&lt;/p&gt;
&lt;p&gt;Q: What is the effect on the gender earnings gap?
A: A 1 SD increase in parental exposure to female co-workers closes the gender earnings gap among children of civil servants by 12%. This is not driven by differences in weeks or hours worked, but rather by daughters of exposed parents selecting into higher-paying and previously male-dominated occupations.&lt;/p&gt;
&lt;p&gt;Q: How do the authors address the possibility that the results reflect local labor market conditions rather than parental exposure per se?
A: By 1940, 67% of civil servant children lived in a city different from their parent&amp;rsquo;s WWI-era city. Even among children who moved to the same destination city — and thus face identical labor market conditions — variation in parental exposure at the origin city-by-department remains highly predictive of daughters&amp;rsquo; LFP. Comparing children moving from the same origin city to the same destination city, those with parents in higher-exposure departments still show higher LFP, pointing to cultural transmission rather than local labor market demand.&lt;/p&gt;
&lt;p&gt;Q: What do the marriage and fertility results indicate about mechanisms?
A: Daughters of more exposed civil servants are less likely to be married (a 1 SD increase in parental exposure reduces the relative likelihood of daughters being married by 3.7 percentage points) and tend to have fewer children by 1940. A mediation exercise shows these observable differences in marriage, fertility, and education only partially explain the LFP increase; a statistically significant and economically large residual exposure effect remains, consistent with parental exposure shifting broader gender norms rather than only proximate determinants of labor supply.&lt;/p&gt;
&lt;p&gt;Q: What does the spousal work decision evidence contribute?
A: A 1 SD increase in male civil servants&amp;rsquo; exposure to female co-workers increases the propensity of their subsequent wife to work by 0.5 percentage points after WWI. The effect is driven by marriages formed after the exposure and is not mechanically explained by men marrying their female co-workers. This revealed preference measure supports the interpretation that exposure changed men&amp;rsquo;s attitudes toward women&amp;rsquo;s work.&lt;/p&gt;
&lt;p&gt;Q: What do naming patterns suggest about changing attitudes?
A: Exposed parents are more likely to give daughters names that are less feminine — specifically, names with a lower share of vowels or less likely to end with a vowel — for daughters born after WWI. No comparable effect is observed for sons&amp;rsquo; names. This provides supplementary evidence of a shift in paternal attitudes following workplace exposure to female co-workers.&lt;/p&gt;
&lt;p&gt;Q: What are the aggregate city-level effects on female LFP?
A: In a difference-in-differences design using cross-city variation in female federal worker exposure before and after WWI, a 1 SD increase in city-level exposure raises aggregate female LFP by 0.9–1.0 percentage points, with no effect on male LFP. The effect is persistent through 1940 and city-level exposure is uncorrelated with female LFP prior to WWI. A back-of-envelope calculation implies each additional female wartime entrant generated approximately 2.4 additional women entering the broader workforce — a social multiplier.&lt;/p&gt;
&lt;p&gt;Q: Is there evidence of horizontal (non-family) transmission?
A: Yes. The aggregate LFP gains are concentrated almost entirely in census enumeration districts where female wartime civil servants resided; neighboring districts without female entrants do not see comparable gains. Cities with greater increases in female federal employees also experienced faster growth in women&amp;rsquo;s club memberships, with this pattern appearing only after WWI and coinciding with the rise in female LFP. Both findings are consistent with social learning operating through residential proximity and community networks.&lt;/p&gt;
&lt;p&gt;Q: How robust are the results to potential selection bias from imperfect census linking?
A: The propensity of a civil servant&amp;rsquo;s child to be linked to the 1940 Census is — conditional on city and department fixed effects — uncorrelated with the parental exposure measure. The authors apply inverse probability weighting (IPW) to ensure the matched sample is balanced on baseline characteristics, and results remain virtually identical. Estimates are also stable across different linking strategies individually.&lt;/p&gt;
&lt;p&gt;Q: What instrumental variable strategy is used and what does it find?
A: The authors instrument for office-level female share change using the interaction of the 1915 clerical workforce share and an indicator for war-related departments — a pre-determined source of variation in the capacity and demand for female clerical workers. The IV estimates are consistent with the OLS main specification: parental exposure to female co-workers closes the children&amp;rsquo;s gender LFP gap.&lt;/p&gt;
&lt;p&gt;Q: What is the policy implication regarding public sector hiring?
A: The paper suggests that increasing gender representation within public sector employment can have labor market implications that extend well beyond the organization itself — across generations through vertical intergenerational transmission and across the broader community through horizontal social spillovers. The findings imply that public sector diversity policies can serve as a lever for broader, persistent reductions in gender gaps in the private labor market.&lt;/p&gt;
&lt;p&gt;Office-level exposure: The city-by-department measure of the change in female share of civil servants between 1915 and 1919, capturing the granular intensity of each workplace unit&amp;rsquo;s contact with wartime female entrants; the interquartile range across offices is approximately 10 percentage points.&lt;/p&gt;
&lt;p&gt;Intergenerational gender gap in LFP: The difference in labor force participation rates between daughters and sons of incumbent civil servants measured in 1940 adulthood, used as the primary outcome to capture whether parental workplace exposure transmits to children&amp;rsquo;s labor supply decisions.&lt;/p&gt;
&lt;p&gt;Vertical transmission: The intergenerational channel through which exposed parents — identified here primarily as fathers, including those without working wives — convey changed attitudes or information about female work to their children, closing the gender LFP gap.&lt;/p&gt;
&lt;p&gt;Horizontal transmission: The community-level channel through which the increased presence of female civil servants in a city spreads changed norms or information about women&amp;rsquo;s work to women who are not daughters of exposed co-workers, operating through residential proximity and social networks such as women&amp;rsquo;s clubs.&lt;/p&gt;
&lt;p&gt;Social multiplier: The amplification of the direct effect of hiring female workers through behavioral spillovers; the authors&amp;rsquo; back-of-envelope calculation estimates that each additional female wartime civil service entrant generated approximately 2.4 additional women entering the workforce.&lt;/p&gt;
&lt;p&gt;Formative years: The period of adolescence during which children are argued to be most malleable in forming preferences and beliefs; exposure effects in this paper are concentrated among children who were teenagers at the time of parental exposure, with older children showing little effect.&lt;/p&gt;
&lt;p&gt;Source text origin: The authors&amp;rsquo; classification of whether a summary is based on full working paper text (pdf or oa-html) vs. abstract only; in this workflow, abstract-only is a hard block for summary generation.&lt;/p&gt;</description></item><item><title>Coarse Bayesian Updating</title><link>https://macropaperwarehouse.com/papers/coarse-bayesian-updating/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/coarse-bayesian-updating/</guid><description>&lt;p&gt;This paper introduces and axiomatically characterizes Coarse Bayesian updating, a generalization of Bayes&amp;rsquo; rule designed to accommodate the wide empirical evidence that individuals systematically deviate from standard Bayesian belief revision. The research question is: what is the minimal, tractable, axiomatically grounded generalization of Bayes&amp;rsquo; rule that can accommodate heterogeneous non-Bayesian behaviors — including under-reaction, over-reaction, asymmetric updating, limited perception, and motivated reasoning — while remaining portable to standard economic settings?&lt;/p&gt;
&lt;p&gt;The paper takes as primitive a finite state space Omega = {1, &amp;hellip;, N} and an updating rule mu: S -&amp;gt; Delta assigning posterior beliefs to signals, where signals represent likelihood profiles from stochastic information structures. No data are used; the methodology is axiomatic decision theory combined with analysis of the model&amp;rsquo;s implications in static, dynamic, and decision-theoretic settings.&lt;/p&gt;
&lt;p&gt;A Coarse Bayesian agent is characterized by (i) a partition of the probability simplex Delta into convex cells, and (ii) a representative distribution for each cell, one of which is the prior. Upon observing a signal, the agent determines which cell contains the Bayesian posterior and adopts the representative of that cell as his posterior belief. The agent need not point-identify the Bayesian posterior; he merely approximates it by identifying which cell it belongs to.&lt;/p&gt;
&lt;p&gt;The central characterization result (Theorem 1) establishes that an updating rule has a Coarse Bayesian representation if and only if it satisfies three axioms: Homogeneity (beliefs depend only on likelihood ratios of the signal, not its scale), Cognizance (if two signals induce the same belief, then a garbled signal indicating one of them was generated also induces that belief), and Confirmation (if a signal is perfect evidence of some feasible belief, the agent adopts that belief). The representation — partition, representative points, and prior — is unique.&lt;/p&gt;
&lt;p&gt;Proposition 1 shows that, under mild regularity conditions, strengthening any of the three axioms to an if-and-only-if form forces the agent to be perfectly Bayesian. This identifies the Coarse Bayesian framework as a qualitatively small but substantively rich departure from Bayes&amp;rsquo; rule. The converse statements identify three necessary non-Bayesian behaviors exhibited by any proper Coarse Bayesian: (i) treating some signals as equivalent when a Bayesian would not; (ii) collapsing to a default belief when uncertain between two signals the agent would otherwise distinguish; (iii) false extrapolation — arriving at a belief via signals that are not perfect evidence of it.&lt;/p&gt;
&lt;p&gt;In dynamic settings, Pooled Coarse Bayesian rules (which apply the full signal history at each period) are invariant to signal ordering and pooling and converge whenever Bayesian beliefs do, though to the representative point of the cell containing the true state rather than the true state itself. Sequential Signal Distortion rules are invariant to signal ordering but not pooling, and beliefs converge almost surely — but not necessarily to the true state (Example 1 illustrates convergence to the wrong state in a two-state setting). Sequential Coarse Bayesian rules need not satisfy either form of path-independence and need not converge at all.&lt;/p&gt;
&lt;p&gt;In the decision-theoretic application (Section 4), a Coarse Bayesian&amp;rsquo;s value of information is posterior-separable and generally violates the Blackwell (1951) information ordering — more informative experiments need not be valued more highly. Two Coarse Bayesians are shown to be identical (same cells and representative points) if and only if they benefit from the same Blackwell improvements, providing a behavioral identification result. Agents with finer partitions are more sophisticated (higher ex-ante value of information), while agents with larger distortions from Bayesian posteriors are more biased (larger worst-case losses relative to a Bayesian). Neither greater sophistication nor lower bias implies being better off at all menus or signal realizations.&lt;/p&gt;
&lt;p&gt;Q: What are the three axioms that characterize Coarse Bayesian updating, and what property of Bayes&amp;rsquo; rule does each capture?
A: Homogeneity requires that beliefs depend only on likelihood ratios of the signal — if two signals are proportional (s ~ t), they induce the same posterior. Cognizance requires that if two signals induce the same belief, then a garbled signal indicating that one of them was generated also induces that belief (mu_{s+t} = mu_s when mu_s = mu_t). Confirmation requires that if a signal is perfect evidence of some feasible belief — i.e., the Bayesian posterior at that signal equals a candidate belief — then the agent adopts that belief. Each axiom is satisfied by standard Bayesian updating.&lt;/p&gt;
&lt;p&gt;Q: In what sense is Coarse Bayesian updating a &amp;ldquo;small&amp;rdquo; departure from Bayes&amp;rsquo; rule?
A: Proposition 1 establishes that strengthening any one of the three axioms to an if-and-only-if form forces the agent to be perfectly Bayesian. The converses are: (i) different likelihood ratios lead to different posteriors; (ii) if a garbled signal does not change beliefs, then the two signals must induce the same belief individually; (iii) if a signal induces the same posterior as another, then it must be perfect evidence of that posterior. Any Coarse Bayesian satisfying any one of these is in fact perfectly Bayesian, meaning the three axioms together come very close to fully characterizing Bayesian rationality.&lt;/p&gt;
&lt;p&gt;Q: What non-Bayesian behaviors does the model generate as special cases?
A: The framework generates under-reaction (representative points of cells close to the prior boundary), over-reaction (representative points at the far boundary), asymmetric updating (favoring one state, making upward revision easier than downward), limited perception (the agent retains the prior unless the Bayesian posterior is sufficiently far from the prior), extreme-belief aversion (the agent applies Bayes&amp;rsquo; rule except when posteriors are near degenerate distributions), and reactions to unexpected news (non-Bayesian behavior only when signals have low prior probability). In each case the Coarse Bayesian Representation provides an axiomatic foundation via Axioms 1–3.&lt;/p&gt;
&lt;p&gt;Q: What are the three necessary non-Bayesian behaviors exhibited by any proper (non-Bayesian) Coarse Bayesian?
A: These follow from the negations of properties (i)-(iii) in Proposition 1. First, there exist signals s and t that are not proportional yet induce the same posterior — the agent treats informationally distinct signals as equivalent. Second, there exist signals s and t such that mu_s ≠ mu_t but mu_{s+t} = mu_s — signals the agent distinguishes individually collapse to a default when the agent is uncertain which one was generated. Third, there exist signals s and t with mu_s = mu_t where t is not perfect evidence of mu_s — a form of false extrapolation. Together, these three biases account for all non-Bayesian behavior the model generates.&lt;/p&gt;
&lt;p&gt;Q: How does the model accommodate globally uniform biases like always-under-reaction, and how common does it predict such behavior to be?
A: Global under-reaction requires representative points of cells to sit on their cell boundaries (as close to the prior as possible given the partition). This is a non-generic, hairline case — representative points generically lie in the interior of their cells, so a typical Coarse Bayesian under-reacts to some signals and over-reacts to others depending on which cell the Bayesian posterior falls into. The model additionally predicts local stability: if an agent over-reacts to signal s, nearby signals typically produce the same response; if an agent is Bayesian at s, nearby signals are almost surely also Bayesian.&lt;/p&gt;
&lt;p&gt;Q: What does the model imply about dynamic updating under sequential signal-by-signal processing versus pooled processing?
A: Pooled Coarse Bayesian rules apply the full signal history at each period, are invariant to both signal ordering and signal pooling, and converge almost surely whenever Bayesian beliefs converge — but to the representative point of the cell containing the true state, not necessarily the true state itself. Sequential Signal Distortion rules are invariant to signal ordering but not signal pooling, and also yield almost-sure convergence though potentially to the wrong state (Example 1 shows this for a two-state setting). Sequential Coarse Bayesian rules need not be invariant to either form of path-dependence and need not converge at all.&lt;/p&gt;
&lt;p&gt;Q: How does the paper provide a behavioral identification of the model&amp;rsquo;s parameters?
A: Theorem 1 establishes that the partition, representative points, and prior are uniquely determined by the agent&amp;rsquo;s updating rule alone — they are identifiable from observable updating behavior without additional assumptions. In the decision-theoretic setting of Section 4, a stronger result holds: two Coarse Bayesians are identical (same cells and same representative points) if and only if they benefit from the same Blackwell improvements across all menus (decision problems). This means the model&amp;rsquo;s parameters can be uniquely identified from menu-contingent rankings of Blackwell-comparable experiments.&lt;/p&gt;
&lt;p&gt;Q: Does the Coarse Bayesian framework respect the Blackwell information ordering, and what characterizes when Blackwell improvements are beneficial?
A: Unlike Bayesians, Coarse Bayesians typically violate the Blackwell ordering — they need not assign higher ex-ante value to more informative experiments. The paper characterizes the menus (decision problems) for which a given Coarse Bayesian benefits from Blackwell improvements, and shows this characterization runs deep: the complete set of such menus fully identifies the agent&amp;rsquo;s representation.&lt;/p&gt;
&lt;p&gt;Q: How do the sophistication and bias orderings relate to welfare?
A: An agent is more sophisticated if he employs a finer partition; more-sophisticated agents have a higher ex-ante value of information. An agent is more biased if his updating rule exhibits larger distortions from Bayesian posteriors; greater bias is characterized by greater worst-case losses relative to a Bayesian. Crucially, neither greater sophistication nor lower bias implies the agent is better off at all menus or signal realizations — welfare improvements require the agent to be perfectly Bayesian on a strictly larger set of signal realizations, giving rise to a third ordering that jointly refines the other two.&lt;/p&gt;
&lt;p&gt;Q: How does the model relate to Wilson (2014) and Ortoleva (2012)?
A: Wilson (2014) studies optimal updating for a boundedly rational agent with K memory states over binary decisions: each memory state is associated with a convex set of posteriors and a representative, so the optimal protocol is a dynamic Coarse Bayesian updating procedure. However, Wilson&amp;rsquo;s parameters are endogenous (determined by signal structure, stakes, and the bound K), whereas Coarse Bayesian updating does not require optimality or a bound on the number of cells — the model can accommodate behavior (e.g., Bayesian updating except at &amp;ldquo;extreme&amp;rdquo; signals) that Wilson&amp;rsquo;s model cannot. Ortoleva&amp;rsquo;s (2012) Hypothesis Testing model applies Bayes&amp;rsquo; rule when the prior probability of a signal exceeds a threshold epsilon and otherwise uses a maximum-likelihood criterion; Coarse Bayesian updating can accommodate similar behavior, and the paper shows that Coarse Bayesian rules can be expressed as Maximum-Likelihood rules when there are only two states, but neither class subsumes the other in general — Maximum-Likelihood rules may violate Confirmation.&lt;/p&gt;
&lt;p&gt;Q: What are the main limitations of the Coarse Bayesian framework?
A: The paper identifies four. First, only likelihood ratios of the realized signal matter — sensitivity to framing and extraneous environmental features are ruled out. Second, beliefs must be probability distributions, so phenomena like the conjunction fallacy (where subjects assign higher probability to a conjunction than a component event) are outside the model&amp;rsquo;s scope. Third, the model exhibits discontinuities when signal perturbations move the Bayesian posterior across a cell boundary — a feature shared with Wilson (2014), Ortoleva (2012), and related models. Fourth, cells must be convex (driven by Cognizance); dropping Cognizance allows non-convex cells but removes the normative foundation that agents correctly forecast their own updating behavior.&lt;/p&gt;
&lt;p&gt;Coarse Bayesian Representation: A pair consisting of a partition P of the probability simplex Delta into convex cells and a profile of representative distributions (one per cell, including the prior), such that the agent&amp;rsquo;s posterior after observing signal s equals the representative of the cell containing the Bayesian posterior B(mu_e|s).&lt;/p&gt;
&lt;p&gt;Homogeneity: The axiom that if two signals are proportional (s ~ t, meaning s = lambda*t for some lambda &amp;gt; 0), they induce the same posterior belief — updating depends only on likelihood ratios, not signal scale.&lt;/p&gt;
&lt;p&gt;Cognizance: The axiom that if signals s and t induce the same posterior, then the garbled signal s+t (indicating that either s or t was generated) also induces that belief — the agent correctly forecasts his own updating behavior.&lt;/p&gt;
&lt;p&gt;Confirmation: The axiom that if a signal constitutes perfect evidence of some feasible belief (i.e., the Bayesian posterior equals a candidate belief), the agent adopts that belief — candidate beliefs are adopted when the signal confirms them exactly.&lt;/p&gt;
&lt;p&gt;Signal Distortion Representation: An equivalent representation of Coarse Bayesian behavior as a function d: S -&amp;gt; S that distorts signals before Bayesian updating is applied (mu_s = B(mu_e|d(s))), satisfying properties analogous to the three axioms; equivalent to the partition representation in static settings but distinct in dynamic settings.&lt;/p&gt;
&lt;p&gt;Blackwell Information Ordering: The partial order on experiments under which sigma is more informative than sigma&amp;rsquo; if sigma can be obtained from sigma&amp;rsquo; by a garbling; Bayesians always weakly prefer more informative experiments in this ordering, but Coarse Bayesians typically do not.&lt;/p&gt;
&lt;p&gt;Sophistication Ordering: The partial order under which one Coarse Bayesian is more sophisticated than another if he employs a finer partition; more-sophisticated agents exhibit greater responsiveness to information as measured by ex-ante value of information.&lt;/p&gt;
&lt;p&gt;Bias Ordering: The partial order under which one Coarse Bayesian is more biased than another if his updating rule exhibits larger distortions away from Bayesian posteriors; greater bias is characterized by larger worst-case losses relative to a Bayesian benchmark.&lt;/p&gt;</description></item><item><title>Collusion with Optimal Information Disclosure</title><link>https://macropaperwarehouse.com/papers/collusion-with-optimal-information-disclosure/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/collusion-with-optimal-information-disclosure/</guid><description>&lt;p&gt;This paper asks how a third-party intermediary (an &amp;ldquo;algorithm&amp;rdquo;) that observes market demand or costs superior to competing firms should optimally disclose that information to maximize the firms&amp;rsquo; collusive profit in a repeated Bertrand competition setting. The motivation is the rise of algorithmic pricing intermediaries such as RealPage in apartment rentals, A2i Systems in retail gasoline, and Rainmaker in hotel rooms, as well as offline cartel facilitators like AC-Treuhand.&lt;/p&gt;
&lt;p&gt;The model extends the canonical Rotemberg–Saloner (1986) repeated Bertrand framework with stochastic demand. The key technical assumption is that firm profit is affine in the unknown state s, so expected profit depends only on the expected state. This holds for binary states, linear demand with unknown intercept (D(p,s) = s − p), and linear demand with unknown per-unit cost. The algorithm observes s and commits to a known disclosure policy mapping s to a public signal. The solution concept is pure-strategy subgame-perfect equilibrium, and the paper solves for the disclosure policy and equilibrium that jointly maximize collusive profit.&lt;/p&gt;
&lt;p&gt;The main result (Theorem 1) is that the unique optimal disclosure policy is upper censorship: there is a cutoff ŝ such that demand states s &amp;lt; ŝ are disclosed and result in the corresponding monopoly price p^m(s), while demand states s ≥ ŝ are pooled — only the event {s ≥ ŝ} is disclosed — and result in the monopoly price for the mean concealed state, p^m(s*), where s* = E[s | s ≥ ŝ]. The reduction to a static information design problem (Lemma 1) is the key technical step: optimal collusive profit equals V*, the greatest fixed point of V = max_{G ∈ MPC(F)} E_G[min{π^m(s), δV/((1−δ)(n−1))}]. The &amp;ldquo;capped monopoly profit&amp;rdquo; min{π^m(s), π^max} is convex-then-concave in s, and classical results from the static information design literature (Kolotilin 2018; Dworczak and Martini 2019) then imply upper censorship is uniquely optimal.&lt;/p&gt;
&lt;p&gt;Two features of the optimal equilibrium are notable. First, prices are rigid (constant at p^m(s*)) whenever s ≥ ŝ — the opposite of Rotemberg–Saloner&amp;rsquo;s &amp;ldquo;price wars during booms.&amp;rdquo; The logic is that pooling high demand states with a lower average state is more profitable than cutting prices, because pooling reduces the current-period deviation gain without sacrificing as much on-path profit. Second, for demand states s ∈ (ŝ, s*), the equilibrium price p^m(s*) exceeds the monopoly price p^m(s) — supra-monopoly pricing occurs for a range of intermediate states. Monopoly pricing is attainable at each such state in isolation, but recommending the higher price p^m(s*) is necessary to make the pooling incentive-compatible at states s &amp;gt; s*.&lt;/p&gt;
&lt;p&gt;Comparing to full disclosure, Proposition 1 shows that optimal disclosure leads to strictly higher prices at every demand state, and hence unambiguously lower consumer surplus. Proposition 3 shows that improving the algorithm&amp;rsquo;s accuracy (a mean-preserving spread of F) reduces expected consumer surplus whenever consumer surplus under monopoly pricing is concave in s — a natural condition. This result is more pessimistic than prior work (Sugaya–Wolitzky 2018; Miklos-Thal–Tucker 2019), which found ambiguous effects because those papers assumed full disclosure.&lt;/p&gt;
&lt;p&gt;Comparative statics (Proposition 2): fewer firms or a higher discount factor δ increases collusive profit V* and makes prices more flexible (raises ŝ). Collusion is impossible if and only if δ &amp;lt; (n−1)/n, the same threshold as under full disclosure.&lt;/p&gt;
&lt;p&gt;Extensions maintain the core results. With Markov (persistent) demand (Section 4 / Theorem 2), upper censorship remains optimal but the cutoff ŝ(s) depends on last-period demand s: under positive serial correlation, ŝ(s) is decreasing in s, so the algorithm discloses less information following high demand. With differentiated products under a symmetric linear demand system (Section 5 / Theorem 3), the optimal policy censors an intermediate interval [ŝ_L, ŝ_H] and discloses both the lowest and highest demand states, because at high states the absence of an upper bound on equilibrium profit makes disclosure with price-cutting optimal.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question and why is it policy-relevant?
A: The paper asks how an informed intermediary should optimally disclose demand or cost information to competing firms to maximize their collusive profit. It is directly motivated by antitrust cases against RealPage (sued by the US DOJ in August 2024), A2i Systems/Kalibrate, and Rainmaker, all of which gather market data from competing firms and recommend prices. The theory also applies to offline facilitators like AC-Treuhand, prosecuted by the European Commission for disclosing competitively sensitive information.&lt;/p&gt;
&lt;p&gt;Q: What is the affinity assumption and why does it matter?
A: The paper assumes that firm profit π(p, s) is affine (linearly increasing) in the demand or cost state s for each price p. This implies that expected profit for any distribution over states equals profit evaluated at the expected state: E[π(p,s)] = π(p, E[s]). As a consequence, any disclosure policy is equivalent, from a profit standpoint, to choosing a distribution G of the firms&amp;rsquo; posterior mean beliefs over s, and G must be a mean-preserving contraction of the prior F (by Blackwell 1953). The assumption is satisfied for binary states, linear demand with unknown intercept, and linear demand with unknown cost.&lt;/p&gt;
&lt;p&gt;Q: What is the key reduction result (Lemma 1) and what does it achieve?
A: Lemma 1 reduces the problem of finding an optimal repeated-game equilibrium to a static information design problem. Optimal collusive profit equals V*, the greatest fixed point of V = max_{G ∈ MPC(F)} E_G[min{π^m(s), δV/((1−δ)(n−1))}], and this is attained by a symmetric, stationary, grim-trigger equilibrium. The reduction works because, under Bertrand competition, static deviation gains are proportional to on-path payoffs, creating a one-to-one correspondence that allows the repeated-game constraint to be folded into a single-period objective.&lt;/p&gt;
&lt;p&gt;Q: Why is upper censorship the uniquely optimal disclosure policy?
A: The static information design problem has a &amp;ldquo;capped monopoly profit&amp;rdquo; objective: min{π^m(s), π^max}, where π^max = δV*/((1−δ)(n−1)) is the maximum per-period profit that satisfies incentive constraints. Because π^m(s) is convex (as the maximum of affine functions) and the cap π^max is constant, the overall objective is convex for s below the cap and constant (then concave) above it — i.e., convex-then-concave in s. Classical results for linear information design (Kolotilin 2018; Dworczak and Martini 2019) imply that the unique optimal policy for a convex-then-concave objective is upper censorship.&lt;/p&gt;
&lt;p&gt;Q: What is the supra-monopoly pricing result and why does it arise?
A: For demand states s ∈ (ŝ, s*), the equilibrium price is p^m(s*) &amp;gt; p^m(s), meaning firms charge above the monopoly price for the current state. This arises because the pooling policy must recommend a single price for all states s ≥ ŝ, and the recommended price is p^m(s*) where s* = E[s | s ≥ ŝ]. At intermediate states s ∈ (ŝ, s*), this price exceeds the local monopoly price. The algorithm accepts lower profit at these states because it is necessary to maintain the pooled recommendation at higher states where monopoly pricing would otherwise require a price cut.&lt;/p&gt;
&lt;p&gt;Q: How does optimal disclosure compare to full disclosure in terms of consumer surplus?
A: Proposition 1 shows that collusive prices under optimal disclosure are strictly higher at every demand state compared to full disclosure (Rotemberg–Saloner). In Rotemberg–Saloner, high demand states trigger price cuts (&amp;ldquo;price wars during booms&amp;rdquo;) to deter deviation; under optimal disclosure, high states are pooled and prices are instead rigid at p^m(s*). Because prices are higher at all states, consumer surplus is unambiguously lower under optimal disclosure.&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 3 say about the effect of algorithmic accuracy on consumer surplus?
A: Proposition 3 states that if consumer surplus under monopoly pricing, CS(s), is concave in s, then a mean-preserving spread of F (i.e., improved algorithmic accuracy) reduces expected consumer surplus. This result is more pessimistic than prior work by Sugaya–Wolitzky (2018) and Miklos-Thal–Tucker (2019), which found ambiguous effects. The difference is that those papers assumed full disclosure, so better accuracy tightened incentive constraints and sometimes forced price cuts. Under optimal selective disclosure, a more accurate algorithm always raises average prices because the algorithm withholds information that would have forced price cuts.&lt;/p&gt;
&lt;p&gt;Q: What are the comparative statics with respect to the number of firms and the discount factor?
A: Proposition 2 establishes that a decrease in the number of firms n or an increase in the discount factor δ increases collusive profit V* and makes collusive prices more flexible (raises ŝ). The intuition for fewer firms making prices more flexible is that with fewer firms, incentive constraints bind for a narrower range of demand states, so less pooling is needed. Collusion is impossible if and only if δ &amp;lt; (n−1)/n, the same threshold as under full disclosure.&lt;/p&gt;
&lt;p&gt;Q: How does the model generate empirically testable predictions distinct from other collusion models?
A: The model predicts: (1) the equilibrium price distribution has support on an interval [p^m(s_bar), p^m(ŝ)] plus a single mass point at the higher price p^m(s*); (2) prices are pro-cyclical overall but rigidly fixed at p^m(s*) for all but the lowest demand states; (3) the gap p^m(s) − p(s) is non-monotone — zero at low states, negative (supra-monopoly) at intermediate states, and positive at high states; (4) prices are more flexible when firms are more patient or fewer. The rigid high price combined with a flexible interval of lower prices is described as a distinctive collusive marker not present in other models.&lt;/p&gt;
&lt;p&gt;Q: How does the model relate to the empirical literature testing Green–Porter versus Rotemberg–Saloner?
A: Rotemberg–Saloner predicts counter-cyclical prices (price wars during booms), while Green–Porter predicts pro-cyclical prices. Empirical tests (e.g., Porter 1983, Ellison 1994) have typically found pro-cyclical prices, favoring Green–Porter. The present model generates pro-cyclical prices through a different mechanism — perfect monitoring plus selectively disclosed demand information — showing that pro-cyclical prices are consistent with perfect monitoring when the information intermediary optimally pools high demand states. The paper suggests that distinguishing the theories requires estimating the gap between price and monopoly price over the cycle: under Green–Porter, collusion succeeds better in high demand states; under this model, collusion succeeds better in low demand states.&lt;/p&gt;
&lt;p&gt;Q: What narrative evidence from the RealPage case corroborates the model&amp;rsquo;s predictions?
A: The US DOJ complaint against RealPage states that &amp;ldquo;in down markets… [RealPage] instills pricing discipline in landlords, curbing normal fully independent competitive reactions by substituting them with interdependent decision-making,&amp;rdquo; and that RealPage advertised that its AI helps clients &amp;ldquo;avoid the race to the bottom in down markets.&amp;rdquo; This is consistent with the model&amp;rsquo;s prediction of flexible monopoly prices at low demand states and a rigid, supra-monopolistic price in normal times. The Kumatori Contractors Cooperative case (studied by Kawai, Nakabayashi, and Ortner 2024) corroborates the censorship result: that organization took drastic steps to limit bidders&amp;rsquo; information about costs on the largest projects — exactly the states where deviation is most tempting.&lt;/p&gt;
&lt;p&gt;Q: How do results change with persistent (Markov) demand?
A: Theorem 2 shows that upper censorship remains uniquely optimal with Markov demand, but the cutoff ŝ(s) now depends on last-period demand s. Under positive serial correlation, ŝ(s) is decreasing in s: the algorithm discloses less information after high demand because firms are more optimistic and thus more tempted to deviate. Under negative serial correlation, ŝ(s) is increasing. The optimal collusive price is no longer always equal to the monopoly price for the disclosed mean demand, and the expected price conditional on last-period demand can be countercyclical (similar to Rotemberg–Saloner), even though the current-period price is always monotone in current demand.&lt;/p&gt;
&lt;p&gt;Q: How does the optimal disclosure policy change with differentiated products?
A: With a symmetric linear demand system (Section 5, Theorem 3), the optimal policy censors an intermediate interval [ŝ_L, ŝ_H] and discloses both the lowest and the highest demand states. At high demand states s &amp;gt; ŝ_H, the algorithm discloses the state and recommends a price below monopoly (to satisfy incentive constraints), because with differentiated goods there is no upper bound on equilibrium profit and profit is convex in s at high states, making disclosure with price-cutting optimal. Mathematically, the capped monopoly profit is piecewise-convex rather than convex-then-concave, so the optimal policy is intermediate-interval censorship rather than upper censorship. The Appendix A version extends to general demand systems and capacity constraints with the same qualitative logic.&lt;/p&gt;
&lt;p&gt;Q: What are the main limitations and directions for future work acknowledged by the authors?
A: The paper identifies three main limitations. First, if profit is not affine in s (i.e., expected profit depends on more than the mean state), the information design problem becomes non-linear and upper censorship is typically suboptimal, though it remains approximately optimal when the problem is close to linear. Second, the model assumes the algorithm&amp;rsquo;s objective is to maximize industry profit; if the intermediary is a profit-maximizing seller of software (as in Harrington 2022), the objective may instead be to maximize the profit differential between adopters and non-adopters. Third, the model assumes all firms use the algorithm; allowing partial adoption would require modeling firms&amp;rsquo; incentives to subscribe. The paper notes that incorporating these considerations &amp;ldquo;could be an interesting direction for future research.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;Upper Censorship (disclosure policy): A disclosure policy in which demand states below a cutoff ŝ are revealed to firms (along with the corresponding monopoly price recommendation), while states above ŝ are pooled — only the event {s ≥ ŝ} is disclosed — with a single monopoly price recommendation p^m(s*) for the mean concealed state s* = E[s | s ≥ ŝ]. This is the uniquely optimal disclosure policy in the baseline model.&lt;/p&gt;
&lt;p&gt;Capped Monopoly Profit: The per-period profit objective in the reduced static information design problem: min{π^m(s), π^max}, where π^max = δV*/((1−δ)(n−1)) is the maximum industry profit attainable in a single period without violating incentive constraints. This function is convex-then-concave in s, which drives the optimality of upper censorship.&lt;/p&gt;
&lt;p&gt;Supra-Monopoly Pricing: Equilibrium prices that exceed the monopoly price for the realized demand state. In the model, this occurs for states s ∈ (ŝ, s*), where the algorithm&amp;rsquo;s pooled recommendation p^m(s*) is above the local monopoly price p^m(s). It arises because the pooled recommendation must be incentive-compatible at the highest concealed states.&lt;/p&gt;
&lt;p&gt;Price Rigidity: The feature of the optimal equilibrium in which the collusive price is constant at p^m(s*) for all demand states s ≥ ŝ. The algorithm achieves this by withholding information about high demand states, preventing the &amp;ldquo;price wars during booms&amp;rdquo; predicted by Rotemberg–Saloner (1986) under full disclosure.&lt;/p&gt;
&lt;p&gt;Algorithmic Accuracy: In the paper&amp;rsquo;s terms, the informativeness of the algorithm&amp;rsquo;s signal about s, formalized as the precision of the distribution F. Improving accuracy corresponds to a mean-preserving spread of F (Blackwell 1953). A more accurate algorithm always increases collusive profit; under the concavity condition on consumer surplus, it also reduces expected consumer surplus.&lt;/p&gt;
&lt;p&gt;Mean-Preserving Contraction (MPC(F)): The set of distributions G of firms&amp;rsquo; posterior mean beliefs over s that are consistent with Bayesian updating of the prior F. By Blackwell (1953), a disclosure policy is feasible if and only if it induces a distribution G ∈ MPC(F). This is the feasibility constraint in the static information design problem.&lt;/p&gt;
&lt;p&gt;Affinity in the state: The assumption that π(p, s) is affine (linearly increasing) in s for each price p. This implies E[π(p,s)] = π(p, E[s]), so expected profit is determined entirely by the expected state, enabling the reduction of the disclosure problem to choosing a distribution of posterior means.&lt;/p&gt;</description></item><item><title>Comment on 'Asset Bubbles and Overlapping Generations' by Tirole</title><link>https://macropaperwarehouse.com/papers/comment-on-asset-bubbles-and-overlapping-generations-by-tirole/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/comment-on-asset-bubbles-and-overlapping-generations-by-tirole/</guid><description>&lt;p&gt;Tirole (1985) studied an overlapping generations model with capital accumulation and showed that the emergence of asset bubbles can resolve the capital over-accumulation problem when the economy is dynamically inefficient. His Proposition 1(c) claims that a bubble can emerge if and only if the dividend growth rate exceeds the bubbleless steady-state interest rate. This comment identifies an error in that proposition: the stated condition is necessary but not sufficient for bubble existence. The paper constructs an explicit counterexample in which the dividend growth rate exceeds the bubbleless interest rate but no bubble equilibrium exists, and separately constructs a case in which a bubble exists even when the condition in Proposition 1(c) fails. Corrected necessary and sufficient conditions for bubble existence are derived, and the implications of the correction for the welfare results and the relationship between dynamic inefficiency and bubbles are characterized.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-error-in-tiroles-proposition-1c"&gt;Q1. What is the error in Tirole&amp;rsquo;s Proposition 1(c)?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The error is in the sufficiency direction: Tirole argued that whenever the dividend growth rate exceeds the bubbleless interest rate, a bubble equilibrium exists; Pham and Toda construct a parameter configuration satisfying this condition where no bubble equilibrium exists, because the continuity argument used in Tirole&amp;rsquo;s proof fails at boundary parameter values.&lt;/strong&gt; The necessity direction — that bubble existence requires this rate comparison — is not challenged.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-corrected-conditions-change-the-interpretation-of-dynamic-inefficiency"&gt;Q2. How do the corrected conditions change the interpretation of dynamic inefficiency?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Tirole&amp;rsquo;s original result linked bubbles tightly to dynamic inefficiency (r &amp;lt; g), providing a clean condition for when bubbles are both feasible and welfare-improving by absorbing excess saving. The correction weakens this link: bubble existence requires additional structural conditions beyond the rate comparison, meaning dynamic inefficiency is a necessary but not sufficient condition for bubbles in the Tirole framework.&lt;/strong&gt; Policy prescriptions based on the r &amp;lt; g condition for bubble welfare analysis need qualification.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;dynamic inefficiency&lt;/strong&gt; : the OLG condition in which the interest rate falls below the growth rate, making intergenerational transfers from young to old welfare-improving; related to but not sufficient for bubble existence under the corrected Tirole conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;bubble existence condition&lt;/strong&gt; : the necessary and sufficient conditions under which an asset bubble can emerge and persist in the Tirole OLG model; the corrected version requires more than the dividend-growth-rate-exceeds-interest-rate comparison of the original Proposition 1(c).&lt;/p&gt;</description></item><item><title>Comment on "Artificial Intelligence and Technological Unemployment" by Wang and Wong</title><link>https://macropaperwarehouse.com/papers/comment-on-artificial-intelligence-and-technological-unemployment-by-wang-and-wong/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/comment-on-artificial-intelligence-and-technological-unemployment-by-wang-and-wong/</guid><description>&lt;p&gt;This comment, written by J. Carter Braxton (University of Wisconsin), discusses the paper &amp;ldquo;Artificial Intelligence and Technological Unemployment&amp;rdquo; by Wang and Wong (2025), which develops and quantifies an equilibrium labor search model to evaluate the employment effects of spreading AI. Wang and Wong&amp;rsquo;s central finding is that improvements in AI quality will increase productivity by a factor of three while reducing employment by 23%, with approximately half of the employment decline occurring within the next five years. Braxton&amp;rsquo;s comment serves two purposes: first, to clarify the model&amp;rsquo;s structural channels through which AI affects employment; and second, to bring empirical evidence from the spread of computers in the 1980s–2000s to bear on the relative magnitude of those channels.&lt;/p&gt;
&lt;p&gt;Braxton identifies two competing forces within Wang and Wong&amp;rsquo;s framework. The &lt;strong&gt;job destruction channel&lt;/strong&gt; arises from endogenous separations: as AI quality improves, firms increasingly replace matched workers with AI, raising outflows from employment. The &lt;strong&gt;job creation channel&lt;/strong&gt; arises from the free-entry condition: rising AI quality increases firm profits on all matches, inducing firms to post more vacancies, which raises workers&amp;rsquo; job-finding rates and employment inflows. Whether aggregate employment rises or falls depends on which channel dominates — a quantitative question the authors resolve through calibration, finding the job destruction channel dominant. Braxton notes that three modeling choices (learning-by-using, the requirement that firms must be matched with a worker to adopt AI, and disembodied technological change) each push &lt;em&gt;against&lt;/em&gt; the job-destruction result, making the authors&amp;rsquo; findings more striking.&lt;/p&gt;
&lt;p&gt;Braxton then evaluates the relative strength of these channels using the historical spread of personal computers. Drawing on Bick, Blandin, and Deming (2024), he notes that workplace AI adoption in 2024 follows nearly the same time trend and income-distribution profile as computer adoption in 1984, making computers a plausible historical analog. Using the CPS Computer Supplement (1984–2003), Braxton measures the change in computer usage by occupation and regresses it against the change in employment-to-unemployment (EU) transition rates by occupation. The estimated coefficient is 0.0146 (robust SE 0.0064), indicating that occupations with higher computer adoption rates saw higher flows into unemployment — confirming that a job destruction channel was active during the computer era. However, regressing the change in log occupation-level employment (1980–2000 Census) on the change in computer usage yields a coefficient of 0.7761 (robust SE 0.2658), with a positive slope indicating that occupations more exposed to computers saw &lt;em&gt;higher&lt;/em&gt; employment growth. For the computer episode, therefore, the job creation channel dominated the job destruction channel — the opposite of Wang and Wong&amp;rsquo;s AI projection.&lt;/p&gt;
&lt;p&gt;Braxton also cites his own prior work showing that even when job creation and destruction balance in aggregate, workers displaced by technological change face lasting earnings losses and elevated permanent income risk, raising the question of how to optimally insure these workers.&lt;/p&gt;
&lt;p&gt;The comment concludes by identifying avenues for future research: introducing occupational heterogeneity (with some occupations more exposed to AI than others) and worker heterogeneity (skills that are complements versus substitutes to AI). The central open question is whether AI is qualitatively different from prior episodes of technological change, and if so, why.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the two central channels through which AI quality affects employment in Wang and Wong&amp;rsquo;s model, and how do they operate?&lt;/strong&gt;
The job destruction channel operates through endogenous separations: as AI quality (At) improves, firms that are matched with workers are more likely to replace them with AI at rate ρ, adding the term ρµAt Ht It to outflows from employment in the law of motion for employment. The job creation channel operates through the free-entry condition: higher AI quality raises firm profits on all existing matches (because technological change is disembodied, benefiting matches formed today with future AI gains), inducing firms to post more vacancies, which via free entry reduces the firm&amp;rsquo;s matching probability but raises the worker&amp;rsquo;s job-finding rate αt and thereby increases employment inflows. The net employment effect depends on which channel quantitatively dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is Wang and Wong&amp;rsquo;s quantitative finding about the aggregate employment and productivity effects of AI?&lt;/strong&gt;
Using a calibrated equilibrium labor search model, Wang and Wong find that the spread of AI will increase productivity by a factor of three while reducing employment by 23%. Approximately half of the employment decline is projected to occur within the next five years. A version of the model holding job-finding rates fixed yields a similar result, indicating that through the lens of their model the job creation channel is quantitatively small and the job destruction channel dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What three modeling choices push against Wang and Wong&amp;rsquo;s job-destruction result, and why does Braxton view this as making the finding more striking?&lt;/strong&gt;
First, AI improves through &amp;ldquo;learning by using&amp;rdquo; — it learns from all output being produced — which creates an incentive for employment to remain elevated to accelerate AI learning, dampening job destruction. Second, firms can only adopt AI if currently matched with a worker, which creates an incentive for vacancy posting and pushes in favor of job creation. Third, AI improvements are disembodied (raising productivity in all matches, including those formed before the improvement), which increases the value of forming new matches today and strengthens job creation. Because each of these assumptions pushes against the job destruction result, Braxton argues that finding job destruction dominant despite these model features makes the result more striking.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does Braxton use the historical spread of computers to assess the job destruction and job creation channels?&lt;/strong&gt;
Braxton measures occupation-level computer adoption as the change in the share of CPS Computer Supplement respondents who reported using a computer at work between 1984 and 2003 (denoted ΔCPUo,84–03), using occupation codes from Autor and Dorn (2013). He then regresses the occupation-level change in EU transition rates (ΔEUo,84–03, from monthly CPS micro data) on ΔCPUo,84–03 to measure the job destruction channel, and separately regresses the change in log occupation-level employment (Δlog Eo,80–00, from the 1980 and 2000 Census IPUMS) on ΔCPUo,84–03 to assess the net employment effect. A positive coefficient on the employment regression indicates job creation dominates; a negative coefficient indicates job destruction dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What do the regression results show about the job destruction and job creation channels during the computer era?&lt;/strong&gt;
The job destruction regression yields a coefficient of β = 0.0146 (robust SE = 0.0064, R² = 0.0178), indicating that occupations with higher computer adoption rates did see higher employment-to-unemployment transition rates — the job destruction channel was present. However, the employment-level regression yields a coefficient of β = 0.7761 (robust SE = 0.2658, R² = 0.0348), with a positive slope indicating that occupations more exposed to computers experienced &lt;em&gt;higher&lt;/em&gt; employment growth between 1980 and 2000. Thus, for the computer episode, the job creation channel dominated the job destruction channel — the opposite of what Wang and Wong project for AI.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the basis for treating the computer episode as a relevant analog to the spread of AI?&lt;/strong&gt;
Braxton cites Bick, Blandin, and Deming (2024), who show that AI adoption in the workplace in 2024 is following nearly the same aggregate time trend as the spread of personal computers in the early 1980s. Moreover, the distribution of AI usage across the income distribution in 2024 is nearly identical to computer usage across the income distribution in 1984: for both technologies, workplace usage peaks between the 80th and 90th percentiles of the income distribution before declining modestly at the top. Bick et al. (2024) also show the similarities hold by education level and age.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Even if job creation and destruction balance in aggregate, what does prior work suggest about the distributional consequences for workers?&lt;/strong&gt;
Braxton and Taska (2023) show that workers in occupations more exposed to technological change (measured by changes in computer and software task requirements) suffered larger earnings losses following displacement. Braxton, Herkenhoff, Rothbaum, and Schmidt (2024, forthcoming AER) show that workers in occupations more exposed to technological change experienced larger increases in permanent income risk between the 1980s and 2010s. These findings imply that even if AI does not reduce aggregate employment, workers who are displaced will face deteriorating labor market prospects, raising the question of how to optimally provide insurance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What policy implication does Braxton draw from the distributional consequences of technological change?&lt;/strong&gt;
Braxton and Taska (2025, forthcoming Review of Economic Dynamics) show that technological change expands the motive for governments to provide retraining subsidies. Braxton argues that if AI represents an acceleration of technological change, even larger retraining subsidies — and potentially other forms of insurance — may be needed for displaced workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the main avenues for future research identified in the comment?&lt;/strong&gt;
Braxton identifies two principal directions. First, introducing occupational heterogeneity into the Wang-Wong framework, so that some occupations are more exposed to AI displacement than others, would allow the model to generate richer distributional implications. Second, allowing worker heterogeneity in skills — distinguishing skill dimensions that are complements to AI from those that are substitutes — would permit the model to capture differential effects across the workforce. The overarching research question is whether AI is qualitatively different from prior technological change episodes, and if so, to identify the precise mechanisms that make it different.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job destruction channel&lt;/strong&gt;: In the Wang-Wong model, the increase in endogenous separations driven by firms replacing matched workers with AI as AI quality improves. Formally, this is the term ρµAt Ht It in the law of motion for employment, representing separations that occur when a firm adopts AI and the worker and firm cannot renegotiate a mutually acceptable wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job creation channel&lt;/strong&gt;: The increase in vacancy posting and worker job-finding rates induced by rising AI quality. Because higher AI quality raises firm profits on all matches (via disembodied technological change), the free-entry condition implies firms post more vacancies, lowering the firm&amp;rsquo;s matching probability but raising the worker&amp;rsquo;s job-finding rate αt, increasing employment inflows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Free-entry condition&lt;/strong&gt;: The equilibrium condition equating the cost of posting a vacancy (κt) to the expected benefit (the probability of matching ft times the firm&amp;rsquo;s match surplus Πt). This condition pins down the job-finding rate for workers: when firms find it more profitable to post vacancies, αt rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Disembodied technological change&lt;/strong&gt;: The modeling assumption that AI quality improvements raise productivity in all existing matches, not just those formed after the improvement. This means future AI gains benefit matches formed today, increasing the incentive to create new matches and pushing in favor of the job creation channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Learning by using&lt;/strong&gt;: The mechanism in Wang-Wong whereby AI quality (At) improves as a function of current aggregate employment (Ht) and the learning rate µ. Because AI learns from all output being produced, maintaining higher employment accelerates AI improvement, creating a motive that partially offsets the job destruction channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Employment-to-unemployment (EU) transition rate&lt;/strong&gt;: The rate at which employed workers flow into unemployment in a given occupation, used by Braxton as the empirical measure of the job destruction channel during the computer episode. Measured from monthly CPS micro data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capitalization effect&lt;/strong&gt;: The tendency for firms to post more vacancies today in anticipation of future productivity improvements, because the cost of posting is paid upfront while the benefits of a future-better-AI accrue to the match going forward. Referenced by Braxton as relevant to understanding the job creation channel in Wang-Wong&amp;rsquo;s framework (citing Pissarides (2000), Chapter 3).&lt;/p&gt;</description></item><item><title>Comment on: Is it AI or data that drives market power?</title><link>https://macropaperwarehouse.com/papers/comment-on-is-it-ai-or-data-that-drives-market-power/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/comment-on-is-it-ai-or-data-that-drives-market-power/</guid><description>&lt;p&gt;This paper is a published comment by Miao Ben Zhang (USC Marshall School of Business) on Mihet, Rishabh, and Gomes (2025), &amp;ldquo;Is It AI or Data That Drives Market Power?&amp;rdquo; Zhang identifies three contributions of the commented paper and benchmarks each against the existing literature, offering targeted suggestions for strengthening the analysis.&lt;/p&gt;
&lt;p&gt;The first contribution Zhang discusses is the commented paper&amp;rsquo;s distinction between raw data, AI capability, and processed data. Raw data is modeled as a by-product of production linearly related to firm size; processed data is modeled as the abundance of signals improving the precision of firms&amp;rsquo; next-period productivity predictions. The commented paper&amp;rsquo;s key modeling innovation is a formula linking raw data (n_{i,t}), firm-level AI capability (z_i), and processed data (n_{i,t}-tilde): processed data equals a weighted sum of an information entropy effect — e^(-z_i) * (-n_{i,t} * ln(n_{i,t})) — and an AI capability effect — (1 - e^(-z_i)) * n_{i,t} * e^(n_{i,t}). Zhang notes this formula implies that the marginal value of raw data can turn negative for firms with low AI capability, consistent with information-theoretic constraints from the rational inattention literature (Sims, 2003). Zhang requests more empirical support for this equation, specifically asking whether low-AI firms exhibit lower TFP than high-AI firms at similar data-intensity levels, and encouraging discussion of existing measures of data-processing ability such as human capital in data engineering and ML pipeline automation.&lt;/p&gt;
&lt;p&gt;The second contribution is the commented paper&amp;rsquo;s modeling of a secondary market for trading processed data among firms. Zhang notes that facilitating processed data markets — for example via APIs or structured knowledge sharing — can, per the commented paper&amp;rsquo;s simulation and empirical analysis, democratize innovation and reduce market concentration, enabling even low-AI firms to compete. Zhang flags that the paper is silent on firm acquisition as an alternative channel for accessing processed data, arguing this omission is significant given that processed data, unlike ideas or technologies, is less portable and cannot be obtained simply by poaching skilled employees.&lt;/p&gt;
&lt;p&gt;The third contribution is the commented paper&amp;rsquo;s empirical strategy. The commented paper constructs firm-level proxies for AI intensity and data intensity, then exploits two exogenous technological shocks — the advent of AWS cloud computing and transformer-based architectures — to identify causal effects of improvements in compute and processed data accessibility. The evidence shows that compute improvements disproportionately benefit data-rich firms, while processed data access disproportionately benefits low-AI firms. The central empirical message is that access to raw data tends to foster market concentration, whereas access to processed data tends to reduce market concentration. Zhang raises a measurement concern: the commented paper relies on firm-level Herfindahl-Hirschman Index (HHI) calculations based on time-varying, text-based industry definitions (Hoberg and Phillips, 2016). Zhang argues a positive effect on this HHI could reflect either genuine firm growth relative to competitors or reclassification of the firm into different, possibly more concentrated, sectors — making the HHI measure alone insufficient to support claims about product market concentration. Zhang recommends complementing this with industry-level concentration measures anchored to fixed baseline industry codes (FIC codes from Hoberg and Phillips, 2016), constructed at the FIC-year level, following the approach of Gutierrez and Philippon (2017) on industries&amp;rsquo; growth and median Q.&lt;/p&gt;
&lt;p&gt;No quantitative magnitudes from regressions or calibrations are reported in the comment itself, as this is a discussion piece rather than an original empirical paper. All claims above are drawn directly from the text.&lt;/p&gt;
&lt;p&gt;Q: What are the three contributions of Mihet, Rishabh, and Gomes (2025) that Zhang identifies?
A: First, the paper explicitly models the distinct roles of raw data, AI capability, and processed data, linking the information entropy literature to firm production. Second, it models a secondary market for trading processed data among firms, relevant for policy on data sharing platforms. Third, it empirically tests the model&amp;rsquo;s predictions using firm-level proxies and two exogenous technological shocks.&lt;/p&gt;
&lt;p&gt;Q: What is the core formula linking raw data, AI capability, and processed data in the commented paper?
A: Processed data (n_{i,t}-tilde) equals e^(-z_i) * (-n_{i,t} * ln(n_{i,t})) plus (1 - e^(-z_i)) * n_{i,t} * e^(n_{i,t}), where z_i is firm-level AI capability and n_{i,t} is raw data. The first term captures the information entropy effect (which can reduce or negate the value of raw data for low-AI firms) and the second captures the AI capability effect (where AI turns raw data into abundant useful signals).&lt;/p&gt;
&lt;p&gt;Q: Why can the marginal value of raw data turn negative, according to the framework?
A: Information-theoretic constraints — long studied through concepts like Shannon entropy and Sims&amp;rsquo;s rational inattention — imply that unprocessed raw data may harm rather than help firms that lack adequate processing capabilities. Zhang situates this in the broader macro-finance literature on information choice (Sims, 2003; Veldkamp, 2011).&lt;/p&gt;
&lt;p&gt;Q: What empirical suggestion does Zhang make regarding the raw data versus AI capability distinction?
A: Zhang asks whether, in the commented paper&amp;rsquo;s sample of publicly-traded firms with measures of data intensity and AI intensity, low-AI firms exhibit lower TFP (following Imrohoroglu and Tuzel, 2012) than high-AI firms when controlling for similar levels of data intensity. Zhang also encourages discussion of anecdotal evidence for negative information entropy effects and of existing measures of data processing ability such as human capital in data engineering, annotation, cleaning, or ML pipeline automation (Abis and Veldkamp, 2024).&lt;/p&gt;
&lt;p&gt;Q: What is the policy relevance of the secondary market for processed data?
A: The commented paper&amp;rsquo;s simulation and empirical analysis shows that facilitating processed data markets (e.g., via APIs or structured knowledge sharing) can democratize innovation and reduce market concentration, enabling even low-AI firms to compete. This aligns with recent literature on secondary markets for structured data and foundation model outputs (Gans, 2018, 2024; Conti et al., 2023, 2024; Athey, 2019). Platforms may have incentives to restrict processed data access, potentially reinforcing incumbent power (Carballa Smichowski et al., 2023).&lt;/p&gt;
&lt;p&gt;Q: What channel does Zhang argue the commented paper neglects in its analysis of market concentration?
A: Zhang argues the paper is silent on firm acquisition as an alternative means by which firms access processed data, noting that processed data is less portable than ideas or technologies — it cannot be obtained simply by poaching a skilled employee. Zhang contends this acquisition channel appears central to the paper&amp;rsquo;s focus on market concentration and encourages the authors to include a discussion of it.&lt;/p&gt;
&lt;p&gt;Q: What is the central empirical finding of the commented paper regarding raw versus processed data and market concentration?
A: Access to raw data tends to foster market concentration, while access to processed data tends to reduce market concentration. The evidence shows that compute improvements (proxied by the AWS shock) disproportionately benefit data-rich firms, while processed data accessibility (proxied by the transformer architecture shock) disproportionately benefits low-AI firms, consistent with theoretical predictions.&lt;/p&gt;
&lt;p&gt;Q: What is Zhang&amp;rsquo;s specific concern about the HHI measure used in the commented paper?
A: The commented paper constructs firm-level HHI using time-varying, text-based industry definitions (Hoberg and Phillips, 2016). Zhang argues a positive effect on this HHI is ambiguous: it could reflect genuine firm growth relative to competitors or reclassification of the firm into different, possibly more concentrated, sectors. Zhang concludes that the HHI measure alone is not strong enough to support claims about product market concentration.&lt;/p&gt;
&lt;p&gt;Q: What robustness check does Zhang recommend for the empirical analysis?
A: Zhang recommends constructing industry-level concentration measures at the FIC-year level using fixed baseline FIC codes from Hoberg and Phillips (2016), available at the Hoberg-Phillips Data Library. The authors could then analyze how industries with high versus low average or median AI intensity and data intensity respond to the two technological shocks in terms of concentration. Zhang cites Gutierrez and Philippon (2017) as an example of this approach and notes it would help distinguish within-industry dynamics from shifts in firm business focus, aligning with best practices from De Loecker, Eeckhout, and Unger (2020) on persistent market power.&lt;/p&gt;
&lt;p&gt;Raw data: A by-product of firms&amp;rsquo; production, modeled as linearly related to firm size; represents unprocessed observations that have not yet been transformed into useful signals. Distinguished from processed data, which is what actually improves productivity predictions.&lt;/p&gt;
&lt;p&gt;Processed data: Modeled as the abundance of signals that improves the precision of firms&amp;rsquo; predictions of their next-period productivity (following Farboodi and Veldkamp, 2022). Unlike ideas or technologies, processed data is less portable and cannot easily be transferred by poaching skilled employees.&lt;/p&gt;
&lt;p&gt;AI capability (z_i): Firm-level ability to transform raw data into processed data. Firms with low AI capability may receive negative marginal value from additional raw data due to information entropy effects; firms with high AI capability extract large gains from the same raw data.&lt;/p&gt;
&lt;p&gt;Information entropy effect: The component of the raw-to-processed-data transformation — e^(-z_i) * (-n_{i,t} * ln(n_{i,t})) — that captures the information-theoretic cost of possessing raw data without adequate processing capability. At low AI capability, this effect can reduce or negate the precision of signals.&lt;/p&gt;
&lt;p&gt;Secondary market for processed data: A market in which firms trade processed data, modeled in the commented paper as a platform or API-based exchange. The commented paper&amp;rsquo;s analysis shows this market can democratize innovation and reduce market concentration by enabling low-AI firms to access processed data they cannot produce internally.&lt;/p&gt;
&lt;p&gt;Firm-level HHI (text-based): Herfindahl-Hirschman Index calculated using time-varying, text-based industry definitions (Hoberg and Phillips, 2016). Zhang identifies a measurement ambiguity: a positive effect on this measure could reflect genuine competitive gains or reclassification into more concentrated sectors.&lt;/p&gt;</description></item><item><title>Community Engagement and Public Safety: Evidence from Crime Enforcement Targeting Immigrants</title><link>https://macropaperwarehouse.com/papers/community-engagement-and-public-safety-evidence-from-crime-enforcement-targeting-immigrants/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/community-engagement-and-public-safety-evidence-from-crime-enforcement-targeting-immigrants/</guid><description>&lt;p&gt;This paper studies how immigration enforcement affects public safety, asking two questions: (1) what is the effect of increased enforcement on criminal victimization, and (2) how does increased enforcement affect victims&amp;rsquo; willingness to report crimes to police? The authors exploit the staggered rollout of the U.S. Secure Communities (SC) program — the largest expansion of interior immigration enforcement in U.S. history — across counties between 2008 and 2013. SC expanded information sharing between local police and federal immigration authorities, causing ICE honored detainer requests to increase by over 50% following program activation.&lt;/p&gt;
&lt;p&gt;The primary data source is the restricted-access National Crime Victimization Survey (NCVS), which measures victimizations independently of whether they were reported to police and includes respondent ethnicity. This allows the authors to separately estimate effects on underlying crime incidence and on reporting behavior for Hispanic and non-Hispanic individuals. The empirical strategy uses a staggered difference-in-differences design following Sun and Abraham (2021), comparing earlier-treated counties to the last 25% of counties to activate SC, with estimates run separately by ethnicity.&lt;/p&gt;
&lt;p&gt;The main findings run contrary to the stated policy goal of improving public safety. Among Hispanic individuals, SC caused a statistically significant 0.15 percentage point increase in monthly victimization — a 16% increase relative to the pre-period baseline of 0.9 percentage points — implying approximately 1.3 million additional crimes against Hispanics in the two years following program activation. The increase is concentrated primarily in property crimes (a statistically significant 15% increase), with a similarly sized but imprecisely estimated 15% increase in violent crime victimizations. The victimization increase is larger for Hispanic females (0.23 percentage points, or 25%) and in counties with higher shares of non-citizen Hispanic residents.&lt;/p&gt;
&lt;p&gt;Simultaneously, SC caused a 9.5 percentage point decline in the likelihood that Hispanic victims report incidents to police — a 30% decline relative to the pre-period mean reporting rate of 33 percentage points. This reporting decline is primarily driven by a 34% decline in the reporting of property offenses. No changes in victimization or reporting are found for non-Hispanic individuals in the aggregate, though non-Hispanic individuals in neighborhoods with high Hispanic population shares do experience higher victimization rates after SC.&lt;/p&gt;
&lt;p&gt;Critically, reported crime rates (the product of victimization and reporting) are unchanged for both Hispanic and non-Hispanic individuals, explaining why prior studies using administrative reported-crime data found null effects of SC. The null effect on reported crime masks two large, opposing causal forces.&lt;/p&gt;
&lt;p&gt;The authors provide evidence that the decline in crime reporting is the primary driver of the increase in victimization. Cohorts with larger reporting declines experienced larger victimization increases, and a decomposition exercise shows the reporting decline is substantially more important than concurrent SC-induced changes in unemployment, wages, female-headed household shares, and the male immigrant share. Supporting data from 75 police departments confirm no change in 911 call volumes or total arrest volumes, while showing a decline in the Hispanic share of arrestees in both Hispanic and non-Hispanic neighborhoods — consistent with reduced reporting leading to reduced apprehension of offenders, with offending shifting toward non-Hispanic individuals.&lt;/p&gt;
&lt;p&gt;Scope conditions: results are estimated for the population residing in counties exceeding 100,000 residents (representing 61% of total U.S. population and 69% of the Hispanic population), excluding southern border counties and states that actively resisted SC implementation (Illinois, Massachusetts, New York). Effects apply to all Hispanic respondents — citizens and non-citizens — consistent with prior evidence that citizen Hispanics respond to immigration enforcement out of concern for non-citizen contacts.&lt;/p&gt;
&lt;p&gt;Q: What was the Secure Communities program and how was it implemented?
A: SC was a federal program launched in 2008 that required fingerprints of individuals booked into local jails to be forwarded not only to the FBI but also to the Department of Homeland Security, enabling automatic screening for immigration violations. Local authorities could not prevent federal officials from learning of an arrestee&amp;rsquo;s immigration status. The program rolled out county-by-county between October 2008 and January 2013 due to technological constraints and resource bottlenecks, generating the staggered variation used for identification.&lt;/p&gt;
&lt;p&gt;Q: How large was the first-stage effect on actual immigration enforcement?
A: County-level honored ICE detainer requests increased by over 50% following SC activation, with a similar 40% increase in all detainer requests. The number of honored detainers nationwide doubled between 2008 and 2012. Over 90% of detainers and removals in any given month were for individuals of Hispanic ethnicity.&lt;/p&gt;
&lt;p&gt;Q: What is the main finding on Hispanic victimization?
A: SC caused a 0.15 percentage point increase in monthly Hispanic victimization rates, a 16% increase relative to the pre-period baseline of 0.9 percentage points. This translates to approximately 1.3 million additional crimes against Hispanics over two years following program activation, calculated by multiplying the monthly effect by 24 months and the 35.3 million Hispanics in the sample counties.&lt;/p&gt;
&lt;p&gt;Q: What is the main finding on Hispanic crime reporting?
A: SC caused a 9.5 percentage point decline in the likelihood that Hispanic victims report incidents to police, a 30% decline relative to the pre-period mean reporting rate of 33 percentage points. This decline occurred relatively quickly after activation and was concentrated in property offenses, where reporting fell by 34%.&lt;/p&gt;
&lt;p&gt;Q: Why do reported crime rates show no change despite large shifts in victimization and reporting?
A: Reported crime rates — the probability of being victimized and reporting the crime — are unchanged because the 16% increase in victimization and the 30% decline in reporting are approximately offsetting in magnitude. This explains why prior work using administrative police data (Miles and Cox 2014; Treyger et al. 2014; Hines and Peri 2019) found null effects of SC on reported crime: those data sources cannot separately identify the two underlying changes.&lt;/p&gt;
&lt;p&gt;Q: Does SC affect non-Hispanic individuals?
A: In the aggregate, SC has no statistically significant effect on non-Hispanic victimization or reporting. However, non-Hispanic individuals living in neighborhoods with high Hispanic population shares do experience victimization increases, and in those neighborhoods their reporting rates also decline slightly. Re-weighting non-Hispanic respondents to match the county composition of Hispanic respondents yields an 8% increase in non-Hispanic victimization, suggesting spillover effects in Hispanic-dense areas.&lt;/p&gt;
&lt;p&gt;Q: What mechanism links the reporting decline to the victimization increase?
A: The authors argue that reduced victim reporting lowers the probability that offenders are apprehended, thereby reducing the cost of committing crimes. They demonstrate this through two analyses: first, cohorts of counties with larger reporting declines experienced larger victimization increases; second, a decomposition shows the reporting channel is substantially more important than concurrent SC-induced changes in unemployment, wages, female-headed household shares, and the male immigrant share of the population.&lt;/p&gt;
&lt;p&gt;Q: What do the police administrative data show about offender composition?
A: Data from 75 police departments show no change in 911 call volumes or total arrest volumes following SC — consistent with the NCVS finding of unchanged reported crime rates. However, the Hispanic share of arrestees declined after SC, with a 1.5 percentage point drop in Hispanic neighborhoods (off a base of 54%), suggesting the rise in offending was more concentrated among non-Hispanic offenders as reduced reporting lowered expected punishment probabilities.&lt;/p&gt;
&lt;p&gt;Q: How does the victimization effect vary by gender?
A: The victimization point estimate for Hispanic males is 0.085 percentage points and imprecisely estimated (SE = 0.088). For Hispanic females, the effect is over 2.5 times larger at 0.23 percentage points, a 25% increase. The decline in reporting is comparable in magnitude across male and female Hispanic victims, suggesting fear of enforcement is similar by gender but that females disproportionately bear the crime burden.&lt;/p&gt;
&lt;p&gt;Q: How does the victimization effect vary by neighborhood non-citizen Hispanic share?
A: Victimization effects for Hispanics are relatively constant across neighborhood types but are higher — around 25% — in neighborhoods with the highest shares of non-citizen Hispanics. Counties with higher non-citizen Hispanic shares also exhibit higher ICE removal rates, indicating greater total enforcement, and these counties have higher victimization effects. Reporting declines among Hispanics appear relatively uniform across neighborhood types.&lt;/p&gt;
&lt;p&gt;Q: Could survey attrition or compositional changes explain the results?
A: The authors rule this out through several tests. First, SC has no statistically significant effect on household survey response rates, even in Census tracts above the 90th percentile of Hispanic share. A worst-case bias calculation implies attrition could account for at most 26% of the victimization effect. Second, re-estimating using predicted victimization (based on pre-SC demographics) yields precise null effects, indicating the increase is not driven by compositional change. Third, results are stable when restricting to respondents present at all survey waves or using individual fixed effects.&lt;/p&gt;
&lt;p&gt;Q: Could the reporting decline be mechanical — reflecting a change in the types of crimes committed rather than behavioral change?
A: The authors test this by constructing predicted reporting rates using pre-SC incident characteristics. The largest alternative estimate is -1.45 percentage points, over six times smaller than the estimated main reporting effect of 9.5 percentage points, ruling out crime composition change as the primary explanation. Results also hold when focusing on always-respondents and using individual fixed effects, ruling out entry of low-reporting individuals into the survey.&lt;/p&gt;
&lt;p&gt;Q: How robust are the results to alternative empirical strategies?
A: Results are robust to including states that resisted SC (with somewhat smaller magnitudes as expected), alternative population cutoffs, TWFE specifications, the Borusyak et al. (2021) and Callaway and Sant&amp;rsquo;Anna (2021) estimators (which yield larger point estimates), a triple-differences specification using non-Hispanics as an additional control group, and the inclusion of time-varying unemployment rates. The dynamic event-study plots show parallel pre-trends across all specifications.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the null effect on aggregate victimization?
A: The authors estimate that the policy ruled out declines in aggregate victimization larger than 3.3%, indicating SC did not generate meaningful improvements in aggregate public safety. This contradicts the stated mission of immigration enforcement agencies. The findings imply that policies targeting immigrant communities can generate public safety costs through trust erosion that outweigh any deterrence or incapacitation benefits.&lt;/p&gt;
&lt;p&gt;Secure Communities (SC): A federal program launched in 2008 requiring automatic sharing of fingerprints from local jail bookings with the Department of Homeland Security, enabling identification of unauthorized immigrants among local arrestees and triggering ICE detainer requests; the largest expansion of interior immigration enforcement in U.S. history.&lt;/p&gt;
&lt;p&gt;Chilling effect: The mechanism by which immigration enforcement raises the perceived cost of contacting law enforcement for immigrant victims and witnesses — through fear that they, a family member, or neighbor will be detained or deported — thereby reducing willingness to report crimes independently of any change in underlying criminality.&lt;/p&gt;
&lt;p&gt;Victimization rate: The likelihood that an individual is the victim of a crime in a given period, measured via the NCVS independently of whether the crime was reported to police; the paper&amp;rsquo;s primary measure of public safety.&lt;/p&gt;
&lt;p&gt;Reporting rate: The likelihood that a criminal victimization is reported by the victim to the police, measured as a share of all crime incidents; distinct from victimization rate and central to the paper&amp;rsquo;s decomposition of reported crime into its two components.&lt;/p&gt;
&lt;p&gt;Reported crime rate: The joint probability of being victimized and reporting the crime, analogous to measures available in administrative police data such as the FBI UCR; this outcome masks the opposing effects of SC on victimization and reporting.&lt;/p&gt;
&lt;p&gt;Honored detainer: An ICE detainer request that results in a transfer of the arrested individual to ICE custody; the paper&amp;rsquo;s preferred measure of immigration enforcement intensity because it is available both before and after SC activation and is more directly linked to deportation actions than all detainer requests.&lt;/p&gt;
&lt;p&gt;Decomposition of victimization increase: The paper&amp;rsquo;s procedure for quantifying the relative importance of the reporting-channel (reduced probability of apprehension) versus other SC-induced social and economic changes (unemployment, wages, female-headed households, male immigrant share) in explaining the rise in Hispanic victimization.&lt;/p&gt;</description></item><item><title>Competing under Information Heterogeneity: Evidence from Auto Insurance</title><link>https://macropaperwarehouse.com/papers/competing-under-information-heterogeneity-evidence-from-auto-insurance/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/competing-under-information-heterogeneity-evidence-from-auto-insurance/</guid><description>&lt;p&gt;This paper studies imperfect competition in selection markets where competing firms have heterogeneous information about consumers — a layer of asymmetry distinct from the classic buyer-seller information gap. The central questions are: how do inter-firm information asymmetries shape equilibrium pricing, consumer sorting, and market efficiency; and whether a centralized bureau that aggregates and equalizes firms&amp;rsquo; risk information can promote competition and improve welfare.&lt;/p&gt;
&lt;p&gt;The empirical setting is the Italian mandatory motor vehicle liability insurance market (Responsabilità Civile Auto). The authors use the IPER dataset from IVASS, a nationally representative panel of matched insurer-insuree contracts covering 124,428 liability insurance contracts for new customers in the province of Rome from 2013 to 2021. The panel tracks consumers across insurer switches, enabling construction of individual-specific risk estimates from ex-post claim records using Poisson regressions for claim frequency and log-normal regressions for claim severity. The analysis focuses on the top 10 largest firms plus a composite fringe firm.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s empirical strategy proceeds in three stages. First, individual risk types are estimated from multi-year claim panels. Second, demand parameters — price sensitivity and firm-level unobserved product attributes — are recovered using a novel fixed-point algorithm (extending Berry et al. 1995) that infers the full offered-price distribution from observed transaction prices alone, without parametric restrictions on price distributions across firms. Third, supply-side parameters — pricing coefficients, signal variances, and cost parameters — are identified by exploiting the monotone mapping between offered prices and private signals, borrowing from the nonparametric auction literature.&lt;/p&gt;
&lt;p&gt;The model features firms that each draw a private Gaussian signal about a consumer&amp;rsquo;s true risk type theta, with firm-specific signal standard deviation sigma_j. Lower sigma_j means higher information precision. Firms set prices as a linear function of their posterior risk rating: p_j = alpha_j + beta_j * E(theta | theta_j, D=j). Firms simultaneously choose pricing coefficients to maximize expected profits.&lt;/p&gt;
&lt;p&gt;Key empirical findings: (1) Firms differ substantially in how sensitively their premiums respond to realized consumer risk — a reduced-form measure of information precision — with Figure 2 showing wide cross-firm variation in premium-to-risk coefficients. (2) Structural estimation confirms substantial heterogeneity in signal standard deviations sigma_j across all 11 firms. Firms with less accurate risk-rating algorithms (higher sigma_j) tend to have more efficient cost structures (lower claim-processing cost parameter k_j), generating distinct comparative advantages. (3) Baseline pricing coefficients alpha_j and risk-sensitivity coefficients beta_j vary dramatically across firms. (4) Senior drivers are less price sensitive; urban drivers are more price sensitive. Lower-risk consumers show stronger preferences for Firms 3 and 5, while higher-risk consumers disproportionately choose Firm 8.&lt;/p&gt;
&lt;p&gt;Counterfactual simulations assess three information policies relative to the baseline. Under a centralized risk bureau — which collects each firm&amp;rsquo;s signal, aggregates them weighted by precision, and distributes the combined signal equally — average premiums fall by 21.6% and consumer surplus rises by 15.7%. The efficiency benchmark (firms observe true risk perfectly) yields a 25.7% premium reduction and a 16.9% consumer surplus gain, so the bureau recovers almost all the efficiency gap. The privacy benchmark (all firms restricted to the coarsest signal in the market) raises surplus for high-risk consumers by 6.9% but harms low-risk consumers.&lt;/p&gt;
&lt;p&gt;The bureau&amp;rsquo;s price reduction operates through two channels: it eliminates the market power that accrues to firms with superior private information, and it aligns firms&amp;rsquo; risk evaluations, enabling sharper undercutting. The bureau also reduces average costs by 12 euros per contract by enabling more efficient insurer-insuree matching — cost-efficient claim processors can better target the consumer types they have a comparative advantage in serving.&lt;/p&gt;
&lt;p&gt;The analysis is confined to new customers in Rome&amp;rsquo;s provincial market to avoid complications from dynamic pricing and consumer-firm learning. The model abstracts away from optional contract clauses (treated as observable characteristics) and does not model the specific mechanisms generating information heterogeneity.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s core research question?
A: The paper asks how information asymmetries between competing firms (not just between buyers and sellers) shape equilibrium pricing strategies, consumer sorting, and market efficiency in a selection market, and whether a centralized bureau that equalizes firms&amp;rsquo; access to aggregated risk information can improve competition and welfare. This extends the classic Akerlof-Rothschild-Stiglitz framework by introducing a second layer of asymmetry — across sellers themselves.&lt;/p&gt;
&lt;p&gt;Q: Why is the Italian auto insurance market well suited for this study?
A: Italy mandates liability insurance for all drivers and prohibits rejections, so the analysis focuses entirely on how consumers sort across insurers rather than on participation margins. The IPER dataset from IVASS is a nationally representative panel tracking policyholders even across insurer switches, providing both premium and ex-post claim records needed to construct individual risk types. The market has roughly 50 competing firms using demonstrably heterogeneous pricing algorithms, documented through a survey of major insurers and reduced-form regressions.&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure firm-level information precision in the reduced-form analysis?
A: They estimate individual-specific risk types from a panel of claim records using Poisson regressions (claim frequency) and log-normal regressions (claim severity), then regress each firm&amp;rsquo;s premiums on those estimated risk measures. Firms whose premiums respond more sensitively to realized risk are inferred to have higher information precision. Figure 2 shows that these premium-to-risk coefficients vary significantly across firms — for example, Firm 7&amp;rsquo;s premiums are considerably more sensitive to risk than Firm 8&amp;rsquo;s — providing reduced-form evidence of heterogeneous information precision before any structural estimation.&lt;/p&gt;
&lt;p&gt;Q: What is the structural model&amp;rsquo;s signal structure?
A: Each firm j draws a private signal theta_j ~ N(theta, sigma_j^2) about a consumer&amp;rsquo;s true risk type theta, where sigma_j is the firm-specific signal standard deviation. A smaller sigma_j means higher precision. Signals are independent across firms conditional on theta, analogous to common-value auctions where firms receive noisy estimates of a shared unknown value (expected claim payouts). The parameter sigma_j is the key structural object the paper identifies and estimates.&lt;/p&gt;
&lt;p&gt;Q: What is novel about the demand estimation strategy?
A: Standard demand estimation assumes the same price is offered to all consumers or that the full price menu is observed. Here, only transaction prices are observed — the prices of unchosen insurers are not in the data. The authors apply the Wu and Xin (2024) fixed-point algorithm, which jointly estimates consumers&amp;rsquo; sorting probabilities, offered price distributions, and demand parameters by adding an outer loop over sorting propensities to the Berry (1994) contraction mapping. No parametric restrictions are imposed on the offered price distributions, and they are allowed to vary fully across firms.&lt;/p&gt;
&lt;p&gt;Q: How are firms&amp;rsquo; signal variances identified separately from pricing coefficients?
A: There is a one-to-one mapping between a firm&amp;rsquo;s offered price and its signal (prices increase monotonically in the signal, analogous to bids in auctions). After recovering the offered price distribution from the demand step, the authors observe price dispersion at a fixed risk level. By focusing on average prices conditional on each risk level, signal noise averages out, identifying the pricing coefficients beta_j. The residual price dispersion at fixed risk then identifies signal variance sigma_j^2.&lt;/p&gt;
&lt;p&gt;Q: What does structural estimation reveal about the relationship between information precision and cost efficiency?
A: Firms with higher signal standard deviations (less precise risk evaluation) tend to have lower claim-processing cost parameters k_j — they are more efficient at handling claims. This creates distinct comparative advantages: some firms excel at risk identification but face higher processing costs, while others process claims cheaply but evaluate risk less precisely. This heterogeneity means information-equalizing policies have differentiated firm-level impacts.&lt;/p&gt;
&lt;p&gt;Q: What are the quantitative effects of the centralized risk bureau on premiums and consumer surplus?
A: The bureau reduces average premiums by 21.6% relative to baseline and increases consumer surplus by 15.7%. The efficiency benchmark — where firms observe consumers&amp;rsquo; true risk perfectly — produces a 25.7% premium reduction and a 16.9% consumer surplus gain. The bureau therefore closes nearly all of the gap to the first-best allocation in surplus terms (15.7% vs. 16.9%).&lt;/p&gt;
&lt;p&gt;Q: Through what mechanisms does the bureau reduce prices?
A: Two distinct channels are identified. First, equalizing information precision eliminates the informational market power held by firms with superior signals, compelling them to compete more aggressively on price. Second, when all firms share the same risk evaluation of a consumer, they can undercut each other more precisely, which intensifies price competition further. Both channels operate simultaneously under the bureau.&lt;/p&gt;
&lt;p&gt;Q: How does the bureau affect consumer surplus distribution across risk types?
A: The bureau primarily benefits low-risk consumers because improved information allows firms to price discriminate more accurately on risk type, lowering prices for those who are low risk. High-risk consumers see smaller benefits and may face relatively higher premiums. This contrasts with the privacy benchmark, where restricting all firms to the coarsest signal in the market raises high-risk consumers&amp;rsquo; surplus by 6.9% — because it becomes harder for firms to distinguish them from low-risk consumers.&lt;/p&gt;
&lt;p&gt;Q: What is the cost efficiency effect of the bureau?
A: Under the centralized risk bureau, average costs per contract fall by 12 euros. This reflects more efficient insurer-insuree matching: when firms have equal and better information, those with cost advantages in claims processing can better identify and attract the consumer types they are relatively best equipped to serve. The authors note that given the scale of the Italian auto insurance market (approximately 31 million contracts annually), this per-contract saving implies a substantial aggregate impact.&lt;/p&gt;
&lt;p&gt;Q: What happens to firm profits under the bureau, and is the impact uniform?
A: Average profits decline overall due to lower prices. However, the impact is heterogeneous across firms. Firms that rely most heavily on superior information precision — often smaller, more specialized firms — experience greater profit losses, since the bureau most directly erodes their competitive advantage.&lt;/p&gt;
&lt;p&gt;Q: How does the privacy benchmark differ from the bureau scenario?
A: The privacy benchmark simulates a regulation that restricts all firms to using only basic consumer information, setting signal variance to the highest level observed in the market. Unlike the bureau (which improves and equalizes information), this benchmark degrades information uniformly. It produces opposite distributional effects: high-risk consumers gain 6.9% in surplus as cross-subsidization from low-risk to high-risk consumers increases, while low-risk consumers are worse off.&lt;/p&gt;
&lt;p&gt;Q: Why does the paper focus on new customers only?
A: Focusing on new customers avoids complications from dynamic pricing, where insurers update premiums based on accumulated claim history with a specific consumer, and from consumer-firm learning dynamics. This follows standard practice in the empirical asymmetric information literature, as cited in Chiappori and Salanie (2000) and Crawford et al. (2018).&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to and extend prior work on selection markets?
A: Prior empirical work on imperfect competition in selection markets — including Einav et al. (2010), Crawford et al. (2018), and related studies — assumes that competing firms have symmetric information about consumers. This paper is described as introducing the first tractable empirical framework for analyzing selection markets where firms have heterogeneous information. It also incorporates multidimensional cost heterogeneity on the supply side, adding to work by Salanié (2017) and Nelson (2025).&lt;/p&gt;
&lt;p&gt;Q: What do the reduced-form regressions reveal about pricing heterogeneity across insurers?
A: Firm-level regressions of premiums on observable risk factors show R-squared values ranging from 0.39 to 0.59. Estimated coefficients on key risk factors vary dramatically: being one year older reduces premiums by 0.25 to 1.68 euros depending on the firm; a higher bonus-malus class increases premiums by 12 to 32 euros; one additional accident in the previous five years raises premiums by 74 to 181 euros. These ranges reflect genuine differences in actuarial algorithms, not just sampling variation.&lt;/p&gt;
&lt;p&gt;Q: What is the bonus-malus system and why does its saturation matter for the paper&amp;rsquo;s setting?
A: Italy&amp;rsquo;s bonus-malus (BM) system assigns drivers to one of 18 risk classes based on accident history. Because approximately 80% of policyholders are in the best class (BM class 1), the public BM system provides limited granularity for risk evaluation. This saturation creates strong incentives for firms to develop proprietary risk-rating algorithms, which is the institutional basis for the substantial information heterogeneity that the paper documents and models.&lt;/p&gt;
&lt;p&gt;Information Precision (sigma_j): In the paper&amp;rsquo;s model, the firm-specific parameter measuring the dispersion of a firm&amp;rsquo;s private signal about a consumer&amp;rsquo;s true risk type. Firm j draws signal theta_j ~ N(theta, sigma_j^2); 1/sigma_j is information precision. A smaller sigma_j means the firm more accurately identifies consumer risk. This is not merely a theoretical construct — the paper identifies and estimates sigma_j structurally for each of the 11 firms.&lt;/p&gt;
&lt;p&gt;Heterogeneous Information: The condition where competing firms hold signals of different precision about the same consumer&amp;rsquo;s unobserved risk type, introducing asymmetry not just between buyers and sellers (as in Akerlof 1970) but among sellers themselves. This is the paper&amp;rsquo;s central departure from prior literature on selection markets, which assumed symmetric information among firms.&lt;/p&gt;
&lt;p&gt;Centralized Risk Bureau: A policy institution that collects each firm&amp;rsquo;s analyzed risk signal, aggregates them weighted by each firm&amp;rsquo;s information precision (producing a combined signal more precise than any individual firm&amp;rsquo;s signal), and makes the aggregated information equally accessible to all firms. The bureau is the paper&amp;rsquo;s primary policy counterfactual, and it is modeled as equalizing both the level and heterogeneity of information precision across competitors.&lt;/p&gt;
&lt;p&gt;Offered vs. Accepted Price Distribution: A distinction central to the paper&amp;rsquo;s identification strategy. The accepted price distribution is what is observed in transaction data — prices conditional on the consumer having chosen that firm. The offered price distribution is the full set of prices the firm would charge across all consumers, including those who did not select it. The paper recovers the offered distribution from the accepted distribution using a fixed-point algorithm, without imposing parametric restrictions.&lt;/p&gt;
&lt;p&gt;Selection Loop: The paper&amp;rsquo;s methodological extension of the Berry (1994) BLP contraction mapping for mean utilities. An outer loop iterates over consumers&amp;rsquo; sorting propensities to jointly recover offered price distributions, sorting probabilities, and demand parameters when only transaction prices are observed. This technique handles the endogeneity of which prices are accepted.&lt;/p&gt;
&lt;p&gt;Risk Rating: The firm&amp;rsquo;s posterior assessment of a consumer&amp;rsquo;s expected cost, computed as the posterior mean E(theta | theta_j, D=j) — the expected true risk type conditional on the firm&amp;rsquo;s private signal and the consumer selecting that firm. Firms set prices as a linear function of their risk rating: p_j = alpha_j + beta_j * E(theta | theta_j, D=j).&lt;/p&gt;
&lt;p&gt;Comparative Advantage (information vs. cost): The paper&amp;rsquo;s finding that firms with lower information precision (higher sigma_j) tend to have more efficient cost structures (lower k_j), and vice versa. This cross-sectional negative correlation between information advantage and cost advantage means that policy interventions that equalize information precision shift the basis of competition from information asymmetry to cost specialization.&lt;/p&gt;</description></item><item><title>Competition and the Phillips curve</title><link>https://macropaperwarehouse.com/papers/competition-and-the-phillips-curve/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/competition-and-the-phillips-curve/</guid><description>&lt;p&gt;Fujiwara and Matsuyama ask whether the well-documented flattening of the New Keynesian Phillips curve (NKPC) and the concurrent rise in market concentration and markup rates are causally linked or merely coincidental. Under the canonical New Keynesian model with CES demand, competition is irrelevant to the Phillips curve regardless of whether entry is endogenous — concentration neither changes its slope nor affects inflation directly. This paper overturns that irrelevance result by extending the canonical model in two directions: (1) incorporating endogenous firm entry and exit following Bilbiie, Ghironi, and Melitz (2008) and Bilbiie, Fujiwara, and Ghironi (2014), and (2) replacing CES with the Homothetic Single Aggregator (HSA) demand system (Matsuyama and Ushchev 2017, 2020b), a flexible, tractable class of homothetic demand systems that nests CES and Translog as special cases.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s theoretical results depend on two of Marshall&amp;rsquo;s laws of demand. The Second law states that the price elasticity of demand rises with the firm&amp;rsquo;s own price; the Third law states that the rate of increase in that elasticity falls with price. Together these conditions imply that the markup rate and pass-through rate are endogenous to the competitive environment.&lt;/p&gt;
&lt;p&gt;The main findings, delivered under both Rotemberg (1982) and Calvo (1983) pricing, are that higher entry costs — leading to market concentration — cause Phillips curve flattening through two distinct, complementary channels:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Structural (steady-state) effect.&lt;/strong&gt; Under Rotemberg pricing, the slope of the NKPC is proportional to the price elasticity zeta(z); market concentration reduces z, hence reduces zeta(z) under the Second law, directly flattening the curve. Under Calvo pricing, the slope is proportional to the pass-through rate rho(z); the Third law implies that concentration reduces rho(z), again flattening the curve. The Calvo–Rotemberg equivalence, which holds under CES to first order (Roberts 1995), breaks down under HSA: each pricing mechanism highlights a different channel.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Observational (omitted variable bias) effect.&lt;/strong&gt; Endogenous entry generates an endogenous cost-push shock through strategic complementarity in price setting. Because the number of firms N_t is omitted from a naive regression of inflation on real marginal cost, and because N_t is positively correlated with the marginal cost under the Second law, the omitted variable bias is negative — the estimated slope is biased downward. This bias is amplified with greater concentration under the Third law (Rotemberg case) and under both the Second and Third laws (Calvo case).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Quantitatively, the paper simulates under three parametric HSA families — CES, Translog, and Co-PaTh (Constant Pass-Through). De Loecker, Eeckhout, and Unger (2020) document that aggregate markups rose from 21% above marginal cost to 61% — a rise of approximately 40 percentage points. The authors&amp;rsquo; simulations imply this increase corresponds to an entry cost roughly 3.5 times higher under Translog and roughly 2.5 times higher under Co-PaTh with pass-through rate rho = 0.5. Under these parameterizations, the accompanying market concentration can halve the slope of the NKPC. Impulse responses confirm that the responses of inflation to both technology shocks and monetary policy shocks become smaller as market concentration deepens.&lt;/p&gt;
&lt;p&gt;Scope conditions: results require departure from CES (the Second and/or Third law must hold); endogenous entry is necessary for the dynamic cost-push channel; the structural flattening requires only the Second law under Rotemberg but additionally the Third law under Calvo; the omitted variable bias requires the Second law under Rotemberg and both laws under Calvo. The model is closed-economy, with symmetric monopolistic competition and Rotemberg or Calvo price adjustment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the irrelevance result the paper overturns, and why does CES produce it?&lt;/strong&gt;
Under CES, the market share function takes the form s(z) = gamma * z^(1-theta), yielding a constant price elasticity zeta = theta and a pass-through rate rho = 1, regardless of the number of firms or entry costs. As a result, concentration neither alters the slope of the NKPC nor generates any endogenous cost-push shock; competition is simply irrelevant to inflation dynamics. This irrelevance holds even with endogenous entry under CES.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the Homothetic Single Aggregator (HSA) and why is it used?&lt;/strong&gt;
HSA is a class of homothetic demand systems, originally proposed by Matsuyama and Ushchev (2017), in which the market share of each intermediate input variety depends solely on its own price normalized by a single price aggregator A_t. This single aggregator serves as a sufficient statistic summarizing all competitive pressure effects on pricing behavior, including the markup rate and pass-through rate. HSA nests CES and Translog as special cases, is analytically tractable (equilibrium existence and uniqueness are straightforward to ensure with endogenous entry), and is flexible enough to accommodate both the Second and Third laws of demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are Marshall&amp;rsquo;s Second and Third laws as defined in the paper?&lt;/strong&gt;
The Second law states that the price elasticity of demand zeta(z) is increasing in the normalized price z (equivalently, increasing in the single price aggregator A_t, which rises with fewer firms). The Third law, as defined by Matsuyama and Ushchev (2023b), states that the rate of increase in the price elasticity is decreasing in z. Together they ensure that both markup rates and pass-through rates respond systematically to changes in competitive pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does market concentration structurally flatten the NKPC under Rotemberg pricing?&lt;/strong&gt;
Under Rotemberg pricing, the slope of the NKPC equals (zeta(z) - 1) / chi, where chi is the Rotemberg price adjustment cost parameter. Higher entry costs reduce the equilibrium number of firms, which reduces competitive pressure and lowers z. Under the Second law, lower z reduces zeta(z), directly shrinking the slope coefficient. This is the steady-state effect of concentration: the structural slope of the curve declines because the price elasticity falls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does market concentration structurally flatten the NKPC under Calvo pricing?&lt;/strong&gt;
Under Calvo pricing, the slope of the NKPC is positively related to the pass-through rate rho(z) rather than the price elasticity. The Third law implies that lower z (more concentration) reduces rho(z). Market concentration therefore causes structural flattening through the pass-through channel under Calvo. This is why the Calvo–Rotemberg equivalence — which holds to first order under CES — breaks down under HSA: Rotemberg highlights the Second law / price elasticity channel and Calvo highlights the Third law / pass-through channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the endogenous cost-push shock and how does it arise?&lt;/strong&gt;
When the number of operating firms N_t changes endogenously, it alters the single price aggregator A_t and therefore the competitive environment facing each firm. Under the Second law, firms exhibit strategic complementarity in price setting: a firm reduces its markup when other firms lower their prices (A_t falls with more entry). Consequently, movements in N_t directly enter the NKPC as an additional term — (1/chi) * (1 - rho(z)) / rho(z) * N_hat_t — acting as an endogenous cost-push shock. This channel is absent under CES because rho = 1 makes the coefficient zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the endogenous cost-push shock create a negative omitted variable bias?&lt;/strong&gt;
A naive regression of inflation on real marginal cost omits the N_hat_t term. Under the Second law, N_t is positively correlated with the marginal cost (more entry drives markups down, consistent with marginal cost movements), so the omitted variable N_hat_t is positively correlated with the included regressor. Because the true coefficient on N_hat_t in the NKPC is negative, omitting it biases the estimated slope on marginal cost downward (negative omitted variable bias). The estimated relationship between inflation and marginal cost is therefore weaker than the true structural relationship.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How is the omitted variable bias amplified by concentration?&lt;/strong&gt;
Under the Third law (Rotemberg case) and under both the Second and Third laws (Calvo case), greater market concentration amplifies the magnitude of this negative bias. The intuition is that higher concentration makes the pass-through rate rho(z) smaller, which increases the coefficient on N_hat_t in the NKPC and thereby raises the magnitude of the bias when N_hat_t is omitted. Greater concentration thus generates both more structural flattening and more observational flattening simultaneously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the quantitative magnitudes of Phillips curve flattening in the simulations?&lt;/strong&gt;
De Loecker, Eeckhout, and Unger (2020) document that aggregate markups rose from 21% above marginal cost to 61% — approximately 40 percentage points. The paper&amp;rsquo;s simulations imply this corresponds to an entry cost increase of roughly 3.5 times under Translog and roughly 2.5 times under Co-PaTh with rho = 0.5. According to Figure 2, the accompanying market concentration can halve the slope of the NKPC. The slope declines more steeply for demand systems with smaller pass-through rates (rho further from 1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do impulse responses change with market concentration?&lt;/strong&gt;
As entry costs rise (deeper concentration), the responses of the inflation rate to both technology shocks and monetary policy shocks become smaller in magnitude. Under the Second law, a positive technology shock increases the number of firms through a wealth effect, but strategic complementarity in price setting reduces markups, muting the inflation response relative to CES. The dynamic effect of endogenous entry thus weakens the transmission of real economic shocks to inflation — a supply side effect of monetary policy that parallels Baqaee, Farhi, and Sangani (2021) but operates through firm entry rather than the misallocation channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the cyclicality of the markup rate under HSA, and why is it ambiguous?&lt;/strong&gt;
Under CES with flexible prices, the markup is constant. Under CES with sticky prices, the markup is procyclical (marginal cost falls with a positive technology shock but the price is rigid in the short run). Under the Second law with flexible prices, a positive technology shock increases firm entry, which reduces markups, making the markup countercyclical. In a sticky price equilibrium under the Second and Third laws, the cyclicality is therefore ambiguous: it depends on the tension between nominal rigidities (pushing toward procyclicality) and the pass-through rate (pushing toward countercyclicality).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Why do the three price indices in the model differ, and which is used for the NKPC?&lt;/strong&gt;
The model features three aggregate price measures: the final goods price (CPI) P_t, which captures productivity effects of entry; the single price aggregator A_t, which captures competitive effects of entry and is the reference price for firms; and the average price index (PPI) p_t, which is not affected by entry effects and is the measured price index. Because entry effects shift P_t and A_t in ways that are not directly observed, the paper evaluates NKPC responsiveness in terms of p_t (PPI inflation), the measurable index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: How does this paper relate to Wang and Werning (2022) and Baqaee, Farhi, and Sangani (2021)?&lt;/strong&gt;
Wang and Werning (2022) use a dynamic oligopoly model with exogenous entry and CES/Kimball demand, showing that higher concentration amplifies real effects of monetary policy and generates inflation persistence and endogenous cost-push shocks. Baqaee, Farhi, and Sangani (2021) use monopolistic competition with exogenous entry and Kimball demand under Calvo pricing, showing flattening through real rigidities and a misallocation channel (supply side effects of monetary policy). This paper uses monopolistic competition with endogenous entry and HSA under both Rotemberg and Calvo pricing; it produces supply side effects through firm entry rather than misallocation, and uses HSA rather than Kimball because HSA more readily guarantees equilibrium uniqueness with endogenous entry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What parametric families of HSA are used in simulations and what are their properties?&lt;/strong&gt;
Three families are used: CES (constant price elasticity theta, pass-through rho = 1, benchmark); Translog (satisfies the Second law, variable markups and pass-through); and Co-PaTh or Constant Pass-Through (proposed by Matsuyama and Ushchev 2020a, constant pass-through rate rho in (0,1) under flexible prices, containing CES as a limit as rho approaches 1). For Calvo pricing, a fourth family — PEM (Power Elasticity of Markup, proposed by Matsuyama and Ushchev 2023b) — is used; PEM satisfies the Third law in its strong form and contains Co-PaTh as a limit case. Translog is noted to behave similarly to Co-PaTh with rho = 0.5.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: What are the policy implications for central banks?&lt;/strong&gt;
Rising market concentration, by flattening the NKPC both structurally and observationally, reduces the effectiveness of monetary policy in achieving price stability through real economic activity — consistent with the concerns expressed by Federal Reserve officials (Clarida, Daly, Williams) quoted in the paper. The results suggest that empirical estimates of the NKPC slope that omit endogenous entry dynamics will be systematically biased downward, potentially leading central banks to underestimate the true structural responsiveness of inflation to demand conditions. Competition policy and barriers to entry thus have macroeconomic consequences beyond standard allocative efficiency considerations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Homothetic Single Aggregator (HSA):&lt;/strong&gt; A class of homothetic demand systems in which the market share of each input variety depends solely on its own price normalized by a single price aggregator A_t, which serves as a sufficient statistic for all competitive pressure effects on firm pricing behavior including the markup rate and pass-through rate. Nests CES and Translog as special cases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marshall&amp;rsquo;s Second Law of Demand (as used in the paper):&lt;/strong&gt; The condition that the price elasticity of demand zeta(z) is strictly increasing in the firm&amp;rsquo;s normalized price z. Under this condition, markup rates and pass-through rates vary endogenously with competitive pressure, and strategic complementarity in price setting arises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marshall&amp;rsquo;s Third Law of Demand (as used in the paper):&lt;/strong&gt; The condition, defined by Matsuyama and Ushchev (2023b), that the rate of increase in the price elasticity is decreasing in z. This law determines how the pass-through rate responds to concentration changes and is the relevant condition for structural flattening under Calvo pricing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pass-through rate rho(z):&lt;/strong&gt; The fraction of a cost change that a monopolistically competitive firm passes through to its price under flexible pricing, defined as rho(z) = [1 - d&lt;em&gt;ln(zeta/(zeta-1))/d&lt;/em&gt;ln(z)]^(-1). Under CES, rho = 1 (complete pass-through); under the Second law, rho &amp;lt; 1 (incomplete pass-through); it declines with concentration under the Third law.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous cost-push shock:&lt;/strong&gt; The direct effect of changes in the endogenous number of firms N_t on inflation in the NKPC, arising from strategic complementarity in price setting under HSA. This term is absent under CES (where the coefficient is zero) and generates an omitted variable bias in naive regressions of inflation on marginal cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Steady-state (structural) flattening:&lt;/strong&gt; The reduction in the true structural slope of the NKPC caused by market concentration operating through lower price elasticity (Rotemberg channel) or lower pass-through rate (Calvo channel). This is the first of the paper&amp;rsquo;s two reasons for observed Phillips curve flattening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Observational (omitted variable bias) flattening:&lt;/strong&gt; The downward bias in empirically estimated NKPC slopes arising because naive regressions omit the endogenous cost-push shock term. The bias is negative and is amplified by greater market concentration under the Third law and/or Second law depending on the pricing mechanism.&lt;/p&gt;</description></item><item><title>Competition in a Spatially-Differentiated Product Market with Negotiated Prices</title><link>https://macropaperwarehouse.com/papers/competition-in-a-spatially-differentiated-product-market-with-negotiated-prices/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/competition-in-a-spatially-differentiated-product-market-with-negotiated-prices/</guid><description>&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;How does individually negotiated pricing — where buyers make discrete choices among differentiated products and negotiate transaction-specific prices — affect market power and merger effects in oligopoly markets, and how do these effects differ from the uniform-pricing benchmark?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper estimates the model using 13,788 transactions between the four main UK brick manufacturers and national house-building firms over 2003–2006. For each transaction (defined as a unique buyer-variety-destination-year combination), the data record the chosen product, negotiated price, production and delivery locations, volume, transport costs, and brick characteristics. The market is highly concentrated: four manufacturers held an 85% share of brick sales, with a two-firm concentration ratio of 0.60 and an HHI of 2,113. Spatial differentiation is a central feature — transport costs vary substantially by project location, and prices for the same brick product vary across the different projects of the same buyer depending on local competitive conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops an empirical model that adapts the Berry, Levinsohn, and Pakes (1995) differentiated-products framework to individually negotiated pricing. In the model, each buyer negotiates simultaneously and bilaterally with the sellers of the first-best and runner-up products (defined by surplus — value minus cost). The equilibrium first-best markup equals the minimum of (i) the unconstrained Nash bargaining solution, bj(wj(1) − w0), and (ii) the first-best seller&amp;rsquo;s surplus advantage over the runner-up, (wj(1) − wj(2)). Runner-up and lower-ranked sellers earn zero markups in equilibrium. This outcome is shown to be consistent with a range of non-cooperative bargaining models (Binmore 1985, Bolton and Whinston 1993, Manea 2018) and lies in the core of the associated coalition game. The TIOLI posted-price model is nested as the special case where seller bargaining skill equals one. A tractable likelihood for the joint probability of observed product choice and negotiated price is derived under the assumption that idiosyncratic taste terms follow a Generalized Extreme Value (GEV) distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The estimated mean seller bargaining skill is b̄ = 0.41 (s.e. 0.03), and a likelihood ratio test rejects the TIOLI restriction with a chi-squared statistic of 847 (p &amp;lt; 0.001), confirming that buyer bargaining power is economically and statistically significant. The model-implied price-cost margins (Lerner index) are low on average — mean of 0.08 — but vary widely across transactions (coefficient of variation of 0.78). Project location matters: sellers extract higher margins from buyers that are relatively close, taking advantage of their transport-cost proximity. Multi-product ownership also affects markups, but its relevance varies by project.&lt;/p&gt;
&lt;p&gt;Switching from negotiated to uniform pricing raises average markups by 34% at the observed market structure. However, effects are heterogeneous: approximately 15% of transactions see markup decreases. Buyers who benefit from uniform pricing are those with relatively little runner-up competition — precisely the buyers who face weak bargaining positions under negotiated pricing, and for whom the seller&amp;rsquo;s ability to use that position is constrained under a uniform rule.&lt;/p&gt;
&lt;p&gt;Under negotiated pricing, a merger affects a transaction&amp;rsquo;s markup only if it brings the first-best and runner-up products for that transaction under joint ownership. A demerger to single-product manufacturers reduces total manufacturer surplus by 25%. The merger of the two largest firms increases total manufacturer surplus by 19%, but with highly unequal transaction-level effects. Comparing the same mergers across pricing regimes, negotiated pricing abates average markup-increasing merger effects but worsens them for a minority of transactions — those where the merger creates a first-best/runner-up pairing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model applies to complete-information settings where prices are negotiated transaction-by-transaction, buyers single-source for each discrete purchase occasion, and sellers have multiple spatially differentiated products. It is most directly applicable to business-to-business markets where individual transaction values are large enough to justify project-level negotiation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the fundamental difference between negotiated pricing in this paper and the standard Nash-in-Nash (NiN) bargaining framework?&lt;/strong&gt;
A: In standard NiN (Horn and Wolinsky 1988), a buyer negotiates one price per product and trades positive quantities of all products with negotiated prices, so all negotiated prices are observed in transaction data. In this paper, buyers make discrete single-sourcing choices — each project uses exactly one product — so only the chosen product&amp;rsquo;s price appears in data; the runner-up product and its counterfactual price are unobserved. Additionally, under NiN, prices are set at the buyer level and apply uniformly to all the buyer&amp;rsquo;s needs, whereas here prices are negotiated separately for each project, generating intra-buyer cross-project price variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the equilibrium markup formula, and what determines whether the Nash bargaining solution or the TIOLI constraint binds?&lt;/strong&gt;
A: The equilibrium first-best markup is ρ*j(1) = min[bj(1)(wj(1) − w0), (wj(1) − wj(2))], the minimum of the unconstrained Nash bargaining solution and the first-best seller&amp;rsquo;s surplus advantage over the runner-up. The TIOLI constraint (surplus advantage) binds when the seller&amp;rsquo;s bargaining skill is sufficiently high that the unconstrained NBS would exceed the surplus advantage — that is, when bj(1)(wj(1) − w0) &amp;gt; (wj(1) − wj(2)). Runner-up and all lower-ranked sellers earn zero markups in equilibrium because competition from the first-best drives their outside-option constraint to bind.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why do third-best and lower-ranked sellers have no effect on equilibrium outcomes?&lt;/strong&gt;
A: Because the most attractive offer any seller below the runner-up could make is a zero markup, and the runner-up already offers a zero markup due to competition from the first-best. Since the runner-up at zero markup already offers the buyer at least as much utility as any third-best product, the third-best cannot improve the buyer&amp;rsquo;s position. Proposition 1 (part iii) shows that the equilibrium markup and choice are invariant to N for N in {2, &amp;hellip;, N̄}.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper address the econometric challenge that the runner-up product and its price are unobserved?&lt;/strong&gt;
A: The paper derives a tractable closed-form likelihood for the joint probability of the observed product choice and the observed negotiated price, integrating out the unobserved idiosyncratic taste terms along with their implications for the identity and surplus of the unobserved runner-up product. The GEV distributional assumption on taste terms is crucial: it ensures that (1) choice probabilities have a closed form, (2) the surplus advantage can be expressed in terms of observed surpluses and GEV terms, and (3) the probability that the NBS is constrained has a closed form. This reduces the full problem to a lower-dimensional numerical integral over the normally distributed random effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What empirical evidence motivates the negotiated pricing model over simpler alternatives?&lt;/strong&gt;
A: Four data patterns motivate the model. First, prices vary across projects even after controlling for product identity and buyer identity — intra-buyer cross-project variation that is inconsistent with standard NiN where prices are set at the buyer level. Second, prices are lower, other things equal, when there is greater local competition from manufacturers not chosen for a project — inconsistent with standard NiN where excluded products play no competitive role. Third, buyers have many projects and make a discrete single-sourcing choice for each. Fourth, sellers are multi-product firms with products differentiated spatially and in other dimensions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do the price regressions reveal about price determinants?&lt;/strong&gt;
A: Adding year effects to a simple regression explains only a small share of price variation (R² rises from 0.000 to 0.118 for the full sample). Adding variety-year effects raises R² to 0.775 and adding buyer-variety-year effects to 0.918, but still leaves substantial unexplained variation. Panel B regressions show that prices decrease with quantity, increase with input prices (gas price coefficient 27.2, wage coefficient 8.3), decrease with buyer-to-seller size ratio (coefficient −2.51), and decrease with greater local competition (a distance advantage indicator raises price by about 0.48–2.20 and N(DST) count reduces price by about 1.49–1.53 depending on specification).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do the parameter estimates imply about spatial differentiation and buyer preferences?&lt;/strong&gt;
A: Transport costs have a strongly negative effect on value (coefficient on distance is −1.27, s.e. 0.04), and the interaction of distance with fuel costs is also negative and significant. The nesting parameter σJ is estimated at 0.47, indicating substantial within-group taste correlation across products from the same firm. Product characteristics matter: red and wire-cut bricks are preferred, and there are significant interactions between weather conditions and technical brick characteristics (frost positively interacts with strength; rainfall negatively interacts with absorption), indicating that buyers value bricks whose technical performance is suited to their project&amp;rsquo;s climate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How is the mean seller bargaining skill estimated, and how is the TIOLI model rejected?&lt;/strong&gt;
A: The mean seller bargaining skill b̄ is estimated at 0.41 (s.e. 0.03), substantially below one. The TIOLI restriction corresponds to b̄ = 1 (all markup determined by surplus advantage). A likelihood ratio test rejects this restriction with a chi-squared statistic of 847 (p &amp;lt; 0.001), providing strong statistical evidence that buyer bargaining power — not just competitive pressure — constrains markups below the TIOLI level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the main findings regarding the distribution of price-cost margins?&lt;/strong&gt;
A: Price-cost margins (Lerner index form) are low on average, with a mean of 0.08, but vary widely across transactions, with a coefficient of variation of 0.78. Sellers set higher margins to buyers located relatively close to them (lower transport costs make the seller more attractive to the buyer, strengthening the seller&amp;rsquo;s position). Multi-product manufacturer portfolios also affect markups, but the relevance of multi-product ownership varies across projects depending on whether different products from the same firm compete as first-best and runner-up for a given project.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the uniform pricing counterfactual show, and how does it differ from the Hotelling benchmark?&lt;/strong&gt;
A: Switching from individually negotiated to uniform pricing raises average markups by 34% at the observed market structure. However, effects are heterogeneous: approximately 15% of transactions see markup decreases. Buyers who benefit from the switch are those in transactions with relatively weak runner-up competition — who had weak bargaining positions under negotiated pricing — and who gain because uniform pricing prevents sellers from exploiting that weakness. This contrasts with the result from the simple Hotelling linear city model (Thisse and Vives 1988), where switching to uniform pricing raises all markups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the demerger counterfactual quantify multi-product effects?&lt;/strong&gt;
A: Decomposing the observed market to single-product manufacturers reduces total manufacturer surplus by 25%. This large reduction reflects the role of multi-product ownership in determining who the runner-up is for each transaction: when a manufacturer owns multiple products, it can avoid internal competition between its own first-best and runner-up products, preserving its surplus advantage. The impact is highly unequal across individual transactions, however, because the relevance of multi-product effects depends on whether any of a manufacturer&amp;rsquo;s other products would have been the runner-up for a given project.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the merger of the two largest firms imply for markups and surplus?&lt;/strong&gt;
A: The merger of the two largest firms (by market share) increases total manufacturer surplus in the industry by 19%. Markup increases are very unequal across transactions: the merger affects only those transactions for which the merging firms jointly become the first-best and runner-up, which is the mechanism highlighted in the 2010 US Merger Guidelines for negotiated pricing markets. The heterogeneity of effects means that aggregate market-level concentration measures (such as HHI changes) can be poor proxies for merger effects in these markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the pricing regime interact with merger effects?&lt;/strong&gt;
A: Comparing the same mergers under negotiated versus uniform pricing, negotiated pricing abates the average markup-increasing effects of mergers. However, for a minority of transactions — specifically those where the merger creates a first-best/runner-up pairing that did not exist pre-merger — negotiated pricing makes the merger&amp;rsquo;s markup effect worse than it would be under uniform pricing. This implies that the direction of the pricing-regime effect on merger harm is not uniform across buyers, and that transaction-level analysis is required for accurate antitrust assessment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper relate to the Competition Commission&amp;rsquo;s 2007 assessment of the Wienerberger/Baggeridge merger?&lt;/strong&gt;
A: The CC (2007) found the market highly concentrated (HHI 2,113, implied HHI increase of 390 from the merger, both exceeding guideline thresholds) but approved the merger, judging profitability to be at or below average for comparable industries and competition to be more intense than the concentration level alone would suggest. This paper&amp;rsquo;s model provides formal underpinning for that assessment: with negotiated pricing and buyer bargaining power, markups are constrained by the runner-up competitive threat at the transaction level, not by market-wide concentration, and the low mean Lerner index of 0.08 is consistent with the CC&amp;rsquo;s profitability finding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What external validity evidence supports the model&amp;rsquo;s cost specification?&lt;/strong&gt;
A: The paper compares the marginal costs implied by the estimated model to plant-month level production cost data that were not used in estimation. A good match between the two provides external validation of the cost specification and supports the model&amp;rsquo;s structural interpretation of the markup decomposition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;First-best and runner-up products&lt;/strong&gt;: Defined at the project level in terms of surplus (value minus cost). The first-best product j(i,1) is the inside good yielding the highest surplus for project i; the runner-up j(i,2) is the highest-surplus inside good not sold by the first-best seller. These two products — and only these two — determine the equilibrium markup and buyer choice; third-best and lower-ranked products are irrelevant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Surplus advantage&lt;/strong&gt;: The difference wj(i,1) − wj(i,2) ≥ 0 between the first-best product&amp;rsquo;s surplus and the runner-up&amp;rsquo;s surplus for a given project. This is the competitive constraint on the first-best seller&amp;rsquo;s markup under TIOLI pricing and the binding ceiling on the negotiated markup whenever the unconstrained Nash bargaining solution would exceed it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negotiated pricing&lt;/strong&gt;: A pricing arrangement in which buyers negotiate prices specific to the individual purchase occasion (here, each construction project), as opposed to uniform pricing where the pre-transport price is the same for all buyers. Prices are determined bilaterally between buyer and competing sellers, with the buyer&amp;rsquo;s outside option — buying the runner-up at its anticipated negotiated price — serving as the competitive constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Outside option principle (Binmore et al. 1989)&lt;/strong&gt;: The principle that a rival offer (outside option) has no effect on a bilateral Nash bargaining problem unless it would leave the receiving party better off than the Nash bargaining solution — i.e., it constrains rather than shifts the disagreement point. In the paper&amp;rsquo;s model, the runner-up seller&amp;rsquo;s zero-markup offer serves as the first-best seller&amp;rsquo;s constraining outside option when seller bargaining skill is high.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;GEV (Generalized Extreme Value) taste distribution&lt;/strong&gt;: The distributional assumption on project-product idiosyncratic match terms that makes the joint likelihood of observed product choice and negotiated price tractable. The GEV structure yields closed-form choice probabilities (nested logit) and allows the surplus advantage — which depends on unobserved runner-up surplus — to be expressed analytically, enabling joint estimation from transaction-level data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price-cost margin (Lerner index)&lt;/strong&gt;: Markup (price minus cost) divided by price, used here at the transaction level. The estimated mean Lerner index is 0.08 with a coefficient of variation of 0.78, reflecting wide dispersion driven by spatial variation in local competition and first-best surplus advantage across transactions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nash-in-Nash (NiN) vs. single-sourcing bargaining&lt;/strong&gt;: NiN (Horn and Wolinsky 1988) applies when a buyer trades positive quantities of all products with negotiated prices (multi-sourcing); the paper&amp;rsquo;s model applies when a buyer makes a discrete single-sourcing choice per occasion, so only the chosen product&amp;rsquo;s price is observed. The distinction generates different data observability and different competitive mechanisms — in NiN, excluded products play no role; in this paper, the runner-up&amp;rsquo;s potential zero-markup offer disciplines the first-best seller&amp;rsquo;s markup.&lt;/p&gt;</description></item><item><title>Competitive Advertising and Pricing</title><link>https://macropaperwarehouse.com/papers/competitive-advertising-and-pricing/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/competitive-advertising-and-pricing/</guid><description>&lt;p&gt;Hwang, Kim, and Boleslavsky study how firms in an oligopoly simultaneously choose prices and advertising strategies, where advertising is modeled as the choice of how much product information to disclose to consumers. The paper extends the canonical Perloff-Salop (1985) random-utility discrete-choice framework — in which n firms engage in Bertrand competition for a consumer whose value for each product is independently drawn from a common distribution F — by endogenizing the information environment: each firm may choose any mean-preserving contraction (MPC) of F as its advertising strategy, with no structural restriction on feasible content. This full flexibility, drawn from the information design literature, allows each firm to choose the consumer&amp;rsquo;s effective value distribution, ranging from full information (choosing F itself) to complete concealment (a degenerate distribution at the mean). The model is silent on advertising costs, which are assumed to be zero throughout.&lt;/p&gt;
&lt;p&gt;The central result is that intense competition forces firms to provide precise product information. Formally, the full information equilibrium — in which every firm chooses F — exists in the advertising game (the subgame in which prices are fixed symmetrically) if and only if F^(n-1) is convex over its support. Because F^(n-1) represents the distribution of the consumer&amp;rsquo;s best outside option, convexity means the consumer likely faces an attractive alternative, incentivizing each firm to maximize the chance of offering the highest possible value. Crucially, this convexity condition is guaranteed to hold when n is sufficiently large, regardless of the shape of F, because the power function x^(n-1) becomes more convex as n rises. This establishes that under sufficiently intense competition, full information disclosure is the unique symmetric equilibrium.&lt;/p&gt;
&lt;p&gt;The general equilibrium advertising strategy G* — which governs cases where full information is not an equilibrium — satisfies two necessary and sufficient conditions: (i) (G*)^(n-1) is convex over the support of G*, and (ii) for almost all values in the support, G* either coincides with F (where the MPC constraint binds, preventing further dispersion) or (G*)^(n-1) is locally linear (where the firm is locally risk-neutral and has no incentive to alter its distribution). The paper proves existence and uniqueness of G* for any F satisfying the stated regularity conditions (density positive, continuously differentiable, bounded, with finitely many peaks). When F has log-concave density, a unique symmetric pure-price equilibrium (p*, G*) exists in the full game.&lt;/p&gt;
&lt;p&gt;The paper demonstrates that strategic advertising has ambiguous implications for prices and consumer welfare. Strategic advertising necessarily reduces social surplus through information loss, since consumers select suboptimal products with positive probability when G* differs from F. However, it compresses the support of the value distribution relative to F, which — by a new result (Proposition 3) — tends to lower the equilibrium price. Offsetting this, strategic advertising also redistributes marginal consumers in ways that may raise or lower the price. In the duopoly case with power distributions F(v) = v^alpha on [0,1], strategic advertising lowers the market price if and only if alpha &amp;gt; 1/sqrt(2) (approximately 0.7071), and raises consumer surplus if and only if alpha &amp;gt; 0.7928.&lt;/p&gt;
&lt;p&gt;The paper examines three extensions: (1) a binding consumer outside option, (2) multi-unit (k-out-of-n) demand, and (3) asymmetric firms with two types. In all three cases, full information cannot be a strict equilibrium for any finite n under the relevant structural condition, yet the equilibrium distribution G* converges pointwise to F as n tends to infinity, preserving the paper&amp;rsquo;s core asymptotic insight.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the main research question?&lt;/strong&gt;
A: The paper asks how much product information firms will voluntarily disclose when they compete both on price and advertising content in an oligopoly. Unlike the monopoly literature, the oligopoly context creates strategic interdependencies — each firm&amp;rsquo;s optimal disclosure depends on rivals&amp;rsquo; disclosure choices — that the paper characterizes fully.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How is advertising modeled, and why use mean-preserving contractions?&lt;/strong&gt;
A: Each firm&amp;rsquo;s advertising strategy is modeled as a choice of any mean-preserving contraction (MPC) of the true value distribution F. An MPC preserves the expected value but reduces dispersion, capturing the idea that a firm can selectively conceal information (moving toward a degenerate distribution) but cannot fabricate value dispersion beyond what F allows. Because consumers are risk-neutral and buy based on expected values net of prices, this MPC formulation captures full flexibility in information design without loss of generality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the precise necessary and sufficient condition for the full information equilibrium in the advertising game?&lt;/strong&gt;
A: The full information equilibrium — in which every firm chooses F — exists if and only if F^(n-1) is convex over its support [v, v̄]. The &amp;ldquo;only if&amp;rdquo; direction follows from Lemma 1: in any equilibrium, (G*)^(n-1) must be convex, so if F^(n-1) is not convex, F is not an equilibrium. The &amp;ldquo;if&amp;rdquo; direction follows because a convex F^(n-1) makes each firm locally risk-loving, so no MPC of F yields a higher payoff than F itself.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does sufficiently intense competition force full information disclosure?&lt;/strong&gt;
A: For any distribution F with positive, continuously differentiable, bounded density f with bounded derivative f&amp;rsquo;, the second derivative of F^(n-1) satisfies F(v)^(n-1)&amp;rsquo;&amp;rsquo; &amp;gt;= (n-1)F(v)^(n-3)[(n-2)epsilon^2 - M], where epsilon = min f(v)^2 &amp;gt; 0 and M = max |f&amp;rsquo;(v)| &amp;lt; infinity. This expression is strictly positive for n sufficiently large, so F^(n-1) is convex and the full information equilibrium exists. Economically, with many competitors each firm wins the consumer only when it offers the highest possible value, so providing full information is optimal.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Q: What are the two necessary and sufficient properties characterizing the general equilibrium advertising strategy G&lt;/em&gt;?&lt;/em&gt;*
A: First (Lemma 1), (G*)^(n-1) must be convex over the support of G* — this prevents any firm from profitably concentrating mass to reduce dispersion. Second (Lemma 2), for almost all values in the support, either G* = F locally (the MPC constraint binds, preventing further dispersion) or (G*)^(n-1) is locally linear (the firm is locally risk-neutral and indifferent over distributions with the same local mean). Theorem 1 proves these two conditions are both necessary and sufficient, and that G* is unique for any F satisfying the stated regularity conditions.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Q: What structure does G&lt;/em&gt; take when F^(n-1) has strictly quasi-concave density?&lt;/em&gt;*
A: By Corollary 2(1), there exists a cutoff v* in [v, v̄] such that G*(v) = F(v) for v &amp;lt;= v* (full information below the cutoff) and (G*)^(n-1) is linear above v*. As n increases, v* rises, meaning the region of full disclosure expands, and G* increases in convex order — so consumers receive strictly more information. One immediate implication is that consumer surplus strictly increases in n: consumers benefit both from more options and from more accurate information about each product.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens when F^(n-1) is concave?&lt;/strong&gt;
A: By Corollary 3, when F^(n-1) is concave, (G*)^(n-1) is linear over the entire support, with lower bound v. In the illustrative Example 1 (truncated exponential with n=2), this yields G* = U[0, 2*mu_F] — a uniform distribution on an interval whose upper bound is twice the mean of F.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does strategic advertising raise or lower equilibrium prices, and consumer surplus?&lt;/strong&gt;
A: Both effects are ambiguous and depend on the shape of F. Strategic advertising compresses the support of the value distribution (since G* is an MPC of F), which by Proposition 3(1) tends to lower equilibrium prices. But it also reshapes the distribution of marginal consumers, which may raise or lower prices. In the power distribution example (n=2, F(v) = v^alpha on [0,1]), strategic advertising lowers the market price if and only if alpha &amp;gt; 1/sqrt(2) ≈ 0.7071, and raises consumer surplus if and only if alpha &amp;gt; 0.7928. Thus even with deadweight loss from information suppression, consumers can be better off under strategic advertising than under forced full disclosure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does Proposition 3 contribute about equilibrium prices in the Perloff-Salop model?&lt;/strong&gt;
A: Proposition 3 delivers two results about how the distribution of marginal consumers (integral (F^(n-1))&amp;rsquo; dF) determines equilibrium prices. First, the measure of marginal consumers decreases if F is proportionally stretched over a larger support, confirming that longer support raises equilibrium prices. Second — presented as novel — among all distributions with support in [v, v̄], the power distribution F(v) = ((v-v)/(v̄-v))^(2/n) minimizes the measure of marginal consumers, corresponding to the maximum equilibrium price. The key property is that marginal consumers are uniformly distributed under this power distribution, and any deviation from uniformity allows a &amp;ldquo;flattening&amp;rdquo; adjustment that increases the measure of marginal consumers and lowers the price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Under what condition does the full game (price plus advertising) have a unique symmetric pure-price equilibrium?&lt;/strong&gt;
A: Theorem 2 states that log-concavity of the density f is sufficient for existence and uniqueness of a symmetric pure-price equilibrium (p*, G*) as characterized in Theorems 1 and 2. Log-concavity ensures that the equilibrium distribution G* has a convex-linear structure (as in Corollary 2), which preserves log-concavity of each firm&amp;rsquo;s profit function even under compound deviations (simultaneous changes to both price and advertising strategy), making the first-order conditions sufficient for global optimality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Can strategic advertising create or destroy pure-price equilibria relative to the Perloff-Salop benchmark?&lt;/strong&gt;
A: Yes, both directions are possible. When F^(n-1) is convex (so G* = F), equilibrium existence in the Perloff-Salop (PS) model is necessary but not sufficient for existence in the full model, because compound deviations (changing both price and advertising) may be profitable even when pure price deviations are not. Conversely, when G* differs from F, the changed distribution of marginal consumers can sustain an equilibrium in the full model even when none exists in PS. Appendix E of the paper provides a specific example of the latter phenomenon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens with a binding consumer outside option?&lt;/strong&gt;
A: Proposition 4 shows that a full information equilibrium never exists in the advertising game when the consumer has a binding outside option (p* in (v, v̄)). The firm&amp;rsquo;s value function acquires a discrete jump at p* due to the indicator 1_{v &amp;gt;= p*}, making it optimal to pool mass around p* rather than disclose fully. Nevertheless, Proposition 5 proves that G* converges pointwise to F as n tends to infinity, because the jump of size F(p*)^(n-1) vanishes exponentially fast as n grows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does the full information result survive multi-unit demand?&lt;/strong&gt;
A: No. Proposition 6 shows that with k &amp;gt; 1 units demanded (out of n products), the full information equilibrium never exists for any finite n or F. The reason is that phi&amp;rsquo;(v; F) — the firm&amp;rsquo;s marginal value of offering value v — is zero at v̄ when k &amp;gt; 1, so the firm can profitably pool values near the top of the support. However, Proposition 7 shows that G* converges pointwise to F as n tends to infinity (with k fixed), preserving the asymptotic full information result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens with asymmetric firms differing in their value distribution supports?&lt;/strong&gt;
A: Proposition 8 shows a sharp dichotomy. If both firm types share the same upper bound of their value supports (v̄_1 = v̄_2), the full information equilibrium exists whenever both F_1^(n1-1) and F_2^(n2-1) are convex. If the supports have different upper bounds (v̄_1 &amp;lt; v̄_2), the full information equilibrium never exists regardless of n_1 and n_2, because type-2 firms face a downward kink in their winning probability at v̄_1 and always have an incentive to pool mass there. The authors conjecture that G*_1 and G*_2 still converge to F_1 and F_2 asymptotically but do not prove this due to technical complexity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does this paper relate to Ivanov (2013)?&lt;/strong&gt;
A: Ivanov (2013) also uses the Perloff-Salop framework and shows that full information is an equilibrium when n is sufficiently large, but restricts advertising to rotation-ordered strategies (in the sense of Johnson and Myatt, 2006). The present paper imposes no structural restriction and strengthens Ivanov&amp;rsquo;s result by: (a) providing a necessary and sufficient condition for the full information equilibrium (not just a sufficient condition for large n); (b) fully characterizing G* when full information is not an equilibrium; and (c) demonstrating robustness across multiple model variants.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What policy implication does the ambiguity result carry?&lt;/strong&gt;
A: The paper warns against assuming that mandating full information disclosure is unambiguously consumer-beneficial. While strategic advertising creates deadweight loss through information suppression, it can simultaneously compress support and alter the marginal consumer distribution in ways that lower equilibrium prices significantly. The power distribution example (alpha &amp;gt; 0.7928) shows consumers can be strictly better off under strategic advertising than under forced full disclosure. This ambiguity is a cautionary tale for disclosure regulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mean-Preserving Contraction (MPC):&lt;/strong&gt; A distribution G_i is an MPC of F if it has the same mean as F but less dispersion (in the sense of second-order stochastic dominance). In the paper, each firm&amp;rsquo;s feasible advertising strategies are exactly the set MPC(F) — this captures all informationally feasible disclosures without structural restriction on content.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Advertising Game:&lt;/strong&gt; A restricted subgame of the full market game in which firms choose their advertising strategies G_i taking the symmetric price as given. An equilibrium in the advertising game is a necessary condition for equilibrium in the full game. The advertising game&amp;rsquo;s equilibrium uniquely pins down G* independently of the price level (under the baseline model without binding outside option).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Full Information Equilibrium:&lt;/strong&gt; An equilibrium of the advertising game in which every firm chooses the true underlying distribution F as its advertising strategy. This corresponds to complete, unobstructed product disclosure. The paper&amp;rsquo;s central result is that this equilibrium exists if and only if F^(n-1) is convex over its support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Convexity of F^(n-1):&lt;/strong&gt; The key distributional condition governing advertising equilibria. F^(n-1) is the distribution of the consumer&amp;rsquo;s best alternative among (n-1) rivals&amp;rsquo; products. Convexity of F^(n-1) means its density is increasing, signaling a likely attractive outside option, which makes each firm risk-loving and induces full disclosure. This convexity is guaranteed for n sufficiently large.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Locally Linear (G&lt;/em&gt;)^(n-1):&lt;/em&gt;* A region of the equilibrium distribution where (G*)^(n-1) has constant slope, making the firm locally risk-neutral. Over such a region, the firm is indifferent among all distributions with the same local mean, and the equilibrium G* need not coincide with F — it is only required to be an MPC of F on that interval. This alternating structure (coinciding with F on strictly convex regions; linear elsewhere) fully characterizes G*.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal Consumers:&lt;/strong&gt; In the Perloff-Salop pricing formula, the equilibrium price p* = (1/n) / integral [(G*(v)^(n-1))&amp;rsquo; dG*(v)]. The integrand (G*(v)^(n-1))&amp;rsquo; * g*(v) is the density of consumers who are indifferent between a given firm&amp;rsquo;s product and their best alternative at value v. A larger measure of marginal consumers implies lower equilibrium prices through greater competitive pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Compound Deviation:&lt;/strong&gt; In the full game, a deviation by a firm that changes both its price p_i and its advertising strategy G_i simultaneously, rather than varying only one dimension. The possibility of compound deviations is what distinguishes equilibrium existence conditions in the full model from those in the standard Perloff-Salop model, even when G* = F.&lt;/p&gt;</description></item><item><title>Consistent Evidence on Duration Dependence of Price Changes</title><link>https://macropaperwarehouse.com/papers/consistent-evidence-on-duration-dependence-of-price-changes/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/consistent-evidence-on-duration-dependence-of-price-changes/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; This paper asks two related questions. First, can one develop a robust, distribution-free estimator for the discrete-time mixed proportional hazard (MPH) model of duration with unobserved heterogeneity? Second, what does that estimator reveal about the shape of the hazard of price changes, the role of heterogeneity in shaping aggregate price dynamics, and the distinction between regular price changes and sales?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The authors develop a linear generalized method of moments (GMM) estimator for the discrete-time MPH model, building on identification results in Honoré (1993). The model specifies that the probability a price spell ends at duration t, conditional on surviving to t, equals the product of a product-specific frailty parameter θ (unobserved, fixed over time) and a common baseline hazard bt. The estimator exploits repeated price spells per product via moment conditions that are linear in bt, making estimation and inference straightforward. It accommodates right- and left-censored data, competing risks, and spell-specific observable characteristics, without requiring any parametric assumption on the frailty distribution. The estimator is consistent as the number of products grows, even with a short time dimension. A Hansen-Sargan J-test of overidentifying restrictions and a test of the monotone-average-type prediction are also developed.&lt;/p&gt;
&lt;p&gt;The estimator is applied to two datasets: (1) IRI weekly store data (2001–2011), covering 30 product categories and more than 21 million products, yielding 684,919,778 pairs of durations; and (2) Online Micro Price data from Cavallo (2018), comprising approximately 250,000 products at daily frequency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Baseline hazard and heterogeneity.&lt;/em&gt; In the pooled IRI data, the Kaplan-Meier hazard is steeply declining throughout the entire range from 2 to 60 weeks. In contrast, the estimated baseline hazard is roughly constant until week 4 and then declines only modestly, with a noticeable spike at week 52. The ratio of the Kaplan-Meier hazard to the baseline hazard — the average type, E[θ|t] — drops by approximately 60 percent within the first 20 weeks, and continues to decline, reaching roughly 0.3 of its initial value after one year. This decomposition reveals substantial unobserved heterogeneity that accounts for a large fraction of the observed decline in the Kaplan-Meier hazard.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Implications for structural models.&lt;/em&gt; The finding of a decreasing baseline hazard is inconsistent with canonical state-dependent pricing models (Golosov and Lucas, 2007), which predict an increasing hazard, conditional on a given firm&amp;rsquo;s type. The decreasing baseline hazard is instead broadly consistent with time-dependent pricing models, though not with a constant-hazard (Calvo, 1983) specification.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Monetary policy impulse response.&lt;/em&gt; In a calibrated time-dependent pricing model with strategic complementarity (α = 0, 0.5, 0.95), the aggregate price level dynamics in the estimated heterogeneous-firm MPH economy are close to those of a homogeneous-firm economy that uses the Kaplan-Meier hazard as the common price-change hazard. The homogeneous-firm approximation is substantially closer to the MPH economy than a Taylor (1979, 1980) staggered-contract economy with the same Kaplan-Meier hazard, particularly when strategic complementarity is strong (α = 0.95). The Calvo economy provides a poor approximation due to its exponential (constant-speed) price convergence structure.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Regular versus temporary price changes.&lt;/em&gt; Using the competing-risks extension with spell-specific observables — classifying spells by whether they start and end with a price increase (+) or decrease (−) — the authors separately estimate four baseline hazards. The baseline hazard for consecutive price increases (b++t) is relatively flat, especially for the first 6 weeks, then flat until week 45, with a spike near one year, consistent with price-plan models. The baseline hazard for reversals (particularly b−+t, price decreases followed by price increases, associated with sales) is steeply declining. The J-test statistics are substantially lower for price trends (J++ = 3,920; J−− = 3,401) than for reversals (J+− = 8,737; J−+ = 7,910), and markedly lower than the pooled-model J = 10,498, indicating that the MPH structure fits regular price changes considerably better than sales.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results are conditional on weekly store-level price data for mostly packaged consumer goods (30 IRI product categories). The analysis focuses on price spells of at least 2 weeks to avoid spurious duration-one spells from mid-week price changes. The maximum duration examined is 60 weeks. The comparison of estimation methods relies on the IRI data only; the Online Micro Price data confirm weekly decision-making through a spike in the daily hazard every 7 days. Comparisons with maximum likelihood estimates show that GMM recovers more heterogeneity (average type declines to 0.37 at 6 months by GMM versus 0.48 by continuous-time MLE), and that time aggregation explains most of the discrepancy between the two methods.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the mixed proportional hazard (MPH) model as used in this paper, and what does the estimator identify?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1. The MPH model specifies that the hazard that a price spell ends at duration t, conditional on surviving to t, equals θ·bt, where θ is a product-specific frailty parameter drawn from an unknown distribution G and bt is a baseline hazard common to all products. The estimator, which is linear in bt, identifies the baseline hazard up to a multiplicative constant using moment conditions derived from repeated spell data, without restricting the shape of the frailty distribution. Identification relies on comparing the joint survival probabilities of two consecutive spells for the same product and exploits the symmetry implied by the MPH structure across spells.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How does the Kaplan-Meier hazard relate to the baseline hazard, and what does this relationship imply about heterogeneity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2. The paper proves that the Kaplan-Meier hazard Ht equals bt times E[θ|t], the mean frailty among spells surviving to duration t. Because higher-type products (those with a higher propensity to change prices) exit the pool of surviving spells earlier, E[θ|t] is strictly decreasing in t — a form of dynamic selection. The ratio Ht/bt, normalized to 1 at the start, falls to approximately 0.4 by week 20 in the pooled IRI data and to approximately 0.3 after one year, documenting that a large share of the decline in the Kaplan-Meier hazard reflects heterogeneity rather than structural negative duration dependence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What does the estimated baseline hazard imply about structural models of price setting?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3. A decreasing baseline hazard is inconsistent with the canonical state-dependent model of Golosov and Lucas (2007), in which a firm&amp;rsquo;s hazard of price change is increasing in the time since the last change, because larger deviations from the desired price accumulate with duration. The decreasing baseline hazard is instead consistent with time-dependent pricing models and with price-plan models where within-plan switches are costless. The mild spike at week 52 in the baseline hazard is consistent with Taylor-type annual pricing rules.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. What is the approximate aggregation result for monetary policy, and how quantitatively accurate is it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4. In the time-dependent pricing model without strategic complementarity (α = 0), the impulse response of the aggregate price level to a monetary shock in a heterogeneous-firm economy is exactly the same as in a homogeneous-firm economy whose single firm uses the Kaplan-Meier survival function. This extends Carvalho and Schwartzman (2015) to an approximation in the case with strategic complementarity (α = 0.5 and α = 0.95). Numerically, the path of aggregate prices in the estimated MPH economy is close to that in the homogeneous-firm Kaplan-Meier economy, and substantially closer to it than to the Taylor-contract economy — the difference is most pronounced at horizons beyond about half a year when α = 0.95, where the Taylor economy shows notably slower initial convergence and faster later convergence relative to the MPH and homogeneous economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. How do the paper&amp;rsquo;s results differ from those obtained using maximum likelihood estimation of the continuous-time MPH model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5. The GMM estimator recovers substantially more heterogeneity than maximum likelihood (MLE) applied to the continuous-time model with continuous records (assumed gamma frailty). The average type falls from 1 to 0.37 at six months under GMM, versus only 0.48 under MLE. The authors investigate two sources of this discrepancy: the assumed frailty distribution family (gamma) and time aggregation. They conclude that time aggregation is quantitatively more important in the IRI weekly data — that is, the continuous-time MLE approach fails to properly account for the discrete nature of the data-generating process, leading it to understate heterogeneity and recover a steeper baseline hazard.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. How does the paper distinguish regular price changes from sales without directly observing a sales flag?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6. The competing-risks extension classifies each spell by whether it starts with a price increase or decrease (observable characteristic χ ∈ {+, −}) and by whether it ends with a price increase or decrease (competing risk ρ ∈ {+, −}). Price trends — spells where the direction is the same at both the start and end (++ or −−) — are interpreted as regular price changes; price reversals (especially −+, i.e., price decrease followed by increase) are associated with sales. This approach is consistent with the statistical model used for estimation, avoids the bias from simply dropping suspected sales spells before estimation, and allows the MPH structure to hold only for the risks of interest even if it fails for others.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How well does the MPH model fit regular price changes versus sales?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7. The J-test of overidentifying restrictions yields test statistics of J++ = 3,920 for consecutive price increases and J−− = 3,401 for consecutive price decreases, compared with J = 10,498 for the pooled model and J+− = 8,737 and J−+ = 7,910 for the reversal hazards. All rejections are at conventional significance levels (critical value 1,749 at 5%), but the rejection is substantially milder for price trends than for price reversals. For individual product categories, the model cannot be rejected for 8 categories (out of 30) for b++ and 21 categories for b−−, suggesting the MPH structure is a much better description of regular price changes than of sales.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. What role do one-week price spells play in the data, and why are they excluded?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8. In the IRI data, prices are measured as the ratio of weekly revenue to quantity, so a price change occurring mid-week generates a spurious price spell of duration one week. If all spells including one-week spells are retained, the autocorrelation of spell durations is only 0.029 in levels and even negative (−0.042) in logs, which is inconsistent with a mixture model. Once one-week spells are excluded, the autocorrelation rises to 0.235 in levels and 0.233 in logs, and is stable when two-week spells are also excluded (0.248 and 0.256). The paper therefore sets the lower duration bound at T̲ = 2 weeks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. What does the daily Online Micro Price data add relative to the weekly IRI data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9. The daily data reveal a sharp spike in the price-change hazard every seven days, suggesting that even when prices are observed daily, the decision to change prices is made at the weekly frequency. This justifies the use of a discrete-time model with a one-week period. The estimates from daily and weekly aggregations of the same data are broadly similar, though weekly data recovers somewhat less heterogeneity than daily data. Aggregating IRI weekly data to monthly frequency understates heterogeneity even more, confirming that frequency matters for measuring heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. What are the computational advantages of the GMM estimator relative to maximum likelihood?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10. Because the moment conditions are linear in the baseline hazard bt, the GMM estimator is obtained in closed form, making estimation fast and inference straightforward. On the pooled IRI sample, GMM estimation (including standard errors) required 70 minutes on a machine with 60 GB memory, whereas the maximum likelihood estimator required 15 hours on a machine with 256 GB memory and failed entirely on the 60 GB machine. The GMM approach also avoids the need to specify the frailty distribution family and guarantees a global solution (proved by the identification result), whereas the likelihood function is non-linear in bt and may have multiple local maxima.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. What is the shape of the b++ baseline hazard for regular price increases, and what models does it support?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11. The baseline hazard for spells starting and ending with a price increase (b++) is decreasing during the first 6 weeks — dropping by almost 50% — and then flat until approximately week 45, with a pronounced spike at around one year. This shape is consistent with price-plan models (Eichenbaum, Jaimovich, and Rebelo, 2011) with Calvo-type switching between plans, where within-plan changes are costless and the hazard of between-plan switching is approximately constant. The annual spike is consistent with Taylor-type pricing. Approximately 76.8% of complete spells starting after a price increase last at most 6 weeks.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Baseline hazard (bt).&lt;/strong&gt; The component of the MPH hazard that is common to all products and may vary arbitrarily with elapsed duration t. It represents structural duration dependence — the tendency for a given product to be more or less likely to change price as a function of how long its current spell has lasted — net of heterogeneity. It is identified only up to a multiplicative constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Frailty parameter (θ) / frailty distribution (G).&lt;/strong&gt; The product-specific scaling factor in the MPH model, fixed over all spells for a given product, that captures permanent unobserved differences in price-change frequency across products. The paper treats G as a nuisance parameter and does not require a parametric assumption on its shape. A higher θ means the product has a higher baseline propensity to change its price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average type (E[θ|t]).&lt;/strong&gt; The mean frailty parameter among spells that have survived to at least duration t. Because high-type products change price earlier and exit the pool of surviving spells first, the average type is provably strictly decreasing in t under the MPH model. It is measured as the ratio of the Kaplan-Meier hazard to the baseline hazard, and its rate of decline measures the importance of dynamic selection.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Kaplan-Meier hazard (Ht).&lt;/strong&gt; The probability that a randomly drawn spell ends at duration t, conditional on having lasted at least t periods. It mixes together structural duration dependence (captured by bt) and dynamic selection (captured by changes in the average type). It can be estimated without imposing the MPH structure, requiring only stationarity of the duration process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Competing risks.&lt;/strong&gt; The framework in which a price spell can end for multiple distinct reasons — here, ending with a price increase or a price decrease — each with its own hazard function. The paper&amp;rsquo;s GMM approach allows the MPH structure to hold for only a subset of risks and observables, without imposing any structure on the remaining risks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price trends vs. price reversals.&lt;/strong&gt; A classification of spells based on the direction of the surrounding price changes. Price trends are spells where the direction of the price change at the start and end of the spell is the same (++ or −−), interpreted as regular price changes. Price reversals are spells where the direction switches (e.g., −+, a price decrease followed by a price increase), associated with sales and other temporary price changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Strategic complementarity in pricing (α).&lt;/strong&gt; The degree to which a firm&amp;rsquo;s target price responds to the average price set by other firms. Parameterized by α ∈ [0, 1), where α = 0 yields the exact aggregation result (only the Kaplan-Meier hazard matters) and higher α increases aggregate price stickiness by making firms reluctant to deviate from the average price when few others are adjusting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic selection.&lt;/strong&gt; The mechanism by which the composition of the pool of surviving price spells shifts toward lower-type (more price-sticky) products as duration increases, because higher-type products change price sooner and exit the pool. This is the source of the gap between the steeply declining Kaplan-Meier hazard and the more modestly declining baseline hazard.&lt;/p&gt;</description></item><item><title>Consumer Credit and the Incidence of Tariffs: Evidence from the Auto Industry</title><link>https://macropaperwarehouse.com/papers/consumer-credit-and-the-incidence-of-tariffs-evidence-from-the-auto-industry/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/consumer-credit-and-the-incidence-of-tariffs-evidence-from-the-auto-industry/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Do import tariffs affect consumer credit terms, and does focusing solely on goods prices understate tariff pass-through to consumers? The paper also asks whether vertical integration &amp;ndash; specifically, the ownership of a captive finance subsidiary &amp;ndash; expands the channels through which manufacturers can pass on cost shocks, and whether tariff incidence falls disproportionately on consumers with less elastic credit demand or in areas with lower credit market competition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting.&lt;/strong&gt; The Trump administration&amp;rsquo;s 2018 metal tariffs &amp;ndash; a 25 percent tariff on steel and a 10 percent tariff on aluminum &amp;ndash; created a large and largely unanticipated cost shock for US auto manufacturers who are heavy consumers of both metals across their supply chains. Crucially, auto manufacturers own captive finance subsidiaries (e.g., Ford Credit, GM Financial, Honda Finance) that originate consumer auto loans alongside independent noncaptive lenders (banks, credit unions, independent finance companies). Because noncaptive lenders had no direct exposure to the metal tariffs, they serve as a natural control group in a difference-in-differences design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The primary data source is Regulation AB II, which requires issuers of public auto loan asset-backed securities to report loan-level information monthly to the SEC. The final sample covers 1,973,639 auto loans originated between January 2017 and December 2018 across 14 lenders (8 captive, 6 noncaptive). Vehicle invoice price data come from Regulation AB II; consumer sales price data come from the Texas Department of Motor Vehicles (covering approximately 3.9 million vehicle transactions in 2017-2018). Population credit bureau data from Equifax are used for representativeness checks and HHI construction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Strategy.&lt;/strong&gt; The baseline difference-in-differences compares captive auto loans to otherwise-identical noncaptive auto loans originated in the same state, the same quarter, for the same vehicle make-model-condition, and to borrowers in similar income and credit score bins. Parallel pre-trends tests confirm no economically meaningful differential pre-trends across captive and noncaptive lenders for any outcome variable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Interest Rate Pass-Through.&lt;/strong&gt; Relative to noncaptive lenders, captive lenders increased average interest rates by 26 basis points following the tariff announcement, representing a 10 percent increase relative to the pretreatment captive mean of 252 basis points. This corresponds to an average present value increase in total loan payments of $179 per loan (discounted at 5 percent for an average $26,914 principal with 66-month maturity). By the fourth quarter of 2018, the dynamic estimate reaches 48 basis points &amp;ndash; nearly double the pooled average &amp;ndash; as metal prices continued to rise. The increase is concentrated among more-exposed captive lenders (those whose manufacturers operate two or more domestic production plants), not less-exposed captive lenders (primarily BMW, Mercedes-Benz, Volkswagen), ruling out captive-specific omitted variables.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Non-Price Loan Terms.&lt;/strong&gt; There is no economically significant change in captive loan amounts, maturities, or loan-to-value ratios following the tariffs. Captive lenders responded to the tariff shock exclusively by raising interest rates, consistent with prior evidence that auto loan demand is less sensitive to interest rates than to non-price terms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Vehicle Prices.&lt;/strong&gt; Invoice prices for makes with greater domestic production rose by approximately 1.0 percent (relative to makes with less domestic production), and consumer sales prices rose by approximately 0.7 percent ($225 average increase relative to a pretreatment mean of $32,206) for these same makes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Relative Magnitude of Pass-Through Channels.&lt;/strong&gt; After accounting for estimated spillover effects on noncaptive lenders of 7 basis points, the spillover-adjusted estimate implies captive interest rates rose by 33 basis points on average, corresponding to $227 per loan in present value terms. Interest rate pass-through is estimated to be almost two-thirds as large as vehicle price pass-through, meaning that focusing solely on vehicle prices would underestimate tariff incidence on consumers by approximately 37 percent. The population-weighted average cost increase per vehicle is $146 &amp;ndash; roughly equally split between higher vehicle prices ($74) and higher financing costs ($72).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Intensive vs. Extensive Margin.&lt;/strong&gt; The composition of captive borrowers did not deteriorate following the tariffs: average household incomes of captive borrowers increased slightly (economically small), credit scores were unchanged, and future default rates showed no significant change. This confirms that the interest rate increase reflects tariff pass-through to inframarginal borrowers along the intensive margin, not a shift in borrower composition.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by Credit Demand Elasticity.&lt;/strong&gt; Pass-through via interest rates was higher for borrowers with lower incomes (33 basis points vs. 20 basis points for higher-income consumers), lower credit scores (36 basis points vs. 15 basis points), and smaller loan amounts (36 basis points vs. 12 basis points). These groups are proxies for less elastic credit demand, consistent with theoretical predictions that cost pass-through is larger where demand is less price sensitive.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by Market Competition.&lt;/strong&gt; Tariff pass-through via interest rates was higher in states with lower credit market competition (as measured by state-level Herfindahl-Hirschman Index). Consumers in the lowest competition decile experienced an average captive interest rate increase of 41 basis points, compared to 24 basis points for consumers in the highest competition decile. This 17 basis point differential implies that interest rate pass-through was approximately 88 percent as large as vehicle price pass-through in less competitive markets, versus 57 percent in more competitive markets.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is a captive finance subsidiary, and why does it create a novel channel for tariff pass-through?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A captive finance subsidiary is a wholly owned lending unit of an auto manufacturer (e.g., Ford Credit, GM Financial, American Honda Finance) whose primary purpose is to finance the sale of the manufacturer&amp;rsquo;s vehicles. Because the captive lender and the manufacturing unit share a parent company, a cost shock to the manufacturing side &amp;ndash; such as higher steel and aluminum prices from the tariffs &amp;ndash; can be passed on to consumers not only through higher vehicle prices but also through worse financing terms offered by the captive. Prior studies documented tariff pass-through to goods prices but found limited evidence of pass-through to consumer prices; this paper shows that the bundling of a product with captive financing creates a second, previously unmeasured channel. The institutional structure also facilitates &amp;ldquo;price shrouding&amp;rdquo;: because consumers are less attentive to financing costs than vehicle sticker prices, captive lenders can exploit this inattention to pass on cost shocks along the financing margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why is the auto loan market a particularly suitable setting for studying this question?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The auto loan market provides three key advantages. First, both captive lenders (directly exposed to metal tariffs via manufacturing) and noncaptive lenders (with no direct tariff exposure) compete for the same borrowers on the same vehicle purchases, creating a clean within-vehicle, within-period control group. Second, the Regulation AB II data contain vehicle make-model-condition information, allowing the authors to hold vehicle choice fixed and isolate tariff pass-through to loan terms separately from any vehicle switching by consumers. Third, the indirect dealer-intermediated financing process means that consumers typically do not observe the full set of lender bids, weakening their ability to actively arbitrage between captive and noncaptive loan offers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the Regulation AB II data, and how representative is it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under Regulation AB II (effective November 2016), issuers of publicly offered auto loan asset-backed securities must report monthly loan-level data to the SEC, including interest rates, loan amounts, maturities, vehicle characteristics, borrower credit scores and incomes, and loan performance. The final sample covers approximately 8 percent of all open auto loans in the United States and around 30 percent of the total auto loan portfolios of the 14 sampled lenders. Average loan characteristics in the Regulation AB II data closely match population credit bureau data from Equifax, indicating that securitization selection is not a major concern. Average credit scores and incomes are slightly higher in Regulation AB II than in the population, primarily because small banks and credit unions that serve riskier borrowers do not access public securitization markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the baseline empirical specification and what identifying variation does it use?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline is a difference-in-differences regression comparing captive loans (treated) to noncaptive loans (control) before and after January 2018 (the date of the Department of Commerce&amp;rsquo;s initial tariff recommendation, chosen conservatively). The regression includes lender fixed effects, vehicle make-model-condition x origination quarter fixed effects, state x origination quarter fixed effects, $25,000 income bin x origination quarter fixed effects, and 10-point credit score bin x origination quarter fixed effects. The coefficient of interest is estimated using within-lender variation after netting out common vehicle-level shocks, state-level shocks, and shocks common across income and credit score cells. This granular fixed effect structure ensures that the estimate compares captive and noncaptive loans for exactly the same vehicle, in the same state, in the same quarter, to borrowers with similar incomes and credit scores.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the main coefficient estimates on interest rates, and how do they evolve dynamically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the full sample, the pooled difference-in-differences estimate is 26 basis points (t = 2.75), representing a 10 percent increase relative to the pretreatment captive mean of 252 basis points. Excluding subvented (subsidized) loans, the estimate is 29 basis points (t = 2.85). Dynamically, captive interest rates started rising within one quarter of the treatment date and continued increasing alongside metal prices, reaching a terminal coefficient of 48 basis points in the fourth quarter of 2018 &amp;ndash; nearly double the pooled average. Consistent with the parallel trends assumption, there is no economically significant evidence of differential pre-trends across captive and noncaptive loans in the pretreatment period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do the authors validate that noncaptive lenders constitute a valid counterfactual?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Four alternative specifications are presented. First, when splitting captive lenders by tariff exposure (more exposed: Ford, GM-AmeriCredit, Honda, Toyota; less exposed: BMW, Mercedes-Benz, Volkswagen), only more-exposed captive lenders show a significant increase in interest rates (30 basis points; t = 3.37), while less-exposed captive lenders show no significant increase (-18 basis points; t = -1.33). This rules out captive-specific correlated omitted variables. Second, the authors add interactions of the treatment indicator with changes in the Fed Funds rate and 1-, 5-, and 10-year Treasury yields; results are unchanged in magnitude, ruling out differential sensitivity to the rising interest rate environment of 2018. Third, using CarMax (a noncaptive that also sells and finances vehicles but does not participate in DealerTrack) as the sole control group yields similar results. Fourth, lender-specific borrowing cost controls do not attenuate the estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Did captive lenders adjust any non-price loan terms in response to the tariffs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. Columns 2-4 of Table 3 document that loan amounts, maturities, and loan-to-value ratios showed no economically significant changes for captive lenders relative to noncaptive lenders following the tariffs. Some coefficient estimates in the full sample are statistically significant but economically small, and they lose significance or flip signs once subvented loans are excluded. The event study plots confirm no meaningful pre-trends and no meaningful post-treatment changes in non-price terms. The authors note that this is consistent with prior evidence that auto loan demand is less sensitive to interest rates than to maturity, making interest rates the optimal margin along which to pass through costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do the authors rule out that the increase in captive interest rates reflects a change in borrower composition rather than intensive-margin pass-through?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors estimate a separate regression (equation 4) with log household income, log credit score, and future default rate as outcomes. Relative to noncaptive borrowers, captive borrowers experienced a small but positive increase in average household income (Gamma = 0.012, t = 3.25), no significant change in credit scores (Gamma = 0.001, t = 1.13), and no significant change in 12-month or 24-month default rates. The income increase is of the wrong sign and too small in magnitude to explain the observed interest rate increase from a risk-based pricing perspective. Additionally, captive loan origination volumes declined 6.7 percent after the tariffs, inconsistent with a demand surge driving the interest rate increase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do the authors rule out alternative explanations including demand surges, borrowing cost increases, securitization changes, and dealer markup changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For demand surges: vehicle sales volumes showed no noticeable increase following the tariff announcement, and captive loan originations actually declined. For differential borrowing costs: controlling for lender-specific CDS spreads and other borrowing cost measures does not attenuate the main estimate. For securitization changes: combining Regulation AB II and credit bureau data, the authors find no significant change in captive lenders&amp;rsquo; securitization rates, the ratio of securitized to total loan amounts, maturities, or monthly payments. For dealer markup changes: noncaptive loans are also subject to dealer markups, so common changes are absorbed in the DiD; additionally, subvented loans (which dealers cannot mark up) also show higher captive interest rates post-tariff, ruling out differential markup changes. For interest rate sensitivity differentials: controlling for changes in risk-free rates does not alter results. For prepayment responses: 12-month and 24-month prepayment rates show no significant change for captive loans.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do the authors measure vehicle price pass-through, and what data do they use?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To measure invoice price pass-through, the authors use Regulation AB II data (which contains the invoice price for new vehicles) and estimate a regression comparing the change in log invoice prices for makes with a higher proportion of US-assembled vehicles versus those with lower domestic production, controlling for vehicle make-model fixed effects and price bin x quarter fixed effects. Invoice prices rose approximately 1.0 percent for more-exposed makes. For consumer sales price pass-through, the authors use Texas DMV data (1,819,498 new and 2,105,938 used vehicle transactions in 2017-2018) with the same identification strategy. Sales prices rose approximately 0.7 percent ($225 average increase) for more-exposed makes. Both effects are robust to defining exposure at either the make level or the make-model level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How is the overall pass-through rate decomposed between the interest rate and vehicle price channels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors define total tariff pass-through as the sum of interest rate pass-through (change in aggregate captive financing costs divided by aggregate production cost increase) and vehicle price pass-through (change in aggregate new vehicle sales revenue divided by aggregate production cost increase). Taking the ratio of these two components allows them to estimate the relative importance of each channel without needing to directly measure production costs. With a captive loan penetration rate (M) of 0.59, a per-loan present value financing cost increase of $179 (unadjusted) or $227 (adjusted for 7 basis point spillover effect on noncaptives), and a $225 average vehicle price increase, the spillover-adjusted estimate implies interest rate pass-through is almost two-thirds as large as vehicle price pass-through. Focusing solely on vehicle prices would underestimate tariff incidence on consumers by approximately 37 percent. The population-weighted average total cost increase is $146 per vehicle, roughly equally split between vehicle prices ($74) and financing costs ($72).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How large is the estimated aggregate impact of the tariffs on consumer financing costs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using population data of approximately 50 million vehicles sold annually in the United States and a population-weighted average financing cost increase of $72 per vehicle, the authors estimate that the tariffs resulted in approximately $3.6 billion (= 50,000,000 x $72) in additional present value financing costs each year. For reference, Flaaen, Hortacsu, and Tintelnot (2020) estimated that the 2018 tariffs on washing machines led to $1.5 billion in additional annual consumer costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Which borrowers bore a disproportionate share of the interest rate pass-through, and by how much?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The triple-differences results show monotonically higher pass-through for borrowers with less elastic credit demand. Lower-income borrowers (below median) experienced an average captive interest rate increase of 33 basis points versus 20 basis points for higher-income borrowers. Lower-credit-score borrowers experienced an increase of 36 basis points versus 15 basis points for higher-credit-score borrowers. Borrowers with smaller loan amounts (below median) experienced an increase of 36 basis points versus 12 basis points for larger loan amounts. Within income quartiles, consumers in the lowest income quartile experienced a 37 basis point increase compared to 17 basis points in the highest quartile. These patterns are not driven by changes in borrower composition, as default rates show no significant change across any of these subgroups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: How does credit market competition affect tariff pass-through via interest rates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;States with lower credit market competition (higher Herfindahl-Hirschman Index, constructed from pretreatment lender market shares) experienced higher interest rate pass-through. Comparing above- versus below-median HHI states, the difference is 5 basis points (28 vs. 23 basis points), statistically significant at the 10 percent level. When restricting to the tails of the competition distribution, the difference is substantially larger: consumers in the lowest competition decile experienced an average increase of 41 basis points versus 24 basis points for consumers in the highest competition decile &amp;ndash; a 17 basis point differential. This implies interest rate pass-through was 88 percent as large as vehicle price pass-through in less competitive markets versus 57 percent in more competitive markets, consistent with theoretical predictions that firm-specific cost shocks generate higher pass-through when competition is weaker.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: Why do captive lenders spread interest rate increases broadly across vehicle types rather than targeting directly tariff-exposed new vehicle models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors find that captive interest rates increased for both new and used vehicles, and that within more-exposed captive lenders, interest rate increases were not concentrated in domestically produced vehicle models. This is consistent with the hypothesis that firms spread cost shocks across multiple goods and business segments (as documented in the industrial organization literature for multiproduct firms). The authors argue this occurs because vehicles of different makes and models are substitutes for each other (making vehicle-specific price increases costlier in terms of demand loss), whereas auto loans are complementary to vehicle purchases and are offered as an add-on to the sales transaction. This bundled structure, combined with consumer inattention to financing terms, makes it optimal to spread the cost shock across the loan book rather than concentrating it in specific vehicle models.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Captive Finance Subsidiary&lt;/strong&gt;: A wholly owned lending unit of a manufacturer (e.g., Ford Credit, GM Financial) whose primary purpose is to originate loans and leases to finance the sale of the manufacturer&amp;rsquo;s own products. Unlike independent noncaptive lenders, captive lenders are vertically integrated with the manufacturing unit and can, in principle, use financing terms as an additional margin to pass through manufacturing-side cost shocks to consumers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tariff Pass-Through (Interest Rate Channel)&lt;/strong&gt;: The extent to which an input cost increase caused by an import tariff is transmitted to consumers via higher interest rates charged by captive lenders, rather than (or in addition to) higher goods prices. The paper defines interest rate pass-through as the ratio of the aggregate present value increase in captive financing costs to the aggregate increase in manufacturing production costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive vs. Extensive Lending Margin&lt;/strong&gt;: The distinction between raising loan prices charged to existing (inframarginal) borrowers (intensive margin) versus changing the pool of borrowers served or lending standards (extensive margin). The paper argues that the observed increase in captive interest rates reflects intensive-margin pass-through because borrower incomes, credit scores, and future default rates did not change significantly after the tariffs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price Shrouding&lt;/strong&gt;: The practice of making price increases less salient to consumers by embedding them in a less-scrutinized component of a bundled transaction. In the auto market, because consumers are documented to be less sensitive to increases in financing costs than to vehicle sticker prices, captive lenders can pass on cost shocks through interest rates with less demand response than if they raised vehicle prices by an equivalent amount. The paper treats this as a key mechanism enabling the financing pass-through channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Subvented (Subsidized) Loan&lt;/strong&gt;: A promotional auto loan offered at a below-market interest rate, often tied to specific vehicle models or sales events (e.g., &amp;ldquo;1.99 percent APR for well-qualified borrowers&amp;rdquo;). Subvented loans are typically fixed by the manufacturer and cannot be marked up by dealers. The paper uses the subsample of non-subvented loans as a robustness check and to isolate tariff pass-through from seasonal variation in promotional financing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Captive Loan Penetration Rate (M)&lt;/strong&gt;: The ratio of captive auto loans originated to new vehicles produced and sold, used in the paper&amp;rsquo;s decomposition of total tariff pass-through into the interest rate and vehicle price channels. Estimated at approximately 0.59 from population data, this parameter determines how the aggregate present value financing cost increase scales relative to the aggregate vehicle sales price increase when computing the relative importance of the two pass-through channels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Herfindahl-Hirschman Index (HHI) as Market Competition Measure&lt;/strong&gt;: The paper constructs state-level HHIs based on pretreatment lender market shares in each state using population credit bureau data, as an inverse measure of credit market competition. Local (direct) auto lending markets exhibit meaningful geographic variation in HHI, in contrast to the largely national scope of indirect (dealer-arranged) lending. The paper uses this variation to test whether pass-through is higher in less competitive credit markets, consistent with theoretical predictions for firm-specific cost shocks.&lt;/p&gt;</description></item><item><title>Consumer durables and monetary policy according to HANK</title><link>https://macropaperwarehouse.com/papers/consumer-durables-and-monetary-policy-according-to-hank/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/consumer-durables-and-monetary-policy-according-to-hank/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;Consumer durables account for a disproportionately large share of household expenditure fluctuations despite their small share of total private consumption. Two stylized facts motivate the paper: (1) durable expenditure is far more interest-rate sensitive than nondurable expenditure following monetary policy shocks, and (2) durable and nondurable expenditures comove positively and persistently—both reaching trough in the same quarter. Standard two-sector New Keynesian models struggle to generate this positive conditional comovement because asymmetric sectoral price rigidity induces large relative-price movements that push the two sectors in opposite directions. This paper asks what model features are necessary and sufficient to reproduce both the sectoral comovement pattern and the hump-shaped aggregate dynamics observed in the data, and how the answer changes across households sorted by liquid asset holdings.&lt;/p&gt;
&lt;h3 id="data-and-methodology"&gt;Data and Methodology&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Empirical identification.&lt;/strong&gt; The authors employ a local projection instrumental variables (LP-IV) strategy using Romer-Romer monetary policy shocks updated by Wieland and Yang (2020), over the sample 1969:Q1–2007:Q3. Impulse response functions (IRFs) are normalized to a cumulative 100 basis-point increase in the Federal Funds Rate over five years. Household-level evidence is drawn from the Consumer Expenditure Survey (CEX) and the Survey of Consumer Finances (SCF); households are classified as liquidity-constrained if liquid assets are below $1,000.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors develop a two-sector Heterogeneous Agent New Keynesian (HANK) model in which households maximize utility over nondurable consumption and a durable stock (Cobb-Douglas aggregation), face convex adjustment costs on durable purchases, and update expectations infrequently in the Mankiw-Reis sense (probability of not updating: Xi = 0.918 per period). The general equilibrium version features asymmetric Rotemberg price stickiness (Calvo probability 0.671 for nondurables, 0.797 for durables), nominal wage stickiness (Calvo 0.802), and a Taylor rule with inflation coefficient 1.105, output coefficient 1.440, and smoothing 0.988.&lt;/p&gt;
&lt;h3 id="main-findings-and-quantitative-magnitudes"&gt;Main Findings and Quantitative Magnitudes&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sectoral magnitude gap.&lt;/strong&gt; At trough (approximately 8 quarters after the shock), the durable expenditure response to monetary tightening is an order of magnitude larger than the nondurable response—a fact the calibrated HANK model is designed to match.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Positive comovement.&lt;/strong&gt; Both durable and nondurable expenditures contract and reach trough in the same quarter, contradicting TANK models (Monacelli 2009) in which savers shift portfolios toward durables and generate negative comovement for that group.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Relative-price dynamics.&lt;/strong&gt; The relative price of durables rises following monetary tightening (nondurables deflate more), but the rise is modest and cannot overturn the positive comovement result.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Role of the direct interest-rate effect.&lt;/strong&gt; Across liquid-asset groups, the direct effect accounts for 73–87% of the cumulated durable expenditure response and 37–91% of the cumulated nondurable expenditure response. This direct channel—operating through intertemporal substitution—is quantitatively first-order for durables in a way it is not in standard single-sector HANK models where income effects dominate.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Role of sticky information.&lt;/strong&gt; A full-information HANK variant produces a counterfactually high durable elasticity (35.24 times the baseline) and no hump-shaped dynamics. Infrequent information updating (Xi = 0.918) is essential to match the hump-shaped propagation of both sectoral and aggregate expenditures.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Income effects and fiscal policy.&lt;/strong&gt; For a fiscal subsidy specifically targeting durable purchases, intertemporal substitution incentives generate a large shift toward durables and, without income effects, a counterfactual crowding-out of nondurable spending. Income effects are essential to protect nondurable spending, and the aggregate consumption effect of such a policy is at best modest—consistent with Mian and Sufi&amp;rsquo;s (2012) evidence on cash-for-clunkers.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="scope-conditions"&gt;Scope Conditions&lt;/h3&gt;
&lt;p&gt;All empirical results are conditional on the LP-IV sample 1969:Q1–2007:Q3 and Romer-Romer shocks as instrumented by Wieland-Yang. The household-level comovement result is established for both liquidity-constrained (liquid assets below $1,000) and unconstrained savers using CEX/SCF data. Model quantitative results are specific to the calibration targeting moments from Fagereng et al. (2021) marginal propensities and BEA depreciation data (delta = 0.054).&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core empirical puzzle the paper addresses, and why do standard models fail?&lt;/strong&gt;
Standard two-sector New Keynesian models predict that asymmetric sectoral price stickiness generates large relative-price movements between durables and nondurables following a monetary shock. These relative-price shifts tend to produce negative conditional comovement—when durables contract, nondurables expand—contradicting the data. The authors document that both categories exhibit positive and persistent comovement, both reaching their trough at approximately 8 quarters, which standard models cannot replicate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the key empirical facts established via LP-IV?&lt;/strong&gt;
Using Romer-Romer shocks over 1969:Q1–2007:Q3, normalized to a cumulative 100bp Federal Funds Rate increase, the authors find: (1) aggregate expenditure follows a hump-shaped contraction with trough at roughly 8 quarters; (2) the durable expenditure response is an order of magnitude larger than the nondurable response at trough; (3) both categories reach their trough in the same quarter; and (4) the relative price of durables rises modestly after monetary tightening (nondurables deflate more), but not enough to reverse comovement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How is the partial equilibrium model calibrated, and which moments does it target?&lt;/strong&gt;
Key calibrated parameters include CRRA sigma = 2.640, Cobb-Douglas weight on nondurables theta = 0.607 (implying durable expenditure share 0.193), adjustment cost alpha = 8.299, information stickiness Xi = 0.918, depreciation rate delta = 0.054, steady-state real rate r = 0.03/4, discount factor beta = 0.915 (matching a 30% share of liquidity-constrained households with liquid assets-to-income ratio of 0.26), and borrowing wedge kappa = 0.05. Moments matched include quarterly MPC on nondurables (22.94%), quarterly MPX on durables (24.15%), interest-rate elasticity of durable expenditure (3.35, within the empirical range of 1.1–5.0), price elasticity of durable demand (29.59), and durable stock skewness relative to nondurable consumption (0.695, consistent with Bertola et al. 2005).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the paper decompose monetary policy transmission?&lt;/strong&gt;
The paper decomposes transmission into three channels: (1) the direct effect of real interest rate changes, which operates through intertemporal substitution and accounts for the quantitatively largest share of the durable response; (2) the relative-price effect, which is modest and redistributive but cannot overturn positive comovement; and (3) pure income effects, which are key for persistence of the nondurable response but not for the sign of comovement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What do counterfactual models reveal about the role of each model ingredient?&lt;/strong&gt;
A sticky-information RANK produces positive comovement but the dynamics are front-loaded and less inertial than in the data. A sticky-information TANK delivers results similar to RANK—income effects do not qualitatively change the story. A full-information HANK produces a counterfactually high durable interest-rate elasticity (35.24 times the baseline) and no hump-shaped dynamics, demonstrating that sticky information is the ingredient generating realistic propagation, not heterogeneity per se.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What does the household-level evidence from CEX and SCF show about comovement across the wealth distribution?&lt;/strong&gt;
Classifying households as liquidity-constrained if liquid assets are below $1,000, the LP-IV estimates show positive comovement between durables and nondurables for both constrained and unconstrained savers. This contradicts TANK models (Monacelli 2009), in which savers shift portfolios toward durables following a monetary shock, generating negative comovement for the saver group. After controlling for income and relative prices, the direct interest-rate effect operates uniformly across financial status groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the direct effect vary across liquid asset groups quantitatively?&lt;/strong&gt;
Decomposing across four liquid asset groups (below $1k, $1k–$10k, $10k–$20k, above $20k), the direct effect accounts for 73–87% of the cumulated durable expenditure response and 37–91% of the cumulated nondurable expenditure response. Income effects are more important for nondurable spending prolongation among liquidity-constrained households, but the direct channel dominates durable expenditure for all groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the general equilibrium two-sector HANK model differ from the partial equilibrium setup?&lt;/strong&gt;
The GE model adds asymmetric sectoral price stickiness (Calvo probabilities 0.671 for nondurables and 0.797 for durables), nominal wage stickiness (Calvo 0.802), a Taylor rule (inflation coefficient 1.105, output coefficient 1.440, smoothing 0.988), and fiscal lump-sum taxes responding to debt (coefficient 0.191). These features generate the relative-price dynamics observed in the data while preserving the positive comovement result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the fiscal policy application reveal about the role of income effects?&lt;/strong&gt;
A fiscal subsidy targeting durable purchases generates a much larger shift in the relative price of durables than monetary policy does. Without income effects, intertemporal substitution dominates and nondurable spending falls—a counterfactual result inconsistent with the data. With income effects present, nondurable spending is protected. The aggregate consumption effect of such a durable-targeted fiscal policy is at best modest, consistent with Mian and Sufi&amp;rsquo;s (2012) evidence from the cash-for-clunkers program.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the broader implication for the literature on HANK versus RANK transmission?&lt;/strong&gt;
In standard single-sector HANK models, income effects (the indirect channel) typically dominate monetary transmission. The presence of consumer durables restores a quantitatively important role for the direct interest-rate channel, which operates through intertemporal substitution in durable purchases. This rebalances the direct-versus-indirect decomposition relative to the conventional HANK wisdom and shows that the durable goods sector is essential to understanding the full transmission mechanism.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Sectoral comovement (conditional on monetary policy shocks)&lt;/strong&gt;
The empirical regularity that durable and nondurable expenditures both contract following monetary tightening and reach their respective troughs in the same quarter. In this paper, comovement is defined conditional on identified monetary policy shocks (LP-IV with Romer-Romer instruments), not unconditionally. Standard two-sector NK models predict negative conditional comovement due to relative-price effects; replicating positive comovement is the central discipline imposed on the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Direct effect (of real interest rate changes)&lt;/strong&gt;
The component of monetary transmission that operates through the intertemporal substitution incentive induced by changes in the real interest rate, holding income and relative prices fixed. Distinct from the income effect (indirect channel) and the relative-price effect. In this paper&amp;rsquo;s decomposition, the direct effect accounts for 73–87% of the cumulated durable expenditure response across liquid-asset groups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sticky information (Mankiw-Reis)&lt;/strong&gt;
Households update their information sets infrequently, with probability (1 - Xi) per period; Xi = 0.918 means only about 8.2% of households update each quarter. This mechanism is essential in the model for generating the hump-shaped, inertial impulse response dynamics observed in the data. Without it (full-information HANK), the durable elasticity is counterfactually large (35.24 times baseline) and dynamics are front-loaded.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MPX (Marginal Propensity to Expend on durables)&lt;/strong&gt;
Analogous to the MPC for nondurables, the MPX measures the additional durable expenditure flow induced by an income windfall. Calibrated to 24.15% quarterly, matching estimates from Fagereng et al. (2021). Distinct from the MPC because durable purchases represent investment in a stock, not immediate consumption flow.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Liquidity-constrained households&lt;/strong&gt;
Households with liquid assets below $1,000, identified in the CEX and SCF. In the model, the 30% share of such households is targeted by the discount factor (beta = 0.915) and the borrowing wedge (kappa = 0.05). The paper&amp;rsquo;s key finding is that positive comovement holds for both constrained and unconstrained households, contradicting TANK predictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;HANK (Heterogeneous Agent New Keynesian model)&lt;/strong&gt;
A New Keynesian general equilibrium model in which households are heterogeneous in their liquid asset holdings (and thus face binding borrowing constraints), so that the distribution of assets matters for aggregate dynamics. Distinguished from RANK (Representative Agent NK) and TANK (Two-Agent NK, which approximates heterogeneity with one unconstrained and one hand-to-mouth agent). In this paper, HANK is extended to a two-sector setting with durables and nondurables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Convex adjustment costs on durable purchases&lt;/strong&gt;
A cost of adjusting the durable stock that is convex in the size of the adjustment (calibrated parameter alpha = 8.299). This smooths the durable expenditure response and prevents counterfactually sharp jumps in durable purchases following interest rate changes, contributing to realistic propagation dynamics alongside sticky information.&lt;/p&gt;</description></item><item><title>Contextually Private Mechanisms</title><link>https://macropaperwarehouse.com/papers/contextually-private-mechanisms/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/contextually-private-mechanisms/</guid><description>&lt;p&gt;Haupt and Hitzig introduce a framework for comparing the privacy properties of different mechanism protocols. The core research question is: when a designer commits to implementing a social choice rule, how much superfluous private information must they inevitably learn about agents, and how should they design the elicitation protocol to minimize that exposure?&lt;/p&gt;
&lt;p&gt;The setting is a finite-player extensive-form game in which a designer elicits agents&amp;rsquo; private types through a dynamic protocol to compute a social choice function. The authors explicitly exclude cryptographic tools and trusted mediators, working under the minimal assumption that the designer learns information if and only if an agent discloses it. This assumption is motivated by the historical prevalence of live dynamic auction formats — ascending formats at Sotheby&amp;rsquo;s, descending formats at Aalsmeer, oral ascending formats used by the U.S. Forest Service for timber, multi-round clock auctions for radio-spectrum allocation — and by settings where mediating technology is unavailable or costly.&lt;/p&gt;
&lt;p&gt;The central object is the contextual privacy violation. A protocol produces a contextual privacy violation for agent i at type profile θ if the designer can distinguish θ_i from some alternative type θ&amp;rsquo;_i while holding other agents&amp;rsquo; types fixed, yet the social choice rule assigns the same outcome at both profiles. Violations are defined at the level of individual agent–state pairs, not aggregated ex ante. A protocol is fully contextually private if it produces no violations; it is maximally contextually private if its set of violations is inclusion-minimal among all protocols that implement the same rule.&lt;/p&gt;
&lt;p&gt;The main characterization result (Theorem 1) connects privacy to pivotality: a social choice function admits a fully contextually private protocol if and only if, on every product subset of the type space where agents are collectively pivotal, at least one agent is individually pivotal. The contrapositive is what drives the paper&amp;rsquo;s impossibility results: whenever a rule contains a region where no single agent&amp;rsquo;s report changes the outcome but a group&amp;rsquo;s joint report does, any implementing protocol must produce contextual privacy violations.&lt;/p&gt;
&lt;p&gt;Using this characterization, the authors establish that the first-price auction rule (Proposition 2) and serial dictatorship (Proposition 3) admit fully contextually private protocols. Conversely, k-item Vickrey auction rules (Proposition 4) and any stable school-choice rule (Proposition 5) do not admit fully contextually private protocols, because these rules contain type-space regions where agents are only collectively — not individually — pivotal.&lt;/p&gt;
&lt;p&gt;For k-item Vickrey auctions, the authors study maximally contextually private protocols. They establish (Proposition 6) that, for a class of social choice rules on totally ordered type spaces that contains k-item Vickrey auctions, it is without loss to consider only protocols consisting of threshold queries that are monotonically increasing or decreasing after an initial guess. This reduction identifies two key design dimensions: the initial query posed to each agent, and the order in which agents are queried.&lt;/p&gt;
&lt;p&gt;The main constructive result (Theorem 2) proves that an ascending-join protocol is maximally contextually private for the k-item Vickrey auction. Proposition 7 formalizes the sense in which this protocol protects privacy by delaying queries to certain bidders — it repeatedly asks agents whether they can rule out a particular outcome, and postpones questioning agents whose privacy it is protecting.&lt;/p&gt;
&lt;p&gt;The authors also show (Proposition 19) that the ascending-join protocol is minimally relatively informative among protocols that are maximally contextually private. Extensions cover group contextual privacy (Proposition 11) and individual contextual privacy (Proposition 8), showing that individual contextual privacy violations equal the union of contextual privacy violations and nonbossiness violations.&lt;/p&gt;
&lt;p&gt;Q: What is a contextual privacy violation, precisely?
A: A protocol produces a contextual privacy violation for agent i at type profile θ if the designer can distinguish θ_i from some alternative type θ&amp;rsquo;_i — holding all other agents&amp;rsquo; types fixed — yet the social choice rule assigns the same outcome at both profiles. The violation is defined at the level of individual agent–state pairs. A single additional superfluous distinction at the same (i, θ) pair does not register as a second violation; the framework records whether any unnecessary disclosure occurs for that agent at that state, not the degree of overexposure.&lt;/p&gt;
&lt;p&gt;Q: How does contextual privacy differ from relative informativeness?
A: Relative informativeness compares two protocols by whether one distinguishes every pair of type profiles the other does, treating all disclosures as equally undesirable. Contextual privacy conditions the notion of a &amp;ldquo;violation&amp;rdquo; on the social choice rule: a distinction between θ_i and θ&amp;rsquo;_i counts as a violation only when the rule assigns the same outcome at both profiles. Relative informativeness thus penalizes the designer for learning information that is necessary to implement the rule, whereas contextual privacy imposes no penalty for learning pivotal information.&lt;/p&gt;
&lt;p&gt;Q: What is the pivotality characterization (Theorem 1)?
A: A social choice function admits a fully contextually private protocol if and only if, on every product subset of the type space where agents are collectively pivotal, at least one agent is individually pivotal. The necessity direction shows that if a collectively pivotal set exists where no agent is individually pivotal, any implementing iterative partition must contain an earliest node that distinguishes two type profiles leading to the same outcome. The sufficiency direction constructs a contextually private protocol inductively by always querying an individually pivotal agent, ensuring every distinction implies a different outcome.&lt;/p&gt;
&lt;p&gt;Q: Which social choice rules admit fully contextually private protocols?
A: The first-price auction rule (Proposition 2) and serial dictatorship (Proposition 3) admit fully contextually private protocols. The authors use Theorem 1 to show this: in both rules, any collectively pivotal region contains an individually pivotal agent. By contrast, k-item Vickrey auction rules (Proposition 4), any stable school-choice rule (Proposition 5), efficient allocations in housing assignment, and generalized median voting rules (Section B) do not admit fully contextually private protocols.&lt;/p&gt;
&lt;p&gt;Q: Why do k-item Vickrey auctions fail full contextual privacy?
A: Proposition 4 shows that k-item Vickrey auctions for k ≥ 1 do not admit fully contextually private protocols. The argument uses the necessary conditions from Theorem 1 (Corollaries 1 and 2): the Vickrey payment rule creates type-space regions where multiple agents together determine the price but no single agent is individually pivotal over the price, so any protocol implementing the Vickrey rule must produce violations for at least some agents at some type profiles.&lt;/p&gt;
&lt;p&gt;Q: What is the ascending-join protocol and what does Theorem 2 establish?
A: The ascending-join protocol is a specific dynamic elicitation protocol for k-item Vickrey auctions that repeatedly asks agents whether they can rule out a particular outcome, structured as threshold queries ascending from an initial guess. Theorem 2 proves that the ascending-join protocol is maximally contextually private for the k-item Vickrey auction. Proposition 7 formalizes the protection mechanism: the protocol delays queries to the bidders whose privacy it is protecting, querying them only when their responses become necessary for determining the outcome.&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 6 establish about the structure of maximally contextually private protocols?
A: For a class of social choice rules on totally ordered type spaces that contains k-item Vickrey auctions, Proposition 6 shows it is without loss of generality to consider only protocols consisting of threshold queries that are monotonically increasing or decreasing in the threshold after an initial guess. This result serves as a theoretical reduction (enabling proofs that certain protocols are maximally private) and as a practical design principle (identifying the initial query and the ordering of agents as the two key design dimensions).&lt;/p&gt;
&lt;p&gt;Q: How does contextual privacy relate to obviously dominant strategies?
A: The paper treats privacy properties and incentive properties as largely orthogonal questions, to be analyzed separately. For the ascending-join protocol specifically, the authors verify obvious dominance — the most demanding incentive notion they consider — which requires that at every history, the worst-case payoff from the equilibrium action exceeds the best-case payoff from any deviation. This analysis proceeds after the contextual privacy properties of the protocol are established.&lt;/p&gt;
&lt;p&gt;Q: What is group contextual privacy and why do the authors focus on individual-level violations instead?
A: Group contextual privacy requires that whenever the designer learns any property of the joint type profile, that property must affect the outcome. The authors show (Proposition 11) that a protocol is fully group contextually private if and only if every query rules out at least one outcome. They argue this standard is extremely demanding and produces a very coarse partial order: improving in the group privacy order requires restructuring the entire protocol tree rather than making agent- or state-specific improvements. They also note that normative accounts of privacy, including Nissenbaum&amp;rsquo;s contextual integrity theory, center on individual rather than group information.&lt;/p&gt;
&lt;p&gt;Q: How does individual contextual privacy relate to nonbossiness?
A: Individual contextual privacy (Proposition 8) requires that if two type profiles differing only in agent i&amp;rsquo;s type are distinguished, they must lead to different allocations for agent i — presuming a private allocation domain. The paper shows that the set of individual contextual privacy violations equals the union of contextual privacy violations and nonbossiness violations: individual contextual privacy is violated precisely when either (a) agent i&amp;rsquo;s superfluous type information is revealed, or (b) agent i is &amp;ldquo;bossy&amp;rdquo; — able to change others&amp;rsquo; outcomes without changing their own.&lt;/p&gt;
&lt;p&gt;Q: What is the relationship between the ascending-join protocol and minimal relative informativeness?
A: Proposition 19 shows that the ascending-join protocol is not only maximally contextually private but also minimally relatively informative among protocols that are maximally contextually private. That is, among all maximally contextually private protocols, the ascending-join protocol reveals the smallest total amount of information about the type profile in the relative informativeness order. This establishes relative informativeness as a useful refinement for selecting among contextually privacy-equivalent protocols.&lt;/p&gt;
&lt;p&gt;Q: What motivates the exclusion of cryptographic tools and trusted mediators from the framework?
A: The authors work under the minimal assumption that the designer learns information if and only if an agent directly discloses it — no commitment to forget, anonymize, or cryptographically conceal. They motivate this on two grounds: first, many real-world auction formats are live and dynamic with no mediating technology; second, advanced cryptography is often costly in time, money, or computation, and studying the no-mediator benchmark can explain the historical prevalence of dynamic protocols and inform auction design in environments where cryptography may become unavailable (for example, due to quantum computing). The authors cite a Danish sugar-beet auction as a case where designers themselves questioned whether full multiparty computation was necessary.&lt;/p&gt;
&lt;p&gt;Contextual privacy violation: A protocol produces a contextual privacy violation for agent i at type profile θ if the designer can distinguish θ_i from some alternative type θ&amp;rsquo;_i — holding other agents&amp;rsquo; types fixed — yet the social choice rule assigns the same outcome at both profiles. The violation is assigned at the level of individual agent–state pairs.&lt;/p&gt;
&lt;p&gt;Maximally contextually private protocol: A protocol whose set of contextual privacy violations is inclusion-minimal among all protocols that implement the same social choice rule — equivalently, a protocol that lies on the Pareto frontier of implementation and contextual privacy, such that no other implementing protocol weakly reduces every violation and strictly reduces at least one.&lt;/p&gt;
&lt;p&gt;Iterative partition: A directed rooted tree whose nodes are subsets of the type space, where each non-leaf node is split into children by partitioning on a single agent&amp;rsquo;s type. Any protocol is equivalent (in terms of what the designer learns) to a partitional protocol induced by an iterative partition (Proposition 1).&lt;/p&gt;
&lt;p&gt;Individual pivotality: On a product set of type profiles, agent i is individually pivotal if there exist two subsets of agent i&amp;rsquo;s types such that every type profile from one subset leads to a different outcome than every type profile from the other subset, holding others&amp;rsquo; types fixed.&lt;/p&gt;
&lt;p&gt;Collective pivotality: Agents are collectively pivotal on a product set if there exist two type profiles in that set with different outcomes. Collective pivotality without any agent being individually pivotal is precisely the condition that forces contextual privacy violations (Theorem 1).&lt;/p&gt;
&lt;p&gt;Ascending-join protocol: A specific dynamic protocol for k-item Vickrey auctions that poses threshold queries in ascending order after an initial guess, repeatedly asking agents whether they can rule out a particular outcome. It is maximally contextually private (Theorem 2) and minimally relatively informative among maximally contextually private protocols (Proposition 19), and it achieves privacy protection by delaying queries to agents whose privacy it protects (Proposition 7).&lt;/p&gt;
&lt;p&gt;Relative informativeness: A partial order on protocols defined by: protocol P is less relatively informative than P&amp;rsquo; if every pair of type profiles P distinguishes is also distinguished by P&amp;rsquo;. Unlike contextual privacy, relative informativeness treats all disclosures as equally undesirable and does not condition on the social choice rule. The paper positions it as a useful refinement for selecting among contextually privacy-equivalent protocols.&lt;/p&gt;</description></item><item><title>Contract Terms, Employment Shocks, and Default in Credit Cards</title><link>https://macropaperwarehouse.com/papers/contract-terms-employment-shocks-and-default-in-credit-cards/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/contract-terms-employment-shocks-and-default-in-credit-cards/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks two related questions bearing on financial inclusion policy in developing countries: (1) How effective are credit card contract term changes — specifically interest rate reductions and minimum payment increases — in limiting default among new borrowers? (2) How large is the effect of formal-sector job loss on default relative to these contract term interventions, and can the difference in magnitudes be explained by differential cash flow impacts?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The study is set in Mexico during 2007–2009 and exploits a large nationwide stratified randomized controlled trial implemented by a major commercial bank (&amp;ldquo;Bank A&amp;rdquo;) on its financial-inclusion credit card — a product that accounted for approximately 15% of all first-time formal-sector loans in Mexico as of 2010. The study card was targeted at borrowers with limited or no formal credit history (the bank&amp;rsquo;s &amp;ldquo;C, C- and D&amp;rdquo; customer segments); 47% of the experimental sample held it as their first formal loan product. A sample of 144,000 pre-existing cardholders was stratified into nine cells based on bank tenure (6–11 months, 12–23 months, 24+ months) and past repayment behavior, then randomly allocated to eight treatment arms combining two minimum payment levels (5% or 10% of the outstanding balance) and four annual interest rates (15%, 25%, 35%, 45%), for 26 months (March 2007 to May 2009). The study sample is representative of the bank&amp;rsquo;s national portfolio of approximately 1.3 million study card customers. Card-level data run through December 2014 — five years after the experiment ended — allowing examination of both short- and long-run effects. The experimental sample is matched to Mexico&amp;rsquo;s Social Security database (IMSS), providing monthly formal employment histories from January 2004 to December 2012 for 59% of the sample; and to credit bureau data, allowing observation of defaults across all formal financial institutions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Result 1 — Interest rate effects are modest in aggregate.&lt;/em&gt; A 30 percentage point (pp) decrease in the annual interest rate (from 45% to 15%, a 67% reduction relative to the baseline rate) decreased cumulative default by 2.5 pp over the 26-month experiment, for a default elasticity of +0.20. Over the same 18-month horizon used for unemployment comparisons, the implied effect is 1.03 pp. These magnitudes are substantially smaller than predictions elicited from Mexican central bank regulators (mean predicted decrease: 8.6 pp) and from participants on the Social Science Prediction Platform (mean predicted decrease: 5 pp). Default continued to decline in the lower-rate arm for approximately three years after the experiment ended, reaching −1 pp by March 2012, after which effects became statistically indistinguishable from zero.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Result 2 — No effect on the newest borrowers.&lt;/em&gt; For the newest borrowers (those with 6–11 months of tenure when the experiment began — the group with a 36% cumulative default rate over 26 months versus 18% for those with 24+ months of tenure), the interest rate reduction has no effect on default over the 26-month period, with point estimates consistently small and statistically indistinguishable from zero. This is in contrast to older borrowers, who are meaningfully responsive.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Result 3 — Minimum payment increases increase short-run default but reduce long-run default.&lt;/em&gt; Doubling the minimum payment from 5% to 10% of outstanding balance increased cumulative default by 0.8 pp by the end of the experiment (26-month elasticity: +0.04; p = 0.016), driven primarily by defaults occurring within the first year. The short-run increase is concentrated among the most liquidity-constrained borrowers — those with the highest baseline debt utilization and those in the minimum-payer stratum (baseline debt utilization rate of 85%). After the experiment ended and all arms were returned to the same 4% minimum payment, the previously higher-minimum-payment arm exhibited persistently lower default, reaching a 1 pp decline by the end of the sample (p = 0.054 at end of study period), relative to a base default rate of 41% at that point.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Result 4 — Job displacement effects are seven times larger than contract term effects.&lt;/em&gt; Formal-sector job displacement (identified using mass layoff events at firms with 50+ employees, defined as year-on-year employment contractions exceeding 30% of prior-year average employment) increased cumulative default by 4.8 pp after 12 months and 7.6 pp after 18 months. This is seven times larger than the effect of a 30 pp interest rate decrease (1.03 pp over 18 months) and nine times larger than the effect of doubling minimum payments (0.8 pp). Formal job loss alone can explain approximately 14% of total study card default during the experiment (calculation: 19.8% of formally employed study card borrowers lose their job at least once in the first 18 months; multiplied by the 7.6 pp default increase per spell, this yields 1.5 pp of the 10.8% base default rate at 18 months). Results are corroborated using a nationally representative matched credit bureau–IMSS sample of 600,339 borrowers, which yields 8,723 mass layoff events and similar estimates.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Per-peso normalization.&lt;/em&gt; A back-of-the-envelope calculation normalizes all three shocks by their respective cash flow impacts. The interest rate decrease reduces cumulative required minimum payments due by 2,917 MXN pesos over 18 months; the minimum payment doubling increases them by 1,325 MXN pesos; formal job loss reduces total labor earnings by an estimated 21,328 MXN pesos (adjusting formal-sector earnings losses of 77,555 MXN pesos downward by 72.5% to reflect that 82% of workers who lose formal employment transition to informal employment in the following quarter, with total earnings falling only 27.5%). The per-peso default effects are: 0.36 pp per 1,000 MXN pesos for the interest rate intervention; 0.51 pp for the minimum payment intervention; and 0.36 pp for job displacement. The null hypothesis that all three per-peso effects are equal cannot be rejected (p = 0.78).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Interpretation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors present a simple two-period optimizing model emphasizing the role of previously accumulated debt and liquidity constraints. The model generates four testable predictions consistent with the data: (1) lower interest rates decrease default via reduced debt burden; (2) higher minimum payments increase short-run default by tightening liquidity constraints; (3) &amp;ldquo;surprise&amp;rdquo; minimum payment increases (where borrowers anticipated they would continue) reduce post-experiment default via debt reduction; (4) negative income shocks (modeled as first-order stochastic dominance deterioration in period-2 income) increase default. The per-peso normalization supports the interpretation that cash flow impacts — not differential per-peso susceptibility to shocks — drive the relative magnitudes of the three effects.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why is the interest rate elasticity of default (0.20) so much lower than prior estimates in the literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper contrasts its 26-month elasticity of +0.20 with estimates from Karlan and Zinman (2019) (1.8) and Adams et al. (2009) (2.2), and notes it falls in the same range as Karlan and Zinman (2009) (0.27) and DeFusco et al. (2021) (0.01). The paper proposes that variation in borrower tenure may partly explain cross-study differences, as default elasticities appear to be increasing in bank tenure. The newest borrowers — the most policy-relevant subgroup — show zero elasticity, pulling the overall estimate down. The paper also argues that in this context, interest-rate-driven moral hazard (all channels: debt burden, concurrent, and dynamic) is collectively small.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What mechanism explains why newer borrowers are entirely unresponsive to interest rate changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper hypothesizes that newer borrowers place a higher continuation value on the card (captured by parameter v in the model) because they have fewer formal credit alternatives; at baseline, only 64% of the 6–11 month stratum held a card with another bank versus 78% of the 24+ month stratum. A higher continuation value implies more muted responses to interest rate changes (formally derived in Appendix E.3). Newer borrowers also respond more strongly to credit limit increases, consistent with tighter liquidity constraints. A regression controlling for age, gender, baseline card ownership, debt utilization, labor force attachment, and earnings cannot explain away the differential treatment effect between new and old borrowers (differential remains significant at p = 0.05), suggesting the tenure gradient in responsiveness is not simply a composition effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Why does increasing minimum payments raise short-run default but reduce long-run default?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the short run, the doubling of minimum payments tightens liquidity constraints for already-constrained borrowers. The increase in default is concentrated among borrowers in the highest baseline debt-utilization tercile and among minimum-payers (baseline debt utilization of 85%), and is preceded by a sharp rise in delinquencies in months 3–5 (which trigger 350 MXN peso fees per occurrence, further worsening the repayment burden). In the long run, borrowers who anticipated continuing higher minimum payments (the experiment ended without advance notice, so borrowers expected the new terms to persist) chose lower debt levels during the experiment. Since all arms were returned to the same low minimum payment when the experiment ended, the lower-debt borrowers in the higher-minimum-payment arm were better positioned to weather subsequent shocks, producing the 1 pp post-experiment decline in default. The hypothesis that this is driven by habit formation in payment behavior is ruled out by the absence of any effect of past higher minimum payments on post-experimental payment levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How is the mass-layoff identification strategy designed and validated?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper uses the universe of IMSS formal employment records to define a mass layoff at a firm (50+ employees) as the first month in which year-on-year employment declines by more than 30% of average employment in the prior 12 months. An individual is &amp;ldquo;displaced&amp;rdquo; if they lost their job in the same quarter as their employer&amp;rsquo;s mass layoff event. The identification assumption is that, conditional on individual and time fixed effects, the exact timing of the mass layoff is uncorrelated with workers&amp;rsquo; potential default outcomes. This is supported by: (1) mass layoffs occurring in every period, making coincidence with credit market shocks unlikely; (2) time fixed effects absorbing common trends; and (3) the absence of statistically distinguishable pre-trends in default between displaced and non-displaced workers. The paper implements both standard two-way fixed effects and the staggered DiD estimator of de Chaisemartin and D&amp;rsquo;Haultfoeuille (2024), which remains valid under heterogeneous and dynamic effects, and the results are similar across methods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the paper account for informal employment when estimating the cash flow impact of job loss?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Formal-sector earnings losses over 18 months post-displacement are estimated at 77,555 MXN pesos using IMSS wage data in an event-study design paralleling the default equation. However, since more than 4/5 of workers who lose formal employment are informally employed in the following quarter (based on Mexico&amp;rsquo;s ENOE labor force survey panel), and total labor earnings fall by only an estimated 27.5% over the three post-displacement quarters, the paper scales the formal earnings loss down to 21,328 MXN pesos (≈ 0.275 × 77,555). This brings the estimated earnings loss closer to prior developed-country estimates of displacement costs and is treated as a lower bound relative to the raw formal-earnings loss figure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Does the cost of default deter borrowers from defaulting, and what is the cost?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper argues that defaulters face substantial consequences. Using an instrumental variables strategy (treatment assignment as instrument for default on the study card), the probability of having a new loan one year after default is estimated to be 65 pp lower relative to the non-default counterfactual (p = 0.03). A selection-on-observables approach also shows that study card default is associated with the complete absence of any subsequent credit card for at least four years. These costs should provide strong incentives to remain current, making the high observed default rates primarily attributable to cash flow shocks rather than strategic default. The value of formal credit is further confirmed by the finding that a 100 MXN peso increase in the study card&amp;rsquo;s credit limit translates into 32 MXN pesos of additional debt (instrumental variable estimates are more than twice as large as OLS), and by the comparison of informal loan terms (annual rates averaging 291%, loan amounts of 3,658 MXN pesos, durations of 0.52 years) with formal loan terms (94 pp lower rates, 9,842 MXN peso average amounts, 1.07 year durations).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Are the default treatment effects different across the interest rate and minimum payment interventions, or do they interact?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper tests for and cannot reject separability between the two interventions at standard significance levels. At the end of the experiment (May 2009), the p-value for the null that the minimum payment effect is constant across interest rate arms is 0.44; five years later it is 0.65. The null that the interest rate effect is constant across both minimum payment arms yields p = 0.08 at end of experiment and p = 0.411 five years later. The fully saturated specification yields results indistinguishable from the parsimonious linear-separable specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Are there spillover effects from the contract term changes onto other loans held by study participants?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No spillover effects on default on other loans are found, either during the experiment or after it ended, based on credit bureau data covering all formal-sector loans held by the experimental sample. There is also no evidence of crowd-out or crowd-in from other lenders in terms of new loans or loan closures. The only minor exception is a small decrease in default (3%, or approximately 2 pp out of a 61 pp base) on other Bank A loans in the high minimum payment arm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Why does the effect of unemployment on default exceed the model&amp;rsquo;s predictions from cash flow alone?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper&amp;rsquo;s back-of-the-envelope normalization finds that the per-peso effects of all three shocks on default are statistically indistinguishable (p = 0.78 for the null that all three λ estimates are equal), with point estimates of λ_IR = 0.36, λ_MP = 0.51, and λ_U = 0.36 pp per 1,000 MXN pesos. This implies that job loss does not have a larger per-peso effect on default than contract term changes; the larger absolute effect of displacement arises entirely from its larger cash flow impact. Additional consequences of job loss beyond cash flow (health, mental health) do not appear to generate additional default beyond what can be attributed to income loss.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do the experimental results compare to what experts predicted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Expert predictions were systematically too large. Mexican central bank regulators predicted a mean decrease of 8.6 pp from a 30 pp interest rate reduction at the 18-month horizon, versus the actual estimated effect of 1.03 pp. Social Science Prediction Platform respondents predicted a mean decrease of 5 pp. For minimum payments, regulators on average predicted a 0.4 pp decrease in default from doubling the minimum payment, whereas the actual effect was a 0.8 pp increase. Three-quarters of SSPP respondents correctly predicted the sign of the minimum payment effect (an increase in default), but the predicted mean increase was 6.4 pp, far larger than the estimated 0.8 pp.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Do the job displacement results generalize beyond the experimental sample?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Yes. The paper repeats the displacement event study on the intersection of the nationally representative credit bureau sample (approximately 600,339 individuals with both credit information and employment histories) with the universe of IMSS data for October 2011–March 2014, yielding 8,723 mass layoff events. This sample is representative of the population of Mexican borrowers with formal employment histories, and the estimated effects on default for any loan in the credit bureau are similar in magnitude to the experimental-sample results, providing a measure of external validity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What do the debt dynamics during the experiment reveal about the mechanisms for interest rate effects on default?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The data show that purchases (net of payments) increase in response to interest rate decreases, consistent with downward-sloping demand for credit; yet total debt declines in lower-rate arms. This is consistent with the model&amp;rsquo;s prediction that the mechanical compounding effect (lower rate applied to previously accumulated debt) exceeds the behavioral new-purchase response. Confirmed empirically: the debt elasticity to the interest rate is estimated to be positive, with preferred estimates in the range [+0.18, +0.54]. The decline in default is further concentrated among borrowers with the highest baseline debt utilization rates, those for whom the debt compounding effect is strongest — consistent with the debt channel as the primary mechanism.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Cumulative Default Measure:&lt;/strong&gt; Default is defined as three consecutive monthly payments each below the required minimum payment due, at which point Bank A automatically revokes the card. The outcome variable is coded as Yit = 1 if borrower i has defaulted in any month s ≤ t and 0 otherwise, making it a cumulative (absorbing) measure. This allows estimation on an unchanging sample, avoiding attrition biases that would arise from conditioning on not having defaulted in the prior period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Minimum Payment Due (mpd):&lt;/strong&gt; The paper uses the required minimum payment due to avoid delinquency as its central cash-flow normalization variable. This is a comprehensive measure that incorporates not only the contractually specified fraction of outstanding balance but also interest charges, fees, and endogenous borrower responses (changes in debt and purchases). It serves as the common denominator for benchmarking the cash flow impacts of the two contract term interventions and formal job loss against one another.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Free Cash Flow / Per-Peso Normalization (λ):&lt;/strong&gt; The paper defines per-peso default effects (λ^IR, λ^MP, λ^U) by dividing each intervention&amp;rsquo;s average treatment effect on cumulative default (in percentage points) by the cumulative change in the minimum payment due (or equivalent cash flow impact) induced by that intervention over 18 months. The resulting ratio is expressed as percentage points of default per 1,000 MXN pesos of cash flow change. This normalization is explicitly not treated as an instrumental variable estimate; it is a descriptive back-of-the-envelope calculation intended to equate the scale of the three shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mass Layoff / Displacement:&lt;/strong&gt; A mass layoff at the firm level is defined as the first month in which year-on-year firm employment declines by more than 30% of average employment in the prior 12 months, restricted to firms with 50+ employees. An individual worker is classified as displaced if they lost formal-sector employment in the same calendar quarter as their employer&amp;rsquo;s mass layoff event. This definition follows Jacobson et al. (1993) and subsequent literature and is used to isolate plausibly involuntary (exogenous) separations from voluntary quits or individually driven terminations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Continuation Value (v):&lt;/strong&gt; In the paper&amp;rsquo;s two-period optimizing model, v is the reduced-form utility parameter capturing future flow of card benefits, warm glow from card ownership, or the option value of retaining access to formal credit, experienced only if the card is not in default. The paper uses v to rationalize the zero interest-rate response of newer borrowers: ceteris paribus, higher v implies that borrowers will remain current on the card even when interest rates are high, because they value continued access. Higher v thus implies more muted responses to interest rate changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank Tenure Strata:&lt;/strong&gt; Borrowers are stratified into three groups based on length of relationship with the study card: &amp;ldquo;new customers&amp;rdquo; (6–11 months), medium-term (12–23 months), and long-term (24+ months). Tenure is used both as a stratification variable for the experiment and as a primary dimension of heterogeneity in treatment effects, reflecting differing default rates (36% vs. 18% at 26 months), labor market vulnerability (1.34× higher job loss probability for new vs. long-term), and interest rate responsiveness (zero for new, significantly positive for long-term borrowers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt Burden Channel vs. Concurrent Moral Hazard:&lt;/strong&gt; The paper distinguishes three channels through which interest rate changes can affect default: (a) the debt burden channel — higher rates mechanically increase the stock of interest-accruing debt, making repayment harder; (b) concurrent moral hazard — higher current interest rates alter the incentive to default on existing obligations, holding debt constant; and (c) dynamic moral hazard — higher future interest rates reduce the benefit of remaining current. The paper&amp;rsquo;s finding of a modest total effect (elasticity 0.20) implies that the sum of all three channels is small in this context, with the debt burden channel being the primary driver of what effect does exist.&lt;/p&gt;</description></item><item><title>Costly Multidimensional Screening</title><link>https://macropaperwarehouse.com/papers/costly-multidimensional-screening/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/costly-multidimensional-screening/</guid><description>&lt;p&gt;This paper studies when a principal can improve upon simple one-dimensional mechanisms by also deploying costly nonprice screening instruments — actions that are socially wasteful yet potentially informative about the agent&amp;rsquo;s private type.&lt;/p&gt;
&lt;p&gt;The model features a principal and an agent with quasilinear, additively separable preferences across two components: (i) a productive component, where allocations lie in a one-dimensional compact space X and generate genuine surplus, and (ii) a costly component, where any allocation y in an arbitrary measurable space Y satisfies sB(y, θB) ≤ 0 — it destroys or at best does not create social surplus. The agent&amp;rsquo;s private type is multidimensional, θ = (θA, θB), drawn from a commonly known distribution. Both components allow for nonlinear valuations and, on the principal&amp;rsquo;s side, interdependent preferences.&lt;/p&gt;
&lt;p&gt;The central result (Theorem 1) establishes that if the agent&amp;rsquo;s preferences between the productive and costly components are positively correlated — meaning that a higher θA implies a stochastically higher θB — then there exists an optimal mechanism that involves no costly screening. Moreover, if instruments are strictly costly, every optimal mechanism involves no costly screening almost everywhere. Positive correlation is defined in terms of stochastic dominance: θB | θA is stochastically nondecreasing in θA. A sufficient but not necessary condition is affiliation in the sense of Milgrom and Weber (1982).&lt;/p&gt;
&lt;p&gt;The intuition centers on two observations. First, under positive correlation, costly instruments can only help relax upward incentive constraints (deterring lower types from mimicking higher types). Second, under the surplus condition — a single-crossing condition on the surplus function sA(x, θA) requiring that if x generates more surplus than x&amp;rsquo; at some type, it continues to do so at all higher types — the principal can safely ignore upward incentive constraints at the optimum. The Downward Sufficiency Theorem (Theorem 2) formalizes the second observation: in any one-dimensional screening problem satisfying the surplus condition, there exists an optimal solution to the relaxed program (with only downward IC constraints) that also satisfies all upward IC constraints. Because monetary transfers fully substitute for costly instruments in relaxing downward constraints without destroying surplus, the costly instruments add no value under positive correlation.&lt;/p&gt;
&lt;p&gt;The proof proceeds via a monotone path decomposition of the multidimensional type space, exploiting a measurable monotone coupling (Lemma 1) to write θ = (θA, h(θA; ε)) where ε is independent of θA and h is nondecreasing. This reduces the problem to a family of one-dimensional paths, on each of which the Reconstruction Lemma (Lemma 2) shows that any costly mechanism can be weakly improved upon by one with no costly screening that satisfies all downward IC constraints.&lt;/p&gt;
&lt;p&gt;A partial converse (Proposition 1) shows that under negative correlation — when some dimension of θB is stochastically nonincreasing in θA — there exist utility functions satisfying the surplus condition for which any mechanism screening only the productive component is strictly dominated.&lt;/p&gt;
&lt;p&gt;The paper derives three applications. In monopoly pricing with costly signals (waiting in line, climbing stairs, collecting coupons), profit-maximizing mechanisms require no costly signals when higher-willingness-to-pay consumers also face weakly lower signal costs (Proposition 2). In monopsonistic labor market screening, the firm need not make offers contingent on costly credentials when higher-ability workers find credentialing easier — in contrast to the competitive Spence (1973) model where all screening must occur through costly effort because wages are pinned down by expected output (Proposition 3). In multiproduct pricing, the paper reinterprets bundle components as costly instruments for screening grand-bundle values, recovering Haghpanah and Hartline&amp;rsquo;s (2021) pure bundling optimality result and extending it to nested bundling (Proposition 4), under conditions that the incremental value of adding items to nested bundles is strictly increasing in type while the value of any non-nested bundle is nonincreasing relative to some nested superset.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central research question?
A: The paper asks whether a principal can improve upon simple one-dimensional mechanisms by also deploying costly nonprice screening instruments when the agent has multidimensional private information. The goal is to characterize conditions under which augmenting a standard price menu with surplus-destroying actions — such as waiting in line, climbing stairs, or obtaining credentials — is or is not beneficial for the principal.&lt;/p&gt;
&lt;p&gt;Q: What does &amp;ldquo;positively correlated preferences&amp;rdquo; mean precisely in this model?
A: Positive correlation means that θB is stochastically nondecreasing in θA: for any θA &amp;lt; θ̂A, the conditional distribution of θB given θA first-order stochastically dominates that given θ̂A — i.e., θB | θA ≤_st θB | θ̂A. Observing a high θA conveys good news about θB in the stochastic dominance sense. A sufficient but not necessary condition is affiliation in the sense of Milgrom and Weber (1982). The condition is asymmetric and does not require full independence or monotone dependence in a deterministic sense.&lt;/p&gt;
&lt;p&gt;Q: What is the surplus condition and why does it matter?
A: The surplus condition is a single-crossing condition on the productive surplus function: for any x &amp;lt; x̂ and θA &amp;lt; θ̂A, if sA(x̂, θA) &amp;gt; sA(x, θA) then sA(x̂, θ̂A) &amp;gt; sA(x, θ̂A). It says that if a higher allocation generates more total surplus at some type, it continues to do so at all higher types. This condition ensures the existence of a monotone efficient allocation rule, and it is the key enabling condition for the Downward Sufficiency Theorem. It is automatically satisfied when the principal has no interdependent preferences and the agent satisfies increasing differences, and also when sA is strictly increasing in x or has nonnegative cross partial derivative.&lt;/p&gt;
&lt;p&gt;Q: What is the Downward Sufficiency Theorem and why is it the key technical result?
A: Theorem 2 states that in any one-dimensional screening problem satisfying the surplus condition, there exists an optimal solution to the relaxed program — which ignores all upward IC constraints — that also satisfies all upward IC constraints. This means the principal can solve the easier downward-IC-only problem and the solution is fully incentive compatible. The result is novel and uncovers a general property of one-dimensional screening problems beyond the standard monotone allocation rule setting. It is key because, combined with the observation that costly instruments under positive correlation can only relax upward constraints, it implies there is no benefit to using costly screening.&lt;/p&gt;
&lt;p&gt;Q: How does the proof handle the case of multidimensional types?
A: The proof uses a monotone path decomposition. By Lemma 1 (measurable monotone coupling), under positive correlation there exists a random variable ε independent of θA and a nondecreasing measurable function h such that θ =^d (θA, h(θA; ε)). This writes the joint type distribution as a family of monotone paths indexed by ε. On each path ε = e, the types are ordered by θA alone, reducing the problem to a one-dimensional screening problem. The Reconstruction Lemma (Lemma 2) then shows that on each such path, any mechanism involving costly screening can be replaced by one without costly screening that weakly improves principal payoff and satisfies all downward IC constraints.&lt;/p&gt;
&lt;p&gt;Q: What does the partial converse (Proposition 1) establish?
A: Proposition 1 shows that when some dimension i of the costly component satisfies that θi is stochastically nonincreasing in θA (negative correlation), and the type distribution has a density with |X| &amp;gt; 1 and |Y| &amp;gt; 1, then there exist utility functions satisfying the surplus condition for which any mechanism screening only the productive component is strictly dominated by one involving costly screening. This is not a full converse — it establishes existence of cases where costly screening is strictly beneficial, not that it is always beneficial under negative correlation.&lt;/p&gt;
&lt;p&gt;Q: How does the insurance example illustrate the two correlation cases?
A: In Example 1 (negative correlation), a low-risk type (θA = 0) values insurance at 2, a high-risk type (θA = 1) values it at 3; costs are 0 and 5/2 respectively; and the high-risk type also has higher disutility for the costly action. Without costly screening, the optimal mechanism sells full insurance at price 2 to both types for a profit of 3/4. With costly screening (e.g., requiring the agent to climb stairs to get full insurance), only the low-risk type purchases, yielding profit of 1 &amp;gt; 3/4. In Example 2 (positive correlation), the high-risk type has lower disutility for the costly action; any mechanism using the costly instrument is strictly dominated by simply selling full insurance at price 2 to both types.&lt;/p&gt;
&lt;p&gt;Q: How does the labor market application differ from Spence (1973)?
A: In Spence (1973), wages are competitive and pinned down by expected output, leaving no room to screen workers via monetary payments, so all screening must occur through costly credentials. In Yang&amp;rsquo;s model, the monopsonistic firm sets wages and all types face the same outside option, so monetary transfers can screen types. Proposition 3 says that when θB is stochastically nondecreasing in θA — higher-ability workers find credentials easier — no credential is needed in the optimal mechanism. The paper thus shows that costly screening is a feature of competitive, not monopsonistic, labor markets, under positive correlation of preferences.&lt;/p&gt;
&lt;p&gt;Q: What is the bundling application and what new results does it yield?
A: The paper reinterprets the multiproduct pricing problem by treating the grand bundle as the productive component and sub-bundles as costly instruments (since selling a sub-bundle instead of the grand bundle destroys social surplus relative to selling the grand bundle). Proposition 4 (nested bundling) establishes that a nested menu B of bundles is optimal among deterministic mechanisms if: (i) the incremental value of adding items to move from bundle b to b&amp;rsquo; ⊃ b in B is strictly increasing in θ, and (ii) for any bundle b not in B, there exists a nested superset b&amp;rsquo; ∈ B such that the value of b relative to b&amp;rsquo; is nonincreasing in θ. This extends and complements Haghpanah and Hartline (2021), which is recovered as the special case of pure bundling (Proposition 5).&lt;/p&gt;
&lt;p&gt;Q: What are the key scope conditions that delimit when Theorem 1 applies?
A: Theorem 1 requires: (i) additive separability of preferences across productive and costly components; (ii) the surplus condition on sA (single-crossing of total surplus in the productive component); (iii) the positive correlation condition (stochastic monotonicity of θB in θA); and (iv) the costly instruments satisfy sB(y, θB) ≤ 0 for all y, θB. The productive allocation space X must be compact and one-dimensional; Y can be any measurable space. The agent&amp;rsquo;s type space can be multidimensional. The result holds for both private values and interdependent valuations on the principal&amp;rsquo;s side.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions does costly screening arise in practice, according to the model?
A: The model predicts that if costly screening instruments are observed in practice, the consumers or agents with higher willingness to pay (or ability) for the productive good must tend to face higher costs for the screening action. For instance, higher-willingness-to-pay consumers who find waiting in line more costly (positively correlated preferences) would not be subjected to waiting as a screening device. If a firm uses waiting in line, it must be because higher-willingness-to-pay consumers find waiting less costly — consistent with negative correlation.&lt;/p&gt;
&lt;p&gt;Costly Instruments: Allocations in the space Y such that the ex post social surplus sB(y, θB) = uB(y, θB) + vB(y, θB) ≤ 0 for all y and all θB. These include actions like waiting in line, collecting coupons, or obtaining credentials that destroy social surplus but may convey private information useful for screening.&lt;/p&gt;
&lt;p&gt;Productive Component: The one-dimensional allocation dimension X in which both principal and agent derive non-negative surplus, representing the intrinsically valuable output of the mechanism (e.g., insurance coverage, job placement, bundle of goods).&lt;/p&gt;
&lt;p&gt;Positive Correlation (Stochastic Monotonicity): The condition that θB is stochastically nondecreasing in θA: for any θA &amp;lt; θ̂A, the conditional distribution of θB given θA first-order stochastically dominates that given θ̂A. Equivalently, observing a higher θA conveys good news about θB. A sufficient condition is affiliation (Milgrom-Weber), but positive correlation is strictly weaker.&lt;/p&gt;
&lt;p&gt;Surplus Condition: A single-crossing condition on the total surplus function sA(x, θA) for the productive component: for any x &amp;lt; x̂ and θA &amp;lt; θ̂A, if x̂ generates strictly more surplus than x at type θA, it continues to do so at θ̂A. This ensures a monotone efficient allocation rule exists and is the enabling condition for the Downward Sufficiency Theorem.&lt;/p&gt;
&lt;p&gt;Downward Sufficiency Theorem (Theorem 2): The result that in any one-dimensional screening problem satisfying the surplus condition, there exists an optimal solution to the relaxed program (which ignores upward IC constraints) that also satisfies all upward IC constraints. This implies the principal need only enforce downward incentive constraints at the optimum.&lt;/p&gt;
&lt;p&gt;Monotone Path Decomposition: A proof technique that writes the multidimensional type distribution as θ =^d (θA, h(θA; ε)) where ε ⊥ θA and h is nondecreasing in θA. Borrowed from dynamic mechanism design (Eso-Szentes, Pavan-Segal-Toikka), it reduces multidimensional IC problems to families of one-dimensional paths indexed by the independent residual ε.&lt;/p&gt;
&lt;p&gt;Nested Bundling: A menu B of product bundles that can be totally ordered by set inclusion (b1 ⊂ b2 ⊂ &amp;hellip; ⊂ bK). The paper shows that nested bundling is optimal under conditions that the incremental value of nesting is strictly increasing in type for bundles within B, and nonincreasing relative to any nested superset for bundles outside B.&lt;/p&gt;</description></item><item><title>Costs of Financing U.S. Federal Debt Under a Gold Standard: 1791-1933</title><link>https://macropaperwarehouse.com/papers/costs-of-financing-u.s.-federal-debt-under-a-gold-standard-1791-1933/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/costs-of-financing-u.s.-federal-debt-under-a-gold-standard-1791-1933/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper constructs a new dataset of US federal bond prices and uses it to estimate the full term structure of yields on gold-denominated US federal debt from 1791 to 1933 — the entire gold standard era. The core research question is how the costs of financing US federal debt evolved over this period and what monetary, fiscal, and financial policy changes drove that evolution, with the ultimate aim of understanding how the US built fiscal capacity and transformed its debt from a &amp;ldquo;junk bond&amp;rdquo; into a global &amp;ldquo;safe asset.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology.&lt;/strong&gt; The authors compile monthly prices, quantities, and descriptions of all US Treasury securities from 1776 to 1960 (the Hall et al. 2018 dataset). Bonds with less than one year to maturity are excluded from the main estimation due to liquidity premia. The primary estimation uses a Dynamic Nelson-Siegel (DNS) model with stochastic volatility (Diebold and Li 2006; Hautsch and Yang 2012), estimated by Bayesian MCMC. A key methodological innovation is the addition of bond-specific idiosyncratic pricing errors (Assumption 3), which allows the authors to include bonds with heterogeneous contract features — call options, indefinite maturities, conversion features — that characterize 19th-century US debt without either dropping them from the sample or having their idiosyncrasies distort the common yield curve. The data are &amp;ldquo;big&amp;rdquo; in the time-series dimension but sparse in the maturity (cross-sectional) dimension, frequently offering fewer than five price observations per month; the DNS framework pools information across time to address this sparsity.&lt;/p&gt;
&lt;p&gt;For the greenback period (1862–1878), the authors extend the approach by modeling the greenback yield curve as a function of the gold yield curve and a time-varying VAR model of exchange rate expectations (Assumptions 4–5). Only nine greenback-denominated bonds exist in the sample, most of them short-term; the VAR is estimated jointly using exchange rate data and the relative prices of greenback and gold bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Long-run decline in yields.&lt;/strong&gt; The 10-year gold-denominated zero-coupon yield fell from approximately 8% in 1800 to approximately 2% in 1900, consistent with global secular decline trends, but the trajectory stabilized near 2% after 1900 — suggesting US debt began to play a distinctive &amp;ldquo;safe-asset&amp;rdquo; role from the turn of the 20th century.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;War spikes were much larger than previously understood.&lt;/strong&gt; The paper&amp;rsquo;s estimate of the 10-year gold yield reaches a peak of approximately 16% near the end of the Civil War. This is substantially higher than the Homer and Sylla (2004) peak of 6% at the start of the war. The discrepancy arises because Homer and Sylla used bonds trading at par — which did not exist during the Civil War — while this paper uses the full universe of bonds at monthly frequency.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Yield curve slope switched sign.&lt;/strong&gt; The term spread (10-year minus 2-year gold yield) was typically negative before the Civil War (inverted yield curve) and turned persistently positive afterward. The authors link this switch to a change in long-run inflation predictability: inflation was relatively hard to forecast before the Civil War and easier to forecast after, consistent with a negative inflation-risk premium in the pre-war period.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Default risk premium disappeared around 1905.&lt;/strong&gt; Comparing hypothetical gold-denominated US consols to UK consols (the 19th-century benchmark safe asset), US yields were persistently above UK yields until approximately 1905, when US yields fell below UK yields. This indicates that US federal debt acquired safe-asset characteristics well before World War I, foreshadowing the shift in global reserve asset status during and after Bretton Woods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Nominal anchor during the Civil War.&lt;/strong&gt; Despite a 60% depreciation of the greenback against gold during the Civil War (100 greenback dollars could be purchased for as few as 40 gold dollars in summer 1864), investors expected greenbacks to eventually return to gold parity. Estimated long-run exchange rate expectations remained anchored at one-for-one parity throughout the period. This kept greenback-denominated bond yields flat at approximately 6% — bonds traded around par — explaining the &amp;ldquo;Civil War yield puzzle&amp;rdquo; noted by Friedman and Schwartz (1963).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Short-rate disconnect.&lt;/strong&gt; Short-maturity government bonds (less than one year) traded with a premium of approximately 0.25 to 0.5 percentage points relative to model-implied yields throughout most of the 19th century, reflecting scarcity of money-like assets. This premium effectively disappeared from the 1880s until World War I — coinciding with the National Banking Era — and then reappeared in the 1920s after the Federal Reserve created a secondary market for Certificates of Indebtedness.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why does the paper restrict estimation to bonds with maturity greater than one year?&lt;/strong&gt;
Short-maturity Treasury notes exhibited particularly large estimated bond-specific pricing errors in preliminary analysis, which the authors attribute to a liquidity premium: short-term government debt was used for transactions and thus commanded a money-like premium that a common discount function cannot accommodate. To keep this liquidity premium from distorting estimates of the longer end of the curve, these bonds are excluded from the main estimation. Short-maturity bonds are then studied separately as an &amp;ldquo;out-of-sample&amp;rdquo; exercise (the short-rate disconnect).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the Dynamic Nelson-Siegel model with stochastic volatility solve the cross-sectional sparsity problem?&lt;/strong&gt;
The DNS model parameterizes the entire yield curve at each date using only three latent factors — level (L), slope (S), and curvature (C) — which follow a driftless random walk. The stochastic volatility component, captured in the covariance matrix Σt, governs how much information is pooled across adjacent time periods. When Σt → 0, the yield curve is assumed constant (full pooling); when Σt → ∞, estimates are date-by-date (no pooling). By allowing Σt to vary, the model pools more heavily in sparse periods and less during wars when yields change rapidly. The companion paper (Payne et al. 2023a) confirms via information criteria that stochastic volatility and correlated shocks improve fit without overfitting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the bond-specific pricing error and why is it essential for historical data?&lt;/strong&gt;
Assumption 3 adds to each bond i a Gaussian pricing error with mean zero and bond-specific standard deviation σ(i)_m (scaled by Macaulay duration to approximate yield-space errors). This allows bonds with idiosyncratic contract features — call options, conversion clauses, ambiguous payment currency — to inform the common yield curve without unduly distorting it. Bonds with larger σ(i)_m receive less weight in estimation. In modern datasets, researchers pre-select homogeneous bonds and use time-specific pricing errors; the historical sparsity prevents that approach here.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How large were Civil War yields compared to prior estimates, and why does the discrepancy arise?&lt;/strong&gt;
The paper&amp;rsquo;s posterior median for the 10-year gold zero-coupon yield peaks at approximately 16% near the end of the Civil War. Homer and Sylla (2004) report a peak of 6% at the start of the war. The discrepancy arises because Homer and Sylla used bonds trading close to par, but during the Civil War no federal bonds traded at gold-price par (Lincoln&amp;rsquo;s re-election was uncertain in summer 1864; 100 greenback dollars could be purchased for 40 gold dollars, implying 6% coupon bonds were priced at 40% of par, implying yields in excess of 15%). This paper uses the full universe of Treasury bonds at monthly frequency and allows all bonds — regardless of trading price — to inform the yield curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: When did US debt cease to carry a default risk premium relative to UK debt, and how is this measured?&lt;/strong&gt;
The authors compare yields-to-maturity on gold-denominated UK consols to those on hypothetical gold-denominated US consols promising the same coupon flows. Because both countries were on a gold standard for most of the period and UK consols were the 19th-century safe asset, the spread is interpreted as a risk premium on US debt. US yields fell below UK yields persistently after approximately 1905, indicating that US debt was priced as a safe asset well before World War I. US yields were temporarily close to UK yields in the 1820s but the spread re-widened after the Jacksonian era, state defaults in the 1840s, and the Civil War. The spread closed only after Civil War disruptions resolved, the National Banking System matured, and gold-greenback parity was restored in 1879.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the &amp;ldquo;nominal anchor&amp;rdquo; finding during the greenback era, and what econometric method uncovers it?&lt;/strong&gt;
During 1862–1878, the federal government issued non-convertible greenback dollars alongside gold bonds. The greenback depreciated substantially (to 40 cents per gold dollar in 1864), yet greenback-paying bonds traded near par, implying greenback yields near 6%. The authors model the greenback yield curve as a product of the gold discount function and a &amp;ldquo;multiplier&amp;rdquo; z(j)_t capturing the expected future gold-to-greenback exchange rate at each horizon j (Assumption 4). The exchange rate expectations are estimated via a time-varying VAR(2) model of the gold-to-greenback and gold-to-goods exchange rates (Assumption 5), jointly constrained by the prices of greenback bonds via an interest-rate parity condition. The resulting estimates show that throughout the greenback era — even during large wartime depreciations — investors&amp;rsquo; long-run expectations of the exchange rate remained anchored near gold parity, consistent with anticipated eventual resumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How did political events affect exchange rate expectations during and after the Civil War?&lt;/strong&gt;
The time-varying VAR captures shifts in exchange rate expectations associated with identifiable political events. Grant&amp;rsquo;s victory in 1869 (which resolved uncertainty about whether debts would be honored in gold) coincided with an increase in the price of greenbacks, a decrease in expected greenback appreciation, and a closing of the gap between greenback and gold 10-year yields. In the early 1870s, following the Panic of 1873 and uncertainty about resumption, investors came to expect that gold-greenback discrepancies would persist almost indefinitely, causing gold and greenback yields to converge. The Resumption Act of January 1875 then shifted 2-year and 10-year expectations back toward parity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the short-rate disconnect and what does it reveal about the National Banking Era?&lt;/strong&gt;
The short-rate disconnect is the difference between observed yields-to-maturity for bonds with less than one year to maturity and the yields-to-maturity implied by the model estimated on bonds with more than one year maturity. A positive disconnect means short-maturity bonds yielded less than long-maturity bonds conditional on the model — indicating a liquidity premium on short-term debt. The authors find a persistent premium of 0.25 to 0.5 percentage points through most of the 19th century, reflecting scarcity of money-like assets when state bank notes circulated at variable discounts. The premium disappeared from approximately the 1880s to World War I, coinciding with the mature National Banking Era after greenback-gold parity was restored in January 1879. The authors interpret this as evidence that the National Banking Acts (1862–1866), which allowed National Banks to issue standardized bank notes backed by long-term US government bonds, ultimately succeeded in supplying liquid assets and equalizing the pricing of short- and long-term federal debt — but only after the currency risk from the greenback period had been resolved.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the composite long-term yield series (Officer-Williamson / Homer-Sylla) distort historical narratives?&lt;/strong&gt;
The composite series combines Homer and Sylla US federal yields (1798–1861), New England Municipal bond yields (1862–1899), and corporate bond yields (1900–1940). The paper shows that this composite series substantially underestimates the increase in US federal borrowing costs during Civil War deficits (peak of 6% vs. this paper&amp;rsquo;s 16%) and overstates post-Civil War borrowing costs by mixing in riskier private obligations. The authors argue that earlier findings of no strong association between 19th-century interest costs and deficits (Evans 1985, 1987) may reflect the composite series&amp;rsquo; failure to accurately capture federal borrowing costs during large deficit episodes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How did the yield curve slope change after the Civil War and what explains it?&lt;/strong&gt;
The term spread (10-year minus 2-year gold yield) was typically negative before the Civil War and positive after the late 1870s. Major wars caused sharp temporary decreases (inversions). The authors connect the sign switch to a change in long-run inflation dynamics documented in a companion paper (Payne et al. 2023b): long-run inflation was hard to predict before the Civil War and easier to predict after, suggesting gold bonds provided a better inflation hedge in the pre-war period (negative inflation-risk premium), which is consistent with asset pricing theory producing a downward-sloping yield curve. After the Civil War, as inflation became more predictable, the inflation-risk premium became positive and the yield curve turned upward-sloping.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What did the National Banking Acts seek to do and was the puzzle of bank note under-issuance resolved?&lt;/strong&gt;
The National Banking Acts (1862, 1863, 1865, 1866) authorized federally chartered banks to issue bank notes up to 90% of the par or market value of eligible US Treasury bonds deposited as collateral, subject to a 1% annual tax on notes outstanding (0.5% after 1900), compared to a 10% tax on state bank notes. The intended goals were to increase the supply of short-term liquid assets and to increase bank demand for long-term federal debt, thereby lowering long-term yields and eliminating the short-rate disconnect. A long-standing puzzle (Friedman-Schwartz, Cagan, Champ, Calomiris-Mason) held that yields on eligible Treasuries did not fall enough to equal the note tax rate, implying under-issuance. The paper&amp;rsquo;s analysis of the short-rate disconnect offers a resolution: if one focuses on the disconnect rather than the yield-tax spread, the National Banking Acts appear to have largely achieved their goals by the 1880s — but only after greenback-gold parity was restored, suggesting that currency devaluation risk had initially restrained bank note issuance, as hypothesized by Cagan (1965).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Dynamic Nelson-Siegel (DNS) model with stochastic volatility:&lt;/strong&gt; A parametric yield curve model (Diebold-Li 2006) parameterizing zero-coupon yields at each date as a function of three latent factors — level (L), slope (S), curvature (C) — following a driftless random walk. The paper extends this with time-varying shock volatilities (stochastic volatility) to allow the degree of information pooling across time periods to vary with institutional and wartime disruptions. Used here to handle cross-sectional sparsity in historical bond data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond-specific pricing error:&lt;/strong&gt; A Gaussian pricing error with bond-specific standard deviation σ(i)_m (scaled by Macaulay duration) added to each bond&amp;rsquo;s observed price. Allows bonds with heterogeneous and idiosyncratic contract features (call options, conversion clauses) to inform a common discount function without distorting it, by automatically down-weighting &amp;ldquo;peculiar&amp;rdquo; bonds through higher estimated σ(i)_m.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Short-rate disconnect (liquidity premium):&lt;/strong&gt; The systematic difference between observed yields-to-maturity on bonds with less than one year to maturity and yields implied by a pricing kernel fitted on bonds with more than one year to maturity. Interpreted as a money-like convenience yield (liquidity premium) on short-term debt: when money-like assets are scarce, short-term bonds are overpriced (lower yields) relative to the term structure implied by longer maturities. Measured here as an out-of-sample fit residual from the DNS model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Denomination risk:&lt;/strong&gt; The risk that the unit of account in which bond payments are promised may change in value relative to gold. During the greenback era (1862–1878), bonds denominated in greenbacks carried denomination risk because greenbacks could depreciate against gold. The paper distinguishes denomination risk from default risk by estimating separate gold and greenback yield curves and modeling exchange rate expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nominal anchor:&lt;/strong&gt; The phenomenon in which long-run market expectations of the gold-to-greenback exchange rate remained anchored near gold parity (one-for-one) even during large short-run depreciations during the Civil War. Inferred from the observation that greenback-denominated bonds traded near par (yield ~6%) while the spot greenback depreciated by up to 60% against gold, implying investors anticipated eventual full appreciation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Default risk premium (US-UK yield spread):&lt;/strong&gt; The difference between yields on hypothetical gold-denominated US consols and yields on UK consols. Since both were on a gold standard (so inflation expectations are similar), and UK consols were the 19th-century benchmark safe asset, the spread is interpreted as the compensation investors demanded for the risk that the US might default or alter payment terms. Persistently positive until approximately 1905, then became negative.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Convenience yield:&lt;/strong&gt; An implicit yield that accrues to holders of money-like or safe assets because of their use in transactions or as collateral. In this paper, it emerges as the spread between yields on US federal bonds and other low-risk bonds in the late 19th century, reflecting increased demand for Treasuries as reserves under the National Banking System. Historically identified via the short-rate disconnect disappearing in the National Banking Era.&lt;/p&gt;</description></item><item><title>Counterfactual Analysis for Structural Dynamic Discrete Choice Models</title><link>https://macropaperwarehouse.com/papers/counterfactual-analysis-for-structural-dynamic-discrete-choice-models/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/counterfactual-analysis-for-structural-dynamic-discrete-choice-models/</guid><description>&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Discrete choice data identify only &lt;em&gt;differences&lt;/em&gt; in agents&amp;rsquo; utilities, not utility levels. In dynamic discrete choice (DDC) models this means many policy-relevant counterfactuals — those requiring knowledge of utility in levels — are not point-identified. Kalouptsidi, Kitamura, Lima, and Souza-Rodrigues ask: how much can researchers learn about counterfactual outcomes under mild, verifiable restrictions, without imposing the strong normalizations that are standard in applied work but often hard to justify and potentially sign-reversing in their effects?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Methodology.&lt;/strong&gt; The paper works within a canonical infinite-horizon DDC framework where an agent chooses among a finite action set each period, with additively separable per-period payoffs and i.i.d. unobservables. The econometrician observes conditional choice probabilities (CCPs) and state transition functions from panel data, but the payoff vector is underidentified by X free parameters (one per state), which is the source of non-identification of many counterfactuals. The authors characterize the &lt;em&gt;sharp identified set&lt;/em&gt; for counterfactual CCPs, for low-dimensional outcomes such as average welfare, and develop both identification theory and a feasible inference procedure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Identification Results.&lt;/strong&gt; The sharp identified set for counterfactual CCPs is a smooth, connected manifold whose dimension equals the rank of a specific matrix (CJ*QJ) that the econometrician can compute directly from the data. This rank is at most X minus the number of linearly independent equality restrictions imposed. Two classes of commonly used restrictions reduce the dimension further without requiring full point identification: (i) &lt;em&gt;local counterfactuals&lt;/em&gt; — experiments affecting only a subset of the state-action space — reduce the dimension to at most the number of eigenvalues of the relevant transformation matrix that differ from one; (ii) &lt;em&gt;parametric payoffs&lt;/em&gt; with ηγ free parameters reduce the dimension to at most ηγ. Combining both achieves the tightest bound. Point identification is the special case where the rank equals zero.&lt;/p&gt;
&lt;p&gt;For scalar low-dimensional outcomes (e.g., average welfare), the identified set is a compact interval whose endpoints are obtained by solving constrained optimization programs implementable in standard nonlinear solvers (e.g., Knitro), feasible even when the state space is large.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Illustration.&lt;/strong&gt; In the firm entry/exit Monte Carlo with state space X = 4 and a counterfactual entry subsidy removal: under Restriction 1 alone (outside option = 0, non-negative costs, known variable profits), the identified set for the change in the long-run probability of being active is [-0.1235, 0.0000], correctly signed and containing the true value of -0.0638. Adding shape restrictions (Restrictions 1–2) tightens the upper bound to -0.0341; adding the scrap-value exclusion restriction (Restrictions 1–3) tightens it to -0.0421. Analogous patterns hold for consumer surplus (true: -0.0875; bounds narrowing from [-0.1735, 0.0000] to [-0.1735, -0.0573]) and firm value (true: 0.9513; bounds from [0.0000, 1.8229] to [0.6388, 1.8229]). Critically, the authors show that setting scrap values to zero — the standard identifying assumption — is &lt;em&gt;rejected by the data&lt;/em&gt; under Restrictions 1 and 2, because that payoff vector does not lie in the identified set.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Application.&lt;/strong&gt; Revisiting Das, Roberts, and Tybout (2007) on Colombian exporters, the paper re-examines the horserace among export revenue, fixed cost, and entry cost subsidies. The DRT ranking (revenue subsidies dominate, entry cost subsidies rank last) survives under weaker restrictions than originally imposed, but hinges on the assumption that scrap values do not vary across states. Without that restriction, entry cost subsidies can potentially outperform the other types, reversing the original conclusion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inference.&lt;/strong&gt; The paper develops a subsampling-based inference procedure that is asymptotically uniformly valid (bootstrap fails here due to non-regularity of the set boundary). The confidence set is constructed by inverting a quadratic-form distance test statistic. The critical practical recommendation is subsample size hN = N^{2/3}. The procedure remains feasible in binary choice models with state spaces up to X = 240 (dimension of the optimization problem: 720), where standard moment-inequality approaches are computationally infeasible.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why are many counterfactuals not point-identified in DDC models, even after the model is estimated?&lt;/strong&gt;
A: Choice data identify only differences in value functions across actions, not utility levels. The identifying matrix M has rank AX, leaving X free payoff parameters undetermined. Counterfactuals that depend on utility levels — such as the welfare impact of an entry subsidy when scrap values are unknown — therefore cannot be recovered uniquely from the data, even with a fully estimated model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the key object the paper characterizes, and what does it look like geometrically?&lt;/strong&gt;
A: The paper characterizes the sharp identified set for the counterfactual CCP vector p̃. Proposition 1 establishes that this set is a smooth, connected manifold with boundary, whose interior dimension equals rank(CJ*QJ). Connectedness is important because it means the set has no gaps and boundary tracing is sufficient to characterize it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the dimension of the identified set depend on the type of model restrictions imposed?&lt;/strong&gt;
A: Equality restrictions (d of them) reduce the maximum possible dimension from X to X–d. Local counterfactuals (affecting L state-action pairs) reduce the dimension further to at most the number of eigenvalues of the payoff transformation H(L) that differ from one, which is at most L. Parametric payoffs with ηγ free parameters cap the dimension at ηγ. Combining local counterfactuals with parametric payoffs gives the tightest bound: at most the number of eigenvalues of a related matrix D that differ from one, which is at most min(L, ηγ).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Under what conditions does the identified set for counterfactual behavior collapse to a point?&lt;/strong&gt;
A: When rank(CJ*QJ) = 0, every payoff vector in the identified set PI maps to the same counterfactual CCP — that is, p̃ is point-identified even though the structural payoff π may not be. This can occur through a combination of equality restrictions and specific structure of the counterfactual experiment, without requiring full identification of all model parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What properties does the identified set for a scalar low-dimensional outcome have, and how is it computed?&lt;/strong&gt;
A: Under continuity of the outcome function φ and boundedness of the payoff identified set, the identified set for a scalar outcome θ is a compact interval [θL, θU]. The endpoints are computed as the minimum and maximum of a constrained optimization program over the joint space of counterfactual CCPs and payoff vectors, subject to the model&amp;rsquo;s Bellman equations, model restrictions, and equality constraints linking observed to counterfactual behavior. These programs can be solved with standard nonlinear solvers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do the Monte Carlo results show about the informativeness of the bounds?&lt;/strong&gt;
A: In the firm entry/exit example with X = 4, the identified sets under only mild restrictions (non-negative costs, known variable profits, zero outside option) are already informative and correctly signed. For the change in the probability of being active (true value: -0.0638), the set under Restriction 1 alone is [-0.1235, 0.0000], establishing that the probability does not increase. Adding shape restrictions and exclusion restrictions progressively tightens the interval. All intervals contain the true parameter value, confirming sharpness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the paper show about the assumption of zero scrap values, which is standard in the entry cost literature?&lt;/strong&gt;
A: The paper shows that setting scrap values to zero can be rejected by the data: in the firm entry/exit example, the payoff vector with s = 0 does not belong to the identified set PI under Restrictions 1 and 2. This is empirically important because Kalouptsidi, Scott, and Souza-Rodrigues (2021) had previously shown that mistakenly setting scrap values to zero not only biases estimated entry costs downward but can also reverse the sign of a subsidy&amp;rsquo;s predicted effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the main finding of the empirical application to export subsidies?&lt;/strong&gt;
A: Revisiting Das, Roberts, and Tybout (2007), the paper finds that the DRT ranking — export revenue subsidies dominate, entry cost subsidies rank last — can be confirmed under restrictions weaker than those DRT originally imposed. However, the ranking is not robust to allowing scrap values to vary across states: under that generalization, entry cost subsidies can potentially outperform the other subsidy types, reversing the original policy conclusion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does the bootstrap fail for inference in this setting, and why does subsampling work?&lt;/strong&gt;
A: The test statistic ĴN(θ0) involves the minimum of a quadratic form over a non-regular (kinked), random, and possibly nonconvex set. Bootstrap critical values are not asymptotically uniformly valid in this non-regular setting. Subsampling with subsample size hN → ∞, hN/N → 0 (the paper recommends hN = N^{2/3}) delivers asymptotically uniformly valid critical values under weak conditions, because it does not require regularity of the constraint set boundary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the inference approach handle the high dimensionality of DDC settings?&lt;/strong&gt;
A: The paper develops a computational algorithm specifically tailored to the structure of DDC models, exploiting the linear Bellman equation constraints to reduce the effective dimensionality of the optimization problem. In a binary choice model with X = 90, the joint optimization is over a 270-dimensional space; with X = 240 (as in Blundell, Gowrisankaran, and Langer, 2020), the dimension is 720. Standard moment-inequality inference methods (Kaido, Molinari, Stoye, 2019; Bugni, Canay, Shi, 2017) are computationally infeasible at these scales; the authors&amp;rsquo; algorithm remains tractable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper relate to Norets and Tang (2014), the closest alternative approach?&lt;/strong&gt;
A: Norets and Tang (2014) partially identify structural parameters and high-dimensional counterfactual CCPs by relaxing the assumed distribution of idiosyncratic shocks, focusing on binary choice models and using a pointwise-valid Bayesian approach. The present paper instead targets low-dimensional policy outcomes (nonlinear functions of payoffs and counterfactual CCPs), accommodates multinomial choice, provides asymptotically uniformly valid frequentist inference via subsampling, and restricts the source of underidentification to the payoff function rather than the error distribution. The two contributions are non-nested and complementary.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the practical workflow the paper enables for applied researchers?&lt;/strong&gt;
A: A researcher can (i) select any combination of model restrictions (equality or inequality, parametric or shape), (ii) specify any counterfactual experiment via an affine payoff transformation (H, g), and (iii) define any low-dimensional outcome of interest φ, then directly compute the identified set and a valid confidence interval by solving two constrained optimization programs — without deriving new analytical identification results for each specification. The rank condition for checking the dimension of the identified set is computable from the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic Discrete Choice (DDC) Model.&lt;/strong&gt; A discrete-time infinite-horizon model where agents choose among a finite action set each period, with per-period utilities additively separable into an observed payoff function π and an i.i.d. unobservable shock, and agents maximize expected discounted lifetime utility. The model is parameterized by payoffs π, transition function F, discount factor β, and shock distribution G.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conditional Choice Probability (CCP).&lt;/strong&gt; The probability that an agent selects a given action in a given state, integrating out the unobservable shocks. CCPs and state transitions are directly identifiable from panel data and serve as the sufficient statistics for the identified set, in place of the unidentified payoff vector.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sharp Identified Set for Counterfactual CCPs.&lt;/strong&gt; The set PĨ(p, F) of all counterfactual CCP vectors p̃ that are consistent with the observed data (p, F) and the imposed model restrictions, given the specified counterfactual transformation. Characterized as a smooth connected manifold with dimension equal to rank(CJ*QJ).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Counterfactual.&lt;/strong&gt; A counterfactual experiment in which the payoff transformation H modifies only a subset L of the state-action pairs, leaving the rest unchanged. Local counterfactuals reduce the dimension of the identified set relative to global experiments, because only the payoffs in the affected subset matter for the unidentified component of the counterfactual response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Partial Identification / Identified Set for Outcomes.&lt;/strong&gt; Rather than seeking a unique estimate of a counterfactual outcome θ, partial identification recovers the set ΘI of all values of θ consistent with the data and restrictions. For scalar outcomes this is a compact interval [θL, θU] whose endpoints solve constrained optimization problems over payoff and counterfactual CCP spaces.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Subsampling Inference.&lt;/strong&gt; A procedure for constructing asymptotically uniformly valid confidence sets by repeatedly computing the test statistic on subsamples of size hN &amp;lt; N, approximating the sampling distribution of ĴN(θ0) without requiring regularity (smoothness) of the boundary of the constraint set — a requirement that fails here due to the kinked, nonconvex nature of the identified set.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rank Condition for Dimension.&lt;/strong&gt; The dimension of the identified set for counterfactual CCPs is determined by the rank of the matrix CJ*QJ, which depends on the counterfactual transformation H, the model restrictions, and the observed data. The econometrician can compute this rank from observables to assess, before imposing any strong assumptions, how many dimensions of freedom remain in the identified set.&lt;/p&gt;</description></item><item><title>Credit Easing versus Quantitative Easing: Evidence from Corporate and Government Bond Purchase Programs</title><link>https://macropaperwarehouse.com/papers/credit-easing-versus-quantitative-easing-evidence-from-corporate-and-government-bond-purchase-programs/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/credit-easing-versus-quantitative-easing-evidence-from-corporate-and-government-bond-purchase-programs/</guid><description>&lt;p&gt;Using security-level data on individual corporate bond prices and the Bank of England&amp;rsquo;s published purchase quantities across its gilt purchase programs (QE1: £200bn, QE2: £125bn, QE3: £50bn, QE4: £60bn) and Corporate Bond Purchase Scheme (CBPS: £10bn of investment-grade sterling corporate bonds), this paper estimates supply effects of QE and CE on UK corporate bond prices, credit spreads, and new issuance separately, exploiting cross-sectional variation in quantities purchased as identifying variation via an instrumental variables approach. In the case of QE alone, supply effects on corporate bond prices are significant at announcement and larger over the full stock-effect horizon, but pass-through to credit spreads is found to be limited to the default-free component of corporate yields under normal market conditions — an exception is QE1 during the financial crisis, when QE&amp;rsquo;s cross-asset supply effects also significantly lowered credit spreads in the longer run. CE via the CBPS is found to be more effective than QE in reducing credit spreads for higher-rated investment-grade bonds even under normal conditions, and is the only program that generates a statistically significant increase in sterling corporate bond issuance. The results are consistent with QE and CE working through partially distinct channels — QE primarily affecting the default-free component of corporate yields, CE additionally compressing the credit-spread component — and complementing each other for higher-rated bonds.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-empirical-strategy-and-why-use-a-security-level-approach"&gt;Q1. What is the empirical strategy and why use a security-level approach?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper uses a two-stage instrumental variables (IV) approach at the individual corporate bond level, with pre-program bond characteristics — maturity, yield-curve fitting errors, the BoE&amp;rsquo;s prior ownership share in the gilt bucket — serving as instruments for the expected distribution of purchases across bonds, allowing isolation of the supply channel from signaling and duration channels.&lt;/strong&gt; The security-level approach offers three advantages over aggregate or event-study methods: it enables construction of &amp;ldquo;substitute buckets&amp;rdquo; (bonds whose maturity is close to the purchased bonds&amp;rsquo;) to estimate cross-asset supply effects; it permits direct comparison of the price elasticity with respect to gilt purchases (cross-asset effect) versus corporate bond purchases (within-asset effect); and it allows estimation of both the announcement-day effect and the stock effect — the cumulative price and spread change over the life of each program — which captures the longer-run portfolio-rebalancing contribution separately from the initial market reaction.&lt;/p&gt;
&lt;h3 id="q2-what-are-qes-effects-on-corporate-bond-prices-and-credit-spreads"&gt;Q2. What are QE&amp;rsquo;s effects on corporate bond prices and credit spreads?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;For QE alone (QE1–3), the instrumented gilt substitute purchases have positive and statistically significant effects on corporate bond prices at announcement across all three programs — in the case of QE1, the average 30 basis-point decline in corporate yields on the announcement day is attributed in full to QE supply effects in the paper&amp;rsquo;s regression.&lt;/strong&gt; The stock effect — estimated over the full life of each program — is significantly larger than the announcement-day effect, consistent with gradual portfolio rebalancing as predicted by Greenwood, Hanson, and Liao (2018). However, except for QE1, the supply effects do not carry through to credit spreads in either the short run or the longer run, which the paper interprets as consistent with QE working primarily through the default-free component of the corporate yield: corporate yields fell in line with gilt yields, but spreads over gilts were unchanged.&lt;/p&gt;
&lt;h3 id="q3-when-does-qe-affect-credit-spreads"&gt;Q3. When does QE affect credit spreads?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;QE1&amp;rsquo;s cross-asset supply effects significantly lowered credit spreads in the longer run, even though QE2 and QE3 do not generate significant credit spread compression in either the short or long run, suggesting that the supply channel interacts with the liquidity channel specifically under conditions of financial market distress.&lt;/strong&gt; The paper interprets the QE1 exception as reflecting the severe disruption during the 2008–09 financial crisis: when capital mobility across markets is constrained and liquidity premia are elevated, central bank purchases of safe assets may also improve trading conditions in indirectly targeted, less liquid markets such as the corporate bond market, reducing the liquidity component of corporate spreads. This interaction does not appear to be operative in the more normal market conditions of QE2 and QE3.&lt;/p&gt;
&lt;h3 id="q4-how-does-ce-compare-to-qe-in-reducing-credit-spreads-and-stimulating-issuance"&gt;Q4. How does CE compare to QE in reducing credit spreads and stimulating issuance?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;CE via the CBPS is found to be more effective than QE in reducing credit spreads for higher-rated investment-grade bonds even under normal financial market conditions, and a corporate bond&amp;rsquo;s price sensitivity to its own CBPS purchases is substantially higher than its price sensitivity to gilt substitute purchases; CE is also the only program with a statistically significant positive effect on new sterling corporate bond issuance.&lt;/strong&gt; Across QE1–3, there is no statistically significant impact of gilt purchases on sterling corporate issuance, while CBPS purchases have positive and statistically significant effects on new sterling corporate bond issuance. The paper characterizes CE and QE as complementary for higher-rated bonds: CE&amp;rsquo;s credit-spread reduction layers on top of QE&amp;rsquo;s default-free component effect, making the total stock effect larger than either program alone.&lt;/p&gt;
&lt;h3 id="q5-what-happens-for-lower-rated-investment-grade-bonds"&gt;Q5. What happens for lower-rated investment-grade bonds?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;For lower-rated investment-grade bonds, the evidence for both cross-asset QE supply effects and within-asset CE supply effects is weaker, and the paper suggests that CE&amp;rsquo;s stimulation of new bond issuance may have counterbalanced its positive price effects for these bonds through the dilutive effect of new supply.&lt;/strong&gt; The mechanism is that CE&amp;rsquo;s reduction in the cost of corporate bond issuance for lower-rated firms induced enough new bond issuance to partially offset the price increase from CBPS purchases, consistent with the issuance channel being most active for the market segment where CBPS created the largest pricing improvement. This dilution effect implies that the net price benefit of CE for lower-rated bonds is smaller than the gross supply-effect estimate.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;stock effect&lt;/strong&gt; : the cumulative effect of the total quantity of bonds purchased under a program on bond prices and spreads, estimated over the full life of the program; in this paper the stock effect is significantly larger than the announcement-day effect, consistent with gradual portfolio rebalancing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;cross-asset supply effect&lt;/strong&gt; : the pass-through of government bond (gilt) purchase supply shocks to the prices of corporate bonds — an asset class not directly targeted by QE; the paper provides the first estimates of this cross-market supply channel at the security level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;credit spread&lt;/strong&gt; : the difference between the yield on a corporate bond and the yield on a risk-free government bond of the same maturity; the paper finds QE pass-through is generally limited to the default-free component of corporate yields rather than the credit spread.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;default-free component&lt;/strong&gt; : the part of a corporate bond&amp;rsquo;s yield attributable to the risk-free interest rate rather than credit risk; the paper finds that QE supply shocks affect this component but generally leave the credit spread unchanged in normal market conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;within-asset substitution effect&lt;/strong&gt; : the price effect of CE purchases on the bonds directly purchased and their corporate bond substitutes, as distinct from cross-asset effects; the paper finds this effect is substantially larger in magnitude than the cross-asset QE effect on corporate bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;issuance channel&lt;/strong&gt; : the mechanism by which lower corporate borrowing costs induced by CE stimulate new corporate bond issuance; the paper finds this channel operates under CE (CBPS) but not under QE (gilt purchases).&lt;/p&gt;</description></item><item><title>Customer accumulation, returns to scale, and secular trends</title><link>https://macropaperwarehouse.com/papers/customer-accumulation-returns-to-scale-and-secular-trends/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/customer-accumulation-returns-to-scale-and-secular-trends/</guid><description>&lt;p&gt;This paper asks how rising returns to scale in production contributed to three concurrent U.S. secular trends since 1980: declining business dynamism, rising markups, and growing firm expenditures on customer acquisition. The author constructs a firm dynamics model in the Hopenhayn (1992) tradition with endogenous entry and exit, heterogeneous markups, and customer accumulation grounded in directed search in the product market. Firms compete for customers through both prices and selling activities; larger firms gain a competitive edge when returns to scale rise because their marginal costs fall more than those of smaller firms—even though the technological shift is uniform across firms. This demand-based channel triggers winners-and-losers dynamics and the rise of superstar firms.&lt;/p&gt;
&lt;p&gt;The empirical foundation rests on Compustat data for U.S. publicly traded firms (1977–2014) and Business Dynamics Statistics (BDS) for aggregate and sector-level dynamism measures. Production-function estimation using Ackerberg, Caves, and Frazer (2015) augmented with sales-share controls documents that aggregate returns to scale rose from approximately 1.0 in 1980 to approximately 1.05 by 2014—a within-sector increase, not a reallocation effect. Over the same period, the cost-weighted markup rose by 42%, the firm entry rate fell by 33%, the excess reallocation rate fell by 29%, and selling costs relative to production costs rose by 60%–90% depending on the measure used.&lt;/p&gt;
&lt;p&gt;The model is calibrated to 1980 steady-state moments (firm life-cycle patterns, markups, entry and reallocation rates). A 5% increase in returns to scale—matching the empirical estimate—accounts for: a +15 percentage point rise in the average cost-weighted markup (vs. +42% in the data); a 33% decline in the entry rate (exactly matching the data); a 21% decline in the reallocation rate (vs. 29% in the data); and a 23% increase in selling costs relative to production costs (vs. 60%–90% in the data). The model also generates a 53% rise in the share of firms aged 11 years or older (vs. 50% in the data) and a 58% decline in the employment share of firms aged 5 years or younger (vs. 56% in the data), closely tracking the aging of the U.S. firm population. Firm-level responsiveness to productivity shocks declines by 0.08 in the model, versus about 0.01 in Compustat and 0.09 in Decker et al. (2020).&lt;/p&gt;
&lt;p&gt;Sector-level panel regressions with sector fixed effects confirm the model&amp;rsquo;s directional predictions: within-sector increases in returns to scale are associated with lower entry rates (coefficient −2.89, significant at 1%), lower reallocation rates (−1.16, significant at 1%), higher markups (+3.15, significant at 1%), and higher selling costs relative to production costs (+1.85 for the advertising-based measure; +8.52 for adjusted SG&amp;amp;A).&lt;/p&gt;
&lt;p&gt;A key scope condition is that the model yields a constrained-efficient allocation: directed search and full internalization of returns to scale imply decentralized equilibrium efficiency, making the paper a laboratory for assessing how far efficient firm responses to technological change can explain the secular trends without invoking market failures. The model fits the post-2000 transition dynamics better than the 1980s–1990s period, and explains a substantial but incomplete share of the trends, suggesting complementary—possibly inefficient—forces also contributed.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism through which rising returns to scale generate winners-and-losers dynamics?&lt;/p&gt;
&lt;p&gt;A: The marginal cost of production under increasing returns to scale (alpha &amp;gt; 1) is MC(z,n) = l(n,z)^(1−alpha) × (1/alpha) × (W/e^z), which depends on firm size l(n,z). A uniform rise in alpha rotates the marginal cost schedule clockwise by firm size: larger firms see a proportionally larger cost reduction than smaller firms, even though the technological change is identical across all firms. Because firms compete for the same pool of customers, this asymmetric cost advantage allows large firms to offer lower prices while sustaining higher margins, attracting customers away from small firms. The result is a demand-based channel that generates winners-and-losers dynamics and increases market concentration.&lt;/p&gt;
&lt;p&gt;Q: How does the model capture customer accumulation, and why is it central to the paper&amp;rsquo;s argument?&lt;/p&gt;
&lt;p&gt;A: The model introduces directed search in the product market, where firms post advertisements and customers—including those already matched with a firm—choose which submarket to enter by trading off offered utility against matching probability. A constant-returns-to-scale matching function governs match creation; in submarket with tightness theta, customers match with probability m(theta) = theta(1+theta)^(−1) and firms attract customers with probability q(theta) = (1+theta)^(−1). The customer accumulation motive creates an investment-harvest trade-off: firms can either post high promised utility (low prices) to grow their customer base or extract surplus through high prices. Rising returns to scale amplify large firms&amp;rsquo; ability to resolve this trade-off favorably, linking the technological change directly to markup dynamics, entry incentives, and selling expenditures.&lt;/p&gt;
&lt;p&gt;Q: What is the directed search framework&amp;rsquo;s role in ensuring equilibrium uniqueness and efficiency?&lt;/p&gt;
&lt;p&gt;A: The author introduces firm-side commitment contracts—specifying price, separation probability, and continuation utility contingent on productivity realizations—combined with directed search. Because search is directed on both sides and firms fully internalize returns to scale, the decentralized equilibrium is constrained-efficient. This delivers uniquely determined heterogeneous prices in equilibrium (solving the indeterminacy problem common in customer-market models) and establishes the paper&amp;rsquo;s efficient-mechanism benchmark: it tests how far profit-maximizing firm responses to technological change—without any market failure—can account for the secular trends.&lt;/p&gt;
&lt;p&gt;Q: How are prices structured in the model, and what life-cycle pattern do they generate?&lt;/p&gt;
&lt;p&gt;A: Each firm charges two distinct prices in each period: one to incumbent customers (the same for all incumbents, since they are identical conditional on being attached to the same firm) and one to newly acquired customers (which varies based on the promised utility in the submarket searched). Firms that are expanding their customer base offer greater promised utility and therefore charge lower prices to attract customers; firms harvesting their existing base charge higher prices. Because firms enter small and grow, this dynamic generates a price life cycle: young firms invest via low prices and mature firms harvest through higher prices, which the model reproduces as a rising markup pattern over the firm life cycle—an untargeted moment the model fits well.&lt;/p&gt;
&lt;p&gt;Q: What does the calibration target and what untargeted moments does the model reproduce?&lt;/p&gt;
&lt;p&gt;A: The model is calibrated to 1980 using: the number of employees of entrant firms (pinning entry customer base n_e), employees of age-5 firms (pinning convex cost chi_1), share of firms aged 11+ years (pinning chi_2), average firm size (operating cost f), entry rate (entry cost kappa), excess reallocation rate (exit shock delta), and average cost-weighted markup (linear cost c). Untargeted moments reproduced include: a sales-weighted markup of 0.28 (vs. 0.25 in De Loecker et al. 2020), endogenous customer turnover of approximately 9% (vs. 15% in Gourio and Rudanko 2014), and an elasticity of customer base shrinkage to price of 0.08 (within the 0.01–0.16 range from Paciello et al. 2019). The model also matches markup and selling-cost life-cycle patterns that are typically overlooked.&lt;/p&gt;
&lt;p&gt;Q: How large is the quantitative contribution of the 5% rise in returns to scale to each secular trend?&lt;/p&gt;
&lt;p&gt;A: Comparing the 1980 steady state (alpha = 1) to the 2014 steady state (alpha = 1.05): the average cost-weighted markup rises by 15% in the model versus 42% in the data; the entry rate declines by 33% in the model, exactly matching the data; the reallocation rate declines by 21% in the model versus 29% in the data; and selling costs relative to production costs rise by 23% in the model versus 60%–90% in the data. The model thus explains a substantial share of each trend while leaving a residual requiring additional mechanisms.&lt;/p&gt;
&lt;p&gt;Q: How does the model explain the aging of U.S. firms, and how well does it match the data?&lt;/p&gt;
&lt;p&gt;A: The winners-and-losers mechanism shifts activity toward larger, older firms, which mechanically ages the firm population. The model generates a 53% increase in the share of firms aged 11 years or older (vs. 50% in the data) and a 58% decline in the employment share of firms aged 5 years or younger (vs. 56% in the data). This aging arises because rising returns to scale increase the cost of customer acquisition, acting as a barrier to entry that disproportionately hurts new, small firms while allowing large incumbents to remain viable at lower productivity thresholds.&lt;/p&gt;
&lt;p&gt;Q: What is the channel through which rising returns to scale reduce business dynamism specifically?&lt;/p&gt;
&lt;p&gt;A: The unequal reduction in marginal costs intensifies competition for customers and raises customer acquisition costs. This operates through two simultaneous effects on the exit threshold: (i) lower marginal costs allow large firms to remain viable at lower productivity levels despite higher customer acquisition costs; and (ii) heightened competition forces smaller firms to require higher productivity to survive in a market that has become increasingly costly to operate in. Higher customer acquisition costs therefore function as an endogenous barrier to entry, reducing the entry rate and the reallocation of resources across firms.&lt;/p&gt;
&lt;p&gt;Q: Does the model attribute the secular trends entirely to efficient firm behavior, and what does it conclude about residual explanations?&lt;/p&gt;
&lt;p&gt;A: No. The model is explicitly designed as a constrained-efficient benchmark, and the paper finds that while rising returns to scale account for a substantial share of the trends—particularly in magnitude—the transition dynamics show a less accurate fit before the 2000s. The author concludes that complementary mechanisms, likely involving inefficiencies (such as market power from horizontal product differentiation or barriers to entry beyond those captured by the model), played a significant role in the earlier evolution of these trends and in the portion of the trends not explained by the efficient channel.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports the rising returns to scale finding, and what are its limitations?&lt;/p&gt;
&lt;p&gt;A: Production-function estimation using the Ackerberg-Caves-Frazer method with sales-share controls on Compustat data shows returns to scale rising from approximately 1.0 in 1980 to approximately 1.05 by 2014, driven primarily by within-sector increases rather than reallocation toward high-returns sectors. A translog production function finds limited evidence of heterogeneous increases across firm sizes within Compustat. However, Compustat predominantly covers large publicly traded firms; smaller firms outside the sample may have experienced minimal or no increase in returns to scale. If technology adoption involves fixed costs, the aggregate impact could be larger than estimated, meaning the quantitative exercises likely represent a conservative lower bound.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to and extend the directed search literature in product markets?&lt;/p&gt;
&lt;p&gt;A: The paper builds on Gourio and Rudanko (2014) and Roldan-Blanco and Gilbukh (2020), where customers are locked in once matched, by introducing labor-search tools from Schaal (2017) to allow: (i) incumbent customer switching between firms at rates of 10%–25% annually (Gourio and Rudanko 2014), and (ii) a non-zero price sensitivity of incumbent customers (Paciello et al. 2019). It also allows firms to invest in demand through selling expenditures, which prior directed search models in product markets typically abstracted from, making it possible to study how technological changes affect customer reallocation and firms&amp;rsquo; cost structures jointly.&lt;/p&gt;
&lt;p&gt;Customer capital: The stock of customers a firm has accumulated through prior selling and pricing decisions; treated as a state variable that firms invest in (by offering low prices and spending on advertisements) or harvest from (by charging high markups), with a customer turnover rate estimated at 10%–25% annually in the literature.&lt;/p&gt;
&lt;p&gt;Directed search in the product market: A market structure in which both firms and customers choose which submarket (indexed by the promised utility level) to enter, trading off match probability against terms; delivers constrained-efficient equilibrium and uniquely determined heterogeneous prices.&lt;/p&gt;
&lt;p&gt;Investment-harvest trade-off: The firm&amp;rsquo;s dynamic choice between offering high promised utility (low prices, low current markups) to grow the customer base versus extracting surplus through high prices from an existing customer base; shaped by the firm&amp;rsquo;s current size, productivity, and the cost structure implied by returns to scale.&lt;/p&gt;
&lt;p&gt;Returns to scale (alpha): The curvature of the production function y = e^z × l^alpha; equals 1.0 under constant returns and approximately 1.05 by 2014 in the empirical estimates; the paper&amp;rsquo;s central technological change parameter, whose rise disproportionately reduces marginal costs for larger firms.&lt;/p&gt;
&lt;p&gt;Winners-and-losers dynamics: The reallocation of customers and market share from small to large firms triggered by the asymmetric cost advantage large firms obtain when returns to scale rise; the demand-based channel through which superstar firms emerge.&lt;/p&gt;
&lt;p&gt;Cost-weighted markup: The average markup aggregated using each firm&amp;rsquo;s costs as weights, as opposed to sales-weighted markup; the primary measure of market power used in the paper, rising by 42% in the data between 1980 and 2014.&lt;/p&gt;
&lt;p&gt;Constrained-efficient allocation: An equilibrium outcome in which, given the frictions present (search-and-matching in the product market), no social planner operating under the same constraints could improve welfare; the paper uses this as a benchmark to assess how far efficient firm responses explain secular trends without invoking market failures.&lt;/p&gt;
&lt;p&gt;Selling costs relative to production costs: The ratio of customer acquisition expenditures (advertising or adjusted SG&amp;amp;A) to cost of goods sold; rose by 60%–90% in the data between 1980 and 2014 and by 23% in the model&amp;rsquo;s steady-state comparison.&lt;/p&gt;</description></item><item><title>Customer Acquisition, Business Dynamism and Aggregate Growth</title><link>https://macropaperwarehouse.com/papers/customer-acquisition-business-dynamism-and-aggregate-growth/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/customer-acquisition-business-dynamism-and-aggregate-growth/</guid><description>&lt;p&gt;This paper asks whether firm-level customer acquisition — distinct from productivity differences — is a quantitatively important driver of aggregate economic growth, and whether ignoring it distorts predictions about growth policy efficacy. The authors build a novel endogenous growth model in which innovating firms must first accumulate customers to sell their products, with two channels of customer acquisition operating simultaneously: costly sales-and-marketing expenditure and below-static-markup pricing (sales-driven accumulation). The model is estimated using indirect inference against a combination of aggregate data (U.S. real GDP per worker growth of 1.43% annually, 1979–2019), Business Dynamics Statistics (BDS) life-cycle profiles, and firm-level data from Compustat matched to Capital IQ&amp;rsquo;s sales-and-marketing expense records covering 1997–2019.&lt;/p&gt;
&lt;p&gt;The benchmark model yields four closed-form propositions. First, a &amp;ldquo;firm-level market size effect&amp;rdquo;: higher customer retention raises a firm&amp;rsquo;s future profit base, strengthening incentives to conduct R&amp;amp;D. Second, an endogenous feedback loop: more productive firms invest more in customer acquisition, which expands their customer base and further strengthens R&amp;amp;D incentives. Third, customer base accumulation raises aggregate growth, but only indirectly — by boosting firm-level innovation rates — since aggregate productivity is a customer-weighted average of firm productivity levels. Fourth, the sensitivity of innovation to R&amp;amp;D subsidies increases with customer base growth, because firms with faster-growing customer bases discount future profits less steeply.&lt;/p&gt;
&lt;p&gt;In the quantitatively estimated full model — which relaxes the benchmark&amp;rsquo;s perfect-scaling restrictions and endogenizes firm entry and exit — the authors conduct two decomposition exercises. In a counterfactual scenario where expected customer retention is reduced to make average customer base growth zero among continuing businesses, firm-level innovation rates fall by approximately 40% relative to the full model. Of this 40% decline, only about 6 percentage points are attributable to the direct firm-level market size effect alone; the vast majority is driven by the endogenous feedback loop between innovation and customer acquisition. In a second decomposition focused on aggregate growth, the firm-level market size effect and a reallocation effect — whereby the feedback loop concentrates customers among high-productivity firms — together account for 44% of aggregate growth in the full model.&lt;/p&gt;
&lt;p&gt;On policy, the authors compare R&amp;amp;D subsidies and operational subsidies in the full model against an otherwise identical model that ignores customer accumulation. R&amp;amp;D subsidies are approximately twice as effective at boosting aggregate growth in the full model as in the model without customer accumulation. Conversely, operational subsidies produce a stronger decline in aggregate growth in the full model than in the benchmark-without-customer-accumulation, because aggregate growth in the full model is a customer-weighted average of firms&amp;rsquo; productivity growth rates, making the joint distribution of productivity and customer bases the relevant object of study.&lt;/p&gt;
&lt;p&gt;Firm-level data support three empirical predictions. Marketing expenditure, R&amp;amp;D intensity, and markups co-move in model-consistent directions both contemporaneously and over the life cycle. The estimated relative weight of marketing versus pricing as channels of customer accumulation is γ = 0.745, indicating marketing is the dominant channel. A model-consistent proxy for the severity of customer-base frictions, estimated in the cross-section of industries, shows that stronger frictions correlate with lower R&amp;amp;D investment, as predicted. The customer-base depreciation rate is estimated at ζ = 0.375, R&amp;amp;D cost scaling at σx = 1.264, and marketing cost scaling at σa = 1.405.&lt;/p&gt;
&lt;p&gt;Q: What is the firm-level market size effect and why does it arise?
A: When a firm retains more customers, successful innovations apply to a larger market, raising the profitability of each unit reduction in production costs. This increases the marginal benefit of R&amp;amp;D investment. In the benchmark model, Proposition 2(a) shows formally that firm-level innovation increases with customer base growth: ∂x/∂(1−ζ) &amp;gt; 0, where ζ is the customer separation rate.&lt;/p&gt;
&lt;p&gt;Q: What is the endogenous feedback loop between innovation and customer accumulation?
A: More productive firms have lower production costs and can therefore afford greater investment in marketing and can set lower markups, both of which attract more customers. A larger customer base raises firm value and strengthens R&amp;amp;D incentives further (Proposition 2(b)). This bidirectional feedback means that productivity growth and customer accumulation are jointly determined in equilibrium, not independent processes.&lt;/p&gt;
&lt;p&gt;Q: How large is the quantitative effect of customer accumulation on firm-level innovation?
A: In the counterfactual where expected customer retention is reduced so that average customer base growth among continuing firms is zero, firm-level innovation rates are approximately 40% lower than in the full model. Of this, only about 6% (of the total drop) is attributable to the direct market size effect in isolation; the feedback loop accounts for the remaining roughly 34 percentage points.&lt;/p&gt;
&lt;p&gt;Q: How much of aggregate growth do customer-acquisition channels explain?
A: The firm-level market size effect and a customer reallocation effect together account for 44% of aggregate growth in the full model. The firm-level market size effect alone reduces aggregate growth by about one-fifth (20%) in the relevant counterfactual. The reallocation effect — by which productive firms accumulate disproportionate market share — contributes the remainder of the 44%.&lt;/p&gt;
&lt;p&gt;Q: What is the reallocation channel for aggregate growth?
A: Because highly productive firms can invest more in customer acquisition, the feedback loop endogenously concentrates customers (market shares) among high-productivity firms. Since aggregate productivity in the model is a customer-weighted average of firm productivity levels (equation 16), this reallocation raises aggregate productivity growth beyond what the firm-level R&amp;amp;D incentive effect alone would produce.&lt;/p&gt;
&lt;p&gt;Q: How does customer accumulation change the efficacy of R&amp;amp;D subsidies?
A: R&amp;amp;D subsidies are approximately twice as effective at raising aggregate growth in the full model (with customer accumulation) as in an otherwise identical model that ignores customer accumulation. The mechanism is Proposition 4(b): faster customer base growth makes firms weight future profits more heavily, increasing their sensitivity to any change in R&amp;amp;D costs, including that brought about by a government subsidy.&lt;/p&gt;
&lt;p&gt;Q: What happens to aggregate growth under operational subsidies in the two models?
A: Operational subsidies lead to a stronger decline in aggregate growth in the full model than in the model without customer accumulation. The reason is that aggregate growth in the full model depends on the joint distribution of firm productivity and customer bases; operational subsidies alter this distribution in ways that reduce the customer-weighted average of productivity growth rates, an effect absent when customer accumulation is ignored.&lt;/p&gt;
&lt;p&gt;Q: How are the two customer-acquisition channels (marketing and pricing) measured empirically?
A: Marketing is measured using sales-and-marketing expenses from Capital IQ, available for 48% of the Compustat sample (34% report directly; an additional 14% report advertising or marketing sub-components). Markups are measured following De Loecker et al. (2020) as the inverse share of variable costs in sales multiplied by the cost-output elasticity, with variation across firms identified from balance sheet data under the assumption that cost-output elasticities are constant within industry-year cells.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated relative strength of marketing versus pricing in customer accumulation?
A: The relative weight on marketing is γ = 0.745, estimated by targeting the coefficient βµ = 0.04 (standard error 0.01) from a reduced-form regression of firm-level sales growth on changes in markups (equation 29). This implies that marketing is the dominant channel, consistent with evidence in Afrouzi et al. (2021) and Fitzgerald et al. (forthcoming).&lt;/p&gt;
&lt;p&gt;Q: What is the estimated customer-base depreciation rate and how is it disciplined?
A: The depreciation rate ζ is estimated at 0.375, targeted to match average firm-level employment growth from the BDS. This falls toward the lower end of existing estimates, which range from about 0.3 to 0.7 across studies.&lt;/p&gt;
&lt;p&gt;Q: How do R&amp;amp;D costs scale with firm size in the estimated model?
A: The R&amp;amp;D cost scaling parameter is σx = 1.264, estimated by targeting the reduced-form coefficient of −0.01 from a regression of log R&amp;amp;D intensity on log sales with industry-time fixed effects (equation 28). This is close to the estimate in Akcigit and Kerr (2018).&lt;/p&gt;
&lt;p&gt;Q: How do marketing costs scale with firm size?
A: The marketing cost scaling parameter is σa = 1.405, estimated by targeting a reduced-form coefficient of −0.01 from a regression of log sales-and-marketing intensity on log sales with industry-time fixed effects (equation 30).&lt;/p&gt;
&lt;p&gt;Q: What empirical co-movement evidence supports the model&amp;rsquo;s predictions?
A: In the cross-section of firms, marketing expenditure, R&amp;amp;D intensity, and markups all co-move in model-predicted directions, for both static (contemporaneous) relationships and dynamic (life-cycle) patterns. Additionally, a model-consistent industry-level proxy for the severity of customer-base frictions shows that stronger frictions are associated with lower R&amp;amp;D investment, as the model predicts.&lt;/p&gt;
&lt;p&gt;Q: How does endogenous firm exit work in the full model and why does it differ from standard models?
A: Firms pay a stochastic per-period operational cost and exit when that cost exceeds a threshold κ*_j = v(q_j, b_j)/W. Unlike standard growth models where exit depends only on productivity, here the exit threshold depends on both productivity and accumulated customers, so customer loss can trigger exit even for relatively productive firms.&lt;/p&gt;
&lt;p&gt;Q: What data sources are used and what are their key limitations?
A: The three primary firm-level sources are the Census Bureau&amp;rsquo;s BDS (broad coverage, employment-focused), Compustat (rich financial data but limited to publicly traded firms and lacking direct customer-acquisition measures), and Capital IQ (sales-and-marketing expenses available from 1997, matched to 91% of the Compustat sample). To address Compustat&amp;rsquo;s non-representativeness, employment-based weights aligning Compustat and BDS firm-size distributions are applied when computing model moments against Compustat targets.&lt;/p&gt;
&lt;p&gt;Firm-level market size effect: The mechanism by which higher customer retention raises a firm&amp;rsquo;s future profit base — because lower production costs from successful innovation apply to a larger market — thereby strengthening incentives to conduct R&amp;amp;D. This is the primary channel linking customer accumulation to innovation.&lt;/p&gt;
&lt;p&gt;Customer base (b_j): The mass of household members consuming a firm&amp;rsquo;s product variety, which varies endogenously across firms. It enters demand directly (equation 4) and serves as a state variable in the firm&amp;rsquo;s value function alongside productivity.&lt;/p&gt;
&lt;p&gt;Endogenous feedback loop: The bidirectional reinforcement between productivity growth and customer accumulation. More productive firms invest more in customers; a larger customer base raises the value of innovation; higher innovation raises productivity further.&lt;/p&gt;
&lt;p&gt;Reallocation effect: The concentration of customers (market shares) toward high-productivity firms that arises endogenously from the feedback loop, contributing to aggregate growth because aggregate productivity is a customer-weighted average of firm-level productivity.&lt;/p&gt;
&lt;p&gt;Customer-base depreciation rate (ζ): The exogenous rate at which a firm loses its existing customers each period, estimated at 0.375 in the paper&amp;rsquo;s calibration. It governs the baseline speed of customer attrition and is the key parameter for the firm-level market size effect.&lt;/p&gt;
&lt;p&gt;Sales-and-marketing expenses: Expenditures on sales force, brand development, customer service, advertising, and customer data acquisition — measured from Capital IQ — that directly drive marketing-based customer accumulation (the dominant channel with estimated weight γ = 0.745).&lt;/p&gt;
&lt;p&gt;Perfect scaling (Assumption 1): The benchmark restriction that R&amp;amp;D and marketing costs, and the sales-driven customer accumulation benefit, all scale one-for-one with a composite of firm productivity and customer base. This assumption enables closed-form solutions and is relaxed in the full model using estimated scaling parameters.&lt;/p&gt;</description></item><item><title>De Gustibus and Disputes about Reference Dependence</title><link>https://macropaperwarehouse.com/papers/de-gustibus-and-disputes-about-reference-dependence/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/de-gustibus-and-disputes-about-reference-dependence/</guid><description>&lt;p&gt;This paper examines whether heterogeneity in individual gain-loss attitudes — the degree to which people weigh losses more or less severely than equivalent gains — contaminates prior tests of expectations-based reference dependence (EBRD). The central question is: do prior experiments that appear to yield mixed or null evidence against EBRD actually reflect a failure of the expectations-based reference point, or instead reflect a methodological flaw — the implicit assumption that all individuals are uniformly loss averse?&lt;/p&gt;
&lt;p&gt;All prior tests of EBRD models (e.g., Kőszegi and Rabin 2006, 2007) have proceeded under what the authors call &amp;ldquo;universal loss aversion,&amp;rdquo; the assumption that every individual weighs losses more heavily than commensurate gains (λ &amp;gt; 1). The authors argue that this assumption — a form of the classic De Gustibus conjecture — is empirically incorrect and theoretically distorting: within EBRD designs, loss-averse and gain-seeking subjects are predicted to respond in opposite directions to expectations manipulations, so aggregating across them suppresses or reverses treatment effects.&lt;/p&gt;
&lt;p&gt;The authors run two pre-registered laboratory experiments totaling 1,524 subjects. The labor supply experiment (N = 500, UC San Diego) uses a two-stage design. Stage 1 elicits each subject&amp;rsquo;s gain-loss attitude parameter λ_i from their effort responses to fixed versus uncertain piece rates in a real-effort transcription task, exploiting the prediction that loss-averse workers reduce effort under wage uncertainty while gain-seeking workers increase it. Stage 2 manipulates expectations by varying the probability of a high outside payment (p = 0.05 in Condition Low vs. p = 0.45 in Condition High), holding the piece-rate probability constant at 50%; under EBRD, this shifts the reference point and should change effort in a direction governed by λ_i.&lt;/p&gt;
&lt;p&gt;The exchange experiment (N = 1,024, University of Bonn, with a pre-registered 2018 replication of N = 417) uses Stage 1 preference statements over randomly endowed objects to estimate λ_i, and Stage 2 manipulates expectations via a 0% vs. 50% probability of forced exchange. Under EBRD, loss-averse subjects should become more willing to exchange in the High condition; gain-seeking subjects should become less willing.&lt;/p&gt;
&lt;p&gt;Both experiments document substantial heterogeneity in gain-loss attitudes. In the labor supply study, approximately 70.6% of subjects exhibit loss aversion (λ̂ &amp;gt; 1) and 29.4% exhibit gain-seeking (λ̂ &amp;lt; 1), with an average structural estimate of λ̂ = 1.65 and median 1.66. In the exchange study, 76% are loss averse and 24% are gain-seeking, with mean λ̂ = 1.49 and median 1.34. Lottery-based elicitation in the labor supply experiment yields 28% gain-seeking, consistent with prior literature estimates of roughly 22% gain-seeking from Chapman et al. (2018).&lt;/p&gt;
&lt;p&gt;Crucially, Stage 1 gain-loss attitudes are strongly predictive of Stage 2 treatment effects in both experiments. In the labor supply study, the aggregate treatment effect of approximately 26% greater effort in Condition High — reproducing Abeler et al. (2011) — masks strongly heterogeneous responses: higher λ̂ predicts larger positive treatment effects (raw correlation ρ = 0.18, p &amp;lt; 0.01), and controlling for heterogeneous gain-loss attitudes raises R² by more than a factor of 10. In the exchange study, the aggregate treatment effect is precisely zero (coefficient = 0.00, clustered s.e. = 0.03), a result that prior literature would interpret as contradicting EBRD; but once gain-loss heterogeneity is accounted for, treatment effects are strongly positive for loss-averse subjects and negative for gain-seeking subjects, again raising R² by more than a factor of 10.&lt;/p&gt;
&lt;p&gt;Gain-seeking subjects exhibit negative treatment effects in the exchange study, consistent with EBRD predictions, but in the labor supply study the average treatment effect for gain-seeking subjects remains slightly positive, representing a partial deviation from the model&amp;rsquo;s quantitative predictions. The authors interpret this as evidence that expectations-based reference points are an important but likely incomplete determinant of behavior, with attention-based, status-quo-based, or anchoring-based reference points potentially playing supplementary roles.&lt;/p&gt;
&lt;p&gt;Q: What is the central methodological problem with prior tests of expectations-based reference dependence?&lt;/p&gt;
&lt;p&gt;A: All prior tests assumed universal loss aversion — that every individual has λ &amp;gt; 1, i.e., weighs losses more severely than equivalent gains. The authors show this is both empirically wrong (roughly 24–29% of subjects are gain-seeking across both studies) and theoretically distorting: within EBRD designs, gain-seeking individuals are predicted to respond in the opposite direction from loss-averse individuals, so averaging across heterogeneous types can suppress, zero out, or even reverse the true treatment effect. This makes standard aggregate tests of EBRD unreliable.&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure gain-loss attitudes in the labor supply experiment?&lt;/p&gt;
&lt;p&gt;A: In Stage 1, subjects make 30 effort decisions across fixed piece rates and uncertain piece rates with the same mean. Under the Kőszegi-Rabin CPE model, a loss-averse individual reduces effort when the wage is uncertain (because outcomes can fall below the reference point), while a gain-seeking individual increases effort under uncertainty. The authors estimate individual-level parameters by regressing log(e_i + 10) on log(w) and Δw/w in a random-coefficients framework; the coefficient l̂_i on Δw/w is the reduced-form measure of gain-loss attitudes, with λ̂_i = 1 + 4·(l̂_i/ĝ_i) as the structural estimate. The correlation between the two measures is ρ = 0.85 (p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure gain-loss attitudes in the exchange experiment?&lt;/p&gt;
&lt;p&gt;A: In Stage 1, subjects are randomly endowed with one of two objects and provide three unincentivized preference statements (relative liking, relative wanting, and hypothetical choice) before any possibility of exchange is introduced. Under CPE, an individual endowed with object X will prefer X to the extent that (1 + λ_i) − 2(Y/X) &amp;gt; 0, so subjects with higher λ_i should more strongly favor their endowment. A principal components analysis reduces the three statements to one factor (capturing ~70% of variation), and residuals from regressing that factor on object assignment constitute the reduced-form measure l̂_i. The structural estimate λ̂_i is obtained via a mixed logit using a log-normal distribution for λ_i; the reduced form and structural measures are correlated at r = 0.95 (p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Q: What does the distribution of gain-loss attitudes look like across the two experiments?&lt;/p&gt;
&lt;p&gt;A: In the labor supply experiment (N = 453 estimable subjects), 70.6% are loss averse and 29.4% are gain-seeking, with mean λ̂ = 1.65 and median λ̂ = 1.66. In the exchange experiment (N = 1,024), 76% are loss averse and 24% are gain-seeking, with mean λ̂ = 1.49 and median λ̂ = 1.34. A separate lottery-based elicitation in the labor supply study finds 28% gain-seeking subjects. These proportions are consistent with the weighted average of 22% gain-seeking found by Chapman et al. (2018) across seven prior lottery-choice studies.&lt;/p&gt;
&lt;p&gt;Q: What is the aggregate treatment effect in the labor supply experiment, and what does it look like once heterogeneity is accounted for?&lt;/p&gt;
&lt;p&gt;A: Without accounting for gain-loss heterogeneity, Condition High is associated with roughly a 26% increase in effort relative to Condition Low (individual-clustered s.e. = 0.03, p &amp;lt; 0.01), reproducing the Abeler et al. (2011) result and consistent with EBRD under universal loss aversion. However, R² = 0.03. Once interactions of Condition High with l̂_i and λ̂_i are included, R² rises to 0.40 and 0.39 respectively — more than a tenfold increase. Higher λ̂_i predicts larger positive treatment effects (raw correlation ρ = 0.18, p &amp;lt; 0.01), and the interaction of Condition High with λ̂_i is highly significant (F(1,452) = 49.14, p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Q: What is the aggregate treatment effect in the exchange experiment, and what does it look like once heterogeneity is accounted for?&lt;/p&gt;
&lt;p&gt;A: Without heterogeneity, the treatment effect of Condition High on the probability of exchanging is precisely 0.00 (clustered s.e. = 0.03), which prior literature would read as a failure of EBRD. Once heterogeneity is introduced via interactions with l̂_i and λ̂_i, the pattern changes markedly: loss-averse subjects show positive treatment effects (greater willingness to exchange in High), while gain-seeking subjects show negative treatment effects (less willingness to exchange in High), consistent with Predictions 4–6. R² again rises by more than a factor of 10. In Condition Low, 38% of subjects exchange, reflecting a significant endowment effect (F(1,1022) = 25.66, p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Q: Why does the aggregate treatment effect in the exchange experiment equal zero?&lt;/p&gt;
&lt;p&gt;A: The authors show in Appendix B.4 that the relationship between λ_i and exchange probability treatment effects can be concave — negative effects for gain-seeking subjects can be of greater absolute magnitude than positive effects for loss-averse subjects. With roughly 24% gain-seeking and 76% loss-averse subjects, aggregation can yield a near-zero average even when heterogeneous effects are substantial and directionally consistent with EBRD. This aggregation problem, not a failure of the expectations-based reference point mechanism, explains the null aggregate result.&lt;/p&gt;
&lt;p&gt;Q: Do gain-loss attitudes measured in one domain predict behavior in another domain?&lt;/p&gt;
&lt;p&gt;A: The lottery-based measure of gain-loss attitudes (from Multiple Price Lists administered after the real-effort task in the labor supply experiment) has mean λ̂ = 1.48 and median 1.42, with 28% gain-seeking subjects — proportions similar to the labor supply estimates. However, the correlation between the lottery-based and labor-supply-based structural estimates of λ̂ is only Pearson&amp;rsquo;s r = 0.091 (p = 0.03) and Spearman&amp;rsquo;s ρ = 0.084 (p = 0.075). Furthermore, the lottery measure has no predictive power for Stage 2 treatment effects. This suggests that while the prevalence of gain-seeking is similar across domains, gain-loss attitudes at the individual level are more domain-specific than prior work has appreciated.&lt;/p&gt;
&lt;p&gt;Q: How do the authors address the &amp;ldquo;generated regressor problem&amp;rdquo; when using estimated λ̂_i as a regressor?&lt;/p&gt;
&lt;p&gt;A: Since λ̂_i is itself estimated from Stage 1 data, using it directly as a regressor in Stage 2 regressions treats imprecise preference estimates as ideal data, which can distort inference (the Murphy-Topel problem). The authors address this by bootstrapping the entire pipeline — re-estimating gain-loss attitudes from Stage 1 in each of 500 bootstrap iterations and re-running the Stage 2 regressions — then reporting the average bootstrap coefficient and its standard deviation. The bootstrapped conclusions are qualitatively identical to the original regression results in both experiments.&lt;/p&gt;
&lt;p&gt;Q: What limitations do the authors acknowledge in the EBRD model&amp;rsquo;s fit?&lt;/p&gt;
&lt;p&gt;A: Even after accounting for heterogeneity, the EBRD model does not provide a complete quantitative account of behavior. In the labor supply experiment, gain-seeking subjects exhibit slightly positive average treatment effects (not negative as predicted), and loss-averse subjects&amp;rsquo; empirical treatment effects fall short of theoretical predictions, despite a significant correlation between predicted and empirical treatment effects (ρ = 0.25, p &amp;lt; 0.01). The authors attribute these deviations to potential measurement error (which would attenuate estimated relationships), and to the possibility that reference points have multiple determinants — including status quo-based, attention-based, and anchoring-based factors — beyond expectations alone.&lt;/p&gt;
&lt;p&gt;Q: What are the broader implications for other applications of gain-loss attitudes?&lt;/p&gt;
&lt;p&gt;A: The paper&amp;rsquo;s findings have implications for any application that relies on universal loss aversion as a maintained assumption, including Rabin&amp;rsquo;s (2000) calibration argument for risk aversion at small and large stakes, insurance demand for small losses (Slovic et al., 1977), and preferences for bunched resolution of uncertainty (Kőszegi and Rabin, 2009). Admitting heterogeneity in gain-loss attitudes will require more nuanced predictions in each of these settings. The paper provides a methodology — measuring individual-level gain-loss attitudes within the experimental context of interest — for investigating and controlling for such heterogeneity.&lt;/p&gt;
&lt;p&gt;Q: What design features prevent confounds between Stage 1 measurement and Stage 2 treatment in the exchange experiment?&lt;/p&gt;
&lt;p&gt;A: Stage 1 uses a different pair of objects (USB stick and pens) than Stage 2 (picnic mat and thermos), or vice versa — each subject encounters each pair exactly once, with counterbalancing at the session level. Stage 1 preference statements are unincentivized and made before any possibility of exchange is introduced, so they do not contaminate the Stage 2 expectations manipulation. The random reassignment of objects at the end of Stage 1 generates exogenous variation in endowments, preventing mechanical confounds. The authors also verify that interpreting Stage 1 variation as reflecting heterogeneity in object valuations (rather than gain-loss attitudes) would predict zero heterogeneous treatment effects in Stage 2 — a prediction rejected by the data.&lt;/p&gt;
&lt;p&gt;Expectations-Based Reference Dependence (EBRD): The formulation, due to Kőszegi and Rabin (2006, 2007), in which an individual&amp;rsquo;s reference point is the entire distribution of outcomes they rationally expected, rather than a fixed status quo. Behavior is governed by a Choice-Acclimating Personal Equilibrium (CPE) in which the chosen action is optimal given that the expectation of that action serves as the reference.&lt;/p&gt;
&lt;p&gt;Gain-Loss Attitudes (λ_i): The individual-specific parameter governing how outcomes above versus below the reference point affect utility. Under piecewise-linear gain-loss utility, an outcome that falls short of the reference by z reduces utility by η·λ_i·z, while an outcome above it raises utility by η·z. Loss aversion is λ_i &amp;gt; 1; gain-seeking is λ_i &amp;lt; 1; loss neutrality is λ_i = 1. In this paper, λ_i is treated as heterogeneous across individuals rather than assumed uniform.&lt;/p&gt;
&lt;p&gt;Universal Loss Aversion: The implicit homogeneity assumption maintained in all prior tests of EBRD — that every individual has λ &amp;gt; 1. The authors characterize this as a form of the De Gustibus Non Est Disputandum conjecture applied to gain-loss attitudes, and document that it fails empirically in both experimental settings.&lt;/p&gt;
&lt;p&gt;Choice-Acclimating Personal Equilibrium (CPE): The rational expectations equilibrium concept from Kőszegi and Rabin (2006, 2007) used throughout the paper to derive comparative statics. A choice is a CPE if its expected utility given its own expectation as the reference exceeds the expected utility of any alternative given that alternative&amp;rsquo;s expectation as the reference.&lt;/p&gt;
&lt;p&gt;Reduced-Form Gain-Loss Measure (l̂_i): In the labor supply context, the individual-level OLS coefficient on Δw/w in a log-effort regression — capturing how strongly a subject reduces (or increases) effort under wage uncertainty relative to a fixed wage of equal mean. A positive l̂_i identifies loss aversion; negative identifies gain-seeking. In the exchange context, the analogous measure is the residual from regressing the first principal component of Stage 1 preference statements on object assignment.&lt;/p&gt;
&lt;p&gt;Aggregation Problem: The paper&amp;rsquo;s central methodological contribution — when gain-loss attitudes are heterogeneous and the EBRD treatment effect is non-linear in λ_i, the average treatment effect across a heterogeneous population need not equal the treatment effect at the average λ. In the exchange experiment, the aggregate treatment effect is precisely zero even though loss-averse and gain-seeking subjects each respond in the theoretically predicted (opposite) direction, because the concave relationship between λ_i and the exchange probability treatment effect causes negative gain-seeking effects to dominate in the aggregate.&lt;/p&gt;</description></item><item><title>Debasements and Small Coins: An Untold Story of Commodity Money</title><link>https://macropaperwarehouse.com/papers/debasements-and-small-coins-an-untold-story-of-commodity-money/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/debasements-and-small-coins-an-untold-story-of-commodity-money/</guid><description>&lt;p&gt;This paper applies a multiple-denomination commodity money model — building on Lee, Wallace, and Zhu (2005) — to coinage episodes in late medieval England, and derives two main findings. Shortages of small coins are severely inconvenient because halfpennies and farthings serve not merely as small change but as consumption-smoothing instruments: parameterized to 15th-century England (per-capita silver approximately 35 grams, penny approximately 1 gram), the model shows that adding a halfpenny is highly welfare-improving for poor agents even at infrequent expenditure, and welfare-improving for all agents when monetary transactions occur at least twice weekly. Debasing the penny by 50 percent has approximately the same welfare effect as introducing a halfpenny and replicates the three stylized facts of the debasement puzzle — large minting volumes, cocirculation of old and new coins, and no additional mint inducement — as equilibrium outcomes rather than paradoxes. However, full-bodiedness creates a commitment device against over-issuance that cannot be replicated by sufficiently small coins, since precious metals have a practical lower bound on coin content, so debasement relieves but does not solve the structural small-coin problem, pointing to the historical necessity of a transition to fiat money.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-debasement-puzzle-and-how-does-the-paper-resolve-it"&gt;Q1. What is the debasement puzzle and how does the paper resolve it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The debasement puzzle, documented by Rolnick, Velde, and Weber, consists of three facts: following a debasement, minting volumes rose sharply, old and new coins cocirculated sometimes by weight, and yet people still paid minting fees rather than receiving inducements — all of which are puzzling because the absence of an inducement suggests no straightforward arbitrage.&lt;/strong&gt; The paper resolves the puzzle by modeling a debasement as equivalent to introducing a new denomination: it draws agents to the mint because it supplies the welfare-improving small denomination that agents wanted, not because of a price arbitrage. Cocirculation by weight emerges naturally along the equilibrium path because agents hold both old and new coins in optimal portfolios, and the counterfactual welfare calculation shows the welfare gain from eliminating the shortage is large, explaining why agents willingly pay minting fees to obtain the new coins.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-paper-measure-the-inconvenience-of-a-coin-shortage"&gt;Q2. How does the paper measure the inconvenience of a coin shortage?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper measures inconvenience as the welfare difference between the shortage equilibrium and a hypothetical scenario in which the mint suddenly eliminates the shortage — an unanticipated shock that adds the missing denomination to the coinage structure.&lt;/strong&gt; This counterfactual is tractably computable in the model and directly mirrors the intuition of a historical agent who compares their constrained experience to the imagined experience of having access to the missing coins. Applied to the penny, the model shows that adding a halfpenny (debasing the penny by 50 percent) yields a welfare gain equivalent to the full shortage inconvenience; the result is large for poor agents even at once-monthly expenditure and extends to all agents when transactions are at least twice weekly.&lt;/p&gt;
&lt;h3 id="q3-why-can-debasement-not-permanently-solve-the-small-coin-problem"&gt;Q3. Why can debasement not permanently solve the small-coin problem?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Full-bodied coinage — coins whose face value equals their precious-metal content — constrains the minimum viable coin size: very small coins are practically too easy to counterfeit and too difficult to handle, so debasement merely pushes the lower denomination boundary down without eliminating it.&lt;/strong&gt; The model uses this practical indivisibility of precious metals as the structural constraint that prevents an infinite regress of smaller and smaller coins. This constraint points to why fiat money — which severs the link between value and metallic content — ultimately emerged as the only way to provide arbitrarily small denominations at negligible production cost. The paper frames this as the resolution to the historical &amp;ldquo;big problem of small change.&amp;rdquo;&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;debasement puzzle&lt;/strong&gt; : the simultaneous occurrence of unusually large minting volumes and cocirculation of old and new coins following a debasement, without any additional mint inducement; resolved in this paper as the equilibrium response to supplying a welfare-improving small denomination.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;full-bodiedness&lt;/strong&gt; : the property of commodity coins whose face value equals their precious-metal content; acts as a commitment device against over-issuance in the model but creates a practical indivisibility constraint on the minimum coin size.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;multiple-denomination model&lt;/strong&gt; : the Lee-Wallace-Zhu framework extended in this paper; explains the social demand for multiple coin denominations via wide transaction-value heterogeneity and the burden of carrying many coins.&lt;/p&gt;</description></item><item><title>Debiasing and T-Tests for Synthetic Control Inference on Average Causal Effects</title><link>https://macropaperwarehouse.com/papers/debiasing-and-t-tests-for-synthetic-control-inference-on-average-causal-effects/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/debiasing-and-t-tests-for-synthetic-control-inference-on-average-causal-effects/</guid><description>&lt;p&gt;Chernozhukov, Wüthrich, and Zhu propose a debiased synthetic control (SC) estimator and an accompanying self-normalized t-test for making inferences on the average treatment effect on the treated (ATT) in aggregate panel data settings with one treated unit. The inferential target is the time-averaged treatment effect τ = (1/T1) Σ_{t=T0+1}^{T} (Y0t(1) − Y0t(0)), a one-number summary of the overall causal impact that admits standard-form confidence intervals, in contrast to per-period effects (which cannot be consistently estimated with one treated unit) and sharp null hypotheses (which do not inform effect magnitude).&lt;/p&gt;
&lt;p&gt;The method addresses two structural challenges in SC inference. First, the canonical SC estimator τ_SC is biased because the weights are estimated from high-dimensional pre-treatment data, and the bias can be substantial under misspecification. Second, even if true weights were known, constructing standard errors requires estimating the long-run variance (LRV), for which classical estimators such as Newey-West are unreliable in the small samples typical of SC applications.&lt;/p&gt;
&lt;p&gt;The debiasing procedure is a K-fold cross-fitting scheme applied to the pre-treatment period. The pre-treatment sample is split into K consecutive blocks. For each fold k, SC weights w_(k) are estimated on the leave-one-block-out pre-treatment data H_{(-k)}, and a component estimator τ_k is formed as the difference between the post-treatment SC residual (using w_(k)) and the in-block pre-treatment SC residual. The latter serves as an estimator of the bias, which under the model assumptions is stable across the pre- and post-treatment periods. The final estimator τ_hat is the average of τ_k across folds. A self-normalized t-statistic T_K = sqrt(K)(τ_hat − τ)/σ_τ is constructed using the cross-fold variance; its asymptotic distribution is t_{K-1}, so no LRV estimation is required and (1−α) confidence intervals take the textbook form τ_hat ± t_{K-1}(1−α/2) × σ_τ/sqrt(K).&lt;/p&gt;
&lt;p&gt;The t-test is proven valid with both stationary and non-stationary data. With stationary data (Theorem 2), it is valid under arbitrary misspecification. With non-stationary data, validity holds either when all units share a common nonstationarity (Theorem 3, also misspecification-robust) or when units deviate from a common nonstationarity under restrictions on the magnitude and heterogeneity of deviations but SC is correctly specified (Theorem 4). The latter covers heterogeneous deterministic time trends and certain cointegration structures. Researchers therefore need not pre-test for unit roots and select inference procedures accordingly.&lt;/p&gt;
&lt;p&gt;A formal efficiency result (Section 3.3) shows that the asymptotic variance of the debiased SC estimator is no larger than that of difference-in-differences (DID), because SC minimizes prediction error and w* dominates the equal-weight DID vector. The relative asymptotic efficiency (RAE) of the t-test versus DID rises with K: K=3 yields RAE of 63.56%; K=5 yields 82.08%; K=10 yields 92.25%.&lt;/p&gt;
&lt;p&gt;Simulations calibrated to Andersson&amp;rsquo;s (2019) Swedish carbon tax application — T0=30, T1=16, N=14, Gaussian AR(1) errors — show that the t-test at K=3 achieves coverage close to the nominal 90% level across correct-specification and misspecification DGPs, while Newey-West standard errors produce substantial undercoverage (coverage = 0.72–0.84) at moderate to high AR(1) coefficients. The method performs comparably to or better than subsampling (Li, 2020) and synthetic DID (Arkhangelsky et al., 2021), and avoids bandwidth selection.&lt;/p&gt;
&lt;p&gt;In the empirical application, the debiased SC t-test (K=3) applied to annual CO2 emissions from transport across Sweden (treated, 1990) and 14 OECD control countries over 1960–2005 yields a negative and statistically significant ATT, with a 90% confidence interval lying entirely below zero, implying approximately an 11% average reduction in per capita CO2 emissions from transport attributable to the Swedish carbon tax over 1990–2005. The pre-treatment AR(1) coefficient of SC residuals is approximately 0.31, supporting K=3 as appropriate. These findings corroborate and extend Andersson&amp;rsquo;s (2019) permutation-based results by providing a confidence interval for the magnitude of the average effect. The method is implemented in the R package scinference.&lt;/p&gt;
&lt;p&gt;Q: What is the primary inferential target and why is it preferred over per-period effects or sharp nulls?
A: The target is the ATT τ = (1/T1) Σ_{t=T0+1}^{T} (Y0t(1)−Y0t(0)), the time-averaged treatment effect on the treated unit over the post-treatment period. Per-period effects cannot be consistently estimated when there is only one treated unit, yielding wide and uninformative confidence intervals. Sharp nulls (e.g., of no effect whatsoever) are useful starting points but do not inform policy decisions about effect magnitude. The ATT provides an interpretable one-number summary and admits standard-form confidence intervals.&lt;/p&gt;
&lt;p&gt;Q: What are the two main inferential challenges that the paper addresses?
A: First, the canonical SC estimator τ_SC is biased due to estimation error in the high-dimensional weights, even under correct specification, and the bias can be substantial under misspecification. Second, even with known true weights, standard error estimation requires the long-run variance (LRV), for which classical estimators such as Newey-West (1987) and Andrews (1991) are not sufficiently accurate in the small samples typical of SC applications.&lt;/p&gt;
&lt;p&gt;Q: How does the K-fold cross-fitting procedure debias the SC estimator?
A: The pre-treatment period is divided into K consecutive blocks H1,&amp;hellip;,HK. For each fold k, SC weights w_(k) are estimated using leave-one-block-out pre-treatment data H_{(-k)}. The component estimator τ_k subtracts the in-block pre-treatment SC residual (an estimator of the bias in period Hk) from the post-treatment SC residual (using w_(k)). Because the bias is assumed stable across pre- and post-treatment periods, this subtraction removes it. The final estimator τ_hat averages τ_k across k=1,&amp;hellip;,K.&lt;/p&gt;
&lt;p&gt;Q: How does the self-normalized t-statistic avoid LRV estimation?
A: The statistic T_K = sqrt(K)(τ_hat − τ)/σ_τ uses σ_τ = sqrt(1 + Kr/T1) × sqrt[(1/(K−1)) Σ_k (τ_k − τ_hat)^2], which is the cross-fold standard deviation of the component estimators scaled by a factor reflecting the ratio of pre- to post-treatment block lengths. Under the asymptotic theory, T_K converges to a t_{K-1} distribution, which is pivotal and requires no bandwidth or kernel choice. The cross-fold structure acts as a self-normalizer analogous to the fixed-b approach in the LRV literature.&lt;/p&gt;
&lt;p&gt;Q: What does the paper prove about validity with non-stationary data?
A: Theorem 3 establishes that when all units share a common nonstationarity (Assumption 4: Yt(0) = Vt(0)+θt and Xt = Zt+1_N·θt where {Vt(0),Zt} is stationary and θt is unrestricted), T_K → t_{K-1} under arbitrary misspecification. Theorem 4 establishes validity when units deviate from common nonstationarity (Assumption 5) under restrictions on the magnitude and heterogeneity of deviations, but requires SC to be correctly specified. These results jointly imply that researchers need not pre-test for unit roots before applying the t-test.&lt;/p&gt;
&lt;p&gt;Q: How does the paper formally show that debiased SC is more efficient than DID?
A: The pseudo-true SC weights w* minimize mean squared prediction error over W_SC, so the residual variance σ^2_* = E(Yt(0)−Xt&amp;rsquo;w*)^2 ≤ E(Yt(0)−Xt&amp;rsquo;w_DID)^2 = σ^2_DID, where w_DID = (1/N,&amp;hellip;,1/N)&amp;rsquo; is the equal-weight DID vector. This inequality holds regardless of whether SC is correctly specified or not, so the efficiency gain over DID is unconditional. The t-test is also valid when the parallel trends assumption underlying DID is violated, making it more robust.&lt;/p&gt;
&lt;p&gt;Q: What is the trade-off in choosing K, and what does the paper recommend?
A: A larger K produces shorter confidence intervals (higher RAE: 63.56% at K=3 versus 92.25% at K=10) but may reduce coverage accuracy in finite samples because the t_{K-1} approximation improves with K while each block becomes smaller. The paper recommends K=3 as a starting point for typical SC applications where T0 is small, based on simulation evidence showing excellent 90% coverage at K=3. When T0 is moderate or large, K can be increased without loss of coverage accuracy.&lt;/p&gt;
&lt;p&gt;Q: What do the simulations show about the performance of Newey-West standard errors versus the t-test?
A: In simulations calibrated to the Swedish carbon tax application (T0=30, T1=16, N=14, AR(1) errors), the t-test at K=3 achieves coverage close to the nominal 90% level across both correct-specification and misspecification DGPs. Newey-West standard errors produce coverage of only 0.72–0.84 when the AR(1) coefficient of the error process is moderate to high. DID achieves nominal coverage when parallel trends hold but is biased and has poor coverage under violations of parallel trends.&lt;/p&gt;
&lt;p&gt;Q: How does the method compare with Li (2020) subsampling and synthetic DID (Arkhangelsky et al., 2021)?
A: Compared with Li (2020), the t-test allows N to grow with (T0,T1) rather than treating N as fixed, directly corrects for SC estimation bias via cross-fitting, avoids the need to pre-process data for stationarity, and does not require a subsampling bandwidth choice. Compared with SDID (Arkhangelsky et al., 2021), the t-test is simpler, does not require homoskedasticity across units as SDID&amp;rsquo;s placebo variance estimator does, and is developed under a linear prediction model rather than a factor model. Simulations show the t-test performs comparably to or better than both alternatives in the application-calibrated DGP.&lt;/p&gt;
&lt;p&gt;Q: What are the empirical findings for the Swedish carbon tax application?
A: Using annual CO2 emissions from transport for Sweden and 14 OECD control countries over 1960–2005, with T0=30 (1960–1989) and T1=16 (1990–2005), the debiased SC t-test at K=3 yields a negative and statistically significant ATT. The 90% confidence interval lies entirely below zero. The estimated average effect is approximately an 11% reduction in per capita CO2 emissions from transport attributable to the carbon tax over 1990–2005. The pre-treatment SC residuals show an estimated AR(1) coefficient of approximately 0.31, confirming moderate persistence and supporting the use of K=3.&lt;/p&gt;
&lt;p&gt;Q: When does the paper recommend against using the t-test?
A: The paper advises against the t-test when T1 is very small (T1 &amp;lt; 8–10), as asymptotic approximations may be inaccurate; when there are structural breaks shortly after T0 (making the ATT ill-defined); and when SC fit is poor because the treated unit is very different from controls. The method requires T0, T1, N → ∞ for asymptotic validity, and T1 ≥ 10–15 is suggested for reliable finite-sample performance.&lt;/p&gt;
&lt;p&gt;Q: How does the paper cover higher-order improvements in finite samples?
A: Appendix D formally establishes that the coverage error of the confidence interval I_K(1−α) is O(1/T) rather than O(1/sqrt(T)), analogous to the fixed-b approach in the LRV literature. This provides a formal justification for the excellent finite-sample coverage observed in the simulations and distinguishes the t-test from Gaussian approximations whose coverage error is of larger order.&lt;/p&gt;
&lt;p&gt;K-fold cross-fitting debiasing: A procedure that splits the pre-treatment period into K consecutive blocks, estimates SC weights on the leave-one-block-out pre-treatment data for each fold, and subtracts the in-block pre-treatment prediction error as an estimator of the bias. Under the model, the bias is assumed stable across pre- and post-treatment periods, so this subtraction removes it from the final estimator.&lt;/p&gt;
&lt;p&gt;Self-normalized t-statistic: A scale-free test statistic T_K = sqrt(K)(τ_hat − τ)/σ_τ whose denominator is the cross-fold standard deviation of the K component estimators, scaled to account for the ratio of pre-treatment block length to post-treatment period length. The statistic converges to a t_{K-1} distribution without requiring any LRV estimation.&lt;/p&gt;
&lt;p&gt;Average treatment effect on the treated (ATT): The target parameter τ = (1/T1) Σ_{t=T0+1}^{T} (Y0t(1)−Y0t(0)), representing the time-averaged causal effect of the treatment on the treated unit over the post-treatment period. It provides an interpretable one-number summary that admits standard-form confidence intervals, in contrast to per-period effects (not consistently estimable with one unit) and sharp null hypotheses (informative about presence but not magnitude of effect).&lt;/p&gt;
&lt;p&gt;Common nonstationarity: The condition (Assumption 4) that all units share the same nonstationary component θt — formally, Yt(0) = Vt(0)+θt and Xt = Zt+1_N·θt with {Vt(0),Zt} stationary and θt unrestricted. Under this condition, the t-test is valid under arbitrary misspecification of SC weights, without requiring the researcher to specify or pre-test the type of nonstationarity.&lt;/p&gt;
&lt;p&gt;Relative asymptotic efficiency (RAE): The ratio of the asymptotic expected confidence interval length of the debiased SC t-test to a benchmark (taken as K→∞), quantifying the cost in interval length from using a finite K. At K=3, RAE = 63.56%; at K=5, RAE = 82.08%; at K=10, RAE = 92.25%.&lt;/p&gt;
&lt;p&gt;Long-run variance (LRV): The quantity that governs the asymptotic variance of time-averaged quantities in settings with serially correlated data. The paper argues that classical LRV estimators (Newey-West, Andrews) are insufficiently accurate in the small samples typical of SC applications, motivating the self-normalization approach that avoids LRV estimation entirely.&lt;/p&gt;
&lt;p&gt;Pseudo-true SC weights: The population minimizer w* = argmin_{w ∈ W_SC} E(Yt(0)−Xt&amp;rsquo;w)^2, defined as the best linear predictor of the treated unit&amp;rsquo;s counterfactual outcome within the SC simplex constraint. These weights exist and satisfy the efficiency bound even under model misspecification, providing the foundation for the efficiency comparison with DID.&lt;/p&gt;</description></item><item><title>Decision Theory for Treatment Choice Problems with Partial Identification</title><link>https://macropaperwarehouse.com/papers/decision-theory-for-treatment-choice-problems-with-partial-identification/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/decision-theory-for-treatment-choice-problems-with-partial-identification/</guid><description>&lt;p&gt;This paper applies classical statistical decision theory (Wald 1950) to treatment choice problems where the data only partially identify payoff-relevant parameters. The policy maker chooses an action a in [0,1] — interpreted as the share of the population assigned to a new policy — to maximize welfare that is linear in the action. The data are Gaussian, and the key departure from prior literature is that the mean function mapping parameters to data need not be injective, so even infinite data may not reveal the optimal action.&lt;/p&gt;
&lt;p&gt;The paper evaluates decision rules under three classical criteria: admissibility, maximin welfare, and minimax regret (MMR).&lt;/p&gt;
&lt;p&gt;Admissibility result (Theorem 1): Under nontrivial partial identification, every decision rule — however exotic — is welfare-admissible. No rule is dominated. This is a sharp reversal from point-identified settings, where admissibility meaningfully restricts the rule class: in the scalar point-identified case (n=1, m(theta)=theta), Karlin and Rubin&amp;rsquo;s (1956) result implies that any non-threshold rule is dominated. The proof exploits completeness of the Gaussian statistical model: if a dominating rule d&amp;rsquo; existed, it would have to agree almost everywhere with d, yielding a contradiction. Theorem 5 generalizes this result beyond Gaussian likelihoods, tying it to bounded completeness of the statistical model.&lt;/p&gt;
&lt;p&gt;Maximin welfare result (Theorem 2): The maximin criterion selects the no-data rule d(y) = 0 — preserve the status quo regardless of data — whenever the status quo welfare is the infimum over states with non-positive welfare contrast. In the running example, maximin welfare equals zero and is achieved by never assigning the new policy. This echoes critiques from Savage (1951) and Manski (2004) about ultra-pessimism.&lt;/p&gt;
&lt;p&gt;Minimax regret result (Theorem 3): In point-identified problems, the MMR rule is essentially unique and nonrandomized (Canner 1970; Stoye 2009a; Tetenov 2012). Under partial identification, when the identified set is large enough — formally, when I(0) is large enough and there exists mu in the identified set with I(mu) &amp;gt; I(0) — there are infinitely many MMR optimal rules, and any symmetric, weakly increasing MMR rule depending only on the sufficient statistic (w*)^T Y must randomize for some data realizations. Moreover, if I(mu) is differentiable at zero, no linear threshold rule is MMR optimal.&lt;/p&gt;
&lt;p&gt;Least randomizing MMR rule (Theorem 4): Because policy randomization is difficult to implement in practice, the authors uniquely characterize the MMR optimal rule that randomizes least frequently. Among all symmetric, weakly increasing, unimodal MMR optimal rules depending on (w*)^T Y, the rule d*_linear has the smallest randomization region — every other distinct such rule has a strictly wider randomization region. This rule can be profiled-regret dominant over the Stoye (2012a)/Yata (2023) MMR rule (Proposition 2), and the uniformly randomizing rule is inadmissible under profiled regret (Proposition 3). Under some conditions, d*_linear can also be obtained as the MMR rule within a class that penalizes randomized assignments equally (Proposition 4).&lt;/p&gt;
&lt;p&gt;Three applications ground the theory. First, in Ishihara and Kitagawa&amp;rsquo;s (2021) evidence aggregation framework — extrapolating treatment effects from n source countries to a target country — the least randomizing rule randomizes only when estimated bounds on the target treatment effect straddle zero, linking decision rules directly to identified-set estimators. Second, in LATE extrapolation (Mogstad et al. 2018), all decision rules are admissible and IV-based threshold rules are not dominated. Third, in the omitted-variable-bias setting of Diegert et al. (2022), the decision-theoretic breakdown point — the largest confounding magnitude under which the seemingly better policy should be adopted without hedging — tolerates strictly more confounding than Diegert et al.&amp;rsquo;s breakdown point, where the threshold is k = sqrt(pi/2) * sigma.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question?
A: The paper asks how classical statistical decision theory — admissibility, maximin welfare, minimax regret — applies when the data only partially identify the payoff-relevant parameters governing a binary treatment choice. Prior literature had developed these criteria for point-identified settings; this paper characterizes how partial identification fundamentally changes the answers.&lt;/p&gt;
&lt;p&gt;Q: What is the formal framework?
A: The policy maker chooses a in [0,1] (population share assigned to the new policy) with welfare W(a,theta) = a*W(1,theta) + (1-a)*W(0,theta), linear in a. The data are Y ~ N(m(theta), Sigma) with known m and Sigma. Partial identification arises when m is not injective, so distinct parameter values theta and theta&amp;rsquo; with opposite-sign welfare contrasts U(theta) = W(1,theta) - W(0,theta) can produce the same data distribution.&lt;/p&gt;
&lt;p&gt;Q: Why does admissibility lose all refinement power under partial identification?
A: Theorem 1 shows that every decision rule is admissible when there is nontrivial partial identification. The mechanism is Gaussian completeness: if a dominating rule d&amp;rsquo; existed, then for every data distribution in the model, d and d&amp;rsquo; would have equal expected values, which by completeness implies d = d&amp;rsquo; almost everywhere — a contradiction. This relies on the fact that nontrivial partial identification ensures that each data distribution is compatible with both positive and negative welfare contrasts, preventing the construction of a uniformly dominating rule.&lt;/p&gt;
&lt;p&gt;Q: What is the contrast with point-identified settings?
A: In the scalar point-identified case (n=1, m(theta)=theta, W(1,theta)=theta, W(0,theta)=0), Karlin and Rubin&amp;rsquo;s (1956) theorem implies any non-threshold rule is dominated; admissibility restricts attention to threshold rules. Partial identification completely eliminates this refinement: even randomized or otherwise arbitrary rules are admissible.&lt;/p&gt;
&lt;p&gt;Q: What does the maximin welfare criterion recommend?
A: Theorem 2 shows that when the status quo welfare equals the infimum of welfare over states with non-positive welfare contrast, the maximin optimal rule is d(y) = 0 for all y — preserve the status quo regardless of the data. In the running evidence-aggregation example, maximin welfare equals zero and is achieved by never assigning the new policy. The criterion ignores all data because the worst case is always achieved at states where the new policy performs no better than the status quo.&lt;/p&gt;
&lt;p&gt;Q: What is the minimax regret criterion and why is it preferred?
A: Expected regret at state theta is R(d,theta) = U(theta)*{1{U(theta)&amp;gt;=0} - E[d(Y)]} — the expected welfare loss relative to the oracle who knows theta. A rule is MMR optimal if it minimizes worst-case expected regret. Unlike maximin welfare, MMR uses data and balances risks across states. In point-identified settings it yields essentially unique, nonrandomized rules.&lt;/p&gt;
&lt;p&gt;Q: How does partial identification change the MMR solution set?
A: Theorem 3 shows that when the identified set is large enough — I(0) is sufficiently large and there exists mu with I(mu) &amp;gt; I(0) — there are infinitely many MMR optimal rules, and every symmetric, weakly increasing MMR rule depending on the sufficient statistic (w*)^T Y must randomize for some data realizations. If I(mu) is differentiable at zero, no linear threshold rule is MMR optimal. Different MMR rules can recommend different policies for the same data, creating a nontrivial multiplicity problem.&lt;/p&gt;
&lt;p&gt;Q: How is the least randomizing MMR rule characterized?
A: Theorem 4 shows that among all symmetric, weakly increasing, unimodal MMR optimal rules that depend on data only through (w*)^T Y, the rule d*_linear has the smallest randomization region: every other distinct rule in this class has a strictly wider randomization region, V(d*_linear) ⊆ V(F∘w*) with strict inclusion when F ≠ d*_linear. This characterization is essentially unique and provides a pragmatic refinement of the MMR solution set.&lt;/p&gt;
&lt;p&gt;Q: What is profiled regret and why is it used?
A: Profiled regret reports worst-case expected regret at each fixed value of the point-identified parameters, rather than worst-case over all parameters jointly. Proposition 2 shows that the least randomizing rule d*_linear can profiled-regret dominate the Stoye (2012a)/Yata (2023) MMR rule in the running example. Proposition 3 shows that the uniformly randomizing rule is profiled-regret inadmissible when profiling over point-identified parameters. This concept provides an additional selection criterion within the MMR solution set.&lt;/p&gt;
&lt;p&gt;Q: Can the least randomizing rule be derived from an explicit welfare penalty?
A: Proposition 4 shows that, under some conditions, d*_linear is minimax regret optimal within the class of rules that penalize all randomized assignments equally. This connects the least randomizing criterion to a modified welfare function that treats randomization itself as costly, providing an interpretation for the refinement beyond mere pragmatics.&lt;/p&gt;
&lt;p&gt;Q: What does the evidence aggregation application show?
A: In the Ishihara-Kitagawa (2021) framework — extrapolating effects from n source countries to a target country using Lipschitz smoothness — the least randomizing rule randomizes only (though not always) when the estimated bounds on the target treatment effect contain both positive and negative values. When bounds are entirely positive or entirely negative, the rule recommends a deterministic action. This shows how identified-set estimators directly enter decision-theoretically optimal rules.&lt;/p&gt;
&lt;p&gt;Q: What does the LATE extrapolation application show?
A: In the Mogstad et al. (2018) setting with a binary instrument and no covariates, where the payoff-relevant parameter is a policy-relevant treatment effect corresponding to expanding the complier subpopulation, Theorem 1 applies: all decision rules are admissible. In particular, the IV threshold rule — implement the policy for large IV estimates — is not dominated, providing decision-theoretic grounding for a common empirical practice.&lt;/p&gt;
&lt;p&gt;Q: What does the omitted variable bias application show?
A: In the Diegert et al. (2022) setting where the identified set for the long regression coefficient given the medium regression coefficient is [beta_med - k, beta_med + k], the least randomizing MMR rule is d*_linear(beta_hat_med) when k &amp;gt; sqrt(pi/2) * sigma. The decision-theoretic breakdown point — the largest k under which the seemingly better policy should be adopted without randomization — is strictly larger than Diegert et al.&amp;rsquo;s sensitivity breakdown point, meaning the decision-theoretic approach tolerates more confounding before recommending hedging.&lt;/p&gt;
&lt;p&gt;Q: How does Theorem 5 generalize Theorem 1 beyond Gaussian likelihoods?
A: Theorem 5 extends the admissibility result by connecting it to bounded completeness of the statistical model rather than Gaussian-specific completeness. This shows that the collapse of admissibility&amp;rsquo;s refinement power is not an artifact of normality but a general consequence of partial identification combined with a sufficiently rich statistical model.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s broader implication for empirical practice?
A: The results show that under partial identification, two of the three classical decision-theoretic criteria (admissibility and maximin welfare) provide no useful guidance — the former because everything passes, the latter because it ignores data entirely. MMR remains the operative criterion but yields infinitely many rules, all requiring some randomization. The least randomizing refinement provides a unique, practically implementable rule that connects to estimated identified sets and tolerates more ambiguity than purely statistical sensitivity analyses.&lt;/p&gt;
&lt;p&gt;Partial identification: A setting where even infinite data cannot uniquely determine payoff-relevant parameters, because the mean function m mapping parameters to data distributions is not injective. Distinct parameter values with opposite-sign welfare contrasts may be observationally equivalent.&lt;/p&gt;
&lt;p&gt;Welfare contrast U(theta): The difference W(1,theta) - W(0,theta) between the welfare under the new policy and under the status quo at parameter theta. The oracle optimal action is 1{U(theta) &amp;gt;= 0}.&lt;/p&gt;
&lt;p&gt;Admissibility (welfare): A rule d is admissible if no rule d&amp;rsquo; weakly dominates it in expected welfare at every theta with strict improvement at some theta. Under partial identification with Gaussian likelihood, every rule is admissible — admissibility has no refinement power.&lt;/p&gt;
&lt;p&gt;Maximin welfare optimality: A rule is maximin optimal if it attains the highest worst-case expected welfare. Under partial identification, this criterion selects the no-data rule (always preserve status quo) whenever the status quo welfare equals the infimum over states with non-positive welfare contrast.&lt;/p&gt;
&lt;p&gt;Minimax regret (MMR) optimality: A rule minimizes the worst-case expected welfare loss relative to the oracle action. Under severe enough partial identification, MMR optimal rules are non-unique and all require randomizing policy recommendations for some data realizations.&lt;/p&gt;
&lt;p&gt;Least randomizing MMR rule (d*_linear): The unique MMR optimal rule with the smallest randomization region among all symmetric, weakly increasing, unimodal MMR rules depending on the sufficient statistic. Characterized in Theorem 4; randomizes only when estimated identified set bounds straddle zero in the running example.&lt;/p&gt;
&lt;p&gt;Profiled regret: The worst-case expected regret at each fixed value of the point-identified parameters, treating them as a parameter of interest and profiling out the partially identified parameters. Provides a finer ranking within the MMR solution set and renders the uniformly randomizing rule inadmissible.&lt;/p&gt;</description></item><item><title>Default Options and Retirement Saving Dynamics</title><link>https://macropaperwarehouse.com/papers/default-options-and-retirement-saving-dynamics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/default-options-and-retirement-saving-dynamics/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; Does automatic enrollment (auto-enrollment) in retirement savings plans increase lifetime wealth accumulation and welfare? The prior literature established large short-run participation effects but had not traced the policy&amp;rsquo;s consequences over a full working life.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper draws on two primary sources. First, a proprietary panel of 401(k) administrative records from nearly 600 U.S. firms, covering roughly 159,216 first-year employees across 86 firms (for the &amp;ldquo;increasing default&amp;rdquo; fact) and 6,415 employees across 34 firms (for structural estimation), observed between December 2006 and December 2017. Second, 12 successive waves (2006–2017) of the U.K. Annual Survey of Hours and Earnings (ASHE), a 1% nationally representative panel of approximately 200,000 private-sector employees per year, including 37,120 job-switchers, used to exploit the phased rollout of the U.K. Pension Act of 2008.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The paper proceeds in three steps. (1) Three empirical stylized facts are documented using quasi-experimental variation (comparing employees hired before versus after changes in the default contribution rate within the same firm, and exploiting the staggered employer-size-based rollout of U.K. auto-enrollment). (2) A structural lifecycle model is estimated via the Method of Simulated Moments, using three preference parameters—intertemporal discount factor (δ), elasticity of intertemporal substitution (σ), and opt-out cost (k)—identified from the within-firm default variation in 34 U.S. firms. (3) The estimated model is used for out-of-sample validation and counterfactual welfare analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three stylized facts.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Fact I — Increasing the default reduces participation.&lt;/em&gt; Among 159,216 first-year employees in 86 auto-enrollment firms, each percentage-point increase in the default contribution rate reduces 401(k) participation by approximately 1 percentage point and increases contributions strictly below the new default by 1 percentage point. When the default rose from 3% to 6%, workers were 3.2 percentage points more likely to contribute at 1% or 2% of salary. This &amp;ldquo;drop-out&amp;rdquo; pattern is consistent with an opt-out cost model but is inconsistent with loss-aversion and psychological-anchoring theories, both of which predict that raising the default should weakly increase low-end contributions.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Fact II — Non-autoenrolled workers catch up within three years.&lt;/em&gt; In the estimation sample of 34 U.S. firms offering a 50% match up to 6% and an auto-enrollment default of 3%, median cumulative employee 401(k) contributions of non-autoenrolled workers equal those of autoenrolled workers after three years of tenure. Because non-autoenrolled workers compensate for initial non-participation by contributing more later—earning similar cumulative employer match and tax benefits over the full three-year horizon—a modest opt-out cost suffices to explain the observed inertia. Previous studies (which examined only the first year of tenure and did not allow future contribution adjustment) inferred opt-out costs of $1,000–$2,200 or more; the dynamic model implies a cost of only approximately $250.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Fact III — Prior auto-enrollment reduces saving in the next job.&lt;/em&gt; Using the phased U.K. policy rollout, workers who were auto-enrolled in their previous job and then move to a new employer that has not yet implemented auto-enrollment participate 12.8 percentage points less and contribute 0.55% of salary less in the new plan relative to otherwise similar job-switchers from non-auto-enrollment employers. When the new employer also has auto-enrollment, no statistically significant difference is observed. Placebo rollout tests confirm the effect is not a pre-existing selection pattern. This negative spillover contradicts a &amp;ldquo;savings habit&amp;rdquo; hypothesis and suggests that auto-enrollment&amp;rsquo;s short-run boost overstates lifetime savings effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural estimation results.&lt;/strong&gt; The estimated quarterly discount factor is δ = 0.987 (approximately 0.949 annually), and the elasticity of intertemporal substitution is σ = 0.435, both standard in lifecycle models. The opt-out cost is estimated at &lt;strong&gt;$254&lt;/strong&gt; per contribution-rate change (standard error $11). Sensitivity exercises show that combining a short observation window (first year only), sticky contributions (no intra-job adjustment), no income uncertainty, immediate vesting, and penalty-free DC withdrawals yields an opt-out cost of $3,004—broadly matching the range in previous studies. The low baseline estimate is thus driven by the dynamic nature of decisions (ability to compensate later), the illiquidity of retirement accounts (which reduces their perceived value), and income uncertainty (which expands the inaction range).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-run wealth effects.&lt;/strong&gt; Simulating a universal 3% auto-enrollment policy, the model predicts that &lt;strong&gt;wealth at retirement changes by less than 2% for the top 7 income deciles&lt;/strong&gt;. For individuals in the top two deciles, total wealth at age 65 is actually reduced by less than 1% because many high earners who would voluntarily contribute above 3% are pulled down to the default. At the &lt;strong&gt;bottom decile&lt;/strong&gt;, however, auto-enrollment raises total retirement wealth by more than &lt;strong&gt;12%&lt;/strong&gt;; savings increases are concentrated in the first 20 years of working life and peak around age 45, where bottom-quintile workers hold an additional 20% of average annual lifetime earnings. Even at the bottom, approximately one-third of the early savings gains are offset by lower contributions after age 45, as the wealth effect dominates. Crowd-out of liquid savings is limited: for bottom-quintile individuals, &lt;strong&gt;89%&lt;/strong&gt; of the increase in retirement savings at age 65 passes through to total wealth; for middle-quintile individuals, &lt;strong&gt;62%&lt;/strong&gt; passes through.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Out-of-sample validation.&lt;/strong&gt; The U.S.-estimated model is not rejected (at the 10% level) in 8 of 11 response moments in the 86-firm sample where defaults were raised between two positive rates, covering over 85% of workers. Recalibrated to U.K. institutions (using δ and σ from the U.S. and k = £160 via the average USD/GBP exchange rate), the model replicates the roughly 30-percentage-point increase in both participation and contributions at the 1% U.K. default. The model also predicts a 9.6-percentage-point drop in participation when workers move from an auto-enrollment to an opt-in employer, close to the empirical 12.8 percentage points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare and optimal policy.&lt;/strong&gt; Under utilitarian preferences (policymaker shares individuals&amp;rsquo; discount rate, no redistributive motive), the opt-in regime is always preferred to auto-enrollment regardless of policy incidence, because matching and tax incentives already induce over-saving relative to individuals&amp;rsquo; revealed time preferences. Under &lt;strong&gt;paternalistic&lt;/strong&gt; preferences (social discount factor = 1) or &lt;strong&gt;inequality-averse&lt;/strong&gt; preferences (Pareto weights inversely proportional to income, with degree of inequality aversion ν = 1 following Saez 2002), an auto-enrollment default at or near the employer matching threshold (6% of income) maximizes social welfare. A 6% auto-enrollment default improves welfare by 0.3% in lifetime consumption-equivalent for the bottom decile even under a utilitarian policymaker when incidence is on employers. These optimal policy rankings are robust to whether the opt-out cost is treated as fully welfare-relevant (π = 1) or welfare-irrelevant (π = 0), and hold under three incidence scenarios (employer profit reduction, match-rate adjustment, wage adjustment).&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core mechanism by which non-autoenrolled workers &amp;ldquo;catch up&amp;rdquo; at the median, and why does this reduce the implied opt-out cost relative to prior estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Non-autoenrolled workers who do not contribute in their first year are not permanently forgoing employer matching and tax benefits; they can contribute more later in the same job and earn similar cumulative benefits. The paper shows that at the median and 75th percentile, cumulative employee 401(k) contributions among opt-in workers equal those of autoenrolled workers after three years of tenure in 34 U.S. firms offering a 50%-up-to-6% match at a 3% default. This dynamic substitutability means the opportunity cost of initial non-participation is far smaller than one-period back-of-the-envelope calculations suggest. Previous studies, which implicitly or explicitly assumed static contribution decisions or examined only the first year, inferred opt-out costs of $1,000–$2,200; in a fully dynamic model the same inertia requires only ~$254.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does Fact I (higher default reduces participation) specifically rule out loss aversion and anchoring as the primary mechanism, and what does it support instead?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under loss aversion, contributions above the default feel like losses while contributions below the default feel like gains. Raising the default shifts some contributions from the loss domain into the gain domain, making low contributions relatively less attractive. Proposition 2 demonstrates formally that loss-averse preferences predict a weakly lower fraction contributing below the new (higher) default — the opposite of what is observed. Similarly, Proposition 3 shows that psychological anchoring shifts preferences toward the new default, also predicting more participation at low rates when the default rises. Only the opt-out cost model (Proposition 1) predicts that a higher default causes some workers to incur the cost to switch &lt;em&gt;away&lt;/em&gt; from the default and end up at lower contribution rates, matching the empirical finding that each 1-percentage-point rise in the default increases contributions strictly below the old default by approximately 1 percentage point.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the quantitative magnitude of the opt-out cost, and what modeling assumptions are responsible for it being much smaller than prior estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The baseline estimate is $254 per contribution-rate change (s.e. $11), roughly an order of magnitude smaller than prior estimates of $1,000–$3,000+. Table 4 decomposes the sources of the difference: using only first-year data changes the estimate only slightly (to $226). Assuming contributions cannot be changed within a job (&amp;ldquo;sticky contributions&amp;rdquo;) raises the cost to $308 with four years of data or $712 with one year of data. Eliminating income uncertainty raises the estimate to $465. Assuming immediate vesting raises it to $344. Assuming penalty-free DC withdrawals raises it to $609. Combining all these restrictions simultaneously yields $3,004 — closely matching the prior literature. The three key drivers are thus: (1) the ability to adjust contributions over time within a job; (2) the illiquidity of the DC account (early-withdrawal penalties); and (3) income uncertainty widening the inaction range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the paper validate the structural model out of sample, and what confidence does this provide in the long-run predictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Two out-of-sample exercises are reported. First, the model estimated on 34 U.S. firms (introduction of auto-enrollment from 0% to 3% default) is used to predict workers&amp;rsquo; response when 86 other firms raised the default from one positive rate to a higher rate. The model prediction cannot be rejected at the 10% level in 8 of 11 response-moment cases, covering 71 of 86 firms and more than 85% of workers. Second, the model is re-calibrated to U.K. institutions (keeping U.S. preference estimates, setting k = £160 via exchange rate) and applied to the phased rollout of the U.K. Pension Act of 2008. The model replicates the roughly 30-percentage-point increase in both participation and contributions at the 1% default following the policy, and predicts a 9.6-percentage-point drop in participation when previously autoenrolled workers move to a new opt-in employer — compared with an empirical estimate of 12.8 percentage points (s.e. 5.5 pp).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the distributional implications of a universal 3% auto-enrollment policy for wealth at retirement?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The effect is concentrated at the bottom. For the top 7 income deciles, retirement wealth at age 65 changes by less than 2% relative to the opt-in counterfactual. For the top two deciles, total wealth at age 65 is actually reduced by less than 1% because high-earning workers who would voluntarily contribute above 3% are pulled down to the default. For the bottom decile, the policy raises total retirement wealth by more than 12%. Even at the bottom, roughly one-third of the early savings gains are later offset by lower contributions after age 45 as the wealth effect dominates, so even 20-year empirical follow-ups may overstate the policy&amp;rsquo;s lifetime effect at the bottom.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How large is crowd-out of liquid savings by auto-enrollment, and what explains the limited degree of substitution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Crowd-out is modest. For bottom-quintile workers, 89% of the increase in retirement savings at age 65 translates into higher total wealth; for middle-quintile workers, 62% passes through. The limited crowd-out arises because liquid assets serve a precautionary motive and DC accounts serve a lifecycle motive — the two assets are not close substitutes. Additionally, as in Kaplan and Violante (2014), the marginal propensity to consume out of liquid assets is high in the model, so autoenrolled workers reduce consumption rather than run down liquid balances. These predictions align with Beshears et al. (2021), who find no significant increase in unsecured debt after four years, and Chetty et al. (2014), who estimate an 80% pass-through to total savings in a different Danish policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why do previously autoenrolled workers contribute less when they switch to an opt-in employer, and how is this consistent with the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The most plausible explanation, and the one consistent with the model&amp;rsquo;s out-of-sample predictions, is a standard wealth effect: workers auto-enrolled early accumulate more retirement wealth and therefore have less incentive to contribute in a new job. The model predicts a 9.6-percentage-point participation drop for AE-to-non-AE movers, close to the empirical 12.8 pp. An alternative explanation — that previously autoenrolled workers rationally expect their new employer to soon adopt auto-enrollment and thus delay active enrollment — is partially ruled out by the finding that the empirical estimate is closer to the model prediction for job-switchers whose new employer is not expected to adopt auto-enrollment in the next 12 months.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the welfare implications of auto-enrollment under utilitarian, paternalistic, and inequality-averse policymakers, and how robust are these to the incidence assumption?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under utilitarian preferences (policymaker shares individuals&amp;rsquo; discount factor, no extra redistributive weight), the opt-in regime is always preferred regardless of whether the policy&amp;rsquo;s cost falls on employer profits, the match rate, or wages. The negative welfare effect is largest when incidence falls on wages (approximately 50% larger than under match-rate reduction). Under paternalistic preferences (social discount factor = 1), a 6% default (equal to the employer matching threshold) is optimal under all three incidence scenarios. Under inequality-averse preferences (ν = 1 Pareto weights), a 6% default is optimal when incidence falls on employers, and a 5% default when incidence falls on workers. These results are identical whether the opt-out cost is treated as fully welfare-relevant (π = 1) or welfare-irrelevant (π = 0). A 6% auto-enrollment default increases welfare by 0.3% in lifetime consumption-equivalent for the bottom income decile even under a utilitarian planner when incidence is on employers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the paper address heterogeneity in default effects across age and income groups within a parsimonious homogeneous preference model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model has only three estimated preference parameters (δ, σ, k), yet it endogenously replicates empirical heterogeneity. Conditional on participating, workers in their 20s are approximately 20 percentage points more likely to stay at the 3% default than workers in their late 50s and early 60s; the model attributes this to the option value of waiting: young workers can compensate for current non-saving by contributing more later, so the cost of opting out is effectively smaller for them. The lowest-income workers are approximately 40 percentage points more likely to remain at the default than the highest-paid; the model explains this primarily because the fixed opt-out cost of $254 represents a larger share of earnings for low-income individuals (and secondarily because high-income workers have more to gain from active contribution decisions due to higher marginal tax rates and a lower Social Security replacement rate). All model-predicted coefficients fall within the 95% confidence intervals of the empirical estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the paper conclude about the broader relevance of the &amp;ldquo;dynamic opt-out cost&amp;rdquo; framework beyond retirement saving?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper argues that wherever individuals can compensate for present inaction with future actions — as in retirement saving — the observed inertia at a default understates the freedom of choice preserved by the nudge, and short-run effects overstate long-term consequences. In contrast, in domains such as healthcare plan choice or school selection, future actions cannot easily offset present inertia; opt-out costs are likely to remain large; and the distinction between a nudge and a hard mandate collapses. The paper therefore argues that the appeal of &amp;ldquo;libertarian paternalism&amp;rdquo; (Thaler and Sunstein 2003) is domain-specific and is strongest precisely where intertemporal adjustment is possible.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Opt-out cost (k).&lt;/strong&gt; In this paper, a utility cost — estimated at $254 per contribution-rate change — that individuals must pay every time they choose a retirement contribution rate different from the current default. The cost is modeled as a consumption reduction and captures both real transaction costs (form-filling, adviser fees) and behavioral costs (cognitive cost of attention and optimal-choice search). It is fixed and homogeneous across individuals, and applies symmetrically in any direction of deviation from the default.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Auto-enrollment default contribution rate.&lt;/strong&gt; The positive contribution rate at which new hires are automatically enrolled in a defined-contribution plan, with the option to opt out by incurring the opt-out cost. In the paper&amp;rsquo;s estimation sample, this is 3% of salary. The default is exogenous at the start of each new job but endogenous thereafter: once established, the default for subsequent periods equals the worker&amp;rsquo;s contribution rate in the previous period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Default eﬀect.&lt;/strong&gt; The empirically observed tendency of workers to remain at the default contribution rate rather than actively choosing a different rate. In this paper, the default effect is explained by opt-out costs rather than loss aversion or psychological anchoring — a distinction identified through the novel prediction that raising the default from a positive rate to a higher positive rate reduces overall participation (the &amp;ldquo;drop-out&amp;rdquo; effect), a pattern consistent only with opt-out costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Drop-out eﬀect.&lt;/strong&gt; The paper&amp;rsquo;s term (following Caplin and Martin 2017) for the empirical finding that increasing the auto-enrollment default contribution rate causes some workers to stop contributing altogether or to contribute at rates strictly below the initial default. This effect is used as a discriminating test between competing theories of the default effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic opt-out cost framework.&lt;/strong&gt; The paper&amp;rsquo;s core modeling insight: that opt-out costs must be estimated in a fully dynamic lifecycle model that allows workers to adjust contributions over time, to hold liquid assets and unsecured debt, and to face labor market risk. In a static or short-horizon model, the opportunity cost of initial non-participation appears large (because the worker permanently forgoes match and tax benefits), requiring large opt-out costs. In the dynamic model, the ability to compensate later shrinks the implied opportunity cost and hence the opt-out cost required to rationalize observed inertia.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Crowd-out of liquid savings.&lt;/strong&gt; The extent to which higher DC retirement contributions induced by auto-enrollment reduce liquid asset holdings (or increase unsecured borrowing), rather than increasing total wealth. The paper estimates limited crowd-out (89% pass-through to total wealth for bottom-quintile workers, 62% for middle-quintile workers), attributable to the different roles of liquid assets (precautionary motive) and DC accounts (lifecycle motive) in the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy incidence.&lt;/strong&gt; The channel through which employers balance their budget in response to higher matching costs created by auto-enrollment. The paper considers three scenarios: employers absorb costs through reduced profits; employers reduce the match rate; employers reduce wages. Optimal policy rankings and welfare magnitudes differ across these scenarios, but the qualitative conclusions — utilitarian policymaker prefers opt-in; paternalistic or inequality-averse policymaker prefers AE at 6% — are robust across incidence assumptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption-equivalent variation (γ).&lt;/strong&gt; The welfare metric used in the paper: the proportional increase in consumption in every period and every state of the world that would make the policymaker indifferent between an auto-enrollment policy at default d and the opt-in regime. A 6% default increases welfare by 0.3% in consumption-equivalent for the bottom income decile under a utilitarian policymaker when incidence is on employers.&lt;/p&gt;</description></item><item><title>Defying Distance? The Provision of Medical Services in the Digital Age</title><link>https://macropaperwarehouse.com/papers/defying-distance-the-provision-of-medical-services-in-the-digital-age/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/defying-distance-the-provision-of-medical-services-in-the-digital-age/</guid><description>&lt;p&gt;This paper asks whether digital platforms can improve healthcare outcomes by enabling needs-based matching between patients and physicians unconstrained by geography. Amanda Dahlstrand studies digital primary care in Sweden during 2016-2018, exploiting nationwide conditional random assignment between approximately 200,000 patients and 143 doctors employed by Europe&amp;rsquo;s largest digital primary care provider. Patients who selected the &amp;ldquo;first available doctor&amp;rdquo; option (82% of first visits) were effectively randomized to a doctor within each 3-hour shift-by-date stratum, generating quasi-experimental variation free of the patient-doctor sorting that confounds identification in physical primary care.&lt;/p&gt;
&lt;p&gt;The paper defines three observable dimensions of primary care physician skill: (1) identifying risky patients and triaging them to higher levels of care, measured by whether patients subsequently have an avoidable hospitalization within 90 days; (2) providing guideline-consistent treatment, measured by counter-guideline antibiotic prescriptions; and (3) leaving patients sufficiently informed so they do not unnecessarily seek additional in-person care within the following week. Doctor skill in each dimension is estimated via a value-added framework in a hold-out sample (Sample 1, the first 600 randomized consultations per doctor), using empirical Bayes shrinkage to reduce noise. Complementarities between doctor skill and patient risk are then estimated in a disjoint main sample (Sample 2).&lt;/p&gt;
&lt;p&gt;A central finding is that doctor skill is task-specific rather than governed by a single latent ability: skills across the three tasks are not positively correlated, meaning doctors within general practice have individual &amp;ldquo;specializations.&amp;rdquo; A patient ranked in the top 1% of avoidable hospitalization risk who is matched to a doctor ranked in the top 10% at reducing avoidable hospitalizations experiences a 90% reduction in that adverse outcome, relative to a patient with the same risk profile matched to the worst-performing doctor. Patients not estimated as risky show effects indistinguishable from zero when matched to the same high-skilled doctors, establishing a strong complementarity between doctor type and patient risk.&lt;/p&gt;
&lt;p&gt;Using the Average Match Function framework of Graham, Imbens, and Ridder (2014, 2020), the paper evaluates counterfactual reallocation policies. Reallocating only 2% of patients — those in the top 1% of predicted avoidable hospitalization risk — to doctors in the top 10% of triage skill reduces aggregate avoidable hospitalizations by 20% relative to random assignment, without adversely affecting counter-guideline prescriptions or other measured outcomes. Doctor skills across outcomes are not positively correlated, so this reallocation does not generate meaningful trade-offs. The paper benchmarks this matching policy against a selective hiring/expansion policy in which doctors with above-median skill in three tasks expand their hours by up to 70% at the expense of below-median peers; that policy yields no significant reduction in avoidable hospitalizations and only a 4% reduction in counter-guideline prescriptions — smaller gains than matching and harder to implement.&lt;/p&gt;
&lt;p&gt;The paper also documents that physical primary care quality is worse in lower-income and more deprived areas of Sweden (a negative relationship between deprivation index and patient-reported experience is statistically significant at the 1% level in a cross-section of roughly 120-150 primary care centers in Region Skane). Because the estimated risk of avoidable hospitalization and prior avoidable hospitalizations are concentrated in the lower end of the income distribution, needs-based digital matching reallocates triage skill toward lower-income patients, severing the correlation between local area income and service quality. Simulating positive assortative matching on patient income and doctor skill — approximating existing healthcare inequalities — leads to more avoidable hospitalizations than random assignment, because the most vulnerable patients tend to be the poorest. Scope conditions: findings derive from a single digital primary care provider in Sweden, 2016-2018, pre-pandemic, covering conditions amenable to video consultation and a patient pool younger and somewhat more urban than the average Swedish citizen.&lt;/p&gt;
&lt;p&gt;Q: What is the key identification strategy, and why is it valid in this setting but not in physical primary care?
A: Patients who selected the &amp;ldquo;drop in&amp;rdquo; (first available doctor) option — 82% of first visits — were assigned to whichever certified doctor was next in the roster within a 3-hour shift-by-date stratum, a by-product of the first-come-first-served queue. Neither patients nor doctors could intervene in this digital process. The author validates the assumption by regressing doctor characteristics on patient characteristics controlling for shift-by-date fixed effects and finds characteristics are balanced. In physical primary care, endemic patient-doctor sorting means doctors do not meet a common support of patient types, preventing causal identification of doctor effects.&lt;/p&gt;
&lt;p&gt;Q: How are doctor skill estimates constructed and why does the split-sample matter?
A: Doctor skill in each task is estimated as an empirical Bayes-shrunk random effect from a value-added regression on Sample 1, each doctor&amp;rsquo;s first 600 randomized consultations (40% of the sample). Sample 2 (60%) is entirely disjoint and used to estimate complementarities between doctor skill and patient risk. The split-sample design prevents overfitting: doctor skill was estimated on different patients than those in Sample 2. The Durbin-Wu-Hausman test does not reject random effects (p = 0.16).&lt;/p&gt;
&lt;p&gt;Q: What is the main quantitative result on avoidable hospitalization matching?
A: A patient ranked in the top 1% of predicted avoidable hospitalization risk matched to a doctor ranked in the top 10% at reducing avoidable hospitalizations could reduce that patient&amp;rsquo;s avoidable hospitalizations by 90%, relative to the worst-performing doctor in that skill. At the aggregate level, reallocating only 2% of patients (those in the top 1% risk group) to high-triage-skill doctors reduces avoidable hospitalizations across the full patient population by 20% compared to random assignment.&lt;/p&gt;
&lt;p&gt;Q: Does the avoidable hospitalization reallocation harm other outcomes?
A: No. The paper explicitly evaluates the Average Reallocation Effect on counter-guideline prescriptions and additional in-person care seeking when optimizing for avoidable hospitalizations, and finds no significant adverse effects on these other outcomes. The author attributes this to the fact that doctor skills across tasks are not positively correlated, so reallocating triage-skilled doctors does not systematically remove skill from other dimensions.&lt;/p&gt;
&lt;p&gt;Q: How does matching compare to selective hiring and hour expansion as a policy?
A: Even expanding the working hours of doctors with above-median skill across three tasks by as much as 70% yields no significant reduction in avoidable hospitalizations and only a 4% reduction in counter-guideline prescriptions — both smaller gains than the matching policy. Matching outperforms hiring expansion because patients have heterogeneous needs that can be identified from prior healthcare records, and doctors have differentiated skill sets relevant to some patients but not others.&lt;/p&gt;
&lt;p&gt;Q: What is the evidence that doctor skills are task-specific rather than reflecting a single latent ability?
A: The estimated doctor effects across the three tasks — triaging to avoid hospitalizations, guideline-consistent antibiotic prescribing, and minimizing unnecessary follow-up care — are not positively correlated with one another. This means a doctor who is effective at one task is not systematically effective at others, indicating individual specializations within general practice that are not accounted for in standard primary care organization.&lt;/p&gt;
&lt;p&gt;Q: How is patient risk for avoidable hospitalizations measured?
A: A propensity score is estimated from pre-digital physical healthcare data (2013-2015), regressing past number of avoidable hospitalizations on demographic and healthcare utilization variables — including age, a disease index of chronic diagnoses, and previous hospitalizations — all variables already available in patient medical records. The top 1% of predicted risk scores are classified as &amp;ldquo;risky.&amp;rdquo; Patients in the risky group had on average 0.35 avoidable hospitalizations in the prior 3 years, versus 0.01 for non-risky patients.&lt;/p&gt;
&lt;p&gt;Q: What is the distributional (equity) implication of needs-based matching versus income-assortative matching?
A: Estimated risk of avoidable hospitalization and the count of prior avoidable hospitalizations are concentrated in the lower end of the income distribution. Needs-based matching therefore reallocates triage skill toward lower-income patients. Simulating positive assortative matching on patient income and doctor skill — approximating observed inequalities in physical care — produces more avoidable hospitalizations than random assignment, because the most vulnerable patients are often the poorest. Needs-based digital matching can sever the link between local area income and service quality.&lt;/p&gt;
&lt;p&gt;Q: How does digital care usage sort by income and demographics in the data?
A: At the extensive margin, the deprivation index (Care Need Index) is similar among digital users and non-users in Region Skane. However, at the intensive margin, individuals with a higher deprivation index who use the digital service have more appointments in it; similarly, lower-income users use the service more intensively. Digital care users are younger than non-users and are more likely to live in cities than the average Swedish citizen.&lt;/p&gt;
&lt;p&gt;Q: What are avoidable hospitalizations and why are they the primary outcome?
A: Avoidable hospitalizations (also called hospitalizations for ambulatory care sensitive conditions) are hospital admissions defined in the medical literature as preventable by adequate and timely primary care. They are coded using ICD-10 diagnosis codes listed in Page et al. (2007). The most common diagnoses in the 90-day post-consultation window are respiratory and genitourinary, conditions commonly treated in digital care. The outcome is rare (0.2% of patients in the sample), but high-stakes: an estimated 1.1 potential life years are lost per avoidable hospitalization, and in Sweden they cost an estimated SEK 7.1 billion (~$820 million) annually (7% of inpatient curative and rehabilitative care costs).&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the counter-guideline antibiotic prescription outcome?
A: Non-adherence is coded against 16 guidelines from Sweden&amp;rsquo;s strategic programme against antibiotic resistance (Strama 2017, 2019), all designed to limit or narrow antibiotic use. The measured rate of non-adherence is described as quite low by international standards; the CDC estimates 28% of US antibiotic prescriptions are unnecessary, while the author&amp;rsquo;s sample rate is 2%. The guidelines require doctors to sometimes refuse patients who request antibiotics, introducing a behavioral compliance dimension to this skill.&lt;/p&gt;
&lt;p&gt;Q: What are the costs and feasibility considerations for implementing needs-based digital matching?
A: The paper characterizes matching as a &amp;ldquo;resource-neutral&amp;rdquo; policy because it reallocates existing doctors without hiring or training. The primary costs are a small increase in waiting time for some patients and the costs of importing data and developing the matching algorithm. Because the algorithm handles patient-doctor allocation while doctors retain all clinical decision-making, the policy functions as a complement to human skill rather than a substitute, which the author argues makes it less subject to &amp;ldquo;algorithm aversion.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;Q: Why does the paper restrict to each patient&amp;rsquo;s first digital consultation only?
A: The first visit is the one subject to conditional random assignment; subsequent visits could reflect endogenous selection by patients who preferred a particular doctor or outcome. Using only first visits eliminates this concern. The restriction reduces the sample from approximately 378,000 to 210,171 patients (56% of the original), paired with 143 doctors who each had at least 600 randomized consultations.&lt;/p&gt;
&lt;p&gt;Conditional random assignment: The allocation mechanism by which patients selecting the &amp;ldquo;first available doctor&amp;rdquo; option in digital primary care were assigned to whichever certified doctor was next in the shift roster, conditional on 3-hour shift-by-date strata — a by-product of the first-come-first-served queue rather than an intended experimental design.&lt;/p&gt;
&lt;p&gt;Average Match Function (AMF): The conditional mean of a patient outcome given observable doctor type and patient type under random assignment, β(x,w) = E[Y|X=x, W=w], which serves as the building block for evaluating counterfactual reallocation policies.&lt;/p&gt;
&lt;p&gt;Average Reallocation Effect (ARE): The difference in expected patient outcomes between a counterfactual doctor-patient assignment and the status quo random assignment, taking into account the externality on the patient from whom a high-skilled doctor is moved.&lt;/p&gt;
&lt;p&gt;Task-specific doctor skill: The paper&amp;rsquo;s finding that primary care physician effectiveness is not governed by a single latent ability but varies across distinct tasks — triage/risk prediction, guideline-consistent prescribing, and minimizing unnecessary follow-up care — with skills across tasks not positively correlated.&lt;/p&gt;
&lt;p&gt;Avoidable hospitalization: A hospital admission coded to a diagnosis (per Page et al. 2007 ICD-10 classification) defined in the medical literature as preventable by adequate and timely primary care, used as the primary high-stakes outcome measure (0.2% incidence in the sample within 90 days of a digital consultation).&lt;/p&gt;
&lt;p&gt;Counter-guideline prescription: A prescription of an antibiotic in violation of one of 16 guidelines from Sweden&amp;rsquo;s Strama antibiotic resistance programme, all of which are designed to limit use or require narrower-spectrum first-line antibiotics; used as the primary guideline-adherence outcome (2% incidence in the sample).&lt;/p&gt;
&lt;p&gt;Empirical Bayes shrinkage: A procedure applied to raw doctor value-added estimates in which the noisy estimate of doctor quality is multiplied by the ratio of signal variance to total (signal plus noise) variance, yielding a best linear predictor of the underlying doctor random effect and reducing noise from small-sample estimation.&lt;/p&gt;</description></item><item><title>Demand Analysis under Latent Choice Constraints</title><link>https://macropaperwarehouse.com/papers/demand-analysis-under-latent-choice-constraints/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/demand-analysis-under-latent-choice-constraints/</guid><description>&lt;p&gt;Agarwal and Somaini study demand estimation in markets where consumers face latent choice constraints — situations where a consumer&amp;rsquo;s effective choice set is determined not only by her preferences but also by supply-side rationing or information frictions that restrict which options are actually available to her. Standard discrete choice methods assume consumers pick freely from the full product set, but this assumption fails in school and college admissions, entry-level labor markets, healthcare with selective admissions, and consumer markets with incomplete consideration sets. The paper provides a unified non-parametric identification framework for this class of models, proves necessity of the identifying instruments, proposes a computationally tractable estimator, and applies the framework to the California kidney dialysis market.&lt;/p&gt;
&lt;p&gt;The model combines a general random utility specification — accommodating multi-dimensional unobserved heterogeneity and product-level unobservables correlated with observed characteristics as in Berry (1994) and BLP (1995) — with a reduced-form acceptance policy function that governs which products accept which consumers. The consumer&amp;rsquo;s latent choice set is the set of products that accept her, and she picks her most preferred option within that set. Crucially, the acceptance decision may be arbitrarily correlated with consumer preferences, ruling out the independence assumptions common in the consideration-set literature.&lt;/p&gt;
&lt;p&gt;Identification rests on two sets of instruments. The first is a preference shifter, a consumer-product observable that affects utility but is excluded from the acceptance policy — distance to facility in the application. The second is a choice-set shifter, an observable that affects the acceptance decision but is excluded from consumer utility — short-term deviation of a facility&amp;rsquo;s caseload from its estimated target in the application. The main result (Theorem 1) establishes non-parametric point identification of the joint distribution of indirect utilities and acceptance decisions given both instruments. Proposition 1 establishes that the model is not identified when the choice-set shifter is absent — even when the preference shifter has full support — making both instruments necessary rather than merely sufficient.&lt;/p&gt;
&lt;p&gt;The application uses USRDS data on 41,913 new dialysis patients treated at 552 California facilities between 2015 and 2018. Most facilities are owned by Fresenius or DaVita. The choice-set shifter is the facility&amp;rsquo;s caseload deviation from target when a patient enters the market; facility and quarter fixed effects are included so that only short-term caseload variation drives identification. A reduced-form regression shows that higher caseload deviation significantly reduces the inflow of new patients to a facility, consistent with supply-side rationing. Patients also choose more distant facilities when nearby facilities have above-normal caseloads, providing further reduced-form evidence that rationing shapes allocations.&lt;/p&gt;
&lt;p&gt;A Gibbs sampler with data augmentation — drawing alternately from the distribution of latent choice sets conditional on utilities and from utility parameters conditional on choice sets — circumvents the curse of dimensionality that makes direct likelihood maximization over all possible choice sets infeasible.&lt;/p&gt;
&lt;p&gt;Estimation results show that the probability a patient is accepted at her first-choice facility is only 73.0%, with variation across facilities. Standard discrete choice models that ignore rationing misestimate facility quality, systematically assigning high desirability to low-caseload facilities in a manner that conflates easy access with genuine patient preference. A naive correction that includes the caseload measure in the utility function mischaracterizes the diversion pattern: rationed patients are marginal for the facility but strictly prefer it, so they divert differently from patients who voluntarily switch because of quality changes. Fresenius and DaVita facilities are estimated to be more selective than independent facilities, consistent with chain networks enabling coordinated patient-flow management across locations.&lt;/p&gt;
&lt;p&gt;Q: What is the core empirical problem the paper addresses?
A: Standard demand estimation inverts market shares to recover preference parameters under the assumption that consumers choose freely from the full product set. When choice sets are constrained by supply-side rationing or information frictions, the largest market share product need not be the one most preferred — it may simply be the one that accepts the most consumers. This makes the standard inversion inapplicable, and ignoring constraints yields biased preference estimates.&lt;/p&gt;
&lt;p&gt;Q: What does the paper&amp;rsquo;s model consist of?
A: The model has two components: (1) a random utility model for consumer preferences with rich observed and unobserved heterogeneity, allowing product-level unobservables correlated with observed characteristics; and (2) a reduced-form acceptance policy function sigma_jt taking values in {0,1} that determines whether product j accepts consumer i. The consumer&amp;rsquo;s latent choice set is the set of products that accept her; she picks her most preferred option within it. Utilities and acceptance decisions may be arbitrarily correlated.&lt;/p&gt;
&lt;p&gt;Q: What examples of latent choice constraints are covered by the framework?
A: The reduced form encompasses: selective admissions in healthcare (facility accepts patient if profitability exceeds a caseload-dependent threshold); two-sided matching markets where a pairwise stable allocation is described by cutoff scores (school admissions, entry-level labor markets); consideration set models where brand awareness advertising or inattention determines which products a consumer sees; fixed-sample consumer search; and product stock-outs. Each of these implies an acceptance policy function of the form specified in the paper&amp;rsquo;s reduced-form model.&lt;/p&gt;
&lt;p&gt;Q: What are the two identifying instruments and the intuition behind each?
A: The preference shifter yij is a consumer-product observable that affects the consumer&amp;rsquo;s indirect utility for product j but is excluded from that product&amp;rsquo;s acceptance decision. In the application this is distance: dialysis requires multiple weekly visits, so distance affects patient utility, but a facility&amp;rsquo;s decision to accept a patient does not depend on how far the patient lives. The choice-set shifter zij is an observable that affects the acceptance decision but is excluded from consumer preferences. In the application this is the deviation of facility caseload from its estimated target: short-term caseload swings affect whether a facility can take a new patient but, conditional on facility fixed effects, do not reflect facility quality as perceived by patients.&lt;/p&gt;
&lt;p&gt;Q: What does Theorem 1 establish and under what conditions?
A: Theorem 1 establishes non-parametric point identification of (i) the function gj mapping the preference shifter to its utility contribution, and (ii) the joint distribution of indirect utilities and acceptance indicators, for every consumer attribute vector and every value in the interior of the joint support of the instruments. Conditions required include: monotonicity of the acceptance policy in the choice-set shifter (higher z makes acceptance weakly less likely, with sigma=1 as z approaches negative infinity and sigma=0 as z approaches positive infinity); conditional independence of unobservables from the instruments given observed consumer attributes; and at least two products available.&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 1 establish about necessity of the choice-set shifter?
A: Proposition 1 shows that if the choice-set shifter z has singleton support (no variation), then even when the preference shifter g has full support on R^|J|, the distribution of preferences is not identified wherever a choice set strictly smaller than the full product set has positive probability. The non-identification result applies on any open set where a constrained choice set has positive probability — it is not a knife-edge case. This makes the choice-set shifter a necessary condition for identification, not merely a convenient one.&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle endogeneity of product characteristics?
A: Corollary 2 extends the baseline identification result to allow product-level unobservables that may be correlated with observed product characteristics, as in Berry (1994) and BLP (1995). Identification in this case requires an additional instrument that shifts product characteristics but is excluded from both preferences and choice sets — analogous to BLP supply-side instruments — alongside the two shifters already required. This extends Berry and Haile (2010) to settings with constrained choice sets.&lt;/p&gt;
&lt;p&gt;Q: What is the Gibbs sampler estimator and why is it needed?
A: With J products per market, the number of possible choice sets is 2^J, making direct likelihood computation infeasible for even moderate J. The Gibbs sampler uses data augmentation to alternate between: (a) drawing latent choice sets conditional on current utility parameters and observed choices; and (b) drawing utility parameters conditional on the augmented choice sets. Each conditional draw reduces to a standard problem, avoiding the curse of dimensionality. The Bernstein-von Mises theorem implies that the posterior mean of the sampling chain is asymptotically equivalent to the maximum likelihood estimator.&lt;/p&gt;
&lt;p&gt;Q: What is the reduced-form evidence for supply-side rationing in dialysis?
A: The regression of log(1 + new patient inflows to facility j in quarter q) on facility fixed effects, quarter fixed effects, and the caseload deviation z_jq yields a statistically significant negative coefficient on caseload deviation: above-target caseloads reduce new patient admissions even after controlling for facility-level and time-level averages. Additionally, patients whose nearest facilities have above-normal caseloads travel to more distant facilities, providing complementary evidence that rationing displaces patients geographically.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated probability of acceptance at a first-choice facility?
A: The structural estimates imply that a patient is accepted at her first-choice facility with probability only 73.0%, with variation across facilities. The implied 27.0% rejection rate is economically substantial, meaning a large share of observed allocations do not reflect unconstrained patient preference.&lt;/p&gt;
&lt;p&gt;Q: How do estimates from the constrained model differ from a standard discrete choice model?
A: The standard model, which ignores selective admissions, assigns higher utility to facilities with lower caseloads — a bias that conflates easy access with genuine patient preference. The constrained model separately identifies the facility&amp;rsquo;s acceptance propensity from the patient&amp;rsquo;s underlying preference, yielding different facility quality rankings. The largest facilities are not necessarily the most desirable once selective admissions are accounted for.&lt;/p&gt;
&lt;p&gt;Q: Why is the naive correction — including caseload in the utility function — insufficient?
A: The naive correction treats caseload as a quality attribute, implying that a patient turned away because of high caseload and a patient who voluntarily avoids a high-caseload facility are pulled from the same margin. In the constrained model, a rationed patient is marginal for the facility but strictly prefers it, so she diverts to a different set of alternatives than a patient who voluntarily switches. Not capturing this distinction produces quantitatively different diversion ratios.&lt;/p&gt;
&lt;p&gt;Q: What do the estimates say about chain versus independent facilities?
A: Fresenius and DaVita facilities are estimated to be more selective in their admissions than independent facilities. The paper interprets this as consistent with large chains having better ability to coordinate patient flows across their network of facilities, potentially directing turned-away patients to other chain locations.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the identification results?
A: Identification is established within each market, for consumer attribute vectors in the interior of support, and for utility-acceptance pairs in the interior of the joint support of the instruments. The results are non-parametric in that they do not restrict the functional form of preferences or acceptance policies beyond monotonicity and support conditions, and they allow unobservables affecting choice sets to be arbitrarily correlated with preference unobservables. The empirical application implements a parametric version for tractability.&lt;/p&gt;
&lt;p&gt;Latent choice constraint: A restriction on a consumer&amp;rsquo;s effective choice set arising from supply-side rationing or information frictions, such that the consumer can only choose among the products that accept her rather than freely among all products in the market. Distinct from price-based market clearing.&lt;/p&gt;
&lt;p&gt;Acceptance policy function: A reduced-form function mapping consumer attributes, consumer unobservables, and the choice-set shifter to a binary accept/reject decision by product j. Indexed by product and market, allowing arbitrary variation in selectivity across products and time. The consumer&amp;rsquo;s latent choice set is defined as the set of products whose acceptance policy equals 1.&lt;/p&gt;
&lt;p&gt;Choice-set shifter: A consumer-product observable that shifts the acceptance probability — making product j more or less likely to accept consumer i — while being excluded from consumer indirect utility. In the application: short-term deviation of facility caseload from its estimated target. Necessary (not merely sufficient) for non-parametric identification of the model.&lt;/p&gt;
&lt;p&gt;Preference shifter: A consumer-product observable that shifts consumer utility for product j and is separable from consumer-specific unobservables, but is excluded from that product&amp;rsquo;s acceptance policy function. In the application: distance from patient&amp;rsquo;s residence to the facility. Also necessary for identification.&lt;/p&gt;
&lt;p&gt;Curse of dimensionality in constrained choice: The computational problem that the number of possible latent choice sets grows as 2^J with the number of products J, making direct likelihood integration over choice sets infeasible for even moderate J. Resolved in this paper by a Gibbs sampler with data augmentation that conditions alternately on latent choice sets or utility parameters.&lt;/p&gt;
&lt;p&gt;Diversion ratio under selective admissions: The share of patients lost by a facility who are captured by each alternative facility. In a model with selective admissions, rationed patients (marginal for the facility) divert differently from patients who voluntarily switch (marginal for the consumer), because rationed patients strictly prefer the rejecting facility. The naive correction conflates these two margins, yielding quantitatively different and biased diversion ratio estimates.&lt;/p&gt;
&lt;p&gt;Non-parametric necessity of instruments: The property that both the preference shifter and the choice-set shifter are individually necessary conditions for point identification of the joint distribution of preferences and acceptance decisions, not merely convenient sufficient conditions. Absence of either instrument leaves the model non-identified on any open set where a constrained choice set has positive probability.&lt;/p&gt;</description></item><item><title>Demand Stimulus as Social Policy</title><link>https://macropaperwarehouse.com/papers/demand-stimulus-as-social-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/demand-stimulus-as-social-policy/</guid><description>&lt;p&gt;This paper estimates the distributional and social consequences of Department of Defense (DOD) contract spending using a city-level (CBSA) panel dataset spanning 2005–2016. The research question is whether demand stimulus — specifically DOD spending, the largest category of U.S. discretionary government spending — has differential effects across demographic groups and whether it improves social outcomes typically targeted by dedicated government programs. A secondary question is whether these effects are specific to DOD spending or common to any demand shock.&lt;/p&gt;
&lt;p&gt;The empirical strategy exploits variation in DOD contract spending from USAspending.gov, constructing a proxy for outlays over time using contract duration, and instrumenting with a Bartik-type shock (location&amp;rsquo;s average DOD share interacted with aggregate contract spending). The main specification is a two-year differenced panel regression with CBSA and time fixed effects. Social outcomes come primarily from the American Community Survey (ACS), covering 290 CBSAs; mortality data come from the CDC; crime data from the FBI/NACJD. For comparison, the authors construct a general demand shock series using the standard Bartik shift-share approach across two-digit industries, which is nearly uncorrelated with the DOD shock (correlation -0.07).&lt;/p&gt;
&lt;p&gt;Main findings on distributional effects: A 1 percent increase in DOD spending as a share of local earnings raises overall average ACS earnings by 0.43 percent but raises average earnings for households without a bachelor&amp;rsquo;s degree by 0.71 percent, and raises average earnings for Black households by a slightly larger amount, while Whites receive the majority of total income. The employment rate rises by 0.22 percentage points per percent increase in DOD spending. Labor force participation is largely unchanged in aggregate, but rises 0.08 percentage points for the middle-aged (41–61) and 0.14 percentage points for those with a bachelor&amp;rsquo;s degree.&lt;/p&gt;
&lt;p&gt;On social outcomes: The poverty rate falls 0.08 percentage points, driven entirely by those without a bachelor&amp;rsquo;s degree. Food stamp (SNAP) receipt falls 0.08 percentage points. Self-reported disability rates fall, particularly among households without a bachelor&amp;rsquo;s degree. Occupational prestige rises by 0.024 points overall (0.037 for those without a bachelor&amp;rsquo;s degree). Travel time to work falls by 6.7 minutes per day, implying an annual benefit exceeding $558 per worker at a value of time of $10/hour. Marriage rates rise and divorce rates fall for some demographic groups. Homeownership increases significantly for some groups. Mortality falls, with 2.61 fewer deaths per 100,000 among those age 45–65 and 8.49 fewer deaths per 100,000 among those over 65 per percent increase in DOD spending; health-related deaths account for the majority of the decline. Crime is largely unaffected, except for a statistically significant reduction in vehicle theft.&lt;/p&gt;
&lt;p&gt;Comparing DOD to general demand shocks: Although both raise total earnings by similar amounts ($0.56 and $0.63 per dollar of shock, respectively), the general demand shock produces only about half the employment rate response (14.3 vs. 24.5 percentage point increase for households without a bachelor&amp;rsquo;s degree), concentrates earnings gains among already-employed, higher-educated, and White households, produces weaker effects on disability and occupational prestige, increases mortality by approximately 100 deaths per 100,000, and increases crime (vehicle theft and aggravated assault). The differential mortality response is partly attributed to differential pollution effects: general demand shocks raise the median AQI substantially, while DOD shocks do not. The differential employment effects of DOD shocks are explained primarily by city and occupational composition rather than industry composition: DOD shocks are directed toward smaller, lower-earnings cities with lower employment rates and fewer college-educated residents, and toward construction, manufacturing, and production/maintenance occupations with high no-bachelor&amp;rsquo;s shares.&lt;/p&gt;
&lt;p&gt;Scope conditions: Results are identified using CBSA-level variation over 2005–2016. DOD spending is treated as predominantly supply-side-driven and not directly entering household utility or local infrastructure. The social outcome results are local partial-equilibrium estimates and do not account for general equilibrium spillovers across CBSAs.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification strategy, and why is DOD spending considered a valid instrument for demand stimulus?
A: DOD contract data from USAspending.gov are used to construct a proxy for outlays (distributing contract obligations over contract duration), and this measure is instrumented with a Bartik-type shock (location&amp;rsquo;s average DOD share times aggregate contract growth). The Bartik IV isolates the component of DOD contracts associated with new production, addressing endogeneity and the &amp;ldquo;anticipated contracts&amp;rdquo; problem. DOD spending is treated as predetermined relative to local business cycles and does not directly enter household utility or local infrastructure, isolating the aggregate demand channel.&lt;/p&gt;
&lt;p&gt;Q: Which demographic groups receive the most total income from DOD spending, and which see the largest relative gains?
A: In absolute terms, the majority of wage and salary income from DOD spending accrues to Whites and to those without a bachelor&amp;rsquo;s degree. However, adjusting for existing income shares, Black households and households without a bachelor&amp;rsquo;s degree experience the largest proportional increases in average earnings: a 1 percent increase in DOD spending as a share of local earnings raises average earnings for no-bachelor&amp;rsquo;s households by 0.71 percent, compared to a 0.43 percent increase in overall average earnings.&lt;/p&gt;
&lt;p&gt;Q: How does DOD spending affect employment at the extensive margin, and what does this imply about who benefits?
A: A 1 percent increase in DOD spending as a share of local earnings raises the overall employment rate by 0.22 percentage points. The large employment response among those without a bachelor&amp;rsquo;s degree (24.5 percentage points in the comparative analysis) implies that DOD spending disproportionately benefits previously unemployed workers rather than simply raising wages for those already employed.&lt;/p&gt;
&lt;p&gt;Q: Does DOD spending increase labor force participation?
A: There is no detectable aggregate effect on labor force participation rates, suggesting limited effects of demand stimulus on the participation margin over short horizons. However, participation rises 0.08 percentage points for the middle-aged (41–61) and 0.14 percentage points for those with a bachelor&amp;rsquo;s degree. The population response is strongest for those without a bachelor&amp;rsquo;s degree, though the estimate is imprecise.&lt;/p&gt;
&lt;p&gt;Q: What are the poverty and welfare effects of DOD spending?
A: A 1 percent increase in DOD spending as a share of local earnings reduces the poverty rate by 0.08 percentage points, with the entire effect concentrated among households without a bachelor&amp;rsquo;s degree. SNAP (food stamp) receipt falls by 0.08 percentage points. Medicaid receipt falls significantly for young children, while children substitute into private health insurance, leaving overall child health insurance coverage unchanged.&lt;/p&gt;
&lt;p&gt;Q: How does DOD spending affect disability rates?
A: A 1 percent increase in DOD spending leads to a 0.001 percentage point reduction in self-reported disability rates among households without a bachelor&amp;rsquo;s degree. The effect is most apparent for this group, the middle-aged, and Whites. In the comparative analysis, the employment margin accounts for a disability decline of -0.051 for no-bachelor&amp;rsquo;s households, nearly half of the total disability decline of -0.114 for that group.&lt;/p&gt;
&lt;p&gt;Q: What are the occupational prestige and commute time effects?
A: A 1 percent increase in DOD spending raises a city&amp;rsquo;s average occupational prestige score (Siegel score) by 0.024 points, with the effect concentrated among no-bachelor&amp;rsquo;s households (0.037). Commute time falls by 6.7 minutes per day; at a value of time of $10/hour, this implies an annual benefit of approximately $558 per worker.&lt;/p&gt;
&lt;p&gt;Q: How does DOD spending affect household formation outcomes?
A: Marriage rates increase and the likelihood of single parenthood decreases for White households. Divorce rates decrease for middle-aged and Black households. White households become more likely to own homes and less likely to live in multi-family homes. Estimates for Black and Hispanic households are imprecise.&lt;/p&gt;
&lt;p&gt;Q: What are the mortality effects of DOD spending, and how do they compare to general demand shocks?
A: A 1 percent increase in DOD spending as a share of local income leads to 2.61 fewer deaths per 100,000 among those aged 45–65 and 8.49 fewer deaths per 100,000 among those over 65, with health-related deaths accounting for the majority of the decline. This implies the DOD must spend approximately $25 million to save a life aged 45–65, exceeding the typical value of a statistical life. By contrast, a general demand shock increases mortality by approximately 100 deaths per 100,000, consistent with Ruhm&amp;rsquo;s (2000) finding that mortality is procyclical; mortality increases from general shocks are also concentrated among those over 45.&lt;/p&gt;
&lt;p&gt;Q: What explains the divergent mortality effects of DOD and general demand shocks?
A: One mechanism explored is pollution: general demand shocks raise median AQI substantially while DOD shocks leave AQI largely unaffected, consistent with Ruhm&amp;rsquo;s (2000) emphasis on deteriorating health behaviors during expansions. The paper also points to differential occupational and geographic composition: DOD shocks flow to construction, manufacturing, and production/maintenance occupations rather than to higher-pollution or higher-accident-risk activities common in broad economic expansions.&lt;/p&gt;
&lt;p&gt;Q: How do the crime effects differ between DOD and general demand shocks?
A: DOD spending shocks are associated with a statistically significant reduction in vehicle theft but no significant change in other crime categories. General demand shocks, by contrast, appear to increase vehicle theft and aggravated assault. Voter turnout falls substantially in response to a general demand shock; both shock types reduce Democratic vote shares.&lt;/p&gt;
&lt;p&gt;Q: What is the key mechanism explaining why DOD shocks have stronger social effects than general demand shocks?
A: Despite similar average earnings effects for no-bachelor&amp;rsquo;s households (0.71 for DOD vs. 0.69 for general shocks), DOD shocks produce a much larger employment rate increase for that group (24.5 vs. 14.3 percentage points). The authors show that this employment margin accounts for large shares of the differential declines in poverty, food stamp receipt, disability, and improvements in marriage rates and occupational prestige.&lt;/p&gt;
&lt;p&gt;Q: What accounts for the differential employment effects on no-bachelor&amp;rsquo;s households between DOD and general demand shocks?
A: Of the 0.21 percentage point differential employment effect, roughly one quarter is associated with differences in the no-bachelor&amp;rsquo;s share across industries. Differences across cities and across occupations each account for much larger shares. DOD shocks are directed toward smaller, lower-income, lower-employment cities with fewer college-educated residents, while general demand shocks go to larger, richer cities with more elastic housing supply and higher education levels.&lt;/p&gt;
&lt;p&gt;Q: Which industries and occupations drive DOD&amp;rsquo;s stronger employment effects for no-bachelor&amp;rsquo;s workers?
A: Within industries, DOD-induced employment gains for no-bachelor&amp;rsquo;s workers are strongest in construction and manufacturing, with much milder effects from general demand shocks in these industries. The occupations benefiting most are military occupations (broadly defined) and Production and Maintenance occupations, which rank among the lowest in occupational prestige for no-bachelor&amp;rsquo;s workers.&lt;/p&gt;
&lt;p&gt;Q: How does DOD spending compare to targeted social programs in achieving distributional goals?
A: The paper argues that although DOD spending is not designed as social policy, its effects on earnings for households without a bachelor&amp;rsquo;s degree, poverty reduction, disability reduction, homeownership, and occupational upgrading mirror the stated objectives of many targeted programs (job training, housing subsidies, SNAP, Medicaid). At the same time, DOD-induced life savings cost approximately $25–45 million per life, exceeding the typical value of a statistical life, so the mortality benefits cannot alone justify the spending.&lt;/p&gt;
&lt;p&gt;Local DOD earnings multiplier: The dollar amount of earnings for a demographic group produced by a dollar of local DOD spending over a two-year period, estimated using a two-year differenced panel regression with CBSA and time fixed effects, instrumented by a Bartik-type shock.&lt;/p&gt;
&lt;p&gt;Bartik-type IV shock: An instrumental variable constructed as the product of a location&amp;rsquo;s average share of DOD contract spending and aggregate contract spending in a given period; used to isolate the component of DOD contracts associated with new production rather than anticipated or smoothed payments.&lt;/p&gt;
&lt;p&gt;General demand shock: A Bartik shift-share shock constructed from local industry employment shares and national industry-level growth rates across all private-sector industries, used as a comparison series to evaluate whether DOD spending effects are generic or specific to defense contracts (correlation with DOD shock: -0.07).&lt;/p&gt;
&lt;p&gt;Extensive margin of employment: The change in the employment rate (entry from unemployment or non-participation into employment) as distinct from hours or wage adjustments among the already-employed; identified in the paper as the primary mechanism linking DOD shocks to differential social outcomes for no-bachelor&amp;rsquo;s households.&lt;/p&gt;
&lt;p&gt;Deaths of despair: Drug-and-alcohol-related deaths and deaths by suicide, following Case and Deaton (2020); examined here at higher frequency as an outcome of labor market earnings changes induced by aggregate demand stimulus.&lt;/p&gt;
&lt;p&gt;Occupational prestige (Siegel prestige score): A summary measure of job quality based on survey-derived perceptions of occupational standing (Siegel 1971), aggregated to the CBSA level by demographic group; used as a measure of upward job-ladder mobility in response to demand stimulus.&lt;/p&gt;
&lt;p&gt;Source text origin: A classification of the text basis for a paper summary — full PDF or OA-HTML versus abstract-only; the pipeline hard-blocks summaries derived solely from abstract text.&lt;/p&gt;</description></item><item><title>Designing Dynamic Reassignment Mechanisms: Evidence from GP Allocation</title><link>https://macropaperwarehouse.com/papers/designing-dynamic-reassignment-mechanisms-evidence-from-gp-allocation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/designing-dynamic-reassignment-mechanisms-evidence-from-gp-allocation/</guid><description>&lt;p&gt;This paper studies the design of dynamic reassignment mechanisms—centralized systems that must not only provide good initial matches but also accommodate changes in agents&amp;rsquo; preferences over time. The empirical setting is Norway&amp;rsquo;s system for allocating patients to general practitioners (GPs), where every individual is assigned a specific GP whose panel has a binding capacity cap. Since 2016, Norway has allowed patients to join waitlists for oversubscribed GPs while retaining their spot on their current GP&amp;rsquo;s panel, with reassignment proceeding strictly first-come, first-served (FCFS) as vacancies arise.&lt;/p&gt;
&lt;p&gt;The paper makes three contributions. First, it provides direct evidence of unrealized gains from trade: in December 2019, 15 percent of the 133,332 patients then standing on waitlists could have been immediately reassigned via a single run of the Top-Trading Cycles (TTC) algorithm, which identifies not only bilateral swaps but arbitrary cycles. A mechanical simulation holding patient choices fixed shows that running TTC monthly from November 2016 through December 2019 would have left 23 percent fewer patients on waitlists by end-2019, with average waiting times among reassigned patients 29 percent shorter.&lt;/p&gt;
&lt;p&gt;Second, the paper introduces a dynamic TTC mechanism and clarifies why static properties do not carry over. In the static case, TTC is both strategy-proof and Pareto-improving (Shapley and Scarf, 1974; Roth, 1982). In a dynamic setting, neither property holds. Repeated TTC is not strategy-proof because patients&amp;rsquo; GP choices affect how long they wait. More importantly, TTC may leave some patients worse off: a panel slot that would have gone to the first person on a waitlist under FCFS may instead go to a later-arriving patient who can form a trading cycle, effectively de-prioritizing patients whose GPs are undersubscribed. In the mechanical simulation, 4.5 percent of patients face longer waiting times under TTC.&lt;/p&gt;
&lt;p&gt;Third, the paper estimates a structural model of patient attention and GP choice using monthly Norwegian administrative data covering 4.78 million patients and 6,470 GP panels (2014–2019), restricting estimation to the Trondelag region (approximately 8 percent of the country). The model specifies: a Poisson attention process (patients consider switching only when an attention shock arrives); preferences over GPs as a function of travel time, GP fixed effects, and match characteristics; and a belief model mapping observed waitlist lengths into expected waiting times. Parameters are recovered via a Gibbs sampler with Metropolis-Hastings for the discount rate. Key estimates: the annual discount factor is approximately 0.91; a female patient under 45 would travel 7.3 minutes farther to see a female GP (6.3 minutes for a female patient over 45); GP fixed effects have a standard deviation of 31 minutes&amp;rsquo; travel-time equivalent; idiosyncratic taste shocks have a standard deviation of 12.6 minutes.&lt;/p&gt;
&lt;p&gt;The paper then simulates a stationary equilibrium for each counterfactual mechanism. Under the status quo in stationary equilibrium, 9.4 percent of patients are on a waitlist, 82.2 percent of GPs have a waitlist, and average expected waiting time is 16.7 months. Introducing TTC reduces average waiting time to 14.1 months and raises mean patient welfare by the equivalent of 0.75 minutes&amp;rsquo; travel time (more than 13 percent of the gain achievable under a no-capacity-constraints benchmark). Over half of this gain (0.4 minutes) comes directly from patients obtaining geographically closer GPs. Benefits are concentrated among younger patients, female patients, and recent movers; rural patients gain 2.1 minutes. However, patients with undersubscribed GPs face waiting times that rise from 16.7 to 22.8 months and are worse off by the perpetuity equivalent of 0.8 minutes.&lt;/p&gt;
&lt;p&gt;Two modified mechanisms are evaluated. Deferred Acceptance (DA), which strictly respects FCFS priority, achieves essentially no improvement over the status quo, illustrating a fundamental trade-off between eliminating envy and exploiting gains from trade. A &amp;ldquo;TTC with Priority&amp;rdquo; (TTCP) mechanism, which gives priority for panel vacancies to patients with undersubscribed GPs before running TTC, achieves 61 percent of TTC&amp;rsquo;s welfare gains (0.46 minutes flow payoff; 1.08 minutes NPV) while leaving patients with undersubscribed GPs no worse off than under the status quo. A benchmark simulation eliminating waitlists altogether raises mean welfare slightly (0.19 minutes) but lowers median welfare (−0.60 minutes), with gains concentrated among highly mismatched patients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the core market failure the paper documents?&lt;/strong&gt;
A: Norway&amp;rsquo;s waitlist mechanism assigns panel vacancies strictly first-come, first-served without allowing patients to trade. This creates a &amp;ldquo;double coincidence of wants&amp;rdquo; problem: patients can simultaneously be on each other&amp;rsquo;s waitlists but cannot swap. In December 2019, 15 percent of 133,332 waiting patients could have been immediately reassigned via a single TTC run. A mechanical simulation shows that monthly TTC would have left 23 percent fewer patients on waitlists by end-2019 and reduced average realized waiting times among reassigned patients by 29 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does TTC fail to be strategy-proof in a dynamic setting?&lt;/strong&gt;
A: In the static case, TTC gives every agent an assignment at least as good as their endowment, making truthful reporting a dominant strategy. In a dynamic setting, a patient&amp;rsquo;s choice of GP determines not only which GP they receive but also how long they wait — patients who choose less-demanded GPs reach the front of the waitlist faster. This creates incentives to misreport preferences strategically, breaking strategy-proofness. The paper shows this formally and builds it into the equilibrium model by requiring patients to optimize over both GP choice and expected waiting time.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does dynamic TTC harm some patients relative to the status quo?&lt;/strong&gt;
A: Under FCFS, the first person on a waitlist is guaranteed the next available slot on the target GP&amp;rsquo;s panel. Under TTC, a patient who arrived later but whose current GP is oversubscribed can form a trading cycle that redirects that slot, effectively jumping the queue. Patients with undersubscribed GPs — whose panel endowment is not a scarce resource that others want — cannot form cycles and are systematically de-prioritized. In the stationary equilibrium, their expected waiting time rises from 16.7 to 22.8 months, and they are worse off by the perpetuity equivalent of 0.8 minutes&amp;rsquo; travel time.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the main parameter estimates and what do they imply?&lt;/strong&gt;
A: The annual discount factor is estimated at approximately 0.91 once GP fixed effects are included (rising to near 0.95 without them, because more desirable GPs have longer waitlists). Gender homophily is worth 6.3–7.3 minutes of travel time for female patients under 45. Age homophily is worth approximately 1 minute. The standard deviation of GP fixed effects is 31 minutes and idiosyncratic shocks are 12.6 minutes, both in travel-time equivalents, indicating substantial horizontal differentiation across GPs and across patients&amp;rsquo; idiosyncratic tastes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How important are moves as a driver of GP switching?&lt;/strong&gt;
A: Moves are the dominant driver. Among non-movers, older men consider switching just once every 25 years; temporary residents consider switching approximately once every 7.5 years (1.084 percent per month). Among patients who moved more than 30 minutes, a temporary resident has an 18.59 percent monthly probability of considering switching in the month of or month after the move. For a permanent resident making a long-distance move, the cumulative attention probability over the 8 months surrounding the move rises to 34 percent (versus 22 percent for a short-distance move). In the data, 26 percent of waitlist users moved municipality during 2017–2019, versus 6 percent of non-switchers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the stationary equilibrium under the status quo look like?&lt;/strong&gt;
A: In the long-run stationary equilibrium, 9.4 percent of patients are on a waitlist, 82.2 percent of GPs have a waitlist, and the average expected waiting time to switch GPs is 16.7 months. Each month, 2,299 patients on average draw attention shocks; 85.2 percent of these choose to join a waitlist, while the remainder either switch to an open GP or stay with their current GP. The average attentive patient expects to successfully obtain their chosen GP after 16.8 months.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the distributional consequences of TTC across patient subgroups?&lt;/strong&gt;
A: Female patients benefit especially because they are more likely to be attentive (and thus use waitlists) than males. Recent movers gain 2.3 minutes&amp;rsquo; travel-time equivalent. Patients who have never moved still gain 1.0 minutes. Rural patients gain 2.1 minutes (larger than average), reflecting their longer baseline travel times and greater geographic mismatch potential. Urban patients also benefit but less so. The one group that is harmed is patients with undersubscribed GPs, who face longer waits and a welfare loss of 0.8 minutes perpetuity equivalent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does the Deferred Acceptance mechanism fail to improve on the status quo?&lt;/strong&gt;
A: DA strictly respects FCFS waiting-time priority: no patient may be reassigned to a GP for whom another patient has been waiting longer. This means DA can only execute swaps in which all patients ahead of each participant on their respective waitlists are also reassigned in the same month. In practice, this virtually never occurs, so DA reassigns almost no patients earlier than the status quo Waitlists mechanism. The result illustrates a fundamental trade-off: fully respecting FCFS priority eliminates nearly all gains from trade.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does TTCP restore fairness while preserving most of the efficiency gains?&lt;/strong&gt;
A: TTCP modifies TTC by prioritizing patients with undersubscribed GPs over those with oversubscribed GPs when assigning panel vacancies, while still respecting the constraint that patients cannot be assigned a GP they prefer less than their current one. This gives patients with undersubscribed GPs a compensating advantage in the queue that offsets their inability to trade via cycles. TTCP achieves 0.46 minutes&amp;rsquo; mean flow payoff improvement versus 0.75 for TTC (61 percent of TTC&amp;rsquo;s gains), and an NPV measure of 1.08 minutes versus 1.25 for TTC. Patients with undersubscribed GPs are left no worse off than under the status quo.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens when waitlists are eliminated entirely?&lt;/strong&gt;
A: Under No Waitlists, attentive patients may only choose among GPs with open panels at the moment of attention. Mean welfare rises slightly (0.19 minutes) because patients spend less time mismatched while waiting, but median welfare falls by 0.60 minutes. The gains are concentrated among a minority of highly mismatched patients who prefer limited choice with no waiting over broader choice with long waits, while most patients prefer the option to wait for a more preferred GP. The authors note this may partly explain why formal waitlists are rare in other primary care systems.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the welfare benchmark and how large are the gains?&lt;/strong&gt;
A: The benchmark is a &amp;ldquo;No Caps&amp;rdquo; scenario in which all panel caps are removed, representing the maximum achievable improvement. The mean welfare gain from TTC (0.75 minutes) represents more than 13 percent of this upper bound. The &amp;ldquo;Truthful TTC&amp;rdquo; benchmark, where patients submit full preference lists, yields 1.04 minutes, but its gains are also concentrated: the median patient is no better off than under the status quo Waitlists mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the scope conditions for these findings?&lt;/strong&gt;
A: The demand model is estimated on the Trondelag region of Norway (approximately 8 percent of the national population) over 2017–2019, a period when waitlists were growing rapidly rather than in steady state. Counterfactual comparisons are made in a stationary equilibrium calibrated to Trondelag. The model excludes patients under 16 (whose enrollment is managed by parents). The partially capitated payment structure and fixed panel caps are institutional features specific to Norway, though similar systems exist in Canada, the UK, Italy, and Sweden. GP characteristics are held fixed in the model. The analysis abstracts from health outcomes, focusing on preference-based welfare from GP assignment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Top-Trading Cycles (TTC) algorithm&lt;/strong&gt;: A centralized reassignment algorithm that takes agents&amp;rsquo; preference lists and objects&amp;rsquo; priority lists as inputs, has each agent &amp;ldquo;point to&amp;rdquo; their preferred object and each object &amp;ldquo;point to&amp;rdquo; their highest-priority current or waiting agent, identifies cycles of mutual pointing, and executes the trades in those cycles simultaneously. In the paper&amp;rsquo;s static application, TTC is both Pareto-improving (every participant receives an assignment at least as good as their endowment) and strategy-proof. In the dynamic setting studied here, neither property holds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic TTC mechanism&lt;/strong&gt;: A mechanism that runs the TTC algorithm repeatedly at the end of each period after naturally arising vacancies have been filled from waitlists. Because patients&amp;rsquo; GP choices affect how long they wait — not only which GP they receive — this mechanism is not strategy-proof and may leave patients with undersubscribed GPs worse off than under strictly FCFS waitlists.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;TTC with Priority (TTCP)&lt;/strong&gt;: A modified version of dynamic TTC that changes the priority ordering so that patients with undersubscribed current GPs are prioritized above patients with oversubscribed GPs when panel vacancies are allocated. This modification preserves patients&amp;rsquo; endowment rights but compensates the group harmed by standard TTC. In the paper&amp;rsquo;s simulations, TTCP achieves 61 percent of TTC&amp;rsquo;s mean welfare gains while leaving patients with undersubscribed GPs no worse off than under the status quo.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Patient attention model&lt;/strong&gt;: A model in which patients consider switching GPs only when they receive a Poisson-distributed attention shock. Attention rates vary by observable characteristics (age, gender, temporary vs. permanent residency, whether and how far the patient recently moved). The model interprets any switch request as evidence of both an attention shock and a preference for the requested GP over the current one. Patients who do not request switches may be either inattentive or attentive but satisfied.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Horizontal differentiation (GP preference heterogeneity)&lt;/strong&gt;: The extent to which different patients prefer different GPs for reasons unrelated to overall GP quality — primarily driven by geographic proximity, gender homophily (worth 6.3–7.3 travel-time-equivalent minutes for young female patients), and age similarity (approximately 1 minute). Horizontal differentiation is the fundamental source of gains from trade: if all patients preferred the same GP, there would be no mutual-benefit swaps to find.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deferred Acceptance (DA) algorithm&lt;/strong&gt;: The patient-proposing DA algorithm, which strictly respects FCFS waiting-time priority: no patient may be reassigned ahead of another patient who has been waiting longer for the same GP. In the dynamic context, DA achieves essentially no welfare improvement over the status quo because its strict respect for priority eliminates nearly all trading opportunities, illustrating the trade-off between envy-freeness and efficiency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Double coincidence of wants&lt;/strong&gt;: The situation in which two (or more) patients are simultaneously on each other&amp;rsquo;s waitlists and would mutually benefit from trading GP assignments, but cannot do so under the current mechanism because there is no vacancy on either panel. The paper&amp;rsquo;s direct evidence of this phenomenon — 15 percent of waiters could be immediately reassigned via one TTC run — motivates the counterfactual analysis.&lt;/p&gt;</description></item><item><title>Destabilizing Capital Flows amid Global Inflation</title><link>https://macropaperwarehouse.com/papers/destabilizing-capital-flows-amid-global-inflation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/destabilizing-capital-flows-amid-global-inflation/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bengui and Coulibaly ask whether the pattern of capital flows observed during the 2021–2023 global monetary tightening cycle — whereby capital flowed from low-inflation to high-inflation countries — was a stabilizing or destabilizing force for the global economy&amp;rsquo;s adjustment to cost-push shocks. Among the G7 and a broader sample of 26 jurisdictions, those with higher average CPI inflation (October 2021–March 2023) and larger cumulative interest rate hikes ran more negative current account balances over the same period, with the slope of the cross-sectional relationship between cumulative hikes and the current account equal to −1.29 (significant at 1%) and the slope between average inflation and the current account equal to −0.99 (significant at 1%), and over 75% of the top two quartile hikers running deficits while over 75% of the bottom two quartiles ran surpluses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors build a standard continuous-time two-country general equilibrium model with nominal rigidities (Calvo price-setting), internationally traded bonds, and cost-push shocks modeled as wage markup shocks that create an output-inflation trade-off. The baseline model features no home bias (equal weights on domestic and foreign goods) and two tradable goods. Extensions introduce (i) consumption home bias (parameter α ∈ [0, 1/2]) and (ii) non-tradable goods. Policy is analyzed under two regimes: (a) free capital mobility (no taxes on financial transactions) with optimal cooperative monetary policy, and (b) a managed capital flow regime in which a planner jointly optimizes both monetary policy and a tax wedge on the international bond (τ^D_t). A second-order approximation of household utility yields a loss function penalizing world and cross-country output gaps, PPI inflation differentials, and the demand imbalance term θ_t. The quantitative section replaces optimal monetary policy with standard Taylor rules (φ_π = 1.5, φ_y = 0.25) and calibrates a Home cost-push shock to generate a peak CPI inflation rate of about 7%, with an annual autocorrelation of 0.65.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central theoretical result (Proposition 2, &amp;ldquo;Topsy-Turvy Capital Flows&amp;rdquo;) is that, under the Marshall-Lerner condition (trade elasticity η &amp;gt; 1), a free capital mobility regime channels capital into the country with the most acute inflationary pressures — the very country whose central bank is most aggressively tightening — while the constrained-efficient managed regime would channel capital in the opposite direction. The mechanism operates through the supply side: capital inflows raise domestic households&amp;rsquo; wealth, reducing their labor supply and thereby raising real wages and firms&amp;rsquo; marginal costs. In the presence of non-tradable goods, an additional channel operates through the real exchange rate — capital inflows appreciate the domestic real exchange rate and inflate tradable-sector firms&amp;rsquo; marginal costs independently of labor supply. Both channels worsen the central bank&amp;rsquo;s output-inflation trade-off.&lt;/p&gt;
&lt;p&gt;In the quantitative exercise (Taylor rule setting, home bias α = 0.25, trade elasticity χ = 3), following the calibrated inflationary cost-push shock in Home:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Under &lt;strong&gt;free capital mobility&lt;/strong&gt;: Home inflation rises to 8% on impact; Home output gap reaches −8.4%; Foreign output gap reaches +2.4%; Home runs a trade deficit of 2.5% of GDP on impact; Home&amp;rsquo;s initial policy rate hike is nearly 10% while Foreign&amp;rsquo;s is less than 1%.&lt;/li&gt;
&lt;li&gt;Under the &lt;strong&gt;managed capital flow regime&lt;/strong&gt; (capital flows reversed to outflows from Home): Home inflation on impact falls to nearly 6% (a reduction of approximately 2 percentage points); Home output gap is −6.8% (improvement of about 1.5 percentage points); Foreign output gap is 0.8% (improvement of about 1.5 percentage points); Home runs a trade surplus of 0.6% of GDP; Home&amp;rsquo;s initial hike falls to approximately 8% (roughly 2 percentage points lower) while Foreign&amp;rsquo;s rises to approximately 2.5% (roughly 1.5 percentage points higher).&lt;/li&gt;
&lt;li&gt;The managed regime delivers average welfare gains of &lt;strong&gt;0.78% of current consumption (0.03% of permanent consumption)&lt;/strong&gt;. Welfare gains are increasing in the trade elasticity η: at η = 10 (consistent with Yi 2003&amp;rsquo;s bilateral trade flow estimates), gains reach approximately 0.08% of permanent consumption or 1.9% of current consumption.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The topsy-turvy result (free mobility channels capital in the wrong direction) holds conditional on the Marshall-Lerner condition (η &amp;gt; 1 in the baseline; equivalently, the trade elasticity χ &amp;gt; 1). With consumption home bias, the condition weakens to: the trade elasticity exceeds the degree of home bias (χ &amp;gt; 1 − 2α, which is weaker than Marshall-Lerner). When home bias is strong relative to the trade elasticity, a purchasing power effect may dominate the wealth effect, and free capital mobility may instead deliver too little capital flow toward the depressed country — the opposite inefficiency. The welfare analysis throughout assumes symmetric initial net foreign asset positions. The key insight is specific to environments in which monetary policy faces an output-inflation trade-off from cost-push shocks; it is directionally opposite to the aggregate demand externality prescription that arises in demand-shortage environments (e.g., currency unions with productivity shocks), where optimal policy instead calls for capital to flow toward the more depressed country.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the empirical motivation for the paper, and how is the stylized fact documented?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: During October 2021–March 2023, jurisdictions with higher average CPI inflation and larger cumulative policy rate hikes ran more negative current account balances. The cross-sectional slope between average inflation and the current account-to-GDP ratio is −0.99 (R² = 0.22, significant at 1%), while the slope between cumulative hikes and the current account is −1.29 (R² = 0.27, significant at 1%). Among the top two quartiles of cumulative hikers, over 75% of jurisdictions ran current account deficits, while among the bottom two quartiles over 75% ran surpluses. Data come from the BIS (inflation and policy rates) and the OECD Main Economic Indicators (quarterly current accounts), covering 26 jurisdictions excluding Argentina, Russia, and Turkey.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the core externality the paper identifies, and why do atomistic agents fail to internalize it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: When a household in the high-inflation country borrows from abroad for consumption smoothing (as the domestic central bank tightens), it raises domestic consumption and thereby reduces labor supply through a wealth effect, pushing up real wages and firms&amp;rsquo; marginal costs. The central bank must then tighten further to achieve the same inflation stabilization, or accept a worse inflation outcome. Because this effect operates through economy-wide wages and prices (general equilibrium), atomistic households do not internalize it when making individual borrowing decisions. The paper shows formally that a marginal increase in Home borrowing dθ_t raises welfare losses by an amount proportional to the product of the Phillips curve slope κ, the co-state variable φ^D_t (equal to the cross-country output gap differential y^D_t under optimal monetary policy), and the direct effect on cross-country marginal cost differences (1/2). When output is more depressed in Home (y^D_t &amp;lt; 0), additional borrowing by Home tightens the constraint and lowers welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What does the optimal capital flow management targeting rule say, and what is its economic interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: Proposition 1 states that under jointly optimal monetary and capital flow management, the demand imbalance (relative consumption) should satisfy θ_t = 2y^D_t. This means the planner generates a demand imbalance in favor of the less depressed country, reallocating spending away from the country with the most acute inflationary pressure. This is counterintuitive from a pure output stabilization view: policy deliberately shifts demand away from the country with the most depressed output. The logic is that reducing the domestic wealth of the high-inflation country lowers real wages, reduces firms&amp;rsquo; marginal costs, and thereby relaxes the output-inflation trade-off for that country&amp;rsquo;s central bank.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the &amp;ldquo;topsy-turvy&amp;rdquo; capital flows result (Proposition 2), and under what condition does it hold?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: Under free capital mobility, standard neoclassical consumption-smoothing motives lead capital to flow into the country with the most depressed output (the high-inflation country): the trade deficit equals [(η−1)/η]·y^D_t. Under managed capital flows, the optimal regime instead mandates a trade surplus for the most depressed country: the trade balance equals −(1/η)·y^D_t. Comparing signs, the direction of capital flows is literally reversed — hence &amp;ldquo;topsy-turvy.&amp;rdquo; The result holds whenever Assumption 1 (η &amp;gt; 1, the Marshall-Lerner condition in the baseline model) is satisfied, which the authors argue has compelling empirical support (trade elasticities estimated at 7–17 in the literature).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the presence of home bias in consumption affect the externality and the topsy-turvy result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: With home bias (α &amp;lt; 1/2), capital inflows also appreciate the terms of trade, which lowers the relative price of imports in terms of domestic goods and reduces marginal costs for domestic tradable firms — a &amp;ldquo;purchasing power effect&amp;rdquo; that partially offsets the wealth effect. The optimal capital flow targeting rule becomes θ_t = [1 − (1−2α)/(2(1−α)η)]·2y^D_t. Under the condition that the trade elasticity exceeds the degree of home bias (χ &amp;gt; 1 − 2α, strictly weaker than Marshall-Lerner), the wealth effect dominates the purchasing power effect and the topsy-turvy result is preserved. Below a knife-edge curve in the (α, η) parameter space, the purchasing power effect dominates and free capital mobility results in too little rather than too much capital flowing toward the high-inflation country.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Does the externality always imply excessive capital flow volatility?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: No — this is a novel contribution relative to the prior literature. In the limiting case of a unit intratemporal elasticity (η → 1, the Cole-Obstfeld case), trade is balanced at all times under free capital mobility. Under managed capital flows, however, capital should flow from the most depressed to the least depressed country. This means the externality can result in too little rather than too much capital flow. The standard normative literature (e.g., Bianchi 2011) has focused on excessive capital flow volatility; the supply-side channel identified here shows that market failures can sometimes lead to insufficient external imbalances.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the paper&amp;rsquo;s mechanism differ from aggregate demand externalities as in Farhi and Werning (2016)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: Farhi and Werning (2016) study demand-shortage environments (fixed exchange rates or zero lower bound) where constraints on monetary policy mean output is demand-constrained. Their prescription is to channel capital toward the most depressed country to stimulate demand for undersupplied goods. In Bengui and Coulibaly, monetary policy is unconstrained but faces an output-inflation trade-off from cost-push shocks. Here, the depressed output reflects the central bank&amp;rsquo;s deliberate demand contraction to fight inflation, not an inability to stimulate. The optimal response is therefore to shift spending away from the high-inflation (most depressed) country to reduce supply pressure — the opposite direction. Formally, in the demand-shortage case with unit elasticity and home bias, the optimal trade balance targeting rule is nxt = [(1−2α)/(4(1−α))]·ỹ^D_t (trade deficit for most depressed country), while in the supply pressure case it is nxt = −[α/(1−α)]·y^D_t (trade surplus for most depressed country).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What does the non-tradable goods extension add to the baseline mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: The baseline model (two tradable goods, no home bias) transmits the externality only through the wealth effect on labor supply: capital inflows raise consumption, reduce labor supply, and raise real wages and marginal costs. In the non-tradable goods extension, a second channel operates through the real exchange rate. Capital inflows raise demand for non-tradable goods, appreciating the domestic real exchange rate and inflating the price of the consumption basket relative to domestically produced tradable goods. This raises marginal costs for tradable-sector firms independently of any labor supply response, and is therefore unaffected by whether preferences exhibit a wealth effect on labor supply. The paper shows that the optimal policy problem in this extension is isomorphic to the baseline: the loss decomposition (equation 42) yields two additive terms proportional to the share of tradable goods (wealth effect on labor supply) and the share of non-tradable goods (wealth effect on demand for non-tradables), respectively.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the quantitative exercise show about cross-country policy rate dispersion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: Under free capital mobility with Taylor rules, the initial policy rate hike in Home following the calibrated shock is nearly 10%, while in Foreign it is less than 1% — a cross-country dispersion of roughly 9 percentage points. Under managed capital flows, Home&amp;rsquo;s initial hike falls to approximately 8% and Foreign&amp;rsquo;s rises to approximately 2.5% — a dispersion of roughly 5.5 percentage points. The authors interpret this as evidence that free capital mobility leads high-inflation countries to tighten excessively and low-inflation countries to tighten too little, generating an inefficiently large cross-country dispersion in monetary policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the welfare gain from managed capital flows vary with the trade elasticity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: Welfare gains are increasing in the elasticity of substitution between domestic and foreign goods (η). At the baseline calibration of η = 2 (trade elasticity χ = 3, near the lower bound of empirical estimates), the gain is 0.78% of current consumption (0.03% of permanent consumption). At η = 10 (consistent with Yi 2003&amp;rsquo;s estimate needed to match bilateral trade flows), the gain rises to approximately 1.9% of current consumption (0.08% of permanent consumption). The welfare gain is defined as the percentage increase in permanent consumption required by a household under free capital mobility to be as well off as under managed capital flows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the role of Lemma 1 (irrelevance of capital flow regime for world variables)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: Lemma 1 shows that under optimal cooperative monetary policy, the paths of world output gap and world inflation are independent of the capital flow regime (i.e., independent of the path of θ_t). This follows because the &amp;ldquo;world&amp;rdquo; block of the model can be solved independently of the &amp;ldquo;difference&amp;rdquo; block and the demand imbalance. As a result, the entire normative analysis of capital flows reduces to the behavior of cross-country difference variables (y^D_t, π^D_t, and θ_t), greatly simplifying the analysis. It also implies that switching capital flow regimes does not affect the global total of output or inflation, only its distribution across countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What extensions do the authors suggest would enrich the analysis without invalidating the main insight?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: Three extensions are noted. First, additional monetary policy constraints — discretionary (non-commitment) policy, non-cooperative policy setting, or a currency union — would introduce extra stabilization constraints and generate additional terms in the capital flow management targeting rule but would not overturn the supply-side channel. Second, alternative goods pricing specifications (local currency pricing, deviations from the law of one price) would make additional variables like cross-country consumer price differentials relevant measures of policy tightness, again adding terms to the rule. Third, the insight is argued to apply more generally in heterogeneous-agent or multi-sector closed-economy models with nominal rigidities whenever private financial decisions affect the economy&amp;rsquo;s supply side through general equilibrium price effects.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Cost-push shock (wage markup shock):&lt;/strong&gt; In the paper&amp;rsquo;s model, a cost-push shock is a positive deviation of the wage markup (µ^w_t) from its steady-state value. It shifts the New Keynesian Phillips curve, creating an output-inflation trade-off: the central bank must accept either higher inflation or a larger negative output gap. It is not a demand shock; its policy implications are directionally opposite to demand shortage shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demand imbalance (θ_t):&lt;/strong&gt; The log ratio of Home to Foreign consumption, defined as c_t − c^*_t = θ_t in the linearized model. Under free capital mobility and symmetric initial wealth, θ_t = 0 (consumption shares are equalized). Under managed capital flows, θ_t is the instrument of capital flow policy: setting θ_t &amp;gt; 0 shifts spending toward Home; θ_t &amp;lt; 0 shifts it toward Foreign. The loss function penalizes deviations of θ_t from zero as an independent inefficiency (cross-country consumption misallocation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Topsy-turvy capital flows:&lt;/strong&gt; The paper&amp;rsquo;s central finding that, following a cost-push shock, the direction of capital flows prescribed by constrained-efficient policy is opposite to the direction that free capital mobility generates. Under free mobility, capital flows into the high-inflation country (trade deficit there); under managed flows, capital should flow out of the high-inflation country (trade surplus there). The term is used to describe the directional reversal, not merely excessive magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Macroeconomic externality (supply-side):&lt;/strong&gt; The failure of atomistic agents to internalize the general equilibrium effect of their borrowing decisions on domestic firms&amp;rsquo; marginal costs (via real wages or the real exchange rate). This is the paper&amp;rsquo;s label for the source of inefficiency. It is classified as a supply-side externality to distinguish it from aggregate demand externalities (Farhi and Werning 2016), where the operative mechanism runs through demand for specific goods rather than through factor costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trade elasticity (χ):&lt;/strong&gt; In the baseline model, χ = η (elasticity of substitution between domestic and foreign tradable goods). With home bias, χ = 2(1−α)η. The trade elasticity plays the key role in determining whether the topsy-turvy result holds: the result requires χ &amp;gt; 1 (Marshall-Lerner in baseline) or, with home bias, χ &amp;gt; 1 − 2α (weaker condition). At χ = 1 (Cole-Obstfeld case), trade is balanced under free mobility, and managed flows call for capital to move from the most to the least depressed country — implying insufficient rather than excessive capital flows under free mobility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Purchasing power effect:&lt;/strong&gt; In the model with home bias, a capital inflow appreciates the terms of trade (the relative price of exports over imports), which raises the purchasing power of domestic firms and lowers their marginal costs. This effect partially offsets the wealth-effect-driven rise in marginal costs. Its strength is proportional to the degree of home bias (1−2α) relative to the trade elasticity 2(1−α)η. Under the paper&amp;rsquo;s weaker-than-Marshall-Lerner condition, the wealth effect dominates the purchasing power effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Managed capital flow regime:&lt;/strong&gt; A policy regime in which the government imposes taxes on international financial transactions (τ_t for Home, τ^&lt;em&gt;_t for Foreign) to control the demand imbalance θ_t, subject to the targeting rule θ_t = 2y^D_t (or its home-bias-adjusted counterpart). This regime accounts for the macroeconomic externality and delivers a constrained-efficient allocation given the presence of nominal rigidities. The tax wedge τ^D_t = (τ_t − τ^&lt;/em&gt;_t)/2 represents the gap in returns on the international bond faced by Home versus Foreign households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;World and difference formulation:&lt;/strong&gt; Following Engel (2011) and Groll and Monacelli (2020), the model is decomposed into &amp;ldquo;world&amp;rdquo; variables (averages: y^W_t, π^W_t) and &amp;ldquo;difference&amp;rdquo; variables (cross-country gaps: y^D_t, π^D_t). The targeting rules and Phillips curves separate additively into world and difference blocks, and Lemma 1 establishes that the capital flow regime affects only the difference block. This decomposition is the analytical device that isolates the role of capital flows.&lt;/p&gt;</description></item><item><title>Devaluations, Deposit Dollarization, and Household Heterogeneity</title><link>https://macropaperwarehouse.com/papers/devaluations-deposit-dollarization-and-household-heterogeneity/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/devaluations-deposit-dollarization-and-household-heterogeneity/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Ferrante and Gornemann study the aggregate and redistributive effects of currency devaluations in emerging market economies, focusing on a feature that prior open-economy HANK models had not jointly incorporated: households hold dollar-denominated deposits that are disproportionately concentrated among wealthier agents, and these deposits sit on the liability side of leveraged, agency-constrained banks. The paper asks how this combination of deposit dollarization and household wealth heterogeneity shapes the macroeconomic and distributional consequences of a currency depreciation, and what it implies for the optimal degree of exchange-rate smoothing by the central bank.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Empirical Motivation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is calibrated to match cross-sectional micro-data from the 2013 Uruguayan Household Financial Survey, which records the currency denomination of household assets and liabilities. As documented by Drenik et al. [2018] and confirmed by the authors for Uruguay, the top quintile of the wealth distribution holds close to 70% of liquid savings in dollars, while households with zero or negative net wealth have essentially no direct foreign-currency exposure. The baseline calibration targets a deposit dollarization rate of 40% of aggregate bank deposits, in line with the cross-country average reported for Latin America. The spread between bank lending and deposit rates is calibrated at 8% annualized for household loans (consistent with Uruguayan bank data over the prior 15 years) and 2% for capital returns, implying a bank leverage ratio of approximately 6.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The framework is a small open economy New Keynesian model with two non-standard elements layered on a Bewley-Huggett-Aiyagari incomplete-markets household sector. First, households face idiosyncratic labor productivity risk and a borrowing constraint, generating a non-degenerate wealth distribution in which, at the calibrated steady state, approximately 8% of households are constrained borrowers, 22% are unconstrained borrowers, 27% hold zero liquid wealth and behave hand-to-mouth (HtM), 52% are net savers, and 1% are capitalists. Second, financial intermediaries face a Gertler-Karadi [2011] agency problem that generates an endogenous, time-varying spread between lending and deposit rates. Households can save in local- or foreign-currency bank deposits and in foreign bonds, but can only borrow through domestic banks. The currency composition of household portfolios, which is a linear function of household wealth in the baseline, maps through market clearing into the banks&amp;rsquo; currency mismatch, so that a wealthier-household preference for dollar deposits directly determines the bank&amp;rsquo;s foreign-currency liability share.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central experiment is a 100 basis-point annualized increase in the foreign interest rate with persistence 0.85, which induces a currency depreciation.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Aggregate amplification&lt;/em&gt;: Combining a HANK household sector with leverage-constrained banks exposed to currency mismatch causes aggregate consumption to drop approximately twice as much as in a representative-agent New Keynesian (RANK) model with constrained banks, and output to decline more than 1% — roughly 30% larger than the 0.75% decline in the RANK model with financial frictions. In contrast, absent banking frictions, a bank-less HANK model would generate an output expansion because the standard expenditure switching channel dominates.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Channels&lt;/em&gt;: The paper decomposes the consumption decline into (a) a labor income channel — lower hours and wages caused by the financial accelerator contraction account for approximately two-thirds of the aggregate consumption decline — and (b) a borrowing rate channel — the endogenous rise in household lending spreads accounts for approximately one-third. In a counterfactual model in which the spread on household loans is held fixed, the decline in consumption and output is approximately 50% smaller than in the baseline, confirming that the borrowing rate channel and its general-equilibrium feedback onto wages and asset prices are responsible for more than half of the baseline output decline.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Distributional effects&lt;/em&gt;: Within the baseline model, unconstrained borrowers see their consumption fall on average by more than 3.5% on impact; constrained borrowers&amp;rsquo; consumption falls by more than 5% in the second period as interest payments jump. Zero-wealth HtM agents cut consumption roughly one-for-one with the more-than-2% decline in real labor income. Wealthier savers and capitalists are partially insulated through their dollar holdings, which gain real value during the depreciation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Portfolio composition and deposit dollarization&lt;/em&gt;: When the deposit dollarization rate is raised from the baseline 40% to 80% (to match high-dollarization countries such as Uruguay at the extreme), investment declines approximately 12% (versus 6% in the baseline) and aggregate consumption falls approximately 1.7% (versus 1% in the baseline), with the output decline more than twice as large as in the baseline. Wealthier households&amp;rsquo; consumption path is actually higher in the high-dollarization calibration because of larger windfall gains on their dollar portfolios, while poorer households bear the amplified downturn through stronger labor income and borrowing rate channels. This produces a novel distributional result: stronger currency hedging by richer households deepens the aggregate recession and worsens outcomes for poorer agents.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Monetary policy&lt;/em&gt;: In the baseline 40% dollarization calibration, reacting to exchange rate changes by raising domestic interest rates is welfare-detrimental for most households: the gain from partially stabilizing banks&amp;rsquo; balance sheets is more than offset by the contractionary effect of higher rates on aggregate demand and spreads. A modest response (κ_e ≈ 0.04 in the ex-ante welfare experiment) is preferred, conditional on aggregate dynamics. When dollarization is 80%, a small degree of exchange rate leaning (κ_e = 0.5) can improve welfare for most agents, as the benefit from protecting banks&amp;rsquo; balance sheets becomes larger relative to the cost of tighter monetary conditions.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What three stylized facts about liability dollarization motivate the model, and how does the model&amp;rsquo;s structure capture each?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: The three facts are: (i) banks and firms borrow in foreign currency; (ii) foreign-currency bank debt is matched by dollar-denominated deposits from domestic households; (iii) those deposits are held predominantly by wealthier households. The model captures (i) and (ii) by having the bank hold a currency mismatch on its balance sheet — local-currency loans on the asset side, foreign-currency deposits on the liability side. Fact (iii) is captured by assuming a linear portfolio rule in which household dollar deposit share is an increasing function of wealth, calibrated to the slope observed in Uruguayan micro-data, with borrowers restricted to local-currency debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does a bank-less HANK open-economy model produce an output expansion rather than a contraction following a foreign interest rate shock in the calibration used?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: Without banking frictions, the expenditure switching channel dominates. A rise in the foreign interest rate depreciates the real exchange rate by roughly 1%, making domestic goods cheaper and raising exports by approximately 2%. In the bank-less HANK, this export boost causes hours and real labor income to increase, and high-MPC households (HtM and constrained borrowers) raise consumption. There is no financial accelerator operating through the bank&amp;rsquo;s balance sheet to offset this stimulus, so output expands rather than contracts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Through what exact mechanism does bank currency mismatch transform an exchange rate depreciation into a financial accelerator event?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: A weaker domestic currency raises the real cost of repaying foreign-currency deposits (R_Dt jumps on impact), directly eroding bank net worth (N_t). As net worth falls and leverage rises, the bank&amp;rsquo;s incentive constraint tightens, requiring spreads on both capital loans and household loans to increase jointly (per equation 21, the ratio of spreads moves one-for-one with the ratio of diversion parameters). Lower asset prices further reduce the return on capital, feeding back into net worth in the standard Gertler-Karadi financial accelerator loop. In the RANK with banks benchmark, investment declines approximately 6% compared to only 1% in the frictionless RANK.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the borrowing rate channel, and how is it distinct from the balance-sheet exposure channel studied in De Ferra et al. [2020]?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: The borrowing rate channel operates through the endogenous widening of bank lending spreads following a net worth erosion: when banks&amp;rsquo; leverage constraint binds more tightly, both the spread on firm capital and the spread on household loans rise simultaneously (equation 21). This forces even households who borrow only in local currency — and thus have no direct exchange-rate exposure on their liabilities — to face sharply higher borrowing costs, causing their consumption to fall steeply. De Ferra et al. [2020] study a different channel in which households borrow in foreign currency and suffer a direct balance-sheet loss from depreciation; the borrowing rate channel in this paper is distinct because it operates through financial intermediary frictions rather than through direct currency exposure of household debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How much of the aggregate consumption decline is attributable to the borrowing rate channel versus the labor income channel, and how do the authors establish these shares?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: The decomposition exercise (Figure 6) simulates each household&amp;rsquo;s response to a single price path at a time while holding all other prices at steady state. The labor income channel — the decline in real wages and hours caused by the contraction in output — accounts for approximately two-thirds of the aggregate consumption decline. The borrowing rate channel accounts for approximately one-third. Separately, a counterfactual model in which the household loan spread is held fixed produces consumption and output declines roughly 50% smaller than the baseline, showing that the borrowing rate channel and its second-round effects on wages and asset prices together account for more than half of the output decline in general equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the distribution of dollar deposits across the wealth distribution affect the severity of the downturn, and what is the novel redistribution result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: Through market clearing for local-currency deposits (equation 44), a larger household demand for dollar deposits directly raises the bank&amp;rsquo;s foreign-currency liability share (x^D_bt), magnifying the bank&amp;rsquo;s currency mismatch. Raising the deposit dollarization rate from 40% to 80% causes bank net worth to decline twice as much as in the baseline, investment to fall roughly 12% versus 6%, and aggregate consumption to fall roughly 1.7% versus 1%, with output declining more than twice as much. The novel distributional result is that wealthier savers and capitalists are actually better off in the high-dollarization scenario because their windfall dollar gains are larger, while poorer households suffer a more severe recession through the labor income and borrowing rate channels. Hence, stronger currency hedging by the rich deepens the aggregate recession and worsens distributional outcomes for the poor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What happens when borrowers are assumed to hold foreign-currency debt rather than local-currency debt, as in De Ferra et al. [2020]?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: In this alternative calibration, borrowers face a direct balance-sheet loss from depreciation, causing constrained borrowers&amp;rsquo; consumption to drop more steeply on impact. However, since household loans represent only approximately 5% of annual GDP in the baseline, the boost to bank net worth from having dollar-denominated loan assets is modest compared to the reduction in the dollar deposit liability. As a result, the path for investment is very similar to the baseline, while on impact consumption drops about 20% more and output declines about 10% more than in the baseline model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What welfare implications arise from removing dollar deposits entirely from savers&amp;rsquo; portfolios?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: In a calibration where households hold only local-currency assets (with banks&amp;rsquo; currency mismatch maintained through external dollar borrowing), savers lose their windfall dollar gains during depreciation. The consumption of savers drops about 25% more than in the baseline on impact, and capitalists experience even larger changes. Because of general equilibrium feedback through wages and prices, poorer households also cut consumption more, causing aggregate consumption to fall approximately 20% more than in the baseline and output to decline approximately 5% more on impact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Under what dollarization conditions does exchange rate stabilization through monetary tightening improve welfare, and why?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: Under the baseline 40% dollarization, raising domestic interest rates in response to depreciation is welfare-detrimental for most households because higher rates depress asset prices, tighten the bank&amp;rsquo;s leverage constraint, worsen the borrowing rate channel and the labor income channel for low-net-worth agents, more than offsetting the benefit from partially stabilizing the bank&amp;rsquo;s balance sheet. Only a very modest response (κ_e ≈ 0.04) is preferred. When deposit dollarization is 80%, the benefit from protecting the bank&amp;rsquo;s balance sheet is proportionally larger; a moderate reaction (κ_e = 0.5) can improve welfare for most households, though further tightening (κ_e = 5) causes bank net worth to fall more than 20% and leads to a deeper recession, reversing the gains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the quarterly average MPC in the model compare to external estimates, and why is the MPC distribution central to the paper&amp;rsquo;s mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: The quarterly average MPC in steady state is approximately 27%, which implies an annual MPC of approximately 71%, consistent with Hong [2020b]&amp;rsquo;s estimates for Peru. The MPC distribution is central because the amplification mechanisms — both the borrowing rate channel and the labor income channel — work by hitting high-MPC agents (HtM households and constrained borrowers) hardest. Without a sufficiently high mass of high-MPC agents, changes in spreads and labor income would have muted aggregate consumption effects. The presence of approximately 27% of households with zero liquid wealth at the borrowing spread is itself endogenously generated by the bank&amp;rsquo;s agency problem, which creates a wedge between saving and borrowing rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the HANK model without banks compare to the RANK model without banks in transmitting the foreign interest rate shock?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: Both HANK-without-banks and RANK-without-banks generate output expansions through the expenditure switching channel. However, in the bank-less HANK, aggregate consumption declines only half as much as in the frictionless RANK because high-MPC households amplify the positive real income effect from rising labor income. Some household groups (HtM agents and constrained borrowers) actually increase consumption on impact due to higher real labor income, the Fisher channel reducing the real value of domestic-currency debt, and portfolio gains for savers holding dollar assets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What role does the monetary policy Taylor rule play during the baseline devaluation, and how does it interact with the financial accelerator?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: The standard Taylor rule (coefficient 1.5 on domestic inflation) causes the central bank to raise rates in response to the CPI inflation spike accompanying the depreciation. Higher domestic rates compress the real exchange rate depreciation and reduce the boost to exports, but also directly increase banks&amp;rsquo; funding costs, contributing to the financial accelerator by compressing the return on capital. This interaction means that the baseline monetary policy passively amplifies the banking-sector contraction relative to a model with no monetary response.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Deposit dollarization&lt;/strong&gt;: The share of domestic bank deposits denominated in foreign currency, held by domestic households. In the paper&amp;rsquo;s calibration this is set at 40% of aggregate bank deposits (baseline) or 80% (high-dollarization alternative), reflecting the empirical range across Latin American countries. It determines the bank&amp;rsquo;s foreign-currency liability share and thus the severity of currency mismatch.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Currency mismatch (banks)&lt;/strong&gt;: The gap between the currency denomination of a bank&amp;rsquo;s assets (local-currency loans to households and firms) and its liabilities (foreign-currency deposits from households). In the model, when the domestic currency depreciates the real cost of dollar deposits rises, directly eroding bank net worth without any offsetting appreciation of loan assets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Borrowing rate channel&lt;/strong&gt;: The mechanism by which a decline in bank net worth, caused by currency mismatch losses, tightens the bank&amp;rsquo;s incentive constraint and forces up the spread on household loans. This raises borrowing costs for households who have no direct foreign-currency exposure on their balance sheets, causing high-MPC borrowers to cut consumption sharply and thereby depressing aggregate demand and wages. This channel is distinct from the direct balance-sheet channel studied in De Ferra et al. [2020].&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor income channel (in an open economy with banking frictions)&lt;/strong&gt;: The mechanism by which the financial accelerator — reduced credit supply and lower capital demand following bank net worth erosion — depresses output, hours, and wages, causing a decline in real labor income that hits high-MPC workers regardless of their asset-portfolio currency composition. Accounts for approximately two-thirds of the aggregate consumption decline in the baseline experiment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hand-to-mouth (HtM) agents&lt;/strong&gt;: In this paper&amp;rsquo;s setting, HtM behavior is not a permanent household state but arises endogenously for households who hold zero liquid wealth because the bank&amp;rsquo;s endogenous lending spread makes both saving and borrowing suboptimal for them in a given period. Their consumption moves approximately one-for-one with current labor income, making them a key amplifier of real income fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial accelerator (with currency mismatch)&lt;/strong&gt;: The Gertler-Karadi [2011] mechanism as augmented by exchange-rate exposure: a currency depreciation erodes bank net worth through the dollar deposit liability, tightening the leverage constraint, raising spreads on capital and household loans simultaneously, lowering the price of capital, further reducing net worth, and feeding back to reduce credit supply. The currency mismatch channel and the asset-price channel interact to amplify the initial shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Portfolio dollarization rule&lt;/strong&gt;: The assumption that each household&amp;rsquo;s share of savings held in foreign-currency deposits is a linear function of net wealth (x_i = λ_bar + λ·b_i, with λ &amp;gt; 0 and x_i = 0 for borrowers). This rule is calibrated to match the wealth-gradient of dollar holdings in the 2013 Uruguayan Household Financial Survey, and through market clearing it pins down the aggregate bank deposit dollarization rate and the distributional exposure of households to exchange rate shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange rate stabilization trade-off&lt;/strong&gt;: The central bank&amp;rsquo;s choice of how much to raise domestic interest rates in response to a depreciation (parameterized by κ_e in the augmented Taylor rule). A higher κ_e reduces the bank&amp;rsquo;s currency mismatch loss but simultaneously depresses asset prices and raises borrowing costs, potentially worsening the financial accelerator. The paper shows the net welfare effect depends critically on the level of deposit dollarization: at 40% dollarization aggressive leaning is harmful for most agents; at 80% dollarization a moderate response (κ_e = 0.5) can be welfare improving.&lt;/p&gt;</description></item><item><title>Digital Distractions with Peer Influence</title><link>https://macropaperwarehouse.com/papers/digital-distractions-with-peer-influence/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/digital-distractions-with-peer-influence/</guid><description>&lt;p&gt;This paper estimates the causal effects of mobile app usage on college students&amp;rsquo; academic performance, physical health, and labor market outcomes, while separately identifying behavioral (endogenous) and contextual (exogenous) peer effects in app usage — the first study to do so within a unified empirical framework. The analysis draws on administrative data for three freshman cohorts (2018–2020) at a mid-tier Chinese university, linked to individual-level mobile phone usage records from a major telecommunications carrier covering 6,430 students over four years (excluding COVID semester). High-frequency GPS data, hourly app usage records for the 2020 cohort, and two waves of university surveys supplement the main dataset.&lt;/p&gt;
&lt;p&gt;The identification strategy addresses three challenges: endogeneity of own app usage, endogeneity of peer group formation, and the reflection problem in peer effects. For own usage, two instrumental variables are used: (1) a shift-share instrument interacting the September 2020 launch of the blockbuster game Yuanshen with students&amp;rsquo; pre-college app usage intensity; and (2) China&amp;rsquo;s October 2019 minors&amp;rsquo; game restriction policy (prohibiting under-18s from playing online games 10 p.m.–8 a.m. and capping weekday gaming at 90 minutes/day) interacted with the evolving number of underage pre-college friends. For peer effects, the university&amp;rsquo;s random dormitory assignment within gender-class units provides exogenous peer variation; behavioral peer effects are further isolated using the minors&amp;rsquo; restriction policy interacted with roommates&amp;rsquo; pre-college underage friend networks, an instrument that affects roommates but not the focal student. Contextual peer effects are recovered by subtracting the estimated behavioral component from reduced-form estimates.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, app usage is contagious: a one standard deviation (s.d.) increase in roommates&amp;rsquo; in-college total app usage raises a student&amp;rsquo;s own usage by 5.8% (IV). Behavioral peer effects dominate: contextual peer effects are small and statistically insignificant. Second, own app usage severely harms academic performance: a one s.d. increase in total app usage reduces GPA for required courses by 36.2% of a within-cohort-major s.d. (IV), and a one s.d. increase in game app usage alone reduces GPA by 56.6% of a within-cohort-major s.d. The direct disruption effect of roommates&amp;rsquo; app usage reduces GPA by a further 20.6% of a within-cohort-major s.d.; combining the indirect channel (behavioral contagion), the total roommate effect reaches 22.7% of a within-cohort-major s.d., more than 60% of the own-usage effect. Third, the effect on physical education scores is roughly four times larger than on required-course GPA: a one s.d. increase in own app usage reduces PE scores by 2.74 points, while roommates&amp;rsquo; app usage has no direct effect on PE. Fourth, a one s.d. increase in own in-college app usage reduces initial wages upon graduation by 2.3% (12.1% of within-cohort-major wage s.d.); a one s.d. increase in roommates&amp;rsquo; usage reduces wages by 0.9% directly, with a total effect (including the contagion channel) of approximately 1.0% (5.3% of within-cohort-major s.d.). Controlling for cumulative GPA reduces the gaming-to-wage coefficient by roughly one-third, indicating that academic performance is an important but partial mediator.&lt;/p&gt;
&lt;p&gt;A back-of-the-envelope policy simulation extending the minors&amp;rsquo; gaming cap (3 hours/week) to college students — binding for 34.3% of student-month observations — projects an average wage increase of 0.9% at graduation, approximately half the wage premium from one additional year of work experience in developing countries.&lt;/p&gt;
&lt;p&gt;Mechanism evidence from GPS data shows that Yuanshen&amp;rsquo;s launch caused students to arrive at study halls 18.2 minutes later and leave 23.4 minutes earlier per day. High-frequency sleep data show that a one s.d. increase in nighttime app usage reduces sleep duration by approximately 30 minutes and raises the probability of sleeping late by 34 percentage points. Survey evidence indicates that heavy app users recognize the addictive nature of gaming, pointing to self-control problems rather than lack of awareness.&lt;/p&gt;
&lt;p&gt;The scope conditions are: single mid-tier Chinese university; 2018–2020 cohorts; outcomes through initial job placement only; peer group restricted to dormitory roommates; findings rely on IV exclusion restrictions conditional on student and time fixed effects.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question?
A: The paper asks how individual and peer mobile app usage affect college students&amp;rsquo; academic performance, physical health, and early labor market outcomes, and it separately identifies the behavioral (endogenous) versus contextual (exogenous) components of peer influence in app usage. This is claimed as the first study to disentangle these two types of peer effects within a unified empirical framework.&lt;/p&gt;
&lt;p&gt;Q: What data does the paper use?
A: Administrative records for 7,479 undergraduates across three freshman cohorts (2018–2020) at a medium-sized mid-tier Chinese university are linked to monthly mobile app usage records from a telecommunications provider covering 75% of the provincial population; 6,430 students are matched. The dataset also includes GPS location data at 5-minute intervals, hourly app usage for the 2020 cohort (used to infer sleep), and two waves of voluntary annual surveys with 1,798 respondents (24% response rate). Labor market outcomes — employment status, wages, post-graduate admissions — are available for the 2018 and 2019 cohorts.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address the endogeneity of own app usage?
A: Two sets of instruments are used. The first interacts the September 2020 launch of Yuanshen (the most popular game in China, with over 13 million Chinese users by 2021, the majority under age 25) with students&amp;rsquo; pre-college app usage, forming a shift-share instrument under the assumption that the game launch is orthogonal to unobserved GPA determinants conditional on student fixed effects. The second interacts China&amp;rsquo;s October 2019 minors&amp;rsquo; game restriction policy with the evolving count of a student&amp;rsquo;s underage pre-college friends; event studies confirm no pre-trends and a sharp, transitory drop in app usage post-policy that dissipates as friends age out of the restricted group.&lt;/p&gt;
&lt;p&gt;Q: How does the paper solve the reflection problem and separate behavioral from contextual peer effects?
A: Three-step procedure: (1) random dormitory assignment within gender-class units yields reduced-form peer effect estimates using roommates&amp;rsquo; pre-college app usage as the exogenous peer shifter; (2) behavioral peer effects are isolated via an IV using the minors&amp;rsquo; restriction policy interacted with roommates&amp;rsquo; (not the focal student&amp;rsquo;s) underage pre-college friend networks — an instrument that shifts roommates&amp;rsquo; app usage but is orthogonal to the focal student&amp;rsquo;s outcomes; (3) contextual peer effects are recovered as the residual from subtracting the estimated behavioral effect from the reduced-form estimate.&lt;/p&gt;
&lt;p&gt;Q: How large and significant are the behavioral versus contextual peer effects in app usage?
A: A one s.d. increase in roommates&amp;rsquo; in-college total app usage raises own usage by 5.8% (IV estimate, significant). For game apps alone the behavioral spillover is 10.7%, and for games plus video it is 6.5%. Contextual peer effects (identified from roommates&amp;rsquo; pre-college characteristics) are much smaller and statistically insignificant, indicating that peer influence operates primarily through the direct imitation of peers&amp;rsquo; actions rather than their background traits.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of own app usage on GPA?
A: The IV estimate shows a one s.d. increase in total in-college app usage reduces GPA for required courses by 0.716 points, equivalent to 36.2% of a within-cohort-major GPA s.d. (significant at 1%). For game apps alone, a one s.d. increase reduces GPA by 1.119 points, or 56.6% of a within-cohort-major s.d. OLS estimates are biased toward zero, likely because negative health shocks reduce both GPA and app usage simultaneously.&lt;/p&gt;
&lt;p&gt;Q: How large is the total peer effect of roommates&amp;rsquo; app usage on a student&amp;rsquo;s GPA?
A: Roommates&amp;rsquo; app usage directly lowers GPA by 0.408 points (20.6% of within-cohort-major s.d.) through disruption of the dormitory study environment or crowding out of group study. The behavioral contagion channel (5.8% increase in own usage per s.d. of roommates&amp;rsquo; usage) adds an additional 0.042 points, bringing the total effect to approximately 0.450 points, or 22.7% of a within-cohort-major s.d. — over 60% of the own-usage effect.&lt;/p&gt;
&lt;p&gt;Q: What is the effect on physical education (PE) scores, and why do roommates&amp;rsquo; app usage not matter there?
A: A one s.d. increase in own total app usage reduces PE scores by 2.74 points (IV), approximately four times the magnitude of the effect on required-course GPA, consistent with health literature on excessive screen time. Roommates&amp;rsquo; app usage has no statistically significant direct effect on PE, which the authors attribute to the irrelevance of dormitory noise and study disruptions for outdoor physical activity.&lt;/p&gt;
&lt;p&gt;Q: What are the effects of app usage on wages at graduation?
A: Doubling total app usage during college reduces initial wages by approximately 2% (IV). A one s.d. increase in own usage reduces wages by 2.3%, or 12.1% of a within-cohort-major wage s.d. A one s.d. increase in roommates&amp;rsquo; usage directly reduces wages by 0.9% (4.8% of within-cohort-major s.d.); including the behavioral contagion channel, the total roommate effect is approximately 1.0% (5.3% of within-cohort-major s.d.). Controlling for cumulative GPA reduces the game-usage-to-wage coefficient by about one-third, implying GPA is a partial but not complete mediator.&lt;/p&gt;
&lt;p&gt;Q: What does the policy simulation of the gaming cap say?
A: Extending the minors&amp;rsquo; game restriction (3 hours/week cap) to college students would bind for 34.3% of student-month observations, reducing average monthly gaming from 12.1 hours to 8 hours (a one-third decrease). Incorporating the behavioral peer multiplier for gaming (0.078), average gaming further converges to approximately 7.65 hours in steady state. The implied wage gain at graduation is 0.9%, approximately half the wage premium from one additional year of work experience in developing countries (Lagakos et al., 2019 estimate).&lt;/p&gt;
&lt;p&gt;Q: What does the GPS evidence show about time allocation?
A: Following Yuanshen&amp;rsquo;s launch, the average student arrives at the study hall 18.2 minutes later and returns to the dormitory 23.4 minutes earlier per day. The minors&amp;rsquo; restriction reverses this: students with the average number of minor friends arrive at study halls 17.4 minutes earlier and return to the dorm 19.8 minutes later. Both game shocks also shift tardiness and absence rates for major-required courses in the expected directions, and the effects intensify over time with Yuanshen&amp;rsquo;s growing popularity.&lt;/p&gt;
&lt;p&gt;Q: What do the sleep data show?
A: A one s.d. increase in nighttime app usage (9 p.m.–3 a.m.) is associated with roughly 30 minutes less sleep (7% of the mean), a 34 percentage point higher probability of sleeping late, and a 4.5 percentage point higher probability of waking up late. Daytime app usage (8 a.m.–9 p.m.) is also associated with 7.2 fewer minutes of sleep (1.8% of mean) and a 3.7 percentage point higher probability of late wake-up. These results are descriptive (from the 2020 cohort hourly data) rather than IV-based.&lt;/p&gt;
&lt;p&gt;Q: What does the survey evidence show about mechanisms and self-awareness?
A: Heavier app users report worse physical health and higher stress, are less likely to have obtained professional certifications by graduation, submit fewer job applications, and express lower satisfaction with job offers. Notably, heavier users are more likely to acknowledge the addictive nature of apps and games, suggesting a self-control problem rather than informational deficiency. They also report better relationships with roommates and greater likelihood of following roommates&amp;rsquo; advice on post-graduation choices, a potential direct channel for peer labor market effects.&lt;/p&gt;
&lt;p&gt;Q: How representative is the sample, and what are the key scope conditions?
A: The university is a mid-tier institution in southern China with students predominantly from the 30th–80th CEE score percentile among provincial college-admitted applicants; it is less female (42% vs. 53% nationally) and more rural (40% vs. 27% nationally). Survey respondents oversample less advantaged backgrounds and are re-weighted. Findings pertain to dormitory roommates as the peer group; all labor market outcomes are initial wages upon graduation; the sample covers 2018–2021 with COVID semester excluded. The peer effects estimates rest on random dormitory assignment, which the authors verify by showing no within-dorm correlation in pre-college characteristics.&lt;/p&gt;
&lt;p&gt;Behavioral (endogenous) peer effects: The mechanism by which a peer&amp;rsquo;s actual behavior — here, contemporaneous app usage — directly influences a focal individual&amp;rsquo;s own behavior. In this paper, identified via IV using the minors&amp;rsquo; game restriction policy interacted with roommates&amp;rsquo; underage pre-college friend networks, which shifts roommates&amp;rsquo; usage but not the focal student&amp;rsquo;s characteristics.&lt;/p&gt;
&lt;p&gt;Contextual (exogenous) peer effects: The influence of peers&amp;rsquo; pre-determined background characteristics (e.g., pre-college app usage, reflecting motivation, study habits, attitudes toward academics) on a focal individual&amp;rsquo;s outcomes, independent of peers&amp;rsquo; actual in-college behavior. Recovered as the residual after subtracting estimated behavioral peer effects from reduced-form estimates; found to be small and insignificant in this setting.&lt;/p&gt;
&lt;p&gt;Shift-share instrument (Yuanshen): A quasi-experimental instrument constructed by interacting the mid-sample launch date of the blockbuster game Yuanshen (September 2020) with students&amp;rsquo; pre-college app usage intensity, under the assumption that pre-college usage predicts differential susceptibility to the shock while the launch itself is orthogonal to the university&amp;rsquo;s academic environment.&lt;/p&gt;
&lt;p&gt;Minors&amp;rsquo; game restriction policy: China&amp;rsquo;s October 2019 policy prohibiting individuals under 18 from playing online games between 10 p.m. and 8 a.m. and capping weekday gaming at 90 minutes per day (tightened to 3 hours/week in September 2021). Used both as an instrument for own app usage (via underage pre-college friends) and as an instrument for roommates&amp;rsquo; usage (via roommates&amp;rsquo; underage friends) to isolate behavioral peer effects.&lt;/p&gt;
&lt;p&gt;Reflection problem: The identification challenge first articulated by Manski (1993) arising because an individual&amp;rsquo;s behavior both affects and is affected by peers simultaneously, making it impossible to separately identify the direction of influence from observational data without exogenous variation in peer behavior.&lt;/p&gt;
&lt;p&gt;Source text origin: The paper&amp;rsquo;s own data provenance category distinguishing whether summaries are based on full working paper text (pdf or oa-html) versus abstract only — a distinction the paper itself does not use but that is relevant to the review pipeline running this analysis.&lt;/p&gt;
&lt;p&gt;Within-cohort-major GPA standard deviation: The unit used to scale all GPA effect sizes, defined as the standard deviation of GPA within students of the same graduation cohort and declared major. This normalization accounts for systematic differences in grading across fields and years, making effect magnitudes comparable across specifications.&lt;/p&gt;</description></item><item><title>Disincentive effects of unemployment insurance benefits</title><link>https://macropaperwarehouse.com/papers/disincentive-effects-of-unemployment-insurance-benefits/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/disincentive-effects-of-unemployment-insurance-benefits/</guid><description>&lt;p&gt;This paper isolates the disincentive effects of pandemic unemployment insurance (UI) benefits on employment recovery, separating them from the simultaneously operating stimulative (demand) effects that previous studies conflate. The authors study the largest UI expansion in U.S. history — the CARES Act of March 2020 — which introduced three simultaneous provisions: a $600 weekly income supplement (FPUC) through end of July 2020, a 13-week extension of maximum benefit duration (PEUC), and expanded eligibility to workers previously ineligible for UI (PUA), together raising the median replacement rate to 145% and more than doubling the number of UI recipients.&lt;/p&gt;
&lt;p&gt;The empirical strategy uses high-frequency establishment-level data from Homebase (HB), a scheduling and payroll provider covering approximately 140,000 small U.S. businesses — predominantly restaurants and retailers — matched to Yelp price-tier data and Safegraph foot-traffic and spending data. The final estimation sample is 4,595 businesses within 1,195 local-industry cells, observed at weekly frequency from January 2019 to December 2020.&lt;/p&gt;
&lt;p&gt;The identification rests on comparing employment recovery of low-wage versus high-wage businesses within the same narrow local labor market (four-digit zip code), industry (two-digit NAICS), and price tier. Because neighboring businesses largely share the local demand stimulus from UI, differencing within local-industry cells removes common demand effects. The key variation is the expiration of the $600 supplement, which differentially compresses the replacement-rate gap between low- and high-wage businesses depending on local average wages — labor markets where the gap falls more sharply are the treated group.&lt;/p&gt;
&lt;p&gt;The main empirical finding is that a 100 percentage point decline in the replacement rate gap is associated with a 5.7 percentage point rise in low-wage business employment recovery relative to high-wage business employment recovery at 12 weeks after the $600 expiration. For the average labor market, the expiration of the $600 supplement decreased the replacement rate gap by 46 percentage points, implying a 2.6 percentage point closing of the low-versus-high-wage employment gap within 12 weeks. Importantly, hours per employee and hourly wages grew faster in low-wage businesses over the same period, consistent with a labor supply rather than a demand mechanism. When the comparison is conducted at the U.S. state level rather than within local-industry cells — as in Finamor and Scott (2021) — the effect disappears and reverses sign, illustrating how local demand effects obscure disincentive effects at broader geographic aggregations.&lt;/p&gt;
&lt;p&gt;To quantify the aggregate employment impact, the authors build and calibrate a McCall-style labor search model with heterogeneous firm wages, a UI-eligible and non-UI unemployed pool, and equilibrium reservation wages. The model is extended to include a probability (calibrated at 16.5%) that workers lose UI eligibility upon refusing a job offer, which reconciles the model with the empirical estimates; without this feature the baseline model substantially overstates the differential employment effect of the $600 expiration.&lt;/p&gt;
&lt;p&gt;The full model-implied aggregate employment loss from all CARES Act UI provisions combined is 3.4 percentage points on average between April and December 2020, representing approximately 20% of the average employment shortfall in the Leisure and Hospitality sector over that period. When each provision is implemented in isolation, the effects are modest ($600 supplement: 0.2 pp; extended duration: 0.2 pp; expanded eligibility: 1.0 pp), but their interaction generates the large combined effect. Expanded eligibility is identified as the most disruptive provision, particularly for low-wage businesses, because it depletes the pool of non-UI unemployed who are the primary source of hires for these firms. The unemployment duration elasticities implied by the model are modest and in line with the low-to-middle range of pre-pandemic estimates.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s scope is restricted to the disincentive channel and deliberately excludes the stimulative effects of UI; it studies small, in-person service sector businesses and the April–December 2020 recovery period only.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification challenge this paper addresses?
A: Prior empirical studies find only modest net effects of pandemic UI on employment, but it is unclear whether this reflects small disincentive effects or the near-cancellation of two opposing forces — UI suppressing labor supply while simultaneously stimulating local consumer demand. Identifying the disincentive effect alone requires a design that neutralizes the demand channel. The authors accomplish this by comparing low-wage and high-wage businesses within the same narrow local market, industry, and price tier, so that common local demand shifts from UI are differenced out.&lt;/p&gt;
&lt;p&gt;Q: What data does the empirical analysis use, and how is the sample constructed?
A: The primary data source is Homebase, covering approximately 140,000 small U.S. businesses with daily employment, hourly wages, and hours worked. The estimation sample is restricted to 4,595 businesses present throughout 2019, matched to Yelp price-tier classification and Safegraph weekly foot traffic and credit-card spending. Businesses are grouped into 1,195 local-industry cells defined by four-digit zip code, two-digit NAICS industry, and Yelp price tier (inexpensive vs. expensive). Within each cell, businesses are classified as low-wage or high-wage, with high-wage businesses paying on average $1.80 per hour more — about 8% above the average hourly wage of $10.90.&lt;/p&gt;
&lt;p&gt;Q: How is the replacement rate defined in the empirical framework?
A: The business-specific replacement rate is the ratio of average UI receipts (state benefit plus the pandemic supplement, converted to hourly units) to the pre-pandemic average hourly wage of that business. Because the supplement is uniform across workers, businesses with lower pre-pandemic wages face higher replacement rates; the replacement rate gap between low- and high-wage businesses within a local market is therefore a function of both state benefit levels and the local wage dispersion.&lt;/p&gt;
&lt;p&gt;Q: What does the event-study analysis around the $600 expiration show?
A: The event study exploits cross-labor-market variation in how much the replacement rate gap between low- and high-wage businesses declined when the $600 FPUC supplement expired at end of July 2020. Labor markets with a larger decline in the gap see faster relative recovery in low-wage business employment after expiration. A 100 percentage point decline in the replacement rate gap is associated with a 5.7 percentage point rise in the low-versus-high-wage employment recovery gap at 12 weeks post-expiration. For the average labor market, the $600 expiration reduced the replacement rate gap by 46 percentage points, implying a 2.6 percentage point narrowing of the employment recovery gap.&lt;/p&gt;
&lt;p&gt;Q: Why does the estimated effect disappear when broader geographic aggregations are used?
A: When businesses are compared within U.S. state borders rather than within local-industry cells, the estimated coefficient on the replacement rate gap turns positive and statistically insignificant. This occurs because at the state level, low-wage areas benefit disproportionately from the purchasing power increase that generous UI provides to local unemployed workers, so demand effects swamp and reverse the supply-side disincentive. This finding explains why Finamor and Scott (2021), using Homebase data with state fixed effects, find no negative association between replacement rates and labor market re-entry.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports a labor supply rather than demand interpretation of the differential recovery?
A: During the period of the $600 supplement, hours per employee and hourly wages grew faster in low-wage businesses than in high-wage businesses, even as low-wage businesses lagged in employment levels. If the differential recovery reflected demand deficiencies at low-wage businesses, hours per employee and wages should have grown faster at high-wage businesses instead. The observed pattern is consistent with labor supply shortfalls at low-wage firms.&lt;/p&gt;
&lt;p&gt;Q: What is the structure of the quantitative labor search model?
A: The model features a unit measure of workers and a fixed measure of firms, each posting a constant idiosyncratic wage drawn from an exogenous distribution. Unemployed workers receive job offers at a rate determined by labor market tightness and accept offers above their reservation wage. Reservation wages are equilibrium objects because UI benefits depend on the worker&amp;rsquo;s previous wage. The unemployed are split into UI-eligible and non-UI pools; the non-UI pool accepts jobs from lower in the wage distribution and is the primary supply source for low-wage firms. The model is calibrated to pre-pandemic U.S. service sector averages, with a pre-pandemic UI replacement rate of 0.51, a UI recipiency probability of 14%, and a non-UI replacement rate of 0.15.&lt;/p&gt;
&lt;p&gt;Q: Why does the baseline model overstate the empirical effect, and how is this reconciled?
A: The baseline model dramatically overstates the differential employment impact of the $600 expiration because the CARES Act&amp;rsquo;s expanded eligibility (modeled as a rise in the recipiency probability from 14% to 70%) nearly empties the non-UI unemployed pool, which is the dominant labor supply source for low-wage firms. In the data, the share of unemployed receiving UI nearly tripled for in-person leisure and hospitality workers, but not to the degree that the model&amp;rsquo;s implied employment collapse would require. The model is reconciled by introducing a 16.5% probability that a worker loses UI eligibility upon refusing a suitable job offer — consistent with UI law — which reduces the effective outside option and raises acceptance rates for low-wage firms.&lt;/p&gt;
&lt;p&gt;Q: What are the aggregate employment losses implied by the model?
A: When all three CARES Act provisions are implemented jointly, the model estimates that the disincentive effects held back aggregate employment recovery by 3.4 percentage points on average between April and December 2020 — approximately 20% of the average employment shortfall in the Leisure and Hospitality sector. Implemented in isolation, each provision generates only modest losses: the $600 supplement alone accounts for 0.2 percentage points, extended duration for 0.2 percentage points, and expanded eligibility for 1.0 percentage points. The large combined effect arises from the interaction of all three provisions, not from any single one.&lt;/p&gt;
&lt;p&gt;Q: What are the conditional (interaction) effects of each provision when the other two are in place?
A: Conditional on the other two provisions being active, the income supplement holds back employment recovery by 1.6 percentage points, the extended duration by 1.5 percentage points, and expanded eligibility by 2.9 percentage points. This interaction effect is the central quantitative finding: individually modest provisions combine to produce effects far exceeding their sum when implemented simultaneously.&lt;/p&gt;
&lt;p&gt;Q: What are the implied unemployment duration elasticities, and how do they compare to the literature?
A: The $600 supplement alone raises average unemployment duration by 8% against a 343% rise in the replacement rate, implying an elasticity of 0.02. Extended duration alone raises unemployment duration by 6% against a 150% increase in potential benefit duration, implying an elasticity of 0.03. Expanded eligibility alone raises unemployment duration by 19%, implying an elasticity of 0.04. When each provision is activated on top of the other two, the implied elasticities rise substantially: 0.24 for the $600 supplement, 0.43 for extended duration, and 0.28 for expanded eligibility. These are in the low-to-middle range of pre-pandemic estimates (Katz and Meyer, 1990: 0.3–0.5; Johnston and Mas, 2018: 0.4–0.8; Rothstein, 2011: 0.06; Farber and Valletta, 2015: 0.15).&lt;/p&gt;
&lt;p&gt;Q: What is the role of expanded eligibility specifically?
A: Expanded eligibility is identified as the most disruptive CARES Act provision, accounting for 1.0 percentage points of employment loss alone and 2.9 percentage points conditional on the other provisions. Mechanically, expanded eligibility converts non-UI unemployed workers into UI-eligible workers, draining the pool of workers willing to accept low-wage job offers. Because low-wage firms depend disproportionately on the non-UI pool for hiring, this provision disproportionately depresses their employment. Using CPS data, the authors document that the share of unemployed workers receiving UI in the in-person leisure and hospitality sector nearly tripled in 2020 relative to the pre-pandemic period.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions and limitations of the analysis?
A: The empirical analysis is restricted to small, in-person service sector businesses (restaurants and retailers) in the Homebase sample, which may not be representative of the broader labor market. The quantitative model is explicitly focused on disincentive effects only and does not capture the stimulative or demand effects of UI. The model also abstracts from re-opening restrictions and other pandemic-specific confounders. The analysis covers April to December 2020; the 2021 pandemic UI extensions are not studied. The job-refusal probability (chi = 16.5%) is a reduced-form calibration target rather than a structurally identified parameter.&lt;/p&gt;
&lt;p&gt;Replacement rate gap: The difference in business-specific UI replacement rates between low-wage and high-wage businesses within the same local labor market; defined as UI benefits (state benefit plus supplement) divided by the business&amp;rsquo;s pre-pandemic average hourly wage. Larger gaps indicate greater relative disincentive for workers to accept jobs at low-wage firms.&lt;/p&gt;
&lt;p&gt;Disincentive effect: The negative impact of higher UI replacement rates on workers&amp;rsquo; willingness to accept job offers and thus on business employment recovery, isolated from the simultaneous stimulative demand effect of UI spending.&lt;/p&gt;
&lt;p&gt;Non-UI unemployed pool: Workers who are ineligible for or have exhausted UI benefits and therefore receive only social benefits at a lower replacement rate (calibrated at 0.15 in the model). This group has a lower reservation wage and constitutes the primary labor supply source for low-wage firms.&lt;/p&gt;
&lt;p&gt;Local-industry cell: The paper&amp;rsquo;s unit of comparison — businesses sharing the same four-digit zip code (covering on average four neighboring zip codes), two-digit NAICS industry, and Yelp price tier. Within-cell differencing is the mechanism that removes common local demand effects.&lt;/p&gt;
&lt;p&gt;Benefit recipiency probability: The probability that a newly separated worker enters the UI-eligible unemployed pool, combining UI eligibility and takeup. Pre-pandemic this is calibrated at 14%; under the CARES Act it rises to 70%, targeting the observed near-tripling of UI recipients in the CPS data.&lt;/p&gt;
&lt;p&gt;Job-refusal eligibility loss: A probability (calibrated at 16.5%) that a UI-eligible worker who rejects a job offer loses UI status and transitions to the non-UI pool. Motivated by UI law prohibiting refusal of suitable work; reduces the effective outside option and reconciles the model&amp;rsquo;s predicted employment gap with the empirical estimate.&lt;/p&gt;
&lt;p&gt;Equilibrium residual wage dispersion: The wage dispersion observed in equilibrium conditional on worker observables. The model generates realistic dispersion by calibrating the non-UI replacement rate to match the lower half of the wage distribution and the firm wage offer variance to match the upper half; the presence of the non-UI state substantially increases residual dispersion relative to standard search models.&lt;/p&gt;</description></item><item><title>Distorted prices and targeted taxes in the New Keynesian Network model</title><link>https://macropaperwarehouse.com/papers/distorted-prices-and-targeted-taxes-in-the-new-keynesian-network-model/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/distorted-prices-and-targeted-taxes-in-the-new-keynesian-network-model/</guid><description>&lt;p&gt;This paper asks how governments should optimally adjust sector-specific taxes in response to sectoral shocks when monetary policy cannot be tailored to individual sectors. The authors work within a variant of Rubbo&amp;rsquo;s (2023) New Keynesian Network (NKN) model, augmented to include time-varying sectoral sales taxes and production subsidies. The model features N sectors connected through input-output linkages, with Calvo-type price rigidity that is heterogeneous across sectors, and encompasses both sectoral productivity (supply) shocks and demand shocks.&lt;/p&gt;
&lt;p&gt;The central finding, stated as Proposition 1, is that the first-best tax policy requires exactly 2N instruments—one sales tax and one production subsidy per sector—not just instruments in the shocked sector. The mechanism turns on a twofold distortion created by sticky prices. Because only a fraction of firms adjust prices at any time, relative prices are distorted both within sectors (price dispersion among firms) and across sectors (misalignment of relative prices). The production subsidy offsets the effect of shocks on marginal costs, incentivizing price-adjusting firms to leave seller prices unchanged and thereby eliminating within-sector dispersion. The sales tax—which applies to both household purchases and intermediate goods trade—steers demand across sectors so that market prices move as if fully flexible, closing sectoral output gaps even as seller prices remain constant. The optimal sales tax moves exactly one-for-one with the vector of natural prices. Crucially, budget neutrality holds to first order: the sales tax revenues fund the production subsidies.&lt;/p&gt;
&lt;p&gt;The strength of each instrument&amp;rsquo;s response depends on network proximity rather than price rigidity. For supply shocks, adjustment propagates downstream (governed by the Leontief inverse), so sectors that intensively use inputs from the shocked sector require larger responses. For demand shocks, adjustment propagates upstream first and then back downstream, so upstream suppliers to the shocked sector face the largest responses.&lt;/p&gt;
&lt;p&gt;Because the first-best policy requires observing sectoral shocks directly, the authors propose a simple 2N rule (Proposition 2) that responds only to observable sectoral seller-price inflation, with rule strength parameter ϕ_i per sector. As ϕ_i → ∞ the simple rule converges to the first-best. Crucially, the rule can be implemented by observing inflation only in the shocked sector and adjusting taxes and subsidies in other sectors proportionally to their input-output distance from that sector.&lt;/p&gt;
&lt;p&gt;The quantitative assessment calibrates the model to the U.S. economy using BEA 2017 input-output accounts with N = 373 sectors at the 6-digit classification. Sectoral price flexibility is drawn from Antonova (2025), ranging from 0.052 to 0.989 with a median of 0.277 (implying a median price duration of roughly 4.3 months). Shocks follow AR(1) processes with persistence ρ = 0.97. Supply shocks hit 10 energy-related sectors (roughly 10% of total sales); demand shocks hit 22 service-related sectors (roughly 7% of total sales). The key quantitative finding is that the simple 2N policy—both subsidy and tax together—delivers substantially greater welfare improvement than a subsidy-only policy (N instruments), particularly for supply shocks. When the subsidy is not accompanied by the corresponding sales tax, welfare gains are much smaller.&lt;/p&gt;
&lt;p&gt;The paper extends to an open economy with import-price shocks that act simultaneously as supply and demand shocks. Applied to the 2022 Ukraine war energy crisis: a 24% world-energy-price increase (IMF Global Energy Price index, 2022M1–2022M4) is used, with high-dependence Europe (energy import share γ_EU = 0.63, substitution elasticity η_EU = 1) contrasted against low-dependence U.S. (γ_US = 0.17, η_US = 4). In Europe, adverse supply effects dominate so the domestic energy sector contracts; in the U.S., demand substitution effects dominate so domestic energy expands. Simple 2N rules correlate 0.89 with the optimal policy across sectors for Europe and 0.94 for the U.S. A notable normative implication: the optimal policy raises sales taxes on energy to discourage consumption, in contrast to the actual European policy of subsidizing energy consumption during the 2022 crisis.&lt;/p&gt;
&lt;p&gt;Q: Why can monetary policy not achieve the first-best allocation in the NKN model?&lt;/p&gt;
&lt;p&gt;A: Monetary policy sets a single nominal interest rate that applies uniformly across all sectors, but sectoral shocks generate heterogeneous natural rates. Even if monetary policy stabilizes aggregate output, it cannot simultaneously close all sectoral output gaps and eliminate within-sector price dispersion. Rubbo (2023) shows that optimal monetary policy improves welfare but leaves a significant welfare loss remaining.&lt;/p&gt;
&lt;p&gt;Q: What is the core tradeoff in each sector that motivates the 2N result?&lt;/p&gt;
&lt;p&gt;A: With Calvo-type staggered pricing, adjusting a sector&amp;rsquo;s relative price to close its output gap creates price dispersion within the sector because not all firms adjust simultaneously; but holding seller prices constant to avoid dispersion leaves output gaps open due to the absence of relative price adjustment. Two instruments—production subsidy and sales tax—are required to address both sides of this distortion simultaneously, in keeping with the Tinbergen principle.&lt;/p&gt;
&lt;p&gt;Q: How exactly do the production subsidy and sales tax each work under the optimal policy?&lt;/p&gt;
&lt;p&gt;A: The production subsidy is paid to producers and affects the optimal seller price for a given marginal cost, incentivizing firms that can adjust prices to leave them unchanged. The sales tax is levied on buyers (households and downstream firms) and, because it is applied to both household consumption and intermediate goods trade, it steers demand across sectors to replicate the efficient allocation of expenditure. Under the optimal policy, seller prices are fully stabilized (ps_t = 0) while buyer (market) prices move as pt = τs_t = pn_t, mimicking flexible-price outcomes.&lt;/p&gt;
&lt;p&gt;Q: What determines which sectors receive larger optimal tax and subsidy responses?&lt;/p&gt;
&lt;p&gt;A: For supply (productivity) shocks, responses are governed by the matrix L̄ = XL, where L is the Leontief inverse measuring downstream proximity; sectors that are more intensive downstream users of the shocked sector require larger responses. For demand shocks, the relevant matrix measures upstream proximity, so sectors that supply inputs to the shocked sector face stronger responses. Critically, the level of the policy response is independent of sector-specific price rigidity; only the network structure matters.&lt;/p&gt;
&lt;p&gt;Q: Is the optimal 2N policy budget-neutral, and why only approximately?&lt;/p&gt;
&lt;p&gt;A: Budget neutrality holds to first order around the zero-profit steady state. The production subsidy applies to costs while the sales tax applies to sales; at the steady state these coincide, so the subsidy is exactly funded by the tax revenue. The approximation breaks down away from the zero-profit steady state because costs and sales diverge.&lt;/p&gt;
&lt;p&gt;Q: What is the simple 2N rule and how does it relate to the first-best?&lt;/p&gt;
&lt;p&gt;A: The simple rule sets sp_t = Iϕ · πs_t and τs_t = sp_t, where Iϕ = diag{ϕ_i} is a diagonal matrix of response coefficients for each sector&amp;rsquo;s seller-price inflation. As ϕ_i → ∞ for all i, the allocation converges to first-best; larger ϕ_i produces a stronger commitment to stabilize sectoral inflation, resulting in muted inflation rather than large tax and subsidy levels. In practice, the rule can be implemented by observing inflation only in the shocked sector and scaling responses in other sectors by their input-output distance from that sector.&lt;/p&gt;
&lt;p&gt;Q: What does the three-sector example (Energy, Manufacturing, Services) illustrate about supply vs. demand shocks?&lt;/p&gt;
&lt;p&gt;A: Under an adverse energy productivity shock, the optimal policy subsidizes Energy and Manufacturing (proportional to energy use in manufacturing) but not Services, since Services are not energy-intensive and thus not closely connected downstream. Under a positive manufacturing demand shock, the optimal policy subsidizes both Manufacturing and upstream Energy equally, reflecting that demand shocks propagate upstream first.&lt;/p&gt;
&lt;p&gt;Q: What does the calibrated quantitative exercise show about the welfare gains from using both instruments versus one?&lt;/p&gt;
&lt;p&gt;A: For both supply and demand shock scenarios, the simple 2N policy (subsidy plus tax) delivers substantially greater welfare improvement than using only monetary policy. When the subsidy is not accompanied by the corresponding sales tax, welfare gains are much smaller, confirming that both instruments together—not subsidies alone—are essential. This is identified as a key quantitative finding of the paper.&lt;/p&gt;
&lt;p&gt;Q: How robust are results to decreasing returns to scale in production?&lt;/p&gt;
&lt;p&gt;A: Under decreasing returns to scale, the optimal policy response is highly similar to the baseline: correlations between the two are 0.98 for supply shocks and 0.99 for demand shocks across sectors. The simple 2N rule continues to deliver significant welfare improvements. One difference is that demand shocks generate relatively higher welfare losses under decreasing returns, while productivity shocks lead to lower losses.&lt;/p&gt;
&lt;p&gt;Q: How does the open-economy extension change the analysis for import-price shocks?&lt;/p&gt;
&lt;p&gt;A: Import-price shocks enter the model as both supply shocks (raising input costs) and demand shocks (shifting expenditures toward domestic substitutes), so they require a policy response that accounts for both propagation channels simultaneously. The optimal open-economy policy is formally isomorphic to the closed-economy counterpart but with redefined upstream and downstream matrices and shock vectors. The relative importance of the supply versus demand channel depends on the economy&amp;rsquo;s import dependence and substitution elasticity.&lt;/p&gt;
&lt;p&gt;Q: How does the 2022 energy crisis illustrate the difference between the optimal policy and actual European policy?&lt;/p&gt;
&lt;p&gt;A: Using a 24% world-energy-price increase (IMF Global Energy Price index, 2022M1–2022M4), the model implies that with high European energy dependence (γ_EU = 0.63, η_EU = 1), adverse supply effects dominate and the optimal policy raises sales taxes on energy to discourage consumption and subsidizes domestic energy users proportional to downstream proximity. Actual European policy partly subsidized energy consumption, which the model identifies as welfare-reducing relative to the optimal response. For the low-dependence U.S. (γ_US = 0.17, η_US = 4), demand substitution toward domestic energy dominates, requiring additional subsidies to domestic energy producers.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to the Diamond-Mirrlees result on intermediate good taxation?&lt;/p&gt;
&lt;p&gt;A: Diamond-Mirrlees (1971) recommends against taxing intermediate goods in an otherwise efficient economy to avoid introducing additional distortions. This paper considers an economy already subject to pricing frictions (Calvo staggered pricing), and shows that taxing intermediate goods through the sales tax—which applies to intermediate goods trade—is part of the optimal policy precisely because it corrects the pre-existing distortions. The paper thus does not contradict Diamond-Mirrlees but operates in a different setting where frictions are already present.&lt;/p&gt;
&lt;p&gt;New Keynesian Network (NKN) model: A multi-sector general equilibrium framework with N sectors connected through input-output linkages, Calvo-type staggered price setting that is heterogeneous across sectors, and monopolistically competitive firms; provides the canonical system of sectoral IS curves and Phillips curves used in this paper.&lt;/p&gt;
&lt;p&gt;2N policy: The paper&amp;rsquo;s central result that the first-best tax policy requires exactly two instruments per sector—one production subsidy and one sales tax—for a total of 2N instruments; characterized in Proposition 1 and named for this instrument count.&lt;/p&gt;
&lt;p&gt;Production subsidy (sp_t,i): A sector-specific transfer paid to producers that affects the optimal seller price for a given marginal cost; under the optimal policy it offsets the effect of shocks on marginal costs, incentivizing price-adjusting firms to leave seller prices unchanged and thereby eliminating within-sector price dispersion.&lt;/p&gt;
&lt;p&gt;Sales tax (τs_t,i): A sector-specific tax levied on buyers—both households and downstream firms purchasing intermediate goods—such that the buyer (market) price equals (1 + τs_t,i) times the seller price; under the optimal policy it replicates the efficient allocation of expenditure across sectors even when seller prices are fully stabilized.&lt;/p&gt;
&lt;p&gt;Downstream proximity (Leontief inverse L̄ = XL): A measure of the total direct and indirect use of a sector&amp;rsquo;s output by other sectors, governing the propagation and optimal policy response to supply (productivity) shocks; the ij-th element of L̄ captures how strongly a shock in sector j affects policy in sector i through downstream input-output linkages.&lt;/p&gt;
&lt;p&gt;Upstream proximity: A measure of how closely a sector supplies inputs to another sector, governing the propagation of demand shocks; demand shocks propagate first upstream (to input suppliers) before feeding back downstream.&lt;/p&gt;
&lt;p&gt;Budget neutrality: The property that the optimal 2N policy is self-financing to first order—sales tax revenues exactly fund the production subsidies around the zero-profit steady state—so the fiscal intervention does not require net government expenditure.&lt;/p&gt;
&lt;p&gt;Simple 2N rule: A practically implementable approximation to the first-best policy that sets subsidies and taxes proportional to observed sectoral seller-price inflation with response coefficients ϕ_i; converges to the first-best as ϕ_i → ∞ and can be implemented using only the inflation rate of the shocked sector plus network-distance weights from the input-output table.&lt;/p&gt;</description></item><item><title>Distributional Growth Accounting: Education and the Reduction of Global Poverty, 1980–2019</title><link>https://macropaperwarehouse.com/papers/distributional-growth-accounting-education-and-the-reduction-of-global-poverty-19802019/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/distributional-growth-accounting-education-and-the-reduction-of-global-poverty-19802019/</guid><description>&lt;h2 id="layer-1--core-argument"&gt;Layer 1 — Core Argument&lt;/h2&gt;
&lt;p&gt;This paper constructs the first estimates of the aggregate and distributional effects of worldwide educational expansion since 1980 by developing a &amp;ldquo;distributional growth accounting&amp;rdquo; framework that isolates the contribution of schooling to economic growth by income group. The framework integrates the canonical labor supply-and-demand model of education and the wage structure (à la Goldin and Katz 2007) with standard growth accounting tools, applied to a new microdatabase covering household surveys in 150 countries and representative of approximately 95% of the world&amp;rsquo;s population, alongside new country-specific estimates of private returns to primary, secondary, and tertiary schooling. Under conservative assumptions — relying on standard Mincerian returns, assuming capital income is unaffected by schooling, and abstracting from human capital externalities — education can account for approximately 50% of global economic growth, 70% of income gains among the world&amp;rsquo;s poorest 20% of individuals, and 40% of extreme poverty reduction since 1980; it also explains over 50% of improvements in the share of labor income accruing to women. A key mechanism is imperfect substitutability between skill groups: as educational expansion raises the supply of skilled workers, their relative wage falls, redistributing income toward low-skilled workers and amplifying education&amp;rsquo;s equalizing effect at the bottom of the distribution — a channel that canonical cross-country growth accounting misses, causing it to underestimate education&amp;rsquo;s contribution to poverty reduction by a factor of approximately three. Combining these indirect investment benefits from education with direct government redistribution (from a companion paper) brings the total contribution of public policies to extreme poverty reduction to at least 50%.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q: What is distributional growth accounting and how does it differ from standard growth accounting?&lt;/strong&gt;
A: Standard growth accounting (as in Barro and Lee 2015) combines cross-country data on average years of schooling with a uniform return to derive a counterfactual average income absent educational progress. Distributional growth accounting instead starts from microdata on the joint distribution of income and education within 150 countries, constructs income-group-specific counterfactuals, and accounts for both direct wage effects on individuals whose education changed and general equilibrium supply effects that alter relative wages across all workers. The standard approach is found to underestimate education&amp;rsquo;s contribution to the poorest 20%&amp;rsquo;s income growth by a factor of roughly three (23% vs. 71% in the benchmark specification), because cross-country averages cannot accurately locate the world&amp;rsquo;s poorest individuals and because two key channels — labor income shares being greater at the bottom, and supply-side wage redistribution — are omitted.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How is the counterfactual world income distribution constructed?&lt;/strong&gt;
A: In five steps applied to the 150-country microdata. First, education levels are downgraded within each survey until matching the 1980 distribution of educational attainment (using the Barro–Lee database), prioritizing individuals closest to the target level. Second, the earnings of downgraded workers are reduced using the &amp;ldquo;true&amp;rdquo; return to schooling, which lies between the initial return (prevailing before expansion, computed from the CES production function using the 2019 elasticity) and the final return observed in 2019 — for plausible parameterizations, the true return weights initial returns at 50–70%. Third, relative wages are adjusted to reflect supply effects: the increase in skilled-worker supply lowers their relative wage by 1/σ log points per log-point increase in relative supply. Fourth, counterfactual labor income is combined with unchanged capital income to yield counterfactual total income. Fifth, the share of actual income growth attributable to education is computed as the gap between the actual and counterfactual growth rates, expressed as a fraction of actual growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What role does imperfect skill substitution play, and how is σ calibrated?&lt;/strong&gt;
A: Imperfect substitution between skill groups (elasticity σ in a CES production function) is the mechanism through which educational expansion redistributes income. When skilled-worker supply rises, their relative wage falls and low-skilled workers&amp;rsquo; relative wage rises, so the income gains from education are shared more broadly than individual returns alone would suggest. With perfect substitutes (σ → ∞), supply effects vanish and education&amp;rsquo;s distributional impact is determined entirely by who directly received schooling. The elasticity is calibrated from the recent macroeconomics literature; in sensitivity analysis, the paper bounds the contribution of education to the poorest 20%&amp;rsquo;s income growth between 60% and 90% across plausible values of σ and private returns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why are the estimates described as conservative?&lt;/strong&gt;
A: Three reasons, each biasing the estimates downward. First, standard Mincerian returns are used, which are systematically lower than causal estimates from natural experiments — a meta-analysis of 15 papers and the paper&amp;rsquo;s own quasi-experimental validation (India, Indonesia, United States) confirm this; if anything, the framework underestimates schooling&amp;rsquo;s benefits in those settings. Second, capital income is assumed unaffected by schooling, abstracting from potential effects on capital accumulation and returns. Third, human capital externalities — for which there is now substantial empirical evidence — are ignored entirely. These conservative choices are deliberate; relaxing them would increase all headline estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does skill-biased technical change interact with the education contribution?&lt;/strong&gt;
A: In the CES model, the return to schooling is increasing in the skill bias of technology (AH/AL): a higher skill bias raises the marginal product of skilled workers relative to unskilled, making schooling more profitable. The benchmark counterfactual holds technology fixed at its 2019 value and reduces education to its 1980 level. An alternative counterfactual would hold technology at its 1980 value and increase education to its 2019 level; the difference between these two exercises identifies the contribution of skill-biased technical change in amplifying the benefits of schooling. Because 1980 microdata on the world income distribution are unavailable, this decomposition can only be performed for the subsample of 33 countries with surveys around 2000; for that sample, skill-biased technical change accounts for 20–30% of the income benefits of schooling, meaning education would still have yielded large gains even absent technological progress.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do the quasi-experimental validations in India, Indonesia, and the United States show?&lt;/strong&gt;
A: Three large-scale schooling policy interventions — a school construction program in India (studied in Khanna 2023), Indonesia&amp;rsquo;s INPRES program (Duflo 2001 and 2004), and US compulsory schooling laws (Acemoglu and Angrist 2000) — are used to externally validate the framework. Using regional variation in exposure to each program and rich microdata on the income distribution, the paper documents two findings: (1) educational expansion had large causal effects on aggregate regional incomes comparable in magnitude to individual returns estimated in the same contexts; and (2) all three policies disproportionately benefited low-income earners, substantially reducing inequality. The distributional growth accounting framework reproduces both findings with &amp;ldquo;a remarkable degree of accuracy,&amp;rdquo; and if anything underestimates the benefits of schooling, providing validation of the methodological foundation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper quantify education&amp;rsquo;s role in gender inequality reduction?&lt;/strong&gt;
A: The framework is extended to gender by constructing a counterfactual for how large gender labor income gaps would be absent educational improvement since the early 1990s (the period for which female labor income share data are available). The counterfactual accounts for three gender-specific channels: differential educational expansion between men and women, heterogeneous returns to schooling by gender, and differential effects of schooling on female labor force participation. Comparing the counterfactual to actual trends in female labor income shares, education can explain 50–80% of the observed reductions in gender inequality, depending on specification and world region.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do public policies as a whole contribute to extreme poverty reduction?&lt;/strong&gt;
A: The paper&amp;rsquo;s estimate of education&amp;rsquo;s indirect investment benefits (40% of extreme poverty reduction) is combined with a companion paper&amp;rsquo;s (Gethin 2023) estimates of direct government redistribution — cash and in-kind transfers together accounting for approximately 30% of global poverty reduction since 1980, with in-kind transfers alone accounting for approximately 20%. Because the two contributions overlap (e.g., public education spending is both an indirect investment benefit and an in-kind transfer), the combined lower bound is reported as &amp;ldquo;at least 50%&amp;rdquo; of extreme poverty reduction attributable to public policies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does the distributional approach yield such different results from the standard approach for the poorest 20%?&lt;/strong&gt;
A: Two main reasons. First, cross-country data cannot accurately measure the incomes of the world&amp;rsquo;s poorest, because the poorest individuals are not all concentrated in the poorest countries — distributional accounting within countries is necessary to locate them precisely. Second, the standard approach misses two progressive channels: (a) labor income shares are higher at the bottom of the income distribution than average, so gains from schooling translate into larger income increases for the poor; and (b) supply effects redistribute schooling gains from high-skilled to low-skilled workers, a mechanism that is entirely absent from cross-country averages but directly captured in the microdata-based counterfactual.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Distributional growth accounting:&lt;/strong&gt; A framework, introduced in this paper, that combines a model of education and the wage structure with household microdata to construct income-group-specific counterfactuals, isolating the contribution of human capital accumulation to growth at each point of the income distribution rather than at the national-average level.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;True return to schooling (r&lt;/em&gt;):&lt;/em&gt;* In the CES framework with imperfect skill substitution, the &amp;ldquo;true&amp;rdquo; aggregate return to schooling used in the counterfactual lies strictly between the initial return (prevailing before educational expansion, counterfactually higher because skilled-worker supply was lower) and the final return (observed after expansion, lower due to skill-supply pressure). The true return is the return that equates the model&amp;rsquo;s predicted output loss to the actual output loss from reducing education; for plausible parameters it weights initial returns at 50–70%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supply effects (general equilibrium effects of schooling):&lt;/strong&gt; When the supply of skilled workers rises, their relative wage falls and the relative wage of unskilled workers rises. These wage adjustments are not captured by individual-level Mincerian returns but are modeled via the CES elasticity of substitution σ. Supply effects are central to education&amp;rsquo;s progressive distributional impact: they compress the skill premium and raise earnings at the bottom of the distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imperfect substitution between skill groups:&lt;/strong&gt; The CES production specification in which skilled (H) and unskilled (L) labor are combined with elasticity σ &amp;lt; ∞. This governs the magnitude of general equilibrium wage effects: a lower σ means a larger wage compression per unit of skilled-supply increase, amplifying the redistributive role of education. The paper calibrates σ from the macroeconomics literature and bounds results over plausible ranges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Skill-biased technical change (SBTC):&lt;/strong&gt; Technology that raises the marginal product of skilled workers relative to unskilled (captured by the ratio AH/AL in the CES production function). SBTC amplifies returns to schooling; in the subsample of 33 countries with around-2000 surveys, SBTC accounts for 20–30% of schooling&amp;rsquo;s income benefits, but education would still have generated substantial income gains absent SBTC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conservative assumptions (scope condition):&lt;/strong&gt; All headline quantitative results (50% of aggregate growth, 70% of poorest-20% income gains, 40% of extreme poverty reduction, &amp;gt;50% of gender inequality reduction) are explicitly conditioned on conservative assumptions: Mincerian rather than causal returns, no effect on capital income, and no human capital externalities. The paper argues these assumptions bias all estimates downward.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Summary based on HAL working paper (halshs-04423765v1, Working Paper 2023/25, November 2023). Period covered in working paper text: 1980–2022. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;</description></item><item><title>Diversification, Market Entry, and the Global Internet Backbone</title><link>https://macropaperwarehouse.com/papers/diversification-market-entry-and-the-global-internet-backbone/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/diversification-market-entry-and-the-global-internet-backbone/</guid><description>&lt;p&gt;This paper investigates how buyer demand for supplier diversification shapes entry incentives and market structure, using the global undersea fiber-optic cable industry as the empirical setting. The research question has two parts: first, how much of observed cable entry and surplus generation is attributable to buyers&amp;rsquo; diversification motives rather than standard price competition; and second, whether market forces produce too much or too little diversification relative to the social optimum.&lt;/p&gt;
&lt;p&gt;The empirical setting spans 2005–2021 and covers the worldwide network of undersea cables that carries more than 98% of all international internet traffic. Cables fail frequently — hundreds of faults per year — and industry professionals confirm that &amp;ldquo;no customer would buy capacity on a single cable.&amp;rdquo; The median monthly price for a 10Gbps lease fell from $55,500 in 2005 to $2,200 in 2021, and the number of active cables roughly doubled over the sample period.&lt;/p&gt;
&lt;p&gt;The authors use proprietary data from TeleGeography covering cable characteristics (construction costs, capacity, landing points, entry dates), quarterly bandwidth prices at the city-pair level, annual used bandwidth at the country-pair level, and 168 documented cable faults. Markets are defined as country-pairs in calendar quarters.&lt;/p&gt;
&lt;p&gt;The theoretical model begins with a representative buyer who splits bandwidth purchases equally across n symmetric cable operators to minimize expected disruption costs. Because disruption shocks are i.i.d. across cables, adding suppliers reduces the variance of realized bandwidth delivery, lowering the required over-provisioning buffer. This generates a &amp;ldquo;market expansion&amp;rdquo; channel: entry increases aggregate demand holding prices fixed, not just through price competition. The aggregate demand equation takes log-linear form with cable count indicators alongside price and demand shifters.&lt;/p&gt;
&lt;p&gt;The structural model adds a dynamic oligopoly game where firms make entry and exit decisions as a non-stationary Markov Perfect Equilibrium, with Cournot competition in each period. The three-step estimation procedure recovers: (1) price elasticities and diversification parameters from an IV demand regression using electricity generation cost shares as instruments; (2) marginal costs from firms&amp;rsquo; first-order conditions; (3) entry and fixed costs from a nested pseudo-likelihood (NPL) estimator, supplemented by construction cost data to separately identify entry costs given the near-absence of observed exits.&lt;/p&gt;
&lt;p&gt;Key demand results: the IV price elasticity is −1.36. The market expansion effect is large and exhibits decreasing marginal returns — entry of a second cable expands demand by as much as a 28.3% price decrease; a third cable is equivalent to a 19.3% price decrease; an eighth cable is equivalent to a 7.5% price decrease. The demand model achieves R² = 95%.&lt;/p&gt;
&lt;p&gt;The first counterfactual removes the diversification channel entirely (entry raises competition only). Without diversification, cable investment falls by 12%. The net present value of total surplus per market over the sample period averages $1.11 billion under the observed equilibrium; supplier diversification accounts for 11% of total surplus and 27% of consumer surplus.&lt;/p&gt;
&lt;p&gt;The second counterfactual quantifies two opposing distortions relative to the social optimum. Business-stealing creates excessive entry (entrants reduce incumbents&amp;rsquo; output), while diversity effects create insufficient entry (marginal entrants generate surplus through diversification they cannot fully capture). At end-of-sample (2021-Q4), diversity distortions in terms of number of entrants range from 54% to 125% of the business-stealing distortion. Business-stealing tends to dominate for most markets, producing moderately excessive entry. Relative to the market outcome, total surplus under the social planner&amp;rsquo;s solution is on average 10% higher: 53% of this welfare gap is attributable to diversity effects and 47% to business-stealing effects. These findings hold across market heterogeneity in entry costs, market size, and demand growth.&lt;/p&gt;
&lt;p&gt;The paper concludes that profit-maximizing suppliers fail to fully internalize diversification-related social benefits, and that targeted entry subsidies would pass cost-benefit tests in settings where diversity distortions dominate.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism by which supplier diversification expands demand?
A: When buyers split purchases across n cable operators whose disruption shocks are i.i.d., adding a supplier reduces the variance of realized delivered bandwidth. The buyer therefore needs to hold a smaller over-provisioning buffer to achieve the same expected level of used bandwidth B. This lowers the effective cost of a given quantity of used bandwidth, shifting the aggregate demand curve outward. As the number of suppliers grows to infinity, the expected disruption cost converges to zero.&lt;/p&gt;
&lt;p&gt;Q: How large is the market-expansion effect of diversification empirically?
A: The effect is large but exhibits decreasing marginal returns. Entry of a second cable expands demand by as much as a 28.3% price reduction holding prices fixed; the third cable is equivalent to a 19.3% price reduction; and the eighth cable is equivalent to a 7.5% price reduction. All cable-count coefficients are positive and statistically significant in the IV demand model.&lt;/p&gt;
&lt;p&gt;Q: How is price endogeneity addressed in the demand estimation?
A: Bandwidth prices are instrumented using the marginal cost of electricity generation — specifically, country-level electricity generation shares (coal, gas, oil) interacted with quarterly commodity price series for coal, gas, and oil (Brent crude, Australian coal price, EU natural gas price). The first-stage results indicate electricity costs are strong predictors of bandwidth prices. Accounting for endogeneity raises the price elasticity from an OLS level to −1.36 in absolute value, consistent with the expected direction of OLS bias.&lt;/p&gt;
&lt;p&gt;Q: What share of cable investment and surplus is attributable to diversification motives?
A: In the counterfactual where the diversification channel is eliminated — entry raises competition and lowers prices but provides no diversification benefit — cable investment falls by 12%. Under the observed equilibrium, the net present value of total surplus per market over 2005–2021 averages $1.11 billion; supplier diversification accounts for 11% of this total surplus and 27% of consumer surplus.&lt;/p&gt;
&lt;p&gt;Q: How are the two distortions — business-stealing and diversity — defined and separated?
A: Business-stealing distortion arises because entrants reduce incumbents&amp;rsquo; outputs and revenues, so private entry benefits exceed social benefits, leading to excessive entry. Diversity distortion arises because entrants create surplus for buyers through diversification but cannot fully capture it without perfect price discrimination (following Spence (1976) and Mankiw and Whinston (1986)), leading to insufficient entry. The authors disentangle these by comparing: (i) the social planner&amp;rsquo;s solution (eliminates both distortions), and (ii) a coordinated entry solution maximizing producer surplus (eliminates only business-stealing). The residual gap between the two identifies the diversity distortion.&lt;/p&gt;
&lt;p&gt;Q: What is the net direction and magnitude of distortion in equilibrium market structure?
A: At 2021-Q4, for most markets, business-stealing dominates, leading to moderately excessive entry. Diversity distortions in number of entrants range from 54% to 125% of the business-stealing distortion across markets. Relative to the market outcome, the social planner&amp;rsquo;s solution yields average total surplus that is 10% higher. Of that welfare gap, 53% is attributable to diversity effects and 47% to business-stealing effects.&lt;/p&gt;
&lt;p&gt;Q: How do market characteristics affect which distortion dominates?
A: The paper analyzes cross-market heterogeneity and identifies market features — including the size of entry costs, market size, and the rate of demand growth over time — as determinants of whether insufficient diversification or excessive entry is the binding distortion. Markets with higher entry costs or slower demand growth are more likely to exhibit insufficient diversification.&lt;/p&gt;
&lt;p&gt;Q: How are entry costs identified given the near-absence of cable exits in the data?
A: Because exit events are rare in a nascent industry — only a handful of exits observed, mostly after 2020 — entry and fixed costs cannot be separated by exit decisions alone. The authors address this by using cable-level construction cost data from TeleGeography to estimate entry costs outside the dynamic model. With entry costs in hand, firms&amp;rsquo; optimal entry decisions identify fixed costs. Scrap values are normalized to zero, consistent with industry reports that retired cables are typically abandoned on the seabed.&lt;/p&gt;
&lt;p&gt;Q: What role does the non-stationarity of the market environment play in the model?
A: The data covers the industry&amp;rsquo;s earliest growth phase, with demand growing by roughly three orders of magnitude (used bandwidth from 5 Tbps in 2005 to 2,886 Tbps in 2021) and prices falling by a factor of roughly 25. The authors use a non-stationary Markov Perfect Equilibrium concept in which strategies and transition functions are indexed by time, aligning with the treatment of high-tech commodities in Igami (2017).&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: Because profit-maximizing suppliers do not fully internalize the diversification-related social benefits of entry, entry rates can be sub-optimal from a welfare perspective when diversity distortions dominate. The authors suggest targeted entry subsidies would pass cost-benefit tests in such cases. For antitrust analysis, regulators who ignore the demand-expansion effect of incremental suppliers may incorrectly judge a market as sufficiently competitive. In merger review, authorities must account for firms&amp;rsquo; private incentives to provide diversification to reach accurate welfare conclusions.&lt;/p&gt;
&lt;p&gt;Q: How does the paper verify that diversification demand is not a spurious empirical artifact?
A: Several checks support the causal interpretation. The estimated demand parameters are consistent with the predictions of the consumer-level utility maximization problem derived analytically: decreasing marginal returns to diversification and a positive relationship between the number of suppliers and demand. The demand model achieves R² = 95%, suggesting limited unobserved confounders. Additionally, 78% of cable faults involve only a single cable, confirming that disruptions are geographically isolated and that cross-cable diversification provides genuine insurance value.&lt;/p&gt;
&lt;p&gt;Q: What are the main data limitations acknowledged by the authors?
A: The authors cannot observe cable-level revenue or market shares, nor contracts between buyers and sellers; only aggregate country-pair used bandwidth is observed. Price coverage is not comprehensive — TeleGeography collects prices on a voluntary basis from dozens of providers. The cable faults dataset (168 faults) represents only a subset of total faults, as collection focuses on publicly disclosed events. The demand model also does not explicitly account for substitution patterns across firms due to lack of firm-level market share data, though the high R² partly mitigates this concern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Diversification (in this paper&amp;rsquo;s sense):&lt;/strong&gt; Buyers&amp;rsquo; practice of splitting bandwidth purchases across multiple cable operators to reduce exposure to idiosyncratic disruption risk. Diversification across n cables with i.i.d. disruption shocks reduces the variance of realized delivered bandwidth and lowers the required over-provisioning buffer, making the effective cost of a given usage level B a decreasing function of n.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market Expansion Effect:&lt;/strong&gt; The channel through which entry of additional cable suppliers raises aggregate demand holding prices fixed. This occurs because each additional supplier reduces disruption risk, allowing buyers to demand more used bandwidth for the same price. It is distinct from the conventional competition channel (entry lowering prices).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Diversity Distortion:&lt;/strong&gt; The tendency toward insufficient entry arising because marginal entrants generate consumer surplus through diversification benefits but cannot fully capture this surplus absent price discrimination. Follows Spence (1976) and Mankiw and Whinston (1986).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Business-Stealing Distortion:&lt;/strong&gt; The tendency toward excessive entry arising because entrants reduce incumbents&amp;rsquo; output and revenues, creating a gap between private and social returns to entry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-Stationary Markov Perfect Equilibrium:&lt;/strong&gt; The equilibrium concept used for the dynamic entry game, in which strategies and equilibrium selection rules are indexed by calendar time to accommodate substantial secular trends in demand and costs — as opposed to a stationary MPE which assumes a stable long-run distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Used Bandwidth vs. Purchased Bandwidth:&lt;/strong&gt; Used bandwidth B is the amount the buyer is committed to delivering (to downstream customers or for internal use). Purchased bandwidth Q is what the buyer actually contracts for across all cables; Q &amp;gt; B because the buyer holds an over-provisioning buffer against disruption risk. The ratio B/Q is a decreasing function of the disruption cost parameter gamma and an increasing function of the number of suppliers n.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nested Pseudo-Likelihood (NPL) Algorithm:&lt;/strong&gt; The baseline estimator for the dynamic game, following Aguirregabiria and Mira (2007). It iterates on the best-response mapping to impose equilibrium restrictions. The authors supplement NPL with two-step estimators (1-PML, 1-MD) and the spectral algorithm of Aguirregabiria and Marcoux (2021), which solves for the root of a nonlinear system using a quasi-Newton method and is robust to fixed-point instability.&lt;/p&gt;</description></item><item><title>Diversifying Society's Leaders? Determinants and Causal Effects of Admission</title><link>https://macropaperwarehouse.com/papers/diversifying-societys-leaders-determinants-and-causal-effects-of-admission/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/diversifying-societys-leaders-determinants-and-causal-effects-of-admission/</guid><description>&lt;p&gt;This paper studies why children from high-income families are more likely to attend Ivy-Plus colleges (Ivy League, Stanford, MIT, Duke, Chicago — 12 colleges total) and whether attending these colleges causally improves post-college outcomes. The authors construct a de-identified panel dataset linking federal income tax records, Department of Education college attendance data, College Board and ACT test scores, and application and admissions records from several Ivy-Plus and flagship public colleges covering approximately 2.4 million students across entering classes from 1998–2015.&lt;/p&gt;
&lt;p&gt;The central finding on the input side is that students from families in the top 1% of the income distribution (income above $611,000) are 2.3 times more likely to attend an Ivy-Plus college than middle-class students (defined as the 70th–80th percentiles of the national parental income distribution, approximately $91,000–$114,000) with comparable SAT/ACT scores. Two-thirds of this gap is attributable to higher admissions rates at Ivy-Plus colleges for high-income applicants; conditional on SAT/ACT scores, top-1% applicants are 58% more likely to be admitted than middle-class applicants. The remaining third splits between differences in application rates (roughly 20% of the total attendance gap) and matriculation rates (roughly 12%). In contrast, admissions rates at flagship public colleges are essentially uncorrelated with parental income conditional on test scores.&lt;/p&gt;
&lt;p&gt;Three admissions practices drive the high-income admissions advantage at Ivy-Plus colleges. First, legacy preferences: legacy applicants from the top 1% are admitted at more than five times the rate of non-legacy applicants with comparable test scores, demographics, and admissions ratings; children of alumni of a given Ivy-Plus college are not more likely to be admitted to other Ivy-Plus colleges, confirming that legacy status is not merely a proxy for unobservable credentials. Legacy preferences account for 52 of the estimated 168 &amp;ldquo;extra&amp;rdquo; top-1% students per average Ivy-Plus class (enrollment ~1,650). Second, non-academic ratings: students from the top 1% have markedly stronger non-academic credentials (extracurricular activities, leadership ratings) partly because they disproportionately attend private high schools whose students receive higher non-academic ratings despite no higher academic ratings; this accounts for 35 additional extra top-1% students. Third, athletic recruitment: the share of recruited athletes rises from 5% among admitted students from the bottom 60% to 13% among those from the top 1%, accounting for 27 additional extra top-1% students.&lt;/p&gt;
&lt;p&gt;On the output side, the authors estimate causal effects of attending an Ivy-Plus college using a new research design based on waitlisted applicants. The key identification assumption is that idiosyncratic variation in admissions decisions across waitlisted applicants at one Ivy-Plus college is uncorrelated with admissions decisions at other Ivy-Plus colleges — which the authors verify empirically. Under this assumption, comparisons of admitted vs. rejected waitlisted applicants identify causal effects for marginal students. The marginal student who attends an Ivy-Plus college instead of the average flagship public is approximately 50% more likely to reach the top 1% of the earnings distribution at age 33, nearly twice as likely to attend a highly-ranked graduate school, and 2.5 times as likely to work at a prestigious firm. Attending an Ivy-Plus college increases mean earnings by $101,000 at age 33 relative to a counterfactual mean of $143,000 at state flagships. Effects are concentrated in the upper tail of earnings — the impact on reaching the top quartile is small and statistically insignificant, while impacts on reaching the top 1% far exceed what a constant percentage treatment effect would predict. Effects are larger for students with weaker fallback options (i.e., whose home-state colleges channel fewer students to the top 1%).&lt;/p&gt;
&lt;p&gt;Critically, the three credentials driving the high-income admissions advantage — legacy status, athletic recruitment, and high non-academic ratings — are uncorrelated with or negatively correlated with post-college success once the college attended is held constant. Academic credentials (SAT/ACT scores, academic ratings) remain highly predictive of outcomes.&lt;/p&gt;
&lt;p&gt;Counterfactual simulations show that eliminating all three high-income admissions preferences and replacing those slots with students having the same test score distribution would increase enrollment from the bottom 95% of the parental income distribution by 8.8 percentage points — comparable in magnitude to the effect of race-based affirmative action on Black and Hispanic enrollment shares. Such a policy would have small effects on monetary leadership outcomes (e.g., Fortune 500 CEO share from bottom-95% families rises by only 0.4 pp, because Ivy-Plus graduates are a small fraction of all top earners) but larger effects on non-monetary leadership positions: the share of senators from the bottom 95% would rise by 1.7 pp and the share of Supreme Court justices by 5.4 pp. With need-affirmative policies (giving low-income students preferences comparable to those currently given to legacy applicants), the share of Supreme Court justices from families in the bottom 60% would rise by 17.5 pp. These predictions assume that the causal share of Ivy-Plus attendance in explaining observational differences in leadership outcomes is the same as that estimated for early-career outcomes, and they ignore general equilibrium effects.&lt;/p&gt;
&lt;p&gt;Q: How much more likely are top-1% students to attend an Ivy-Plus college than middle-class students with the same test scores?
A: Students from families in the top 1% (income above $611,000) are 2.3 times more likely to attend an Ivy-Plus college than students from the 70th–80th percentile of the parental income distribution (approximately $91,000–$114,000) with comparable SAT/ACT scores. This &amp;ldquo;missing middle&amp;rdquo; pattern is stable across entering classes from 1998 to 2018 and persists after controlling for race and ethnicity.&lt;/p&gt;
&lt;p&gt;Q: How is the overall attendance gap decomposed into application, admissions, and matriculation?
A: Differences in admissions rates explain two-thirds of the gap in Ivy-Plus attendance between top-1% and middle-class students conditional on test scores. Of the estimated 168 &amp;ldquo;extra&amp;rdquo; top-1% students per average Ivy-Plus class, 87 come from higher admissions rates for non-recruited athletes, 27 from athletic recruitment, and the remaining slack from application rate differences (accounting for roughly 20% of the overall attendance gap) and matriculation differences (roughly 12%).&lt;/p&gt;
&lt;p&gt;Q: How large is the admissions advantage for top-1% applicants at Ivy-Plus colleges?
A: Conditional on SAT/ACT scores, applicants from the top 1% are 58% more likely to be admitted to Ivy-Plus colleges than middle-class applicants. Students from the top 0.1% are 2.5 times more likely to be admitted than middle-class applicants with comparable test scores. At flagship public colleges, admissions rates are essentially constant across the income distribution conditional on test scores.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of legacy preferences and how is it established that legacy is not just a proxy for other credentials?
A: Legacy applicants from the top 1% are admitted at more than five times the rate of otherwise comparable non-legacy applicants at the college their parents attended. The paper isolates the legacy effect by showing that children of alumni at a given Ivy-Plus college are only slightly more likely to be admitted at other Ivy-Plus colleges — and the predicted counterfactual admissions rate for legacy students at other colleges closely matches their actual admissions rate — confirming that legacy status is not merely a proxy for other unobservable credentials. Legacy applicants constitute 2.5% of the overall applicant pool but over 9% of top-1% applicants.&lt;/p&gt;
&lt;p&gt;Q: How do non-academic credentials differ by parental income, and what drives the difference?
A: Top-1% applicants have markedly stronger non-academic ratings (measuring extracurricular participation and leadership traits) compared with other applicants, while the share achieving high academic ratings is essentially constant across the income distribution. Students from the top 1% are much more likely to have attended private high schools, whose applicants receive substantially higher non-academic ratings than students from public high schools with the same SAT/ACT scores. Non-academic ratings account for 35 of the estimated 168 extra top-1% students per Ivy-Plus class.&lt;/p&gt;
&lt;p&gt;Q: What is the research design for estimating causal effects, and what is the key identification assumption?
A: The authors focus on applicants who are waitlisted at a given Ivy-Plus college and compare those ultimately admitted versus rejected from the waitlist. The key identification assumption is that if different colleges&amp;rsquo; admissions committees make correlated assessments of underlying student merit but uncorrelated idiosyncratic admissions errors, then residual variation in admissions outcomes for waitlisted applicants at one college is orthogonal to students&amp;rsquo; long-run potential. The authors validate this empirically by showing that waitlist admission at one Ivy-Plus college is uncorrelated with admissions decisions and internal ratings at other Ivy-Plus colleges.&lt;/p&gt;
&lt;p&gt;Q: What are the causal effects of attending an Ivy-Plus college on post-college outcomes?
A: For the marginal student (one who attends an Ivy-Plus college instead of the average flagship public), attending an Ivy-Plus college increases the probability of reaching the top 1% of the earnings distribution at age 33 by approximately 50%, nearly doubles the probability of attending an elite graduate school, and increases the probability of working at a prestigious firm by approximately 2.5 times. Mean earnings at age 33 increase by $101,000 (relative to a counterfactual mean of $143,000 at state flagships). Effects on reaching the top quartile of earnings are small and statistically insignificant, while effects at the very top tail are disproportionately large.&lt;/p&gt;
&lt;p&gt;Q: Why do the findings differ from Dale and Krueger (2002) and related studies finding little effect of selective college attendance on earnings?
A: The authors replicate the matriculation design of Dale and Krueger (comparing outcomes conditional on the set of colleges to which students were admitted) and obtain estimates statistically indistinguishable from their waitlist design — the research designs are not the source of disagreement. Instead, the differences arise because (1) the authors have direct college fixed effects rather than relying on average test scores as a proxy for college quality, and (2) the authors focus on upper-tail outcomes (top 1% earnings, elite graduate schools, prestigious firms) rather than log mean earnings, where Ivy-Plus colleges have their largest effects.&lt;/p&gt;
&lt;p&gt;Q: Are the credentials that drive the high-income admissions advantage — legacy, athlete status, high non-academic ratings — predictive of better post-college outcomes?
A: No. Recruited athletes, students with higher non-academic ratings, and legacy students have equivalent or lower chances of reaching the upper tail of the income distribution, attending an elite graduate school, or working at a prestigious firm than comparable Ivy-Plus applicants once the college attended is held constant. By contrast, SAT/ACT scores and academic ratings are highly positively predictive of all three post-college outcome measures.&lt;/p&gt;
&lt;p&gt;Q: How much could changing admissions practices diversify Ivy-Plus enrollment and subsequently society&amp;rsquo;s leadership?
A: Eliminating legacy preferences, non-academic rating weights, and the differential recruitment of high-income athletes — and filling those slots with students having the same test score distribution as the current class — would increase enrollment from families in the bottom 95% of the parental income distribution by 8.8 percentage points, a magnitude comparable to race-based affirmative action&amp;rsquo;s effect on Black and Hispanic enrollment shares. For leadership positions, predicted effects are small for monetary outcomes (Fortune 500 CEOs from the bottom 95% would increase by only 0.4 pp) but larger for positions where Ivy-Plus graduates are a larger share: senators from the bottom 95% would increase by 1.7 pp and Supreme Court justices by 5.4 pp. A stronger need-affirmative policy (giving low-income students preferences equivalent to current legacy preferences) would increase the share of Supreme Court justices from the bottom 60% by 17.5 pp.&lt;/p&gt;
&lt;p&gt;Q: How are &amp;ldquo;elite&amp;rdquo; and &amp;ldquo;prestigious&amp;rdquo; employers defined in this study?
A: Elite firms are defined as those that disproportionately employ Ivy-Plus graduates relative to flagship public graduates, pulling firms from the top of that ratio ranking until 25% of Ivy-Plus attendee employment is accounted for. Prestigious employers are defined by the residual of that ratio after controlling for the firm&amp;rsquo;s predicted top-1% income probability — they are firms that disproportionately employ Ivy-Plus graduates conditional on their salaries, capturing high-status jobs that do not necessarily lead to the highest earnings. The paper validates this algorithmic approach against external rankings (Vault.com for law and consulting firms; Scimagoir for hospitals), finding substantial overlap.&lt;/p&gt;
&lt;p&gt;Q: How are treatment effect estimates adjusted for heterogeneity in students&amp;rsquo; fallback options?
A: Causal effects of Ivy-Plus attendance are much larger for students with weaker fallback options — specifically, students whose home-state flagship colleges channel fewer students to the top 1% of earnings. The authors exploit this heterogeneity to estimate the treatment effect for the marginal student who actually switches from a flagship public to an Ivy-Plus college. This heterogeneity also implies that the average causal effect across all admitted students may differ from the effect for the marginal admitted student.&lt;/p&gt;
&lt;p&gt;Q: What share of the overrepresentation of top-1% families at Ivy-Plus colleges is attributable to pre-application factors versus admissions practices?
A: Of the 245 &amp;ldquo;extra&amp;rdquo; top-1% students in an average Ivy-Plus class relative to an unconditionally income-neutral benchmark, 77 (31%) are attributable to the higher test scores of top-1% students (a pre-application factor). The remaining 168 (69%) reflect higher attendance rates conditional on test scores, of which the large majority is attributable to admissions practices (legacy, non-academic ratings, athletic recruitment) rather than application or matriculation rate differences.&lt;/p&gt;
&lt;p&gt;Ivy-Plus colleges: The twelve highly selective private colleges comprising the eight Ivy League institutions plus Stanford, MIT, Duke, and the University of Chicago — the focus group of the study, which together account for more than 10% of Fortune 500 CEOs, a quarter of U.S. senators, and three-fourths of Supreme Court justices appointed in the last half century despite enrolling less than 0.5% of Americans.&lt;/p&gt;
&lt;p&gt;Missing middle: The pattern by which attendance rates at Ivy-Plus colleges conditional on SAT/ACT scores are lowest for students from the middle class (70th–80th percentile of the parental income distribution, approximately $91,000–$114,000) — lower than both the top 1% and, slightly, the bottom 40% — producing a non-monotone income gradient in attendance.&lt;/p&gt;
&lt;p&gt;Legacy preference: An admissions advantage given to applicants whose parent(s) obtained an undergraduate degree from the college to which the student is applying. In the paper&amp;rsquo;s data, legacy applicants from the top 1% are admitted at more than five times the rate of non-legacy applicants with comparable test scores, demographics, and admissions ratings; the preference is college-specific (children of alumni are only slightly more likely to be admitted at other Ivy-Plus colleges).&lt;/p&gt;
&lt;p&gt;Waitlist research design: The paper&amp;rsquo;s primary identification strategy for causal effects, which exploits idiosyncratic variation in admissions decisions among waitlisted applicants. The design&amp;rsquo;s validity rests on the empirical finding that waitlist admissions at one Ivy-Plus college are uncorrelated with admissions decisions and internal ratings at other Ivy-Plus colleges, implying that residual variation conditional on being on the waitlist is orthogonal to students&amp;rsquo; long-run potential outcomes.&lt;/p&gt;
&lt;p&gt;Prestigious employers: Firms defined by the paper&amp;rsquo;s algorithm as disproportionately employing Ivy-Plus graduates conditional on those firms&amp;rsquo; predicted top-1% income probability — capturing high-status employment that does not necessarily lead to the highest earnings (e.g., prominent law firms, consulting firms, elite hospitals). Validated against external rankings (Vault.com, Scimagoir).&lt;/p&gt;
&lt;p&gt;Non-academic ratings: Numerical scores assigned by admissions officers measuring aspects of an application outside academic achievement, such as extracurricular activities and leadership traits. In the paper&amp;rsquo;s data, non-academic ratings differ substantially by parental income — particularly because top-1% applicants disproportionately attend private high schools whose students receive higher non-academic ratings — while academic ratings do not differ across the income distribution.&lt;/p&gt;
&lt;p&gt;Surrogate index: A prediction of later earnings outcomes (specifically, probability of reaching the top 1% at age 33 and mean income rank) constructed from individuals&amp;rsquo; graduate school attendance and employer fixed effects at ages 22–25, used to extend the outcome window for cohorts observed only early in their careers. The approach follows the terminology and methodology of Athey et al. (2019).&lt;/p&gt;</description></item><item><title>Do Credit Conditions Move House Prices?</title><link>https://macropaperwarehouse.com/papers/do-credit-conditions-move-house-prices/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/do-credit-conditions-move-house-prices/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; To what extent did an expansion and contraction of credit drive the 2000s housing boom and bust? The existing literature offers sharply divergent answers — ranging from credit explaining virtually none of the boom (Kaplan, Mitman, and Violante 2020) to credit explaining the majority of it (Favilukis, Ludvigson, and Van Nieuwerburgh 2017, who find credit alone explains 60% of the rise in price-to-rent ratios). Greenwald and Guren argue that the source of these divergent findings is a single structural assumption: the degree to which credit-insensitive agents (landlords and unconstrained savers) can absorb credit-driven demand for housing, which in turn depends on the degree of segmentation between the owner-occupied and rental housing markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Mechanism.&lt;/strong&gt; The paper organizes the literature around a &amp;ldquo;tenure supply&amp;rdquo; curve, defined in price-rent ratio versus homeownership rate space. A perfectly inelastic (vertical) supply curve — corresponding to perfect segmentation, in which housing cannot move between the owner-occupied and rental sectors — implies that credit expansion bids up house prices with no change in the homeownership rate. A perfectly elastic (horizontal) supply curve — corresponding to a frictionless rental market with deep-pocketed landlords who price at the present value of rents — implies that credit expansion raises the homeownership rate but not the price-rent ratio, because landlord reservation prices are unaffected by credit. Intermediate degrees of segmentation produce intermediate outcomes: credit raises both the price-rent ratio and the homeownership rate, with the relative magnitudes determined by the slope of the tenure supply curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Strategy.&lt;/strong&gt; To measure where reality falls on this spectrum, the authors estimate the relative elasticity of the price-rent ratio to an identified credit supply shock, compared to the elasticity of the homeownership rate to the same shock. This ratio is a sufficient statistic for the slope of the tenure supply curve. They use three distinct identification strategies from prior literature — (1) Loutskina and Strahan (2015), instrumenting for local credit supply using differential city-level exposure to changes in the conforming loan limit (CLL); (2) Di Maggio and Kermani (2017), exploiting the 2004 OCC preemption of state anti-predatory-lending laws for national banks; and (3) Mian and Sufi (2019), using differential city-level exposure to the 2003 private label securitization (PLS) expansion through bank funding composition. Regressions are estimated on annual CBSA-level panels using local projection IV (LP-IV) or event-study reduced-form methods. Key data include the CoreLogic repeat-sales house price index, the CBRE Torto-Wheaton same-store rent index (a repeat-rent index for multi-unit apartment buildings, constructed from newly-leased units), and Census Housing Vacancy Survey homeownership rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings.&lt;/strong&gt; All three instruments consistently find that credit supply shocks generate a significant increase in house prices and the price-rent ratio but a much smaller, rarely statistically significant, effect on the homeownership rate. Under the LS LP-IV, the price-rent ratio peaks at an increase of 0.471, while the homeownership rate response reaches only 0.037 at the 2-year horizon and peaks at 0.101 after 5 years. The ratio of price-rent to homeownership responses ranges from 3 to infinity across the three instruments and horizons. These estimates imply a substantial degree of segmentation — the no-segmentation model falls far outside the 95% confidence intervals at all horizons.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural Model and Calibration.&lt;/strong&gt; The authors construct a general equilibrium model featuring a representative borrower, landlord, and saver, with long-term fixed-rate mortgages subject to loan-to-value (LTV) and payment-to-income (PTI) limits following Greenwald (2018). The key modeling innovation is within-type heterogeneity in the benefit of owning versus renting, captured by logistic distributions for both borrowers and landlords. The dispersion parameter of the landlord distribution (σω,L) governs the slope of the tenure supply curve and is calibrated to minimize weighted distance to the LS empirical impulse responses. The resulting benchmark calibration yields σω,L = 2.877, with the benchmark model&amp;rsquo;s price-rent-to-homeownership ratio between 6.98 and 9.31 depending on the horizon — consistent with the empirical estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Results on the 2000s Boom.&lt;/strong&gt; The paper then uses the calibrated model to simulate a credit standard relaxation (LTV limits relaxed from 85% to 99%, PTI limits from 36% to 65%) from 1998 Q1 through 2007 Q1, with a reversion at the start of the bust. This credit relaxation alone explains 34% of the peak rise in price-rent ratios observed in the boom, with a lower bound of 26% accounting for parameter uncertainty. In contrast, the no-segmentation model explains -1%, while the full segmentation model explains 38%. Adding a 2 percentage point permanent decline in mortgage spreads alongside the credit standard relaxation allows the benchmark model to explain 72% of the observed rise in price-rent ratios and 80% of the rise in loan-to-income ratios, compared to only 4% in the no-segmentation model. In a &amp;ldquo;full boom&amp;rdquo; scenario where additional demand and supply shocks are added to match the entire boom in price-rent ratios and homeownership, removing the credit relaxation reduces the rise in price-rent ratios by 55% in the benchmark economy — larger than the 34% explained in isolation due to nonlinear interactions — compared to only 5% in the no-segmentation economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Extensions.&lt;/strong&gt; These results apply to the benchmark calibration in which landlords do not use credit and saver housing demand is fixed. When landlords are allowed to use credit (LTV limit of 65% relaxed to 85% during the boom), the role of credit is strengthened: the recalibrated model explains 80% of the rise in price-rent ratios from combined credit and rate changes, suggesting the benchmark is a lower bound. When savers are allowed to frictionlessly trade housing with borrowers, credit explains 54% of the rise in price-rent ratios even after recalibration — a roughly 25% reduction relative to the benchmark 72%, representing what the authors characterize as an extreme lower bound given that saver housing markets are in practice substantially segmented due to indivisibility, quality, and location differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy Implications.&lt;/strong&gt; The findings imply that macroprudential policies tightening LTV and PTI ratios can be effective at restraining house price growth, but only in the presence of the significant rental market segmentation found in the benchmark economy. In the no-segmentation economy, removing the credit relaxation from the full boom reduces price-rent ratio growth by only 5%.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core theoretical insight that reconciles the divergent findings in the prior literature on credit and house prices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The key difference is the degree to which credit-insensitive agents — specifically landlords and unconstrained savers — can absorb credit-driven demand for housing. Models with perfectly segmented rental markets (no rental sector or fixed homeownership rate) feature borrowers competing only with each other for a fixed stock, so credit expansion bids up prices. Models with frictionless rental markets feature deep-pocketed landlords who supply housing at a price equal to the present value of rents, which is unaffected by credit; credit expansion then raises the homeownership rate rather than prices. Intermediate degrees of frictions produce intermediate outcomes. This mechanism had not been recognized as the source of the literature&amp;rsquo;s divergence before this paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the &amp;ldquo;tenure supply curve&amp;rdquo; and why is its slope the key empirical object?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The tenure supply curve describes the menu of price-rent ratios at which landlords are willing to supply varying amounts of owner-occupied housing (given total housing stock), traced out in price-rent ratio versus homeownership rate space. Its slope determines how the equilibrium responds to a credit-induced demand shift: a steep (inelastic) supply curve translates credit expansion primarily into price-rent ratio increases; a flat (elastic) supply curve translates it primarily into homeownership rate increases. Identifying this slope empirically is therefore sufficient to discipline any macro-housing model&amp;rsquo;s predictions about the role of credit in price dynamics, for arbitrary underlying shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How do the authors identify the slope of the tenure supply curve empirically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;They estimate the slope as the ratio of the causal elasticity of the price-rent ratio to that of the homeownership rate, with respect to an identified credit supply shock. Three instruments are used: (1) the Loutskina-Strahan shift-share instrument based on differential exposure to changes in the conforming loan limit, estimated by LP-IV on an unbalanced panel of 62 CBSAs from 1992 to 2016; (2) the Di Maggio-Kermani event study based on the 2004 OCC preemption of state anti-predatory-lending laws, covering 262 CBSAs for house prices and 82 CBSAs for homeownership from 2001 to 2010; and (3) the Mian-Sufi event study based on differential exposure to the 2003 PLS expansion via non-core deposit share, covering 245 CBSAs using ACS and FHFA data. In practice, they estimate the inverse slope (ratio of homeownership to price-rent response) because the first stage is far stronger using price-rent ratios as the endogenous variable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the empirical results on the relative price-rent and homeownership responses?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Across all three instruments, credit supply shocks significantly raise the price-rent ratio but have a much smaller, rarely statistically significant effect on the homeownership rate. Under the LS LP-IV, the price-rent ratio peaks at 0.471 after 2 years, while the homeownership rate reaches only 0.037 at 2 years and peaks at 0.101 at 5 years. The naive point-estimate ratios range from 2.93 to 12.83 at horizons 2 through 5, with the 4-year estimate negative (implying an infinite slope). The directly estimated inverse slope coefficients are small (0.05 to 0.24) and never statistically different from zero. The DK instrument yields slopes of 6.72 in 2005, 3.67 in 2006, and 3.40 in 2007. The MS instrument yields a slope of approximately 4.49 in both 2006 and 2007. The lower bound of the 95% confidence intervals corresponds to slopes of at least 1.8 to 8.4.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the key modeling contribution on the structural side?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The key innovation is the introduction of within-type heterogeneity in ownership preferences for both borrowers and landlords, modeled as logistic distributions. This heterogeneity allows the model to generate a fractional and time-varying homeownership rate — a feature absent from most prior macro-housing models — and maps directly into the slopes of the demand and tenure supply curves. The dispersion in landlord ownership costs (σω,L) governs the supply curve slope and is calibrated to match the empirical impulse responses. Without this heterogeneity, the model would produce corner solutions with all housing owned by one type.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How is the landlord dispersion parameter σω,L calibrated, and what is the estimated value?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The calibration minimizes a weighted sum of squared deviations between model and data impulse responses for the price-rent ratio and homeownership rate, using the LS LP-IV estimates. Deviations are weighted by the inverse of empirical standard errors. Because model impulse responses jump on impact while empirical responses are hump-shaped (due to search frictions), the calibration uses only horizons 2 through 5 years. The minimum-distance estimate yields σω,L = 2.877, alongside a mortgage spread shock persistence of 0.965 and a shock size of -0.041 (corresponding to an annualized CLL subsidy of approximately 17 basis points, within the 10-24bp range found in prior literature). The benchmark model&amp;rsquo;s implied price-rent-to-homeownership response ratio ranges from 6.98 to 9.31, consistent with the empirical estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What lower bound does the paper derive for σω,L, and how does the no-segmentation model compare?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A credible set for σω,L is derived by targeting the upper and lower bounds of the 95% confidence interval for the estimated inverse slope. The lower bound for σω,L (targeting the top of the confidence interval) is 0.810; the lower bound targets the bottom of the confidence interval but is best matched by the full segmentation case (σω,L → ∞). The no-segmentation economy (σω,L = 0) produces inverse ratios between 4 and 32 times the empirical upper bound, placing it far outside the credible set.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the model&amp;rsquo;s quantitative finding on the role of credit standard relaxation in isolation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A credit standard relaxation (LTV from 85% to 99%, PTI from 36% to 65%) implemented from 1998 Q1 to 2007 Q1 and then reverted explains 34% of the peak rise in price-rent ratios in the benchmark model, with a lower bound of 26% conditional on parameter uncertainty. In the full segmentation model, the same relaxation explains 38%, while in the no-segmentation model it explains -1%. Credit standard relaxation also explains 51% of the rise in loan-to-income ratios in the benchmark, compared to 31% in the no-segmentation model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does adding a decline in mortgage rates contribute?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Adding a permanent 2 percentage point decline in mortgage spreads alongside the credit standard relaxation increases the benchmark model&amp;rsquo;s explained share of the price-rent ratio boom from 34% to 72%, and the loan-to-income ratio share from 51% to 80%. The no-segmentation model explains only 4% of the price-rent ratio boom and 38% of the loan-to-income ratio boom under the same combined experiment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the &amp;ldquo;full boom&amp;rdquo; counterfactual estimate the marginal contribution of credit?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The full boom experiment adds exogenous demand shocks (shifts to µω,B) and supply shocks (shifts to µω,L) on top of the credit relaxation and rate decline, calibrated to exactly reproduce the observed peak increase in both the price-rent ratio and the homeownership rate during the boom. Removing the credit relaxation from this full boom scenario reduces the rise in price-rent ratios by 55% and the rise in loan-to-income ratios by 74% in the benchmark economy. This exceeds the 34% figure from the credit-alone experiment due to strong nonlinear interactions: without the credit relaxation, binding PTI limits constrain households&amp;rsquo; ability to finance properties even when ownership preferences rise, dampening both price and credit growth. In the no-segmentation economy, removing the credit relaxation reduces price-rent ratio growth by only 5%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What are the implications of allowing landlords to use credit?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When landlords face an LTV limit of 65% relaxed to 85% during the boom, the credit expansion also shifts the tenure supply curve upward (as in Panel (d) of the supply-demand framework), leading to a larger price-rent ratio response and a smaller homeownership rate response than in the baseline. Without recalibration, this model explains 81% of the price-rent ratio rise. After recalibration of σω,L (which is required because landlord credit changes the mapping from empirical moments to structural parameters), the model explains 80% of the price-rent ratio rise. This implies the benchmark results are a lower bound on the role of credit in driving house prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the implications of allowing savers to frictionlessly trade housing with borrowers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When savers are allowed to frictionlessly adjust their housing demand (purchasing housing from or selling to borrowers as credit conditions change), the price-rent ratio response is dampened because savers absorb excess borrower demand. After recalibrating σω,L, the combined credit-and-rate experiment explains 54% of the price-rent ratio boom — roughly 25% less than the benchmark 72%. The authors regard this as an extreme lower bound because in practice saver and borrower housing markets are substantially segmented due to indivisibility, location, and quality differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What are the implications for macroprudential policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Macroprudential policies that tighten LTV and PTI limits are effective at slowing house price growth in the benchmark economy, where rental market frictions are substantial. In the full boom counterfactual, tightening credit standards reduces the rise in price-rent ratios by 55%. However, in the no-segmentation economy, the same tightening reduces price-rent ratio growth by only 5%, because landlords readily absorb credit-driven demand and pin prices to the present value of rents. The effectiveness of macroprudential policies is therefore deeply dependent on the degree of rental market segmentation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: Why do the authors prefer the CBRE Torto-Wheaton rent index over typical rent measures?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The TW index uses a repeat-rent methodology on newly-leased multi-unit apartments, which better captures current market conditions than median rent measures, which are biased by composition changes and are sticky due to long-term lease contracts. Since the price-rent ratio is meant to capture the rent a unit could command if leased instead of sold, newly-leased apartment rents are more appropriate for constructing this ratio. The TW index is available for 53 CBSAs from 1989 and 62 CBSAs from 1994.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: Why do the authors estimate the inverse slope rather than the slope directly?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The first stage for the homeownership rate response is very weak — the estimated coefficients are small and imprecise, so using the homeownership rate as an endogenous variable would suffer severe weak instrument problems. Instead, the authors use the price-rent ratio as the endogenous variable (with a much stronger first stage) and the homeownership rate as the outcome, obtaining the inverse slope (homeownership response per unit price-rent ratio response). The upper bounds of the 95% confidence intervals for the inverse slope range from 0.12 to 0.56 across horizons, corresponding to lower bounds on the slope of 1.8 to 8.4.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Tenure Supply Curve.&lt;/strong&gt; The menu of price-rent ratios at which landlords are willing to supply varying quantities of owner-occupied housing (i.e., sell rental units to potential homeowners) at a given total housing stock. Defined in price-rent ratio versus homeownership rate space. Distinct from the absolute supply of housing via the construction sector; shifts in the construction margin affect absolute quantities and prices but not necessarily the price-rent ratio or the ownership share. The slope of this curve — not the level — is the central empirical and structural object of the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market Segmentation (in the paper&amp;rsquo;s sense).&lt;/strong&gt; The degree to which credit-insensitive agents (landlords, unconstrained savers) cannot absorb credit-driven demand from constrained borrowers. Perfect segmentation means owner-occupied and rental housing are entirely non-fungible, so all credit-driven demand falls on a fixed supply of owned units. Zero segmentation means landlords (or savers) can frictionlessly convert between owned and rented housing at a price tied to present discounted rents. In this paper, segmentation is measured continuously by the slope of the tenure supply curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistic (for segmentation).&lt;/strong&gt; The ratio of the causal elasticity of the price-rent ratio to the causal elasticity of the homeownership rate, both with respect to the same identified credit supply shock. This ratio identifies the slope of the tenure supply curve and is sufficient to calibrate a structural model to recover the role of credit in driving house prices for arbitrary combinations of shocks, even when those shocks differ from the identifying variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ownership Benefit Heterogeneity.&lt;/strong&gt; An additional idiosyncratic utility flow (positive or negative) that borrowers or landlords receive from owning versus renting a given unit, modeled as a logistic distribution. This within-type heterogeneity generates a fractional and time-varying homeownership rate in the model and maps directly into the slope of the demand and tenure supply curves. The dispersion parameter σω,L for landlords governs the slope of the tenure supply curve; higher dispersion implies a steeper (more segmented) supply curve and larger price-rent ratio responses to credit shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal Collateral Value (CB,t).&lt;/strong&gt; The shadow value to borrowers of the additional credit that can be collateralized by an additional dollar of housing value, equal to µB,t × FLTV × θLTV in the model. A relaxation of credit standards (raising θLTV or θPTI) or a decline in credit costs raises CB,t, increasing borrower reservation prices and shifting the housing demand curve outward. This is the channel through which credit conditions enter house price dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Projection IV (LP-IV).&lt;/strong&gt; A generalization of Jordà (2005) local projections to instrumental variables settings, as in Ramey (2016) and Ramey and Zubairy (2018), extended to a panel context with CBSA and time fixed effects. Used to estimate impulse responses of price-rent ratios, house prices, and homeownership rates to credit supply shocks at horizons 0 through 5 years, instrumenting for endogenous credit growth using the conforming loan limit shift-share instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conforming Loan Limit (CLL) Instrument.&lt;/strong&gt; A shift-share instrument for local credit supply constructed by interacting the share of mortgage originations in the prior year falling within 5% of the current year&amp;rsquo;s CLL with the percentage change in the national CLL. Cities where a larger fraction of loans cluster near the CLL threshold experience a larger credit supply shock when the CLL increases, because more loans shift from unsubsidized to GSE-subsidized rates. The instrument is constructed using the change in the national CLL only to avoid endogeneity from high-cost area adjustments.&lt;/p&gt;</description></item><item><title>Do Financial Concerns Make Workers Less Productive?</title><link>https://macropaperwarehouse.com/papers/do-financial-concerns-make-workers-less-productive/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/do-financial-concerns-make-workers-less-productive/</guid><description>&lt;h2 id="do-financial-concerns-make-workers-less-productive"&gt;Do Financial Concerns Make Workers Less Productive?&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;The paper tests whether financial concerns distract workers sufficiently to meaningfully reduce their productivity, and whether receiving cash — by alleviating those concerns — can raise output even when total compensation is held fixed.&lt;/p&gt;
&lt;h3 id="setting-and-sample"&gt;Setting and Sample&lt;/h3&gt;
&lt;p&gt;The experiment involves 408 low-income male agricultural casual laborers in rural Odisha, India, recruited from 47 villages across five worksites in four districts. The study takes place during the lean agricultural season (March–June 2017 and 2018), when formal employment is scarce (workers found paid wage work on only 1.9 days per week on average). During this period, 86% of workers reported being &amp;ldquo;worried&amp;rdquo; or &amp;ldquo;very worried&amp;rdquo; about their finances, 68–71% carried outstanding loans, and 64–66% said they would have difficulty coming up with Rs. 1,000 (roughly four days of wages) in an emergency. Workers bring these burdens to the job: on a given day, approximately one in two workers reported thinking about financial worries while working.&lt;/p&gt;
&lt;h3 id="experimental-design"&gt;Experimental Design&lt;/h3&gt;
&lt;p&gt;Workers were employed for twelve days in a piece-rate manufacturing task — stitching sal tree leaves into disposable plates for restaurants. The payment-timing manipulation is the core of the identification strategy. Control workers received all accrued earnings as a lump sum on the final day (day 12). Treatment workers received their earnings in two installments: an interim payment of earnings to date on day 8 or 9 (randomly staggered across waves), with the balance paid on day 12. Total compensation was held constant across groups; only the timing of receipt differed. On day 5 (the &amp;ldquo;announcement day&amp;rdquo;), each worker learned his payment schedule individually. The design thus separates the announcement period (days 5 through the interim payment day, when workers know their schedule but have not yet received cash) from the post-pay period (days after the interim payment until the contract end). This enables the authors to test whether productivity effects arise from information about impending cash, or only once cash is physically in hand.&lt;/p&gt;
&lt;h3 id="first-stage-effects-on-financial-strain"&gt;First Stage: Effects on Financial Strain&lt;/h3&gt;
&lt;p&gt;Within three days of receiving the interim payment, treated workers increased loan repayments by Rs. 271, a 287% increase relative to the control group mean (p &amp;lt; 0.001), and were 40 percentage points (222%) more likely to repay any loan (p &amp;lt; 0.001). The majority of repayments occurred on the same evening as the cash disbursement — a 746% single-day increase in loan payments. Household expenditures on food, clothing, and essentials rose by 40% (Rs. 150) over three days (p &amp;lt; 0.001). Treatment workers also reported feeling more focused on the work task (11.5 percentage points more likely, p = 0.032) and were less likely to report thinking about financial worries while making plates (13.7 percentage points, p = 0.044).&lt;/p&gt;
&lt;h3 id="main-productivity-results"&gt;Main Productivity Results&lt;/h3&gt;
&lt;p&gt;In the post-pay period, treated workers increased output by 0.109 SD (6.9%) relative to the control group (p = 0.020). No treatment effect emerged during the announcement period (0.014 SD, p = 0.685); the post-pay and announcement-period effects are statistically distinguishable (p = 0.008). Because work hours are fixed and daily attendance is 98.3% with no treatment effect on attendance, these gains reflect improvements in how quickly workers produce plates per hour of work.&lt;/p&gt;
&lt;p&gt;Effects are concentrated among workers with below-median baseline wealth (fewer assets, less liquidity): for this subgroup, the interim payment increases output by 0.204 SD (13.0%, p = 0.003). For workers with above-median wealth, the effect is close to zero and statistically insignificant (p = 0.819).&lt;/p&gt;
&lt;h3 id="attentiveness-results"&gt;Attentiveness Results&lt;/h3&gt;
&lt;p&gt;Beyond total output, the authors measure attentiveness through three markers embedded in the finished plates: the number of &amp;ldquo;double holes&amp;rdquo; (paired stitching holes indicating a removed mistaken stitch), the number of leaves used, and the number of stitches used. These measures are collected unbeknownst to workers and combined into an &amp;ldquo;attentiveness index.&amp;rdquo; After receiving the interim payment, treated workers&amp;rsquo; attentiveness index increased by 0.077 SD across all workers (p = 0.092); among poorer workers, attentiveness increased by 0.17 SD (p = 0.041). This improvement occurred simultaneously with higher output speed — workers were producing plates faster while also making fewer mistakes, suggesting improved cognitive engagement rather than mere effort intensification.&lt;/p&gt;
&lt;h3 id="piece-rate-comparison"&gt;Piece-Rate Comparison&lt;/h3&gt;
&lt;p&gt;In separate supplementary rounds with 150 experienced workers, the authors varied piece rates (Rs. 2, 3, or 4) while holding overall earnings constant. Each one-rupee increase in the piece rate raised output by 0.020 SD (p = 0.042). Critically, piece-rate increases produced no detectable change in the attentiveness index (point estimate negative, statistically insignificant), and the piece-rate effect on output differs significantly from the attentiveness effect (p = 0.001). This indicates that consciou effort and automatic attentiveness can move independently: higher incentives increase pace but do not reduce attentional lapses, whereas financial relief increases both pace and attentiveness.&lt;/p&gt;
&lt;h3 id="alternative-explanations-ruled-out"&gt;Alternative Explanations Ruled Out&lt;/h3&gt;
&lt;p&gt;The authors systematically address gift exchange/fairness, trust, nutrition, and sleep. Fairness and gift-exchange stories are inconsistent with: (i) no detectable announcement-period effect; (ii) no decline in control-worker effort when treatment workers are paid before them; (iii) the pattern of effects being concentrated among poorer workers; and (iv) attentiveness being affected when it is not a sanctioned quality dimension for payment. Nutritional channels are inconsistent with overnight effect onset (nutritional stock changes are too slow biologically), no treatment effect on breakfast consumption patterns, and productivity effects persisting through the end of each workday. Sleep channels are inconsistent with no treatment effect on hours or quality of sleep.&lt;/p&gt;
&lt;h3 id="scope-conditions-and-implications"&gt;Scope Conditions and Implications&lt;/h3&gt;
&lt;p&gt;The effect operates through the actual arrival of cash, not its anticipation, consistent with a model in which automatic cognitive inputs — unlike consciously chosen effort — respond to current financial strain rather than expected future income. Effects are concentrated among more financially constrained workers within an already-poor sample. The authors do not identify the specific psychological mechanism (worry, anxiety, affect, or rumination) but interpret results as evidence that financial strain, at least partly through psychological channels, reduces earnings exactly when money is most needed.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why does the experiment focus on payment timing rather than an outright transfer of additional money?&lt;/strong&gt;
Varying only payment timing — not total pay — holds constant both the piece-rate incentive and total wealth across treatment and control. An outright cash transfer would raise total lifetime income, potentially reducing effort through a neoclassical income effect (more lifetime wealth lowers the marginal utility of current consumption). By holding total compensation fixed and only shifting when it arrives, the design isolates the effect of financial strain per se, separable from any wealth or incentive effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why is there no treatment effect during the announcement period, and why does this matter?&lt;/strong&gt;
Between day 5 (when workers learn their payment schedule) and the interim payment date, treated workers know cash is coming but have not yet received it. Output in this window shows no treatment effect (0.014 SD, p = 0.685), and the announcement effect is significantly smaller than the post-pay effect (p = 0.008). This matters because it rules out mechanisms that should operate on information alone — including gift exchange, trust updating, or effort responses to higher discounted expected income — and is consistent with a model in which financial strain falls only when cash is physically received (e.g., moneylenders do not relent until the loan is actually repaid).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the attentiveness index and how was it constructed?&lt;/strong&gt;
The attentiveness index averages three plate-level markers: (i) number of &amp;ldquo;double holes&amp;rdquo; — pairs of stitching holes indicating a mistaken stitch was removed; (ii) number of leaves used; and (iii) number of stitches used. Each component was normalized using the control group&amp;rsquo;s post-pay mean and standard deviation, then averaged and reverse-coded so that higher values denote better attentiveness (fewer mistakes, fewer leaves, fewer stitches). Workers were unaware these dimensions were being measured. The index thus captures the number of unforced steps a worker took to complete a plate — a behavioral trace of cognitive lapses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How do the piece-rate rounds demonstrate that effort and attentiveness are separable?&lt;/strong&gt;
In supplementary rounds (150 workers, 2019), piece rates were experimentally varied among Rs. 2, 3, and 4 per plate with the base wage adjusted to hold total earnings constant, so financial strain was unchanged. A one-rupee increase in the piece rate raises output by 0.020 SD (p = 0.042), consistent with a standard effort response. The same increase produces no discernible change in the attentiveness index (point estimate: negative but not significant), and the output and attentiveness effects are significantly different from each other (p = 0.001). This shows that workers can speed up via conscious effort without reducing attentional lapses, whereas the cash infusion raises both pace and attentiveness simultaneously — a pattern inconsistent with pure motivation as the mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the staggered timing within the treatment group (Wave A vs. Wave B) contribute to identification?&lt;/strong&gt;
Treatment workers were randomized to receive their interim payment on day 8 (Wave A) or day 9 (Wave B). On day 9, Wave B workers have not yet been paid while Wave A workers have. If fairness concerns drove control workers to reduce effort upon seeing colleagues paid first, control workers on day 9 — having observed Wave A payments the evening before — should work less hard relative to Wave B treatment workers (who have also not yet been paid). The authors find no such pattern: the triple interaction (Cash × Payment Day × Wave B) is close to zero and insignificant, ruling out effort reductions from seeing peers paid earlier.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the magnitudes and timing of the spending response to the cash infusion?&lt;/strong&gt;
Within three days of the interim payment, treatment workers spent Rs. 900 in total — roughly two-thirds of the average interim payment of over Rs. 1,400. On the day of the payment itself, loan repayments rose by Rs. 169 (746% increase), and household expenditures rose by Rs. 70 (68% increase). Over three days, loan repayments increased by Rs. 271 (287%), the probability of repaying any loan rose by 40 percentage points (222%), and total household spending rose by 65% (Rs. 371). These patterns indicate that the two main sources of financial stress cited by workers — outstanding debt and inability to meet household essentials — were directly addressed, suggesting a meaningful reduction in financial strain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why are the productivity effects concentrated among poorer workers, and what are the two interpretations?&lt;/strong&gt;
Workers with below-median baseline wealth (fewer assets, lower liquidity) show a 0.204 SD (13.0%) productivity gain, while workers above the median wealth threshold show essentially no effect. The authors offer two interpretations. First, poorer workers may start from a higher level of financial strain, giving the intervention more scope to reduce it. Second, since all workers in the sample are objectively poor and report similar baseline financial worries and loan levels, the more likely explanation is that the interim payment is larger relative to the wealth and income buffer of poorer workers, making the same nominal cash infusion more meaningful for them. Both richer and poorer workers in the sample use the interim payment to repay loans and cover household needs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do the authors rule out nutritional channels?&lt;/strong&gt;
Two tests address nutrition. First, workers were not at subsistence — 94% reported missing no meals the prior week — and increased food spending cannot change the nutritional stock overnight (the medical literature indicates nutritional-stock effects on cognition operate over longer time horizons). Second, and more precisely, all food consumed at the worksite during the workday was provided by the researchers, so differential pre-worksite breakfast consumption is the only plausible same-day biological channel. The authors find no treatment effect on breakfast consumption (whether workers had breakfast, how much, or what they ate). Further, if blood sugar or satiety drove effects, they should attenuate over the workday as all workers are given the same afternoon meal; instead, treatment effects persist and if anything increase through the final hours of the workday.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the self-report evidence on focus and worry show, and why is it treated as suggestive rather than primary?&lt;/strong&gt;
Two days after the interim payment, workers were asked an open-ended question about what they were thinking about while working. Treatment workers were 11.5 percentage points (15.5%) more likely to report feeling focused on the task (p = 0.032) and 13.7 percentage points (32.7%) less likely to report thinking about financial worries (p = 0.044). A supplementary test showed treated workers were 10 percentage points (31%) more likely to generate explanations for a low-income person&amp;rsquo;s negative affect that were unrelated to financial concerns (p &amp;lt; 0.05), suggesting a broadening of cognitive scope. These measures are treated as suggestive because they were collected only at a single point and are self-reported; the primary evidence rests on objective production data because it is more objective and collected at fine hourly resolution throughout the post-pay period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the paper say about optimal payment frequency as a policy implication?&lt;/strong&gt;
The authors are cautious in drawing a direct policy inference about paying workers more frequently. While the positive productivity effect of early payment points toward more frequent paydays reducing financial strain, this must be weighed against workers&amp;rsquo; self-control problems in consumption. In settings where workers face lumpy expenditure needs (e.g., monthly rent), more frequent payments could cause under-saving and worsen strain at the time of lumpy bills. The authors suggest payment frequency or size that matches expenditure needs, or more generally financial products that allow workers to time income receipts to coincide with expenses, as potentially more robust solutions — noting that such products appear largely absent in these markets.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Financial strain (as used in the paper):&lt;/strong&gt; A psychological burden arising from pressing present needs for resources — defined in the authors&amp;rsquo; model as increasing in both the current marginal utility of consumption (i.e., how valuable an additional rupee would be today) and the level of outstanding debt (including lender harassment pressure). Strain is present-oriented: it responds to current cash-on-hand and debt levels, not to expected future income, which is why anticipating a payment does not fully relieve it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Automatic input (a):&lt;/strong&gt; In the authors&amp;rsquo; behavioral model, one of two inputs into production. Unlike &amp;ldquo;effortful&amp;rdquo; input (e), which the worker consciously controls (speed of hands, consciously directed attention), the automatic input captures cognitive functions that are beyond the worker&amp;rsquo;s full control — background attentional processes that can be degraded by financial strain even when a worker is motivated and exerting high effort. The key behavioral assumption is that a falls when financial strain is high, independently of chosen effort.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Attentiveness index:&lt;/strong&gt; A composite measure constructed from three unincentivized physical markers embedded in completed leaf plates: (i) number of double holes (pairs indicating a stitch was removed to correct a mistake); (ii) number of leaves used; (iii) number of stitches used. The index is normalized to the control group&amp;rsquo;s post-pay distribution and reverse-coded so higher values denote better attentiveness. Workers were unaware these dimensions were measured. The index captures attentional lapses — unforced errors that increase the number of steps and time needed to complete each plate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Announcement period:&lt;/strong&gt; The days between when workers are individually informed of their payment schedule (day 5) and when the interim payment is actually disbursed (day 8 or 9). This window serves as a within-experiment control: if effects arose from information about impending cash (e.g., through discounting, gift exchange, or trust), they should appear here. The consistent absence of treatment effects during this period is a key identification result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Post-pay period:&lt;/strong&gt; The days from the interim payment until the contract end (day 12). The main productivity and attentiveness treatment effects are estimated in this window, comparing treatment workers (who have received cash) to control workers (who have not yet been paid).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lean season:&lt;/strong&gt; The months outside the peak agricultural planting and harvesting periods (roughly six to eight months per year in the study area) during which agricultural workers seek intermittent casual employment in manufacturing, construction, and other sectors. Employment rates are low (1.9 paid days per week on average), income is low and variable, and financial strain is correspondingly high. The experiment is intentionally conducted during this period to maximize baseline levels of financial concern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Piece-rate elasticity of effort:&lt;/strong&gt; The responsiveness of output to changes in the marginal return per unit produced (the piece rate), holding financial strain constant. In the supplementary rounds, a one-rupee increase in the piece rate raises output by 0.020 SD. The authors interpret this as the upper bound on how much pure motivational effort can move output in this task, and use it to benchmark the cash infusion effects, which are roughly five times larger per unit of treatment variation and additionally move attentiveness (which piece-rate changes do not).&lt;/p&gt;</description></item><item><title>Do The Effects of Nudges Persist? Theory and Evidence from 38 Natural Field Experiments</title><link>https://macropaperwarehouse.com/papers/do-the-effects-of-nudges-persist-theory-and-evidence-from-38-natural-field-experiments/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/do-the-effects-of-nudges-persist-theory-and-evidence-from-38-natural-field-experiments/</guid><description>&lt;p&gt;This paper asks why the Home Energy Report (HER) — a widely deployed social-comparison nudge that shows households how their electricity consumption compares to their neighbors — produces behavioral changes that persist long after the nudge is discontinued, while analogous nudges in other domains (charitable giving, financial savings, voter turnout, tax compliance) fade almost entirely within a year or two. The authors formalize a research design to decompose the HER&amp;rsquo;s long-run effectiveness into two channels: technology adoption (a change in the stock of energy-efficient capital in the home) and habit formation (a change in the stock of habits or skills in the resident).&lt;/p&gt;
&lt;p&gt;The identifying strategy exploits the administrative rule that when the initial resident in an HER experiment moves out, HER mailings stop immediately — but electricity consumption in the home continues to be observed as new residents occupy it. Under three assumptions — (1) treatment assignment did not influence the initial resident&amp;rsquo;s decision to move; (2) treatment assignment did not influence the type of resident who moved in; and (3) energy-efficient technology adopted in response to the HER remained in the home after the move — the post-move HER effect identifies the fraction of the long-run treatment effect attributable to technology adoption (ATK), and the remainder identifies the fraction attributable to habit formation (ATH).&lt;/p&gt;
&lt;p&gt;Data come from 38 natural field experiments administered by Opower between 2008 and 2013 across 21 U.S. residential energy providers, comprising 61,310,166 electricity bills for 1,810,096 homes. The mover sample, restricted to homes where the initial resident deactivated service at or after the receipt of their fourth HER, contains 5,890,855 bills for 139,908 homes. Treatment and control homes enter the mover sample at statistically indistinguishable rates and have similar baseline electricity consumption.&lt;/p&gt;
&lt;p&gt;The main findings: the HER reduced electricity consumption by 2.1 percent in the long run (the pre-move ATE). After the initial resident moved and the HER was discontinued, 1.1 percent of the reduction persisted in the home — attributable to technology. The habit channel accounts for the remaining 1.0 percent reduction. Normalizing by the ATE, 51.4 percent (s.e. = 13.1) of the long-run effectiveness is attributable to technology adoption and 48.6 percent to habit formation. The persistence of the post-move effect is robust across alternative specifications, different HER-receipt cutoffs, balanced panels, and exclusion of low-consumption move-period homes. A falsification test using rental homes — where tenants do not typically own appliances and the technology channel is therefore shut down — yields a null post-move effect, consistent with the balanced-habits assumption.&lt;/p&gt;
&lt;p&gt;The authors use these results to explain a broader empirical pattern: one year after discontinuation, social comparison nudges targeting compliance, charitable giving, savings, and voter turnout retain on average only 4 percent of their initial effect, while nudges targeting energy and water conservation retain 65 percent. The paper argues this divergence reflects the relative abundance of enabling technologies in conservation contexts versus their absence in compliance or voting contexts. The findings also have cost-benefit implications: ignoring HER-induced technology adoption overstates net benefits by as much as 65 percent, depending on assumed technology cost per kWh saved (ranging from $0.03 per kWh saved per Gillingham et al. 2018 to $0.12 per kWh saved per Billingsley et al. 2014).&lt;/p&gt;
&lt;p&gt;Scope conditions: results are specific to electricity-consumption nudges in the U.S. residential sector; the technology channel identification requires that adopted equipment stays in the home after a move; the decomposition rests on a linear production function for outcomes in habits and technology.&lt;/p&gt;
&lt;p&gt;Q: What is the Home Energy Report and how was it administered in these experiments?
A: The HER is a mailed social-comparison report that contrasts a household&amp;rsquo;s electricity consumption with that of similar neighbors. In each of the 38 waves, homes were observed for a 12-month baseline, then randomly assigned to treatment (receiving HERs) or control. HERs were mailed monthly, bimonthly, or quarterly; generation ceased when the initial resident deactivated electricity service.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central identification strategy?
A: The authors exploit a discontinuity created when the initial treated resident moves out: HER mailings stop, but the home&amp;rsquo;s electricity consumption continues to be measured as new residents move in. Under three assumptions about non-interference of treatment with moving decisions, balanced habits of subsequent residents, and stability of adopted technology, the post-move HER effect point-identifies the technology-adoption component (ATK) of the long-run average treatment effect (ATE). The habit-formation component (ATH) is then inferred as ATE minus ATK.&lt;/p&gt;
&lt;p&gt;Q: What are the three identifying assumptions and how are they tested?
A: Assumption 1 (no effect of treatment on moving rates) and Assumption 2 (balanced habits of subsequent residents) are tested with the data; treatment and control homes enter the mover sample at statistically indistinguishable rates and have similar baseline consumption, supporting Assumption 1. The rental-home falsification test supports Assumption 2: rental homes show a null post-move effect, consistent with renters having balanced habits because the technology channel is inactive in rentals. Assumption 3 (stable technology after a move) is untestable from the data; the authors note that violation of this assumption would imply the post-move effect is a lower bound on ATK, making the technology-adoption estimate conservative.&lt;/p&gt;
&lt;p&gt;Q: What are the main quantitative estimates of the decomposition?
A: The pre-move (long-run) ATE is -2.1 percent of baseline electricity consumption. The post-move effect (ATK) is -1.1 percent, and the habit-formation component (ATH) is -1.0 percent. Normalizing by the ATE, 51.4 percent (s.e. = 13.1) is attributed to technology adoption and 48.6 percent to habits.&lt;/p&gt;
&lt;p&gt;Q: How large is the HER effect in absolute terms during the comparison period?
A: During the comparison period, the HER reduced average daily electricity consumption by approximately -1.8 to -2.3 percent in the first year and -1.5 to -2.0 percent in the second year, with 95 percent confidence intervals excluding zero. In levels, these correspond to roughly -0.6 to -0.9 kWh per day — equivalent to using 2 to 4 sixty-watt incandescent bulbs for 5 fewer hours per day.&lt;/p&gt;
&lt;p&gt;Q: How persistent is the HER effect during the move period?
A: In the first year of the move period the HER continues to produce reductions of -1.7 and -1.4 percent; more than a year after the initial resident&amp;rsquo;s departure the estimated effect is -1.2 percent. All move-period estimates are statistically significant at conventional levels.&lt;/p&gt;
&lt;p&gt;Q: How does the paper explain variation in persistence across social-comparison nudge contexts?
A: One year after discontinuation, nudges targeting compliance, charitable giving, savings, and voter turnout retain on average only 4 percent of their initial effect, while nudges targeting energy or water conservation retain 65 percent on average. The paper argues the divergence reflects the relative availability of enabling technologies: households can adopt long-lived, input-efficient technologies (appliances, fixtures) to reduce energy and water use, but analogous technologies to facilitate compliance, donations, or voting are largely unavailable or absent.&lt;/p&gt;
&lt;p&gt;Q: How does this paper&amp;rsquo;s finding about technology adoption compare to Allcott and Rogers (2014)?
A: Allcott and Rogers (2014) used participation in utility-sponsored energy-efficiency programs as a proxy for technology adoption and found it explained no more than 2 percent of the HER&amp;rsquo;s long-run effectiveness. The authors reject this conclusion: their decomposition attributes 51.4 percent to technology, which is estimated precisely enough to statistically reject the 2 percent figure from Allcott and Rogers (2014). They attribute the discrepancy to the imperfect proxy used by Allcott and Rogers and low statistical power in analogous analyses.&lt;/p&gt;
&lt;p&gt;Q: What are the cost-benefit implications of accounting for HER-induced technology adoption?
A: Assuming monthly HERs for one year, a household electricity price of $0.10/kWh, and benefits accruing over two years, the baseline net benefit (ignoring technology costs) is $32.38 per household (electricity savings of $44.38 minus $12 administration cost). Using a technology cost of $0.03/kWh saved (Gillingham et al. 2018), net benefits fall to $27.14. Using $0.12/kWh saved (Billingsley et al. 2014), net benefits drop to $11.43 — a reduction of up to 65 percent from the baseline estimate. The HER still passes cost-benefit analysis but prior evaluations that ignore technology costs overstate net benefits substantially.&lt;/p&gt;
&lt;p&gt;Q: How robust are the decomposition results to alternative sample definitions and specifications?
A: The qualitative findings are stable across: alternative sets of control variables (Table A1); mover samples defined by receiving as few as 1 or as many as 5 HERs before moving (Table A2, with pre-move effects of -2.08 and post-move effects of -0.93 to -1.04 across cutoffs); balanced panels requiring fixed observation windows in each period (Table A3); and exclusion of homes showing unusually low consumption in the move period (Table A4, post-move effects of -1.19 to -1.48).&lt;/p&gt;
&lt;p&gt;Q: What policy implications does the paper draw for nudge design?
A: Policymakers seeking persistent nudge effects should target behaviors that can be augmented by readily available technologies, or pair social-comparison nudges with opportunities to adopt new technologies. In voting contexts, combining social-comparison nudges with opt-in mail-in or online ballot defaults could produce more persistent effects. In savings and charitable giving, pairing social comparisons with automatic contribution-rate defaults (as in Madrian and Shea 2001; Thaler and Benartzi 2004) is predicted to produce longer-lived effects than the nudge alone.&lt;/p&gt;
&lt;p&gt;Q: What methodological contribution does the paper offer beyond the HER application?
A: The mover-based decomposition is a generalizable research design for separating human capital (habits, skills) from physical capital (technology, infrastructure) as channels of policy effectiveness. The authors suggest it can be applied using other natural separation events — such as student graduation or employee departure — to assess the extent to which nudges build human capital in both recipients and the organizations in which they are embedded.&lt;/p&gt;
&lt;p&gt;Technology adoption channel (ATK): The component of the HER&amp;rsquo;s long-run average treatment effect attributable to increases in the stock of energy-efficient technologies in the home — identified empirically as the post-move HER effect that persists after the treated resident departs and the HER is discontinued.&lt;/p&gt;
&lt;p&gt;Habit formation channel (ATH): The component of the HER&amp;rsquo;s long-run treatment effect attributable to changes in the habits or skills of the resident — inferred as the residual after netting the technology component (ATK) from the total long-run effect (ATE).&lt;/p&gt;
&lt;p&gt;Post-move effect: The estimated difference in electricity consumption between treatment and control homes after the initial resident has moved out, the HER has been discontinued, and a new resident has taken occupancy; under the paper&amp;rsquo;s identifying assumptions this equals ATK.&lt;/p&gt;
&lt;p&gt;Balanced-habits assumption: The identifying assumption that treatment assignment did not influence the characteristics or habits of residents who subsequently moved into homes in the experimental sample, so that the habits of incoming residents are comparable across treated and control homes.&lt;/p&gt;
&lt;p&gt;Stable-technology assumption: The identifying assumption that energy-efficient technologies adopted in response to the HER remain in the home after the initial resident moves; relaxing this assumption implies the post-move effect is a lower bound on ATK.&lt;/p&gt;
&lt;p&gt;Home Energy Report (HER): A mailed social-comparison report that contrasts a recipient household&amp;rsquo;s electricity consumption with that of similar neighboring households; the treatment studied across all 38 experiments in this paper.&lt;/p&gt;
&lt;p&gt;Enabling technologies: Long-lived, input-efficient capital goods (appliances, lighting, insulation) that reduce the marginal cost of conservation and thereby lock in behavioral changes induced by a nudge; their relative abundance in energy and water conservation contexts — versus their absence in voting, giving, or compliance contexts — is the paper&amp;rsquo;s proposed explanation for cross-context variation in nudge persistence.&lt;/p&gt;</description></item><item><title>Does Deposit Insurance Promote Deposit Stability? Evidence from the Postal Savings System during the 1920s</title><link>https://macropaperwarehouse.com/papers/does-deposit-insurance-promote-deposit-stability-evidence-from-the-postal-savings-system-during-the-1920s/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/does-deposit-insurance-promote-deposit-stability-evidence-from-the-postal-savings-system-during-the-1920s/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; Does deposit insurance promote financial depth by arresting the outflow of deposits from the banking system during periods of bank distress? The paper tests and quantifies the deposit-stabilizing effect of state-level deposit insurance schemes operating in the United States during the 1920s.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and identification.&lt;/strong&gt; Between 1908 and 1929, eight primarily Midwestern states adopted some form of deposit insurance. The paper exploits the discontinuity in deposit insurance coverage at state borders to identify the causal effect of insurance on depositor behavior. The identification strategy compares outcomes in contiguous city pairs straddling deposit-insurance (DI) and non-deposit-insurance (NDI) state borders — a quasi-experimental design that controls for observed and unobserved confounders by using narrow geographic areas where the only relevant policy difference is the presence or absence of deposit insurance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Proxy for &amp;ldquo;mattress money.&amp;rdquo;&lt;/strong&gt; The paper uses postal savings deposits as a proxy for money withdrawn from the banking system. The U.S. Postal Savings System (established 1911) was backed by the full faith and credit of the federal government, with a maximum individual account limit of $2,500, and was widely viewed as a far safer alternative to commercial bank deposits. The authors validate this proxy by demonstrating, via Johansen cointegration tests, that the nationwide ratio of postal savings balances to total bank deposits is cointegrated (rank 1) with the currency-deposit ratio — a well-established indicator of banking distress.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The empirical analysis covers 1921–1929. The main postal savings dataset is drawn from Annual Reports of the Postmaster General. Bank suspension data are drawn from FDIC manuscript lists compiled in the 1930s by FDIC economist Clark Warburton, providing location, charter type, and suspension/reopening dates. The sample includes 74 city pairs across 14 states (7 DI: North Dakota, South Dakota, Nebraska, Kansas, Oklahoma, Texas, Mississippi; 7 NDI: Minnesota, Iowa, Missouri, Arkansas, Louisiana, Tennessee, Alabama), with an average distance between paired cities of approximately 18 miles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — postal savings regressions (Table 4).&lt;/strong&gt; Using OLS with city-pair and year fixed effects and standard errors clustered at the NDI city level, the paper finds that following a bank suspension within a 10-mile radius, postal savings deposits in NDI cities grew 16 percent more than deposits in the corresponding DI city. The effect is positive and statistically significant at the 20-mile radius but smaller — approximately 9 percent — and is statistically indistinguishable from zero at the 30-mile radius. The localized decay with distance is consistent with a geographically contained flight-to-safety response. Critically, when the same specification is estimated for periods after deposit insurance was discontinued, the effect at all radii is statistically nil, providing a falsification test ruling out omitted unobserved factors as the driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Persistence of effects (Table 5).&lt;/strong&gt; Arellano-Bond GMM dynamic panel regressions confirm that the disintermediation effects are persistent. The lagged dependent variable enters with a negative and statistically significant coefficient (approximately −0.20 for the 10-mile regression), indicating mean reversion, but the bank suspension coefficients remain robust. Implied long-run effects for the 10-mile and 20-mile equations are approximately 0.151 and 0.100, respectively, suggesting sustained rather than transitory deposit diversion away from the banking system in the absence of deposit insurance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Banking capacity (Table 6).&lt;/strong&gt; Because the postal savings deposit limit constrained the intake of funds — particularly severely during distress episodes, as documented through narrative evidence from the 1915 Congressional Record — the postal savings regressions underestimate the true effect of deposit insurance. The paper therefore estimates an alternative specification at the county level, comparing deposits at state-chartered banks in paired DI and NDI border counties. The results indicate that deposit insurance is associated with approximately a 56 percent increase in county-level deposits at state-chartered banks (coefficient 0.574, significant at 5 percent, robust to inclusion or exclusion of year fixed effects). By contrast, the analogous coefficient for national banks — which were prohibited by the OCC from participating in state deposit insurance schemes — is positive but statistically insignificant, providing a placebo test consistent with the interpretation that deposit insurance, not unobserved county characteristics, drove the banking capacity difference.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; All effects are estimated for state-chartered bank deposits in predominantly agricultural, Midwestern border counties during 1921–1929, a period characterized by an average annual bank suspension rate of 2.22 percent (versus 0.3 percent during 1911–1920). The paper acknowledges that state deposit insurance schemes of this era generated moral hazard (as established by prior literature), and frames the contribution as quantifying the stability-enhancing component rather than the net welfare effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy implication.&lt;/strong&gt; The 56 percent banking capacity differential implies that deposit runoffs in the absence of insurance are substantially higher than the 3–10 percent runoff rates assumed in the Basel III Liquidity Coverage Ratio (LCR) framework, and more consistent with the 25–50 percent runoffs observed in non-systemic institutions in Denmark following an exogenous reduction in deposit insurance limits (Iyer et al., 2016).&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why is the Postal Savings System a valid proxy for &amp;ldquo;mattress money,&amp;rdquo; and what evidence supports this?&lt;/strong&gt;
The postal savings system was backed by the full faith and credit of the United States, making it categorically safer than commercial bank deposits, and was explicitly designed to attract savings hidden in mattresses. The authors validate the proxy empirically by showing that the nationwide ratio of postal savings balances to total bank deposits is cointegrated (Johansen test, rank 1) with the currency-deposit ratio — a series that rises during banking distress as depositors convert bank funds to currency. Contemporary narrative accounts from the 1915 Congressional Record further confirm that postal savings offices experienced sharp deposit inflows during local banking distress, with deposit intake frequently constrained by the $2,500 individual account cap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the identification strategy, and why does it address endogeneity concerns?&lt;/strong&gt;
The strategy exploits the discontinuity in deposit insurance at state borders by comparing relative postal savings deposit growth in contiguous city pairs — one city in a DI state, one in an adjacent NDI state — conditioning on bank suspensions within 10, 20, or 30 miles. The authors argue that deposit insurance legislation was a statewide political decision driven largely by partisan composition (Democrats favored it, Republicans opposed it), making it implausible that interests concentrated at border cities systematically determined which states adopted it. Six of the seven NDI control states introduced deposit insurance legislation but failed to pass it, underscoring that the policy variation was not determined by border-specific characteristics. A falsification test using the same city pairs after deposit insurance was discontinued shows zero effects, ruling out time-invariant unobserved heterogeneity as the driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the main quantitative results from the city-pair postal savings regressions?&lt;/strong&gt;
Following a bank suspension within 10 miles, postal savings deposits in NDI cities grew 16 percent more than in DI cities (coefficient 0.162, significant at 5 percent). At the 20-mile radius the differential is approximately 9 percent (coefficient 0.0933, significant at 5 percent). At the 30-mile radius the coefficient is 0.0997 and statistically indistinguishable from zero. These results are estimated with OLS using city-pair and year fixed effects and standard errors clustered at the NDI city level, based on 524 observations for the 10- and 20-mile specifications and 66 observations for the post-discontinuation falsification regressions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the paper establish that distance matters for the flight-to-safety effect?&lt;/strong&gt;
The monotonic decline in the estimated coefficient from 0.162 (10 miles) to 0.093 (20 miles) to a statistically insignificant 0.100 (30 miles) indicates that the diversion of deposits into postal savings was geographically localized. This pattern is consistent with depositors responding primarily to nearby bank failures rather than to distant ones, and it supports the interpretation that the effect is driven by local banking distress rather than by state-level or regional macroeconomic shocks that would affect all pairs symmetrically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Are the disintermediation effects of bank suspensions temporary or persistent?&lt;/strong&gt;
The Arellano-Bond GMM dynamic panel regressions (Table 5) show that the effects are persistent. The lagged dependent variable coefficient is approximately −0.205 (10-mile) and −0.188 to −0.201 (20-mile), indicating partial mean reversion but not full reversal. Year-1, Year-2, and implied long-run dynamic effects are all statistically significant and of similar magnitude (approximately 0.145–0.152 for the 10-mile equation and 0.096–0.100 for the 20-mile equation), indicating that once depositors shift funds to postal savings in response to bank suspensions, a substantial portion of the effect persists in subsequent years. This is consistent with prior literature showing that deposits leave the banking system quickly but return slowly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why are the postal savings coefficient estimates considered a lower bound on the true effect of deposit insurance?&lt;/strong&gt;
Two institutional features constrained the postal savings system from fully capturing flight-to-safety deposits. First, individual accounts were capped at $2,500, and narrative evidence shows that this limit was severely binding during distress — depositors attempted to place far more than the ceiling allowed. Second, the re-depositing rate of postal savings funds back into local banks was not 100 percent: during 1921–1923 only 32–47 percent of postal savings deposits were re-deposited in banks, compared to 72–82 percent in calmer years. Because the postal savings system could not absorb unlimited deposits and did not fully recycle absorbed funds into local banking, its level understates the true flight of deposits from the banking system in NDI states.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the county-level banking capacity test address the censoring problem?&lt;/strong&gt;
The paper estimates log-ratio regressions comparing county-level deposits at state-chartered banks in DI versus NDI border counties, using a &amp;ldquo;DI Active&amp;rdquo; indicator that switches on when deposit insurance is in effect in a given state-year and switches off when schemes are discontinued. Because different states discontinued their insurance at different times, there is sufficient within-county variation to identify the DI coefficient even with year fixed effects. The estimated coefficient of 0.574 (without year FE) and 0.557 (with year FE) translates to approximately a 56 percent higher deposit level in state-chartered bank counties with deposit insurance, with virtually identical estimates across specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the placebo test for national banks, and what does it show?&lt;/strong&gt;
National banks were prohibited by the Office of the Comptroller of the Currency from participating in state deposit insurance schemes. If deposit insurance — rather than unobserved county characteristics — is responsible for the 56 percent banking capacity premium, then county deposits at national banks in DI states should show no corresponding premium. The Table 6 results confirm this: the DI Active coefficient for national bank deposits is positive (0.165 to 0.267) but statistically insignificant, providing a falsification result consistent with the causal interpretation for state-chartered banks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the paper situate deposit insurance&amp;rsquo;s stabilizing benefits relative to its moral hazard costs?&lt;/strong&gt;
The paper explicitly frames its contribution as quantifying the stability-enhancing component of deposit insurance separately from the moral hazard component. It cites extensive prior literature (Calomiris 1992, 1993; Wheelock 1992, 1993; Wheelock and Wilson 1994) establishing that the 1910s–1920s state schemes generated moral hazard: insured banks reduced capital-to-asset ratios, relaxed lending standards, and increased risk exposure. The paper does not contest those findings but argues that the two effects are analytically separable and that the stabilization benefit had significant quantitative magnitude — a benefit that should be accounted for when assessing the net welfare effects of deposit insurance design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the implications for the Basel III Liquidity Coverage Ratio framework?&lt;/strong&gt;
The Basel III LCR formula assumes that during distress 3 percent of &amp;ldquo;stable deposits&amp;rdquo; and 10 percent of &amp;ldquo;less stable deposits&amp;rdquo; run off. The paper&amp;rsquo;s finding that deposit insurance is associated with a 56 percent increase in banking capacity implies that in the absence of insurance, deposit runoffs are far higher than these Basel assumptions — substantially larger than 10 percent and more consistent with the 25–50 percent runoffs observed for non-systemic banks in Denmark following an insurance limit reduction (Iyer et al. 2016). The authors argue their results suggest that empirical grounding for the LCR runoff assumptions remains insufficient, consistent with critiques by Allen (2014) and Diamond and Kashyap (2016).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Postal Savings System (as &amp;ldquo;mattress money&amp;rdquo; proxy).&lt;/strong&gt; The U.S. Postal Savings System (1911–) accepted deposits up to $2,500 per individual, backed by the full faith and credit of the United States. In this paper, postal savings deposits are used as a quantitative proxy for money withdrawn from the banking system during distress — &amp;ldquo;money under the mattress&amp;rdquo; — validated by cointegration with the currency-deposit ratio.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy discontinuity / border-pair design.&lt;/strong&gt; The identification strategy exploits the fact that deposit insurance was adopted at the state level, creating a sharp policy discontinuity at state borders. Contiguous city pairs straddling DI and NDI state borders are treated as quasi-experimental units, with the within-pair difference in postal savings deposit growth serving as the outcome, controlling for time-invariant city-level heterogeneity and common time effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Relative Postal Savings Deposit Growth (RPS).&lt;/strong&gt; The dependent variable defined as the log-ratio of postal savings deposits in the NDI city to postal savings deposits in the DI city within a pair, and then first-differenced over time. This construction controls for city-pair-level time-invariant characteristics and isolates the differential response to bank suspensions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bank suspension.&lt;/strong&gt; In this paper&amp;rsquo;s context, a bank suspension is any closure of a bank (state-chartered or national) at a specific geographic location, as recorded in FDIC manuscript lists compiled by Clark Warburton during the 1930s. The variable used in regressions is the change in the number of suspensions within R miles (R = 10, 20, 30) of the paired postal savings offices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial depth / local banking capacity.&lt;/strong&gt; The paper uses county-level deposits at state-chartered banks as a measure of local banking market size. Deposit insurance is hypothesized to increase financial depth by preventing the diversion of funds out of the banking system during distress, and the 56 percent estimated premium is the paper&amp;rsquo;s primary measure of the insurance&amp;rsquo;s capacity-enhancing effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;DI Active indicator.&lt;/strong&gt; A time-varying binary variable equal to 1 when deposit insurance was legally in effect in a given state at a given time, and 0 otherwise (including after repeal). Because different states repealed their schemes at different times (Oklahoma 1923, Texas 1927, South Dakota 1927, North Dakota 1929, Kansas 1929, Nebraska 1930, Mississippi 1930), this variable provides within-county variation that identifies the banking capacity coefficient after controlling for county and year fixed effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Moral hazard vs. stability-enhancing components.&lt;/strong&gt; The paper distinguishes analytically between the moral hazard effect of deposit insurance (insured banks undertake riskier projects, reduce capital buffers, relax lending standards) and the stability-enhancing effect (depositors retain funds in the banking system, preventing runs). The paper&amp;rsquo;s contribution is to quantify the latter component in isolation, using a setting where the two effects can be separated by focusing on depositor — rather than banker — behavior.&lt;/p&gt;</description></item><item><title>Dollar Dominance and the Transmission of Monetary Policy</title><link>https://macropaperwarehouse.com/papers/dollar-dominance-and-the-transmission-of-monetary-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dollar-dominance-and-the-transmission-of-monetary-policy/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;An emerging view in international macroeconomics contends that dollar invoicing of exports renders monetary policy ineffective for non-U.S. countries: because export prices are allegedly sticky in dollars, exchange rate depreciations cannot shift expenditure toward domestic goods, muting the classical Mundell-Fleming channel. McLeay and Tenreyro argue that this view rests on empirical assumptions that are not borne out by the data: goods priced in dollars tend to have more flexible prices and higher elasticities of substitution, not the monopoly power and sticky dollar prices assumed in dominant currency pricing (DCP) models. They propose a mixed currency pricing (MCP) framework that incorporates heterogeneous price flexibility and intra-sector international competition, and show that even with dollar pricing, depreciating the currency by loosening monetary policy can still boost exports and activity materially. The limit to any expansion is not demand, but supply capacity: after a depreciation, domestic dollar costs fall, flexible-price exporters lower prices slightly and gain large market share due to high demand elasticities, and the expansion runs until rising marginal costs offset the initial depreciation — producing limited reduced-form dollar pass-through as an equilibrium result rather than evidence of nominal stickiness. Empirical tests using monetary policy shocks in a sample of emerging and developing economies, case studies of Canada and Chile as commodity exporters, and three large devaluation episodes all find significant, material increases in exports and aggregate activity following exchange-rate depreciations, consistent with the MCP model&amp;rsquo;s predictions.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the specific empirical claim that DCP models rest on, and how do McLeay and Tenreyro challenge it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;DCP models (e.g., Gopinath et al. 2020) posit that exporters invoicing in dollars have monopoly power and face nominal rigidities that keep their dollar export prices sticky. The observable implication used to motivate this assumption was limited exchange rate pass-through to dollar export prices. McLeay and Tenreyro show that low pass-through is equally consistent with a flexible-price, high-elasticity equilibrium. When demand elasticities are high, firms optimally absorb exchange rate changes through quantities rather than prices; the reduced-form pass-through coefficient is small even without any nominal friction. Low pass-through is therefore not informative about the degree of nominal rigidities, and using it to calibrate sticky-price DCP models and draw normative conclusions about exchange rate policy is unwarranted.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. What are the three empirical facts that motivate the MCP framework&amp;rsquo;s assumptions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Fact 1: Homogeneous products (commodities and commodity-like goods traded on organized exchanges or reference-priced, following Rauch 1999) represent a large share of goods exports, exceeding 70% for developing economies, around 60% for emerging economies, and around 35% for advanced economies; Sub-Saharan Africa, Latin America, and the Middle East all have shares above 50%. Fact 2: Homogeneous and more competitively produced goods have more flexible prices, documented across multiple countries — for instance, Nakamura and Steinsson (2008) find a median monthly price-change frequency of 10.8% for finished-good producer prices but 98.9% for crude materials. Fact 3: Dollar (vehicle currency) invoicing is most prevalent precisely in these homogeneous, competitive-good sectors; classical work by McKinnon (1979) and Magee and Rao (1980) emphasized that vehicle-currency invoicing facilitates continuous price comparability in competitive markets, and panel regressions corroborate a positive relationship between the share of exports invoiced in dollars and the homogeneous-goods share of exports.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What is the mechanism through which depreciation boosts exports in the MCP model, and why does this generate low observed pass-through?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With sticky wages (representing non-tradable input price stickiness more broadly), a monetary policy-induced depreciation lowers the domestic cost of production when expressed in dollars. For competitive exporters facing highly elastic demand, even a small reduction in the dollar price translates into a substantial gain in export quantities. Firms therefore lower their dollar prices slightly, trading some profit margin for a large increase in market share. As exports expand, domestic marginal costs rise (firms move up the upward-sloping marginal cost curve), partially offsetting the depreciation&amp;rsquo;s effect on dollar costs. In equilibrium, the net dollar price movement is small — producing the observed limited pass-through — but the quantity response is large. In the perfectly competitive limit (relevant for commodity exporters), the dollar price is unchanged by the world market, and the entire adjustment is through an expansion of export volumes until rising domestic marginal costs absorb the depreciation. The implied observation is identical to a sticky-price model for prices, but &amp;ldquo;the implications for export quantities are diametrically opposed.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. How does the MCP model nest existing frameworks, and what does it add relative to the DCP and PCP benchmarks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The MCP (mixed currency pricing) framework nests sticky-price DCP as a special case (by setting demand elasticities low and allowing full price stickiness) and produces behavior close to PCP (producer currency pricing) in the flexible-price, high-elasticity limit — restoring the allocative properties of the exchange rate from Obstfeld and Rogoff (1995). The distinctive addition is intra-sector international competition: domestic exporters face competition from international competitors producing highly substitutable varieties of the same good, so substitution elasticities can be high at the variety level even when macro-level elasticities between goods remain low. This follows a bottom-up approach to elasticities as in Feenstra et al. (2018). The model also allows heterogeneous nominal rigidities across producers, with exporters of dollar-invoiced homogeneous goods having flexible prices while non-tradable input prices (wages) remain sticky — the source of monetary non-neutrality and the mechanism for real exchange rate effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What is the role of supply capacity, and why is it &amp;ldquo;the limit&amp;rdquo; rather than demand?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the sticky-price DCP model, the constraint on the export response is on the demand side: dollar prices do not move, so demand is unchanged, and there is no export response at all. In the MCP model, demand responds immediately to the cost reduction — the constraint that eventually stops the expansion is supply capacity, captured by the slope of the marginal cost curve and macroeconomic constraints on non-tradable inputs. With a flat marginal cost curve (plentiful supply capacity), exports expand materially; with a steep curve or hard capacity constraints, the increase in marginal cost fully offsets the depreciation before much quantity adjustment occurs. This supply-side framing reorients the policy question: the limiting factor for monetary policy&amp;rsquo;s external effectiveness is not whether dollar prices can move, but whether the domestic economy has the productive capacity to expand tradable output. This also connects the paper to the Salter-Swan two-good framework and to Schmitt-Grohé and Uribe (2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What do the macroeconomic empirical tests find, and how do they distinguish the MCP from sticky-price DCP?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses three empirical exercises. First, using a sample of developing and emerging economies, monetary policy expansions that generate exchange rate depreciations cause significant increases in both exports and aggregate economic activity — consistent with the MCP model&amp;rsquo;s material export response and inconsistent with the DCP prediction of no export channel. Second, focusing on Canada and Chile as commodity exporters where the MCP assumptions (competitive markets, flexible export prices) are especially applicable, the aggregate results are corroborated and sectoral evidence provides additional support. Third, three case studies of large devaluations in the sample document that they are followed by material increases in exports relative to trend. In all exercises, the direction and magnitude of export and output responses are consistent with a functioning expenditure-switching channel, even where exports are priced in dollars.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How does the paper reinterpret the pass-through evidence that motivated sticky-price DCP models, and what does this imply for normative conclusions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Standard reduced-form pass-through regressions relate the change in dollar export prices to changes in the exchange rate. These regressions typically omit or fail to fully capture movements in marginal cost. In the MCP model, flexible-price firms fully pass through changes in marginal cost; the observed limited pass-through to export prices is an equilibrium result of the offsetting rise in marginal costs as export volumes expand, not evidence of a nominal friction. Because the standard regressions omit marginal cost dynamics, they risk attributing the equilibrium quantity-driven equilibrium to a pricing friction. This has direct normative implications: the case made by the IMF (2019, 2020) that dollar invoicing worsens the cost-benefit calculation for flexible exchange rates — and may bolster the case for capital controls — rests on interpreting low pass-through as evidence of stickiness. If low pass-through instead reflects high demand elasticities and supply-side adjustment, the normative argument for constraining exchange rate flexibility is weakened.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. How does the paper relate to the purchasing power parity puzzle and the Mussa puzzle?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The MCP framework offers explanations for two classic international macro puzzles without assuming nominal rigidities in export prices. On the PPP puzzle (the volatility and persistence of the real exchange rate, Rogoff 1996): in the MCP model, exporters&amp;rsquo; optimal reset prices move very little after exchange rate changes — not because of stickiness, but because demand is elastic and marginal costs rise quickly. This predicts limited movement in relative export prices, consistent with empirical evidence in Blanco and Cravino (2020) and Itskhoki and Mukhin (2025). On the Mussa puzzle (the large jump in nominal and real exchange rate volatility after the Bretton Woods collapse): the model&amp;rsquo;s mechanism via sticky wages is consistent with evidence that depreciations produce slow adjustment of non-tradable prices (Burstein, Eichenbaum, and Rebelo 2005), generating real exchange rate movements despite limited response in traded-good dollar prices.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Dominant currency pricing (DCP):&lt;/strong&gt; A framework in which non-U.S. exporters set and maintain prices in U.S. dollars, with sticky dollar prices. As formulated by Gopinath et al. (2020), DCP predicts that exchange rate depreciations by non-U.S. countries do not reduce dollar export prices and therefore do not stimulate export demand — muting the expenditure-switching channel of monetary policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mixed currency pricing (MCP):&lt;/strong&gt; The framework introduced in this paper. It allows heterogeneous price flexibility and market structure across export sectors, nesting both sticky-price DCP and flexible-price PCP as special cases. Dollar-priced exports face elastic demand from international competition, have flexible prices, and respond to depreciations through quantities rather than prices. Non-traded inputs (wages) remain sticky, providing the source of monetary non-neutrality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expenditure-switching channel:&lt;/strong&gt; The mechanism by which exchange rate depreciations redirect spending toward domestically produced goods, boosting exports and aggregate demand. In PCP models, this works through a fall in relative export prices. In the MCP model, it works through an expansion in export quantities even when dollar prices change little.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange rate pass-through (to export prices):&lt;/strong&gt; The elasticity of dollar export prices with respect to the nominal exchange rate. In sticky-price DCP models, low pass-through reflects a nominal friction (prices cannot adjust). In the MCP model, low pass-through reflects high demand elasticities and offsetting marginal cost increases: it is an equilibrium outcome, not a friction, and therefore does not imply that export volumes are unresponsive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intra-sector international competition:&lt;/strong&gt; The market structure feature central to the MCP framework. Domestic exporters of a given good compete with foreign suppliers of highly substitutable varieties, making their demand elastic at the variety level even if aggregate elasticities across different goods categories are low. This follows Armington (1969) as implemented by Feenstra et al. (2018).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supply capacity constraint:&lt;/strong&gt; In the MCP model, the binding constraint on how much a depreciation can boost exports. With high demand elasticities, demand for domestic exports expands freely; the limit is set by how quickly rising domestic marginal costs absorb the improvement in export profitability. The supply constraint replaces the demand constraint that operates (mechanically, via zero price response) in sticky-price DCP models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Homogeneous goods (Rauch 1999 classification):&lt;/strong&gt; Goods traded on organized commodity exchanges or reference-priced in trade publications, as opposed to differentiated goods. McLeay and Tenreyro use this classification to establish that dollar-invoiced exports are disproportionately homogeneous, competitive, and flexible-priced — contrary to the DCP assumption of monopoly power and price stickiness.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on published open-access version. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Double Robustness of Local Projections and Some Unpleasant VARithmetic</title><link>https://macropaperwarehouse.com/papers/double-robustness-of-local-projections-and-some-unpleasant-varithmetic/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/double-robustness-of-local-projections-and-some-unpleasant-varithmetic/</guid><description>&lt;p&gt;This paper provides formal theoretical results on the relative robustness of local projection (LP) and vector autoregression (VAR) confidence intervals for impulse response inference when the data generating process (DGP) is locally misspecified. The research question is whether the widely held belief that LP estimators are more robust to misspecification than VARs is theoretically justified, and if so, precisely under what conditions and with what consequences for VAR inference.&lt;/p&gt;
&lt;p&gt;The analytical framework models the DGP as a stationary structural VARMA(1, ∞) that is local to an SVAR(1), of the form y_t = Ay_{t-1} + H[I + T^{-ζ}α(L)]ε_t, where the MA component T^{-ζ}α(L)ε_t represents misspecification that vanishes at rate T^{-ζ} as sample size T grows. The key rate parameter is ζ ∈ (1/4, 1/2), which corresponds to misspecification large enough to be detected with probability approaching 1 by conventional Hausman-type specification tests, yet small enough that the bias-variance trade-off between LP and VAR remains non-trivial asymptotically. The framework encompasses under-specification of lag length, omitted variables, temporal aggregation, measurement error, and failure of shock invertibility — essentially all sources of dynamic misspecification relevant to linearized DSGE models.&lt;/p&gt;
&lt;p&gt;The main finding on LP is a &amp;ldquo;double robustness&amp;rdquo; result: the conventional LP confidence interval achieves correct asymptotic coverage for all ζ &amp;gt; 1/4, even when misspecification is large enough to be detected with certainty. The mechanism is that the omitted-variable bias in the LP regression is of order T^{-2ζ} = o(T^{-1/2}) when ζ &amp;gt; 1/4, because both the direct effect of omitted lags on the outcome and the covariance of the residualized regressor with omitted lags are each of order T^{-ζ}, so their product is negligible relative to the T^{-1/2} standard deviation. This is formally analogous to double robustness in partially linear regression and debiased machine learning: LP is consistent if either the outcome-equation controls or the first-stage controls are correctly specified.&lt;/p&gt;
&lt;p&gt;In stark contrast, the VAR estimator carries asymptotic bias of order T^{-ζ}, which is non-negligible relative to its T^{-1/2} standard deviation for ζ ≤ 1/2. This causes the conventional VAR confidence interval to severely undercover: for ζ ∈ (1/4, 1/2) the coverage converges to zero, and for ζ = 1/2 it converges to a level strictly below the nominal level.&lt;/p&gt;
&lt;p&gt;The &amp;ldquo;no free lunch&amp;rdquo; result formalizes the trade-off. Setting ζ = 1/2 and bounding the noise-to-signal ratio at M²/T, the worst-case scaled VAR bias equals M√(aVar(β̂_h)/aVar(δ̂_h) − 1). This worst-case bias is small if and only if the VAR asymptotic variance is close to that of LP. When the VAR standard error is less than half that of LP — which is typical in applied practice — worst-case coverage falls below 48% even for M = 1. Moreover, the least favorable misspecification takes the form of exponentially decaying MA coefficients peaking at horizon h, a pattern consistent with standard economic theories of adjustment costs, learning, or overshooting, and is difficult to rule out on prior grounds. The Hausman test also provides weak protection: when M = 1, the odds of the test failing to reject are nearly 3-to-1 at the 10% significance level.&lt;/p&gt;
&lt;p&gt;Simulations using the Smets and Wouters (2007) model with T = 240 observations confirm these results. With lag length selected by AIC (median selected p = 2), VAR confidence intervals materially undercover at all but very short horizons while LP achieves close to nominal coverage throughout. Increasing lag length to p = 4 or p = 8 ameliorates VAR undercoverage at short horizons but at the cost of making VAR confidence intervals essentially as wide as LP intervals, with substantial undercoverage persisting at longer horizons. For p = 4 the total misspecification measure is M ≈ 3.23; for p = 8, M ≈ 1.89.&lt;/p&gt;
&lt;p&gt;Scope conditions: results are pointwise asymptotic in fixed model parameters and horizon; they abstract from order-T^{-1} small-sample biases from persistence or the nonlinearity of the impulse response transformation. The LP robustness result requires controlling for lags that are strong predictors of the outcome or impulse variables; omitting lags with small-to-moderate predictive power does not threaten coverage.&lt;/p&gt;
&lt;p&gt;Q: What is the precise sense in which LP confidence intervals are &amp;ldquo;doubly robust&amp;rdquo;?&lt;/p&gt;
&lt;p&gt;A: LP is doubly robust in the sense of partially linear regression: its bias from misspecified MA dynamics is the product of two errors, the estimation error in the outcome-equation lag controls γ̂ − γ_0 and the estimation error in the first-stage lag controls ν̂ − ν_0. In the local-to-SVAR model each error is of order T^{-ζ}, so their product is of order T^{-2ζ} = o(T^{-1/2}) whenever ζ &amp;gt; 1/4, making the omitted-variable bias negligible relative to the T^{-1/2} standard deviation. This means the asymptotic distribution of the LP estimator is completely invariant to the misspecification parameters α(L) and ζ.&lt;/p&gt;
&lt;p&gt;Q: How large does misspecification need to be before LP coverage is threatened?&lt;/p&gt;
&lt;p&gt;A: The LP double robustness result holds for all ζ &amp;gt; 1/4 regardless of the magnitude parameter M of the MA misspecification. Misspecification with ζ ∈ (1/4, 1/2) can be detected with probability approaching 1 asymptotically by standard specification tests — in particular, the Hausman test is consistent for this range — yet LP coverage remains exactly correct. There is no threshold M below which LP fails; robustness is structural, not contingent on misspecification being small.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions does the VAR estimator have zero asymptotic bias?&lt;/p&gt;
&lt;p&gt;A: The VAR asymptotic bias is zero if and only if the lagged shocks ε_{j*,t-ℓ} for ℓ = 1, …, h lie in the span of the lagged data used for estimation. Two sufficient conditions from Corollary 3.2 are: (i) the true model is SVAR(p_0) and the estimation lag length p satisfies h ≤ p − p_0, so the extra lags absorb the residual MA structure; or (ii) the shock of interest is directly observed and ordered first, and h ≤ p. In these cases the VAR estimator is asymptotically equivalent to LP, with equal variance.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;no free lunch&amp;rdquo; result for VARs?&lt;/p&gt;
&lt;p&gt;A: For ζ = 1/2 and noise-to-signal ratio bounded by M²/T, the worst-case scaled VAR bias equals M√(aVar(β̂_h)/aVar(δ̂_h) − 1) (Proposition 4.1). This quantity is small if and only if aVar(δ̂_h) ≈ aVar(β̂_h), meaning the VAR has little efficiency advantage over LP. Put differently, the only way to guarantee robust VAR coverage is to include enough lags that the VAR confidence interval becomes as wide as the LP interval. There is no procedure that simultaneously offers narrower intervals than LP and reliable coverage.&lt;/p&gt;
&lt;p&gt;Q: How severe is the worst-case undercoverage of conventional VAR confidence intervals?&lt;/p&gt;
&lt;p&gt;A: From Corollary 4.3, even for M = 1 (a noise-to-signal ratio of just 1/T), worst-case VAR coverage falls below 48% whenever the VAR asymptotic standard deviation is less than half that of LP — a configuration typical in applied practice. For larger M the undercoverage is worse: the formula 1 − r(M√(aVar(β̂_h)/aVar(δ̂_h) − 1); z_{1-α/2}) can approach zero. Furthermore, the worst-case probability that VAR fails to cover AND the Hausman test fails to reject misspecification simultaneously exceeds 46% when the VAR standard deviation is less than half that of LP (Corollary 4.4).&lt;/p&gt;
&lt;p&gt;Q: Can the researcher detect the problematic misspecification using a Hausman test before it causes undercoverage?&lt;/p&gt;
&lt;p&gt;A: Only weakly. When M = 1, the Hausman test fails to reject misspecification with probability approximately 74% (odds of nearly 3-to-1) at the 10% significance level, since r(1; z_{0.95}) = 26%. At the 5% level the odds of non-rejection are nearly 5-to-1, since r(1; z_{0.975}) = 17%. The least favorable misspecification also cannot be ruled out on economic-theory grounds: the least favorable MA polynomial has exponentially decaying coefficients peaking at horizon h, consistent with adjustment costs, learning, or overshooting.&lt;/p&gt;
&lt;p&gt;Q: Does using a bias-aware critical value (Armstrong-Kolesár approach) resolve the VAR undercoverage problem?&lt;/p&gt;
&lt;p&gt;A: The bias-aware VAR confidence interval CI_B(δ̂_h; M) achieves correct asymptotic coverage by inflating the critical value based on the known bound M on misspecification. However, the bias-aware VAR interval tends to be wider than the LP interval. Specifically, M must be quite small — apparently below 1 — for the bias-aware VAR to dominate LP in width regardless of DGP and horizon. For M ≥ 2 (noise-to-signal ratio above 4/T), bias-aware VAR is dominated by LP in interval width. The practical conclusion is that the simpler LP interval is preferable in most empirically relevant settings.&lt;/p&gt;
&lt;p&gt;Q: What does the minimax model-averaging result say about optimal weighting of LP and VAR?&lt;/p&gt;
&lt;p&gt;A: From Corollary 4.2, the minimax optimal weight on LP when estimating a convex combination of LP and VAR estimators is M²/(1 + M²). For M = 1 (equal noise-to-signal threshold), the optimal weight is 50% on each. For M = 2, the LP estimator receives 80% weight. In the Smets and Wouters simulations, M ≈ 3.23 for p = 4 lags, corresponding to an optimal LP weight of approximately 91%, and M ≈ 1.89 for p = 8 lags, giving an optimal LP weight of approximately 78%.&lt;/p&gt;
&lt;p&gt;Q: What do the Smets and Wouters simulations show about AIC-selected VARs?&lt;/p&gt;
&lt;p&gt;A: In 5,000 simulated samples of T = 240 observations from the Smets and Wouters (2007) model, the AIC selects a median lag length of p = 2. At all but very short horizons, VAR confidence intervals materially undercover while LP confidence intervals throughout achieve close to nominal coverage. A bootstrap correction for VARs somewhat improves coverage but leaves large distortions. Increasing lag length to p = 4 or p = 8 moves coverage closer to nominal at short horizons (h ≤ p) but makes VAR confidence intervals essentially as wide as LP, and substantial VAR undercoverage persists at longer horizons.&lt;/p&gt;
&lt;p&gt;Q: Is the no-free-lunch result specific to univariate impulse responses?&lt;/p&gt;
&lt;p&gt;A: No. Proposition 4.2 extends the result to simultaneous inference on multiple impulse responses. For any k × 1 linear combination R of the impulse response vector, the worst-case squared bias is M² λ_max(R[aVar(β̂) − aVar(δ̂)]R&amp;rsquo;), where λ_max denotes the largest eigenvalue. Because VAR impulse response estimates are often highly correlated across horizons, undercoverage can be particularly severe in the multivariate (joint confidence ellipsoid) case. The no-free-lunch principle holds: the VAR ellipsoid offers non-negligible worst-case bias as long as it offers any efficiency gain relative to LP for any linear combination of horizon-specific impulse responses.&lt;/p&gt;
&lt;p&gt;Q: What is the practical recommendation for lag selection in LP and VAR?&lt;/p&gt;
&lt;p&gt;A: The paper offers three practical guidelines. First, LP researchers should control for those lags of the data that are strong predictors of the outcome or impulse variables, using conventional information criteria (such as AIC) applied to a VAR in all variables to select the number of lags for LP control — omitting lags with small-to-moderate predictive power does not threaten coverage. Second, VAR researchers should increase the lag length until the VAR confidence interval is no longer substantially narrower than the corresponding LP interval. Third, conventional specification tests do not suffice to guard against VAR coverage distortions.&lt;/p&gt;
&lt;p&gt;Local Projection (LP) Estimator: The LP estimator for the impulse response at horizon h is the OLS coefficient on the shock variable y_{j*,t} in a direct regression of y_{i*,t+h} on y_{j*,t}, the variables ordered before it, and lagged data. It is a &amp;ldquo;direct&amp;rdquo; estimator in that it does not iterate a one-step VAR forward.&lt;/p&gt;
&lt;p&gt;Double Robustness: A property of LP whereby its asymptotic bias from MA misspecification equals the product of two estimation errors — in the outcome-equation lag controls and in the first-stage residualization controls — each of order T^{-ζ}, making their product of order T^{-2ζ} = o(T^{-1/2}) for ζ &amp;gt; 1/4. This is the LP analogue of the double robustness of partially linear regression estimators in debiased machine learning.&lt;/p&gt;
&lt;p&gt;Local-to-SVAR Misspecification: A DGP of the form y_t = Ay_{t-1} + H[I + T^{-ζ}α(L)]ε_t in which the MA term T^{-ζ}α(L)ε_t represents misspecification that vanishes at rate T^{-ζ}. The rate parameter ζ governs the magnitude; ζ ∈ (1/4, 1/2) is the empirically relevant range where bias is detectable by specification tests yet the bias-variance trade-off between LP and VAR remains non-trivial.&lt;/p&gt;
&lt;p&gt;No Free Lunch (for VARs): The result that the worst-case scaled VAR bias equals M√(aVar(β̂_h)/aVar(δ̂_h) − 1), implying that the VAR confidence interval has reliable (robust) coverage if and only if the VAR asymptotic variance is close to that of LP — i.e., there is no way to simultaneously have shorter confidence intervals than LP and guaranteed coverage robustness.&lt;/p&gt;
&lt;p&gt;Noise-to-Signal Ratio: The quantity T^{-1}||α(L)||² = trace{Var(T^{-1/2}α(L)ε_t) Var(ε_t)^{-1}}, which measures the total magnitude of the MA misspecification relative to the variance of the shocks. The paper bounds this at M²/T and uses M as the sufficient statistic for worst-case bias and coverage.&lt;/p&gt;
&lt;p&gt;Bias-Aware Critical Value: An inflated critical value cv_{1-α}(b) solving r(b; cv_{1-α}(b)) = α, used to construct a VAR confidence interval CI_B(δ̂_h; M) that achieves correct asymptotic coverage by accounting for the worst-case bias M√(aVar(β̂_h)/aVar(δ̂_h) − 1). The paper shows this approach typically produces intervals at least as wide as LP for M ≥ 2.&lt;/p&gt;
&lt;p&gt;Asymptotic Bias of VAR (aBias): The scaled bias term T^{ζ}E[δ̂_h − θ_{h,T}] converging to aBias(δ̂_h) = trace{S^{-1}Ψ_h H Σ_{ℓ=1}^∞ α_ℓ D H&amp;rsquo;(A&amp;rsquo;)^{ℓ-1}} − e&amp;rsquo;&lt;em&gt;{i*,n} Σ&lt;/em&gt;{ℓ=1}^h A^{h-ℓ} H α_ℓ e_{j*,m}. This term is structurally absent from the LP asymptotics due to the double robustness mechanism.&lt;/p&gt;</description></item><item><title>Downward Rigidity in the Wage for New Hires</title><link>https://macropaperwarehouse.com/papers/downward-rigidity-in-the-wage-for-new-hires/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/downward-rigidity-in-the-wage-for-new-hires/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Hazell and Taska use wages posted on online job vacancies — matched to job titles and establishment identifiers from Burning Glass Technologies — to measure the wage for new hires at the job level (same job title and establishment) over 2010Q1–2020Q2. They find that this measure of the wage for new hires is rigid downward and flexible upward. At the job level, the nominal posted wage changes infrequently — on average once every 5–6 quarters — and conditional on changing, is four times more likely to rise than to fall. In the cyclical dimension, job-level posted wages rise strongly when state unemployment falls but do not fall when state unemployment rises; real wages exhibit the same asymmetric pattern. These results do not appear in the average wage for new hires (which aggregates across all job types), because time-varying job composition inflates the variance of average wages and raises standard errors roughly twentyfold relative to job-level regressions — explaining why prior work using worker-level survey data found no evidence of downward rigidity. A Heckman (1979) selection correction for firms&amp;rsquo; selection into vacancy posting suggests that selection bias in the job-level regression is moderate. The findings provide direct empirical support for models in which downward wage rigidity for new hires — specifically at the job level — amplifies unemployment fluctuations and generates asymmetric unemployment dynamics.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the central empirical claim of the paper?&lt;/strong&gt;
A: At the job level — defined as the same job title within the same establishment — the wage posted for new hires is rigid downward and flexible upward. It changes infrequently and, conditional on changing, rises far more often than it falls; and it responds to falls in unemployment but not to rises in unemployment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What data does the paper use, and what defines a &amp;ldquo;job&amp;rdquo;?&lt;/strong&gt;
A: The paper uses the Burning Glass Technologies dataset of wages posted on online vacancies, covering January 2010 to June 2020. A &amp;ldquo;job&amp;rdquo; is a job title within an establishment whose wages are paid at a given frequency (e.g., hourly or annual). The data come from the near-universe of online job postings — roughly 40,000 sources — and the main regression sample consists of jobs that post wages, have job title and establishment information, and post vacancies in multiple quarters, yielding approximately 3.05 million vacancies, representing about 0.8% of total US vacancies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do the authors validate that posted wages measure the wage for new hires?&lt;/strong&gt;
A: They construct a measure of the wage for new hires from the Current Population Survey (CPS) — workers switching jobs or entering from unemployment — at the state, industry, and occupation level. Regressing log CPS wages on log Burning Glass wages (using an IV split-sample procedure to correct for attenuation bias) yields a coefficient close to 1 across specifications and levels of aggregation, indicating that average posted wages move roughly one-for-one with average wages for new hires in representative survey data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How is the frequency of wage change estimated?&lt;/strong&gt;
A: Because wages are not observed in quarters without a vacancy posting, the authors adapt a constant-hazard model from the price-setting literature (following Nakamura–Steinsson and Klenow–Kryvtsov). The latent wage evolves stochastically between postings; the observed wage is treated as a draw from this process. The quarterly probability of wage change is estimated at 0.17–0.19 across specifications, implying implied durations of unchanged wages of 4–5 quarters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the asymmetry in the direction of wage changes?&lt;/strong&gt;
A: In the unweighted baseline, the quarterly probability of a wage decrease is 0.04, whereas the probability of a wage increase is 0.12 — roughly a three-to-one ratio in probabilities, summarized in the paper&amp;rsquo;s abstract as wages being &amp;ldquo;four times more likely to rise than to fall.&amp;rdquo; The distribution of non-zero wage changes also shows a pronounced pile-up of small positive changes relative to small negative changes, consistent with a downward constraint on wage setting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the first piece of cyclical evidence for downward rigidity?&lt;/strong&gt;
A: A binned scatterplot (Figure 1) of job-level wage growth against state-level quarterly changes in unemployment shows a strong, roughly linear relationship when unemployment is falling — wages rise with falls in unemployment, both for small and large declines. When unemployment rises, however, wages do not fall — neither for small nor for large increases in unemployment. This asymmetry is robust to regression-based analysis and to identified labor demand shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Are real wages also rigid downward?&lt;/strong&gt;
A: Yes. The paper reports that real wages (nominal posted wages deflated) are also rigid downward and flexible upward, mirroring the pattern for nominal wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the job-composition problem, and why does it matter?&lt;/strong&gt;
A: The average wage for new hires — the object measured in most prior work — aggregates across all job types that are actively hiring. If the composition of jobs hiring shifts over the business cycle (e.g., the share of lower-wage jobs rises in recessions), then average wages can fall even if no individual job cuts its wage, and can stay flat or rise even if every job cuts its wage. Job composition therefore confounds cyclicality estimates based on average wages. By tracking the same job title at the same establishment across successive vacancies, the authors purge wage changes driven by shifting composition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why did prior work find no evidence of downward rigidity for new hires?&lt;/strong&gt;
A: Prior work used worker-level survey data (e.g., Bils 1985; Pissarides 2009 survey) that controls for worker characteristics but averages across jobs — the average wage for new hires. The volatility of job composition inflates the variance of this average measure. In the Burning Glass data, standard errors from regressions using average wages are roughly twenty times larger than those from job-level regressions, making it impossible to detect downward rigidity even if it exists. Point estimates in prior work suggested procyclicality but were too imprecise to exclude downward rigidity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does this paper relate to Gertler, Huckfeldt, and Trigari (2020) and Grigsby, Hurst, and Yildirmaz (2021)?&lt;/strong&gt;
A: Both papers attempt to control for job composition at the worker level. Gertler et al. focus on wages of workers hired from unemployment (less affected by composition than all new hires) and find weakly procyclical wages. Grigsby et al. use rich payroll data and worker-level matching to control for composition and also find weakly procyclical wages. The present paper complements these by using job-level data that directly purges composition without relying on worker characteristics, and adds evidence on the asymmetry of rigidity (not just average procyclicality).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the role of the Heckman selection correction?&lt;/strong&gt;
A: If firms select into vacancy posting depending on business-cycle conditions, the sample of observed posted wages may be non-random, biasing job-level wage-cyclicality estimates. The authors implement a standard Heckman (1979) two-step selection correction. The correction suggests that selection bias in the job-level regression is moderate — it does not overturn the finding of downward rigidity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the four main caveats the authors acknowledge?&lt;/strong&gt;
A: (1) The main sample is small — 0.8% of US vacancies — though the authors show it is broadly representative on observables and that wages track representative survey data. (2) The paper measures rigidity only for jobs that post wages; jobs that do not post wages might be more flexible, though the share of vacancies posting wages does not decline during contractions. (3) Posted wages may differ from realized (bargained) wages; however, wages are rigid even in occupations where bargaining is uncommon. (4) The Pandemic Recession is the main contractionary episode in the sample, and it involved labor supply shocks as well as demand shocks; the authors address this through identified labor demand shock regressions and by ending the sample in June 2020.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the implications for models of unemployment fluctuations?&lt;/strong&gt;
A: In the Diamond–Mortensen–Pissarides search model, Pissarides (2009) emphasizes that the wage for newly hired workers — not continuing workers — is the relevant margin for unemployment fluctuations. Shimer (2005) showed the standard calibration produces too-small unemployment fluctuations; wage rigidity for new hires can resolve this. The paper&amp;rsquo;s finding of downward-but-not-upward rigidity additionally supports models (e.g., Dupraz, Nakamura, and Steinsson, 2020) in which this asymmetry generates asymmetric unemployment dynamics — unemployment rises sharply in contractions but falls more slowly in expansions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do wages for new hires compare with wages for continuing workers in terms of rigidity?&lt;/strong&gt;
A: The paper finds approximate parity. The implied duration of unchanged wages from the job-level posted wage data (4–5 quarters) is similar to estimates for continuing workers in the prior literature. This is perhaps surprising because wages could in principle be more flexible for new hires than continuing workers — firms might cut wages for new hires even while insuring continuing workers (Beaudry and DiNardo, 1991). The results instead suggest that internal equity concerns (Bewley, 2002) or other forces produce similar rigidity for both groups.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Job level wage&lt;/strong&gt;: The wage across successive vacancies posted by the same job title at the same establishment. This is the unit of observation in the paper&amp;rsquo;s main analysis and the object for which downward rigidity is documented. Distinct from the average wage for new hires (which aggregates across all job types).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Downward rigidity (as used in this paper)&lt;/strong&gt;: An empirical pattern in which wages at the job level do not fall during contractions — they do not respond to rising unemployment — while rising during expansions in response to falling unemployment. The claim is descriptive: the data show wages do not fall; the paper does not structurally identify the mechanism enforcing this floor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job composition problem&lt;/strong&gt;: The bias introduced when measuring cyclicality of the average wage for new hires using data that aggregates across different types of jobs. If the mix of job types hiring shifts with the business cycle, average wages can change even when no individual job changes its wage, and can mask individual-job wage changes. Job-level data resolve this by holding the job fixed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Burning Glass Technologies dataset&lt;/strong&gt;: A database of wages posted on online job vacancies, drawn from approximately 40,000 online sources (job boards and company websites), covering the near-universe of US online vacancies. The paper&amp;rsquo;s main regression sample uses the subset with posted wages, job title, establishment identifiers, and multiple quarters of postings, spanning January 2010 to June 2020.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Constant hazard model (wage change frequency)&lt;/strong&gt;: An estimation procedure adapted from the price-setting literature to recover the quarterly probability of wage change from a dataset in which wages are only observed when a vacancy is posted. The latent wage evolves with a constant hazard of change between observations; observed wage changes identify the hazard rates for increases and decreases separately.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average wage for new hires&lt;/strong&gt;: The mean wage across all workers newly entering employment (or across all new-hire jobs), used in prior work (Bils 1985 and related). Does not control for job composition. Shown in this paper to exhibit no detectable downward rigidity, with standard errors roughly twenty times larger than in job-level specifications — because job composition variance inflates the residual variance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heckman selection correction&lt;/strong&gt;: A two-step procedure (Heckman 1979) to correct for the possibility that firms that post vacancies — and post wages — are a selected sample that differs systematically across the business cycle. The paper applies this to assess whether selection into vacancy posting biases the job-level wage-cyclicality estimates; the correction suggests bias is moderate.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on LSE Research Online accepted version (accepted manuscript, covers full paper including introduction, data, and Section 3; extraction terminated at line 595 before Sections 4–5). AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Dynamic Concern for Misspecification</title><link>https://macropaperwarehouse.com/papers/dynamic-concern-for-misspecification/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dynamic-concern-for-misspecification/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how an agent who fears that none of their probabilistic models is the correct description of the data-generating process (DGP) should update that fear as evidence accumulates, and what long-run behavior such an agent exhibits. The central contribution is making the concern for misspecification &lt;em&gt;endogenous&lt;/em&gt;: the better the agent&amp;rsquo;s structured models explain past observations, the less concerned the agent becomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Decision Criterion&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The agent posits a finite-dimensional parametric set of structured models Θ, holds a prior µ over Θ, and evaluates each action according to an &lt;em&gt;average robust control criterion&lt;/em&gt;. This criterion takes a weighted average (over models) of robust control assessments, where each assessment penalizes expected utility for probability distributions that deviate from the structured model in terms of relative entropy, scaled by a misspecification concern parameter λ &amp;gt; 0. A standard subjective expected utility maximizer is the limiting case as λ → 0 (no concern), and a maxmin agent is approached as λ → ∞.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Misspecification Concern&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The concern parameter λ is updated each period as a function of the likelihood ratio test (LRT) statistic of the structured models against unstructured alternatives, scaled by a time-normalizing sequence βₜ: λ(hₜ) = LRT(hₜ, Θ) / (2βₜ). The sequence βₜ determines how demanding the agent is in evaluating model fit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taxonomy of Agent Types&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three types emerge based on the speed of βₜ:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Statistician type&lt;/strong&gt; (βₜ = ct, linear): applies a time scaling that keeps the LRT asymptotically informative about the degree of misspecification. This is the unique type satisfying both &lt;em&gt;safety&lt;/em&gt; (long-run average payoff at least ε-close to the maxmin guarantee, almost surely) and &lt;em&gt;consistency under almost correct specification&lt;/em&gt; (no ε-regret when misspecification is small).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Lenient type&lt;/strong&gt; (t = o(βₜ)): attributes unexplained evidence to sampling variability; corresponds to the Law of Large Numbers intuition.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Demanding type&lt;/strong&gt; (βₜ = o(t)): overly penalizes small discrepancies, analogous to the Law of Small Numbers fallacy (Tversky and Kahneman, 1971).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Standard SEU maximization fails safety; robust control with an invariant λ (Hansen and Sargent, 2001; 2022) fails consistency under almost correct specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-Run Convergence Results (Theorem 1)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For a misspecified agent (no θ ∈ Θ with qθ_{a*} = p*_{a*}), the nature of the limit action a* depends on the agent type:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;em&gt;Lenient type&lt;/em&gt;: a* is a &lt;strong&gt;Berk-Nash equilibrium&lt;/strong&gt; — an SEU best reply to beliefs supported on the models with minimum relative entropy from the true DGP.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Demanding type&lt;/em&gt;: a* is a &lt;strong&gt;maxmin equilibrium&lt;/strong&gt; — a worst-case best reply to all models absolutely continuous with respect to the true DGP.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Statistician type&lt;/em&gt;: if behavior converges, a* is a &lt;strong&gt;c-robust equilibrium&lt;/strong&gt; — a robust control best reply to beliefs on the relative entropy minimizers, with the concern for misspecification endogenously set at minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}) / c.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;For a correctly specified agent (Proposition 2), every limit action is a &lt;strong&gt;self-confirming equilibrium&lt;/strong&gt;, regardless of the agent type.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cycles and Limit Frequency (Section 4, Theorem 2)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The statistician type&amp;rsquo;s behavior need not converge. In natural settings, the agent cycles between actions: playing a &amp;ldquo;safe&amp;rdquo; action whose consequences are well-explained by Θ reduces concern for misspecification, eventually leading to a riskier action whose poorly-explained consequences raise concern again, inducing a return to the safe action. The paper proves that every limit &lt;em&gt;frequency&lt;/em&gt; (empirical distribution over actions) is a &lt;strong&gt;mixed c-robust equilibrium&lt;/strong&gt; — a generalization that allows mixing while tying the concern for misspecification to the frequency-weighted average relative entropy of each action.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Applications&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Monetary policy cycles&lt;/em&gt; (Sargent 1999, 2008): In a central bank model where the true DGP includes increased inflation variability under aggressive policy (a feature absent from the bank&amp;rsquo;s structured models), no pure c-robust equilibrium exists for small c. The model predicts persistent cycles between conservative and aggressive policy. The frequency of the conservative policy is increasing in the strength of the exploitable inflation-unemployment trade-off (θ&lt;em&gt;₁π + θ&lt;/em&gt;₁a).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Labor supply under complex tax schedules&lt;/em&gt; (Rees-Jones and Taubinsky, 2020): Agents with a &amp;ldquo;schmeduling&amp;rdquo; heuristic (linearizing the tax schedule) are misspecified. Berk-Nash equilibrium predicts these agents exert excess effort, with the bias increasing in the complexity (convexity) of the tax code. The c-robust equilibrium attenuates this bias: conditional on the equilibrium, minθ R(p*_a || qθ_a) &amp;gt; 0, so agents maintain positive concern for misspecification and pull back from the biased recommendation. The paper rationalizes the empirical finding that approximately 40% of agents hold the schmeduling belief but only about 20% fewer agents act on it — consistent with endogenous concern reducing the behavioral impact of the biased model.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Axiomatization (Section 5)&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper axiomatizes the static average robust control criterion (Theorem 3) using: a Variational Axiom (from Maccheroni, Marinacci, and Rustichini, 2006a), a Structured Savage axiom (Sure-Thing Principle for bets on the model identity), an Intramodel Sure-Thing Principle (STP for bets conditional on the model), and Uniform Misspecification Concern (the agent is equally concerned about misspecification regardless of which model is identified as best-fitting). Three additional dynamic axioms characterize preference evolution: Constant Preference Invariance (utility index stable over time), Dynamic Consistency over Models (Bayesian updating over structured models), and Q-Likelihood (misspecification concern increases in the LRT). A novel Asymptotic Frequentism axiom characterizes the statistician type: preferences must become arbitrarily similar (in a precise quantitative sense) after sufficiently long histories with the same outcome frequency.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the average robust control criterion and how does it generalize prior decision criteria?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: An agent evaluates action a by averaging over structured models θ a robust control assessment: for each θ, minimize expected utility over probability distributions within relative entropy distance (penalized by 1/λ) of qθ_a, then integrate over θ with prior µ. This nests SEU (λ → 0, perfect trust in models), standard robust control of Hansen and Sargent (2001) (µ is Dirac, single benchmark model), and maxmin expected utility of Gilboa and Schmeidler (λ → ∞). The key extension is allowing µ to be nondegenerate, so the agent is simultaneously uncertain about the best-fitting model and about whether any model is exact.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the role of the likelihood ratio test statistic in driving misspecification concern?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The LRT statistic compares the maximum likelihood of the structured models against the best unstructured alternative. It diverges almost surely when the agent is misspecified, regardless of how close the structured models are to the true DGP. The concern parameter λ(hₜ) = LRT(hₜ, Θ) / (2βₜ) uses a time-scaling sequence βₜ to keep this statistic interpretable. Without scaling, a misspecified agent&amp;rsquo;s concern would always explode to infinity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Why does linear time scaling (βₜ = ct) uniquely characterize the statistician type as rational?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Proposition 1 establishes two properties: (1) ε-safety — every βₜ = ct-optimal policy achieves average payoff at least ε below the maxmin guarantee, almost surely; (2) ε-consistency under almost correct specification — for DGPs sufficiently close to Θ, the agent avoids long-run regret. Part 2 of Proposition 1 shows that no βₜ with βₜ = o(t) or t = o(βₜ) satisfies both properties simultaneously. SEU fails safety; invariant-λ robust control fails consistency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is a c-robust equilibrium and how does it differ from a Berk-Nash equilibrium?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: A Berk-Nash equilibrium (Esponda and Pouzo, 2016) requires the action to be an SEU best reply to beliefs supported on the relative entropy minimizers of the true DGP. A c-robust equilibrium requires the same support condition but with the best reply taken under the average robust control criterion, where the concern for misspecification λ equals minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}) / c — that is, the minimum relative entropy scaled by 1/c. The endogenous λ is positive whenever the agent is misspecified, so the agent does not fully trust even the best-fitting model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the paper explain that misspecified lenient types converge to Berk-Nash while demanding types converge to maxmin?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: For the lenient type (t = o(βₜ)), the time scaling makes the concern for misspecification converge to 0 (the LRT grows slower than βₜ relative to t), so the agent effectively behaves as an SEU maximizer with beliefs on the KL-minimizing models — the Berk-Nash condition. For the demanding type (βₜ = o(t)), the LRT diverges relative to βₜ, so λ → ∞ and the agent&amp;rsquo;s preferences converge to worst-case evaluation over all models absolutely continuous with the true DGP — the maxmin condition. These are Theorem 1, parts 1 and 2.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why does the statistician type exhibit cycles rather than convergence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Section 4 and Corollary 1 show in the monetary policy application that no pure c-robust equilibrium exists for small c. Intuitively, the conservative policy (a=0) is a best reply to a high misspecification concern, but it produces outcomes well-explained by Θ, which drives concern down. The aggressive policy (a=1) is a best reply to a low concern, but it generates increased inflation variability not captured in Θ, which drives concern up sharply. There is no fixed point that is self-sustaining, so the agent cycles. Theorem 2 shows that the empirical frequency of actions still converges to a mixed c-robust equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the quantitative comparative statics for the monetary policy cycles?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Corollary 1 establishes that there exists a threshold c̄ &amp;gt; 0 such that for all c ≤ c̄: (1) no pure c-robust equilibrium exists; (2) a mixed c-robust equilibrium exists; and (3) in the maximal and minimal equilibria, the frequency of the conservative policy α*(0) is increasing in θ&lt;em&gt;₁π + θ&lt;/em&gt;₁a — a larger exploitable trade-off between inflation and unemployment implies more time spent on the aggressive policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the model rationalize the Rees-Jones and Taubinsky (2020) labor supply finding?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Rees-Jones and Taubinsky (2020) find that approximately 40% of agents have incentive-compatible beliefs consistent with the schmeduling heuristic (linearizing a convex tax schedule), but approximately 20% fewer agents act according to that heuristic. In a Berk-Nash equilibrium, the schmeduling agent exerts excess effort relative to the optimum; the more convex the tax code, the larger the excess. In a c-robust equilibrium, the agent retains a positive misspecification concern proportional to the deviation between the convex tax schedule and the linear approximation. Higher effort levels are more exposed to uncertainty in the marginal rate (the misspecified term θ+ε multiplies a higher average income z), so the concern for misspecification provides a natural force that reduces effort below the Berk-Nash prediction. The paper notes this finding is also consistent with an alternative interpretation in Rees-Jones and Taubinsky where all agents hold schmeduling beliefs but under-respond behaviorally.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the mixed c-robust equilibrium and why does it always exist?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: A mixed c-robust equilibrium is a mixed action α* ∈ Δ(A) such that beliefs ν are supported on the relative entropy minimizers Θ(α*) — computed as the parameter minimizing the α*-weighted average relative entropy across actions — and every action in the support of α* is a best reply under the average robust control criterion with λ = minθ Σ_a α*(a) R(p*_a || qθ_a) / c. Proposition 3 proves existence by mapping this fixed-point condition to a Nash equilibrium in an auxiliary game between the agent and two adversarial Nature players, then invoking Reny (1999) on that game. A pure c-robust equilibrium need not exist, but mixing over actions allows the concern for misspecification to be calibrated to the frequency of poorly-explained actions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does Theorem 2 formally connect cycles to mixed c-robust equilibria?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Theorem 2 states that if βₜ = ct for all t and α* is a βₜ-limit frequency (i.e., the empirical action distribution converges to α* with positive probability under some optimal policy), then α* is a mixed c-robust equilibrium. The intuition is that when α* places weight on both a well-explained action and a poorly-explained action, the time-averaged relative entropy stabilizes at a fixed level, producing a stable endogenous concern for misspecification that makes the agent asymptotically indifferent between the actions in the support — sharply reducing the incentive to break the cycle.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What does the axiomatization contribute beyond the learning results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The axiomatization (Section 5, Theorem 3) provides behavioral foundations observable from choices, without assuming the internal LRT mechanism. Two primary axioms pin down the average robust control criterion within the variational class: Structured Savage (Sure-Thing Principle for bets over model identity) and Uniform Misspecification Concern (equal concern for misspecification regardless of which model is revealed as best-fitting). Dynamic Consistency over Models pins down Bayesian updating. Q-Likelihood axiomatizes that the concern for misspecification is ordinally increasing in the LRT. The novel Asymptotic Frequentism axiom (Axiom 9) pins down the &lt;em&gt;quantitative speed&lt;/em&gt; of adjustment: long histories with the same empirical frequency must induce asymptotically similar preferences, and Proposition 5 shows this implies λ_{hₜ} / (LRT(hₜ, Q) / (2tₙ)) converges to a finite limit — exactly the statistician type&amp;rsquo;s linear scaling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What is the correlation between behavioral biases that the model predicts?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper derives three novel empirical predictions about the cross-sectional and time-series correlation of uncertainty attitudes: (1) long-run uncertainty aversion positively correlates with initial misspecification and with belief in the Law of Small Numbers; (2) these correlations are causal — repeated model failures and overly demanding evaluation induce a shift toward cautious behavior; (3) even holding misspecification and probability reasoning fixed, limit uncertainty attitudes are stochastic, depending on whether the limit action&amp;rsquo;s outcomes are well-explained by the structured models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: How does Example 2 (Correlation Neglect) show that endogenous concern can amplify rather than attenuate biases?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In a double auction, a buyer who mistakenly treats their own valuation and the ask price as independent (Correlation Neglect, Esponda, 2008) bids below the optimum in Berk-Nash equilibrium. In a c-robust equilibrium, the positive correlation between valuations and prices produces a strictly positive minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}), so the agent maintains misspecification concern. Since lower bids are accepted with lower probability (and thus are less sensitive to model misspecification), the endogenous concern drives the agent to bid even lower — amplifying the bias rather than attenuating it. This example illustrates that the direction of the correction depends on the geometry of how the misspecification interacts with the payoff structure.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Average Robust Control Criterion&lt;/strong&gt;: The decision criterion proposed in the paper. An agent evaluates action a by taking the expectation over structured models θ (with prior µ) of min_{p_a ∈ Δ(Y)} [E_{p_a}[u(a,y)] + (1/λ) R(p_a || qθ_a)]. This is a weighted average of robust control assessments, each penalizing distributions that deviate from a structured model in relative entropy. The parameter λ &amp;gt; 0 governs the intensity of misspecification concern, with SEU as the limit at λ → 0 and maxmin at λ → ∞.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous Misspecification Concern&lt;/strong&gt;: Unlike prior robust control models where λ is fixed or set externally, here λ(hₜ) = LRT(hₜ, Θ) / (2βₜ) is a function of how well the structured models explain the observed history hₜ via the likelihood ratio test statistic. The better the models explain past data, the smaller λ becomes and the less the agent hedges.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Statistician Type&lt;/strong&gt;: An agent who scales the likelihood ratio test statistic with a linear time sequence βₜ = ct for some c &amp;gt; 0. This is the unique agent type satisfying both ε-safety (guaranteed long-run average payoff above the maxmin guarantee minus ε) and ε-consistency under almost correct specification (no long-run regret when misspecification is small). The statistician type&amp;rsquo;s linear scaling is the only one for which the LRT statistic retains asymptotic informativeness about the degree of misspecification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;c-Robust Equilibrium&lt;/strong&gt;: A fixed-point concept for the long-run behavior of the statistician type. Action a* is a c-robust equilibrium if it is an average robust control best reply to beliefs supported on Θ(a*) = argmin_θ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}), with misspecification concern λ = minθ R(p*&lt;em&gt;{a*} || qθ&lt;/em&gt;{a*}) / c. This generalizes Berk-Nash equilibrium by incorporating an endogenous hedging motive proportional to the minimum relative entropy between the true DGP and the best structured model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mixed c-Robust Equilibrium&lt;/strong&gt;: A generalization of c-robust equilibrium to mixed actions α* ∈ Δ(A) for environments where no pure equilibrium exists. The beliefs are supported on the models minimizing the α*-weighted average relative entropy, and the misspecification concern is tied to that average entropy. Every βₜ-limit frequency is a mixed c-robust equilibrium (Theorem 2). This concept characterizes the long-run time-average behavior when the statistician type cycles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Law of Small Numbers (LSN) Type / Demanding Type&lt;/strong&gt;: An agent for whom βₜ = o(t), meaning the time scaling grows sub-linearly. This agent is excessively sensitive to early model failures (analogously to the Law of Small Numbers fallacy of Tversky and Kahneman, 1971, where short-run frequencies are treated as the long-run norm). The long-run behavior of such a type converges to maxmin behavior rather than robust control.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asymptotic Frequentism (Axiom 9)&lt;/strong&gt;: A novel axiom requiring that conditional preferences after sufficiently long histories with the same empirical outcome frequency must be arbitrarily similar (in a quantitative sense defined by measuring rods x, y, E) to a limiting preference. This axiom axiomatically pins down the statistician type&amp;rsquo;s linear time scaling: it implies that the ratio λ_{hₜ} / (LRT(hₜ, Q) / (2t)) converges to a finite limit c, exactly characterizing βₜ = ct.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Berk-Nash Equilibrium&lt;/strong&gt;: The equilibrium concept (Esponda and Pouzo, 2016) that describes the long-run behavior of lenient (SEU) agents learning under misspecification. An action a* is a Berk-Nash equilibrium if it is an SEU best reply to beliefs supported on Θ(a*) — the KL-minimizing models — without any additional hedging against misspecification. The current paper shows that lenient types converge to Berk-Nash equilibria, while statistician types converge to c-robust equilibria that differ by incorporating a positive misspecification concern.&lt;/p&gt;</description></item><item><title>Dynamic Regulation with Firm Linkages: Evidence from Texas</title><link>https://macropaperwarehouse.com/papers/dynamic-regulation-with-firm-linkages-evidence-from-texas/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dynamic-regulation-with-firm-linkages-evidence-from-texas/</guid><description>&lt;p&gt;This paper evaluates the efficiency of linked environmental regulation, a targeting mechanism whereby inspectors who discover violations at one plant can increase enforcement pressure on other plants sharing the same owner. The central research question is whether linking inspection decisions across co-owned plants adds value over unlinked, plant-level targeting and over random enforcement. The paper develops a new empirical framework of dynamic moral hazard under linked regulation, applies it to Texas environmental enforcement data, and uses the estimated model to evaluate counterfactual regulatory designs.&lt;/p&gt;
&lt;p&gt;The empirical setting is the Texas Commission on Environmental Quality (TCEQ), which enforces the Resource Conservation and Recovery Act (RCRA, governing hazardous waste) and the Clean Water Act using a two-dimensional scoring system. A plant-level &amp;ldquo;site rating&amp;rdquo; score captures the individual plant&amp;rsquo;s compliance history, while a firm-wide &amp;ldquo;person rating&amp;rdquo; score aggregates the weighted average of plant scores across all plants under the same manager. Both scores feed into a multiplicative penalty escalation rule and a logit-form inspection probability function. The data are an unbalanced panel of 9,792 plants from 2012–2020, with detailed records of inspections, violations, penalties, scores, and ownership. The average plant is inspected with probability 0.289 per year and is linked with approximately 2 other plants through common ownership, though some firms own portfolios exceeding 50 plants.&lt;/p&gt;
&lt;p&gt;The model features firms endowed with private types (abatement cost parameters) that may be affiliated within a firm&amp;rsquo;s portfolio, choosing continuous pollution actions to maximize discounted payoffs net of expected penalties. The regulator observes only scores and minimizes social costs subject to a binding inspection budget. A key computational innovation is &amp;ldquo;continuation value sufficiency&amp;rdquo;: because fully solving the portfolio optimization over large plant sets is infeasible due to the curse of dimensionality, each plant&amp;rsquo;s decision is approximated using three state variables — its own plant score, the firm-wide score, and a scalar summarizing other co-owned plants&amp;rsquo; continuation values — governed by an AR(1) transition process. Estimation proceeds in three stages: OLS/logit for inspection and penalty parameters, simulated method of moments for type distribution and curvature parameters, and inversion of the regulator&amp;rsquo;s first-order conditions to recover sector-specific marginal social harms.&lt;/p&gt;
&lt;p&gt;Descriptive evidence confirms three preconditions for linked regulation to add value: violations are positively correlated within firm portfolios, inspections are targeted toward higher-scoring plants on both dimensions, and higher inspection probabilities (instrumented by scores) are associated with fewer violations conditional on plant fixed effects. The coefficient on predicted inspection probability in the deterrence regression (specification 3, plant fixed effects, inspected years only) is −3.920, and an increase in log scores from 0 to 1.5 (roughly the interquartile range) reduces expected violations by approximately 0.5.&lt;/p&gt;
&lt;p&gt;Structural estimates show that plant-level and firm-level type variance are similar (σ²_J = 0.209, σ²_F = 0.275), indicating moderate within-firm cost correlation. The curvature parameter y = 0.403 governs diminishing returns to negligence. In counterfactual experiments centered on a 30% budget increase (approximately 10 percentage point rise in per-plant inspection probability), unlinked plant-score-based escalations reduce social costs by 31.9% relative to random inspections. Linked firm-score-based escalations reduce social costs by 41.8% relative to random. The optimal mix — approximately 40% unlinked and 60% linked — reduces social costs by 42.2% relative to random. A back-of-the-envelope cost-benefit calculation calibrating utility-sector violation costs at $3,157 per violation and inspection costs at $740 finds a return of $11.77 in avoided social costs per additional dollar spent on inspections under the optimal mixed regime, versus $8.28 under random inspections.&lt;/p&gt;
&lt;p&gt;The scope conditions are specific: the framework applies to RCRA and Clean Water Act plants in Texas, which typically cannot reallocate production across facilities (unlike Clean Air Act firms), so the pollution-substitution channel documented for multi-plant Clean Air Act firms is not modeled. The penalty schedule is taken as fixed; only inspection allocation is treated as a policy choice.&lt;/p&gt;
&lt;p&gt;Q: What is linked regulation and why might it improve on unlinked enforcement?
A: Linked regulation allows the regulator to increase inspection and penalty pressure on all plants owned by a firm when any one plant accumulates violations. It is efficient when compliance costs (types) are correlated within firms — e.g., due to managerial practices — because a violation at one plant is informative about likely violations at co-owned plants. This correlation means the regulator can target scarce inspection resources toward portfolios that are likely to harbor multiple bad actors, rather than inspecting each plant independently.&lt;/p&gt;
&lt;p&gt;Q: How does Texas implement linked regulation in practice?
A: Texas uses a two-dimensional scoring system. The plant score (&amp;ldquo;site rating&amp;rdquo;) summarizes the individual plant&amp;rsquo;s violation history over the past five years, normalized by complexity points. The firm score (&amp;ldquo;person rating&amp;rdquo;) is the complexity-weighted average of plant scores across all plants under the same manager. Penalties are then multiplied by escalation factors based on both scores: a firm in the &amp;ldquo;unsatisfactory performer&amp;rdquo; tier (firm score ≥ 55) faces a 1.1× firm escalation, while a &amp;ldquo;high performer&amp;rdquo; (firm score &amp;lt; 0.1) faces a 0.9× multiplier. Because the firm escalation applies to all plants in the portfolio simultaneously, even a small change in firm score can produce large aggregate deterrence effects across a large portfolio.&lt;/p&gt;
&lt;p&gt;Q: What descriptive evidence supports the preconditions for linked regulation to add value?
A: Three pieces of evidence are presented. First, a scatterplot (Figure 1) shows a positive cross-sectional correlation between a plant&amp;rsquo;s average violations per inspection and the leave-one-out average violations per inspection of its co-owned plants, indicating within-firm cost correlation. Second, Table 2 logit regressions show that both plant score (coefficient 0.121) and firm score (coefficient 0.062) significantly predict inspection probability, conditional on year and NAICS fixed effects. Third, Table 3 shows that conditional on plant fixed effects, predicted inspection probability is negatively associated with violations (coefficient −3.246 in specification 2, rising to −3.920 in specification 3 restricted to inspected plant-years), confirming dynamic deterrence.&lt;/p&gt;
&lt;p&gt;Q: What is the curse of dimensionality problem and how is it resolved?
A: In a multi-plant firm, each plant&amp;rsquo;s optimal action depends on the scores of every other co-owned plant, producing a state space of dimension n_plants + 1. For firms with portfolios of 50+ plants this is computationally infeasible. The paper introduces &amp;ldquo;continuation value sufficiency&amp;rdquo;: each plant&amp;rsquo;s decision is reduced to three state variables — its own score s_j, the firm score s_f, and a scalar W_j aggregating other co-owned plants&amp;rsquo; continuation values. Transitions are approximated by plant-specific AR(1) processes. This reduces the portfolio problem from one high-dimensional value function to n_plant separate three-dimensional value functions, each solved independently within an inner fixed-point loop.&lt;/p&gt;
&lt;p&gt;Q: How are the type distribution parameters identified?
A: The mean type for each NAICS sector θ̄_g is identified by average violations per inspection within that sector — a higher mean type implies more violations conditional on inspection. The plant-level type variance σ²_J is identified by the share of total violation variance occurring across plants within the same firm. The firm-level type variance σ²_F is identified by the share of total violation variance occurring across firms. The curvature parameter y is identified by the responsiveness of violations to changes in predicted inspection probability (the coefficient from specification 3 of Table 3, which equals −3.920 empirically and −6.095 in simulation moments).&lt;/p&gt;
&lt;p&gt;Q: What are the main counterfactual results?
A: A 30% increase in the inspection budget (approximately +10 percentage points in per-plant inspection probability) is allocated under four regimes. Random inspections reduce violations per plant by 0.31 from a baseline of 0.98. Unlinked (plant-score) escalations reduce social costs by 31.9% more than random. Linked (firm-score) escalations reduce social costs by 41.8% more than random. The optimal mix (approximately 40% unlinked, 60% linked) reduces social costs by 42.2% more than random. In detected violations, all three targeted regimes perform similarly (+0.7% detected violations versus random), meaning the social cost advantage of linked regulation comes through greater undiscovered deterrence rather than through detection rates.&lt;/p&gt;
&lt;p&gt;Q: How does the decomposition into static, own-plant, and cross-plant effects clarify the mechanism?
A: For unlinked escalations: the static effect accounts for −5.4% of social cost relative to random, own-plant dynamic deterrence accounts for −30.6%, and the cross-plant effect is +4.1% (slightly adverse, because unlinked escalations do not account for portfolio-level incentives). For linked escalations: the static effect is −2.4%, own-plant deterrence is −24.5% (smaller than unlinked because linked escalations are less precisely targeted to individual plant histories), and cross-plant deterrence is −14.9% (large and beneficial). The dominance of cross-plant deterrence under linked escalations is the key mechanism explaining why linking outperforms unlinked targeting.&lt;/p&gt;
&lt;p&gt;Q: What does the cost-benefit calculation find?
A: Calibrating utility-sector violation social costs at $3,157 per violation (from Kang and Silveira 2021 for California water utilities post-2006) and inspection costs at $740, the paper finds a return of $11.77 in avoided social costs per additional dollar spent on inspections under the optimal linked/unlinked mix, versus $8.28 under random inspections. This suggests a large return to expanding enforcement budgets, with the gain amplified substantially by optimal targeting design.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions and limitations acknowledged?
A: The framework applies to RCRA and Clean Water Act plants in Texas, where firms (e.g., gas station chains) typically cannot reallocate production across facilities, so the pollution-substitution channel documented by Gibson (2019) for Clean Air Act firms is not modeled. The penalty schedule is taken as fixed — only inspection allocation is treated as a policy choice — because Texas&amp;rsquo;s bylaws are prescriptive about how violations translate into penalties while leaving inspection targeting largely to regulator discretion. Social harm parameters h_g are identified only up to a scale normalization. The paper also does not model why types are correlated within firms (bad managers versus specialization), as the counterfactual results depend only on the degree of correlation, not its source.&lt;/p&gt;
&lt;p&gt;Q: How well does the model fit the data?
A: The model matches the targeted moments well (Table 5). Mean violations by NAICS sector are closely reproduced (e.g., utility: 0.201 empirical vs. 0.184 simulated; trade: 0.252 vs. 0.236). Responsiveness of violations to inspection probability matches closely (−6.398 empirical vs. −6.095 simulated). A non-targeted fit statistic — the correlation between a plant&amp;rsquo;s own violation rate and its co-owned plants&amp;rsquo; violation rates — is 0.32 in simulation versus 0.26 in the data, which the authors characterize as a good out-of-sample fit given it was not directly targeted in estimation.&lt;/p&gt;
&lt;p&gt;Q: How do heterogeneous effects shed light on the distributional consequences of regulation?
A: The own-plant deterrence effect is positive for all plants including those with low types that are unlikely to be targeted, but is especially pronounced for high-type plants under unlinked escalations. Under linked escalations, high-type plants are deterred less to the extent they are co-owned with lower-type plants, because firm-score-based targeting aggregates across the portfolio. Cross-plant effects are predictably small under unlinked escalations and larger under linked escalations, especially for firms with high-type portfolios, since those are the firms whose firm scores respond most to individual violations.&lt;/p&gt;
&lt;p&gt;Linked regulation: An enforcement mechanism in which the discovery of violations at one plant triggers increased inspection and penalty pressure on all other plants under the same owner. It exploits within-firm correlation in compliance costs to target scarce regulatory resources more efficiently than plant-by-plant escalation alone.&lt;/p&gt;
&lt;p&gt;Escalation mechanism: A penalty and inspection design in which plants with worse compliance records — measured by accumulated compliance scores — face disproportionately greater scrutiny and higher penalties per additional violation. The TCEQ&amp;rsquo;s two-dimensional scoring system is an escalation mechanism operating simultaneously at the individual plant and firm portfolio level.&lt;/p&gt;
&lt;p&gt;Plant score / firm score: The plant score (&amp;ldquo;site rating&amp;rdquo;) is a normalized index of a single facility&amp;rsquo;s violation history over the past five years, divided by investigation count and complexity points; the firm score (&amp;ldquo;person rating&amp;rdquo;) is the complexity-weighted average of all plant scores across the firm&amp;rsquo;s portfolio. Higher scores indicate worse compliance records and trigger both higher penalties and higher inspection probabilities.&lt;/p&gt;
&lt;p&gt;Continuation value sufficiency: The paper&amp;rsquo;s solution to the curse of dimensionality in large plant portfolios. Rather than tracking the full joint score state across all co-owned plants, each plant&amp;rsquo;s optimal action is approximated using three variables — its own score, the aggregate firm score, and a scalar W_j summarizing co-owned plants&amp;rsquo; continuation values — with state transitions governed by a plant-specific AR(1) process.&lt;/p&gt;
&lt;p&gt;Dynamic moral hazard under linked regulation: The firm&amp;rsquo;s problem of choosing how much to invest in pollution mitigation at each plant over time, given that current actions affect future scores, future penalties, and — through the firm-wide score — future scrutiny of all co-owned plants. The moral hazard arises because abatement costs are private information not directly observable by the regulator.&lt;/p&gt;
&lt;p&gt;Complexity points: A normalization factor in the TCEQ scoring system that adjusts raw violation counts for plant size and sector, enabling comparable compliance histories across heterogeneous facilities. They were introduced in 2012 specifically to prevent mechanically larger facilities from appearing riskier simply due to their scale.&lt;/p&gt;
&lt;p&gt;Cross-plant deterrence effect: The reduction in pollution actions at co-owned plants induced by increases in the firm-wide score following a violation at one plant in the portfolio. In the counterfactual decomposition, this effect accounts for −14.9 percentage points of social cost reduction under linked escalations and is the primary mechanism by which linked regulation outperforms unlinked plant-level escalation.&lt;/p&gt;</description></item><item><title>Dynamics of the Long-Term Housing Yield: Evidence from Natural Experiments</title><link>https://macropaperwarehouse.com/papers/dynamics-of-the-long-term-housing-yield-evidence-from-natural-experiments/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/dynamics-of-the-long-term-housing-yield-evidence-from-natural-experiments/</guid><description>&lt;p&gt;Each month a fraction of UK property leases are extended by 90 years or more, creating thousands of natural experiments in which the same property&amp;rsquo;s rent and capital value are revealed simultaneously. This paper uses these lease extensions — and Massachusetts and Cambridge rent-control removals as a second identification strategy — to estimate the expected long-term housing yield (annual rent-to-price ratio) and decompose its dynamics into rent-growth expectations and discount-rate components. The central finding is that housing yield movements are dominated by discount-rate shocks: variation in required returns on housing explains the overwhelming majority of yield variance, while expected rent growth contributes less than 10 percent. Housing booms are therefore primarily driven by falling required returns, not by rational expectations of higher future rents. The yield responds to real long-term interest rates with a slope significantly below one, consistent with a non-pecuniary convenience yield on housing that is not fully displaced by interest rate changes.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-do-the-natural-experiments-identify"&gt;Q1. What do the natural experiments identify?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Lease extensions reveal the market&amp;rsquo;s valuation of the same physical dwelling at two points — just before and just after the 90-year extension — with the extension itself creating a clean variation in the remaining lease term (and hence in the present value of ownership) without changing the property&amp;rsquo;s rent-generating characteristics.&lt;/strong&gt; This design separates the rent and price components of the yield at the property level, allowing identification of discount-rate and rent-growth contributions free of compositional differences across properties.&lt;/p&gt;
&lt;h3 id="q2-why-do-discount-rates-dominate-yield-variation"&gt;Q2. Why do discount rates dominate yield variation?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A present-value decomposition of the housing yield into expected rent growth and the discount rate assigns more than 90 percent of variance to the discount rate component, implying that periods of low housing yields (high prices relative to rent) reflect primarily that investors demand a lower return on housing — not that they expect rents to rise faster.&lt;/strong&gt; This result mirrors Campbell-Shiller findings for equity markets but is especially striking for housing, where naive narratives often attribute booms to expected rent appreciation.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-convenience-yield-interpretation-imply"&gt;Q3. What does the convenience yield interpretation imply?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Housing yields respond less than one-for-one to real interest rate movements — a slope well below one in the yield-rate regression — implying that housing carries a non-pecuniary convenience yield (liquidity, collateral value, direct utility of ownership) that buffers the required return on housing against interest rate changes.&lt;/strong&gt; When real rates rise, housing yields rise by less, so price-to-rent ratios decline by less than a frictionless model would predict.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;housing yield&lt;/strong&gt; : the annual rent-to-price ratio on residential property; the paper&amp;rsquo;s central object, decomposed into discount-rate and rent-growth components.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;discount-rate channel&lt;/strong&gt; : the dominant source of housing yield variation in this paper; movements in investors&amp;rsquo; required return on housing, not expected rent growth, drive the observed yield dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;convenience yield&lt;/strong&gt; : the non-pecuniary value of housing ownership (liquidity, collateral, direct utility) that drives a wedge between the housing yield and the risk-free real interest rate; explains the less-than-one slope in the yield-rate relationship.&lt;/p&gt;</description></item><item><title>Education and the Margins of Cyclical Adjustment in the Labor Market</title><link>https://macropaperwarehouse.com/papers/education-and-the-margins-of-cyclical-adjustment-in-the-labor-market/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/education-and-the-margins-of-cyclical-adjustment-in-the-labor-market/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; This paper asks how the cyclical sensitivity of wages varies with workers&amp;rsquo; educational attainment, what mechanisms drive the differences, and what the welfare consequences are of ignoring this heterogeneity. The starting point is a well-known asymmetry: less-educated workers have much higher and more volatile job separation rates, yet the standard macroeconomic literature has treated wages as roughly acyclical for a representative worker. Doniger asks whether this employment-centric picture is incomplete—and finds that it is, in a direction opposite to what the employment pattern would suggest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and methodology.&lt;/strong&gt; The paper uses two primary data sources: the National Longitudinal Survey of Youth 1979 (NLSY), which provides detailed job histories enabling identification of current and completed employer tenure, and the Current Population Survey (CPS) from 1995 to 2020, used both for employment flow statistics and, via biennial Job Tenure Supplements, for replication of the main wage findings. The sample is restricted throughout to males with 0–30 years of potential experience, following the conventions of the user-cost-of-labor (UCL) literature (Kudlyak, 2014; Basu and House, 2016). Workers are grouped into three educational categories: less than high school, high school or some college, and bachelor&amp;rsquo;s degree or more.&lt;/p&gt;
&lt;p&gt;A key methodological contribution is a new, more parsimonious estimator for the cyclical sensitivity of the UCL. Rather than the multi-step indicator-variable approach of Kudlyak (2014), the paper recovers the UCL sensitivity from interaction terms between a flexible function of tenure and the cyclical position at the time of hiring, estimated within an augmented Mincer regression. This estimator admits higher-frequency identification, enables transparent inference via the delta method, and facilitates nonparametric impulse response estimation via the Jorda (2005) local projection method. Cyclical position is measured primarily as the deviation of the unemployment rate from an HP-filtered trend (lambda = 100,000), with robustness checks using the Hamilton (2018) filter and GDP-based detrending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — employment.&lt;/strong&gt; Monthly separation rates from the CPS (1995–2020) show that workers with less than a high school degree separate at a rate of 9.4 percent per month, more than twice the 3.4 percent rate for workers with a bachelor&amp;rsquo;s degree or more, regardless of cyclical position. The volatility of the separation rate (measured by the time-series standard deviation) is also larger for the least educated (1.7) than for the most educated (0.6). All sub-components of separation-to unemployment, to inactivity, and job-to-job transitions-exhibit the same ordering. In response to a 100 basis point monetary policy contraction (Romer and Romer, 2004 shocks), employment of workers with less than a high school education falls significantly, while employment of college graduates or more is statistically unaffected.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — wages.&lt;/strong&gt; Using the NLSY, the cyclical sensitivity of the UCL to a 1 percentage point deviation of the unemployment rate from trend is estimated at approximately −15.5 percent for workers with a bachelor&amp;rsquo;s degree or more, −4.9 percent for high school or some college workers, and −1.4 percent (statistically indistinguishable from zero) for workers without a high school degree. In contrast, average hourly earnings (AHE) show much smaller and more compressed differences across education groups (−1.4, −1.1, and −1.0 percent respectively). The pattern of increasing procyclicality with education holds for new hires&amp;rsquo; wages (NHW) as well but is considerably less stark than for the UCL. Replication in the CPS confirms the ordering: UCL sensitivities are −7.0 percent for college graduates, −2.9 percent for high school or some college, and effectively zero for those without a high school degree.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism.&lt;/strong&gt; Counterfactual decompositions show that differences in the cyclical sensitivity of the wage-tenure profile—not just differences in job duration (separation rates)-account for the vast majority of the divergence across education groups. When separation rates are held constant across groups, the UCL sensitivity of the college-educated falls from -15.5 to −13.0 percent; when wage-tenure profile sensitivities are held constant, it falls to −6.3 percent, and the ordering across groups largely disappears. This finding is consistent with implicit contracting theory (Thomas and Worrall, 1988): longer expected employment durations for the more educated make it optimal to defer a greater share of the wage response to shocks over time, rendering near-term rigidities functionally less binding and producing more persistent effects of hiring-period conditions on subsequent wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness.&lt;/strong&gt; After controlling for cyclical sorting in match quality using the Hagedorn and Manovskii (2013) proxies (cumulated market tightness during tenure and leading up to the present job), the UCL sensitivity for college graduates falls modestly to −12.4 percent, confirming that match-quality composition effects account for only a minority of the documented pattern. The monetary policy shock analysis (Romer-Romer shocks identified from Greenbook forecast errors) yields a 35 percent decrease in the UCL for the most educated at the two-year horizon following a 100 basis point contraction, with no discernible effect for the least educated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare consequences.&lt;/strong&gt; Using a stylized New Keynesian model extended to two labor varieties with heterogeneous wage flexibility, the paper shows that ignoring the documented heterogeneity leads to underestimating the welfare costs of business cycle fluctuations by more than 15 percent under the baseline calibration (unit Frisch elasticity and unit elasticity of intertemporal substitution). Conditional on this model, the welfare loss due to fluctuations for the least educated is more than 15 times larger than for the most educated. The paper explicitly notes this is a conservative lower bound, because the model assumes pooled household consumption, and admitting idiosyncratic consumption risk would disproportionately burden less-educated workers who bear adjustment on the extensive (employment) rather than intensive (wage) margin.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the user cost of labor (UCL), and why does the paper use it rather than average hourly earnings or new hires&amp;rsquo; wages?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The UCL, formalized by Kudlyak (2014), is the present discounted value of wage payments an employer expects to make to a worker over the duration of the employment relationship, net of the continuation value of retaining that worker. It equals the new hire&amp;rsquo;s wage plus the expected wage wedge—the discounted stream of future wage differences between workers hired in the current period versus workers hired one period later. Unlike average hourly earnings or new hires&amp;rsquo; wages, the UCL captures the persistent effects of macroeconomic conditions at the time of hiring on all future remitted wages, making it the appropriate allocative wage concept from a macroeconomic standpoint. The paper documents that AHE understates the cyclicality of wages for all groups but especially for the most educated, because AHE omits the highly cyclically sensitive expected wage wedge that characterizes college-educated employment relationships.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How does the paper&amp;rsquo;s new estimator for the cyclical sensitivity of the UCL differ from the existing method, and what does this enable?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The existing Kudlyak (2014)/Basu and House (2016) method recovers the UCL by estimating a very large set of date-of-hire x current-date indicator interactions, constructing a time series of the UCL, and then analyzing that series—a multi-step procedure that loses covariances across steps and makes cross-sectional disaggregation or high-frequency identification impractical. The new method instead estimates the UCL sensitivity directly from coefficients on the interaction between a flexible tenure function and the cyclical position at hiring, estimated within a single augmented Mincer regression. The UCL semi-elasticity is recovered analytically from these coefficients via a formula that sums discounted weighted differences in the tenure-interaction coefficients across the tenure horizon. This single-step approach allows transparent inference via the delta method, enables fully interacted specifications for heterogeneous subgroups, permits the hiring-date frequency (e.g., weekly in NLSY) to differ from the wage observation frequency (annual or biannual), and permits estimation from repeated cross-sections—all of which were infeasible in the prior approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What are the quantitative magnitudes of the education gradient in UCL cyclicality, and how do they compare across wage measures?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using the NLSY with unemployment deviations from HP-filtered trend as the cyclical indicator: the UCL sensitivity is −15.5 percent (se 3.86) for workers with a bachelor&amp;rsquo;s degree or more, −4.9 percent (se 1.52) for high school or some college, and −1.4 percent (se 2.48, statistically insignificant) for those without a high school degree. By contrast, new hires&amp;rsquo; wages show sensitivities of −3.4, −1.8, and −1.2 percent respectively, and average hourly earnings show −1.4, −1.1, and −1.0 percent. The gradient is largest and most statistically significant for the UCL, indicating that the bulk of the education gap in cyclical wage sensitivity operates through the persistent effect of hiring-period conditions on subsequent wages rather than through the contemporaneous wage alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. What mechanism accounts for the UCL gradient — differential job durations or differential sensitivity of the wage-tenure profile?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper decomposes the UCL into the new hire&amp;rsquo;s wage and the expected wage wedge, and performs counterfactual exercises holding either separation rates or wage-tenure profile sensitivities constant across education groups (Table 3). Holding separation rates constant while allowing wage-tenure profiles to differ reduces the college-educated UCL sensitivity only modestly, from -15.5 to −13.0 percent; holding wage-tenure profile sensitivities constant while allowing separation rates to differ reduces the college-educated sensitivity to −6.3 percent and compresses the education gradient substantially. Thus, differential sensitivity of the wage-tenure profile—the degree to which wages continue to respond to hiring-period conditions over the course of the job-is the primary driver of the UCL gradient, with differential separation rates playing a secondary but non-trivial role. This finding confirms the prediction of Thomas and Worrall (1988) that lower separation rates support greater use of deferred payment and intertemporal risk sharing in optimal wage contracts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. How does the paper rule out cyclical sorting in match quality as the explanation for the UCL gradient?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Workers hired during recessions may be of systematically lower match quality, producing persistently lower wages not because wages are more cyclically sensitive for the same quality match but because recession hires are worse matches. Using the Hagedorn and Manovskii (2013) proxies for match quality - cumulated market tightness during the worker&amp;rsquo;s tenure on the present job (mjob) and on all prior jobs leading to it (mctj) - the paper augments the wage regression with full interactions between these proxies and the tenure-cyclicality terms. After controlling for match quality, the UCL sensitivity for college graduates falls from -15.5 to −12.4 percent (se 5.56); the point estimate remains large, statistically significant, and well above the estimates for lower-education groups. Figure 4 shows that match-quality adjustment primarily affects the first two years of the wage-tenure profile, after which the bias from cyclical sorting fades, confirming that scarring in remuneration for college graduates hired in recessions persists beyond what sorting can explain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What do monetary policy shocks reveal about the education gradient in wage sensitivity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Monetary policy shocks (identified from Greenbook forecast errors as in Romer and Romer, 2004) subject all labor markets to the same aggregate demand shock simultaneously, providing a cleaner test of differential responsiveness than cyclical regressions that may conflate demand composition and supply factors. Using Jorda (2005) local projections, a 100 basis point monetary policy contraction is associated with a 35 percent decrease in the UCL for workers with a bachelor&amp;rsquo;s degree or more at the two-year horizon, with statistically insignificant effects on the UCL of workers without a high school degree. The employment results are symmetric: less-educated workers&amp;rsquo; employment falls significantly after a monetary contraction, while college-educated workers&amp;rsquo; employment is unaffected. This cross-validation using monetary policy shocks supports the main thesis that more-educated workers absorb aggregate demand variation through the wage margin, while less-educated workers absorb it through the employment margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How does acyclical wages for the least educated affect interpretation of the existing macro literature on wage rigidity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The aggregate finding of Kudlyak (2014) and Basu and House (2016)-that the UCL is more procyclical than new hires&amp;rsquo; wages or average hourly earnings, casting doubt on wage rigidity as an amplification mechanism—holds only for educated workers. The paper finds that the UCL for workers without a high school degree is statistically acyclical by all three wage measures. This result restores a potential role for nominal wage rigidity in generating amplification and persistence of shocks for less-educated labor markets, including in the Diamond-Mortensen-Pisarides class of search models criticized by Kudlyak (2014) and in New Keynesian models criticized by Basu and House (2016). The paper therefore reconciles the literature on wage rigidity with the empirical finding of cyclical employment volatility concentrated among the less educated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. What is the welfare calculation, and what are its key results and limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The welfare exercise uses a parsimonious New Keynesian model with two labor varieties (capturing more- and less-educated workers) and price and wage rigidities. The model is extended to admit heterogeneous wage flexibility, and the welfare costs of fluctuations are evaluated following the second-order approximation method of Gali et al. (2007). Under the baseline calibration (unit Frisch elasticity, unit elasticity of intertemporal substitution), the heterogeneous-worker economy incurs welfare costs of fluctuations that exceed those of the output-gap-equivalent representative agent economy by more than 15 percent. The welfare loss of the least-educated workers is more than 15 times that of the most educated. The paper explicitly characterizes this as a conservative lower bound: the model assumes pooled household consumption (within varieties), which implies equal consumption sensitivity across education groups, whereas in reality less-educated workers face income loss on the extensive margin without the wage smoothing available to the more educated. Relaxing this assumption, as in Krusell et al. (2009), could yield welfare losses an order of magnitude larger.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. What does the CPS replication add, and what are its limitations relative to the NLSY baseline?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The CPS replication (Table 7) confirms the main ordering: UCL sensitivities are −7.0, −2.9, and approximately 0 percent for college graduates, high school or some college, and less than high school respectively. This rules out the concern that the NLSY findings are artifacts of the single aging cohort that characterizes the NLSY 1979. However, the CPS must be treated as a repeated cross-section because the tenure data are only available biennially and individual-level panel linkage across tenure supplement waves is infeasible. As a result, the CPS estimates cannot include individual fixed effects and must rely more heavily on observable controls (industry, occupation) to absorb cyclical variation in workforce composition. The CPS also precludes the match-quality controls of Hagedorn and Manovskii (2013). Despite these limitations, the main qualitative and directional findings replicate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. What policy implications does the paper draw for monetary policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper argues that because less-educated workers bear adjustment to aggregate demand shocks disproportionately through the employment margin while their wages are acyclical, welfare assessments that focus on the aggregate output gap underweight the costs borne by less-educated workers. The paper suggests that re-optimizing the monetary policy rule to account for documented heterogeneity would entail placing greater weight on the unemployment rate of the least-educated when measuring the output gap. More broadly, the K-shaped nature of labor market adjustment across education groups — wage scarring for the educated versus employment volatility for the less educated - implies that policies targeting either margin in isolation will miss welfare costs concentrated in the other group.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;User Cost of Labor (UCL).&lt;/strong&gt; The allocative wage from the employer&amp;rsquo;s perspective, defined as the present discounted value of expected future wage payments to a worker hired at date t, net of the continuation value of retaining that worker in the next period. Formally, UCL_t = w_{t,t} + E_t[sum beta^j(1-s)^j (w_{t+j,t} - w_{t+j,t+1})], decomposing into the new hire&amp;rsquo;s wage and the expected wage wedge. In this paper&amp;rsquo;s usage, the UCL is the appropriate measure of the cyclical impact of shocks on labor costs because it captures persistent effects of hiring-period conditions on the entire subsequent wage sequence, not just the contemporaneous wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expected Wage Wedge (EWW).&lt;/strong&gt; The component of the UCL beyond the new hire&amp;rsquo;s wage: the discounted stream of differences between wages a worker hired at date t will receive in future periods and the wages a worker hired one period later would receive in those same future periods. The EWW is non-zero whenever wages are history-dependent - i.e., whenever current macroeconomic conditions at the time of hiring affect future remitted wages. The paper finds that the EWW is larger, more negative, and more persistent for more-educated workers conditional on being hired during a cyclical downturn.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-enforcing implicit wage contract.&lt;/strong&gt; A labor contract in which the sequence of remitted wages is not pinned down period-by-period by spot-market forces but instead reflects an intertemporal risk-sharing arrangement between employer and worker that is sustained by the mutual benefit of the ongoing employment relationship. In this paper&amp;rsquo;s framework (drawing on Thomas and Worrall, 1988), lower separation rates make longer planning horizons feasible, which in turn expands the scope for deferring wage adjustments across time - effectively allowing more-educated workers and their employers to smooth the effects of cyclical shocks over longer horizons than is possible for less-educated workers with shorter expected job durations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cyclical sorting / match quality bias.&lt;/strong&gt; The compositional concern that workers hired during recessions may be of systematically different (in this context, lower) match quality than those hired during booms, so that the persistent wage depression observed for recession hires could reflect poor match quality rather than cyclically sensitive wages for equivalent-quality matches. The paper uses the Hagedorn and Manovskii (2013) proxies - cumulated labor market tightness during the current job and prior employment history - to control for cyclical variation in match quality and assess the residual sensitivity of the UCL for average-quality matches.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive versus intensive margin of labor market adjustment.&lt;/strong&gt; The distinction between adjustment through changes in the number of workers employed (extensive margin: hiring and separation) versus adjustment through changes in wages or hours conditional on employment (intensive margin). A central finding of the paper is that less-educated workers bear cyclical adjustment disproportionately on the extensive margin (more volatile separation rates, employment losses following monetary contractions) while their wages are acyclical, whereas more-educated workers exhibit the reverse: stable employment but highly cyclically sensitive wages, especially as measured by the UCL.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage scarring.&lt;/strong&gt; The persistent negative effect of hiring-period macroeconomic conditions on wages throughout the subsequent employment spell, beyond what is explained by contemporaneous market conditions. In this paper&amp;rsquo;s context, wage scarring is concentrated among more-educated workers: being hired when the unemployment rate is one percentage point above trend is associated with wages that remain depressed for several years, with the depression being larger and more persistent for college-educated workers than for those with less education. This is demonstrated via the expected wage wedge profiles in Figure 3 and is confirmed to survive controls for match-quality sorting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Output-gap-equivalent representative agent economy.&lt;/strong&gt; A conceptual benchmark constructed in the paper&amp;rsquo;s welfare analysis: a single-worker-type New Keynesian economy whose wage and labor supply elasticities are set equal to the output-elasticity-weighted averages of the two labor variety types in the heterogeneous economy. The paper shows that the heterogeneous-worker economy and this representative-agent benchmark produce identical aggregate output gap and price level paths (under Cobb-Douglas production, earnings elasticities are identical across varieties), but welfare diverges because period utility is more volatile for the variety with more rigid wages. The 15 percent excess welfare cost of the heterogeneous economy relative to this benchmark is the paper&amp;rsquo;s headline welfare result.&lt;/p&gt;</description></item><item><title>Efficiency Criteria, Income Taxation, and Heterogeneous Elasticities</title><link>https://macropaperwarehouse.com/papers/efficiency-criteria-income-taxation-and-heterogeneous-elasticities/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/efficiency-criteria-income-taxation-and-heterogeneous-elasticities/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Can income tax schedules be justified as utilitarian-optimal without adopting extreme normative assumptions about how household welfare should be measured? The paper proposes a welfare criterion strictly stronger than Pareto efficiency—called &lt;em&gt;rationalizability with bounded curvature&lt;/em&gt;—and asks whether observed US income taxes satisfy it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Starting Point.&lt;/strong&gt; Any Pareto-efficient nonlinear income tax schedule can, in principle, be rationalized as utilitarian-optimal under &lt;em&gt;some&lt;/em&gt; cardinalization of household utilities (i.e., some choice of how to measure the cardinal scale of each household&amp;rsquo;s well-being). However, the paper shows that rationalizing Pareto-efficient taxes in this way often requires cardinalizations under which there is &lt;em&gt;no&lt;/em&gt; population upper bound on the curvature of utility with respect to consumption. Equivalently, a utilitarian planner&amp;rsquo;s marginal willingness to transfer resources to households must fall arbitrarily quickly with the size of those transfers—an extreme form of status quo bias violated by virtually all quantitative optimal-tax exercises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Proposed Criterion.&lt;/strong&gt; The authors restrict attention to cardinalizations with &lt;em&gt;locally bounded curvature&lt;/em&gt;: there exists a finite (though potentially arbitrarily large) upper bound on the coefficient of relative risk aversion across the population. This admits two interpretations: (i) ex post, it requires that the social value of transfers not change arbitrarily quickly with transfer size; (ii) ex ante, it corresponds to a decision-maker behind a veil of ignorance with bounded risk aversion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Theoretical Result.&lt;/strong&gt; Within a standard Mirrlees model of nonlinear income taxation with arbitrary preference heterogeneity and intensive-margin labor supply, the paper proves that a tax schedule can be rationalized with bounded curvature if and only if government revenues are both &lt;em&gt;decreasing and concave&lt;/em&gt; (not merely decreasing) with respect to a class of narrowly targeted &amp;ldquo;two-bracket&amp;rdquo; reforms—reforms that raise retention by $1 local to some income level $z$ and zero elsewhere. This contrasts with Pareto efficiency, which requires only that revenues be decreasing in these reforms (Bierbrauer, Boyer, and Hansen 2023). The additional requirement of revenue concavity is what distinguishes the bounded-curvature criterion from pure Pareto efficiency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistics.&lt;/strong&gt; The paper derives explicit sufficient-statistics expressions for the first- and second-order derivatives of tax revenue with respect to these targeted reforms. The second derivative depends on higher moments of the elasticity distribution, specifically the &lt;em&gt;income-conditional variance&lt;/em&gt; of compensated elasticities of taxable income (ETIs). Revenue convexity—which causes the second-order condition to fail—arises when income-conditional ETI variance is sufficiently high, even holding the mean ETI fixed. The economic mechanism is a &amp;ldquo;sort-and-extort&amp;rdquo; dynamic: a small tax reform sorts higher-elasticity households into income brackets where marginal taxes fall and lower-elasticity households into brackets where marginal taxes rise; repeating the reform then exploits this sorting by differentially taxing households by elasticity, as if applying group-specific tax schedules within a uniform income tax.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Findings.&lt;/strong&gt; Using the NBER panel of US tax returns from 1979 to 1990, the paper estimates income-conditional mean ETIs of approximately 0.2–0.3 at most income levels. Crucially, it estimates a &lt;em&gt;lower bound&lt;/em&gt; on income-conditional ETI variance by comparing elasticities of light versus heavy itemizers (defined by whether a household claims above or below the mean value of deductions in its income bracket). The low-elasticity group has an ETI of approximately zero and the high-elasticity group has an ETI of approximately one, implying a lower bound on ETI variance of roughly 0.2 at most incomes and approximately 0.25 at the top of the distribution. This lower bound is close to—and under plausible assumptions above—the threshold required for the second-order condition to fail. The authors conclude that the US income tax schedule in 1990 was likely Pareto efficient but likely &lt;em&gt;not&lt;/em&gt; rationalizable with bounded curvature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Welfare Gains.&lt;/strong&gt; In a calibrated model with a 50% top marginal tax rate, Pareto-tail shape of 2.5, mean ETI of 0.3, and ETI standard deviation of 0.75 (50% above the estimated lower bound), the planner gains significant welfare from either raising or lowering top marginal taxes. The welfare-maximizing top rate below the baseline is 13.3%, generating social value equivalent to a transfer of $1,966 per top earner. The welfare-maximizing top rate above the baseline is 71.2%, generating social value equivalent to a transfer of $972 per top earner. The revenue-maximizing rate is 80.9% under the baseline calibration, ranging from 74.6% to 86.8% as ETI standard deviation varies by ±25% of the lower bound.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The theoretical analysis is restricted to intensive-margin labor supply (abstracting from extensive-margin decisions); the empirical application focuses on top incomes where extensive-margin effects are likely small. The empirical period is 1979–1990, covering major federal and state tax reforms. Results concern local efficiency of the tax schedule, not global optimization.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What exactly is &amp;ldquo;rationalizability with bounded curvature&amp;rdquo; and how does it differ from Pareto efficiency?&lt;/strong&gt;
A: Pareto efficiency requires that no small reform makes someone better off without making anyone worse off. Rationalizability (with &lt;em&gt;any&lt;/em&gt; cardinalization) is equivalent to Pareto efficiency in this setting. Rationalizability with bounded curvature additionally restricts the cardinalization: there must exist a finite upper bound on the coefficient of relative risk aversion (or equivalently, on the curvature of utility with respect to consumption) across the population. This is a strictly stronger criterion than Pareto efficiency. A schedule can be Pareto efficient but not rationalizable with bounded curvature if the only cardinalizations that rationalize it require unbounded consumption utility curvature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why do &amp;ldquo;extreme&amp;rdquo; cardinalizations with unbounded curvature arise when rationalizing Pareto-efficient taxes?&lt;/strong&gt;
A: When a Pareto-efficient schedule is rationalized as utilitarian, the cardinalization must make the set of feasible, recardinalized utilities convex so it can be separated from the set of Pareto-improving allocations. The paper constructs such a cardinalization explicitly: it takes the form of a function whose second derivative approaches negative infinity as utility approaches its baseline value. This implies the planner&amp;rsquo;s marginal value of transfers to a household falls precipitously as the household is made even slightly better off—an extreme status quo bias. Theorem 2.b establishes that &lt;em&gt;all&lt;/em&gt; cardinalizations rationalizing a schedule with convex revenues must share this pathology.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the &amp;ldquo;sort-and-extort&amp;rdquo; mechanism and how does it generate revenue convexity?&lt;/strong&gt;
A: When elasticities of taxable income (ETIs) are heterogeneous within an income level and the income density is declining steeply, a reform that lowers marginal taxes around income $z$ brings more households into the local bracket (because there are more households just below $z$ than above). Crucially, it disproportionately attracts households with &lt;em&gt;higher&lt;/em&gt; ETIs, since they respond more strongly to the marginal tax cut and relocate from further away, where the density differs more. Repeating the reform therefore faces a higher-elasticity composition at $z$, generating larger positive behavioral effects—making revenues convex in the size of the reform. The second step (&amp;ldquo;extort&amp;rdquo;) involves raising taxes on the now-concentrated low-elasticity households at adjacent brackets, achieving as-if group-specific taxation within a single income tax schedule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the precise relationship between revenue convexity and ETI variance?&lt;/strong&gt;
A: The paper shows (Theorem 4) that the second-order revenue derivative with respect to a narrow two-bracket reform around income $z$ equals a positive function of the income density times the expression $-[1-R&amp;rsquo;_0(z)]\varepsilon(z) + [1-R&amp;rsquo;_0(z)]\alpha(z)[\varepsilon^2(z) + \text{var}_h[\varepsilon^h | z^h_0=z]]$. The first term is always negative (pushing toward revenue concavity). The second term, which includes the income-conditional variance of ETIs, can dominate and create revenue convexity when ETI variance is sufficiently large. In the benchmark case with a single household type at each income (no within-income heterogeneity), the variance term vanishes and revenues are always concave whenever decreasing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the sufficient statistics test for rationalizability at the top of the income distribution?&lt;/strong&gt;
A: At top incomes (assuming no income effects, no super-elasticities, and CES preferences), taxes are Pareto efficient if and only if $\tau_\text{top} &amp;lt; \frac{1}{1+\alpha_\text{top}\varepsilon_\text{top}}$, and they are rationalizable with bounded curvature if and only if additionally $\tau_\text{top} &amp;lt; \frac{2}{1+\alpha_\text{top}(\varepsilon_\text{top} + \sigma^2_\text{top}/\varepsilon_\text{top})}$, where $\tau_\text{top}$ is the top marginal tax rate, $\alpha_\text{top}$ is the Pareto tail shape, $\varepsilon_\text{top}$ is the mean ETI at the top, and $\sigma^2_\text{top}$ is the income-conditional ETI variance at the top.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the paper estimate a lower bound on income-conditional ETI variance?&lt;/strong&gt;
A: The authors divide households at each income level into &amp;ldquo;heavy&amp;rdquo; and &amp;ldquo;light&amp;rdquo; itemizers based on whether their total deductions exceed the local income-bracket mean. They then estimate group-specific ETIs using local polynomial regressions of log income changes on log marginal retention changes, interacting tax changes with heavy-itemizer indicators. The within-year difference in elasticities between groups provides a lower bound on within-income ETI variance, since the two-group decomposition captures only a fraction of true variance. The interaction coefficient is allowed to vary by year to isolate within-year, within-income variation in elasticities rather than between-year compositional changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the estimated magnitudes of mean and variance of ETIs?&lt;/strong&gt;
A: Income-conditional average ETIs are estimated at between 0.2 and 0.3 at most income levels, consistent with but somewhat below prior literature estimates. The low-elasticity group (light itemizers) has an ETI of approximately zero, while the high-elasticity group (heavy itemizers) has an ETI of approximately one. Given roughly equal group sizes, this implies a lower bound on ETI variance of approximately 0.2 at most incomes and approximately 0.25 at the ninety-fifth percentile. Subdividing the high-elasticity group into two, three, and four subgroups yields a lower bound of approximately 0.25 for variance at the top.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the back-of-the-envelope calculation work to assess whether the second-order test fails?&lt;/strong&gt;
A: With $\tau_\text{top} \approx 0.5$, $\alpha_\text{top} \approx 2.5$, and $\varepsilon_\text{top} \approx 0.3$ (from prior literature), the second-order condition fails if and only if ETI variance exceeds approximately 0.27. The authors&amp;rsquo; lower bound estimate of ETI variance is already approximately 0.25 (standard deviation approximately 0.5), just below this threshold. The authors note that if the true standard deviation exceeds the lower bound by more than 4%, the second-order condition fails, making it empirically likely that the 1990 US tax schedule was not rationalizable with bounded curvature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Why does the paper focus on the top of the income distribution for the empirical test?&lt;/strong&gt;
A: The second-order condition is most likely to fail at high incomes for three reasons simultaneously: (i) the marginal tax rate is highest, (ii) ETI means are somewhat higher there, and (iii) the Pareto parameter $\alpha(z)$ is largest (income density falls steeply), which amplifies the sort-and-extort mechanism. The authors also note that extensive-margin labor supply responses—which are abstracted away in the theory—are likely small at high incomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the calibrated quantitative application reveal about optimal top tax policy?&lt;/strong&gt;
A: Calibrated with a 50% initial top marginal tax rate, Pareto tail shape of 2.5, mean ETI of 0.3, and ETI standard deviation of 0.75 (50% above the estimated lower bound), the model finds welfare gains in both directions of reform. The welfare-maximizing rate &lt;em&gt;below&lt;/em&gt; the baseline is 13.3%, yielding equivalent welfare gains of $1,966 per top earner. The welfare-maximizing rate &lt;em&gt;above&lt;/em&gt; the baseline is 71.2%, yielding equivalent gains of $972 per top earner. The revenue-maximizing rate is 80.9%, ranging from 74.6% to 86.8% when ETI standard deviation varies by ±25% of the lower bound. This sensitivity highlights that the optimal direction and magnitude of reform depend substantially on the uncertain degree of ETI heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper relate to the &amp;ldquo;inverse optimum&amp;rdquo; literature?&lt;/strong&gt;
A: The inverse optimum approach (Bourguignon and Spadaro 2012; Hendren 2020) infers the first-order welfare trade-offs implicit in an observed tax schedule. This paper goes further by inferring from second-order empirical moments—specifically the income-conditional ETI variance—whether taxes are consistent with &lt;em&gt;minimal&lt;/em&gt; requirements on how sensitive the planner&amp;rsquo;s trade-offs are to household welfare levels. Rather than assuming a welfare function, it tests whether &lt;em&gt;any&lt;/em&gt; welfare function with bounded curvature can rationalize the observed schedule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Is revenue convexity possible without within-income heterogeneity in preferences?&lt;/strong&gt;
A: Yes, but only under more specific conditions. The paper provides two supplemental examples. In the first, all households have constant-elasticity labor disutility but differ in both productivity and elasticity across income levels; when lower-income households have higher elasticities, a reform reducing marginal taxes at $z$ attracts higher-elasticity households and raises the average elasticity, leading to convex revenues. In the second, all households have the same initial elasticity but individual elasticities change in response to reforms. However, with the standard additively separable CES preferences and no within-income heterogeneity, revenues are always concave when decreasing—consistent with Werning&amp;rsquo;s (2007) observation that the Pareto planner&amp;rsquo;s problem is convex in this case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What is the role of random tax reforms in the paper&amp;rsquo;s logic?&lt;/strong&gt;
A: Random tax reforms serve as an expository bridge. The paper shows that if the second-order revenue effect of a two-bracket reform is positive at some income $z$, then a &amp;ldquo;randomized&amp;rdquo; reform that applies the reform with equal probability in positive and negative directions generates an expected Pareto improvement—because the convexity of revenues implies expected revenues rise, while for any household with bounded risk aversion the reform&amp;rsquo;s second-order utility effect is also positive when the reform is sufficiently narrow. This establishes that revenue convexity implies random Pareto inefficiency under bounded risk aversion, and then the paper shows the analogous deterministic result for rationalizability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What scope conditions attach to the sufficient conditions for rationalizability (Theorem 3)?&lt;/strong&gt;
A: Theorem 3 requires Assumptions 1 and 3 plus two boundary conditions: the ratio $\delta\text{Rev}(z)/(zg(z))$ must remain bounded away from zero as income approaches 0 or infinity, and at all incomes there must exist households with low enough compensated elasticities. Assumption 1 requires that average and marginal taxes have upper bounds below one, that marginal taxes have a lower bound, and that $zg(z)$ converges to zero at the boundaries. Assumption 3 is a regularity condition on how conditional moments of the elasticity distribution vary with income. These conditions ensure that the narrow, self-financing reforms considered in the necessity proof cannot generate welfare improvements once revenues are both decreasing and concave.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Rationalizability with Bounded Curvature.&lt;/strong&gt; The property that a tax schedule is utilitarian-optimal under some cardinalization of household utilities in which there exists a finite (though potentially arbitrarily large) upper bound on the curvature of utility with respect to consumption across the population. Formally, there exists a continuous function $\bar{\rho}$ such that, for all households, the absolute value of $[w_h \circ u_h]_{cc} / [w_h \circ u_h]_c$ is bounded by $\bar{\rho}$ evaluated at the household&amp;rsquo;s income. This criterion is strictly stronger than Pareto efficiency and strictly weaker than utilitarian optimality under a fixed cardinalization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two-Bracket Reform.&lt;/strong&gt; A targeted tax reform that increases retention (post-tax income) by $1 at incomes local to some level $z$ over a small bracket of width $\ell$, and zero elsewhere (smoothed at the edges). As $\ell \to 0$, this becomes an infinitesimally narrow reform. The first- and second-order revenue effects of these reforms—denoted $\delta\text{Rev}(z)$ and $\delta^2\text{Rev}(z)$—are the paper&amp;rsquo;s key objects: Pareto efficiency requires $\delta\text{Rev}(z) &amp;lt; 0$ for all $z$, and rationalizability with bounded curvature additionally requires $\delta^2\text{Rev}(z) \leq 0$ for all $z$.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Income-Conditional ETI Variance.&lt;/strong&gt; The variance of compensated elasticities of taxable income (ETIs) among households with the same income level, $\text{var}_h[\varepsilon^h | z^h_0 = z]$. This is the paper&amp;rsquo;s primary empirical object of interest and the key determinant of whether revenues are convex or concave in the size of targeted reforms. Unlike the literature&amp;rsquo;s focus on mean ETIs by income bracket, this within-income variance captures heterogeneity among households sharing the same pre-reform income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sort-and-Extort Mechanism.&lt;/strong&gt; The two-step economic mechanism underlying revenue convexity from ETI heterogeneity. In the first step (&amp;ldquo;sort&amp;rdquo;), a marginal tax cut around income $z$ disproportionately attracts higher-ETI households from lower incomes (because they respond more strongly and relocate from further away), shifting the elasticity composition at $z$ upward. In the second step (&amp;ldquo;extort&amp;rdquo;), repeating the reform finds higher-elasticity households concentrated where marginal taxes fall and lower-elasticity households where taxes rise, effectively applying differential tax treatment by elasticity within a single income tax schedule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Pareto Parameter $\alpha(z)$.&lt;/strong&gt; Defined as $-d\log(zg(z))/d\log z$, where $g(z)$ is the income density. This captures the rate at which the income density is falling in income locally at $z$, and governs the strength of the sort-and-extort mechanism. High $\alpha(z)$ at top incomes (reflecting a steeply declining Pareto-type density) amplifies revenue convexity from ETI heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Super-Elasticity.&lt;/strong&gt; A concept that captures how a household&amp;rsquo;s compensated ETI would change if its income were different, holding preferences fixed. Formally, it is the derivative of the household&amp;rsquo;s elasticity with respect to its log income, decomposing into effects from changes in preference curvature and changes in the local curvature of the tax schedule. Super-elasticities are zero in the benchmark case of additively CES preferences and locally CES retention schedules but contribute additional terms to the second-order revenue expression in the general case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cardinalizing Function.&lt;/strong&gt; A strictly increasing function $w_h$ that maps household $h$&amp;rsquo;s indirect utility $V_h$ to a cardinalized utility level $w_h(V_h)$. The social planner maximizes the expectation of cardinalized utilities. Different choices of ${w_h}_h$ correspond to different stances on interpersonal comparisons, including unbounded curvature (rationalizing any Pareto-efficient schedule) or bounded curvature (the paper&amp;rsquo;s proposed restriction). Rawlsian social welfare is a limit of utilitarian welfare with increasingly concave cardinalizing functions.&lt;/p&gt;</description></item><item><title>Energy Transitions in Regulated Markets</title><link>https://macropaperwarehouse.com/papers/energy-transitions-in-regulated-markets/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/energy-transitions-in-regulated-markets/</guid><description>&lt;p&gt;This paper asks how rate-of-return (RoR) regulation in U.S. electricity markets affects the speed and efficiency of energy transitions, specifically the transition from coal to combined-cycle natural gas (CCNG) generation driven by fracking-induced cost declines. The authors build and estimate a structural model of regulated utility behavior in which utilities optimize investment, retirement, and hourly operations decisions against an incentive structure set by state Public Utility Commissions (PUCs).&lt;/p&gt;
&lt;p&gt;The regulatory environment combines two instruments: (1) an allowable rate of return that is decreasing in consumer electricity rates (incentive regulation), parameterized as s = (r/r₀)^{-γ}, where higher γ penalizes high-cost outcomes more severely; and (2) a &amp;ldquo;used-and-useful&amp;rdquo; standard in which a coal plant&amp;rsquo;s contribution to the rate base depends on its capacity utilization via a logit function. These two instruments create a tension: utilities want to lower costs to earn a higher RoR, but also want to run existing coal plants—even when uneconomical—to prove they are &amp;ldquo;used and useful&amp;rdquo; and thus maximize their rate base and profits.&lt;/p&gt;
&lt;p&gt;The authors estimate the model using publicly available EIA and EPA CEMS data spanning 2006–2017, covering 39 unique regulated utilities in the Eastern Interconnection across more than 4 million utility-hour observations (459 utility-years). Structural parameters are recovered via a nested fixed-point indirect inference approach that matches simulated regression coefficients to actual data; investment and retirement costs are estimated with a GMM nested fixed-point approach.&lt;/p&gt;
&lt;p&gt;Key reduced-form findings confirm the model&amp;rsquo;s two core mechanisms. First, a 10% increase in total variable costs is associated with a 2.5% decrease in variable profits per MW of capacity (with utility fixed effects), consistent with incentive regulation. Second, regulated utilities reduce coal generation by only a statistically insignificant 4.2 percentage points when coal fuel costs exceed import prices, compared to 16.1 percentage points for restructured utilities—consistent with regulated utilities running coal out-of-dispatch order to preserve used-and-useful status.&lt;/p&gt;
&lt;p&gt;In counterfactual simulations that impose 2018–20 natural gas prices ($2.01/MMBtu versus the 2006 price of $7.24/MMBtu) on utilities with their 2006 capital stocks, regulated utilities retire only 53% of coal capacity over 30 years and increase CCNG capacity by 296%, whereas a cost minimizer would retire most coal capacity while increasing CCNG by only 58%. The Averch-Johnson over-investment effect dominates: regulated utilities over-invest in CCNG while simultaneously over-using legacy coal.&lt;/p&gt;
&lt;p&gt;Carbon taxes on regulated utilities reduce short-run coal generation only 48% as much as when imposed on a cost minimizer (because the used-and-useful incentive partially offsets the carbon price signal), but in the long run result in 68% lower coal capacity and 77% lower coal generation relative to baseline by year 30—larger effects than for the cost minimizer. Eliminating the coal usage incentive (μ₂ = 0) produces 82% lower coal capacity and 92% lower coal generation over 30 years but requires utility variable profits to fall by over $300 million, threatening reliability without compensating transfers.&lt;/p&gt;
&lt;p&gt;Scope conditions: Results apply to regulated (non-restructured) utilities in the Eastern Interconnection, 2006–2017. The model estimates the coal-to-CCNG transition only; it explicitly does not model the ongoing transition to renewables and storage due to insufficient data variation.&lt;/p&gt;
&lt;p&gt;Q1: What is the central research question?
The paper asks whether and how rate-of-return regulation in U.S. electricity markets slows energy transitions, and what alternative regulatory structures or carbon tax policies could accelerate the transition away from coal. It addresses this both theoretically—through a structural model of regulated utility behavior—and empirically, through estimation and counterfactual simulation using data on 39 regulated utilities over 2006–2017.&lt;/p&gt;
&lt;p&gt;Q2: What are the two key regulatory instruments in the model, and what distortions do they create?
The first instrument is incentive regulation: the allowable rate of return declines as consumer electricity rates rise (s = (r/r₀)^{-γ}), so utilities have an incentive to lower costs. The second is the used-and-useful standard: a coal plant&amp;rsquo;s contribution to the rate base depends on its capacity utilization via a logit function, creating an incentive to run coal plants even when their fuel costs exceed import prices. Together, these instruments generate a tension between cost-reduction incentives and legacy-capacity-preservation incentives, causing the regulated utility to both over-invest in new CCNG capacity (Averch-Johnson effect) and over-use existing coal capacity relative to the cost-minimizing benchmark.&lt;/p&gt;
&lt;p&gt;Q3: What does the reduced-form evidence show about uneconomical coal usage?
In a triple-difference specification, regulated utilities reduce coal generation by only 4.2 percentage points (statistically insignificant) when coal fuel costs exceed import prices, compared to a 16.1 percentage point reduction for restructured utilities. CCNG generation responds similarly under both regulatory regimes (21.1 vs. 19.7 percentage points), confirming that the distortion is specific to legacy coal under RoR regulation and not a general feature of high-cost generation. The six states with the largest responsiveness of coal usage to low market prices are all restructured states; out-of-dispatch-order coal generation also correlates strongly with utility ownership share across states.&lt;/p&gt;
&lt;p&gt;Q4: What do the structural parameter estimates reveal about the rate base?
Each MW of CCNG capacity increases the rate base by $229,000. When fully utilized, each MW of coal capacity contributes 1.144 times as much as CCNG. When coal is not fully used, unused coal capacity contributes only 40% as much to the rate base as CCNG. NGT capacity contributes 79% more to the rate base than CCNG per MW. Operations cost estimates include O&amp;amp;M costs of $12.89/MWh for coal, $8.82/MWh for CCNG, and $44.63/MWh for NGT; a 100 MW coal ramp in one hour costs $4,770 versus $3,860 for CCNG.&lt;/p&gt;
&lt;p&gt;Q5: What happens in the 30-year long-run counterfactual under the baseline regulated utility?
Facing a sudden drop to 2018–20 natural gas prices ($2.01/MMBtu vs. $7.24/MMBtu in 2006), regulated utilities retire 53% of coal capacity and increase CCNG capacity by 296% over 30 years. The Averch-Johnson over-investment effect dominates: utilities invest heavily in CCNG while retaining and using legacy coal far longer than a cost minimizer would. The social planner effectively eliminates coal generation immediately (99% reduction in the first period) and retires almost all coal capacity over the horizon.&lt;/p&gt;
&lt;p&gt;Q6: How does a cost minimizer behave relative to the regulated utility in the same long-run counterfactual?
A cost minimizer immediately reduces coal generation by 50% in the first period and retires most coal capacity over 30 years while increasing CCNG capacity by only 58%—versus the regulated utility&amp;rsquo;s 296% CCNG increase. Thirty years after the shock, the cost minimizer has retired 71% more coal capacity than the regulated utility. The cost minimizer&amp;rsquo;s much smaller CCNG expansion reflects that it does not face Averch-Johnson incentives to over-invest in rate-base capital.&lt;/p&gt;
&lt;p&gt;Q7: What is the short-run vs. long-run impact of carbon taxes on regulated utilities compared to cost minimizers?
In the short run, carbon taxes on regulated utilities reduce coal generation only 48% as much as when imposed on a cost minimizer (34% vs. ~100% in immediate generation drop), because the used-and-useful incentive counteracts the carbon price signal. In the long run (30-year horizon), however, carbon taxes on regulated utilities result in 68% lower coal capacity and 77% lower coal generation relative to baseline—larger percentage reductions than for a cost minimizer—because the regulatory structure amplifies the retirement incentive over time once carbon costs erode the economic rationale for keeping coal in the rate base.&lt;/p&gt;
&lt;p&gt;Q8: What is the short-run operations counterfactual finding for carbon taxes in the sample period?
Using each utility-year in the analysis sample, imposing carbon taxes on regulated utilities reduces carbon costs by only about $500 million relative to baseline—41% of the $1.3 billion carbon cost savings from imposing the same carbon taxes on a cost minimizer. Despite this limited carbon reduction, electricity rates nearly triple from $77.58/MWh to $224.18/MWh under the regulated utility with carbon taxes, as the utility passes through most carbon costs to consumers; regulated utility variable profits also fall by over $500 million.&lt;/p&gt;
&lt;p&gt;Q9: What happens when the coal usage incentive is eliminated (μ₂ = 0)?
Setting the coal usage incentive parameter μ₂ = 0 (eliminating the logit slope on capacity utilization) causes coal capacity to fall 82% and coal generation to fall 92% relative to baseline over 30 years—a slightly larger generation decline than for the cost minimizer. However, this comes at the cost of more than twice the CCNG capacity due to the Averch-Johnson effect, and requires utility variable profits to fall by over $300 million, raising reliability concerns unless accompanied by compensating transfers.&lt;/p&gt;
&lt;p&gt;Q10: How does the paper&amp;rsquo;s mechanism relate to observed differences in coal exit rates between regulated and restructured states?
Between 2006 and 2018, 26.0% of coal capacity exited in restructured states versus only 17.2% in regulated states—a gap the authors attribute primarily to the used-and-useful incentive structure in RoR regulation. The structural model quantifies how this regulatory feature specifically distorts coal usage and retirement decisions; it is not explained by demand or cost differences across states, as confirmed by the triple-difference evidence showing the gap is specific to coal (not CCNG) and to regulated (not restructured) utilities.&lt;/p&gt;
&lt;p&gt;Q11: Why does the paper argue that alternative regulatory adjustments are insufficient to replicate cost-minimizing transitions?
Changing regulatory parameters—such as increasing the coal usage incentive or adjusting the electricity rate penalty—does not come close to replicating the speed of the energy transition under a cost minimizer in the long-run simulations. Regulatory adjustments that do approach cost-minimizing outcomes (such as eliminating μ₂) require large reductions in utility variable profits sufficient to risk reliability, consistent with why the 2022 Inflation Reduction Act relied on substantial investment transfers rather than carbon taxes as its primary clean energy instrument.&lt;/p&gt;
&lt;p&gt;Q12: What is the paper&amp;rsquo;s identification strategy?
Identification exploits the sharp, exogenous decline in natural gas fuel prices from fracking, which had heterogeneous implications across utilities depending on their initial capital mixes (coal-heavy vs. CCNG-heavy). By comparing investment, retirement, and operations decisions across utilities and over time—particularly between utilities that had CCNG exposure before the price decline and those that did not—the authors recover the structural regulatory and cost parameters. The IV specification for reduced-form evidence uses the current natural gas price interacted with the utility&amp;rsquo;s initial CCNG generation share as an instrument for fuel and import costs.&lt;/p&gt;
&lt;p&gt;Q13: What are the paper&amp;rsquo;s explicit limitations?
The paper estimates the coal-to-CCNG transition only and cannot speak to the transition to renewables and storage, because there is insufficient variation in the data to identify how regulators would treat CCNG as a legacy technology subject to used-and-useful standards, or how renewables and storage would contribute to the rate base. The authors note that over-investment in CCNG capacity may create future stranded asset problems for ratepayers and that usage incentives for CCNG are likely to further hinder the transition to renewables—but these are conjectures rather than estimated findings.&lt;/p&gt;
&lt;p&gt;Rate-of-return (RoR) regulation: A regulatory structure in which the PUC sets electricity rates so that utility revenues cover total variable costs plus an allowable return on the utility&amp;rsquo;s rate base (capital stock), with the allowable return parameterized as s = (r/r₀)^{-γ}, declining as consumer electricity rates rise.&lt;/p&gt;
&lt;p&gt;Used-and-useful standard: A prudence criterion under which a capital asset&amp;rsquo;s contribution to the rate base depends on its capacity utilization, modeled as a logit function of the generation-to-capacity ratio; fully used coal capacity contributes 1.144 times as much as CCNG per MW, while unused coal contributes only 40% as much.&lt;/p&gt;
&lt;p&gt;Rate base: The capital stock on which the PUC grants the utility its allowable rate of return; adjusted by prudence and used-and-useful assessments and described in the paper as &amp;ldquo;at best an arduous task&amp;rdquo; to quantify precisely.&lt;/p&gt;
&lt;p&gt;Averch-Johnson (AJ) over-investment effect: The tendency of regulated utilities to over-invest in capital because profits are proportional to the rate base; in this paper&amp;rsquo;s setting, this causes regulated utilities to increase CCNG capacity by 296% over 30 years following the natural gas price shock, compared to 58% for a cost minimizer.&lt;/p&gt;
&lt;p&gt;Incentive regulation: A modification of cost-plus RoR regulation in which the allowable rate of return declines as electricity rates rise; it provides efficiency incentives for cost reduction but does not achieve first-best outcomes and is insufficient to overcome the used-and-useful distortion for legacy coal.&lt;/p&gt;
&lt;p&gt;Out-of-dispatch-order generation: Running a generation unit when its fuel costs exceed the market import price; regulated utilities engage in this behavior with coal plants to maintain used-and-useful status and rate base contribution, whereas restructured utilities do not face this incentive.&lt;/p&gt;
&lt;p&gt;Nested fixed-point indirect inference: The estimation approach used to recover structural regulatory and operations parameters by minimizing the distance between regression coefficients from actual data and those from model-simulated data via a non-linear parameter search.&lt;/p&gt;</description></item><item><title>Enlightenment Ideals and Belief in Progress in the Run-up to the Industrial Revolution</title><link>https://macropaperwarehouse.com/papers/enlightenment-ideals-and-belief-in-progress-in-the-run-up-to-the-industrial-revolution/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/enlightenment-ideals-and-belief-in-progress-in-the-run-up-to-the-industrial-revolution/</guid><description>&lt;p&gt;This paper tests Joel Mokyr&amp;rsquo;s claim that Britain&amp;rsquo;s industrialization was preceded and enabled by a cultural shift — specifically, that Enlightenment ideals produced a &amp;ldquo;progress-oriented&amp;rdquo; view of science that diffused to artisans and craftsmen. The central research question is whether and when the language of science became more progress-oriented in the build-up to the Industrial Revolution, and whether this shift was concentrated in volumes directly linked to industrial production.&lt;/p&gt;
&lt;p&gt;The authors assemble 173,031 unique volumes printed in England and written in English between 1500 and 1900, drawn from the Hathitrust Digital Library. Because copyright law prohibits downloading full text, they use HDL&amp;rsquo;s Extracted-Features &amp;ldquo;bag of words&amp;rdquo; dataset. After removing duplicates and Latin-language volumes from an initial set of 420,081, they apply Latent Dirichlet Allocation (LDA) with cross-validated perplexity minimization to identify an optimal T=60 topics. Topic-pair co-occurrence analysis identifies three categories — science, religion, and political economy — each anchored by three defining topics. Volume-level category weights are derived by multiplying each topic&amp;rsquo;s weight by its category coefficient. The resulting classification yields 50,090 science volumes, 102,565 political economy volumes, and 14,124 religion volumes.&lt;/p&gt;
&lt;p&gt;Progressive sentiment is measured using a seven-word dictionary (progress, improvement, stride, betterment, advance, rise, amelioration) assembled from thesaurus synonyms for &amp;ldquo;progress,&amp;rdquo; manually vetted by all four authors, and restricted to words attested in the Oxford English Dictionary before 1643 (Newton&amp;rsquo;s birth year). Sentiment for each volume equals the count of progress-dictionary words divided by total word count. An analogous optimism-sentiment placebo dictionary is constructed separately.&lt;/p&gt;
&lt;p&gt;Industrial relevance is scored using the digitized indexes of all five volumes of Appleby&amp;rsquo;s Illustrated Handbook of Machinery (1877–1903); the top industrial root words are crane (weight 51), electr (42), weight (37), rope (27), and cost (27). Each volume receives an industry score equal to the weighted occurrence of industrial root words normalized by volume length.&lt;/p&gt;
&lt;p&gt;Three main findings emerge. First, the language of science and religion showed little overlap beginning in the 17th century — that is, the secularization of science predates the onset of industrialization. Science volumes shifted from approximately 40 percent religious content around 1700 to only about 10 percent by 1850, with scientific content rising correspondingly from roughly 40 percent to over 60 percent. This trend was stable from 1650 through 1900.&lt;/p&gt;
&lt;p&gt;Second, while scientific volumes became more progress-oriented during the Enlightenment, this progressive shift was concentrated in volumes at the nexus of science and political economy. Volumes of &amp;ldquo;pure&amp;rdquo; science were largely neutral with respect to progress sentiment, and those at the science-religion nexus had on average negative progress sentiment. The marginal effect of scientific content on progress sentiment was greatest for volumes mixing science and political economy, and most of the increase in predicted sentiment at that nexus occurred during the 18th century, remaining stable thereafter. A placebo test using optimism sentiment finds the opposite pattern: volumes at the science-political economy nexus were among the least optimistic, while the most optimistic language appeared at the religion-political economy nexus. This rules out the interpretation that the measured shift reflects a general increase in positive affect rather than specifically progress-oriented language.&lt;/p&gt;
&lt;p&gt;Third, volumes employing industrial terminology that also sat at the science-political economy nexus were distinctively progressive beginning in the mid-18th century. At the 90th percentile of industry score, predicted progress sentiment at the science-political economy nexus was positive throughout the sample; at zero industry score, it was negative until the mid-18th century. Volumes at the religion-political economy nexus showed modestly positive and time-stable progress sentiment regardless of industry score.&lt;/p&gt;
&lt;p&gt;The paper concludes that it was the pragmatic, applied volumes — those bridging science and political economy, written for artisans and a broader literate public rather than for the human-capital elite alone — that embodied the cultural values Mokyr identifies as central to Britain&amp;rsquo;s industrialization.&lt;/p&gt;
&lt;p&gt;Q: What gap in the existing literature does this paper address?&lt;/p&gt;
&lt;p&gt;A: Prior work on the cultural deep roots of economic growth rarely tracks how culture changes over time, relying instead on cross-sectional variation or qualitative case studies. Quantitative evidence that the language of science itself became more progress-oriented — and that this change reached beyond elite thinkers to artisans and craftsmen — had not been marshaled before. The paper provides inaugural quantitative support by analyzing 173,031 volumes spanning four centuries.&lt;/p&gt;
&lt;p&gt;Q: Why does the paper restrict the progress-sentiment dictionary to words attested before 1643?&lt;/p&gt;
&lt;p&gt;A: Words that entered English only after 1643 (Newton&amp;rsquo;s birth year) could not have appeared in volumes from the early Enlightenment, so including them would bias sentiment scores toward the later part of the sample. The restriction ensures the dictionary is applicable and unbiased across the full 1500–1900 period. The final retained words are: progress, improvement, stride, betterment, advance, rise, amelioration.&lt;/p&gt;
&lt;p&gt;Q: How does LDA classify volumes, and how is T=60 selected?&lt;/p&gt;
&lt;p&gt;A: LDA treats each volume as a bag of words and derives a Dirichlet distribution such that observed documents are generated by repeated topic sampling. The number of topics T is selected by minimizing perplexity on held-out data via 4-fold cross-validation, rotating training and test sets across folds; this procedure yields T=60 as optimal. Each volume is then represented as a mixture over those 60 topics.&lt;/p&gt;
&lt;p&gt;Q: What are the three categories and their anchor topics?&lt;/p&gt;
&lt;p&gt;A: Political Economy is anchored by topics on law/public opinion, governance/parliament, and trade/price/labour. Religion is anchored by topics on church/Christian doctrine, God/faith/sin, and virtue/fame/religion. Science is anchored by topics on engineering/steam/electricity, chemistry/acid/heat, and geometry/equations/trigonometry. These three sets of topics were selected for high corpus-wide importance and mutual independence.&lt;/p&gt;
&lt;p&gt;Q: What does the finding on science-religion separation imply for timing?&lt;/p&gt;
&lt;p&gt;A: The separation of scientific and religious language was already visible by 1600 and firmly established by the mid-17th century, well before the Industrial Revolution conventionally dated to the mid-18th century. This supports Mokyr&amp;rsquo;s argument that the secularization of science was an Enlightenment-era precursor to industrialization rather than a product of it. The trend remained stable from 1650 through 1900.&lt;/p&gt;
&lt;p&gt;Q: How does the progressive sentiment differ between pure science and the science-political economy nexus?&lt;/p&gt;
&lt;p&gt;A: Volumes of pure science were largely neutral with respect to progress-oriented language and in some periods showed slightly negative predicted progress sentiment. The science-religion nexus showed consistently negative progress sentiment. By contrast, volumes at the science-political economy nexus showed the highest level of progressive sentiment beginning in the mid-18th century, and most of this growth in predicted sentiment occurred during the 18th century, after which it remained stable.&lt;/p&gt;
&lt;p&gt;Q: What does the placebo optimism test show?&lt;/p&gt;
&lt;p&gt;A: The optimism sentiment scores are nearly the mirror opposite of the progress scores: the most optimistic language appears at the religion-political economy nexus, while volumes at the science-political economy nexus are among the least optimistic. This dissociation rules out the interpretation that the measured progress-sentiment rise reflects a general shift toward positive language rather than a specific cultural embrace of science as a tool for improving human welfare.&lt;/p&gt;
&lt;p&gt;Q: How is the industrial score constructed and what are the most heavily weighted terms?&lt;/p&gt;
&lt;p&gt;A: The authors digitized the detailed indexes of all five volumes of Appleby&amp;rsquo;s Illustrated Handbook of Machinery (1877–1903), restricted to words attested before 1643, and weighted each industrial root word by its index frequency. Each corpus volume&amp;rsquo;s industry score equals the sum of (word count × index weight) across all industrial words, normalized by volume length, yielding a score between 0 and 1. The top-weighted terms are crane (51), electr (42), weight (37), rope (27), and cost (27).&lt;/p&gt;
&lt;p&gt;Q: What is the key result linking industrial scores to progressive sentiment?&lt;/p&gt;
&lt;p&gt;A: At the science-political economy nexus, volumes with industry scores at the 90th percentile had persistently positive predicted progress sentiment throughout the sample, while volumes at that nexus with zero industry score had negative predicted sentiment until the mid-18th century. The shift to positive sentiment for high-industry volumes at this nexus occurred in the mid-18th century — roughly coinciding with the onset of Britain&amp;rsquo;s industrialization — and those volumes remained the most progress-oriented in the corpus thereafter.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s interpretation of the science-political economy nexus finding in relation to Mokyr?&lt;/p&gt;
&lt;p&gt;A: The authors interpret volumes at the science-political economy nexus as pragmatic, applied works aimed at a broader literate audience including artisans and craftsmen, not exclusively the human-capital elite. These are precisely the volumes Mokyr&amp;rsquo;s &amp;ldquo;Industrial Enlightenment&amp;rdquo; thesis predicts would carry progress-oriented cultural values into the mechanical and artisanal pursuits that drove industrialization. The finding that pure-science volumes were not especially progressive, while applied volumes bridging science and political economy were, is consistent with Mokyr&amp;rsquo;s argument that it was the diffusion of Enlightenment ideals to skilled practitioners — not just to elite scientists — that mattered.&lt;/p&gt;
&lt;p&gt;Q: What qualitative examples support the quantitative findings?&lt;/p&gt;
&lt;p&gt;A: Martin Clare&amp;rsquo;s The Motion of Fluids (1735) explicitly addresses &amp;ldquo;the Unlearned&amp;rdquo; and states in its preface that the work is meant to be &amp;ldquo;of singular Use and Benefit to Mankind&amp;rdquo; — a direct expression of the progress-oriented language the algorithm detects. George Stephenson&amp;rsquo;s 1831 railway report argues that rail infrastructure would allow Ireland to &amp;ldquo;reciprocate with England and with other nations, the products of industry,&amp;rdquo; exemplifying how progress-oriented language pervaded industrial writing by the early 19th century. These examples confirm that the high progress-sentiment scores for industrial volumes at the science-political economy nexus reflect genuine rhetorical content, not measurement artifacts.&lt;/p&gt;
&lt;p&gt;Q: What are the paper&amp;rsquo;s limitations regarding early sample periods?&lt;/p&gt;
&lt;p&gt;A: The corpus is thin in earlier eras, particularly around 1550, so results from the earliest decades must be interpreted with caution. The HDL data derive from digitized scans with OCR output of very old books, introducing errors such as the &amp;ldquo;long-S&amp;rdquo; misread (e.g., &amp;ldquo;juftice&amp;rdquo; for &amp;ldquo;justice&amp;rdquo;) that require manual correction. Additionally, the bag-of-words model discards word order, which may obscure some semantic distinctions.&lt;/p&gt;
&lt;p&gt;Q: What future research directions do the authors identify?&lt;/p&gt;
&lt;p&gt;A: The authors propose applying the same textual analysis techniques to test whether English-language volumes began reflecting greater freedom of expression in the run-up to Britain&amp;rsquo;s economic takeoff, connecting to the literature on European political fragmentation and the marketplace of ideas. They also suggest applying the approach to corpora in other languages — Dutch (following McCloskey&amp;rsquo;s argument about bourgeois values) and Spanish (to examine whether the Counter-Reformation and Spain&amp;rsquo;s economic lag are reflected in cultural attitudes toward progress and science).&lt;/p&gt;
&lt;p&gt;LDA (Latent Dirichlet Allocation): An unsupervised generative statistical model that treats each document as a bag of words and extracts latent topics as multinomial distributions over vocabulary; used here to reduce 173,031 volumes to mixtures of 60 topics without imposing prior scholarly interpretations.&lt;/p&gt;
&lt;p&gt;Progressive Sentiment Score: The fraction of words in a volume belonging to a seven-word dictionary of progress synonyms (progress, improvement, stride, betterment, advance, rise, amelioration), normalized by total word count; measures the cultural orientation toward the betterment of humankind as embedded in text.&lt;/p&gt;
&lt;p&gt;Industrial Score: A volume-level measure equal to the weighted count of industrial root words — derived from the indexes of Appleby&amp;rsquo;s Illustrated Handbook of Machinery (1877–1903) — normalized by volume length; captures the degree to which a volume&amp;rsquo;s vocabulary overlaps with industrial production terminology.&lt;/p&gt;
&lt;p&gt;Science-Political Economy Nexus: The region of the topic simplex where volumes carry substantial weight in both the science and political economy categories but low weight in religion; the paper finds this is where progress-oriented language was most concentrated from the mid-18th century onward, interpreted as applied science aimed at artisans and a broader literate public.&lt;/p&gt;
&lt;p&gt;Industrial Enlightenment: Joel Mokyr&amp;rsquo;s (2009) concept describing the diffusion of Enlightenment ideals about the practical utility of science into the mechanical and artisanal pursuits that drove Britain&amp;rsquo;s industrialization; the paper provides quantitative support for this thesis by showing that industrial volumes at the science-political economy nexus were distinctively progress-oriented.&lt;/p&gt;
&lt;p&gt;Culture of Growth: Mokyr&amp;rsquo;s (2016) broader argument that a pan-European network of elite intellectuals fostered a progress-oriented view of science — the idea that scientific understanding could improve the human condition — and that this cultural norm, in combination with Britain&amp;rsquo;s stock of skilled craftsmen, made industrialization possible.&lt;/p&gt;
&lt;p&gt;Bag of Words: A representation of text that records only word frequencies within a document, discarding word order; used here both because HDL copyright restrictions prevent full-text download and because it is the input format required by LDA.&lt;/p&gt;</description></item><item><title>Environmental Consequences of Hydrocarbon Infrastructure Policy</title><link>https://macropaperwarehouse.com/papers/environmental-consequences-of-hydrocarbon-infrastructure-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/environmental-consequences-of-hydrocarbon-infrastructure-policy/</guid><description>&lt;p&gt;Covert and Kellogg study policies that aim to &amp;ldquo;keep carbon in the ground&amp;rdquo; by blocking fossil fuel infrastructure investment, with the Dakota Access Pipeline (DAPL) as their empirical application. DAPL moves more than 500,000 barrels per day of oil from the Bakken Shale of North Dakota to the U.S. Gulf Coast and was completed in June 2017 amid substantial opposition. The central research question is whether blocking pipeline construction actually keeps oil in the ground or merely shifts transport to alternative modes — specifically crude-by-rail — and what the net environmental and economic consequences are.&lt;/p&gt;
&lt;p&gt;The paper develops a two-period model of crude oil production and transportation mode choice. In the model, oil shippers decide in period 1 whether to commit to pipeline capacity under ship-or-pay contracts, then in period 2 allocate flows between the committed pipeline and the more flexible but costlier railroad alternative. Pipeline construction is an irreversible sunk cost with zero ongoing marginal cost; rail involves no sunk cost but substantial ongoing marginal costs including quadratic adjustment costs that capture capital investment in rail cars and loading/unloading facilities. Equilibrium pipeline capacity is determined by a shippers&amp;rsquo; indifference condition: expected per-barrel returns from pipeline access equal the FERC-regulated tariff.&lt;/p&gt;
&lt;p&gt;The empirical model is estimated using monthly Bakken oil production and transportation data, price differentials across three coastal destinations (Gulf, East, West), and drilling productivity data. Crude-by-rail marginal costs are estimated via 2SLS, yielding static marginal cost intercepts of $9.49/bbl to the East Coast, $12.64/bbl to the Gulf Coast, and $8.69/bbl to the West Coast, plus a dynamic adjustment cost of $1.28/bbl per mbbl/d of flow change. The upstream supply model follows Anderson, Kellogg, and Salant (2018), with old-well production following exponential decline (estimated decay parameter β = 0.955) and new-well drilling responding to current and lagged prices with a total long-run elasticity of 1.32. Shippers&amp;rsquo; beliefs about future oil prices are calibrated to an AR(1) process fit to historical price volatility (persistence φ₁ = 0.9925, volatility σ_G = 0.098). Model validation confirms a predicted expected return to pipeline commitment of $6.17/bbl against DAPL&amp;rsquo;s actual tariff of $5.50–$6.25/bbl.&lt;/p&gt;
&lt;p&gt;The main counterfactual asks what would have happened had DAPL&amp;rsquo;s construction been enjoined. In expectation, blocking DAPL reduces pipeline flows by 306 mbbl/d. Expected crude-by-rail flows increase by 248 mbbl/d, offsetting 81% of the pipeline reduction. Bakken oil production falls by only 58 mbbl/d, a 4% reduction. The modal shift from pipeline to rail worsens local environmental outcomes: per-barrel local pollution damages from rail transport substantially exceed those from pipelines, dominated by locomotive NOx emissions in populated areas. Foreclosing DAPL increases net local pollution damages by $444,000 per day (the decrease in pipeline-related harm of $144,000/day is more than offset by the increase from rail of $588,000/day). The total cost of blocking DAPL is $45/tonne of CO2 abated — $28/tonne from lost producer surplus and $17/tonne from increased local pollution damages — a figure comparable to the contemporaneous U.S. government social cost of carbon estimate of $42/tonne.&lt;/p&gt;
&lt;p&gt;An upstream production tax achieving the same CO2 reduction costs only $1.01–$2.68/tonne CO2 abated, an order of magnitude less, because it does not induce the distortionary modal shift to rail. Two caveats apply: if 57% of Bakken production reductions leak to other basins, the cost of blocking DAPL rises from $45/tonne to $104/tonne; and if reductions represent production delays rather than permanent reductions, effective abatement is further diminished. The analysis is scoped to Bakken crude oil and land transportation alternatives. The finding that blocking infrastructure increases local pollution is atypical of CO2 abatement policies, which usually generate local pollution co-benefits.&lt;/p&gt;
&lt;p&gt;Q: What is the core economic mechanism by which blocking a pipeline can keep oil in the ground?
A: When a pipeline is foreclosed, crude oil can still move by railroad, but rail transport involves substantial ongoing marginal costs. These costs create a wedge between upstream (Bakken) and downstream (Gulf Coast) prices that depresses upstream supply. Only when downstream prices are high enough to cover both rail marginal cost and this wedge will rail fully substitute for the pipeline; at lower prices, some production is uneconomical and stays in the ground. In the model, this price-depressing wedge is the mechanism that reduces production — but it operates only partially, since rail can substitute for much of the pipeline&amp;rsquo;s flow.&lt;/p&gt;
&lt;p&gt;Q: How much of the blocked pipeline flow substitutes to rail versus stays in the ground?
A: In expectation, blocking DAPL reduces pipeline flows by 306 mbbl/d. Expected crude-by-rail flows increase by 248 mbbl/d, offsetting 81% of the pipeline reduction. Bakken oil production falls by only 58 mbbl/d, or approximately 4%. In a specific simulated month (December 2019), 348 mbbl/d (67%) of the 520 mbbl/d of foregone pipeline flows would still move by rail.&lt;/p&gt;
&lt;p&gt;Q: How are crude-by-rail costs estimated, and what is the role of adjustment costs?
A: The authors estimate a 2SLS model of rail flows on price differentials, allowing for quadratic adjustment costs to capture investments and disinvestments in rail cars and loading facilities. Static marginal costs are $9.49/bbl (East Coast), $12.64/bbl (Gulf Coast), and $8.69/bbl (West Coast). The adjustment cost parameter γ is estimated at $1.28/bbl per mbbl/d, meaning a 10 mbbl/d monthly increase in rail flows raises marginal shipping cost by $12.76/bbl — a substantial share of total rail costs. Adjustment costs are necessary to reconcile the model with the sluggish observed response of rail flows to price differentials.&lt;/p&gt;
&lt;p&gt;Q: What is the structure of the upstream oil supply model and what are its key parameter estimates?
A: The model distinguishes &amp;ldquo;old&amp;rdquo; production from pre-existing wells, which follows exponential decline with estimated decay parameter β = 0.955, and &amp;ldquo;new&amp;rdquo; production from newly drilled wells, which is price-responsive with a total long-run elasticity of 1.32 — comparable to the 1.1–1.2 estimated by Newell and Prest (2019) across major U.S. shale plays. This structure implies that total production is highly inelastic in the short run (dominated by old wells) but responds to persistent price shocks over the long run through changes in drilling rates.&lt;/p&gt;
&lt;p&gt;Q: How do the local pollution damages of rail compare to those of pipeline transport?
A: At a social cost of carbon of $100/tonne, local air pollution damages from rail transport to the Gulf Coast are $1.66/bbl (plus $0.73/bbl in spill/accident costs), versus only $0.35/bbl local pollution (plus $0.11/bbl spills) for pipelines. Locomotive NOx emissions are the dominant factor, both because locomotives have high NOx emission factors and because these emissions often occur in densely populated areas. CO2 damages at $100/tonne SCC are roughly similar across modes ($0.79–0.83/bbl), so local pollution is the key differentiator.&lt;/p&gt;
&lt;p&gt;Q: What is the net welfare impact of foreclosing DAPL, and how is it decomposed?
A: Foreclosing DAPL reduces producer surplus by $716,000/day, increases net local pollution damages by $444,000/day (the $588,000/day increase from rail more than offsets the $144,000/day decrease from pipeline), and reduces CO2 emissions by 25.2 mtonnes/day from the 58 mbbl/d production reduction. The cost per tonne of CO2 abated is $28/tonne from lost producer surplus and $17/tonne from increased local pollution damages, totaling $45/tonne — broadly comparable to the U.S. government&amp;rsquo;s contemporaneous SCC estimate of $42/tonne. This means the policy&amp;rsquo;s abatement cost is approximately equal to the social value of each tonne abated, leaving little or no net social gain even before accounting for leakage.&lt;/p&gt;
&lt;p&gt;Q: How does the model validate against observed data and institutional parameters?
A: The model predicts an expected return to committed DAPL pipeline shipment of $6.17/bbl, which closely matches the actual DAPL tariff for committed shippers of $5.50–$6.25/bbl. The authors also validate simulated crude-by-rail flows against actual flows across destinations. The close match on the tariff is particularly meaningful because it tests the model&amp;rsquo;s equilibrium condition for pipeline capacity investment rather than a within-sample fit.&lt;/p&gt;
&lt;p&gt;Q: How does an upstream production tax compare to blocking DAPL as a policy instrument?
A: A production tax normalized to achieve the same CO2 reduction requires only $3.68/bbl if imposed after shippers have committed to DAPL (holding capacity fixed), or $3.24/bbl if announced before commitments are made (reducing pipeline capacity to 443 mbbl/d). The production tax reduces combined producer surplus and government revenue by only $96,000–$109,000/day versus $716,000/day under the DAPL ban, and reduces local pollution damages by $82,000/day rather than increasing them. The resulting cost per tonne CO2 abated is $1.01–$2.68 — an order of magnitude smaller than the $44.63/tonne for blocking DAPL.&lt;/p&gt;
&lt;p&gt;Q: What is the production leakage caveat and how large is its effect?
A: If blocking DAPL causes Bakken production to fall, production from other U.S. or global oil basins may increase, partially or fully offsetting the CO2 reduction. Following Prest (2022) and Prest et al. (2023), the authors note that if 57% of the Bakken production reduction leaks to other basins, the cost of blocking DAPL rises from $45/tonne to $104/tonne. Leakage would increase the cost per tonne for the upstream tax as well, but the relative advantage of the tax over the pipeline ban is unaffected by this caveat.&lt;/p&gt;
&lt;p&gt;Q: What is the production delay caveat?
A: Even absent leakage, the paper cautions that production reductions from either policy may represent production delays rather than permanent reductions — oil not extracted today may be extracted later as prices rise or technology improves. To the extent that reductions are temporary, the effective carbon abatement is smaller than the authors compute, and the cost per tonne of CO2 abated is correspondingly higher. The paper does not quantify this effect but flags it as a material caveat.&lt;/p&gt;
&lt;p&gt;Q: What institutional features drive pipeline capacity investment and risk allocation?
A: Pipelines are irreversible investments subject to ex-post holdup, so construction financing requires firm ship-or-pay commitments from shippers before construction and before future prices are known, meaning oil price risk is borne primarily by shippers rather than the pipeline owner. Pipeline tariffs are regulated by FERC on a cost-of-service basis. In the DAPL case, shippers executed binding ten-year ship-or-pay contracts in June 2014, and shippers&amp;rsquo; beliefs about future oil prices at that date — calibrated to historical price volatility using an AR(1) process with estimated persistence φ₁ = 0.9925 and volatility σ_G = 0.098 — determine equilibrium capacity investment.&lt;/p&gt;
&lt;p&gt;Q: How does the paper&amp;rsquo;s finding relate to the typical co-benefit structure of climate policies?
A: Most CO2 abatement policies generate local pollution co-benefits (reduced NOx, SOx, particulates), so the abatement cost is partially offset by local pollution gains. Blocking DAPL reverses this: the pipeline-to-rail modal shift increases local pollution damages, making local pollution a cost rather than a co-benefit of the policy. The authors note this is atypical but not unprecedented — urban densification and post-combustion emissions controls in fossil fuel boilers also present CO2–local pollution trade-offs.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Infrastructure foreclosure policy: A &amp;ldquo;keep it in the ground&amp;rdquo; strategy that blocks construction of specialized fossil fuel transportation infrastructure (pipelines) with the aim of inhibiting production of the fuels that would have been transported, without requiring direct acquisition or buyout of mineral rights.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Ship-or-pay agreement: A firm, up-front capacity commitment in which a pipeline shipper agrees to pay for reserved pipeline capacity whether or not they ultimately use it, made before construction and before future prices are realized; the institutional mechanism by which oil price risk is transferred from pipeline owners to shippers.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Crude-by-rail adjustment costs: Quadratic costs modeled as linear in the period-to-period change in rail volumes to a given destination, capturing capital investments and disinvestments in rail cars, loading facilities, and unloading terminals needed to expand or contract crude-by-rail capacity; estimated at $1.28/bbl per mbbl/d of monthly flow change.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Production leakage: The partial or full offset of production reductions in one oil basin (Bakken) by production increases in other U.S. or global basins in response to the same price signals; at 57% leakage, the cost of blocking DAPL rises from $45/tonne to $104/tonne of CO2 abated.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Old-well vs. new-well production dynamics: The distinction between production from pre-existing wells (which follows an exponential decline path insensitive to current prices, β = 0.955) and production from newly drilled wells (which responds to current and lagged upstream prices with long-run elasticity 1.32); this structure makes total short-run supply highly inelastic while allowing substantial long-run price responsiveness through drilling adjustments.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Local pollution damages from NOx: The dominant component of environmental harm from crude-by-rail transport, arising from locomotive NOx emissions that are both large in magnitude and concentrated in densely populated areas along rail corridors; at $100/tonne SCC, monetized local pollution damages from rail exceed CO2 damages for all three coastal destinations, whereas for pipelines CO2 damages exceed local pollution costs.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Cost per tonne of CO2 abated: The authors&amp;rsquo; metric for comparing infrastructure foreclosure to alternative policies; computed as the sum of lost producer surplus and net change in local pollution damages divided by the quantity of CO2 emissions avoided from reduced oil production and consumption; equals $45/tonne for blocking DAPL versus $1.01–$2.68/tonne for an equivalent upstream production tax.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Equal Pay for Similar Work</title><link>https://macropaperwarehouse.com/papers/equal-pay-for-similar-work/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/equal-pay-for-similar-work/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies the labor market effects of &amp;ldquo;Equal Pay for Similar Work&amp;rdquo; (EPSW) policies — laws that require firms to pay equal wages to workers of different protected-class identities (e.g., different genders) who perform &amp;ldquo;similar&amp;rdquo; work within a firm. EPSW has become increasingly prevalent: as of January 2023, more of the U.S. workforce falls under state EPSW laws than state &amp;ldquo;Equal Pay for Equal Work&amp;rdquo; (EPEW) laws. Despite this spread, the equilibrium consequences of EPSW were previously unknown.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors develop two theoretical models. The first is a static cooperative game (whose outcomes coincide with the Nash equilibria of a non-cooperative simultaneous-wage-offer game). Homogeneous firms with constant-returns-to-scale production compete for a continuum of heterogeneous workers. Workers belong to one of two groups A or B (e.g., men and women), with group A constituting a β ≥ 1 majority. Each worker&amp;rsquo;s productivity v is drawn from a group-specific distribution (FA or FB); firms&amp;rsquo; willingness to pay equals each worker&amp;rsquo;s productivity, but can embed taste-based discrimination. The analysis is framed as applying &amp;ldquo;within job&amp;rdquo; in a local labor market — only workers performing &amp;ldquo;similar&amp;rdquo; work in the eyes of the law.&lt;/p&gt;
&lt;p&gt;The second model is a dynamic search-and-bargaining framework with an arbitrary number of firms, search frictions, reallocation frictions, and Nash-in-Nash bargaining. EPSW is introduced as a surprise, and constrained firms choose whether to segregate for one group or remain desegregated (paying a common wage to all workers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Theoretical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Without EPSW, Bertrand competition among firms drives every worker&amp;rsquo;s wage to equal her productivity; any wage gap between groups A and B exactly reflects the difference in average productivities (EA(v) − EB(v)), whether or not those productivity differences stem from discrimination.&lt;/p&gt;
&lt;p&gt;With EPSW, the equilibrium is qualitatively transformed. In the static model (Proposition 2), firms generically fully segregate their workforces: one firm hires all A-group workers and the other hires all B-group workers. EPSW functions as an enforcement mechanism for this segregation analogous to location choices in Hotelling&amp;rsquo;s model — poaching a worker from the competing firm is costly because EPSW then requires the poaching firm to pay equal wages to all workers it employs. In the core with EPSW (Proposition 3), the wage gap moves in favor of the majority group (A-group, β &amp;gt; 1) in the sense that all core outcomes except one strictly increase the A-group wage advantage. Moreover, firm profits and the magnitude of the wage gap co-move: firms benefit from selecting equilibria with larger wage gaps. The directional conclusion — EPSW benefits the majority group — holds regardless of the distributions of the two groups&amp;rsquo; productivities, conditional only on β &amp;gt; 1 for the wage gap; for the log wage gap the additional regularity condition βEA[v] &amp;gt; EB[v] is required.&lt;/p&gt;
&lt;p&gt;In the dynamic search model (Proposition 4), all firms eventually segregate under any equilibrium, with the long-run wage ratio moving in favor of the group toward which more firms segregate. Under equitable search and sufficiently low reallocation frictions (Proposition 5), more firms segregate toward the majority group when βEA[v] &amp;gt; EB[v]. Firms that are nearly segregated at the time of EPSW enactment segregate sooner than others (Proposition 6).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Setting and Design&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors test these predictions using Chile&amp;rsquo;s 2009 EPSW (Law 20.348), the country&amp;rsquo;s first equal pay law, which prohibited paying women less than men (or vice versa) for similar work. Firms with 10 or more long-term workers at the time of announcement (June 2009) face formal grievance procedures and financial penalties (69–1,384 USD per worker-month of violation); firms below this threshold face no financial penalty, providing a clean threshold-based treatment assignment.&lt;/p&gt;
&lt;p&gt;The data are matched employer-employee administrative records from the Chilean unemployment insurance system covering January 2005 – December 2013, a random sample of approximately 4% of all firms stratified by size. The main estimation sample restricts to firms with 6–13 total workers at announcement (41% of active firms), and the design is a difference-in-differences (event study) comparing treated (≥ 10 long-term workers) to control (&amp;lt; 10 long-term workers) firms. The identifying assumption is parallel trends between similarly sized firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, EPSW increases full gender segregation across firms. The share of fully gender-segregated firms increases by 4.4 percentage points (baseline: 34.3% of firms were fully segregated at announcement). Simultaneously, the share of nearly-but-not-fully segregated firms (majority gender share ∈ [0.8, 1)) declines by 4.0 percentage points — a &amp;ldquo;missing mass&amp;rdquo; of near-segregated firms consistent with the search model&amp;rsquo;s prediction that firms on the margin of full segregation segregate most readily (e.g., by separating the sole worker of the &amp;ldquo;wrong&amp;rdquo; gender). Moreover, firms that are nearly segregated at announcement experience an 8.7 percentage point increase in full segregation post-EPSW, compared to 2.8 percentage points for firms not nearly segregated at announcement.&lt;/p&gt;
&lt;p&gt;Second, EPSW shifts the gender wage gap in favor of the local labor market majority group. In male-majority local labor markets (defined by industry × county), EPSW increases the gender wage gap in favor of men by 4.3 percentage points. In female-majority local labor markets, EPSW decreases the gender wage gap (i.e., in favor of women) by 6.2 percentage points. The wage gap change is primarily driven by reductions in minority-group wages: women&amp;rsquo;s average wages in male-majority markets fall by 3.3 percentage points, and men&amp;rsquo;s average wages in female-majority markets fall by 4.5 percentage points; there are no statistically significant changes in majority-group wages. Because men dominate Chile&amp;rsquo;s overall labor market (approximately 5/6 of all workers are employed in majority-male local labor markets), the overall effect of EPSW is to increase the gender wage gap (in favor of men) by 2.7 percentage points. Pre-treatment coefficients are statistically indistinguishable from zero across all specifications, supporting the parallel trends assumption. These findings are robust across six alternative specifications covering different samples, fixed-effect structures, and controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Theoretical results apply within a set of &amp;ldquo;similar&amp;rdquo; workers in a given local labor market — the paper does not predict differential effects across job types within a firm (e.g., custodians vs. lawyers) that do not perform similar work. Empirical results are identified for firms with 6–13 workers and pertain to Chile&amp;rsquo;s formal sector (informal labor share ~25% in 2009). Predictions on the wage ratio (log wage gap) require the additional regularity condition βEA[v] &amp;gt; EB[v], which is consistent with the Chilean data.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core mechanism by which EPSW leads firms to fully segregate in the static model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: EPSW makes cross-group poaching prohibitively costly. If a firm that hires only A-group workers were to hire even a positive measure of B-group workers, EPSW would — by transitivity — require it to pay the same wage to all workers. This eliminates the firm&amp;rsquo;s ability to exploit productivity heterogeneity across workers; it would have to raise all wages to match the highest worker, destroying profit. As a result, firms segregate in equilibrium to avoid the bite of EPSW entirely: each firm caters to one group, and the within-group wage schedule remains unconstrained. The mechanism is analogous to Hotelling&amp;rsquo;s location model: segregation serves as the enforcement device for avoiding the equal-pay constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the equal profit condition generate a wage gap in favor of the majority group?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In any core outcome under EPSW (Proposition 3), the Equal Profit Condition requires both firms to earn the same total profit. When there are β &amp;gt; 1 A-group workers (more than B-group workers), the firm serving A-group workers must pay higher average wages per worker to extract the same total profit from a larger pool, relative to the firm serving a smaller B-group. This mechanically raises A-group average wages relative to B-group average wages. Crucially, this directional conclusion — EPSW widens the majority-group wage advantage — holds regardless of the shapes of FA and FB, meaning it is robust to any underlying discriminatory or non-discriminatory productivity differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the baseline (without-EPSW) wage gap, and how does EPSW change it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Without EPSW, Proposition 1 establishes that every worker is paid exactly her productivity in any core outcome (full employment, wages = productivity). Therefore, the wage gap equals EA(v) − EB(v) and the wage ratio equals EA(v)/EB(v): any gap reflects only productivity differences (including discrimination embedded in willingness to pay). Under EPSW, Proposition 3 shows that all core outcomes except a single (measure-zero) one strictly widen the wage gap beyond this level. The wage ratio result (Proposition 3, Part 4) requires the additional condition βEA[v] &amp;gt; EB[v] — that the majority group is not sufficiently less productive or more discriminated against to reverse the direction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the dynamic search model modify the static predictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the dynamic model (Proposition 4), full segregation is achieved in finite time T in any equilibrium, not instantaneously. Prior to T, firms make sequential segregation decisions; workers displaced by firm desegregation choices are replaced at rate ρ ∈ [0,1]. The long-run wage ratio is determined by the ratio nA/nB — the number of firms segregating toward group A versus B. If nA &amp;gt; nB, the long-run wage ratio moves in favor of A; if nA = nB, the policy has no long-run effect on the wage ratio. The key departure from the static model is that this outcome depends not only on the majority group size but also on search intensities and reallocation frictions (high firm tenure/low d can make segregating toward the majority costly if the firm already employs many minority-group workers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Under what conditions does the dynamic model predict that more firms segregate toward the majority group?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Proposition 5 states that for sufficiently large d (fast worker turnover / low reallocation frictions) and equitable search (equal search intensity across firms within a group), the number of firms segregating toward A satisfies nA ∈ [xA−1, xA+1], where xA is defined by an equal-profit condition. Moreover, if βEA[v] &amp;gt; EB[v] (the majority group is collectively more valuable), then nA ≥ nB. Without equitable search, the conclusion holds under more stringent conditions: for any search intensity vector r, there exist d* and β* such that for d &amp;gt; d* and β &amp;gt; β*, any equilibrium yields nA &amp;gt; nB. Empirically, 94% of local-labor-market-by-month units in Chile exhibit more firms segregating toward the majority gender post-EPSW, consistent with these conditions being met.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why do firms that are nearly segregated at announcement respond most strongly to EPSW?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Proposition 6 establishes that firms with a low ratio of minority-group to majority-group search intensity (i.e., nearly segregated in employment) segregate earliest, provided the discount rate is sufficiently low. The intuition is that for a nearly segregated firm, the cost of segregating — separating the few minority-group workers — is small relative to the costs of remaining desegregated (paying a common wage that compresses profit, and being unable to poach new workers). Empirically, firms nearly segregated at announcement (majority gender share ∈ [0.8,1) at announcement) show an 8.7 percentage point increase in full segregation post-EPSW, roughly three times larger than the 2.8 percentage point effect for firms not nearly segregated at announcement. This &amp;ldquo;missing mass&amp;rdquo; pattern (decline in near-segregation matched by increase in full segregation) is also consistent with Proposition 6.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the heterogeneous effect of EPSW on the wage gap by local labor market type?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The empirical design allows the wage gap effect to differ by local labor market (LLM) majority type (male vs. female). In male-majority LLMs (firm industry × county pairs where males comprise more than 50% of workers in June 2009), EPSW increases the gender wage gap in favor of men by 4.3 percentage points (SE = 0.0116). In female-majority LLMs, EPSW decreases the gender wage gap (in favor of women) by 6.2 percentage points (SE = 0.0234). These findings precisely match the theoretical prediction that EPSW benefits whichever group is in the majority of the local labor market. The dynamic event studies show no pre-trends in either subsample; effects begin at announcement (τ = 0) and grow over time.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What drives the wage gap change — majority wages rising or minority wages falling?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The change is primarily driven by a reduction in the minority group&amp;rsquo;s average wages, not an increase in majority wages. Women&amp;rsquo;s average wages in male-majority labor markets fall by 3.29 percentage points (SE = 0.0111) in treated versus control firms post-EPSW. Men&amp;rsquo;s average wages in female-majority labor markets fall by 4.45 percentage points (SE = 0.0178) in treated versus control firms post-EPSW. There are no statistically significant changes in the average wages of the majority group of workers within any LLM type. This is consistent with the model&amp;rsquo;s mechanism: segregation reduces competition for minority-group workers (fewer firms competing for them), depressing their wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the aggregate (economy-wide) effect of EPSW on the gender wage gap in Chile?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Because approximately 5/6 of all Chilean workers are employed in male-majority local labor markets (men have higher labor force participation, with female labor force participation at roughly 30% in 2009), the overall effect of EPSW is to increase the gender wage gap in favor of men by 2.74 percentage points (SE = 0.0102). This is a net effect that averages the positive (pro-male) gap increase in male-majority markets and the negative (pro-female) gap decrease in female-majority markets, weighted by market sizes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the identification strategy deal with anticipation and compositional changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Treatment status is assigned based on firm size at the time of policy announcement (June 2009) rather than enactment (November 2009), creating an intent-to-treat framework: some &amp;ldquo;treated&amp;rdquo; firms may fall below the threshold by enactment, and some &amp;ldquo;control&amp;rdquo; firms may rise above it, both attenuating the estimates (implying estimated effects are plausible lower bounds). The no-anticipation assumption is supported by the absence of statistically significant pre-trends in either the segregation or wage-gap specifications. To address compositional changes in worker characteristics across LLMs induced by EPSW itself, the wage regressions include time fixed effects interacted with human capital dimensions (education, contract type, age decade) and firm comparison groups, controlling for observable composition shifts. Placebo tests at alternative firm-size thresholds find no statistically or economically meaningful effects, supporting the causal interpretation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does EPSW in Chile compare to EPEW theoretically and in the literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: EPEW requires equal pay only for workers doing exactly equal work, which creates an easily exploitable loophole: firms can proliferate job titles or marginally differentiate duties to avoid compliance. EPSW closes this by requiring equal pay across a coarser &amp;ldquo;similar work&amp;rdquo; category, making evasion harder. Theoretically, the prior EPEW literature (Bhaskar et al. 2002, Kaas 2009, Lagerlöf 2020, Lanning 2014) generated ambiguous directional predictions — equal pay laws could either increase or decrease wage disparities within the same paper. The authors attribute this ambiguity to EPEW models&amp;rsquo; requirement that workers be exactly equally productive. By contrast, EPSW applies across workers with heterogeneous productivities, and the authors derive unambiguous predictions: full segregation and a wage gap shift toward the majority group, both of which are confirmed empirically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What is the analogy to &amp;ldquo;best-price guarantees&amp;rdquo; in product markets?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper draws a methodological parallel to most-favored-customer (MFC) clauses in product markets. MFC clauses commit firms to rebating past consumers if prices fall, which directly equalizes payments across buyers but unintentionally raises firm market power. In the EPSW setting, the policy plays the role of a best-wage guarantee — but because firms compete for workers, the constraint binds off the equilibrium path. Firms segregate so that no firm is ever exposed to the equal-pay constraint in equilibrium, yet the threat of the constraint (if a firm deviates and hires from both groups) effectively differentiates labor costs across groups, driving the unintended wage effects. This is related to &amp;ldquo;artificial&amp;rdquo; switching costs that create local market power in consumer markets (Klemperer, 1987).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Equal Pay for Similar Work (EPSW):&lt;/strong&gt; A legal constraint requiring that within a firm, workers belonging to different protected-class identities (e.g., different genders) who perform &amp;ldquo;similar&amp;rdquo; work receive equal wages. Distinguished from &amp;ldquo;Equal Pay for Equal Work&amp;rdquo; (EPEW) by its coarser similarity standard, which cannot be evaded by minor job-title differentiation. In the model, this constraint is formalized as: a firm cannot hire positive measures of workers from two different groups such that all workers in one group receive strictly higher wages than all workers in the other group; by transitivity, a firm hiring from both groups must pay almost all workers the same wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Core Outcome:&lt;/strong&gt; The solution concept used in the static model, drawing on cooperative game theory (Shapley–Shubik assignment game). An outcome (specifying which firm hires each worker and at what wage) is in the core if no firm and subset of workers can form a blocking coalition that makes both the firm and each worker in the coalition strictly better off. The paper uses this concept because its pure-strategy Nash equilibrium outcomes (in the associated non-cooperative simultaneous wage-offer game) exactly coincide with the core outcomes under the restriction that firms pay the same wage to all workers of the same type.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Full Segregation:&lt;/strong&gt; A labor market outcome in which each firm employs workers from only one group (all A-group workers at one firm, all B-group workers at the other). The paper proves (Proposition 2) that EPSW generically forces full segregation in equilibrium, because any deviation to hire from both groups exposes the firm to the equal-pay constraint. Empirically measured as a binary indicator for whether all workers at a given firm in a given month are of the same gender.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Near Segregation:&lt;/strong&gt; A firm-level state in which the majority gender constitutes 80–99% of the firm&amp;rsquo;s workforce (the majority gender share is in [0.8, 1)). The paper uses this as a complementary outcome to full segregation; theory (Proposition 6) predicts a decline in near segregation post-EPSW because firms in this state face the lowest cost of transitioning to full segregation. Empirically, the near-segregation share falls by 4.0 percentage points post-EPSW, mirroring the 4.4 percentage point rise in full segregation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Labor Market (LLM):&lt;/strong&gt; Defined in the empirical analysis as a firm&amp;rsquo;s geographic county interacted with its industry code, creating 321 × 21 potential cells. The LLM is classified as male-majority or female-majority based on the share of female workers across all firms in the industry-county pair in June 2009. This is the unit at which the &amp;ldquo;majority group&amp;rdquo; for Proposition 3&amp;rsquo;s wage gap prediction is defined, and the level at which the heterogeneous wage effects of EPSW are estimated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equal Profit Condition:&lt;/strong&gt; A necessary condition of any core outcome (with or without EPSW): both firms must earn the same total profit in equilibrium. Under EPSW with full segregation, this condition determines the relative average wages of the two groups — because firm sizes differ (β A-group workers vs. 1 B-group worker), equal profit requires the firm serving the larger group to pay higher average wages, mechanically moving the wage gap in favor of the majority group.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nash-in-Nash Bargaining:&lt;/strong&gt; The bargaining protocol used in the dynamic search model, following Horn and Wolinsky (1988). Each bilateral worker-firm bargain splits the available surplus in proportion to exogenous bargaining power parameter Δ ∈ (0,1), taking as given the outcome of all other bilateral bargains. A worker&amp;rsquo;s disagreement point is the wage she would receive from bargaining with the next firm in her search order. This generates the result that a worker&amp;rsquo;s realized payoff is increasing in the number of segregated (non-EPSW-constrained) firms competing for her, connecting firm segregation decisions to wage determination.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reallocation Friction:&lt;/strong&gt; In the dynamic search model, represented by a low departure probability d ∈ (0,1) for existing employees. When d is low, firms retain a large fraction of their workforce across periods, making segregation costly because the firm must separate from any existing workers of the &amp;ldquo;wrong&amp;rdquo; group. The paper shows (Proposition 5) that for sufficiently large d (low frictions), the equal-profit condition approximately pins down the number of firms segregating toward each group, and for d above a threshold, the majority group attracts weakly more segregating firms.&lt;/p&gt;</description></item><item><title>Evaluating macroeconomic outcomes under asymmetries: Expectations matter</title><link>https://macropaperwarehouse.com/papers/evaluating-macroeconomic-outcomes-under-asymmetries-expectations-matter/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/evaluating-macroeconomic-outcomes-under-asymmetries-expectations-matter/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates whether and how assumptions about household and firm expectations alter the macroeconomic implications of asymmetries commonly embedded in macroeconomic models. Specifically, it asks: when a model features a nonlinearity — such as an asymmetric monetary policy rule or a nonlinear Phillips curve — do the longer-run average outcomes and the distributional properties of inflation and unemployment depend on whether agents have &lt;em&gt;rational expectations&lt;/em&gt; (RE, accounting for the possibility of future shocks) versus &lt;em&gt;perfect foresight&lt;/em&gt; (PF, not anticipating future shocks)?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper works within a standard three-equation New Keynesian model comprising an IS curve (linking the unemployment gap to the policy rate and the natural rate of interest via Okun&amp;rsquo;s law with coefficient c ≈ 2), a forward-looking Phillips curve, and a monetary policy rule. The model is parameterized at a quarterly frequency with β = 0.99, κ = 0.01, φπ = 1.5, φu = −0.25, shock persistence ρ_rn = 0.9, and shock standard deviation σ_rn = 0.0025 (calibrated to match a 1-percentage-point standard deviation of the unemployment gap under the symmetric baseline rule).&lt;/p&gt;
&lt;p&gt;The key methodological distinction is the specification of the expectations operator. Under RE, agents use the true stochastic transition matrix for the natural rate (approximated via the Rouwenhorst method with 105 grid points). Under PF, agents instead use a transition matrix that always places probability one on the steady-state value of the natural rate next period — i.e., they do not anticipate future shocks. The model is solved globally with a discrete state space projection (parameterized expectations) method, applied identically to RE and PF cases. The authors first derive analytical results in a simplified three-state environment and then present numerical results from 3,000 simulations of 1,000 periods each.&lt;/p&gt;
&lt;p&gt;Two types of asymmetry serve as case studies: (i) an asymmetric monetary policy rule — the &amp;ldquo;Shortfalls rule&amp;rdquo; — under which the central bank does not tighten in response to a tight labor market (negative unemployment gap), in the spirit of the FOMC&amp;rsquo;s 2020 framework update; and (ii) a nonlinear (kinked) Phillips curve that steepens by a factor of three when the labor market is tight (unemployment gap &amp;lt; 0), consistent with empirical evidence in Smith, Timmermann, and Wright (2025).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The core finding is that the sign and magnitude of longer-run average outcomes under asymmetric macroeconomic environments can differ substantially — and can even reverse — depending on whether agents have rational expectations or perfect foresight.&lt;/p&gt;
&lt;p&gt;For the &lt;strong&gt;Shortfalls rule&lt;/strong&gt;, under PF the model implies a longer-run tradeoff: average unemployment gap is −0.32 percentage points and average inflation gap is +0.25 annualized percentage points relative to the symmetric Deviations rule. PF thus suggests policymakers can lower average unemployment at modest inflationary cost. Under RE, however, this apparent tradeoff disappears entirely: the average unemployment gap is essentially zero (−0.05 percentage points) while average inflation is elevated by approximately 1.02 annualized percentage points. The gap in average inflation outcomes between RE and PF thus exceeds one percentage point, and the labor market benefit implied by PF is absent under RE.&lt;/p&gt;
&lt;p&gt;For the &lt;strong&gt;nonlinear Phillips curve&lt;/strong&gt; (under a symmetric deviations rule with φu = 0), the results again diverge across expectations assumptions, and the direction of the effects reverses. Under PF, the kinked Phillips curve implies average inflation of +0.41 annualized percentage points and a near-zero unemployment gap (+0.30 percentage points). Under RE, the average inflation gap is essentially zero while the average unemployment gap rises to +0.63 percentage points — the opposite directional pattern from PF.&lt;/p&gt;
&lt;p&gt;The mechanism driving the RE–PF divergence is the interaction between forward-looking price-setters and an inflation-stabilizing central bank. Under RE, anticipated future episodes in which the asymmetry may bind (e.g., the Shortfalls rule providing accommodation, or the Phillips curve steepening) cause firms to set higher prices today. The central bank responds to the resulting pickup in inflation expectations with tighter policy, generating a persistent contractionary offset. This channel is absent under PF because agents expect no future shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main conclusions are robust across three extensions: (i) &lt;em&gt;Bounded rationality&lt;/em&gt; (following Gabaix 2020, with m_br = 0.97): outcomes move toward the PF case, confirming that what matters is the degree to which agents internalize the probability of future shocks; (ii) &lt;em&gt;Cost-push shocks&lt;/em&gt; instead of natural rate shocks: the RE–PF divergence under a Shortfalls rule is broadly similar in direction and magnitude to the baseline; (iii) &lt;em&gt;Alternative shock specifications&lt;/em&gt;: the qualitative conclusions are maintained.&lt;/p&gt;
&lt;p&gt;Crucially, under the symmetric Deviations rule the RE and PF solutions are identical in all cases, confirming that the divergence is specific to models with macroeconomic asymmetries, not an artifact of the solution method.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the central methodological claim about perfect foresight solutions in asymmetric models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper argues that in macroeconomic models with asymmetries or nonlinearities, perfect foresight solutions — in which agents do not account for the possibility that future shocks may occur — can yield longer-run average outcomes and distributions that differ from their rational expectations counterparts in magnitude and potentially in sign. The paper is explicit that this is not a critique of PF methods per se, as PF is often necessary for estimating larger models; rather, the point is that researchers should check the robustness of conclusions about longer-run averages using simplified models solvable under both approaches.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How is the difference between RE and PF operationalized in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The sole technical distinction lies in the specification of the conditional expectations operator Et. Under RE, this operator uses the true stochastic Markov transition matrix for the natural rate (P^RE), which assigns positive probability to all feasible future states. Under PF, agents use a degenerate transition matrix (P^PF) that assigns probability one to the mean value of the natural rate next period regardless of the current state — effectively, agents expect no future innovations. The same global solution method (discrete state space projection with 105 Rouwenhorst grid points) is applied to both, so differences in equilibrium outcomes are entirely attributable to the expectation specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the analytical results for the Shortfalls rule in the simplified three-state model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the simplified environment with the natural rate taking three equiprobable values (low, steady-state, high) and no persistence, the analytical solution shows that under PF the average unemployment gap is −Δ/(1 + φπκ) &amp;lt; 0 and the average inflation gap is Δκ/(1 + φπκ) &amp;gt; 0, where Δ parameterizes the degree of additional accommodation in the high-demand state. Under RE, the average unemployment gap is exactly zero and the average inflation gap is Δ/(φπ − 1) &amp;gt; 0. The inflation gap under RE exceeds that under PF by Δ(1 + κ)/[(φπ − 1)(1 + φπκ)] &amp;gt; 0, and the unemployment gap under RE exceeds that under PF by Δ/(1 + φπκ) &amp;gt; 0. Thus, PF spuriously implies an exploitable long-run tradeoff that vanishes under RE.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the analytical results for the nonlinear Phillips curve in the simplified model, and how do the directions of the effects compare to the Shortfalls rule case?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under PF with a nonlinear (kinked) Phillips curve, the average inflation gap is positive (= Δpc &amp;gt; 0) while the average unemployment gap is zero. Under RE, the signs reverse: the average unemployment gap is positive (= Δpc/κ &amp;gt; 0) and the average inflation gap is zero. The difference is ūRE − ūPF = Δpc/κ &amp;gt; 0 and π̄RE − π̄PF = −Δpc &amp;lt; 0. This sign reversal relative to the Shortfalls rule case illustrates that the directional error introduced by PF is not uniform but depends on the specific asymmetry — the key feature is always the absence, under PF, of the forward-looking price-setting channel interacting with monetary policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the quantitative magnitude of the RE–PF divergence in the numerical model for the Shortfalls rule?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the fully parameterized numerical model (Table 2), under a Shortfalls rule the average inflation gap is 1.02 annualized percentage points under RE versus 0.25 annualized percentage points under PF — a difference of roughly 0.77 percentage points. The average unemployment gap is −0.05 percentage points under RE versus −0.32 percentage points under PF — a difference of 0.27 percentage points. The paper also notes that model-implied averages for inflation and nominal interest rates &amp;ldquo;under perfect foresight can easily differ by at least one percentage point from their rational expectations counterparts.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do the simulated distributions differ between RE and PF under a Shortfalls rule?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under PF, the simulated distributions of unemployment and inflation gaps exhibit a pronounced kink near the steady-state value (zero gap), reflecting the asymmetric treatment of expansions and contractions. Under RE, the distributions are substantially more symmetric, shifted to the right for inflation (mean of 1.0 versus 0.25 under PF). Standard deviations of the unemployment and inflation gaps are somewhat larger under PF (1.42 and 1.10, respectively) than under RE (1.33 and 1.03), because under RE the contractionary force from inflation expectations moderates the amplitude of fluctuations. These distributional differences have direct implications for how policymakers interpret the risks associated with state-contingent policies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the role of the forward-looking pricing–central bank interaction in generating RE–PF differences?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The key mechanism is as follows: under RE, the possibility that the asymmetry may bind in the future (e.g., a positive demand shock triggering more accommodation under the Shortfalls rule, or a tight labor market steepening the Phillips curve) causes forward-looking firms to raise prices today in anticipation of future inflation. This increase in current inflation leads the central bank — whose mandate includes inflation stabilization — to raise policy rates, generating a contractionary offset even when the economy is not currently in the high-demand state. Under PF, agents do not form these anticipatory expectations, so this channel is entirely absent, and the asymmetry affects outcomes only when it directly binds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Does the RE–PF divergence arise under a symmetric Deviations rule?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The paper shows analytically and numerically that when the monetary policy rule is symmetric (the Deviations rule, responding equally to deviations above and below target), the RE and PF solutions are identical. Unemployment and inflation gaps are both zero on average under either expectations assumption, and the policy rate gap is essentially zero (0.01 annualized percentage points) in both cases. This equivalence result confirms that the RE–PF divergence is not an artifact of the solution method or parameterization but is specifically generated by the interaction between an asymmetry and agents&amp;rsquo; forward-looking behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What do the bounded rationality results imply about the mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The extension following Gabaix (2020), with a myopia parameter m_br = 0.97, produces results that lie between the full-RE and PF cases: the adoption of the Shortfalls rule yields average unemployment of −0.26 percentage points (intermediate between RE&amp;rsquo;s −0.05 and PF&amp;rsquo;s −0.32) and average inflation of 0.62 annualized percentage points (between RE&amp;rsquo;s 1.02 and PF&amp;rsquo;s 0.25). This gradient confirms that the key driver is the extent to which agents internalize the probability of future shocks: the more forward-looking agents are, the more strongly the anticipatory pricing channel operates and the less favorable (and more inflationary) the apparent policy tradeoff becomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the results for the nonlinear Phillips curve in the numerical model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under the numerically calibrated nonlinear Phillips curve model (Panel B.3 of Table 3, with the slope increasing by a factor of three when the unemployment gap is negative), the average unemployment gap under RE is 0.63 percentage points versus 0.30 under PF, and the average inflation gap under RE is essentially zero (0.01 annualized percentage points) versus 0.41 under PF. The authors note that &amp;ldquo;the average outcomes for both unemployment and inflation can differ by roughly 0.3 to 0.4 percentage points between rational expectations and perfect foresight&amp;rdquo; in this case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the paper&amp;rsquo;s advice for researchers who must use perfect foresight methods?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper explicitly states that PF methods remain valuable, especially for estimating or simulating larger models with heterogeneity at the micro level where RE solutions are computationally prohibitive. The authors recommend that researchers relying on PF to solve larger models &amp;ldquo;check the robustness of their conclusions on longer-run averages and the distribution of outcomes using simplified models which can be solved under both perfect foresight and rational expectations.&amp;rdquo; To support this, the authors provide multiple versions of code for solving simple macroeconomic models under various asymmetries and expectations assumptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the paper position its contribution relative to prior work on RE vs. PF in asymmetric models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper acknowledges that Adam and Billi (2007) and Nakov (2008) previously documented that, at the zero lower bound, households&amp;rsquo; anticipation of future ZLB episodes leads to lower average inflation — an RE–PF difference in the spirit of this paper&amp;rsquo;s findings. However, the paper&amp;rsquo;s contribution is to show that the sign and quantitative implications of a given asymmetry can change depending on the expectations assumption, and to systematically characterize this sensitivity across multiple types of asymmetry (asymmetric policy rules and nonlinear Phillips curves). The paper also categorizes the existing literature by expectations assumptions in Table A.1, showing that many papers examining macroeconomic asymmetries use only one approach.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shortfalls Rule&lt;/strong&gt;: A monetary policy rule, motivated by the FOMC&amp;rsquo;s 2020 Statement on Longer-Run Goals and Monetary Policy Strategy, under which the central bank responds only to shortfalls of employment from its maximum level — i.e., it does not tighten policy in response to a tight labor market (negative unemployment gap) during an expansion. Formally, it = φπ πt + φu ut when ut ≥ 0 (labor market slack), and it = φπ πt only when ut &amp;lt; 0 (labor market tight). Contrasts with the symmetric Deviations rule that responds to deviations of employment in both directions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deviations Rule&lt;/strong&gt;: A symmetric monetary policy rule in which the central bank responds to the unemployment gap regardless of its sign — tightening in expansions and easing in contractions. Serves as the baseline against which the Shortfalls rule is compared, and as the case in which RE and PF solutions are identical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Perfect Foresight (PF) Equilibrium&lt;/strong&gt;: An equilibrium in which agents solve their optimization problems assuming that no future shocks will occur — they expect all endogenous variables to converge to their longer-run (steady-state) values next period, regardless of the current state. In the paper&amp;rsquo;s notation, the PF transition matrix P^PF assigns probability one to the mean state next period. In linear models, PF and RE yield identical outcomes; in models with asymmetries, they diverge.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rational Expectations (RE) Equilibrium&lt;/strong&gt;: An equilibrium in which households and firms correctly account for the full stochastic distribution of future shocks in forming their expectations. Agents use the true Markov transition matrix P^RE for the natural rate process. This allows forward-looking pricing behavior to incorporate the possibility that the economy may enter states in which asymmetries bind in the future.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nonlinear (Kinked) Phillips Curve&lt;/strong&gt;: A Phillips curve in which the slope coefficient κ̃t is state-contingent, increasing when the unemployment gap is negative (labor market is tight). In the paper&amp;rsquo;s numerical implementation, the slope triples (κ̃ = 3κ) when ut &amp;lt; 0, consistent with empirical evidence in Smith, Timmermann, and Wright (2025) on structural breaks in the Phillips curve. The nonlinearity generates an asymmetric inflationary response: a given level of unemployment produces more inflation when the labor market is tight than when it is slack.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stochastic Steady State&lt;/strong&gt;: The equilibrium to which the economy converges in the absence of additional shocks, taking into account the stochastic nature of the environment (i.e., accounting for the possibility of future shocks). Used as the initial condition for computing impulse response functions under RE. Contrasts with the deterministic steady state (zero gaps), which serves as the initial condition under PF.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Parameterized Expectations (Global Solution) Method&lt;/strong&gt;: The numerical solution algorithm used in the paper to solve for equilibrium policy functions for unemployment and inflation gaps over the state space. Implemented identically for RE and PF cases, differing only in the transition matrix used. Applied with 105 Rouwenhorst grid points for the natural rate. The paper shows this method is orders of magnitude faster than the more common shooting algorithm (0.04 seconds vs. 10.8 seconds) while yielding identical policy functions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bounded Rationality (Gabaix 2020)&lt;/strong&gt;: An extension of the baseline model in which agents discount the influence of future expectations by a myopia parameter m_br ∈ (0, 1), applied to both the IS curve and the Phillips curve. The parameter m_br = 0.97 (following McKay, Nakamura, and Steinsson 2017) limits the degree to which distant future states affect current decisions. Produces outcomes intermediate between full RE and PF, confirming that the key dimension of variation is the extent to which agents internalize the probability of future shocks.&lt;/p&gt;</description></item><item><title>Exchange Rates and Asset Prices in a Global Demand System</title><link>https://macropaperwarehouse.com/papers/exchange-rates-and-asset-prices-in-a-global-demand-system/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/exchange-rates-and-asset-prices-in-a-global-demand-system/</guid><description>&lt;p&gt;The paper develops an asset demand system to analyze, jointly and across all countries, how international portfolio holdings and flows, exchange rates, short-term rates, long-term yields, and equity prices are determined in equilibrium. The authors specify a nested logit model of asset demand (substitution across countries within an asset class, and across asset classes) and introduce a new instrumental-variables identification strategy based on the size distribution of countries and bilateral distances; estimating on portfolio-holdings data for 37 countries and three asset classes from 2003 to 2020, they find demand is relatively inelastic, with mean demand elasticities of 27.9 (s.e. 1.9) for short-term debt, 3.2 (0.4) for long-term debt, and 1.2 (1.1) for equity. A variance decomposition attributes 82% of exchange-rate variation, 86% of short-term-rate variation, and 60% of log market-to-book equity variation to &amp;rsquo;latent demand&amp;rsquo; (the residual demand shifter), while portfolio flows (54%) and macro variables (43%) dominate long-term yields. Applying the framework to the European sovereign debt crisis, latent demand explains essentially all of the Italian long-term-yield variation and 74% of the Portuguese, whereas macro fundamentals are relatively more important for Greece (46% vs. 32% for latent demand), which the authors read as consistent with Greece being insolvent while Italy and Portugal were solvent but perceived as vulnerable. Estimating the convenience yield on US assets, they find, in units of expected annual returns, 1.41% on the US dollar, 2.71% on US long-term debt, and 0.50% on US equity. All estimates are specific to their sample, model, and identification assumptions.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-a-global-demand-system-and-what-does-it-explain"&gt;Q1. What is a &amp;lsquo;global demand system&amp;rsquo; and what does it explain?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors represent the equilibrium of an international macro model as an asset demand system and replace traditional optimal portfolios with estimated asset demand functions that match observed international portfolio holdings, so that portfolio flows and shifts in asset demand explain all movements in exchange rates and asset prices.&lt;/strong&gt; This lets them reinterpret the exchange rate disconnect (Meese and Rogoff 1983) as the finding that shifts in asset demand through macro variables explain much less variation than portfolio flows and latent demand, and to identify which countries&amp;rsquo; latent demand matters for exchange rates and asset prices.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-nested-logit-model-of-asset-demand"&gt;Q2. What is the nested logit model of asset demand?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Asset demand follows a nested logit model with substitution across countries in the inner nest and across asset classes in the outer nest, where demand depends on expected returns (asset prices or yields and real exchange rates), macro variables (GDP, GDP per capita, inflation, equity volatility, sovereign rating), bilateral distance (the gravity effect), a domestic-ownership indicator (home bias), and latent demand.&lt;/strong&gt; The nested structure gives more flexible substitution than the logit model of Koijen and Yogo (2019), while latent demand captures heterogeneous beliefs about risk exposure across investors and assets.&lt;/p&gt;
&lt;h3 id="q3-how-are-the-demand-elasticities-identified"&gt;Q3. How are the demand elasticities identified?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors develop an instrumental-variables strategy in which an exogenous component of one investor group&amp;rsquo;s demand shifters generates variation in residual supply that identifies another group&amp;rsquo;s demand elasticity, isolating cross-sectional variation in residual supply from the size distribution of countries and the bilateral distances between them.&lt;/strong&gt; Intuitively, smaller issuer countries in close proximity to larger investor countries have lower residual supply and thus higher asset prices and/or real exchange rates (the example contrasts Dutch with Australian long-term debt).&lt;/p&gt;
&lt;h3 id="q4-what-are-the-estimated-demand-elasticities-and-why-do-they-matter"&gt;Q4. What are the estimated demand elasticities, and why do they matter?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Averaged across years and issuer countries, the mean demand elasticities are 27.9 (s.e. 1.9) for short-term debt, 3.2 (0.4) for long-term debt, and 1.2 (1.1) for equity — so, e.g., a country&amp;rsquo;s aggregate equity demand falls about 1.2% per 1% rise in its price.&lt;/strong&gt; The authors present these as empirical targets for international macro models that rely on inelastic demand and demand shocks unrelated to fundamentals to resolve long-standing puzzles, and they note the estimates are broadly consistent with prior, more granular estimates for narrower sets of countries and asset classes once differences in aggregation and identification are accounted for.&lt;/p&gt;
&lt;h3 id="q5-what-does-the-variance-decomposition-reveal"&gt;Q5. What does the variance decomposition reveal?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Latent demand is relatively more important for exchange rates, short-term rates, and equity prices — explaining 82% of exchange-rate variation (of which foreign-exchange reserves explain 10%), 86% of short-term-rate variation, and 60% of log market-to-book equity variation — whereas portfolio flows (54%) and macro variables (43%) are relatively more important for long-term yields (latent demand explains only about 3%).&lt;/strong&gt; For equity, North American investors explain 13% and European investors 26% of the log market-to-book variation.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-framework-interpret-the-european-sovereign-debt-crisis"&gt;Q6. How does the framework interpret the European sovereign debt crisis?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Applied to extreme long-term-yield movements in Greece, Italy, and Portugal, the decomposition shows macro variables are relatively more important for Greece (46% vs. 32% for latent demand), while latent demand explains all of the Italian and 74% of the Portuguese yield variation, with European investors alone explaining 98% of the Italian and 65% of the Portuguese movements.&lt;/strong&gt; The authors read this as consistent with the narrative that Greece was insolvent while Italy and Portugal were solvent but perceived as vulnerable.&lt;/p&gt;
&lt;h3 id="q7-what-are-the-estimated-convenience-yields-on-us-assets"&gt;Q7. What are the estimated convenience yields on US assets?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Computing counterfactual prices that remove the special demand for US assets, the authors estimate convenience yields, in units of expected annual returns, of 1.41% on the US dollar, 2.71% on US long-term debt, and 0.50% on US equity.&lt;/strong&gt; In the absence of special status, a value-weighted US-dollar exchange rate would be 5.23% higher, the US long-term yield 0.73% higher, and US market-to-book equity 3.35% lower, consistent with the view that the dollar is the global reserve currency and US Treasury debt the global safe asset.&lt;/p&gt;
&lt;h3 id="q8-how-does-the-framework-connect-to-monetary-policy"&gt;Q8. How does the framework connect to monetary policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors note in their conclusion that, because unconventional monetary policy fundamentally concerns changes in the supply of long-term debt and its impact on exchange rates and asset prices through substitution effects, the demand-system approach is suited to study the simultaneous and cumulative impact of conventional and unconventional monetary policy across many countries — and they flag this as a direction for future research rather than a result of the current paper.&lt;/strong&gt; This scope condition matters: the present paper estimates the demand system and its decompositions, not the effects of monetary policy itself.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;asset demand system / demand system asset pricing&lt;/strong&gt; : an approach (introduced in Koijen and Yogo 2019 and here extended to international finance) that estimates asset demand functions on portfolio holdings data and analyzes the equilibrium relation between holdings/flows and prices, in place of traditional optimal portfolios.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;nested logit asset demand&lt;/strong&gt; : the specific functional form for demand, with substitution across countries in the inner nest and across asset classes in the outer nest, allowing flexible substitution patterns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;latent demand&lt;/strong&gt; : the residual component of demand shifters — capturing heterogeneous beliefs about risk exposure — that, together with portfolio flows and macro variables, accounts for movements in exchange rates and asset prices; it is the dominant driver of exchange rates and short-term rates in the decomposition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;demand elasticity (inelastic markets)&lt;/strong&gt; : the percentage change in a country&amp;rsquo;s aggregate asset demand per 1% change in its price; the paper&amp;rsquo;s low estimates (especially 1.2 for equity) are offered as empirical targets for &amp;lsquo;inelastic markets&amp;rsquo; macro-finance models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;convenience yield&lt;/strong&gt; : the extra demand for (and hence lower expected return on) US assets owing to their special status as global reserve currency and safe asset; measured here as 1.41% (USD), 2.71% (US long-term debt), and 0.50% (US equity) in expected-annual-return units.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;gravity effect and home bias&lt;/strong&gt; : the empirical regularities that portfolio holdings decline with bilateral distance (gravity) and are tilted toward domestic assets (home bias), which the demand system captures via distance and a domestic-ownership indicator.&lt;/p&gt;</description></item><item><title>Expectation-driven term structure of equity and bond yields</title><link>https://macropaperwarehouse.com/papers/expectation-driven-term-structure-of-equity-and-bond-yields/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/expectation-driven-term-structure-of-equity-and-bond-yields/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; What drives the joint historical dynamics of the term structure of equity yields and nominal bond yields — and can a single unified equilibrium model explain the procyclical equity yield slope, the switch in bond-stock correlation from positive to negative after the late 1990s, the maturity-declining predictability of dividend strip returns, and standard aggregate stock market puzzles?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Departure from Prior Literature.&lt;/strong&gt; Existing equilibrium models (habit formation, long-run risk, disaster risk) rely on time-varying risk premia to explain asset prices. Recent survey evidence challenges this: De La O and Myers (2021) show that most aggregate stock price movements are driven by cash-flow growth expectations rather than return expectations, and Van Binsbergen et al. (2013) show that equity yields are driven mainly by dividend growth expectations. This paper constructs an equilibrium model in which equity (bond) yield variation is attributable to subjective dividend growth (GDP growth) expectations, with a constant subjective risk premium implied by CRRA utility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model Architecture.&lt;/strong&gt; The representative agent has CRRA utility with risk-aversion coefficient γ = 4 and subjective discount factor β = 1.0065 (calibrated to the average 10-year equity yield). The agent departs from rational expectations by having the &amp;ldquo;belief in the law of small numbers&amp;rdquo; (Tversky and Kahneman 1971): she perceives small samples to represent their population as well as large samples, leading to subjective learning gains that differ from the rational Kalman gain. The subjective belief updating rule is a modified Kalman filter in which the likelihood is exaggerated by factor (1+θ), producing a subjective learning gain ν that exceeds the Kalman gain K when overreaction applies and falls below it when underreaction applies.&lt;/p&gt;
&lt;p&gt;The model has three blocks of fundamentals, each decomposed into a stable and a transitory component. (1) Real GDP growth is decomposed into PCE growth (stable, with a random-walk trend state µ_g) and a volatile gap component (stationary state x_g, persistence ρ_g = 0.941). (2) Inflation is decomposed into core inflation (stable, with trend state µ_π) and a volatile gap (persistence ρ_π = 0.932). (3) Real aggregate dividend is decomposed into a long-duration dividend component dl (levered on log real GDP with leverage λ = 3) and the share of long-duration dividend ds (stationary with persistence ρ_d = 0.94). This cross-sectional decomposition uses firm-level long-term earnings growth (LTG) forecasts from IBES as a model-free equity duration measure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation.&lt;/strong&gt; State-space parameters are estimated by maximum likelihood with the Kalman filter on data from NYSE/NASDAQ/AMEX firms (CRSP/Compustat), quarterly, from 1987Q4 to 2019Q4. Subjective learning gains are estimated by minimizing RMSE between model-implied expectations and consensus forecasts: 1-year real GDP growth and inflation from the Survey of Professional Forecasters (SPF, 1981Q3–2019Q4), and 1-year aggregate dividend growth extended from De La O and Myers (2021) to 2019Q4. Equity yield data are from Giglio et al. (2021); bond yields are end-of-quarter zero-coupon nominal yields from Gürkaynak et al. (2007).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Equity Term Structure Dynamics.&lt;/strong&gt; The model&amp;rsquo;s subjective dividend growth expectations drive equity yields. The 1-year model-implied equity yield correlates 0.68 with data; the 10-year correlates 0.79; the 10Y–1Y slope correlates 0.59 with data. Consistent with &amp;ldquo;belief in the law of small numbers,&amp;rdquo; the agent overreacts to dividend news (estimated learning gains νl_d = 0.166 and νs_d = 0.458, both below their Kalman gains, which under the level-to-growth translation implies overreaction to dividend growth news, confirmed by negative CG(2015) regression slope coefficients of −0.69 at 1Y and −0.97 at 5Y).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Procyclical Equity Yield Slope.&lt;/strong&gt; During recessions, the average equity yield slope (10Y–1Y) in the model is −3.77%; during expansions it is +3.96%, matching the data (−5.50% in recessions, +3.93% in expansions). The sign reversal is driven primarily by the dividend-specific component of the decomposition: in recessions, short-run dividend growth expectations fall much more sharply than long-run expectations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Bond Pricing.&lt;/strong&gt; The model&amp;rsquo;s 1-year and 10-year nominal bond yields achieve correlations of 0.92 and 0.95 with their data counterparts, inheriting the explanatory power of Zhao (2020) for the bond market. The agent underreacts to GDP growth and inflation news (estimated learning gains well below Kalman gains, confirmed by positive CG(2015) slope coefficients of +2.08 at 1Y for GDP growth and +1.01 at 1Y for inflation).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Bond-Stock Correlation Switch.&lt;/strong&gt; In data, 10Y bond vs. dividend strip return correlation (5Y strip) goes from +0.46 before 2000 to −0.49 after 2000. The model produces +0.14 before and −0.56 after (for the 5Y strip). Decomposing the change in bond-stock return covariance: the &amp;ldquo;inflation real effect&amp;rdquo; (correlation between expected inflation and real growth) accounts for approximately 27–31% of total changes (for 5Y to 10Y strips); the &amp;ldquo;real growth correlation&amp;rdquo; channel — stronger co-movement between real GDP and real dividend growth expectations after 2000 — accounts for approximately 89–95% of total changes. The paper identifies this real bond hedging channel as the dominant and previously unexamined driver.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Dividend Strip Return Predictability.&lt;/strong&gt; The price-dividend ratio predicts annual market excess returns with R² of 10.3% (data) vs. 9.0% (model). Strip return predictability is downward-sloping by maturity: in data, the R² is 20.2% for 5-year strips and 14.5% for 10-year strips; the model generates 14.2% and 10.4% respectively. This is decomposed into three sources: bond return predictability (small contribution), dividend forecast error predictability (dominant for short maturities), and forecast revision predictability (negative contribution that offsets). The downward slope occurs because current news has smaller impact on long-term dividend expectations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Aggregate Market Puzzles.&lt;/strong&gt; The model-implied log dividend-price ratio correlates 0.86 with data, with AR(1) coefficient 0.96 (data: 0.95). Model-implied average market return is 9% (data: 8%); annualized return volatility 12% (data: 16%). The model replicates the switch of the bond-stock aggregate return correlation from +0.13 before 2000 to −0.46 after 2000 (data: +0.39 to −0.64).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply to U.S. equity and bond markets over 1987Q4–2019Q4 (with bond learning using data back to 1959Q1). The model assumes a representative agent with CRRA utility and constant subjective risk premium. It is silent on the term structure of expected returns in the statistical sense (which requires identification of latent states under the physical measure). The aggregate market results require a reduced-form specification for stochastic equity duration H_t linked to the value-weighted LTG average.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core psychological mechanism generating subjective beliefs, and how does it differ from the diagnostic expectations approach?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The agent has the &amp;ldquo;belief in the law of small numbers&amp;rdquo; (Tversky and Kahneman 1971): she treats small samples as equally representative of their population as large samples. Formally, this is embedded by exaggerating the likelihood in the Bayesian update: p(x_t|I_t) ∝ p(y_t|x_t)^{1+θ} × p(x_t|I_{t-1}), where θ captures the magnitude of cognitive bias. The resulting subjective learning gain ν = (1+θ)P̃ / [(1+θ)P̃ + σ²_ε] can exceed the Kalman gain K when θ is large (overreaction) or fall below it when θ is small (underreaction). This differs from diagnostic expectations (Bordalo et al. 2019, 2020a,b), which are based on the representativeness heuristic; the paper notes the two notions of news are highly correlated in simulation (Table IA.2) and that both can imply overreaction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does the model generate overreaction to dividend growth news even though the dividend-level learning gains are smaller than the Kalman gains?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model separates dividend learning into level and growth. Section 2.2 derives that underreaction to dividend level news (νl_d &amp;lt; Kl_d, νs_d &amp;lt; Ks_d, estimated values 0.166 and 0.458 against Kalman gains 0.19 and 0.49 respectively) translates into overreaction to dividend growth news. This is confirmed by the CG(2015) rationality test: regressing forecast errors on lagged forecast revisions yields slope coefficients of −0.69 (1Y) and −0.97 (5Y) for real dividend growth, both statistically significant (t-statistics −3.63 and −3.22). In contrast, the same test yields positive slope coefficients for GDP growth (2.08 at 1Y) and inflation (1.01 at 1Y), confirming underreaction for these series.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How well does the model match subjective dividend growth expectations in the survey data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model-implied 1-year subjective dividend growth forecast is estimated by minimizing RMSE against the consensus dividend growth forecast series (extended from De La O and Myers 2021 to 2019Q4, with a replication correlation of 0.92 over the overlapping sample). The unconditional correlation between model-implied and data 1-year forecasts is 0.80. Although only 1-year forecasts are used in estimation, the model also achieves a correlation of 0.80 for 2-year forecasts, providing an out-of-sample validation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What explains the higher volatility of short-term equity yields relative to long-term equity yields?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Short-term subjective dividend growth expectations are more volatile because the agent&amp;rsquo;s short-run expectation mean-reverts toward the less volatile long-run (levered) GDP growth expectation. In the model&amp;rsquo;s two-component dividend structure, the transitory dividend-share component xd has persistence ρ_d = 0.94 and its effect on equity yields decays as maturity increases (via the factor (1−ρ^n_d)/n). Similarly, the effect of the transitory GDP growth state x_g decays with maturity. Long-term equity yields are thus anchored by the slower-moving trend components µ_g and µ_d. In the data from Giglio et al. (2021), 1-year yields have a standard deviation of 8.89% annualized vs. 2.70% for 10-year yields; the model generates 8.22% and 1.89% respectively.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the quantitative importance of the &amp;ldquo;real growth correlation&amp;rdquo; channel vs. the &amp;ldquo;inflation real effect&amp;rdquo; channel in explaining the bond-stock correlation switch?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For the switch in bond-stock return correlation (using the 10-year nominal bond and various maturity dividend strips), the decomposition in Table 4 shows that the &amp;ldquo;real growth correlation&amp;rdquo; channel accounts for 89.1% (5Y strip), 92.1% (7Y strip), and 94.8% (10Y strip) of total bond-stock covariance changes, while the &amp;ldquo;inflation real effect&amp;rdquo; (correlation between expected inflation and expected real growth) accounts for 27.3%, 29.3%, and 31.1% respectively. The &amp;ldquo;volatility of shocks to expected inflation and real growth&amp;rdquo; makes a negative contribution (−16.4%, −21.4%, −25.9%), mostly attributable to more volatile beliefs during the 2008 global financial crisis. The real growth correlation channel reflects that after 2000, real bonds provide a better hedge to aggregate real dividend risks because real GDP growth expectations and real dividend growth expectations became more positively correlated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Does the same real growth correlation story hold for the &amp;ldquo;Fed model&amp;rdquo; (bond-stock yield correlation)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes, but with a quantitatively different balance. For yield correlations (Table 5), the &amp;ldquo;real growth correlation&amp;rdquo; channel accounts for 72.4%–80.1% of bond-stock yield covariance changes (5Y to 10Y strip), while the &amp;ldquo;inflation real effect&amp;rdquo; now accounts for 41.2%–43.9%. The inflation real effect is proportionally larger for yield levels because persistent expected inflation correlates strongly with the level of expected real GDP growth — even though inflation expectations do not move fast enough at high frequency to explain return correlation, they co-move strongly with expected growth at low frequency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the model generate a downward-sloping term structure of return predictability?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The strip excess return is decomposed into three components (Equation 44): maturity-matched bond excess return (Bond), dividend forecast error within the holding period (FE), and forecast revision regarding dividend growth after the holding period (FR). For short maturities, bond predictability contributes little (R² ≈ 6.7% for 5Y strip), while FE predictability (R² ≈ 31.5%) and FR predictability (R² ≈ 35.6%) dominate. As maturity increases, the current news has smaller impact on long-term dividend expectations, reducing the predictability of FE (R² ≈ 26.6% for 10Y) and FR (R² ≈ 26.5% for 10Y). Taken together, total model-implied strip R² declines from 14.2% (5Y) to 10.4% (10Y), matching the data pattern (20.2% to 14.5%). The paper identifies forecast revision predictability as a new channel not previously documented.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Why do forecast errors and forecast revisions have opposite signs in the predictability regressions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bad news (high equity yields, i.e., low current stock prices) triggers excessively pessimistic subjective dividend growth expectations because the agent overreacts to dividend news. These overly pessimistic forecasts tend to be disappointed in the future — actual dividend realizations exceed the forecast — producing positive subsequent forecast errors (FE is positively predicted by high yields, with R² ≈ 31.5% for 5Y strips). However, as dividend levels mean-revert, higher subsequent realizations cause the agent to revise down the forecast for dividend growth thereafter, leading to negative forecast revisions (FR is negatively predicted by high yields, with R² ≈ 35.6% for 5Y strips, opposite sign from FE). The net effect on return predictability is thus a combination of positive (FE) and negative (FR) contributions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the model handle the aggregate market dividend-price ratio and its persistence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The aggregate stock price is modeled as the sum of dividend strip prices up to a stochastic horizon H_t, which is parameterized as a linear function of the value-weighted average of LTG forecasts: H_t = a + b·LTG_t. Parameters a and b are estimated by minimizing RMSE between model-implied and data log dividend-price ratio. The model-implied ratio achieves a correlation of 0.86 with data, an AR(1) coefficient of 0.96 (data: 0.95), and an annualized volatility of 26% (data: 30%). The time-variation is driven entirely by strip yield variations and exogenous LTG movements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Is the overreaction to dividend news and underreaction to GDP/inflation news consistent in a single framework?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. The model&amp;rsquo;s subjective learning framework (based on &amp;ldquo;belief in the law of small numbers&amp;rdquo;) generates both over- and underreaction depending on the estimated subjective learning gain relative to the Kalman gain. For GDP growth and inflation, the learning gains (ν*_g = 0.012, νgap_g = 0.065; ν*_π = 0.049, νgap_π = 0.228) are below their Kalman gains (0.29 and 0.67 for GDP components; 0.67 and 0.48 for inflation components), producing underreaction. The paper hypothesizes this is related to the Fed&amp;rsquo;s dual mandate: agents rationally assign lower weight to GDP and inflation shocks expecting the Fed will stabilize them. For dividend growth, a level-to-growth translation converts level underreaction into growth overreaction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What are the robustness checks, and what do they show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper checks three alternative equity duration measures: those from Dechow et al. (2004), Weber (2018), and Gonçalves (2021b), as well as the book-to-market ratio following Lettau and Wachter (2007). Table IA.1 shows that replacing LTG with these measures still produces model-implied equity yields that replicate key data moments with high time-series correlations. Changing the cross-sectional breakpoint for long-duration dividends from the median LTG to the 40th or 60th percentile leaves results similar. The paper also presents an Internet Appendix extension in which the agent has ambiguity about real GDP and dividend growth (model misspecification fear), yielding equity yields and returns even closer to data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What is the paper&amp;rsquo;s contribution to the bond market relative to Zhao (2020)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The bond pricing block closely follows Zhao (2020), inheriting its explanatory power for bond market stylized facts. The model&amp;rsquo;s 1-year and 10-year nominal bond yields achieve correlations of 0.92 and 0.95 with data, respectively. The new contribution is the joint model covering both equity and bond markets simultaneously, enabling the decomposition of bond-stock covariance and the identification of the real growth correlation as the dominant driver of the bond-stock correlation switch — a channel not addressed by Zhao (2020), which focused on bond market puzzles alone.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Equity Yield (Dividend Strip Yield).&lt;/strong&gt; Defined as ey^(n)_t = (1/n)(d$_t − p^(n)_t), where p^(n)_t is the log price of the n-period dividend strip (a claim to the nominal dividend n periods ahead) and d$_t is the log nominal aggregate dividend. It decomposes into the bond yield, a subjective dividend growth component, and a (constant) risk premium component.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Belief in the Law of Small Numbers.&lt;/strong&gt; A cognitive bias (Tversky and Kahneman 1971) in which the agent perceives small samples to represent their population as well as large samples. Modeled by exaggerating the likelihood in Bayesian updating: p(x_t|I_t) ∝ p(y_t|x_t)^{1+θ} × p(x_t|I_{t-1}). This generates a subjective learning gain ν that can exceed the Kalman gain (overreaction) or fall below it (underreaction) depending on θ and the signal-to-noise ratio.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Subjective Learning Gain.&lt;/strong&gt; The coefficient ν in the subjective Kalman filter update ẽ_t x_t = ρẽ_{t-1}x_{t-1} + ν(y_t − ρẽ_{t-1}x_{t-1}). It equals (1+θ)P̃ / [(1+θ)P̃ + σ²_ε], where P̃ is the subjective predictive variance. When ν &amp;gt; K (the rational Kalman gain), the agent overreacts to news; when ν &amp;lt; K, the agent underreacts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-Duration Dividend Component.&lt;/strong&gt; The portion of aggregate real dividend (dl_t) attributable to &amp;ldquo;long-duration&amp;rdquo; firms — those with above-median analyst LTG forecasts in CRSP/Compustat/IBES data. Levered on log real GDP with leverage parameter λ = 3, it carries aggregate risk. The complementary short-duration dividend share ds_t is stationary and carries no aggregate risk. The decomposition allows the model to exploit cross-sectional cash-flow duration information when learning about future aggregate dividend growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real Growth Correlation Channel.&lt;/strong&gt; A bond-stock covariance component defined as Cov(RGDP^(N), RDIV^(n)), where RGDP^(N) is the real GDP growth expectation component of 10-year nominal bond returns and RDIV^(n) is the real dividend growth expectation component of n-period strip returns. This channel captures whether real bonds hedge aggregate real dividend risks. The paper shows this channel accounts for approximately 89–95% of the post-2000 bond-stock covariance change for dividend strips.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inflation Real Effect.&lt;/strong&gt; The covariance component Cov(INFL^(N)_B, RGDP^(n) + RDIV^(n)), defined as the correlation between shocks to expected inflation (embedded in nominal bond returns) and shocks to expected real growth (in strip returns). In the paper&amp;rsquo;s framework this is distinct from the standard inflation risk premium story, as it concerns the correlation between subjective beliefs rather than realized covariances under the physical measure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forecast Error (FE) and Forecast Revision (FR) Predictability.&lt;/strong&gt; Two of three components of realized strip excess return (Equation 44). FE = ∆d$&lt;em&gt;{t+1:t+h} − ẽ_t∆d$&lt;/em&gt;{t+1:t+h} is the realized dividend growth forecast error within the holding period; FR = (ẽ_{t+h} − ẽ_t)∆d$_{t+h+1:t+n} is the forecast revision for dividend growth beyond the holding period. Because the agent overreacts to dividend news, bad news triggers overly pessimistic forecasts (positive subsequent FE) and, as dividends mean-revert, downward forecast revisions (negative FR). These two have opposite signs in predictive regressions, generating the downward-sloping term structure of return predictability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fed Model.&lt;/strong&gt; The empirical positive correlation between equity yields (real) and nominal bond yield levels. The paper shows that this yield-level correlation switched from strongly positive (≈ 0.85 before 2000) to significantly negative (≈ −0.60 to −0.62 after 2000) for 5Y–10Y dividend strips, and that the same real growth correlation and inflation real effect decomposition applies, albeit with the inflation real effect proportionally larger (≈ 40%) for yield levels than for returns (≈ 30%) because persistent inflation expectations co-move with the level of expected real GDP growth.&lt;/p&gt;</description></item><item><title>Explicit consumption functions with borrowing constraints: A continuous-time approach</title><link>https://macropaperwarehouse.com/papers/explicit-consumption-functions-with-borrowing-constraints-a-continuous-time-approach/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/explicit-consumption-functions-with-borrowing-constraints-a-continuous-time-approach/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; The paper asks whether an explicit, global, closed-form solution exists for the consumption function in the standard income fluctuation problem with a borrowing constraint and constant income, a problem that has resisted closed-form solution since at least Schechtman (1976). All prior continuous-time work (Park 2006, Holm 2018, Fischer 2024) produced only &lt;em&gt;implicit&lt;/em&gt; expressions; Achdou et al. (2022) produced explicit expressions valid only locally, near zero assets or as assets diverge to infinity, and only for r &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; A single agent with CRRA utility (coefficient of relative risk aversion γ &amp;gt; 0) maximizes discounted utility over an infinite horizon, subject to the flow budget constraint da/dt = ra + y − c, with a borrowing constraint a(t) ≥ 0. The agent receives a constant, deterministic income stream y ≥ 0 and discounts at rate ρ, with the impatience condition ρ &amp;gt; r maintained throughout. The paper takes a continuous-time formulation arrived at by letting the discrete period length Δ → 0, nesting Helpman (1981)&amp;rsquo;s discrete-time analysis as a special case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key analytical device.&lt;/strong&gt; A one-to-one mapping exists between initial assets a and the time T it takes for the consumer to fully run down her assets. This map, denoted T = h(a; y), is well-defined, strictly increasing, and concave in a (established in Proposition 1 via the Hadamard-Lévy theorem). Expressing the optimal consumption function as c*(a; y) = y · exp(ρh(a;y)/γ) evaluated at t = 0 reduces the problem to explicitly inverting the transcendental equation relating a to T.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main result (r = 0).&lt;/strong&gt; For the case of a zero net real interest rate, the transcendental equation can be solved explicitly using the second branch W₋₁(·) of the Lambert W function. The closed-form consumption function is (Theorem 2 and Corollary 2.1):&lt;/p&gt;
&lt;p&gt;c*(a; y) = y · exp(ρ h(a;y) / γ), where h(a; y) = −(a/y + γ/ρ) − (γ/ρ) W₋₁(f(a;y)), and f(a;y) = −exp(−b(a + γy/ρ)/y), b := ρ/γ.&lt;/p&gt;
&lt;p&gt;This is a &lt;strong&gt;global&lt;/strong&gt; solution (valid for all a ≥ 0), in contrast to the local solutions in prior work. The paper notes that for the illustrative parameter values r = 0.01, γ = 0.5, ρ = 0.08, y = 3 (broadly consistent with average U.S. real interest rates in 2025), there is a visually sizable gap between the constrained and unconstrained consumption functions except as a → ∞, where the two converge (in line with the asymptotic linearity result of Benhabib et al. 2015).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main result (r &amp;gt; 0).&lt;/strong&gt; For positive interest rates, the Lambert W function cannot invert a sum of exponentials with different exponents (an open mathematical problem). The paper instead derives a global closed-form &lt;strong&gt;approximation&lt;/strong&gt; valid for r ∼ 0, by expanding e^(−rT) ≈ 1 − rT to first order and applying the same Lambert W inversion. The approximating consumption function has the same structural form but with modified coefficients b_r, c_r, d_r that collapse to their r = 0 counterparts as r → 0 (Proposition 2). Numerical comparison against the implicit-expression solution of Park (2006) confirms the approximation is close for small r.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Characterization of the MPC and supermodularity (Section 3).&lt;/strong&gt; Leveraging the explicit expression, the paper derives the full Jacobian vector and Hessian matrix of c*(a; y) in closed form (Propositions 3 and 4). Key findings, all proved formally and holding under the impatience condition ρ &amp;gt; r:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Consumption is increasing in both assets and permanent income&lt;/strong&gt; (both entries of the Jacobian are strictly positive — Corollary 2.2). The second result (∂c*/∂y &amp;gt; 0 for all a) is new for the borrowing-constrained setting; Achdou et al. (2022) provided only suggestive evidence for the limiting case a ∼ 0.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Consumption is strictly concave in both assets and permanent income&lt;/strong&gt; (both diagonal entries of the Hessian are strictly negative — Corollary 2.3). Concavity in assets was known (Carroll and Kimball 1996); concavity in permanent income under borrowing constraints is new.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;The consumption function is supermodular&lt;/strong&gt;: the cross-derivative ∂²c*/∂a∂y is strictly positive (Corollary 2.3). This means assets and permanent income are complements in generating consumption. Equivalently, the MPC out of permanent income is strictly increasing in the level of initial assets — a counter-intuitive result, since high MPCs are usually associated with poor (low-asset) agents. An identical result was obtained by Commault (2025) for a life-cycle model &lt;em&gt;without&lt;/em&gt; borrowing constraints; the current paper confirms it holds in the presence of a borrowing constraint. By symmetry of the Hessian, the MPC out of assets is also strictly increasing in permanent income.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Intuition for supermodularity.&lt;/strong&gt; When assets are low, an increase in permanent income produces little additional consumption because the risk of hitting the borrowing constraint is high. When assets are higher, the agent has buffer savings, faces a lower constraint-risk, and can smooth the higher future income stream into current consumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results are derived under CRRA utility, constant (deterministic) income, no stochastic variation, and the impatience condition ρ &amp;gt; r. The exact closed form applies to r = 0; the approximation is characterized as valid for r ∼ 0 and is not a local expansion in assets.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the longstanding gap in the literature that this paper addresses?&lt;/strong&gt;
A: Since Zeldes (1989) noted that no closed-form solution exists for the consumption function with stochastic income and CRRA utility, researchers settled for numerical solutions or local analytical approximations. In the constant-income/borrowing-constraint version studied here, Park (2006), Holm (2018), and Fischer (2024) derived only implicit continuous-time expressions. Achdou et al. (2022) gave explicit local solutions valid near a ∼ 0 or a → ∞ under r &amp;gt; 0. No prior work produced an explicit, global closed-form for any case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. Why does moving to continuous time enable progress that discrete time did not?&lt;/strong&gt;
A: In discrete time, the consumption function is piecewise linear (Helpman 1981), with kinks at the sequence of asset thresholds µ(T) for T = 0, Δ, 2Δ, …. As Δ → 0, the piecewise-linear function converges to a smooth function whose governing ODE can be solved analytically. This convergence to smoothness, illustrated in Figure 1, is what enables the application of the Lambert W function to invert the resulting transcendental equation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. What is the role of the Lambert W function, specifically its second branch W₋₁?&lt;/strong&gt;
A: The optimal asset-depletion time T satisfies the transcendental equation e^(bT) = yT + c (for r = 0), which cannot be solved with elementary functions. Via the change of variables z := −bT − bc/y, the equation reduces to ze^z = α, whose solution is z = W(α). The argument α lies in (−1/e, 0) for a ∈ (0, +∞), and it is precisely on this interval that the Lambert W function is double-valued; the relevant branch is W₋₁ (the second, lower branch), which is well-defined and strictly less than −1 on (−1/e, 0). It is the properties of W₋₁ on this domain — specifically that 1 + W₋₁(α) &amp;lt; 0 — that drive the sign conclusions for the Hessian.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. Why does the Lambert W approach fail for r &amp;gt; 0, and what is the approximation strategy?&lt;/strong&gt;
A: For r &amp;gt; 0, Equation (8) contains two exponentials with different exponents — e^((ρ−r)T/γ) and e^(−rT) — and their sum cannot be inverted by the Lambert W function, which handles only a linear-plus-single-exponential structure. Inverting a sum of exponentials with different exponents is stated in the paper to be an open problem. The approximation strategy exploits the fact that for r ∼ 0, e^(−rT) ≈ 1 − rT + o(r), reducing the equation to a single-exponential transcendental form (Equation 15) with modified coefficients b_r, d_r, c_r, all of which converge to their r = 0 analogues as r → 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What does Proposition 1 establish, and why is it necessary before stating the main theorem?&lt;/strong&gt;
A: Proposition 1 establishes that the mapping µ(T) from depletion time T to initial assets a is smooth (infinitely differentiable), bijective (one-to-one and onto) on ℝ₊, and strictly convex. The Hadamard-Lévy theorem then guarantees that its inverse h(a;y) = µ⁻¹(a) exists, is unique, is strictly increasing, and is strictly concave in a. This is a necessary prerequisite for Theorem 2 because h(a;y) is the central object in the closed-form consumption function; without establishing its existence and uniqueness, Theorem 2 would have no well-defined object.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What does the Jacobian characterization (Proposition 3 and Corollary 2.2) contribute?&lt;/strong&gt;
A: Proposition 3 gives explicit formulas for ∂c*/∂a = (ρ/γ) · w/(1+w) and ∂c*/∂y in terms of w = W₋₁(f(a;y)). Corollary 2.2 proves both are strictly positive using the property w &amp;lt; −1 on (−1/e, 0), which ensures w/(1+w) &amp;gt; 0 and that the bracketed term in the expression for ∂c*/∂y is strictly positive. The contribution is that the positivity of ∂c*/∂y for all a was previously unproven in a borrowing-constrained setting with constant income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. What is the structure of the Hessian matrix and what signs do its entries take?&lt;/strong&gt;
A: All four entries of Hc are proportional to w/(1+w)³. Since w &amp;lt; −1, we have 1 + w &amp;lt; 0, so (1+w)³ &amp;lt; 0, making w/(1+w)³ &amp;gt; 0. The diagonal elements ∂²c*/∂a² = −(ρ²/γ²y) · w/(1+w)³ and ∂²c*/∂y² = −(ρ²a²/γ²y³) · w/(1+w)³ are both strictly negative (concavity). The off-diagonal elements ∂²c*/∂a∂y = (aρ²/γ²y²) · w/(1+w)³ are strictly positive (supermodularity/complementarity).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. What is the precise counter-intuitive implication of supermodularity for MPC heterogeneity?&lt;/strong&gt;
A: Supermodularity (∂²c*/∂a∂y &amp;gt; 0) means the MPC out of permanent income — conventionally associated with low-wealth households — is in fact &lt;em&gt;increasing&lt;/em&gt; in the level of initial assets. This contradicts the conventional narrative that high MPCs are a hallmark of poor agents. The paper&amp;rsquo;s intuition is that low-asset agents face high risk of hitting the constraint, suppressing their consumption response to income news, while high-asset agents can freely smooth the increased income stream. The same supermodularity implies, by the symmetry of the Hessian, that the MPC out of assets is also increasing in permanent income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. How does this result relate to Commault (2025)?&lt;/strong&gt;
A: Commault (2025) proved, in a life-cycle model with a permanent/transitory stochastic income process but &lt;em&gt;without&lt;/em&gt; borrowing constraints, that the MPC out of permanent income is increasing in assets. The current paper obtains the same qualitative finding in the opposite environment — constant income &lt;em&gt;with&lt;/em&gt; a borrowing constraint. The paper treats these as complementary, noting that the result thus appears robust to these different modeling choices.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Q10. What does concavity in permanent income (∂²c&lt;/em&gt;/∂y² &amp;lt; 0) add that was not previously known?&lt;/em&gt;*
A: Carroll and Kimball (1996) established concavity of the consumption function in assets for a broad utility class. Concavity in permanent income — that the marginal consumption response to a windfall increase in y is diminishing — had been proved by Commault (2025) only in the absence of borrowing constraints. The current paper provides the first formal proof of this property in a setting with a borrowing constraint (albeit for constant, deterministic income and CRRA utility in continuous time).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. What is the potential use of these closed-form results for numerical methods?&lt;/strong&gt;
A: The paper notes in the conclusion that the closed-form solutions for r = 0 and the approximation for r ∼ 0 can serve as benchmarks for assessing the reliability of continuous-time numerical methods when computing objects such as the MPC out of assets. Because the exact solution is known analytically, numerical implementations can be compared against it to detect discretization errors or convergence failures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12. What parameter values are used to illustrate the consumption function, and what do they imply?&lt;/strong&gt;
A: The paper uses r = 0.01, γ = 0.5, ρ = 0.08, y = 3, where r = 0.01 is described as roughly in line with the average real interest rate in the U.S. in 2025. With these values, Figure 1 shows a visually sizable gap between the constrained and unconstrained consumption functions at low to moderate asset levels, with the two converging as a → ∞ as guaranteed by asymptotic linearity (Benhabib et al. 2015).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Income fluctuation problem (with borrowing constraint):&lt;/strong&gt; The standard infinite-horizon single-agent savings problem in which the agent faces a non-negativity constraint on assets (a(t) ≥ 0), so that the agent cannot borrow. In the paper&amp;rsquo;s formulation: maximize ∫ e^(−ρt)u(c(t))dt subject to da/dt = ra + y − c and a(t) ≥ 0, with constant income y and CRRA utility. The borrowing constraint creates the concavity of the consumption function and was the source of intractability in prior closed-form attempts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lambert W function (second branch W₋₁):&lt;/strong&gt; A special transcendental function defined as the solution to we^w = x. It is double-valued on (−1/e, 0); the second branch W₋₁ takes values strictly less than −1 on this interval. In this paper, the transcendental equation linking initial assets to asset-depletion time is reduced to the form ze^z = α, enabling explicit inversion via W₋₁. The property that 1 + W₋₁(α) &amp;lt; 0 on (−1/e, 0) is the algebraic engine driving all sign results in the Hessian.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asset-depletion time T = h(a; y):&lt;/strong&gt; The time it takes for the optimal consumer to fully run down her initial assets before settling into perpetual income consumption of y. The paper establishes a bijective mapping from initial assets a to depletion time T (Proposition 1); the closed-form solution is obtained by explicitly inverting this mapping. In the paper&amp;rsquo;s formulation, h(a; y) = µ⁻¹(a) where µ(T) is derived from the ODE governing the consumption path.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supermodularity of the consumption function:&lt;/strong&gt; The property that the cross-derivative ∂²c*/∂a∂y is strictly positive, meaning assets a and permanent income y act as complements in generating consumption. This is an equilibrium property of the consumption function (not an assumption on the utility function), and the paper identifies it as new to the income fluctuation literature. It implies the MPC out of permanent income is increasing in a, and the MPC out of assets is increasing in y.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;MPC out of permanent income (∂c&lt;/em&gt;/∂y):&lt;/em&gt;* The marginal increase in current consumption per unit increase in the constant income stream y, holding initial assets constant. This object is less studied than the MPC out of a transient asset windfall. In the paper&amp;rsquo;s setting, it is shown to be strictly positive for all a (Corollary 2.2) and, counter-intuitively, strictly increasing in a (supermodularity).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Global vs. local closed-form solution:&lt;/strong&gt; A global solution holds for all values of the state variable (here, all a ≥ 0), while a local solution is valid only in the neighborhood of a particular value (e.g., a ∼ 0 or a → ∞). Achdou et al. (2022) produced local closed-form expressions; the current paper&amp;rsquo;s Theorem 2 (r = 0) is the first global explicit closed-form for this class of problems.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Piecewise-linear consumption function (discrete time):&lt;/strong&gt; In Helpman (1981)&amp;rsquo;s discrete-time formulation with period length Δ = 1, the optimal consumption function is piecewise linear in assets, with slope changes at the asset thresholds µ(T) for integer T. As Δ → 0, this becomes a smooth function, enabling the passage to the continuous-time closed form derived in the paper.&lt;/p&gt;</description></item><item><title>Failing Banks</title><link>https://macropaperwarehouse.com/papers/failing-banks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/failing-banks/</guid><description>&lt;p&gt;Correia, Luck, and Verner ask a foundational question in banking: why do banks fail? Specifically, they seek to adjudicate between two theoretical views — the solvency view (failures caused by deteriorating asset quality and insolvency) and the bank runs view (failures caused by depositor coordination failure that can bring down otherwise solvent banks) — using the longest micro-level panel of U.S. commercial bank balance sheets assembled to date.&lt;/p&gt;
&lt;p&gt;The authors construct a panel covering approximately 37,000 distinct banks across two samples: a historical sample of all national banks from 1863 to 1941 (sourced from OCC Annual Reports, digitized via OCR) and a modern sample of all commercial banks from 1959 to 2024 (from FFIEC Call Reports merged with the FDIC failure list). More than 5,000 banks fail across the full sample, with 2,887 failures before 1935 and 2,233 after 1959. The sample spans institutional regimes before and after the Federal Reserve (founded 1913) and the FDIC (founded 1933/1934).&lt;/p&gt;
&lt;p&gt;Three sets of findings emerge. First, failing banks are characterized by deteriorating fundamentals well before failure: rising non-performing loans and declining solvency (equity-to-assets falls by 8 percentage points in the five years before failure in the modern sample), increasing reliance on expensive noncore funding (rising by 18% of assets in the decade before modern-era failures), and a boom-bust pattern in real assets (expanding by 34% from ten years to three years before failure before contracting). These patterns are consistent across the pre-FDIC and modern eras.&lt;/p&gt;
&lt;p&gt;Second, bank failures are highly predictable from publicly available accounting data. Using simple regression models with insolvency risk, noncore funding reliance, and asset growth as predictors, the area under the ROC curve (AUC) for predicting failure within one year reaches 86% in the historical sample and 90–95% in the modern sample. Pseudo-out-of-sample performance is nearly as strong as in-sample performance. A bank in the top 5th percentile of both insolvency risk and noncore funding vulnerability faces a three-year failure probability of 27% in both the historical and modern samples, compared to unconditional rates of 2.5% (historical) and 1% (modern) — a 10- to 25-fold increase.&lt;/p&gt;
&lt;p&gt;Third, while large deposit outflows consistent with bank runs were common in pre-FDIC failures — deposits declined on average by 14% immediately before failure in 1880–1934, and by 21% in the period before the banking holiday — failures with runs are as predictable as failures without runs, and they occur in banks with similarly weak fundamentals. Recovery rates on failed banks&amp;rsquo; assets averaged only 52% of book value in pre-FDIC failures. Using a framework comparing recovery rates to leverage, the majority of pre-FDIC failed banks appear to have been fundamentally insolvent. Even under the extreme assumption of zero value destruction from failure, runs on banks that were not fundamentally insolvent account for fewer than 8% of pre-FDIC failures; under an assumption of 20% value destruction from failure, this share rises to 22%.&lt;/p&gt;
&lt;p&gt;OCC bank examiners classified fewer than 2% of pre-FDIC failures as caused by runs or liquidity issues; most were attributed to losses, fraud, or external shocks. The aggregate failure rate is also largely predictable: regressing the actual bank failure rate on predicted aggregate failure risk yields an R-squared of 40%.&lt;/p&gt;
&lt;p&gt;Scope conditions: the historical sample covers only national banks (market share ranging from ~80% in the 1870s to ~45% in the 1930s); the modern sample excludes de novo banks (younger than three years); deposit outflow data for the historical period begin in 1880; and FDIC failure transaction data for the modern period begin in 1993.&lt;/p&gt;
&lt;p&gt;Q: What are the two main theoretical views the paper evaluates, and how does the paper distinguish between them?
A: The solvency view holds that bank failures are caused by deteriorating asset quality and insolvency, with the runnable nature of liabilities playing no essential causal role. The bank runs view holds that the runnable nature of demandable deposits is central, with depositor coordination failure capable of bringing down otherwise solvent banks (Diamond and Dybvig, 1983) or weak-but-solvent banks (Goldstein and Pauzner, 2005). The paper distinguishes between them using three empirical tests: predictability of failures from fundamentals, deposit outflows before failure, and asset recovery rates in failure.&lt;/p&gt;
&lt;p&gt;Q: How predictable are bank failures, and what does predictability imply for the bank runs view?
A: In the historical pre-FDIC sample (1863–1934), the in-sample AUC for predicting failure within one year is 86%; in the modern sample (1959–2024) it is 90–95%. Pseudo-out-of-sample AUC is nearly as strong as in-sample AUC. High predictability is consistent with the solvency view and fundamental-based panic run models, but is inconsistent with non-fundamental self-fulfilling runs (Diamond and Dybvig, 1983), which should strike randomly. Predictability also cuts against the assumption of rational, forward-looking depositors in fundamental-run models, since attentive depositors would act on observable signals and accelerate failure, reducing predictability.&lt;/p&gt;
&lt;p&gt;Q: What is the boom-bust pattern in failing banks&amp;rsquo; assets?
A: In the decade before failure, failing banks&amp;rsquo; real total assets expand by 34% from ten years to three years before failure, then contract over the final two years. The boom-and-bust pattern is present in both the historical and modern samples but is more pronounced in the modern period. The boom is driven primarily by loan growth (particularly real estate lending and C&amp;amp;I lending in the modern sample) rather than by growth in liquid assets, consistent with the view that rapid credit expansion produces future credit losses.&lt;/p&gt;
&lt;p&gt;Q: How does noncore funding behave in failing banks, and why does it matter?
A: In failing banks in the modern sample, noncore funding (time deposits plus wholesale funding) rises by 18% of assets over the decade before failure, while demand deposits decline as a share of assets. In the historical sample, noncore (wholesale) funding also rises gradually. Noncore funding is a signal of failure for multiple reasons: it is more expensive than core deposits, eroding profitability; it can finance risky asset growth; it reflects realized losses being funded at the margin; and it increases funding fragility, making banks more vulnerable to shocks.&lt;/p&gt;
&lt;p&gt;Q: How strong is the joint signal from insolvency and noncore funding?
A: A bank in the top 5th percentile of both insolvency risk and noncore funding vulnerability faces a three-year failure probability of 27% in the historical sample and 27% in the modern sample. The unconditional three-year failure probability is 2.5% in the historical sample and 1% in the modern sample. This amounts to a 10- to 20-fold increase in failure probability, illustrating that the combination of solvency and funding weakness is a powerful joint predictor.&lt;/p&gt;
&lt;p&gt;Q: Were deposit outflows common before the FDIC, and did they decline after its introduction?
A: In the 1880–1934 historical sample, deposits in failing banks declined on average by 14% between the last call report and failure, with 25% of pre-FDIC failures preceded by outflows exceeding 20%; during the period before the banking holiday the average deposit decline was 21%. In contrast, in the modern sample (1993–2024), average pre-failure deposit outflows were only 2.5%, and outflows exceeding 20% occurred in only 3% of failures, consistent with deposit insurance insulating most depositors.&lt;/p&gt;
&lt;p&gt;Q: Are failures with large deposit outflows (runs) less connected to weak fundamentals than other failures?
A: No. The paper finds that failures with large deposit outflows are as predictable as failures without large deposit outflows. The relationship between insolvency risk or noncore funding and three-year failure probability is similar for failures with and without large deposit outflows. This implies that runs did not disproportionately strike banks with otherwise strong fundamentals.&lt;/p&gt;
&lt;p&gt;Q: What do asset recovery rates reveal about the insolvency status of pre-FDIC failed banks?
A: Recovery rates on pre-FDIC failed banks averaged 52% of book value of assets. Under the extreme assumption that receivership destroys zero bank value, runs on non-fundamentally-insolvent (weak but solvent) banks account for fewer than 8% of pre-FDIC failures. Under the equally extreme assumption that failure destroys 20% of bank value, this share rises to 22%. The majority of pre-FDIC failed banks therefore appear to have been fundamentally insolvent.&lt;/p&gt;
&lt;p&gt;Q: What did contemporary OCC bank examiners attribute as the causes of bank failures?
A: OCC bank examiners classified most pre-FDIC failures as caused by losses, fraud, or external economic shocks. Runs and liquidity issues together account for fewer than 2% of OCC-classified failures, notwithstanding the common occurrence of large deposit outflows before many of these failures. This examiner evidence supports the solvency view.&lt;/p&gt;
&lt;p&gt;Q: Can bank-level fundamentals predict systemic banking crises and aggregate failure waves?
A: Yes. The authors aggregate out-of-sample predicted failure probabilities to construct a predicted aggregate bank failure rate. The R-squared from regressing the actual aggregate bank failure rate on this predicted rate is 40%, indicating that spikes in bank failures during systemic crises are substantially accounted for by the prior deterioration of bank-level fundamentals.&lt;/p&gt;
&lt;p&gt;Q: Why is predictability higher in the modern sample than in the historical sample?
A: The authors identify several reasons. Accounting data quality is higher in the modern sample. Historical national banks operated as unit branches with less geographic diversification, making idiosyncratic shocks more important and harder to predict. Modern-era failures are preceded by larger lending booms that produce more predictable downstream losses. Additionally, in the modern context bank failures are largely supervisory decisions, and frictions in the supervisory process may delay closure and thereby increase predictability.&lt;/p&gt;
&lt;p&gt;Q: What role do the authors assign to depositor inattention?
A: The high predictability of failures combined with the finding that many failing banks had high predicted failure probabilities before actually failing suggests that depositors were often slow to react to observable signals of bank weakness. The authors note this points to behavioral frictions such as neglect of downside risk (Gennaioli et al., 2012) and sleepy or inattentive depositors (Hanson et al., 2015; Jiang et al., 2023), rather than the rational, forward-looking depositor assumption embedded in standard bank run models.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s overall interpretive conclusion about the relative importance of solvency versus runs?
A: The primary cause of bank failures is almost always and everywhere a deterioration of bank solvency. Runs were more common in the historical pre-FDIC data as a mechanism triggering failure, but they typically closed banks that were already fundamentally insolvent. Non-fundamental, self-fulfilling runs on otherwise healthy banks appear to be an uncommon cause of bank failures. Under the solvency view, even when runs occur, they are the trigger and final mechanism rather than the root cause.&lt;/p&gt;
&lt;p&gt;Insolvency risk: A bank&amp;rsquo;s proximity to default, proxied in the historical sample by surplus profits relative to equity (capturing profitability and capitalization) and in the modern sample by net income to assets. High insolvency risk reflects declining profitability and eroding capital buffers.&lt;/p&gt;
&lt;p&gt;Noncore funding: Expensive, risk-sensitive funding sources outside core demand deposits, including time deposits, wholesale funding (bills payable, rediscounts), and non-deposit wholesale borrowings. Banks relying heavily on noncore funding face higher funding costs, reduced profitability, and greater fragility to funding shocks.&lt;/p&gt;
&lt;p&gt;Fundamental run: A run triggered when bank fundamentals are so weak (theta at or below the lower threshold in the Goldstein-Pauzner framework) that all depositors have an incentive to withdraw regardless of others&amp;rsquo; actions — the bank is effectively insolvent and failure is inevitable.&lt;/p&gt;
&lt;p&gt;Panic-based run: A run triggered when bank fundamentals are moderately weak (below the threshold equilibrium in Goldstein-Pauzner) but the bank would have been able to pay all creditors absent the run; the run itself destroys value and causes failure.&lt;/p&gt;
&lt;p&gt;Non-fundamental (self-fulfilling) run: A run on an otherwise solvent bank driven purely by depositor coordination failure, as in Diamond and Dybvig (1983); failure arises from one of two equilibria and is not predicted by fundamentals.&lt;/p&gt;
&lt;p&gt;Recovery rate: Funds ultimately collected by the receiver throughout receivership proceedings divided by the book value of assets at suspension; used as a proxy for the degree of fundamental insolvency at failure. Pre-FDIC recovery rates averaged 52% of book value.&lt;/p&gt;
&lt;p&gt;Area Under the ROC Curve (AUC): A measure of binary classification performance used to quantify the predictability of bank failures; an uninformative predictor has AUC of 0.5, while AUC of 1.0 indicates perfect classification. In this paper, AUC ranges from 86% (historical, one-year horizon) to 95% (modern).&lt;/p&gt;
&lt;p&gt;Boom-bust pattern: The systematic tendency of failing banks to experience rapid loan-driven asset growth in the years preceding failure followed by asset contraction in the final two years before failure — present in both the historical and modern samples, more pronounced in the latter, with real assets expanding by 34% from ten to three years before failure.&lt;/p&gt;</description></item><item><title>Financial Frictions: Micro versus Macro Volatility</title><link>https://macropaperwarehouse.com/papers/financial-frictions-micro-versus-macro-volatility/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/financial-frictions-micro-versus-macro-volatility/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; How do consumer credit spreads — the gap between household borrowing rates and deposit rates — affect aggregate business cycle dynamics and the distribution of consumption across the wealth distribution? And what is the welfare trade-off between macroeconomic stabilization and household-level consumption volatility when bank capital requirements are tightened?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Empirical Approach.&lt;/strong&gt; The empirical analysis draws on Danish administrative register data for 2003–2018, combining approximately 15.5 million household-year observations. Income tax return data, which capture housing wealth, portfolio wealth, bank deposits, and bank and mortgage debt, are merged with bank-level reporting of interest rates submitted to Danmarks Nationalbank (MFI data). Household-specific credit spreads are constructed as the difference between the loan rate at a household&amp;rsquo;s primary loan bank and the deposit rate at its primary deposit bank in a given year. Consumption is imputed from household balance sheets following the method of Crawley and Kuchler (2023). The empirical specifications include household and time fixed effects, and quantile regressions are run across bins of the net wealth distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The authors develop a Heterogeneous Agent New Keynesian (HANK) model with explicit banking intermediation. Banks, subject to an agency friction following Gertler and Karadi (2011) — in which bankers can divert a fraction λ = 0.381 of assets — combine household deposits with net worth to invest in corporate equity and consumer loans. This leverage constraint generates an endogenous, countercyclical spread between borrowing and saving rates. Households face idiosyncratic income risk and a kink in their budget constraint at zero net worth due to the spread. The supply side features New Keynesian sticky prices (Rotemberg quadratic adjustment costs) and a Taylor rule. Aggregate shocks include monetary policy surprises, total factor productivity (TFP), and capital quality shocks (affecting bank net worth). The model is solved by first-order perturbation using the method of Bayer and Luetticke (2020) and calibrated to Danish macro and micro moments for 2003–2018.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The average consumer credit spread in Denmark is strongly countercyclical, with a cross-correlation with HP-filtered output of −0.44 in the data (−0.31 in the model).&lt;/li&gt;
&lt;li&gt;Higher credit spreads increase the transition rate into the zero net wealth state for households with moderately positive wealth at the beginning of the year, and reduce the outflow rate for households already at zero net wealth.&lt;/li&gt;
&lt;li&gt;Pooled OLS (with household and time fixed effects) finds that a higher spread is negatively associated with consumption (coefficient −0.266), and the interaction between spread and log income is positive (coefficient 1.366), indicating that higher spreads raise income sensitivity of consumption. For below-median wealth households, the income–consumption link is stronger and the negative spread effect on consumption is larger.&lt;/li&gt;
&lt;li&gt;The consumption-income elasticity derived from quantile regression estimates has a standard deviation of 2.4 percent and a cross-correlation with output of −0.53 when spread variation is incorporated; holding spreads constant roughly halves the volatility (to 1.3 percent) and reduces the countercyclicality (cross-correlation −0.31).&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Model Aggregate Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Consumer credit is procyclical (cross-correlation with output 0.56 in data, 0.67 in model) and more than twice as volatile as output (standard deviation ratio 2.11 in data, 1.51 in model).&lt;/li&gt;
&lt;li&gt;Capital quality shocks and monetary policy shocks are amplified at the aggregate level through a financial accelerator working through endogenous spread movements. TFP shocks generate little spread amplification because households&amp;rsquo; labor supply responses partially insulate banks&amp;rsquo; net worth.&lt;/li&gt;
&lt;li&gt;A 1 percentage point contractionary monetary policy shock leads to a sharp, persistent decline in aggregate output and investment, and is amplified relative to a constant-spread HANK benchmark.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Distributional Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In response to a contractionary monetary policy shock, consumption of households at the 10th percentile of the consumption distribution (who are indebted) falls sharply in the short run, while consumption of the 90th percentile (wealthy households) rises in the short run due to higher returns on savings. The responses converge across the distribution in the medium run as spreads normalize.&lt;/li&gt;
&lt;li&gt;When the consumer credit spread is held constant, consumption paths move in parallel across the wealth distribution, demonstrating that endogenous spread movements are the key driver of distributional effects for monetary policy and capital quality shocks.&lt;/li&gt;
&lt;li&gt;The MPC is countercyclical in the model, with a cross-correlation with output of −0.60 (unconditional), compared with −0.53 for the empirically-estimated consumption-income elasticity. The consumption-income elasticity and MPC are correlated at 90 percent in the model at the annual rate.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Macroprudential Regulation.&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;A tightening of bank capital requirements reducing leverage by 10 percent (diversion parameter λ rising from 0.381 to 0.445) reduces output volatility by 5.5 percent and investment volatility by 10.1 percent, and does so at apparently no long-run aggregate cost in the HANK setting (precautionary savings stimulate output and consumption in the stationary equilibrium).&lt;/li&gt;
&lt;li&gt;However, the regulation increases the annual consumer credit spread by 40 basis points, raises household consumption volatility across the wealth distribution (from about 8 percent to 10 percent for the poorest households under idiosyncratic shocks alone), and generates welfare losses across all deciles equivalent to 0.24–4.28 percent of consumption (with aggregate welfare loss of 0.79 percent).&lt;/li&gt;
&lt;li&gt;When aggregate shocks are included, the lower cyclical sensitivity of spreads partially mitigates welfare losses for the poorest 80 percent of the population, but the overall welfare effect remains negative with an aggregate loss equivalent to 0.58 percent of consumption. The paper thus documents a trade-off between macro volatility (stabilized) and micro volatility (increased).&lt;/li&gt;
&lt;li&gt;Results are robust to the extension of the model to three assets (including illiquid assets), which provides a better fit to micro data without materially changing the welfare conclusions.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the specific Danish dataset used, and how is consumption constructed?&lt;/strong&gt;
A: The dataset covers 2003–2018 from Statistics Denmark administrative registers, combining income tax return data (which report end-of-year balances on all bank accounts, housing wealth, portfolio wealth, bank deposits, bank loans, and mortgage debt) with bank-level MFI interest rate reporting submitted to Danmarks Nationalbank. The total sample is approximately 15.5 million household-year observations (about 1.76–1.97 million households per year). Consumption is imputed as after-tax labor income plus after-tax financial income minus the change in end-of-year net worth, following Crawley and Kuchler (2023). Households with self-employment, housing transactions in the current or prior year, negative imputed consumption, or in the bottom and top 1 percent of wealth or income distributions are excluded.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How are household-specific credit spreads constructed from the administrative data?&lt;/strong&gt;
A: Each household&amp;rsquo;s primary loan bank is defined as the bank where it holds the largest loan balance at end of calendar year, and the primary deposit bank as the one holding the largest deposit balance. The household-specific spread is the difference between the loan rate applied by the primary loan bank and the deposit rate applied by the primary deposit bank, both measured as averages over the calendar year. If a household has no loans, the loan rate of the primary deposit bank is used. This construction yields a household-level interest rate spread that moves countercyclically at the aggregate level (cross-correlation with HP-filtered output of −0.44).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What do the empirical results say about the relationship between spreads and the probability of a household reaching zero net wealth?&lt;/strong&gt;
A: Equation (2) is estimated as a linear probability model for the transition to zero net wealth (defined as net assets within plus or minus two weeks of 2007 median weekly income). Higher spreads significantly increase the transition rate into zero net wealth for households with moderately positive net wealth at the beginning of the year (those in the third to sixth net wealth bins), and reduce the outflow rate from zero net wealth for households already in that state. Higher spreads also appear to increase debt repayments for indebted households (third to fifth bins), making it more difficult for them to accumulate wealth. Households at the extremes of the wealth distribution (very poor or very wealthy) show essentially no sensitivity of transition rates to spread movements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What do the consumption regressions in Table 1 find, and what is the key identification caveat?&lt;/strong&gt;
A: The pooled regression (column 1) finds a positive income–consumption coefficient of 0.372, a negative spread coefficient of −0.266, and a positive income–spread interaction of 1.366, all statistically significant with standard errors clustered at the household level (15,610,327 observations, R² = 0.591). When interacted with below-median wealth (column 2), the income coefficient is larger (0.397 versus 0.335 for above-median), the spread effect is more negative for below-median wealth (−0.362 versus −0.101 for above-median), and the income–spread interaction is stronger for below-median wealth (1.640 versus 0.875). The authors explicitly note that these results should not be given a causal interpretation, as income and consumption are likely jointly determined. Institutional features of the Danish mortgage market (covered bonds, competitive market, rates independent of borrower credit situation) minimize confounding from mortgage rate correlation with consumer credit spreads.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How do the quantile regression results and the derived consumption-income elasticity demonstrate countercyclical MPC?&lt;/strong&gt;
A: Quantile regressions across five-percent bins of the net wealth distribution show that income coefficients decline with wealth (from nearly 0.5 for the poorest to about 0.35 for the wealthiest households), spread coefficients are negative for households with negative, zero, and moderately positive wealth and positive for significantly wealthy households, and the income–spread interaction term is positive for all but the richest households (largest near zero net wealth). The consumption-income elasticity is computed as β₀,ⱼ + β₂,ⱼ × spread at the household level, then averaged cross-sectionally. When only wealth distribution shifts are allowed, the elasticity&amp;rsquo;s standard deviation is 1.3 percent and its cross-correlation with HP-filtered output is −0.31. When spread variation is also incorporated, standard deviation rises to 2.4 percent and the cross-correlation becomes −0.53. This measure is highly correlated (90 percent) with the model MPC, supporting the inference that the MPC is countercyclical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the structure of the banking sector in the HANK model, and how does the agency friction generate a countercyclical spread?&lt;/strong&gt;
A: A continuum of banks combines household deposits with net worth to invest in corporate equity and consumer loans. Bankers can divert a fraction λ = 0.381 of assets, and if they do so, depositors can recover only the remaining fraction (1 − λ). This threat of diversion constrains the supply of deposits, resulting in banks needing to earn excess returns — Et(RK,t+1 − RS,t+1) &amp;gt; 0 — on their assets relative to the deposit rate. The leverage ratio is bounded above by ϱt/λ, where ϱt is a value multiplier that depends on current and expected future excess returns. When an adverse shock (capital quality shock or monetary tightening) reduces banking sector net worth, the leverage constraint tightens, banks reduce asset supply, and the spread between the return on capital (and hence the consumer loan rate, which is proportional to RK at markup ωB = 0.0075) and the deposit rate rises. This generates the observed countercyclical credit spread.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: In the model, how do aggregate shocks affect the distribution of consumption, and why is the monetary policy shock particularly distributional?&lt;/strong&gt;
A: A one-percent capital quality shock reduces both wages and bank net worth, causing spreads to rise. In the baseline economy, rising borrowing rates lead to a large reduction in consumption for indebted households (10th percentile) while the constant spread model shows near-parallel movements across the distribution. A one-percentage-point monetary policy shock reduces equity returns, depressing bank net worth and (with a lag) raising spreads. Indebted households face both lower labor income and higher borrowing costs, producing a sharp consumption decline at the 10th percentile; wealthy households gain from higher returns on savings, so their consumption rises in the short run. Responses converge as spreads return to normal over the medium run. This matches empirical evidence from Holm, Paul, and Tischbirek (2021) for Norway. For TFP shocks, banks&amp;rsquo; net worth is less affected because households&amp;rsquo; higher labor supply partially offsets the productivity decline, so spreads move little and distributional effects are smaller (driven mainly by wage effects across the distribution).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the financial accelerator in the HANK model compare to the RANK version?&lt;/strong&gt;
A: In response to capital quality shocks and monetary policy shocks, the HANK model with banking frictions generates amplification relative to a constant-spread HANK benchmark, confirming the presence of a financial accelerator. However, relative to the RANK model, the incomplete markets model implies slightly less amplification of aggregate investment and consumption. This is because, in the HANK model, households facing higher credit spreads increase their labor supply (precautionary motive), which partially stabilizes aggregate income and moderates the financial accelerator. The finding that heterogeneous agent aspects are less important at the aggregate level is consistent with Berger, Bocola, and Dovis (2020). For TFP shocks, the financial accelerator through spreads is largely absent in both HANK and RANK, as spread changes are minor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the long-run aggregate effects of tightening bank capital requirements (reducing leverage by 10 percent) in the HANK versus RANK model?&lt;/strong&gt;
A: In the RANK model, higher capital requirements increase the annual spread between the return on capital and the deposit rate by 25 basis points, reduce the aggregate capital stock by 2.4 percent, output by 0.5 percent, and aggregate consumption by 0.8 percent. In the HANK model, the spread increases by 40 basis points annually, but the mechanism differs: much of the spread change is absorbed by a reduction in the deposit rate (from 3.81 percent to 3.54 percent annually) rather than an increase in the capital return. Households respond to the lower deposit rate and higher credit costs by increasing precautionary savings and labor supply, so aggregate output and consumption actually rise slightly in the HANK stationary equilibrium. The capital requirements thus appear costless at the aggregate level in the HANK model — but this masks welfare costs that operate through the idiosyncratic risk channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the quantitative welfare costs of macroprudential regulation, and how do they vary across the wealth distribution and between idiosyncratic and aggregate shocks?&lt;/strong&gt;
A: Welfare is measured as the fraction of lifetime consumption households are willing to give up to stay in the unregulated baseline. In the face of idiosyncratic shocks only, welfare losses range from 0.24 to 0.43 percent of consumption for the first seven wealth deciles, and reach 4.28 percent for the richest decile (primarily because of the reduction in the return on their savings), with an average welfare loss of 0.79 percent. When aggregate shocks are added, the losses are substantially reduced for the poorest 80 percent (due to lower cyclical sensitivity of spreads), but remain large for the wealthiest decile (4.23 percent) and in aggregate (0.58 percent). These results are robust to the three-asset model extension, where the poorest households are approximately welfare-neutral under the regulation when aggregate shocks are included (0.00 percent), but aggregate welfare losses remain at 0.75 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the three-asset model extension (with illiquid assets) affect the key results?&lt;/strong&gt;
A: In the three-asset extension, households can hold illiquid capital (calibrated with an adjustment probability of φk = 0.0025 per quarter, targeting the Danish ratio of bank deposits to output of 34 percent), creating wealthy hand-to-mouth households who have illiquid assets but no liquid assets. The consumption impulse responses across the wealth distribution remain very similar to the two-asset baseline: endogenous spread movements generate heterogeneous consumption dynamics in response to capital quality and monetary shocks, while constant-spread models produce near-parallel responses. The three-asset model provides a better fit to the micro data (consumption-spread-income relationship across the wealth distribution), but the welfare conclusions from macroprudential regulation are essentially unchanged: welfare losses across the distribution in the stationary equilibrium, partially mitigated when aggregate shocks are added, with losses concentrated in the richest decile.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What robustness checks are reported for the empirical consumption regressions?&lt;/strong&gt;
A: Three robustness exercises are reported. First, capitalizing car purchases using their official tax value (rather than treating car purchases as current expenditure) yields coefficients similar to the baseline (Table 10). Second, excluding households who purchase a car in the current or prior year (reducing the sample to 13.24 million observations) also leaves results unchanged. Third, first-differenced specifications (equation 42, with and without household fixed effects) produce results similar to the levels specification; the main exception is the spread effect for above-median wealth households when household fixed effects are omitted from the differenced specification (Table 11). The income–spread interaction is consistently positive and significant across all robustness checks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What evidence does the paper provide that the model&amp;rsquo;s MPC is countercyclical and that credit spreads are the primary driver?&lt;/strong&gt;
A: Figure 7 shows impulse response functions of the average MPC to each of the three aggregate shocks. In all three cases, the MPC rises in recessions (countercyclical). The key mechanism is that adverse shocks cause spreads to rise, increasing the mass of households at the kink in the budget constraint (zero liquid assets), where MPCs are highest. When the consumer credit spread is held constant, the MPC remains countercyclical but close to constant, indicating that spread movements account for most of the cyclical variation in MPC. Eliminating the spread altogether implies an acyclical MPC (Table 12, Appendix D). The unconditional cross-correlation of the model MPC with output is −0.60, compared with −0.53 for the empirically estimated consumption-income elasticity in the Danish data.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Consumer credit spread (borrowing-saving spread):&lt;/strong&gt; In the paper, this is the difference between the gross real interest rate on consumer loans (RL,t) charged by banks and the gross real return on deposits (RS,t) received by savers. It is not an abstract measure of credit conditions but a household-specific, bank-derived rate gap that moves countercyclically due to banking agency frictions and creates a kink in households&amp;rsquo; budget constraints at zero net worth. Distinct from mortgage spreads (which in Denmark are market-determined and independent of borrower credit conditions).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Kink in the budget constraint:&lt;/strong&gt; The household budget constraint has a kink at zero net assets because borrowers face RL,t &amp;gt; RS,t; households at exactly zero liquid assets (type IV in the paper&amp;rsquo;s taxonomy) face a discrete jump in the cost of additional borrowing. This kink creates a mass point in the wealth distribution at zero net wealth, and households at this kink have higher MPCs than unconstrained savers or borrowers. The size of the mass point increases when the spread rises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial accelerator (in the HANK-with-banking context):&lt;/strong&gt; The amplification mechanism in which shocks that reduce banking sector net worth tighten banks&amp;rsquo; leverage constraints, raise credit spreads, reduce asset supply to both the corporate sector and households, and further depress investment and consumption — which in turn reduces bank net worth further. In this paper, the accelerator operates through the consumer credit spread channel in addition to the standard corporate lending channel, and is present for capital quality and monetary policy shocks but not materially for TFP shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Countercyclical MPC:&lt;/strong&gt; The MPC — defined as the response of consumption to a small transitory income shock — rises during recessions and falls during expansions in this model. The mechanism is that recessions are associated with higher consumer credit spreads, which expand the mass of households at or near the zero net wealth kink (high MPC), and contract the mass of unconstrained savers (low MPC). This is a distinct source of MPC cyclicality from the wealth distribution channel alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Agency friction (diversion problem):&lt;/strong&gt; Banks can divert a fraction λ of their assets; if they do so, depositors can recover only the fraction (1 − λ) and the bank is liquidated. This threat limits depositors&amp;rsquo; willingness to supply funds, resulting in an incentive-compatibility constraint on bank leverage: assets cannot exceed ϱt/λ (where ϱt is the bank&amp;rsquo;s franchise value multiplier). When ϱt declines (because expected excess returns fall), the constraint binds more tightly and the spread between the return on assets and the deposit rate must be positive to sustain bank participation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Macro versus micro volatility trade-off:&lt;/strong&gt; The paper uses this phrase to describe the finding that tighter bank capital requirements (restricting leverage) reduce the cyclical volatility of aggregate output and investment (macro volatility falls) while simultaneously increasing the volatility of individual household consumption streams due to higher credit spreads and lower deposit returns (micro volatility rises). Welfare costs from increased micro volatility outweigh the aggregate stabilization benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Consumption-income elasticity (d log c / d log y):&lt;/strong&gt; A time-varying cross-sectional average measure derived from quantile regression parameter estimates, equal to β₀,ⱼ + β₂,ⱼ × RSi,t for household i in wealth bin j. It is used in the paper as an empirical proxy for the MPC (not a direct estimate), and is shown to be highly correlated with the model MPC (cross-correlation of 90 percent at the annual rate). Its cyclicality is stronger when spread variation is incorporated (standard deviation 2.4 percent, cross-correlation with output −0.53) than when spreads are held fixed (standard deviation 1.3 percent, cross-correlation −0.31).&lt;/p&gt;</description></item><item><title>Financial Intermediation and Aggregate Demand: A Sufficient Statistics Approach</title><link>https://macropaperwarehouse.com/papers/financial-intermediation-and-aggregate-demand-a-sufficient-statistics-approach/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/financial-intermediation-and-aggregate-demand-a-sufficient-statistics-approach/</guid><description>&lt;p&gt;This paper develops a sufficient statistics approach to measuring the aggregate demand effects of financial intermediation disturbances — shocks to the ability of financial intermediaries to supply credit. The central contribution is characterizing, in a general class of models with heterogeneous firms and financial frictions, the aggregate demand impact of a disruption to intermediary balance sheets as a function of a small set of sufficient statistics observable from data: the elasticity of investment to intermediary net worth, the share of investment financed through intermediaries, and the sensitivity of asset prices to intermediary capacity. The approach does not require full model estimation, allowing model-free measurement of the aggregate demand loss from identified intermediary distress episodes. Applied to the 2008–2009 financial crisis, the paper estimates that the shock to financial intermediary balance sheets generated an aggregate demand reduction of 3–4 percentage points of GDP — substantially larger than estimates from reduced-form regressions that do not account for general equilibrium propagation.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-are-the-key-sufficient-statistics"&gt;Q1. What are the key sufficient statistics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The three sufficient statistics are: (1) the elasticity of investment to intermediary net worth — how much investment falls per dollar of balance sheet loss; (2) the share of investment financed through intermediaries — how broadly the balance sheet shock propagates; (3) the sensitivity of asset prices to intermediary capacity — how much collateral values fall when intermediaries are distressed.&lt;/strong&gt; Together these three moments summarize the aggregate demand impact of a balance sheet shock without requiring the researcher to specify the full structural model.&lt;/p&gt;
&lt;h3 id="q2-why-does-the-sufficient-statistics-approach-give-larger-estimates-than-reduced-form-regressions"&gt;Q2. Why does the sufficient statistics approach give larger estimates than reduced-form regressions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Reduced-form regressions typically compare investment of firms exposed to distressed versus healthy intermediaries, capturing the partial equilibrium direct effect of credit supply reduction; the sufficient statistics approach accounts for the general equilibrium propagation — the fall in asset prices and investment that affects even firms not directly borrowing from distressed intermediaries.&lt;/strong&gt; The 3–4 percentage point estimate includes these spillovers; the reduced-form estimate misses them.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-policy-implication"&gt;Q3. What is the policy implication?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The larger aggregate demand estimate implies that recapitalizing intermediaries during financial crises generates larger macroeconomic benefits than direct-effect estimates would suggest, strengthening the case for bank bailouts, TARP-style capital injections, and central bank emergency lending as counter-recessionary tools.&lt;/strong&gt; The sufficient statistics framework also provides a natural way to compare intervention magnitudes: a policy that restores $X of intermediary capital generates an aggregate demand boost proportional to the measured elasticity.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;sufficient statistics for financial intermediation&lt;/strong&gt; : the small set of model-free moments (investment elasticity to net worth, intermediary financing share, asset price sensitivity) that summarize the aggregate demand impact of intermediary distress, derived in this paper from a general class of heterogeneous-firm models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;general equilibrium propagation&lt;/strong&gt; : the amplification of an intermediary balance sheet shock through asset price declines and economy-wide investment responses, which the sufficient statistics approach captures and reduced-form regressions miss; the source of the larger 3–4 pp GDP estimate relative to partial equilibrium benchmarks.&lt;/p&gt;</description></item><item><title>Financial shocks and leverage of financial institutions: When do they matter?</title><link>https://macropaperwarehouse.com/papers/financial-shocks-and-leverage-of-financial-institutions-when-do-they-matter/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/financial-shocks-and-leverage-of-financial-institutions-when-do-they-matter/</guid><description>&lt;p&gt;This paper investigates the role of leverage of financial institutions in amplifying the transmission of financial shocks to the macroeconomy, with particular attention to whether that amplification differs across economic regimes. The authors develop a new endogenous regime-switching structural vector autoregression (RS-SVAR) model with time-varying transition probabilities, in which the probability of switching regime depends on the contemporaneous state of the economy (endogenous switching). The model extends the Sims and Zha (2006) and Sims, Waggoner, and Zha (2008) Markov-switching SVAR framework by: (1) incorporating a time-varying transition matrix in which the probability of staying in a regime is a logistic function of lagged endogenous variables; and (2) introducing new identification techniques for RS-SVARs, including non-recursive zero restrictions, sign restrictions, and narrative sign restrictions, which can in some cases uniquely identify structural shocks rather than merely set-identify them.&lt;/p&gt;
&lt;p&gt;The leverage measure is market-based — book assets divided by market equity — constructed from CRSP/Compustat institution-level data covering publicly listed depository institutions, bank holding companies, and nonbank financial institutions. The sample runs monthly from December 1988 to December 2019. The five-variable VAR includes industrial production growth, core CPI inflation, the 2-year Treasury rate, market leverage of financial institutions, and the Chicago Fed&amp;rsquo;s National Financial Conditions Index (NFCI). The authors estimate three model variants that substitute in turn the leverage of: (i) all depository institutions, (ii) Global Systemically Important Banks (GSIBs), and (iii) securities brokers and dealers.&lt;/p&gt;
&lt;p&gt;The model identifies two coefficient regimes — a &amp;ldquo;financial constraint&amp;rdquo; regime and &amp;ldquo;normal times&amp;rdquo; — using the criterion that the first regime has higher smoothed probability during September 2008 to August 2009. The financial constraint regime covers the end of the Savings and Loan crisis, the 1990/91 recession, the Russian debt default, the Global Financial Crisis (GFC), and the European sovereign debt crisis.&lt;/p&gt;
&lt;p&gt;The core finding is that real effects of financial shocks are amplified in the financial constraint regime but not in normal times. In the financial constraint regime, the output response to a financial shock is significantly negative, large, and protracted; GSIB leverage initially rises sharply (as falling asset prices erode equity) and then declines as institutions deleverage. In normal times, the output growth response is negative but non-persistent, and market leverage remains insignificant over the entire horizon.&lt;/p&gt;
&lt;p&gt;The counterfactual experiment holding GSIB market leverage constant as of October 2008 is the sharpest quantitative result: if GSIB leverage had not risen further at the onset of the GFC, the decline in industrial production growth would have been approximately 20 percentage points smaller, with a faster subsequent recovery in output growth and inflation and higher short-term interest rates. The counterfactual probability of staying in the financial constraint regime would have fallen as low as 0.1 for some draws, compared to the actual probability remaining elevated. By contrast, for a system using depository institution leverage, the lower-bound counterfactual probability of staying in the constraint regime does not fall below 0.90, indicating substantially weaker heterogeneity effects for the broader depository sector.&lt;/p&gt;
&lt;p&gt;Securities brokers and dealers show leverage that rises more on impact than other institutions and then declines immediately, consistent with their willingness to expand balance sheets going into the crisis amplifying losses and forcing a sharp post-crisis contraction.&lt;/p&gt;
&lt;p&gt;A separate counterfactual holding the NFCI constant (rather than leverage) shows that the probability of staying in the constraint regime does not decline, confirming that market leverage and the financial conditions index provide distinct characterizations of the financial system and have different implications for shock propagation and regime persistence. Results are robust to substituting the GZ corporate spread for the NFCI and to imposing narrative restrictions for shock identification.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question?
A: The paper asks whether and how the leverage of financial institutions amplifies the transmission of financial shocks to the real economy, and whether this amplification differs between a financial constraint regime and normal times. A secondary question concerns heterogeneity: do GSIBs, depository institutions broadly, and nonbank securities dealers transmit shocks differently?&lt;/p&gt;
&lt;p&gt;Q: What is novel about the econometric framework?
A: The RS-SVAR model allows the probability of remaining in a given coefficient regime to vary over time as a logistic function of lagged endogenous variables, so regime switching is endogenous to the state of the economy rather than governed by a fixed transition matrix. The paper also introduces sign restrictions, zero restrictions, and narrative sign restrictions into the RS-SVAR class, enabling identification of both structural shocks and regimes within a single framework; in roughly 20 percent of posterior draws these sign restrictions uniquely identify the financial shock.&lt;/p&gt;
&lt;p&gt;Q: Why does the paper use market leverage rather than book leverage?
A: Market leverage (book assets divided by market equity) is argued to be more timely than book leverage because book equity incorporates losses with a delay, giving institutions time to adjust book leverage to avoid regulatory limits. Market capitalization reflects market participants&amp;rsquo; assessment of an institution&amp;rsquo;s creditworthiness, and low market-to-book ratios signal that institutions are more leveraged than their books indicate. Market leverage is therefore a more informative early-warning indicator of financial fragility and the need for rapid deleveraging.&lt;/p&gt;
&lt;p&gt;Q: How are the two regimes identified?
A: For each estimated regime, the authors count the number of months between September 2008 and August 2009 (inclusive) for which the smoothed probability of being in that regime exceeds 0.70; the regime with the higher count is labeled &amp;ldquo;financial constraint&amp;rdquo; and ordered first. Shock identification uses sign restrictions: in the financial constraint regime, a positive financial shock must have a contemporaneously negative effect on output, inflation, and the short-term interest rate, but positive effects on the financial conditions index and leverage; in normal times, only the financial conditions index is required to respond positively on impact.&lt;/p&gt;
&lt;p&gt;Q: What regimes does the model assign historically?
A: The smoothed probability of the financial constraint regime is elevated during the end of the Savings and Loan crisis, the 1990/91 recession, the Russian debt default, the GFC and associated recession (where the probability reaches 1.0 at end-2008 and beginning-2009 before declining sharply to approximately 0.6 percent in 2009/2010), and the European sovereign debt crisis.&lt;/p&gt;
&lt;p&gt;Q: What do the impulse responses show in the financial constraint regime?
A: In the financial constraint regime, the output response to a positive financial shock (tightening) is significantly negative, large, and protracted. GSIB leverage initially rises due to a sharp decline in asset prices eroding market equity, then falls as GSIBs deleverage in response. The authors interpret this pattern as evidence that deleveraging produces procyclical financial amplification effects with adverse real consequences.&lt;/p&gt;
&lt;p&gt;Q: What do the impulse responses show in normal times?
A: In normal times, the output growth response is large and negative but non-persistent, in contrast to the financial constraint regime. Market leverage remains statistically insignificant across the entire horizon in normal times, indicating that the leverage amplification channel is inactive outside of financial constraint episodes.&lt;/p&gt;
&lt;p&gt;Q: What does the GSIB leverage counterfactual show quantitatively?
A: Holding GSIB market leverage constant as of October 2008 implies a decline in industrial production growth that is approximately 20 percentage points smaller than actually occurred, along with a faster recovery in output growth and inflation and higher short-term interest rates. The counterfactual probability of staying in the financial constraint regime declines to as low as 0.1 for some posterior draws, compared to remaining elevated in the actual data.&lt;/p&gt;
&lt;p&gt;Q: How do depository institutions compare to GSIBs in the counterfactual?
A: For the model using broad depository institution leverage, the lower-bound counterfactual probability of staying in the financial constraint regime does not fall below 0.90, compared to as low as 0.1 for the GSIB specification. This implies that GSIB deleveraging has substantially more detrimental macroeconomic effects and a much larger effect on regime persistence than the broader depository sector.&lt;/p&gt;
&lt;p&gt;Q: What is distinctive about securities brokers and dealers?
A: Broker-dealer market leverage rises more on impact than leverage of other financial institutions following a financial shock, and then immediately declines due to rapid deleveraging. The authors interpret this as reflecting that dealers&amp;rsquo; willingness to expand balance sheets ahead of the crisis amplified growth and losses, followed by a sharp post-crisis contraction — a pattern consistent with the procyclical leverage mechanism described in Adrian and Shin (2014).&lt;/p&gt;
&lt;p&gt;Q: How do the authors distinguish the role of market leverage from the financial conditions index?
A: A counterfactual holding the NFCI constant (rather than leverage) as of October 2008 shows that the probability of staying in the financial constraint regime does not decline, unlike the leverage counterfactual. This demonstrates that market leverage and the NFCI provide distinct characterizations of financial conditions and have different implications for the propagation of shocks and the persistence of the constraint regime.&lt;/p&gt;
&lt;p&gt;Q: How robust are the results?
A: Substituting the GZ corporate bond spread for the NFCI yields very similar results, specifically that the probability of staying in the constraint regime declines much more in the counterfactual than in the actual data, suggesting the findings are not driven by the choice of financial conditions proxy. Imposing narrative restrictions for shock identification (exploiting the known high-stress period around Lehman&amp;rsquo;s failure in September 2008) yields results that are &amp;ldquo;rather robust&amp;rdquo; relative to the baseline sign-restriction identification.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications?
A: The results confirm the leverage ratio as a useful financial stability indicator, with particular emphasis on market leverage as providing timely information for monitoring. The heterogeneity findings suggest that regulatory attention to GSIB leverage is especially warranted, since GSIB deleveraging can have substantially more detrimental macroeconomic effects and a much larger influence on the persistence of financial constraint regimes than deleveraging by the broader depository sector. The leverage ratio is characterized as complementary to the risk-weighted capital ratio as a regulatory tool.&lt;/p&gt;
&lt;p&gt;Market leverage: Measured as book assets divided by market equity (not book equity), constructed from CRSP/Compustat institution-level data at monthly frequency. The paper argues market leverage is more timely than book leverage because market equity immediately reflects losses, preventing institutions from masking fragility through delayed book adjustments.&lt;/p&gt;
&lt;p&gt;Financial constraint regime: One of two identified coefficient regimes in the RS-SVAR, characterized by a significantly negative, large, and protracted output response to financial shocks and by active leverage amplification. Identified empirically as the regime with the highest smoothed probability during September 2008 to August 2009.&lt;/p&gt;
&lt;p&gt;Endogenous regime switching: A modeling approach in which the probability of transitioning between regimes depends on lagged values of the endogenous variables themselves (via a logistic function), rather than being governed by a fixed constant transition matrix. This allows regime dynamics to respond to the state of the economy.&lt;/p&gt;
&lt;p&gt;Time-varying transition probabilities: The diagonal elements of the coefficient-regime transition matrix follow a logistic transformation of a linear function of lagged endogenous variables, so the probability of remaining in any given regime changes each period as a function of current financial and macroeconomic conditions.&lt;/p&gt;
&lt;p&gt;Procyclical financial amplification: The mechanism by which financial institution deleveraging in response to falling asset prices further tightens financial conditions and reduces real output, generating a feedback loop. The paper provides empirical evidence for this channel operating specifically in financial constraint regimes.&lt;/p&gt;
&lt;p&gt;Heterogeneity of financial institutions: The finding that GSIBs, broad depository institutions, and securities brokers and dealers differ substantially in how their leverage affects the transmission of financial shocks. GSIB deleveraging is shown to have much more detrimental macroeconomic effects and a much larger influence on the probability of remaining in the financial constraint regime than depository institution deleveraging more broadly.&lt;/p&gt;
&lt;p&gt;Narrative sign restrictions in RS-SVARs: An identification technique extended from Antolin-Diaz and Rubio-Ramirez (2018) to the regime-switching context, which uses known historical episodes (here, the Lehman failure in September 2008) to impose restrictions on which regime the economy was in or on the sign of structural shocks at particular dates, thereby aiding identification of both shocks and regimes.&lt;/p&gt;</description></item><item><title>Firm Accommodation After Workplace Disability: Labor Market Impacts and Implications for Subsidy Design</title><link>https://macropaperwarehouse.com/papers/firm-accommodation-after-workplace-disability-labor-market-impacts-and-implications-for-subsidy-design/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-accommodation-after-workplace-disability-labor-market-impacts-and-implications-for-subsidy-design/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies (1) how firm accommodation decisions respond to financial incentives in the context of workplace disability under workers&amp;rsquo; compensation, (2) what the causal effect of accommodation is on workers&amp;rsquo; subsequent labor market outcomes, and (3) whether the equilibrium level of accommodation is socially efficient, and what the welfare implications of wage subsidies for accommodation are.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Context and Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The analysis uses the universe of Oregon workers&amp;rsquo; compensation claims from 2005 through 2017 — over 131,000 disabling claims — linked to longitudinal quarterly earnings records from the Oregon Employment Department. The setting exploits Oregon&amp;rsquo;s Employer at Injury Program (EAIP), which subsidizes employers who provide &amp;ldquo;transitional work&amp;rdquo; accommodations (primarily through wage subsidies) to workers with temporary workplace disabilities. EAIP accounts for roughly 25 percent of claims on average, with the wage subsidy component representing over 96 percent of EAIP expenses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors exploit a policy change in July 2013 that reduced the EAIP wage subsidy rate from 50 percent to 45 percent. They construct a firm-level &amp;ldquo;exposure&amp;rdquo; measure — the fraction of a firm&amp;rsquo;s claims that used EAIP in a baseline period (2005–2009) — and estimate a continuous difference-in-differences specification in which the interaction of exposure and a post-2013 indicator instruments for accommodation. The identifying assumption is strong parallel trends: firms with low baseline exposure are unlikely to respond to the subsidy reduction, while high-exposure firms respond more, generating cross-firm variation in accommodation rates after 2013. An MTE framework (Heckman and Vytlacil 2005) is then used to explore heterogeneous treatment effects along an unobserved resistance-to-treatment dimension.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;The subsidy reduction from 50% to 45% decreased accommodation rates by &lt;strong&gt;2.9 percentage points&lt;/strong&gt; (9.3 percent) for claims in firms with average exposure, implying a subsidy elasticity of accommodation of 0.9.&lt;/li&gt;
&lt;li&gt;The policy change led to a &lt;strong&gt;0.95 percentage point decrease in employment&lt;/strong&gt; and a &lt;strong&gt;$120 decrease in quarterly earnings&lt;/strong&gt; four quarters after disability for claims in average-exposure firms (roughly 1.3–1.5 percent declines relative to means), with no significant effect on worker turnover to other firms.&lt;/li&gt;
&lt;li&gt;IV estimates of the effect of accommodation itself (using predicted EAIP as instrument) show &lt;strong&gt;accommodation increases the probability of employment four quarters after disability by 33 percentage points&lt;/strong&gt; and &lt;strong&gt;increases quarterly earnings by approximately $4,100&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;The MTE analysis reveals &lt;strong&gt;negative selection on gains&lt;/strong&gt;: workers with workplace disabilities who are least likely to receive accommodation have the highest potential gains from it, driven largely by severe disabilities with high accommodation costs.&lt;/li&gt;
&lt;li&gt;Descriptive and IV evidence is consistent with accommodation operating primarily as &lt;strong&gt;general human capital investment&lt;/strong&gt;: accommodation has no statistically significant effect on the probability of moving to a new firm, and earnings gains are not systematically lower for workers who change employers after accommodation.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Structural Model and Counterfactual Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A two-period frictional labor market model with risk-averse workers, risk-neutral firms, Nash bargaining, imperfect experience rating in workers&amp;rsquo; compensation, and firm accommodation as human capital investment is developed and estimated. Two inefficiency sources are identified: (1) a human capital externality — because accommodation builds general human capital, firms cannot capture the full surplus when workers separate, reducing accommodation incentives; and (2) a fiscal externality — imperfectly experience-rated firms do not fully internalize the workers&amp;rsquo; compensation cost savings from accommodation, further depressing it below the efficient level. Counterfactual simulations show:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Eliminating wage subsidies (from 50% to 0%) reduces accommodation rates from &lt;strong&gt;33% to 11%&lt;/strong&gt;, leading to a &lt;strong&gt;7% decline in post-disability employment&lt;/strong&gt; and a &lt;strong&gt;15% decline in post-disability quarterly wages&lt;/strong&gt; (roughly $1,358).&lt;/li&gt;
&lt;li&gt;A revenue-neutral reform eliminating wage subsidies reduces average welfare and the welfare of &lt;strong&gt;more than 90% of workers&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Welfare gains from the subsidy are &lt;strong&gt;larger for low-skilled workers&lt;/strong&gt; than high-skilled workers.&lt;/li&gt;
&lt;li&gt;Conditional on experiencing disability, eliminating wage subsidies decreases welfare by about &lt;strong&gt;10%&lt;/strong&gt;, while increasing the subsidy to 100% raises welfare for disabled workers by around &lt;strong&gt;30%&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Firm profit is maximized at a subsidy rate around 80%, after which higher taxes offset accommodation gains.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the Employer at Injury Program (EAIP), and how does it differ from standard workers&amp;rsquo; compensation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: EAIP is an optional component of Oregon&amp;rsquo;s workers&amp;rsquo; compensation system that subsidizes employers for the costs of accommodating workers with temporary disabilities during a transitional return-to-work period. Unlike standard workers&amp;rsquo; compensation premiums (which are experience-rated at the firm level), EAIP is funded through a flat payroll tax on all firms that is not experience-rated — meaning firms that use EAIP do not pay higher premiums. The wage subsidy component accounts for over 96 percent of EAIP expenses; other reimbursable costs (worksite modifications up to $5,000, retraining up to $1,000, clothing up to $400) are rarely used. Eligible employers must be the employer at which the disability occurred, and accommodation is limited to a transitional period during which workers cannot simultaneously receive time-loss benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How is firm-level &amp;ldquo;exposure&amp;rdquo; constructed, and what is the rationale for using it as an instrument?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: Exposure is the fraction of a firm&amp;rsquo;s workers&amp;rsquo; compensation claims that used EAIP during a five-year baseline period from 2005 to 2009 — a separate historical period chosen to reduce volatility and avoid mean-reversion. The rationale draws on prior work (Aizawa et al., 2022) showing that firm fixed effects account for nearly 25 percent of variation in accommodation, far more than worker or disability characteristics (1 and 3 percent, respectively), suggesting permanent firm-level heterogeneity in the relative benefits and costs of accommodation. Firms with zero historical exposure are unlikely to change accommodation behavior in response to a subsidy reduction, while high-exposure firms respond more, creating differential quasi-experimental variation in accommodation rates after July 2013.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the first-stage and reduced-form results from the DID specification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: The first-stage DID coefficient shows that a ten-percentage-point increase in exposure is associated with a one-percentage-point decrease in EAIP take-up after 2013, implying a 2.9 percentage point decrease for claims in firms with average exposure (mean 0.27). The corresponding reduced-form results show a 0.35 percentage point decrease in employment four quarters post-disability and a $45 decrease in quarterly earnings for every ten-percentage-point increase in exposure, scaling to 0.95 percentage points and $120 at average exposure. There is no statistically significant effect on the probability of moving to a new firm. Pre-trend tests show parallel accommodation trends across exposure terciles prior to 2013, supporting the identifying assumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What do the IV estimates imply about the causal effect of accommodation on labor market outcomes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: Under the exclusion restriction that the subsidy change affects labor market outcomes only through accommodation, the IV estimates imply that receipt of accommodation increases the probability of employment four quarters after disability by &lt;strong&gt;33 percentage points&lt;/strong&gt; (against a mean of 72 percent) and increases quarterly earnings by approximately &lt;strong&gt;$4,100&lt;/strong&gt; (against a mean of $7,807). There is no significant effect on the probability of working at a new firm four quarters later. The authors note these large estimates reflect local average treatment effects for compliers — workers whose accommodation status was changed by the instrument — who disproportionately have high unobserved resistance to treatment and high accommodation returns, explaining the magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the MTE framework reveal about the distribution of accommodation effects and selection?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: The MTE curves show that workers with the highest unobserved resistance to treatment (least likely to receive accommodation) have the highest potential employment and earnings gains from accommodation. This negative selection on gains arises because these workers tend to have worse employment outcomes in the untreated state, consistent with more severe disabilities commanding higher accommodation costs. IV weights are concentrated at high-resistance values, explaining the large IV estimates. Negative selection on gains is also found along observable dimensions: workers in self-insured firms, healthcare support occupations, women, and those with wounds/cuts/burns show larger gains but lower likelihood of receiving accommodation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What evidence supports characterizing firm accommodation as general rather than firm-specific human capital investment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: Three pieces of evidence point toward general human capital. First, the IV estimate shows accommodation has no statistically significant effect on the probability of working at a new firm four quarters after disability. Second, a triple-interaction specification (DID interacted with new-firm indicator) yields suggestive evidence of even larger earnings gains for workers who move to a new firm post-accommodation, though this is not statistically significant — a pattern inconsistent with firm-specific human capital. Third, the subset of claims that receive non-wage EAIP benefits (worksite modifications, retraining) do show lower mobility, but this comprises fewer than 5 percent of the sample, meaning the predominant form of investment in the context is general in nature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the two sources of market inefficiency in accommodation identified in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: The first is a human capital externality operating through worker turnover. Because accommodation builds general human capital that workers carry to new employers, a firm accommodating a worker does not capture the portion of future surplus that accrues to future employers upon separation. In a Nash bargaining framework with lack of commitment, this dynamic inefficiency is larger when industry-wide turnover rates are higher — consistent with the descriptive finding that accommodation rates are strongly negatively associated with industry separation rates. The second is a fiscal externality from imperfect experience rating: firms whose workers&amp;rsquo; compensation premiums are not fully linked to their own claim costs do not fully internalize the cost-savings from accommodation (i.e., reduced time-loss benefit payments), leading them to accommodate at inefficiently low rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How is heterogeneity incorporated in the structural estimation, and what do the estimated parameters show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: The model incorporates observed heterogeneity (firm insurance status, worker skill type — measured by pre-disability wages — firm baseline exposure, and pre/post policy change) and unobserved heterogeneity mapped to the MTE framework&amp;rsquo;s unobserved resistance to treatment. Indirect inference matches cross-sectional accommodation rates, earnings by subgroup, and the DID coefficients. Key findings: net output during the disability period is negative (accommodation is a costly short-run investment), while post-disability output is higher for accommodated workers. Low-skilled workers experience larger productivity gains from accommodation than high-skilled workers. Accommodation cost shock variance is lower for higher unobserved types, meaning high-gain workers are also more sensitive to subsidy changes, consistent with the large IV estimates. The model fits the DID coefficients for accommodation, employment, and wages well.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What do the counterfactual simulations show about the welfare effects of varying the subsidy rate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: Eliminating wage subsidies from the current 50% rate reduces the accommodation rate from 33% to 11% and lowers post-disability employment by 7 percentage points and post-disability quarterly wages by 15% ($1,358). From a welfare perspective, eliminating subsidies in a revenue-neutral reform reduces average ex-ante worker welfare and lowers welfare for more than 90% of workers. Conditional on experiencing disability, eliminating subsidies reduces welfare by about 10% while raising the subsidy to 100% increases welfare of disabled workers by around 30%. Firm profit is increasing in the subsidy rate up to about 80%, then decreases. Ex-ante worker welfare gains from the current 50% subsidy relative to no subsidy are modest in consumption-equivalent terms (at most 0.6% increase in consumption), partly because the disability probability is low (2.2%) and because unaccommodated workers still receive two-thirds wage replacement through time-loss benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What distributional implications do wage subsidies have across worker and firm types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: Welfare gains from higher wage subsidies are larger for low-skilled workers than high-skilled workers, so the subsidy has a redistributive dimension beyond efficiency correction. Welfare gains are also larger for workers in imperfectly experience-rated firms, where the fiscal externality creates the greater wedge from the efficient level. Self-insured firms, which already internalize workers&amp;rsquo; compensation cost savings and thus accommodate closer to the optimal rate, benefit less from the subsidy and can even be made worse off if subsidies are set very high (since they bear higher flat payroll taxes with smaller marginal accommodation gains). The fraction of worker-firm matches experiencing welfare gains exceeds 90% under the benchmark subsidy level, indicating broad rather than narrowly concentrated gains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How do the experience-rating channel and the worker-turnover channel interact in comparative statics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: Model comparative statics show that reducing the job-to-job transition rate of workers with disabilities to one-quarter of its estimated value substantially raises accommodation rates, and this effect is more pronounced for imperfectly experience-rated firms than for self-insured firms. This occurs because self-insured firms already have a strong incentive to accommodate (to reduce workers&amp;rsquo; compensation premiums), so turnover is less marginal for them. Forcing all firms to be self-insured (perfect experience rating) would substantially increase accommodation rates in currently imperfectly rated firms. Lowering the accommodation cost during the disability period (increasing net output during the disability period) also raises accommodation rates for both firm types.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Firm Accommodation (EAIP):&lt;/strong&gt; In this paper&amp;rsquo;s specific sense, accommodation refers to a firm&amp;rsquo;s decision to offer a worker with a temporary workplace disability &amp;ldquo;transitional work&amp;rdquo; — alternative tasks, modified duties, or flexible arrangements — during their recovery period, funded in part through Oregon&amp;rsquo;s Employer at Injury Program wage subsidy. Accommodation is distinct from simple early return to work; it functions as a form of human capital investment by potentially providing skill development opportunities and preventing human capital depreciation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exposure (Instrument):&lt;/strong&gt; A firm-level continuous measure defined as the fraction of a firm&amp;rsquo;s workers&amp;rsquo; compensation claims that used EAIP during a five-year baseline period (2005–2009). Exposure captures permanent, time-invariant firm-level propensity to accommodate, and is used to construct a difference-in-differences instrument for the causal effect of accommodation by interacting exposure with a post-2013 indicator (when the subsidy rate was cut from 50% to 45%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imperfect Experience Rating:&lt;/strong&gt; The degree to which a firm&amp;rsquo;s workers&amp;rsquo; compensation insurance premium adjusts to reflect that firm&amp;rsquo;s own claims costs, rather than being set at an industry average. Fully experience-rated (self-insured) firms internalize 100% of claim costs and thus have strong incentives to accommodate. Partially experience-rated firms face a fiscal externality: because their premiums do not fully reflect their own time-loss benefit expenditures, they do not capture all the cost savings from accommodating workers, leading to under-accommodation relative to the social optimum.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Human Capital Externality (Dynamic Inefficiency in Accommodation):&lt;/strong&gt; The mechanism — analogous to Acemoglu and Pischke (1999) and Fang and Gavazza (2011) — by which worker turnover reduces firms&amp;rsquo; incentives to invest in general human capital (here, accommodation). When accommodation raises workers&amp;rsquo; general productivity, part of the future surplus from this investment accrues to future employers upon job-to-job separation. With Nash bargaining and lack of commitment (re-bargaining in the second period), the accommodating firm cannot capture this surplus, creating a dynamic inefficiency that is more severe in high-turnover industries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negative Selection on Gains:&lt;/strong&gt; The empirical finding, established via the MTE framework, that workers with workplace disabilities who are least likely to receive accommodation (highest unobserved resistance to treatment) have the largest potential employment and earnings gains from accommodation. This pattern arises because workers with more severe disabilities have high accommodation costs (making firms unwilling to accommodate them) but also face far worse counterfactual labor market outcomes without accommodation, creating large potential gains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal Treatment Effect (MTE):&lt;/strong&gt; Following Heckman and Vytlacil (2005), the treatment effect of accommodation evaluated at a specific quantile of unobserved resistance to treatment — defined here as the propensity score value at which a worker is indifferent between treatment and non-treatment. The MTE curve maps out the full distribution of treatment effects and reveals who benefits (and by how much), how IV estimates are weighted averages over this distribution, and which compliers drive the large IV estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General vs. Firm-Specific Human Capital (in Accommodation Context):&lt;/strong&gt; Accommodation is characterized as general human capital investment if the productivity and earnings gains it produces are transferable across employers — i.e., if accommodated workers who move to new firms retain their wage gains. It is firm-specific if gains are tied to the current match. In this paper, general human capital is supported by the null effect of accommodation on new-firm employment probability, suggestive evidence of non-lower (possibly larger) earnings gains for new-firm movers, and the observation that fewer than 5% of claims use non-wage EAIP benefits associated with firm-specific investment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revenue-Neutral Counterfactual:&lt;/strong&gt; A counterfactual policy experiment in which the wage subsidy rate for accommodation is varied while imposing that both the time-loss benefit program and the EAIP wage subsidy program remain budget-balanced. Higher subsidy rates raise firm accommodation, reduce time-loss benefit payouts (lowering base premiums for imperfectly experience-rated firms), but require a higher flat EAIP payroll tax on all firms, some of which is passed through to workers via lower first-period wages.&lt;/p&gt;</description></item><item><title>Firm dynamics and random search over the business cycle</title><link>https://macropaperwarehouse.com/papers/firm-dynamics-and-random-search-over-the-business-cycle/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-dynamics-and-random-search-over-the-business-cycle/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;How do aggregate economic fluctuations reallocate workers across the firm productivity distribution over the business cycle? In particular, to what extent do recessions impede workers&amp;rsquo; movement up the job ladder toward more productive firms?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops a tractable random search model combining three features that had not previously been integrated in a single quantitative framework: (i) firm dynamics driven by idiosyncratic productivity shocks, with endogenous entry and exit; (ii) on-the-job search, generating a job ladder in which workers gradually move toward more productive firms; and (iii) aggregate productivity shocks. Multi-worker firms post employment contracts, choose hiring rates, and decide whether to continue or exit. The key tractability result — called &amp;ldquo;size-independence&amp;rdquo; (Result 1) — shows that, under a constant-returns hiring cost technology, firms&amp;rsquo; optimal policies (contract value, hiring rate, exit decision) are all independent of firm size, so the relevant state space reduces from the full joint distribution of firm productivity and size to the employment-weighted distribution of firm productivity alone. A further result (&amp;ldquo;rank-monotonic equilibrium,&amp;rdquo; Result 2) guarantees, under a sufficient convexity condition on hiring costs (hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1), that the optimal employment contract is increasing in firm productivity, so the job ladder maps one-for-one onto the firm productivity ladder. The optimal wage contract then admits a closed-form solution.&lt;/p&gt;
&lt;p&gt;The model is calibrated to British data for 1997–2018. Worker-level transition rates (unemployment-to-employment, employment-to-unemployment, and job-to-job) are drawn from the British Household Panel Survey (BHPS). Firm-level data on labor productivity (value added per worker) and employment costs per worker come from the Annual Respondents Database (ARD) and Annual Business Survey (ABS), merged with the Business Structure Database (BSD). The numerical solution adapts ideas from Krusell and Smith (1998), approximating the employment-weighted productivity distribution by a small set of moments and parameterizing value functions as polynomials in the aggregate state; standard linearization methods are inapplicable because endogenous firm entry and exit introduces a discontinuity in value functions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Model validation via the OP decomposition.&lt;/em&gt; The paper&amp;rsquo;s central validation exercise uses the Olley-Pakes (OP) decomposition of a labor productivity index constructed from firm-level data. The aggregate employment-weighted labor productivity index is decomposed into (a) the unweighted average firm productivity and (b) an interaction term (the &amp;ldquo;OP term&amp;rdquo;), which captures the covariance between employment shares and productivity — i.e., how well workers are allocated to productive firms. In the British firm-level data, approximately 20 percent of the variance of the aggregate labor productivity index is accounted for by this interaction (OP) term, with the remaining ~80 percent attributable to the unweighted average of firm productivity. The baseline model, with this moment untargeted, successfully replicates this 80/20 split. By contrast, the leading benchmark model of Moscarini and Postel-Vinay (2016) (MPV2016), calibrated to the same British data, attributes nearly all of the variance of labor productivity to the OP/worker reallocation term, grossly overstating the importance of job-ladder dynamics.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Structural decomposition of labor productivity.&lt;/em&gt; Using the calibrated baseline model to decompose the variance of aggregate labor productivity over the post-war British business cycle (&amp;ldquo;GDP shocks&amp;rdquo; going back to 1955), the baseline model attributes approximately 30 percent to the direct effect of the aggregate productivity shock, approximately 50 percent to changes in the distribution of active firms (the &amp;ldquo;firm ladder&amp;rdquo; or firm selection component), and approximately 20 percent to the worker reallocation component (the OP interaction term). This result is robust to an alternative calibration with a lower curvature of the hiring cost function (c1 = 1).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Persistence and mechanisms.&lt;/em&gt; The impact of recessions on the job ladder is persistent: while the aggregate productivity shock is typically close to its pre-recession value four years after a typical recession onset, the overall allocation of workers to firms remains clearly worse relative to the pre-recession level at that same horizon. The Great Recession, viewed through the lens of the model, is a large but not unusually large recession.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Firm selection with multiple aggregate shocks.&lt;/em&gt; An unexpected finding concerns the direction of firm selection. With a single aggregate productivity shock, the model generates a standard &amp;ldquo;cleansing&amp;rdquo; mechanism: negative shocks raise the firm exit threshold, so surviving firms are on average more productive. However, when additional shocks to the exogenous separation rate (δ) and hiring cost scale (c0) are included — as required to match the volatility of labor market flows — firm selection instead amplifies the decline in labor productivity. The mechanism is a general equilibrium one: a higher separation rate lowers the optimal wage contract (since greater separation risk is passed on to workers), which in turn lowers the entry-exit threshold. Less productive firms become viable because their employees face higher unemployment risk and therefore accept lower wages; moreover, a larger pool of unemployed workers makes it easier for low-productivity firms to recruit.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Wage flexibility tension.&lt;/em&gt; The model implies a pass-through elasticity of wages to productivity shocks of approximately 0.7, well above the 0.05–0.2 range typically found empirically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All calibration and quantitative results pertain to Britain for the period 1997–2018 (firm-level data) and 1955–2018 (GDP-based aggregate shocks). The model abstracts from decreasing returns to scale in production and from nominal rigidities. The tractability results rely on specific assumptions about the hiring cost function; the rank-monotonicity condition requires sufficient convexity (hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1).&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the central tractability result and why does it matter for computational feasibility?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Result 1 (&amp;ldquo;size-independence&amp;rdquo;) shows that, because both the production technology and the hiring cost function are constant returns to scale, the firm&amp;rsquo;s present discounted value of profits is linear in employment. As a result, per-worker profits are independent of firm size, and optimal firm policies — the hiring rate, the contract value offered to workers, and the continuation/exit decision — all depend only on the firm&amp;rsquo;s current productivity, not on its size. This collapses the state space from the full joint distribution of firm productivity and employment size to the employment-weighted measure of firm productivity Lt(p), a uni-dimensional object. Without this result, the model would require tracking the entire joint firm distribution, making it computationally intractable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is a rank-monotonic equilibrium (RME) and what conditions guarantee it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: An RME is a recursive equilibrium in which the optimal contract offered by a firm is weakly increasing in that firm&amp;rsquo;s current productivity realization, for all aggregate states. Result 2 provides sufficient conditions: (i) the Markov process for firm-specific productivity satisfies first-order stochastic dominance (more productive firms today are more likely to be more productive tomorrow), (ii) the distribution of offered contracts is everywhere differentiable (ruling out mass points), and (iii) the hiring cost function satisfies hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1 — a sufficient convexity condition. The economic interpretation of the convexity condition is that firms must find retention (offering higher wages) sufficiently costly relative to new hiring that more productive firms optimally choose to use the wage margin to limit quits. The baseline calibration yields c1 ≈ 5.9 (so costs are highly convex in the hiring rate), though results are also reported for the minimum permissible c1 = 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What does the optimal employment contract look like in a rank-monotonic equilibrium, and what does it reveal about rent extraction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In an RME, the optimal contract V(p,ω,L) is a weighted average of the value of unemployment U(ω,L) and the firm-workers&amp;rsquo; joint surplus S(p,ω,L), where the weights are determined endogenously by the employment-weighted measure of firm productivity L. Specifically, the contract integrates the surplus of all firms with productivity below p, weighted by the share of employed workers at those firms, and divided by the mass of job seekers willing to accept the contract. As the employed workers&amp;rsquo; relative search intensity s approaches zero, the contract converges to the value of unemployment — workers receive no rents. The endogenous bargaining weight evolves with the aggregate state over the business cycle, unlike standard Nash bargaining models with a fixed exogenous weight.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What firm-level moments are used to calibrate the steady-state model, and what is the logic behind the parameter-moment mapping?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Eight moments are targeted. From the BHPS worker data: the average UE rate (0.058) pins down the scale of hiring costs c0; the average EU rate (0.003) pins down the exogenous separation rate δ; and the average EE (job-to-job) rate (0.016) pins down the relative search intensity s. From the firm-level ARD/BSD data: average firm size (12.1 employees) pins down the entry probability µ; the share of job destruction from firm exits (0.526) disciplines the flow value of unemployment b; the autocorrelation of firm employment ln(n) (0.949 annually) disciplines the persistence of idiosyncratic productivity ρp; the interquartile range of firm-level labor productivity (1.129 log points) disciplines the volatility of idiosyncratic shocks σp; and the regression coefficient of firm employment growth on lagged labor productivity (0.136) disciplines the curvature of hiring costs c1. The baseline calibration fits all eight moments closely.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the calibrated model match non-targeted moments, and what does this establish?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model generates several realistic features not targeted in calibration. It produces a realistic Pareto tail for the employment-size distribution (Pareto tail exponent of 1.033 in the model vs. 1.066 in the data), which arises from the combination of size-independent growth rates and firm entry and exit — conditions identified in the literature as generating power law distributions. The model also matches the dispersion of employment costs per worker across firms (capturing about 70 percent of the interquartile range of ECi,t), the slope of a regression of employment costs on labor productivity (model: 0.685 vs. data: 0.704), and the slope of a regression of employment growth on employment costs (model: 0.162 vs. data: 0.131). These non-targeted matches provide independent validation of the model&amp;rsquo;s wage-determination mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why is a single aggregate productivity shock insufficient to match labor market fluctuations, and what additional shocks are needed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: With a single aggregate productivity shock calibrated to match the autocorrelation and standard deviation of log GDP, the model generates labor market fluctuations that are roughly an order of magnitude smaller than in the data. For example, the standard deviation of the EU transition rate is 4.1×10⁻⁴ in the single-shock model versus 2.3×10⁻³ in the data. Adding a discount rate shock (ω,r) partially helps but still leaves the job-finding rate (UE) more than 50 percent too smooth. Adding a separation rate shock (ω,δ) substantially increases EU and UE volatility but generates insufficient EE (job-to-job) volatility. The combination (ω,δ,c0) — adding a shock to the scale of hiring costs c0 — brings the standard deviations of EU and UE close to the data (2.0×10⁻³ and 4.0×10⁻⁴ vs. data 2.3×10⁻³ and 2.7×10⁻⁴), though the model still generates slightly under half the observed volatility in EE rates. This combination is the baseline for the quantitative analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the OP decomposition, how is it computed from the firm-level data, and what does it measure in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The aggregate labor productivity index LPt is constructed from firm-level data as the employment-share-weighted average of log value added per worker across firms. The OP decomposition writes this as LPt = LPt_bar + OPt, where LPt_bar is the unweighted (simple) average of firm-level productivity and OPt is the covariance between employment shares and labor productivity (the &amp;ldquo;interaction term&amp;rdquo;). In the data, OPt increases when workers are disproportionately employed at above-average-productivity firms. In the model, LPt_bar maps onto the average (log) productivity of active firms — the support of the job ladder — while OPt maps onto the difference between the employment-weighted and the unweighted averages of firm productivity, directly measuring how high up the ladder workers are located relative to the set of active firms. Around 20 percent of the variance of LPt in the British data is accounted for by OPt, and the model replicates this.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the Great Recession appear in the OP decomposition, and does the model fit the decomposition during this episode?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: During the Great Recession (2008q2–2009q3 in the UK), around 20 percent of the overall fall in the labor productivity index is accounted for by the fall in the OP interaction term, with the remaining 80 percent coming from the fall in the unweighted average firm productivity. The model, even though it does not target this decomposition in calibration, successfully matches both the average firm productivity component and the interaction (OP) component during the Great Recession. This matching holds both in the baseline calibration (c1 ≈ 5.9) and in the alternative calibration with c1 = 1. The model also matches the analogous decomposition for employment costs per worker (ECt), an additional non-targeted validation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Why does firm selection amplify rather than cleanse in the baseline multi-shock calibration?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the single-shock (productivity ω only) model, a negative productivity shock lowers surplus at all firms, raising the exit threshold pE and thus selecting out low-productivity firms — the standard &amp;ldquo;cleansing&amp;rdquo; mechanism. In the multi-shock baseline, the additional separation rate shock (δ) generates a less intuitive mechanism. A higher δ lowers the optimal wage contract (since increased separation risk is passed on to workers: ∂V/∂δ ≤ 0), which reduces the value of continued employment. This lowers the joint firm-worker surplus threshold for exit, making it viable for low-productivity firms to remain active. Moreover, the larger pool of unemployed workers (generated by the δ shock) depresses the outside option of workers and makes it easier for low-productivity firms to recruit. As a result, the entry-exit threshold pE,t falls — the set of active firms becomes less productive on average — producing a negative firm selection contribution to labor productivity and a positive (amplifying rather than cleansing) contribution to the variance of LPt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the structural variance decomposition of labor productivity in the baseline model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Simulating the baseline model over the post-war British business cycle (1955–2020, GDP shocks), the variance of aggregate labor productivity LPt decomposes into three structural terms: approximately 30 percent (0.296) from the direct effect of the aggregate productivity shock ln(ωt); approximately 50 percent (0.541) from changes in the average productivity of active firms E[KP bar_t(ln p)] — the &amp;ldquo;firm ladder&amp;rdquo; or firm selection component; and approximately 20 percent (0.163) from the worker reallocation component OPt = E[LP bar_t(ln p)] − E[KP bar_t(ln p)]. This decomposition implies that roughly 70 percent of fluctuations in labor productivity are driven by worker reallocation broadly defined (the firm ladder plus the interaction term), with the firm selection component being the largest single driver. The result is robust to the alternative c1 = 1 calibration (30/49/22 percent split).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the baseline model compare to MPV2016 in the variance decomposition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the multi-shock calibration (ω,δ,c0), the MPV2016 model calibrated to the same British data attributes approximately 97.7 percent (0.977) of the variance of LPt to the worker reallocation (OP) term, with essentially none attributed to a firm selection term (since there is no firm entry and exit in MPV2016). This is nearly five times the 20 percent share attributed to worker reallocation in the data and in the baseline model. In the single-shock (ω) calibration, both models attribute a more modest share to worker reallocation (7.2 percent for the baseline model, 0.1 percent for MPV2016 with c1=5), and the difference narrows considerably. The contrast thus stems from the interaction of firm dynamics with multiple aggregate shocks: allowing for endogenous firm entry and exit is critical to prevent the model from overstating the role of the job ladder.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How persistent is the impact of recessions on the job ladder, based on the model simulations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper simulates the structural decomposition of labor productivity starting from each of seven post-war British recessions (defined by two consecutive quarters of negative GDP growth). On average across these recessions, the aggregate productivity shock ln(ωt) is close to its pre-recession level by four years after the recession onset. However, the overall employment-weighted average productivity E[LP bar_t(ln p)] — reflecting workers&amp;rsquo; position on the job ladder — remains clearly below its pre-recession value at the four-year horizon, indicating persistent misallocation. The OP interaction term accounts for approximately 20 percent of the total drop in the employment-weighted productivity measure three years after a typical recession onset. Through the model&amp;rsquo;s lens, the Great Recession is a large recession but not an outlier relative to the historical distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What does the counterfactual with countercyclical unemployment benefits reveal about the tradeoff between firm selection and worker reallocation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: When the flow value of unemployment is made countercyclical (falling in recessions, rising in expansions — mimicking US unemployment insurance extension programs), the model generates a sign reversal in the firm selection (&amp;ldquo;firm ladder&amp;rdquo;) component. With countercyclical b, the unemployment value rises in recessions, which raises the minimum wage firms must offer and raises the exit threshold pE,t: fewer low-productivity firms survive, improving the composition of active firms. However, countercyclical benefits also amplify the slowdown in job-to-job reallocation: the higher value of unemployment reduces workers&amp;rsquo; willingness to accept job offers, and all firms cut recruitment since optimal wage contracts must rise. The OP interaction term therefore falls more sharply than in the baseline model. The counterfactual with ϵb,ω ∈ {−100, −50} finds that the positive &amp;ldquo;firm ladder&amp;rdquo; effect dominates on net, so the overall allocation of workers to firms improves relative to the baseline after a typical recession under countercyclical unemployment benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What is the numerical solution method, and why are standard linearization approaches inapplicable?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model is solved in two steps. First, aggregate shocks are shut down and the steady-state rank-monotonic equilibrium is solved numerically by discretizing the firm productivity process (401 grid points via Tauchen&amp;rsquo;s method) and iterating on the value function and the employment-weighted productivity measure until convergence. Second, aggregate shocks are reintroduced using a simulation-based approach adapted from Krusell and Smith (1998): the employment-weighted distribution of productivity is summarized by Nm = 2 moments (plus the unemployment rate), and the value functions are parameterized as polynomials in the aggregate state, with coefficients updated by regression until convergence. Standard linearization methods (Reiter 2009) are inapplicable because the endogenous entry-exit decision creates a kink (discontinuity) in value functions at the productivity threshold pE, making first-order approximations around the steady state inaccurate. Accuracy tests based on den Haan (2010) show that the polynomial approximation generates errors of at most 0.065 percent for value functions and at most 1 percentage point for the unemployment rate across simulation paths.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;1. Rank-Monotonic Equilibrium (RME)&lt;/strong&gt;
A recursive equilibrium in which the optimal state-contingent employment contract V(p,ω,L) offered by a firm is weakly increasing in the firm&amp;rsquo;s current productivity realization p, for all aggregate states (ω,L). This property implies that the job ladder maps one-for-one onto the firm productivity ladder: workers always prefer to work at more productive firms. The paper shows this property holds under a sufficient convexity condition on hiring costs (hc&amp;rsquo;&amp;rsquo;(h)/c&amp;rsquo;(h) ≥ 1) and first-order stochastic dominance of the productivity process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;2. Size-Independence&lt;/strong&gt;
The property that a firm&amp;rsquo;s optimal policies — the hiring rate h(p), the employment contract V(p), and the entry/exit decision χ(p) — are all independent of the firm&amp;rsquo;s current employment size n. This follows from constant returns to scale in production and hiring, which implies that firm profits are linear in employment. Size-independence reduces the model&amp;rsquo;s relevant state space to the employment-weighted distribution of firm productivity, enabling tractability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;3. Employment-Weighted Distribution of Firm Productivity (L_t(p))&lt;/strong&gt;
The measure recording, for each productivity level p, the total employment at firms with productivity at most p. This is the sufficient statistic for the state of the job ladder at any point in time: combined with the aggregate shock ω, it determines all equilibrium policy functions and value functions. In the model, it replaces the full joint distribution of firm productivity and employment size that would otherwise be required.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;4. OP Decomposition (Olley-Pakes Decomposition)&lt;/strong&gt;
The decomposition of the aggregate employment-weighted labor productivity index LPt into: (a) the unweighted average firm productivity LPt-bar, which summarizes the productivity of active firms (the support of the job ladder); and (b) an interaction term OPt, the covariance between employment shares and firm-level productivity, which measures how well workers are allocated across the productivity distribution (i.e., how high up the ladder workers sit given the set of active firms). In the model, (a) maps to E[KP bar_t(ln p)] and (b) maps to OPt = E[LP bar_t(ln p)] − E[KP bar_t(ln p)].&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;5. Contract Posting&lt;/strong&gt;
The wage-setting protocol in which each firm commits upon entry to a full state-contingent employment contract — a schedule mapping each future realization of aggregate and idiosyncratic productivity to a wage and continuation decision — and is bound by an equal treatment constraint to offer the same contract to all employees. Workers cannot renegotiate based on outside offers. This protocol produces a well-defined closed-form for the optimal contract in an RME and differs from alternating-offer bargaining (Nash bargaining) in that the bargaining weights are endogenous rather than fixed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;6. Firm-Workers&amp;rsquo; Joint Surplus (S_t(p))&lt;/strong&gt;
The total present discounted value accruing to the firm-worker pair: firm profits per worker plus the contract value promised to workers. Because utility is transferable (risk neutrality) and the firm fully commits to its contract, this surplus depends only on the firm&amp;rsquo;s current productivity and the aggregate state — not on the promised contract value V. The surplus S_t(p) is the key object determining firm entry/exit (the firm continues if and only if S_t(p) ≥ U_t) and optimal hiring (the marginal return to an additional hire equals S_t(p) − V(p)).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;7. Cleansing vs. Anti-Cleansing Firm Selection&lt;/strong&gt;
In models with endogenous firm entry and exit, a negative aggregate shock can either raise or lower the productivity threshold for firm survival. &amp;ldquo;Cleansing&amp;rdquo; refers to the standard mechanism where a negative productivity shock raises the exit threshold, selecting out low-productivity firms and improving the average quality of survivors. &amp;ldquo;Anti-cleansing&amp;rdquo; (as in the baseline multi-shock calibration) occurs when separation rate or hiring cost shocks lower the optimal wage contract and reduce the exit threshold, allowing less productive firms to survive and worsening average firm productivity.&lt;/p&gt;</description></item><item><title>Firm Responses and Wage Effects of Foreign Demand Shocks with Fixed Labor Costs and Monopsony</title><link>https://macropaperwarehouse.com/papers/firm-responses-and-wage-effects-of-foreign-demand-shocks-with-fixed-labor-costs-and-monopsony/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/firm-responses-and-wage-effects-of-foreign-demand-shocks-with-fixed-labor-costs-and-monopsony/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; The paper asks three related questions in the context of Belgium, a small open economy: (1) What do firms&amp;rsquo; responses to demand shocks reveal about their cost structures? (2) What are the worker and wage impacts of foreign demand shocks? (3) How sensitive are the aggregate wage effects of foreign demand shifts to firms&amp;rsquo; cost structures and imperfect competition in the labor market?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The analysis combines administrative micro-data from Belgium for 2002–2014, provided by the National Bank of Belgium. The linked dataset covers 995,739 firm-year observations from private, non-financial firms with at least one FTE employee, and integrates: (a) a Business-to-Business (B2B) VAT transactions registry capturing all annual domestic firm-to-firm sales above €250; (b) customs records and intra-EU declarations for imports and exports at the 8-digit product level; (c) annual accounts containing data on sales, labor costs, intermediate inputs, capital, and firm characteristics; and (d) employer-employee matched data from the Belgian social security administration (BCSS) for a random sample of 500,000 workers in firms with 10 or more FTE employees, covering 2003–2014.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy.&lt;/strong&gt; To isolate variation in firms&amp;rsquo; sales driven by foreign demand rather than supply-side factors, the authors construct a firm-specific foreign demand instrument following Hummels et al. (2014) and Dhyne et al. (2021). The instrument is the weighted average of changes in world import demand facing a firm, using lagged export shares as weights and excluding Belgian imports from the world import measure. Crucially, the instrument captures both direct foreign demand exposure (for exporters) and indirect exposure through the domestic production network — including the foreign demand shocks passing through to upstream domestic suppliers via buyer-supplier links. Firm and industry-year fixed effects control for time-invariant heterogeneity and industry-level trends.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Empirical Facts.&lt;/strong&gt; Within-firm analysis over four-year windows finds that intermediate input purchases respond nearly proportionally to changes in sales (slope coefficient 0.82), while labor costs respond less than proportionally (slope coefficient 0.57). The less-than-proportional response of labor costs — with the employment slope of 0.48 and the average wage slope of 0.09 — is consistent with sizable fixed overhead costs in labor inputs and upward-sloping labor supply curves. Output prices co-move more with input prices than with average wages, consistent with labor constituting a smaller share of variable costs than intermediate inputs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;IV Estimates of Firm Responses.&lt;/strong&gt; In response to a foreign demand shock inducing a 10 percent instantaneous increase in a firm&amp;rsquo;s sales, the firm&amp;rsquo;s cumulative sales over four years increase by approximately 7.6 percent (balanced panel). Over the same four-year horizon, total input purchases increase by about 7.0–7.8 percent, while labor costs increase by only 3.5–4.1 percent — a substantially less-than-proportional response. Roughly one-quarter of the labor cost change comes from changes in average wages rather than employment changes. Domestic input purchases increase by 5.3–6.0 percent, indicating that firms pass on a large share of foreign demand shocks to their domestic suppliers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural Parameters.&lt;/strong&gt; The implied IV estimate of the labor cost elasticity with respect to sales is 0.53 (standard error 0.08), statistically significantly below one. The implied elasticity of total input purchases is 1.05 (standard error 0.15), close to one, so the fixed share of intermediate inputs is approximately zero. The labor supply elasticity estimated from the ratio of wage and employment responses is approximately 3.9 in the full sample and 2.3 in the stayer subsample; the implied wage markdown is 21 percent and 30 percent respectively. Incorporating upward-sloping labor supply into equation (15), the estimated share of total labor inputs that is fixed overhead is approximately 53 percent. By comparison, the fixed share of total costs (labor and intermediate inputs combined) is approximately 29 percent in Belgium — higher than the 18–22 percent found in U.S. data (De Loecker et al. 2020) and the 20 percent found in U.S. manufacturing plants (Ederhof et al. 2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General Equilibrium Counterfactuals.&lt;/strong&gt; The authors parameterize and solve a small open economy general equilibrium model with monopsonistic competition in labor markets, monopolistic competition in product markets, and fixed and variable labor and intermediate input costs. Using the Dekle-Eaton-Kortum (2007) &amp;ldquo;hat algebra&amp;rdquo; technique, they simulate a 5 percent increase in foreign tariffs on all Belgian exports and compare four counterfactual economies: (1) baseline Belgium with fixed costs and imperfect labor market competition (ε = 3.9); (2) fixed costs and perfectly elastic labor supply (ε = ∞); (3) no fixed costs with imperfect competition; (4) no fixed costs and perfectly competitive labor markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings on Wages.&lt;/strong&gt; In the baseline Belgian economy, a 5 percent increase in foreign tariffs produces a 4.9 percent fall in the average real wage. With fixed costs but perfectly elastic labor supply, the real wage falls by 4.8 percent — nearly identical. With upward-sloping labor supply but no fixed costs, the real wage falls by only 3.0 percent; without fixed costs and with perfectly competitive labor supply, the fall is only 2.8 percent. The paper concludes that fixed overhead costs in labor substantially amplify real wage declines, while incorporating upward-sloping labor supply appears quantitatively less consequential for aggregate wage outcomes. Standard models that assume no fixed costs and perfectly elastic labor supply — the typical modeling choice in the trade literature — may substantially understate (by roughly 43–75 percent of the true effect) the aggregate wage decline from a negative foreign demand shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism.&lt;/strong&gt; Fixed overhead costs reduce labor&amp;rsquo;s share of variable costs. When labor is a smaller share of variable costs, output prices are less sensitive to changes in wages. With a fixed aggregate labor supply, the economy must lower prices through wage reductions to restore equilibrium after a negative demand shock; the required wage decline is larger when fixed labor costs are taken into account. The findings are robust to adjustment cost specifications, a nested logit extension of the labor market model, and controlling for location-year fixed effects and import price changes.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What two motivating empirical facts about Belgian firms does the paper establish?&lt;/strong&gt;
A1: First, within-firm four-year changes show that intermediate input purchases respond nearly proportionally to changes in sales (slope coefficient 0.82), while labor costs respond less than proportionally (slope coefficient 0.57). The labor cost response decomposes into an employment slope of 0.48 and a wage slope of 0.09. Second, output prices co-move more strongly with input (intermediate goods) prices than with average wages, consistent with labor constituting a smaller share of variable costs than intermediate inputs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the instrument for foreign demand shocks capture indirect exposure through production networks?&lt;/strong&gt;
A2: The instrument for firm k is a weighted average of changes in world import demand, where the weights reflect both the firm&amp;rsquo;s own direct export shares across countries and products and the firm&amp;rsquo;s indirect export exposure through its domestic buyers&amp;rsquo; export shares. The term H̃_{kn,t-1} captures the share of firm k&amp;rsquo;s total sales purchased by firm n directly and indirectly through all upstream chains. This means even non-exporting firms receive a non-zero instrument through their sales to directly-exporting firms. In fact, non-directly-exporting firms sell on average nearly 10 percent of their output indirectly to foreign markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the estimated magnitude of the labor supply elasticity facing Belgian firms, and what does it imply for wage markdowns?&lt;/strong&gt;
A3: In the full main estimation sample (balanced panel), the IV estimate of the firm-specific labor supply elasticity is approximately 3.9, implying a wage markdown of about 21 percent relative to the marginal revenue product of labor. In the stayer subsample (incumbent workers only, holding workforce composition fixed), the estimated labor supply elasticity is approximately 2.3, implying a markdown of about 30 percent. The paper can reject perfect competition (infinite elasticity, zero markdown) at a significance level of 0.06 in the full sample and 0.001 in the stayer sample using the closure method.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the estimated labor cost elasticity with respect to demand-driven sales changes, and what does it imply about fixed labor costs?&lt;/strong&gt;
A4: The IV estimate of the labor cost elasticity with respect to sales is 0.528 (standard error 0.085), statistically significantly below one. If labor supply were perfectly elastic, this would directly imply a fixed labor cost share of approximately 47 percent. Incorporating the estimated upward-sloping labor supply curve through equation (15), the model implies that approximately 53 percent of total labor inputs are fixed overhead. For context, occupational data from Belgium&amp;rsquo;s 2014 Structure of Earnings Survey shows that clerical support workers and managers together account for 21 percent of total earnings, and adding professionals raises this to 51 percent — broadly consistent with the estimated fixed share.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the estimated elasticity of input purchases with respect to sales imply about fixed intermediate input costs?&lt;/strong&gt;
A5: The IV estimate of the elasticity of total input purchases with respect to sales is 1.050 (standard error 0.150), close to one. The implied fixed share of total intermediate inputs is therefore approximately zero. However, there is substantial heterogeneity by input type: purchases from the manufacturing sector (roughly half of all input purchases) have an elasticity close to one, whereas service-sector inputs (roughly 30 percent of total input purchases) have an implied fixed cost share of approximately 36 percent, with a size-weighted average cumulative response of 4.3 percent against a total cumulative sales increase of 6.7 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the paper rule out alternative explanations for the less-than-proportional response of labor costs?&lt;/strong&gt;
A6: The paper considers three main alternatives. First, adjustment costs: even in the presence of labor adjustment costs, under a homothetic constant-returns production function a permanent shock should eventually produce a proportional labor response. The paper focuses on four-year cumulative responses where firm responses change little after the first couple of years, and shows identification of fixed costs holds even in models with quadratic or Calvo-style adjustment costs. Second, a non-homothetic CES production function without fixed costs: Appendix B.3 shows that such a specification predicts that if the labor cost elasticity is below one, the input purchase elasticity must be above one — at odds with the data, which shows the input purchase elasticity is close to one while the labor cost elasticity is well below one. Third, variable markups: a uniform markup change would reduce both elasticities proportionally, not create the large gap between labor cost and input purchase elasticities observed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why are firms&amp;rsquo; domestic suppliers affected by foreign demand shocks, and how large are the pass-through effects?&lt;/strong&gt;
A7: Firms pass on foreign demand shocks to their domestic suppliers through buyer-supplier production network links. When a foreign demand shock increases a firm&amp;rsquo;s sales by 10 percent instantaneously, its domestic input purchases increase cumulatively by approximately 5.3–6.0 percent over four years. Total input purchases increase by 7.0–7.8 percent over the same period; the difference between total and domestic input purchases reflects service inputs (which have smaller responses) and the composition of imported versus domestic inputs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the aggregate real wage effect of a 5 percent increase in foreign tariffs on Belgian exports in the baseline model?&lt;/strong&gt;
A8: In the baseline counterfactual representing the actual Belgian economy (with fixed overhead costs and labor supply elasticity ε = 3.9), a uniform 5 percent increase in foreign tariffs on all Belgian exports produces a 4.9 percent fall in the average real wage. The median firm reduces output by 3.8 percent, marginal costs by 4.8 percent, and wages by 7.9 percent. The fall in wages is driven by a general equilibrium mechanism: since the foreign price is exogenous and trade balance must hold, wages are the key adjusting margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How much does the modeling of fixed overhead costs versus imperfect labor market competition matter for the aggregate wage counterfactual?&lt;/strong&gt;
A9: Fixed overhead costs account for nearly all of the amplification relative to the standard model. With fixed costs but perfectly elastic labor supply, the real wage falls 4.8 percent — almost identical to the 4.9 percent in the baseline. Without fixed costs but with the estimated upward-sloping labor supply, the fall is only 3.0 percent. Without either, the fall is 2.8 percent. Thus, incorporating fixed overhead costs in labor raises the estimated wage decline by approximately 1.9 percentage points, while incorporating imperfect labor market competition adds only about 0.1 percentage points. The paper concludes that fixed overhead costs, not monopsony, are the essential feature for accurately predicting tariff impacts on wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the mechanism by which fixed overhead costs amplify the aggregate wage decline from a negative demand shock?&lt;/strong&gt;
A10: Fixed overhead costs reduce the share of labor in firms&amp;rsquo; total variable costs. When labor constitutes a smaller fraction of variable costs, output prices are less sensitive to changes in wages. With aggregate labor supply fixed, the economy restores equilibrium after a negative demand shock by reducing prices through wage cuts. To achieve the same magnitude of price reduction when labor is a smaller fraction of variable costs, wages must fall by a larger amount — amplifying the aggregate wage impact. Fixed overhead costs in labor also make foreign inputs relatively more important in variable costs, as shown empirically in Appendix D.1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Is the conclusion about the relative importance of fixed costs versus labor market imperfections robust to alternative specifications of the labor market?&lt;/strong&gt;
A11: Yes. The paper extends the model to a nested logit structure for worker preferences (following Lamadon et al. 2022), which allows Belgium to contain multiple labor markets (defined as industry-region nests), permits heterogeneous markdowns across markets, and is still identified from the data. Empirically, incorporating multiple labor markets and heterogeneous markdowns does not quantitatively alter the aggregate counterfactual predictions for the wage effects of foreign demand shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Are heterogeneous responses to the foreign demand shock observed across exporters, importers, and domestic-only firms?&lt;/strong&gt;
A12: The paper finds no systematic differences in the elasticities of labor cost and input purchases between firms that trade internationally and those that do not. This implies that exporters and importers have higher absolute fixed costs (consistent with fixed export and import costs) but comparable fixed cost shares — since these firms tend to be larger and thus spread higher absolute fixed costs over larger output volumes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Do the findings about fixed overhead costs extend beyond foreign demand shocks?&lt;/strong&gt;
A13: Yes. The paper shows in Appendix D.4 that a uniform 5 percent reduction in the productivity of all Belgian manufacturing firms generates qualitatively and quantitatively similar conclusions: fixed overhead costs amplify the predicted wage effects of domestic productivity shocks, while imperfect competition in the labor market matters to a lesser but still meaningful extent.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Fixed Overhead Costs (Fixed Labor Costs / Fixed Intermediate Input Costs):&lt;/strong&gt; In the paper&amp;rsquo;s model, each firm has firm-specific fixed overhead input requirements for labor (denoted ℓ̄_k^f) and intermediate inputs (denoted q̄_k^f) that must be satisfied regardless of the firm&amp;rsquo;s output level. These fixed requirements are separate from the variable inputs used in production. Fixed labor costs may reflect administration, worker management, facility maintenance, and other tasks that do not directly translate into output. Fixed intermediate input costs include waste management, accounting services, and electricity payments that occur irrespective of sales. The share of total labor inputs that is fixed is identified by how much less than proportionally labor costs respond to demand-driven changes in sales.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monopsonistic Competition in the Labor Market:&lt;/strong&gt; The paper models each firm as facing an upward-sloping firm-specific labor supply curve arising from workers&amp;rsquo; heterogeneous idiosyncratic preferences over non-wage firm attributes (amenities). Because workers&amp;rsquo; idiosyncratic tastes are private information, firms cannot price-discriminate and thus face an increasing marginal cost of labor. Each firm is infinitesimal within the aggregate labor market but has wage-setting power at the firm level. This gives rise to a constant-elasticity firm-level labor supply curve ℓ_k = A_k w_k^ε, where ε is the labor supply elasticity facing the firm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage Markdown:&lt;/strong&gt; The firm&amp;rsquo;s equilibrium wage is marked down relative to the marginal revenue product of labor by the factor ε/(1+ε), which is less than one when ε is finite. With a labor supply elasticity of 3.9, the implied markdown is approximately 21 percent; with a supply elasticity of 2.3 (stayer sample), the markdown is approximately 30 percent. Perfect competition corresponds to ε = ∞ and a markdown of zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor Cost Elasticity:&lt;/strong&gt; The elasticity of a firm&amp;rsquo;s total labor cost with respect to a demand-driven change in the firm&amp;rsquo;s sales, as derived from the model&amp;rsquo;s comparative statics (equation 15). This elasticity depends on both the variable share of labor inputs (ℓ_k^v / ℓ_k) and the labor supply elasticity ε. It lies strictly between zero (all labor fixed) and one (all labor variable), and is declining in ε for a given variable share. The paper estimates this elasticity at 0.528 via IV, implying substantial fixed overhead in labor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Total Foreign Demand Shock:&lt;/strong&gt; The firm-level measure of foreign demand used as an instrument, defined as the weighted average of changes in world import demand (excluding Belgium) across country-product pairs, where the weights reflect both the firm&amp;rsquo;s own lagged direct export shares and its indirect exposure through the domestic production network (via the Leontief inverse matrix H̃). This measure captures both direct exporter exposure and indirect upstream exposure for non-exporting firms that supply to exporters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Indirect Export Exposure:&lt;/strong&gt; The share of a firm&amp;rsquo;s output that reaches foreign markets indirectly through sales to domestic buyers who subsequently export. Defined recursively: the total export share of firm k equals its direct export revenue share plus the sum over all domestic buyers of the product of k&amp;rsquo;s revenue share from that buyer and the buyer&amp;rsquo;s own total export share. Even non-direct-exporting firms sell on average approximately 10 percent of their output indirectly to foreign markets in the Belgian data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dekle-Eaton-Kortum Hat Algebra:&lt;/strong&gt; A technique for solving general equilibrium counterfactuals in trade models by expressing all outcomes as proportional changes (&amp;ldquo;hats&amp;rdquo;) relative to the observed equilibrium, without needing to recover the underlying structural parameters. The paper uses this approach to compute counterfactual wages under alternative tariff scenarios, holding fixed the observed firm-level expenditure shares from the reference year (2012) while allowing parameters such as productivity and technology weights to vary across counterfactual economies to rationalize identical observed firm-level observables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Worker Rents:&lt;/strong&gt; In the monopsony model, inframarginal workers earn rents defined as the excess return over what would be required to make them indifferent between employers. These rents arise because firms cannot price-discriminate across workers with heterogeneous amenity valuations. The additional rents accruing to workers from a demand-driven increase in firm sales decompose into: (1) wage increases for incumbent workers multiplied by current employment, (2) rents for new hires (the excess of their wage bill over the amount required to induce them to switch to the expanding firm), and (3) a correction term related to the fraction of the labor cost increase borne by expanding employment rather than wages.&lt;/p&gt;</description></item><item><title>FraNK: Fragmentation in the NK Model</title><link>https://macropaperwarehouse.com/papers/frank-fragmentation-in-the-nk-model/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/frank-fragmentation-in-the-nk-model/</guid><description>&lt;p&gt;Moro and Nispi Landi develop FraNK, a multi-country New Keynesian model designed to study geoeconomic fragmentation — defined, following Aiyar et al. (2023), as a policy-driven reversal of economic integration guided by strategic considerations. The model extends Gali and Monacelli (2005) along three dimensions: it is multi-country rather than small-open-economy; it assumes incomplete international financial markets, relaxing perfect risk sharing; and it incorporates commodities as intermediate inputs in production, capturing both domestic and imported commodity sourcing. A fragmentation shock is modeled as a simultaneous increase in three tax rates imposed on rival countries: a tax on imports of final goods, a tax on imports of commodities, and a tax on the purchase of foreign bonds (capital controls).&lt;/p&gt;
&lt;p&gt;The paper proceeds in two stages. First, under a symmetric two-bloc calibration, closed-form analytical results establish the distinct macroeconomic channels of each tax. The good import tax operates through both demand (households reduce consumption of foreign goods) and supply (firms face higher real marginal costs), with the demand channel dominating: output falls unambiguously and PPI inflation decreases, though CPI inflation rises on impact due to the direct pass-through of import prices. The commodity import tax operates exclusively through supply — raising intermediate input costs — so both output and PPI inflation move in the same direction: output falls and PPI inflation rises. The bond tax is neutral under symmetric calibration: because each country&amp;rsquo;s net foreign asset position is unchanged (each country reduces its holdings of rival-bloc bonds by exactly as much as it reduces its own issuance), output and inflation are unaffected.&lt;/p&gt;
&lt;p&gt;Second, the model is calibrated to four asymmetric regions: the United States (US), US-allied countries including the European Union (WE), the China-Russia-aligned bloc (CR), and a neutral rest of the world (NE). Bloc assignment follows Den Besten et al. (2023), using a political alignment index combining sanctions data, military imports, Belt and Road Initiative participation, and UNGA voting on Russia&amp;rsquo;s invasion of Ukraine. The US and WE impose all three taxes on CR, and vice versa; NE neither imposes nor receives taxes.&lt;/p&gt;
&lt;p&gt;Five main findings emerge from the asymmetric simulation. First, fragmentation predominantly affects CR and WE: both experience substantial declines in consumption and production across all three tax scenarios, with CR most affected when goods or asset taxes are applied. Second, the US is largely insulated: its lower trade and financial exposure to the rival bloc relative to WE limits the pass-through of fragmentation. Third, spillovers to neutral NE are nearly negligible: the expenditure-switching channel (which raises demand for untaxed NE goods) and the global income channel (which reduces demand for all goods as the world becomes poorer) roughly cancel each other out. Fourth, fragmentation is not necessarily inflationary: whether PPI inflation rises or falls depends on the relative weight of commodities in production and the mix of taxes applied — a goods tax lowers PPI inflation, while a commodity tax raises it. Fifth, the bilateral exchange rates most affected are those of the CR bloc, which appreciate under goods and asset taxes and depreciate under commodity taxes.&lt;/p&gt;
&lt;p&gt;Sensitivity analyses confirm robustness across higher elasticity of substitution between domestic and foreign goods (eta raised from 1.5 to 5), lower elasticity of substitution between labor and commodities (xi lowered from 0.4 to 0.1), tighter financial market integration (bond transaction costs multiplied by 5), and permanent shocks (persistence rho raised to 1). Under permanent shocks, the goods-tax effect on PPI inflation approaches zero — consistent with the closed-form result — while commodity-tax effects on production become larger and more persistent.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question of FraNK?
A: The paper asks how geoeconomic fragmentation — modeled as policy-driven increases in taxes on rival countries&amp;rsquo; goods, commodities, and bonds — affects output, inflation, exchange rates, and capital flows at both the global and country level. It also asks whether different sources of fragmentation (real versus financial) have distinct macroeconomic implications, and whether neutral countries experience meaningful spillovers.&lt;/p&gt;
&lt;p&gt;Q: How does the model depart from the Gali-Monacelli (2005) benchmark?
A: Three departures are made. The model is multi-country (N countries) rather than a single small open economy facing the rest of the world. Financial markets are incomplete, so international risk sharing is imperfect — a realistic assumption in a fragmented world. And intermediate-good production uses a CES bundle of labor and a commodity bundle that includes both domestic and imported commodities, which is essential for capturing commodity market disruptions such as those following Russia&amp;rsquo;s invasion of Ukraine.&lt;/p&gt;
&lt;p&gt;Q: What are the three tax instruments and what does each represent?
A: The goods import tax (tau_ijt) is a tariff on final goods imports, representing trade barriers. The commodity import tax (tau_O_ijt) is a tariff on imported commodity inputs, representing sanctions or restrictions on energy and raw material trade. The bond tax (theta_ijt) is a capital control discouraging purchases of bonds issued by rival countries, representing financial fragmentation or sanctions on financial assets.&lt;/p&gt;
&lt;p&gt;Q: What does the closed-form symmetric-calibration result establish about output?
A: Under the symmetric calibration, both the goods import tax and the commodity import tax reduce output unambiguously (Proposition 3.3). The bond tax is neutral for output under symmetry because each country&amp;rsquo;s net foreign asset position is unchanged — any reduction in holdings of rival-bloc bonds is exactly matched by a reduction in own-bond issuance, leaving net positions and aggregate demand unaffected (Proposition 3.4).&lt;/p&gt;
&lt;p&gt;Q: Why does the goods import tax reduce PPI inflation while the commodity import tax raises it?
A: The goods import tax operates through two opposing channels: a demand channel (households substitute away from foreign goods, reducing aggregate demand) and a supply channel (import taxes raise firms&amp;rsquo; real marginal costs). The closed-form solution establishes that the demand channel dominates, so PPI inflation falls. The commodity import tax operates only through the supply channel — raising the cost of intermediate inputs directly — so PPI inflation rises unambiguously. CPI inflation rises on impact under the goods tax because import prices are directly included in the CPI even as PPI falls.&lt;/p&gt;
&lt;p&gt;Q: Under what condition does simultaneous fragmentation (goods and commodity taxes together) produce PPI inflation?
A: When both taxes are imposed simultaneously, the net effect on PPI inflation is ambiguous. The paper shows analytically that PPI inflation rises if and only if omega * gamma_O_tilde &amp;gt; gamma_tilde * (phi/sigma), where omega is the commodity weight in production, gamma_O_tilde captures commodity import weights, and gamma_tilde captures goods import weights. That is, fragmentation tends to be stagflationary the larger the weight of commodities in the production function, consistent with the empirical finding in Caldara et al. (2024) of stagflationary effects from elevated geopolitical risk.&lt;/p&gt;
&lt;p&gt;Q: Why is the US more shielded from fragmentation than its WE allies?
A: The US has relatively lower trade and financial exposure to the CR bloc compared to WE. Because the trade and financial weights calibrated from UN Comtrade, IMF CPIS, BIS LBS, and IMF CDIS data place WE in closer economic relationships with CR countries, a tax on CR imports or assets falls more heavily on WE than on the US. This asymmetry is a direct consequence of the calibration: no structural or strategic advantage of the US is assumed beyond its actual pattern of trade and financial linkages.&lt;/p&gt;
&lt;p&gt;Q: What happens to the CR bloc&amp;rsquo;s exchange rate under each tax scenario?
A: Under the goods import tax, the CR exchange rate appreciates: CR&amp;rsquo;s own tax reduces demand for US/WE goods, increasing domestic demand relative to the rest of the world, and the reduced demand for CR bonds from abroad raises CR interest rates, further attracting capital. Under the commodity import tax, the CR exchange rate depreciates: lower commodity demand reduces CR commodity prices and production, shifting labor toward goods, increasing goods supply, and lowering the CR price level relative to trading partners. Under the bond tax, the CR exchange rate also appreciates, as reduced CR demand for US/WE bonds is interpreted by markets as a shift in capital flows favoring CR assets.&lt;/p&gt;
&lt;p&gt;Q: What explains the near-zero spillovers to neutral countries?
A: Two forces operate on NE in opposite directions. The expenditure-switching channel raises demand for NE goods and commodities, as taxing countries divert purchases away from taxed rival goods toward untaxed NE products — a positive demand shock for NE. The global income channel reduces demand for all goods, including NE&amp;rsquo;s, as the taxing and taxed regions become poorer and reduce imports from everywhere. In the calibration these two forces approximately cancel, leaving NE macroeconomic variables nearly unchanged.&lt;/p&gt;
&lt;p&gt;Q: How is the commodity sector modeled, and why does this matter for the commodity tax result?
A: Each country has a representative commodity firm using a linear production function (Y_iOt = A_iO * H_iOt), where A_iO is interpretable as a per-capita endowment of natural resources. Intermediate-good firms use a CES bundle of labor and commodities (domestic and imported) with elasticity xi=0.4 between the two. When the commodity import tax is imposed, firms face higher commodity input costs, raising real marginal costs and PPI inflation while depressing production. The asymmetry between commodity exporters (CR, NE) and importers (WE) under this tax is the main source of differential regional effects.&lt;/p&gt;
&lt;p&gt;Q: How are financial openness differences across country pairs captured, and what effect do they have?
A: Bond transaction costs psi_ijF differ across pairs: psi_12F = psi_21F = 0.01 for the US-WE pair (reflecting high financial integration), while all other pairs have psi_ijF = 1 — one hundred times higher — reflecting limited cross-bloc financial integration. The sensitivity analysis multiplies all psi_ijF by 5 (less open financial markets) and finds that bond position volatility falls but qualitative results are unchanged, confirming that the financial openness calibration does not drive the main results.&lt;/p&gt;
&lt;p&gt;Q: What are the main caveats acknowledged by the authors?
A: The model omits capital accumulation, so investment dynamics are absent. Cross-country production networks (global value chains) are not modeled, which the authors acknowledge limits the richness of the production structure relative to Baqaee-Farhi (2024) style models. Domestic financial markets are assumed frictionless. The model has no role for dollar dominance in the global economy, which may matter for exchange rate and capital flow dynamics in reality. These are flagged as directions for future research.&lt;/p&gt;
&lt;p&gt;Q: What is the key result for permanent (rho=1) versus temporary (rho=0.9) fragmentation shocks?
A: Under permanent shocks, output reductions become permanent rather than transitory. For the goods import tax, the effect on PPI inflation approaches zero in the permanent case, consistent with the closed-form prediction that the demand channel effect on PPI vanishes when the tax persists indefinitely (households no longer have an intertemporal substitution motive). The commodity tax permanent shock induces a larger and more persistent fall (rise) in production for commodity importers (exporters). Bond tax permanent shock has larger magnitude effects but is otherwise qualitatively similar to the temporary case.&lt;/p&gt;
&lt;p&gt;Q: How does FraNK relate to the existing DSGE literature on sanctions and trade wars?
A: The paper positions FraNK as providing a unified framework covering all three forms of fragmentation (goods, commodity, and financial) simultaneously, with nominal rigidities allowing for inflation analysis, closed-form analytical results for transparency, and a multi-country setup rather than small-open-economy. Ghironi et al. (2024) study sanctions in a three-country model but without nominal rigidities. Itskhoki and Mukhin (2022) analyze sanctions on Russia but in a small-open-economy. Attinasi et al. (2023) and Conteduca et al. (2024b) use richer production networks (Baqaee-Farhi) but are static and exclude financial fragmentation. FraNK trades production network richness for dynamics, nominal rigidities, financial fragmentation, and analytical tractability.&lt;/p&gt;
&lt;p&gt;Geoeconomic fragmentation: A policy-driven reversal of economic integration, often guided by strategic or geopolitical considerations, operationalized in FraNK as simultaneous increases in taxes on rival countries&amp;rsquo; goods imports, commodity imports, and bond purchases.&lt;/p&gt;
&lt;p&gt;Fragmentation shock: A simultaneous increase in three tax rates — goods import tax (tau), commodity import tax (tau_O), and bond tax (theta) — applied by each bloc against the other, representing the policy instruments through which integration is reversed.&lt;/p&gt;
&lt;p&gt;Demand channel (goods tax): The mechanism by which a goods import tax reduces aggregate demand, as households substitute away from now-more-expensive foreign goods, reducing output and — because this channel dominates the supply channel — lowering PPI inflation.&lt;/p&gt;
&lt;p&gt;Supply channel (commodity tax): The mechanism by which a commodity import tax raises intermediate input costs for firms, increasing real marginal costs and PPI inflation while reducing output — a purely cost-push effect with no offsetting demand-side force.&lt;/p&gt;
&lt;p&gt;Bond tax neutrality: Under symmetric calibration, capital controls on rival-bloc bonds are macroeconomically neutral because each country&amp;rsquo;s net foreign asset position is unchanged: the reduction in holdings of rival bonds is exactly matched by a reduction in own-bond issuance, leaving the IS curve and Phillips curve unaffected.&lt;/p&gt;
&lt;p&gt;Expenditure-switching channel: The force by which fragmentation between two blocs diverts import demand toward untaxed third-country (neutral) goods, generating a positive demand spillover for NE countries that roughly offsets the global income channel.&lt;/p&gt;
&lt;p&gt;Global income channel: The negative spillover to neutral countries arising from the reduction in world income caused by fragmentation between the taxing blocs, which reduces demand for all goods including those of neutral producers, approximately canceling the expenditure-switching channel.&lt;/p&gt;</description></item><item><title>From Doubt to Devotion: Trials and Learning-Based Pricing</title><link>https://macropaperwarehouse.com/papers/from-doubt-to-devotion-trials-and-learning-based-pricing/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/from-doubt-to-devotion-trials-and-learning-based-pricing/</guid><description>&lt;p&gt;This paper studies a dynamic mechanism design problem in which an informed seller sells an experience good to a skeptical buyer who learns about the product through consumption. The central question is: how does a seller leverage proprietary data about product-buyer match quality together with the buyer&amp;rsquo;s ability to learn, and what are the welfare implications in equilibrium?&lt;/p&gt;
&lt;p&gt;The model features a seller who privately observes a binary match quality (theta in {H, L}) between their service and the buyer. The buyer does not observe match quality and has an initially unknown private value v for the good, drawn from a Myerson-regular distribution F with support [v_low, v_high] and normalized mean E[v] = 1. If the match is high, the buyer receives instantaneous utility rewards according to a Poisson process with flow rate lambda*I, where I in [0,1] is the seller-controlled access level. Upon receiving the first reward, the buyer perfectly learns both match quality theta and their own value v. The seller commits to a dynamic mechanism over time horizon T = [0, T] specifying access and prices conditional on reported histories. Both parties are risk-neutral and there is no discounting in the baseline.&lt;/p&gt;
&lt;p&gt;Two benchmark cases show the first-best is attainable absent both key features simultaneously. If trade is static (prices set only at time 0) or if the seller is uninformed about theta, the seller achieves first-best revenue of lambda&lt;em&gt;mu_0&lt;/em&gt;T by selling the entire service upfront. Proposition 1 establishes both cases; this implies that consumer data on theta is not required for maximizing social welfare, and it is weakly dominant for a seller to never collect consumer data in static environments.&lt;/p&gt;
&lt;p&gt;The central result is that the combination of dynamic pricing and seller private information breaks the first-best. A high-type seller can deviate by offering a &amp;ldquo;Myersonian free trial&amp;rdquo;: provide full access up to time tM (defined as argmax_t {(1 - exp(-lambda&lt;em&gt;t))&lt;/em&gt;(T - t)}), then offer the remaining service at post-trial price lambda&lt;em&gt;vM&lt;/em&gt;(T - tM), where vM is the Myerson monopoly price. The buyer accepts the trial regardless of beliefs (participation is weakly dominant) and purchases the post-trial service if and only if v &amp;gt;= vM. This deviation yields payoff pi_F = (1 - exp(-lambda&lt;em&gt;tM))&lt;/em&gt;(1 - F(vM))&lt;em&gt;lambda&lt;/em&gt;vM*(T - tM). Proposition 2 states that the first-best cannot be implemented in any equilibrium if and only if pi_F &amp;gt; lambda&lt;em&gt;mu_0&lt;/em&gt;T. Corollary 1 shows this condition holds for sufficiently large T, since pi_F grows proportionally with T while the first-best also grows with T but the ratio converges to a constant less than 1 only for some parameter configurations and exceeds 1 for others.&lt;/p&gt;
&lt;p&gt;Theorem 1 (the main mechanism design result) characterizes the boundary of the IC-IR feasible payoff set: any mechanism on this boundary is outcome-uniquely implemented by a trial mechanism, defined by a triple (v0, t0, p0) — a trial length, a post-trial value threshold, and a trial price. During [0, t0] uninformed buyers receive full access; after t0 only buyers who received a reward with v &amp;gt;= v0 continue at a premium. Trial length t0 is weakly increasing in the weight placed on the low-type seller and in the prior mu_0; post-trial threshold v0 is weakly decreasing in the same objects (Proposition 3).&lt;/p&gt;
&lt;p&gt;Equilibrium payoffs (Proposition 5) are precisely the IC-IR feasible pairs satisfying pi_H &amp;gt;= pi_F, implemented by pooling trial mechanisms in which both seller types propose identical mechanisms and the buyer updates beliefs only through private consumption signals. Under the D1 refinement (Proposition 6), only mechanisms with trial length tM and post-trial threshold vM survive. These have the shortest trial and highest post-trial price of all equilibrium mechanisms, minimize social surplus, and may leave both seller types strictly worse off than in a world without private information — directly contrasting the static informed principal result of Koessler and Skreta (2016) where data always helps the seller.&lt;/p&gt;
&lt;p&gt;When the seller can control service quality q in addition to access I (Section 6), the relevant equilibrium mechanisms become dynamic tiered pricing rather than binary trials: a low-quality, high-ad-load free tier provides learning opportunities while reducing information rents; convinced buyers upgrade to a premium ad-free tier. Counterintuitively, enriching the seller&amp;rsquo;s screening technology can reduce both revenue and social efficiency in equilibrium because additional instruments create additional signaling opportunities that distort outcomes further.&lt;/p&gt;
&lt;p&gt;Q: What is the core tension that prevents the first-best from being an equilibrium?&lt;/p&gt;
&lt;p&gt;A: When the seller is privately informed and pricing is dynamic, the high-type seller anticipates a greater likelihood of the buyer receiving a utility shock than the buyer&amp;rsquo;s own prior implies. This belief gap makes it profitable for the high-type seller to deviate from a proposed first-best mechanism by offering a free trial that &amp;ldquo;proves&amp;rdquo; high match quality and then extracting rent from convinced buyers. Because this deviation is profitable — yielding pi_F &amp;gt; lambda&lt;em&gt;mu_0&lt;/em&gt;T under some parameters — the first-best pooling contract unravels. The interaction of both ingredients (dynamic pricing and informed seller) is necessary: either ingredient alone is insufficient to break the first-best (Proposition 1).&lt;/p&gt;
&lt;p&gt;Q: What exactly is the Myersonian free trial and why does the buyer always accept it?&lt;/p&gt;
&lt;p&gt;A: The Myersonian free trial provides full service access up to time tM = argmax_t {(1 - exp(-lambda&lt;em&gt;t))&lt;/em&gt;(T - t)} at (approximately) zero price, then offers the remaining service at price lambda&lt;em&gt;vM&lt;/em&gt;(T - tM) where vM is the Myerson monopoly price. The buyer accepts the trial regardless of their prior belief about match quality because the trial itself is free and provides non-negative payoff. After the trial, the buyer purchases the post-trial service if and only if they received a reward with v &amp;gt;= vM; otherwise they exit. The deviation payoff is pi_F = (1 - exp(-lambda&lt;em&gt;tM))&lt;/em&gt;(1 - F(vM))&lt;em&gt;lambda&lt;/em&gt;vM*(T - tM).&lt;/p&gt;
&lt;p&gt;Q: Under what parametric conditions can the first-best not be supported in equilibrium?&lt;/p&gt;
&lt;p&gt;A: By Proposition 2, the first-best cannot be implemented if and only if pi_F &amp;gt; lambda&lt;em&gt;mu_0&lt;/em&gt;T. Corollary 1 states that for sufficiently large T this always fails, since as T grows, pi_F grows proportionally (the post-trial term (T - tM) dominates) while tM converges to a finite value. More precisely, for large T, pi_F / (lambda&lt;em&gt;mu_0&lt;/em&gt;T) converges to (1 - exp(-lambda*tM)) * (1 - F(vM)) * vM / mu_0, which exceeds 1 under appropriate parameter configurations. Conversely, when mu_0 is high or the service horizon is short, the first-best may remain implementable.&lt;/p&gt;
&lt;p&gt;Q: What is a trial mechanism and how does Theorem 1 characterize it?&lt;/p&gt;
&lt;p&gt;A: A trial mechanism is defined by a triple (v0, t0, p0): uninformed buyers receive full access on [0, t0] and no access thereafter; a buyer who reports a reward of value v &amp;gt;= v0 at time t receives full service for the remainder [t, T] at a price increment of lambda&lt;em&gt;v0&lt;/em&gt;(T - t0); the trial itself is priced at p0. Theorem 1 states that any payoff pair on the boundary of the IC-IR feasible set is outcome-uniquely attained by such a trial mechanism with appropriately determined (v0, t0, p0). The proof uses a relaxed problem retaining only two key constraint families: local incentive constraints on value reporting (IC-V) and a global intertemporal constraint preventing buyers from hiding the arrival of rewards forever (IC-U).&lt;/p&gt;
&lt;p&gt;Q: How does the trial length respond to changes in prior belief mu_0 and distributional spread?&lt;/p&gt;
&lt;p&gt;A: Proposition 3 states that t0 is weakly increasing in mu_0: as market belief becomes more optimistic, both seller types extract higher revenue from the trial, so the mechanism designer extends the trial. Proposition 4 adds that for a uniform distribution on [1-delta, 1+delta], trial length t0 is weakly increasing in delta (greater spread). The post-trial threshold v0 is weakly decreasing in mu_0, meaning that a more optimistic prior leads to a less exclusive post-trial cutoff.&lt;/p&gt;
&lt;p&gt;Q: What are the equilibrium payoffs and how does the high-type seller&amp;rsquo;s free-trial option constrain them?&lt;/p&gt;
&lt;p&gt;A: Proposition 5 states that (pi_L, pi_H) is an equilibrium payoff if and only if it lies in the IC-IR feasible set and pi_H &amp;gt;= pi_F. The lower bound pi_H &amp;gt;= pi_F reflects the high-type seller&amp;rsquo;s outside option: they can always deviate to the Myersonian free trial. Corollary 4 then shows that all &amp;ldquo;reasonable&amp;rdquo; equilibrium payoffs (those with pi_H &amp;gt;= pi_L, surviving a mild off-path refinement) are implemented by trial mechanisms with complete pooling — both seller types propose the same mechanism and the buyer updates beliefs only through private consumption signals, not the mechanism&amp;rsquo;s structure.&lt;/p&gt;
&lt;p&gt;Q: What does the D1 refinement select and why do it lead to worse outcomes?&lt;/p&gt;
&lt;p&gt;A: Proposition 6 shows that the only equilibrium trial mechanisms surviving the D1 criterion have trial length tM and post-trial threshold vM — the Myersonian free trial parameters. These have the shortest trial and highest post-trial price among all equilibrium mechanisms, resulting in the minimum social surplus. The intuition is that the high-type seller signals credibly by proposing mechanisms that generate high revenue from post-trial price discrimination (which the low type cannot profit from), pushing toward maximum learning-based discrimination. All D1-surviving payoffs are Pareto dominated by the point H (the unconstrained IC-IR optimum) for any prior mu_0, and Pareto dominated by point B when mu_0 is small.&lt;/p&gt;
&lt;p&gt;Q: Can having consumer preference data hurt the seller, and under what conditions?&lt;/p&gt;
&lt;p&gt;A: Yes. The distortion from signaling incentives can be so large that both seller types earn strictly less in the D1-surviving equilibrium than they would if neither possessed private information (where the first-best is attained). This result holds when the condition of Proposition 2 is satisfied — i.e., when pi_F &amp;gt; lambda&lt;em&gt;mu_0&lt;/em&gt;T. This contrasts sharply with the static result of Koessler and Skreta (2016), in which the ex-ante profit-maximizing mechanism is always supportable in equilibrium and data always (weakly) helps sellers.&lt;/p&gt;
&lt;p&gt;Q: How do trial mechanisms differ from the prior literature on signaling through introductory prices?&lt;/p&gt;
&lt;p&gt;A: The earlier literature (Milgrom and Roberts 1986; Bagwell 1987; Bagwell and Riordan 1991; Judd and Riordan 1994) uses two-period models with no seller commitment, so all pricing behavior is necessarily trial-like by model restriction. The present model instead allows the seller full flexibility to design any dynamic mechanism — including selling everything ex-ante, which would prevent buyers from gaining information rent. Trials emerge endogenously as the equilibrium outcome rather than being imposed by the model structure, and the paper provides new economic content on what determines trial length and price thresholds.&lt;/p&gt;
&lt;p&gt;Q: What happens when the seller controls service quality in addition to access?&lt;/p&gt;
&lt;p&gt;A: Section 6 extends the baseline by allowing the seller to choose (I, q) from a subset of [0,1]^2, where I governs the Poisson arrival rate and q scales the reward value (utility from a reward is v*q). Theorem 2 shows that the relevant equilibrium mechanisms now take the form of dynamic tiered pricing: a low-quality tier (interpreted as high ad load) provides learning opportunities while reducing information rents; once convinced, buyers upgrade to a premium high-quality tier. Enriching the screening technology in this way can reduce both revenue and social efficiency in equilibrium, because additional instruments create additional signaling opportunities that distort outcomes further from the revenue-maximizing benchmark.&lt;/p&gt;
&lt;p&gt;Q: What are the two sources of welfare loss relative to the first-best in D1-surviving equilibria?&lt;/p&gt;
&lt;p&gt;A: The welfare analysis in Appendix F identifies two sources. First, exclusion inefficiency: buyers with values v in [v_low, vM) who would generate positive surplus are excluded from post-trial service. Second, service truncation inefficiency: service access is cut off after trial length tM for buyers who were never convinced (theta = L type realizations and high-type buyers with v &amp;lt; vM), reducing total surplus below the first-best of mu_0 * lambda * T. Both losses are minimized (welfare is maximized) among trial mechanisms by longer trials and lower post-trial cutoffs, precisely the opposite of what D1 selects.&lt;/p&gt;
&lt;p&gt;Q: Does the model extend to continuous seller types or multiple buyer types?&lt;/p&gt;
&lt;p&gt;A: Appendix K outlines an extension to continuous seller types theta drawn from a distribution G on [theta_low, theta_high], where rewards arrive at rate lambda&lt;em&gt;I&lt;/em&gt;theta. The main economic forces persist: higher seller types anticipate faster buyer learning and have stronger incentives to offer trials. The main results generalize: equilibrium mechanisms are trial mechanisms, and under D1, pooling equilibria with maximum post-trial discrimination are selected. Appendix G similarly notes that the multiple-buyer-type extension preserves complete pooling and the D1 selection result.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the &amp;ldquo;global intertemporal constraint&amp;rdquo; (IC-U) in the proof of Theorem 1?&lt;/p&gt;
&lt;p&gt;A: The canonical approach to dynamic mechanism design (Eso and Szentes 2007; Pavan, Segal, and Toikka 2014) relaxes the problem to only local incentive constraints on the initial report. This fails here because the informed seller causes buyer and seller to disagree on the evolution of buyer beliefs, making the timing of trade matter and requiring tracking of incentive constraints at every point in time. The paper identifies two key binding constraints in the relaxed problem: (IC-V) the buyer does not misreport their reward value, and (IC-U) the buyer does not remain silent about the arrival of a reward forever. Retaining only these two constraint families yields a tractable bang-bang solution for the optimal access policy, which is then verified to satisfy all original IC-IR constraints.&lt;/p&gt;
&lt;p&gt;Q: What are the implications for platform design and data collection strategy?&lt;/p&gt;
&lt;p&gt;A: The results imply that the value of consumer data depends critically on market dynamics. In static markets, collecting data about consumer match quality is weakly beneficial for sellers (Proposition 1, first point). In dynamic markets with buyer learning and sufficiently long service horizons, the same data can strictly reduce seller revenue by enabling a deviation that unravels first-best pricing. This suggests platforms in dynamic digital markets should weigh whether possessing and acting on proprietary match data improves or worsens their equilibrium position, and that regulatory attention to consumer data collection in dynamic markets may have welfare-ambiguous effects.&lt;/p&gt;
&lt;p&gt;Trial mechanism: A dynamic mechanism parameterized by (v0, t0, p0) in which the seller provides full service access during [0, t0] for uninformed buyers, offers continued service after t0 only to buyers who received a reward with value v &amp;gt;= v0, and charges a post-trial price of p0 + lambda&lt;em&gt;v0&lt;/em&gt;(T - t0) for those who qualify. In the paper&amp;rsquo;s usage, this is the unique outcome-implementing mechanism on the boundary of the IC-IR feasible payoff set.&lt;/p&gt;
&lt;p&gt;Myersonian free trial: The limiting trial mechanism as the trial price epsilon approaches zero, with trial length tM = argmax_t {(1 - exp(-lambda&lt;em&gt;t))&lt;/em&gt;(T - t)} and post-trial threshold vM equal to the Myerson monopoly price. It yields payoff pi_F = (1 - exp(-lambda&lt;em&gt;tM))&lt;/em&gt;(1 - F(vM))&lt;em&gt;lambda&lt;/em&gt;vM*(T - tM) to the high-type seller, and constitutes the binding outside option constraining equilibrium payoffs.&lt;/p&gt;
&lt;p&gt;Belief gap: The divergence between the seller&amp;rsquo;s and buyer&amp;rsquo;s beliefs about the rate at which the buyer will receive Poisson rewards. Because the high-type seller knows theta = H, they anticipate a higher probability of reward arrival than the buyer&amp;rsquo;s prior implies. This gap makes the buyer&amp;rsquo;s belief process non-martingale from the seller&amp;rsquo;s perspective, breaking the standard dynamic mechanism design approach and creating profitable deviation incentives.&lt;/p&gt;
&lt;p&gt;IC-IR feasible payoff set: The set of seller payoff pairs (pi_L, pi_H) achievable by mechanisms satisfying both incentive compatibility (for seller type reports and buyer learning reports) and individual rationality (non-negative ex-ante payoffs for all parties). Theorem 1 establishes that the boundary of this set is uniquely implemented by trial mechanisms.&lt;/p&gt;
&lt;p&gt;Dynamic tiered pricing: The equilibrium mechanism form that emerges when the seller controls both access I and service quality q. It features a low-quality tier (high ad load) providing learning opportunities at reduced information rent, and a premium tier offering full quality to buyers convinced of high match quality. This generalizes trial mechanisms to settings with richer screening technology.&lt;/p&gt;
&lt;p&gt;Global intertemporal constraint (IC-U): The constraint requiring that, upon receiving a Poisson reward, the buyer finds it suboptimal to remain silent about its arrival forever. Together with the local value-reporting incentive constraint (IC-V), these two constraints constitute the binding restrictions in the paper&amp;rsquo;s relaxed mechanism design problem, replacing the full continuum of incentive constraints that would otherwise be intractable.&lt;/p&gt;
&lt;p&gt;D1 criterion: A standard equilibrium refinement from signaling games applied here to the space of mechanism proposals. Among all pooling equilibrium trial mechanisms, D1 selects only those with parameters (tM, vM) — the shortest trial length and highest post-trial threshold — because the high-type seller has a strictly larger set of buyer responses for which deviation to a high-discrimination mechanism is profitable. These surviving mechanisms Pareto dominate no other equilibrium mechanism and minimize social surplus.&lt;/p&gt;</description></item><item><title>Gendered Spheres of Learning and Household Decision-Making over Fertility</title><link>https://macropaperwarehouse.com/papers/gendered-spheres-of-learning-and-household-decision-making-over-fertility/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/gendered-spheres-of-learning-and-household-decision-making-over-fertility/</guid><description>&lt;p&gt;This paper investigates whether information asymmetries within households about maternal health risk can explain persistent spousal disagreement over fertility in a high-fertility, high-maternal-mortality setting. The authors develop a theoretical model and conduct a randomized field experiment among approximately 500 couples in peri-urban Lusaka, Zambia, where the lifetime risk of maternal death is 1 in 59 women and the maternal mortality ratio is 398 deaths per 100,000 live births.&lt;/p&gt;
&lt;p&gt;The central mechanism is a communication barrier that arises from conflicting fertility preferences between spouses. When husbands have higher desired fertility than wives (4.43 vs. 4.19 children on average in the study sample), wives who are better informed about maternal health risk lack the incentive to credibly transmit that information to their husbands. Strategic communication concerns — not a generically lower propensity of men to learn from women — drive this asymmetry. The model predicts a pooling equilibrium in which no informative communication flows from wives to husbands when preference divergence is sufficiently large.&lt;/p&gt;
&lt;p&gt;The experiment randomized whether the maternal mortality information curriculum was delivered to the husband or the wife in each couple, with both spouses in all arms also receiving a family planning curriculum. This design isolates the incremental effect of the maternal mortality information and permits identification of direct versus spillover effects within the household.&lt;/p&gt;
&lt;p&gt;Consistent with the model, treated husbands significantly update their beliefs about maternal health risk factors, and their wives also update — information flows from husbands to wives. By contrast, treated wives update their own beliefs, but their husbands do not update at all. The test that spillover effects are symmetric is rejected (p-value = 0.097 for risk factors index; p-value &amp;lt; 0.001 for direct vs. indirect effects on men). The communication asymmetry is most pronounced among husbands who, at baseline, want a child as soon as possible — precisely the households with the greatest preference conflict.&lt;/p&gt;
&lt;p&gt;Both treatment arms reduce fertility. Households in which the husband is treated experience a 43% reduction in the probability of having a child or being pregnant in the year following the intervention. The fertility reduction is strongest when the wife faces higher ex ante risk based on her birth history, consistent with the model&amp;rsquo;s prediction that treatment effects are concentrated among households with high maternal health costs.&lt;/p&gt;
&lt;p&gt;The transfers evidence is the key differentiator between the two arms. When the wife is treated, fertility declines but is accompanied by a significant reduction in transfers from husband to wife, consistent with the wife updating her own beliefs without being able to convey them to her husband, who then reduces compensation. When the husband is treated, fertility declines without the same reduction in transfers — and treated husbands report higher communication with their spouse about family planning and higher relationship satisfaction. This combination is consistent with the husband treatment resolving the information gap directly, enabling efficient contracting, whereas the wife treatment leaves the information asymmetry in place.&lt;/p&gt;
&lt;p&gt;The study is conducted in informal settlements of Lusaka, a prime-age urban sample in which the average woman is 28 years old with 2.6 children at baseline. Scope conditions: results apply to a setting with very high maternal mortality, large baseline spousal fertility gaps, and strong traditional beliefs (55.5% of men cite marital infidelity as a leading cause of maternal complications). Generalizability to lower-risk or lower-preference-gap settings is explicitly circumscribed by the model&amp;rsquo;s comparative statics.&lt;/p&gt;
&lt;p&gt;Q: What is the baseline gender gap in knowledge of maternal health risk?
A: Men are less likely than women to identify high parity (72.0% vs. 77.7%) and advanced maternal age (74.3% vs. 84.6%) as risk factors. In seven hypothetical scenarios rating complication likelihood on a 0–10 scale, men report lower scores than women in six out of seven cases. Despite Zambia&amp;rsquo;s 1-in-59 lifetime maternal mortality risk, only 27.6% of men (vs. 53.4% of women) report having attempted to discuss maternal health risk with their spouse.&lt;/p&gt;
&lt;p&gt;Q: What drives the gender gap in knowledge?
A: The authors argue the gap stems from &amp;ldquo;gendered spheres of direct and indirect knowledge accumulation of maternal labor and delivery outcomes.&amp;rdquo; Women are embedded in social networks where maternal mortality episodes are more salient: 11.0% of women report knowing a close friend who died giving birth, vs. 6.8% of men knowing a close friend whose wife died. The gap widens with social distance to the victim, suggesting women&amp;rsquo;s networks give them systematically more exposure to maternal mortality events.&lt;/p&gt;
&lt;p&gt;Q: How does the model explain the failure of within-household communication?
A: The model places husband and wife preferences as minimizing the distance between realized fertility and their respective net fertility optima (ideal fertility minus weighted maternal health cost). When the husband&amp;rsquo;s ideal fertility is high enough, he makes transfers to induce the wife to bear more children than her private optimum. Given these incentives, a wife who is informed about high health costs has an interest in exaggerating the cost to extract larger transfers. Because the husband anticipates this, no informative communication occurs in equilibrium — the only equilibrium is a pooling equilibrium where the wife&amp;rsquo;s message is uninformative regardless of her true cost realization.&lt;/p&gt;
&lt;p&gt;Q: What is the specific asymmetry in belief updating observed in the experiment?
A: Among treated husbands, both husbands and their wives update beliefs about maternal risk factors — information flows from husband to wife. Among treated wives, only the wife updates; her husband does not. The Wald test rejects equal direct and indirect effects on men at p &amp;lt; 0.001 and rejects symmetric spillovers at p = 0.097 for the risk factors index. There is no symmetric restriction binding for women&amp;rsquo;s updating across arms.&lt;/p&gt;
&lt;p&gt;Q: How large is the fertility effect and which arm drives it?
A: Households in which the husband is treated experience a 43% reduction in the probability of having a child or being pregnant in the year following the intervention. This effect is described as of the same order of magnitude as other household-level interventions shown to reduce pregnancy (citing Ashraf, Field, and Lee 2014). The fertility reduction is strongest among households where the woman faces higher ex ante risk based on birth history, consistent with the model&amp;rsquo;s Prediction 5 that effects are concentrated where theta_j is high.&lt;/p&gt;
&lt;p&gt;Q: How do transfers differ between the wife-treated and husband-treated arms?
A: When the wife is treated, the fertility decline is accompanied by a significant reduction in transfers from husband to wife. When the husband is treated, the fertility decline is not accompanied by a similar reduction in transfers. The authors interpret this pattern as: wife treatment leaves the husband uninformed, so he reduces transfers when he observes her reducing fertility without understanding why; husband treatment resolves the information gap, allowing efficient renegotiation without penalizing the wife.&lt;/p&gt;
&lt;p&gt;Q: Which husbands fail to update beliefs even when their wife is treated?
A: Husbands who at baseline want a child &amp;ldquo;as soon as possible&amp;rdquo; do not update their beliefs in response to their wife&amp;rsquo;s treatment status. These men also reduce transfers to their wife more than other groups when she is treated. In the model, these are precisely the households with the highest conflict of interest (high alpha_H), where the pooling equilibrium prediction is sharpest.&lt;/p&gt;
&lt;p&gt;Q: What is the role of traditional beliefs about maternal mortality?
A: 55.5% of men and 42.0% of women report (without prompting) marital infidelity as a leading cause of maternal labor and delivery complications — greater weight than assigned to lack of healthcare and poor health status combined. This stigma directly reduces women&amp;rsquo;s willingness to raise concerns about birth complications with their spouse, reinforcing the communication barrier the model formalizes.&lt;/p&gt;
&lt;p&gt;Q: What are the welfare implications of targeting men vs. women with information?
A: The fertility reduction from husband treatment is not inferior to that from wife treatment, but husband treatment also produces improvements in marital surplus — treated husbands report higher communication with spouse about family planning, higher relationship satisfaction, and greater closeness — whereas wife treatment reduces transfers to the wife, indicating she bears a financial cost. The authors argue male-targeted information can reduce unmet need for family planning while enhancing rather than exacerbating household conflict.&lt;/p&gt;
&lt;p&gt;Q: Does this paper provide field experimental evidence on strategic communication models?
A: The authors claim this is the first field experimental evidence directly testing models of strategic communication (Crawford and Sobel 1982; Mailath 1987; Crawford 1998, 2019), wherein persistent preference differences and conflict of interest impede communication and beliefs updating. Prior tests of these models were conducted in the lab; this paper provides the first real-world behavioral test with consequential decisions (fertility) in a high-stakes setting.&lt;/p&gt;
&lt;p&gt;Q: What is the unmet need for family planning in the study sample?
A: Overall, 32% of women in the sample report not using modern contraceptives at baseline. Of the 33% of women who want no more children, 27% are not using any modern contraceptive (8% of the overall sample). Of the 52% of women who wish to delay giving birth by at least one year, 23% are not using any modern contraceptive (12% of the overall sample).&lt;/p&gt;
&lt;p&gt;Q: How does the model characterize the husband&amp;rsquo;s partial internalization of maternal health costs?
A: The husband&amp;rsquo;s utility function includes the maternal health cost theta_j scaled by delta (0 ≤ delta ≤ 1), capturing how much weight he places on his wife&amp;rsquo;s risk. When delta is sufficiently high and the husband&amp;rsquo;s ideal fertility (alpha_H) is sufficiently low, or when his disutility of transfers (gamma) is sufficiently low, informative communication can occur after the husband is treated. When delta is low, the husband discounts his wife&amp;rsquo;s risk and communication barriers are more severe regardless of treatment.&lt;/p&gt;
&lt;p&gt;Maternal health cost (theta): A random variable representing the welfare cost borne by the wife from childbearing, including mortality risk and morbidity. In Zambia, distributed with a higher mean than the worldwide distribution. Enters the wife&amp;rsquo;s utility directly and the husband&amp;rsquo;s utility only scaled by delta, his degree of internalization of her cost.&lt;/p&gt;
&lt;p&gt;Gendered spheres of learning: The paper&amp;rsquo;s term for the systematic differential in experiential exposure to maternal mortality outcomes between men and women, arising from gender-segregated social networks. Women witness maternal mortality events more directly through closer social ties, while men&amp;rsquo;s networks provide systematically less exposure.&lt;/p&gt;
&lt;p&gt;Communication barrier (pooling equilibrium): The equilibrium outcome in the model where no informative signal is transmitted from an informed wife to her uninformed husband about the true realization of maternal health cost. Arises because the wife&amp;rsquo;s incentives to misreport are independent of the true cost realization, making any message uninformative when preference conflict is sufficiently large.&lt;/p&gt;
&lt;p&gt;Intra-household information spillover: The transmission of information learned by one spouse to the other as a consequence of the treated spouse&amp;rsquo;s belief update. The paper documents asymmetric spillovers: information flows from treated husbands to their wives, but not from treated wives to their husbands.&lt;/p&gt;
&lt;p&gt;Husband&amp;rsquo;s demand for children (alpha_H): The husband&amp;rsquo;s ideal fertility level, which governs the degree of preference conflict within the household. Baseline husband desire for a child as soon as possible serves as the empirical proxy for high alpha_H and is the key moderator of spillover and transfer effects.&lt;/p&gt;
&lt;p&gt;Degree of internalization (delta): The parameter in the husband&amp;rsquo;s utility function (0 ≤ delta ≤ 1) capturing how much weight he places on his wife&amp;rsquo;s maternal health cost. When delta is high and gamma (disutility of transfers) is low, communication can occur in equilibrium after the husband is treated.&lt;/p&gt;
&lt;p&gt;Unmet need for family planning: Women who wish to space or limit births but are not using modern contraception. In the study sample, 32% of women report not using modern contraceptives at baseline, with substantial shares among both those wanting no more children and those wishing to delay.&lt;/p&gt;</description></item><item><title>Genetic Prediction and Adverse Selection</title><link>https://macropaperwarehouse.com/papers/genetic-prediction-and-adverse-selection/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/genetic-prediction-and-adverse-selection/</guid><description>&lt;p&gt;This paper asks how much adverse selection would arise in critical illness insurance (CII) markets if consumers can observe polygenic indexes (PGIs) — genetic risk scores derived from millions of genetic variants — while insurers are legally barred from using genetic information. The authors develop an econometric method that measures selection under current PGI technology, then extends identification to expected future PGI accuracy using heritability bounds, even though future PGIs are not yet observable in data.&lt;/p&gt;
&lt;p&gt;The primary dataset is the UK Biobank (UKB), comprising approximately 446,570 genotyped individuals of European-like ancestry linked to NHS electronic health records. The authors study seven single-disease CII contracts (Alzheimer&amp;rsquo;s disease, breast cancer, coronary artery disease, colorectal cancer, prostate cancer, schizophrenia, and type 2 diabetes) and multiple-disease bundled contracts paying a lump sum upon onset. The econometric model assumes a probit disease probability, Gaussian PGI structure, and identification relies on published heritability estimates to pin down future PGI predictive power. The key selection metric is the implicit tax proposed by Hendren (2013): the percentage markup a marginal consumer must pay above her actuarially fair price due to adverse selection. The authors use the minimum implicit tax up to the 80th percentile of risk (t80) as their summary statistic, with market unraveling benchmarked at t80 between 43% and 83% from prior literature.&lt;/p&gt;
&lt;p&gt;The paper reports three main findings, all scoped to a population of 35-year-olds in the standard insurer risk class (those whose predicted risk falls within 0.75–1.25 times the population mean).&lt;/p&gt;
&lt;p&gt;First, under current PGI technology with full consumer adoption, selection is noticeable but heterogeneous across diseases. t80 ranges from 17.9% for coronary artery disease to 117.9% for Alzheimer&amp;rsquo;s disease. Coronary artery disease and colorectal cancer fall in the middle of the no-unraveling range; breast cancer, schizophrenia, and type 2 diabetes fall between the no-unraveling and unraveling ranges; Alzheimer&amp;rsquo;s disease and prostate cancer (t80 = 59.8%) reach or exceed the unraveling range. The current prostate cancer PGI explains 9.9% of liability variance, adding 8.3 percentage points over the 22.9% explained by non-genetic covariates.&lt;/p&gt;
&lt;p&gt;Second, under expected future PGI accuracy — bounded below by SNP heritability and above by twin heritability — selection becomes potentially crippling. Under the lower bound (Scenario 3L), t80 ranges from 57.5% for breast cancer to above 1,000% for Alzheimer&amp;rsquo;s. Under the upper bound (Scenario 3U), t80 exceeds 100% for all seven single-disease contracts and exceeds 1,000% for three of them. For prostate cancer, the reference case, t80 reaches 86.8% under Scenario 3L and 426.9% under Scenario 3U — far above Hendren&amp;rsquo;s unraveling benchmarks. For multiple-disease male contracts, t80 = 30.8% under current technology, rising to 54.4% (Scenario 3L) and 243.9% (Scenario 3U).&lt;/p&gt;
&lt;p&gt;Third, variation in selection across contracts is driven primarily by: the predictive power of the future PGI, the incremental predictive power over non-genetic covariates, and disease prevalence. Alzheimer&amp;rsquo;s and schizophrenia — high heritability, low prevalence — display the highest implicit taxes; breast and colorectal cancer — lower SNP heritability, lower incremental R2 — display the lowest.&lt;/p&gt;
&lt;p&gt;These findings are corroborated by a calibrated Akerlof-Einav-Finkelstein equilibrium model using HRS data: current PGI availability reduces equilibrium market quantity from 30% to 21.4%; future PGI availability drives equilibrium quantity to zero in a full adverse selection death spiral. Partial take-up robustness checks show that even at 50% consumer adoption, selection remains problematically high under future PGI accuracy for most contracts. The analysis is restricted to individuals of European-like ancestry due to data availability constraints.&lt;/p&gt;
&lt;p&gt;Q: What is the core market failure the paper analyzes?
A: The paper analyzes adverse selection arising from an asymmetric information gap: consumers can observe PGI-based disease risk predictions from consumer genetic tests (e.g., 23andMe), while insurers in many jurisdictions are legally prohibited from requesting or using genetic information. This creates a situation where high-risk consumers have private information allowing them to sort into insurance, driving up average claims costs and potentially unraveling the market.&lt;/p&gt;
&lt;p&gt;Q: What is a polygenic index (PGI) and why does it differ from classical genetic testing?
A: A PGI is a weighted sum of millions of genetic variants (typically over one million) each with individually tiny effects, constructed using effect-size estimates from genome-wide association studies (GWASs). This contrasts with traditional genetic testing focused on rare single-gene mutations (e.g., BRCA for breast cancer or PKD for kidney disease), which are rare, explain small shares of population-level disease variance, and can largely be inferred from family history. PGIs target common polygenic diseases and are the primary driver of the adverse selection concern because they aggregate diffuse genetic signals into a meaningful risk prediction.&lt;/p&gt;
&lt;p&gt;Q: What are the current PGI R2 values for the seven diseases studied?
A: Estimated on the liability scale in the UKB, current PGI R2 values are: Alzheimer&amp;rsquo;s disease 7.1%, breast cancer 6.7%, coronary artery disease 2.5%, colorectal cancer 2.2%, prostate cancer 9.9%, schizophrenia 4.9%, and type 2 diabetes 7.4%. These represent the share of liability variance explained by each disease&amp;rsquo;s current PGI in the study sample.&lt;/p&gt;
&lt;p&gt;Q: How does the paper identify the degree of selection under future PGI technology that does not yet exist in the data?
A: The identification strategy combines three elements: the normality of PGI distributions, the relationship between current and future PGIs (the current PGI is modeled as a noisy version of the future PGI with an independent Gaussian error), and published heritability estimates that bound the future PGI&amp;rsquo;s predictive power. Theorem 1 establishes that under five stated assumptions — including a probit disease model and known future R2 from heritability studies — the full joint distribution of loss, current PGI, future PGI, and non-genetic covariates is identified from observed data.&lt;/p&gt;
&lt;p&gt;Q: What heritability bounds are used for the future PGI scenarios, and why two bounds?
A: Scenario 3L sets future PGI R2 equal to each disease&amp;rsquo;s SNP heritability (estimated from common genetic variants), which the authors treat as a conservative lower bound because future PGIs will also incorporate rarer variants with better effect-size precision. Scenario 3U sets future PGI R2 equal to twin heritability, treating it as an upper bound since the theoretical maximum predictive power of a PGI is the trait&amp;rsquo;s narrow-sense heritability. For prostate cancer, these bounds are 18.0% (SNP) and 57.0% (twin); for Alzheimer&amp;rsquo;s, SNP heritability is 33.1% and twin heritability is 58%.&lt;/p&gt;
&lt;p&gt;Q: What is the implicit tax and how is it used as a benchmark?
A: The implicit tax t(r) for a consumer with private risk r equals the percentage by which her insurance cost exceeds her own actuarially fair price when she must pool with all consumers of equal or higher risk. It measures how much the marginal buyer overpays due to adverse selection. The authors follow Hendren (2013) in reporting t80, the minimum implicit tax up to the 80th percentile. Hendren&amp;rsquo;s benchmarks: t80 between 7–35% for markets that did not unravel; t80 between 43–83% for markets that had unraveled.&lt;/p&gt;
&lt;p&gt;Q: What are the single-disease contract results under current PGI technology (Scenario 2)?
A: With full consumer adoption of current PGI technology, t80 ranges from 17.9% for coronary artery disease to 117.9% for Alzheimer&amp;rsquo;s disease. Coronary artery disease (17.9%) and colorectal cancer (26.5%) fall in the middle of Hendren&amp;rsquo;s no-unraveling range. Breast cancer (36.9%), schizophrenia (42.1%), and type 2 diabetes (37.0%) fall between the no-unraveling and unraveling ranges. Alzheimer&amp;rsquo;s disease (117.9%) and prostate cancer (59.8%) reach or exceed the unraveling range.&lt;/p&gt;
&lt;p&gt;Q: What are the single-disease contract results under future PGI technology?
A: Under the lower bound (Scenario 3L, R2 = SNP heritability), t80 ranges from 57.5% for breast cancer to above 1,000% for Alzheimer&amp;rsquo;s disease. Under the upper bound (Scenario 3U, R2 = twin heritability), t80 exceeds 100% for all seven contracts and exceeds 1,000% for three (Alzheimer&amp;rsquo;s, schizophrenia, and at least one other). These figures substantially exceed Hendren&amp;rsquo;s unraveled-market benchmarks for virtually all contracts.&lt;/p&gt;
&lt;p&gt;Q: What drives cross-disease variation in the implicit tax?
A: The authors identify three main drivers: the expected accuracy of future PGI (higher heritability → higher implicit tax), the incremental predictive power of the future PGI over non-genetic covariates observable by insurers (more incremental information → more adverse selection), and disease prevalence (lower prevalence concentrates risk heterogeneity, amplifying selection). Alzheimer&amp;rsquo;s disease and schizophrenia — high heritability and low prevalence — have the highest implicit taxes. Breast and colorectal cancers — lower SNP heritability and lower incremental R2 — have the lowest.&lt;/p&gt;
&lt;p&gt;Q: What do the multiple-disease bundled contract results show?
A: For the male multiple-disease contract under Scenario 2 (current PGI), t80 = 30.8%, comparable to Hendren&amp;rsquo;s no-unraveling range. Under Scenario 3L, t80 = 54.4%; under Scenario 3U, t80 = 243.9%, both in or above the unraveling range. The female contract yields qualitatively similar results. Implicit taxes in bundled contracts are generally lower than in single-disease contracts, suggesting some diversification of genetic risk across diseases.&lt;/p&gt;
&lt;p&gt;Q: What does the calibrated equilibrium model find?
A: Using an Akerlof (1970) / Einav-Finkelstein-Cullen (2010) supply-and-demand model calibrated to match a 30% market participation rate and a 50% loss ratio in the UK CII market, and using HRS data on individual risk aversion, the model finds that current PGI availability reduces equilibrium quantity from 30% to 21.4%. Future PGI availability (both Scenario 3L and 3U) drives equilibrium quantity to zero — a complete adverse selection death spiral with no trade.&lt;/p&gt;
&lt;p&gt;Q: How robust are results to partial consumer adoption of genetic testing?
A: At 10% consumer take-up, selection is low regardless of PGI accuracy. At 50% take-up, selection remains problematically high for all single-disease contracts under future PGI accuracy (Scenarios 3L and 3U). For multiple-disease contracts at 50% take-up, t80 falls just below Hendren&amp;rsquo;s unraveling threshold under Scenario 3L but enters the unraveling range under Scenario 3U. This suggests market problems would materialize once predictive power exceeds the SNP heritability bound and take-up exceeds roughly 50%.&lt;/p&gt;
&lt;p&gt;Q: What role do risk preferences play, and do they confound the results?
A: The authors test whether risk tolerance correlates with disease risk in the UKB using a self-reported general risk tolerance measure. They find extremely low correlations between risk tolerance and each disease. This is consistent with low correlation between relative risk aversion and disease risk in the HRS calibration, and supports the finding that correlation between risk and risk preferences is unlikely to meaningfully affect the main results.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s assessment of preventive treatment as a mitigating factor?
A: The authors acknowledge that genetic testing could enable personalized preventive medicine, which would reduce actual disease incidence among high-risk individuals. However, they argue this is unlikely to substantially affect their main findings because the most commonly covered diseases under CII are cancers, for which preventive behaviors have bounded effectiveness.&lt;/p&gt;
&lt;p&gt;Q: What are the paper&amp;rsquo;s policy implications?
A: The paper situates the genetic information problem within the standard regulatory framework for selection markets, distinguishing laissez-faire (allow genetic underwriting — efficient but potentially unfair to high-risk consumers), government provision (unattractive for non-essential CII), and managed competition (community rating combined with subsidies and risk adjustment). The authors argue that a full ban on genetic underwriting — the current policy in many countries — may become untenable as PGI accuracy improves, because it generates potentially crippling adverse selection. Some level of community rating may remain desirable for redistribution, but needs to be paired with subsidies or risk adjustment to prevent market collapse.&lt;/p&gt;
&lt;p&gt;Q: What are the main data and scope limitations?
A: The analysis is restricted to individuals of European-like ancestry because most large GWASs were conducted in European ancestry samples and PGIs perform poorly across ancestries. The UKB sample was aged 40–69 at recruitment and the analysis adjusts for age-dependent covariates; the HRS replication uses approximately 20,000 individuals. The equilibrium model ignores moral hazard and uses a parsimonious binary loss framework. The paper does not specify a timeline for when PGI accuracy will reach heritability bounds.&lt;/p&gt;
&lt;p&gt;Polygenic Index (PGI): A weighted sum of an individual&amp;rsquo;s genetic variants across the genome (typically over one million variants), constructed using effect-size estimates from a genome-wide association study (GWAS) conducted in an independent sample. It is a noisy proxy for the individual&amp;rsquo;s true additive genetic factor for a disease, and its predictive power is bounded above by the trait&amp;rsquo;s narrow-sense heritability.&lt;/p&gt;
&lt;p&gt;Implicit Tax: A measure of adverse selection defined by Hendren (2013) as the percentage by which a consumer with private risk r must overpay relative to her own actuarially fair price if she is pooled with all consumers of equal or higher risk. The minimum implicit tax up to the 80th percentile of risk (t80) serves as the paper&amp;rsquo;s primary summary statistic; t80 above roughly 43% is associated with market unraveling in prior literature.&lt;/p&gt;
&lt;p&gt;SNP Heritability: The share of variance in a disease&amp;rsquo;s liability attributable to the set of common genetic variants (SNPs) used in heritability estimation. Used in this paper as a conservative lower bound on the predictive power of future PGIs, because future PGIs will additionally capture rarer variants.&lt;/p&gt;
&lt;p&gt;Twin Heritability: An estimate of a trait&amp;rsquo;s narrow-sense (additive) heritability computed by comparing resemblance of monozygotic twins (sharing 100% of their genomes) to dizygotic twins (sharing ~50% on average). Used as an upper bound on future PGI predictive power, since heritability is the theoretical maximum R2 for a PGI.&lt;/p&gt;
&lt;p&gt;Standard Risk Class: The set of consumers whose predicted disease risk (based on non-genetic covariates observable to insurers) falls between 0.75 and 1.25 times the population-wide average risk, following standard insurance underwriting practice. Insurers charge the same premium to all consumers in this class; any variation in risk within the class due to private genetic information constitutes the source of adverse selection analyzed in this paper.&lt;/p&gt;
&lt;p&gt;Private Risk Function: The probability rho(g, w) of contracting the disease conditional on both the consumer&amp;rsquo;s observed PGI g and non-genetic factors w. Contrasted with the non-genetic private risk function pi(w), which conditions only on non-genetic covariates. The dispersion of the private risk distribution across consumers in the same risk class determines the degree of adverse selection.&lt;/p&gt;
&lt;p&gt;Adverse Selection Death Spiral: The Akerlof (1970) mechanism in which high-risk consumers disproportionately purchase insurance, causing insurers to raise premiums, which deters low-risk consumers, which further raises the average risk of purchasers, ultimately driving equilibrium quantity to zero. The paper&amp;rsquo;s calibrated equilibrium model finds this outcome under future PGI accuracy for the HRS CAD contract.&lt;/p&gt;</description></item><item><title>Germs in the Family: The Short- and Long-Term Consequences of Intra-Household Disease Spread</title><link>https://macropaperwarehouse.com/papers/germs-in-the-family-the-short-and-long-term-consequences-of-intra-household-disease-spread/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/germs-in-the-family-the-short-and-long-term-consequences-of-intra-household-disease-spread/</guid><description>&lt;p&gt;This paper studies the short- and long-term consequences of intra-household respiratory disease transmission from older to younger siblings in Danish families. The central research questions are: (1) how do respiratory illnesses spread from preschool-aged older siblings to younger infant siblings during the first year of life, and (2) how does respiratory disease exposure during infancy causally affect younger siblings&amp;rsquo; long-term economic, human capital, and health outcomes?&lt;/p&gt;
&lt;p&gt;The study uses population-level Danish administrative data covering 1,230,180 children from 37 birth cohorts (1981–2017), linking records from the National Patient Register, income and labor market registers, education registers, and psychiatric care registers. The identification strategy combines birth order variation in respiratory disease vulnerability with within-municipality variation in local respiratory disease prevalence among children aged 13–71 months. The authors construct a municipality-level disease exposure index—cumulative respiratory hospitalizations per 100 children aged 13–71 months in a child&amp;rsquo;s municipality over their first 12 months of life—and estimate the differential effect of this index on younger versus older siblings, controlling for municipality fixed effects, birth year-month fixed effects, and an extensive set of individual and family background characteristics.&lt;/p&gt;
&lt;p&gt;The descriptive findings are stark: younger siblings have 2–3 times higher rates of hospitalization for acute respiratory conditions during their first year of life compared to older siblings at the same age, with the gap largest at ages two and three months. The gap is larger for winter births, shorter birth spacing, and when older siblings attend childcare centers—all patterns consistent with the older sibling serving as a disease vector.&lt;/p&gt;
&lt;p&gt;On the causal estimates, moving from the 25th to the 75th percentile of the disease exposure index distribution increases the younger sibling&amp;rsquo;s acute respiratory hospitalizations in the first year of life by 0.023 (32.9 percent above the sample mean), with effects more than twice as large for exposure in the first six months compared to the second six months.&lt;/p&gt;
&lt;p&gt;In the long run, an interquartile increase in first-year respiratory disease exposure reduces younger siblings&amp;rsquo; wage earnings (conditional on employment) at ages 25–32 by 0.8 percent and total income by 0.8 percent, and reduces their income percentile rank by 0.3 percentage points. There is no significant effect on labor force participation at the extensive margin. Effects on earnings are approximately twice as large when exposure is measured in the first six months of life. These earnings effects are comparable in magnitude to those from a 10 percent reduction in birth weight or a 9 percent increase in ambient air pollution at birth, and correspond to roughly two-thirds of the adult earnings impact of in utero exposure to the 1918 Spanish Influenza. When the disease index interaction is included, the main birth order coefficient declines by approximately 70 percent, suggesting intra-household disease transmission is an important channel underlying the documented birth order earnings disadvantage.&lt;/p&gt;
&lt;p&gt;Additional findings include: a 0.5 percentage point reduction in high school graduation and a 0.6 percentage point reduction in college graduation (interquartile effects); a 0.01 standard deviation penalty in ninth grade Danish test scores; a 20 percent increase (0.016 per hundred per year) in chronic respiratory hospitalizations at ages 16–26; and a 6.1 percent increase (0.5 additional visits per hundred per year) in psychiatric clinic visits at ages 16–26. Breastfeeding mitigates short-term effects, with 15 months of breastfeeding sufficient to entirely offset the elevated hospitalization risk.&lt;/p&gt;
&lt;p&gt;Scope conditions: findings apply to second-born relative to first-born children in Danish sibling pairs with at least 11 months birth spacing; long-term estimates are net of parental compensatory responses and any immunity benefits, and thus represent lower bounds of the uncompensated biological impact of respiratory illness in infancy.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the birth order gap in acute respiratory hospitalizations during infancy, and what patterns support an intra-household transmission mechanism?
A: Younger siblings have 2–3 times higher hospitalization rates for acute respiratory conditions in the first year of life compared to older siblings at the same age, with the gap especially large at ages two and three months. The gap is larger for winter births (when respiratory viruses circulate more), for siblings with shorter birth spacing, and when the older sibling attends a childcare center. Hospitalizations for non-infectious digestive diseases and injuries show no analogous birth order differences, ruling out differential parental healthcare-seeking as an explanation.&lt;/p&gt;
&lt;p&gt;Q: How is the disease exposure index constructed and what variation does it exploit?
A: The index is the cumulative count of acute respiratory hospitalizations per 100 children aged 13–71 months in a child&amp;rsquo;s municipality over their first 12 months of life, with the older sibling excluded from the count when applicable. It exploits irregular spatial and temporal waves of respiratory viruses (such as RSV and influenza) across Danish municipalities. The interquartile range of this index captures meaningful variation in community disease burden faced by infants across different places and years.&lt;/p&gt;
&lt;p&gt;Q: What is the first-stage relationship between the disease index and infant hospitalizations?
A: Moving from the 25th to the 75th percentile of the disease index increases younger siblings&amp;rsquo; acute respiratory hospitalizations in the first year of life by 0.023 (a 32.9 percent increase relative to the sample mean), while the effect on older siblings is substantially smaller. The interaction coefficient in the preferred specification implies that one additional hospitalization per 100 community children aged 13–71 months raises the younger sibling&amp;rsquo;s hospitalization count by 0.012 more than the older sibling&amp;rsquo;s. Effects are more than twice as large for exposure in the first compared to the second six months of life.&lt;/p&gt;
&lt;p&gt;Q: What are the estimated long-term effects on adult earnings, and how do they compare to benchmarks in the literature?
A: An interquartile increase in first-year respiratory disease exposure reduces younger siblings&amp;rsquo; wage earnings at ages 25–32 by 0.8 percent and total income by 0.8 percent, with a 0.3 percentage point reduction in income percentile rank. These magnitudes are comparable to a 1 percent earnings reduction from a 10 percent birth weight reduction (Black et al., 2007), a 1 percent earnings reduction from a 9 percent increase in ambient air pollution (Isen et al., 2017b), and roughly two-thirds of the in utero Spanish Influenza effect (Almond, 2006).&lt;/p&gt;
&lt;p&gt;Q: Does the birth order earnings disadvantage reflect intra-household disease transmission?
A: When the interaction between birth order and the disease index is excluded, the regression finds a 1.9 percent birth order earnings disadvantage for second-born children (consistent with Black et al., 2005 range of 1.2–4.2 percent). When the interaction is included, the main birth order coefficient declines by approximately 70 percent, suggesting that disease transmission from older to younger siblings is an important channel driving the birth order earnings penalty.&lt;/p&gt;
&lt;p&gt;Q: Are effects larger for exposure in the first versus second six months of life?
A: Yes, consistently across all outcomes. The interaction coefficient for acute respiratory hospitalizations is more than twice as large when exposure is measured in the first versus second six months. Effects on wage earnings are approximately 60 percent larger for first-half exposure, and effects on income rank are two to three times larger. This is consistent with biomedical evidence that infants&amp;rsquo; immune systems mature around six months when solid food introduction begins.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on educational outcomes?
A: An interquartile increase in first-year respiratory disease exposure reduces the likelihood of high school graduation by 0.5 percentage points (0.6 percent at the sample mean) and college graduation by 0.6 percentage points (1.7 percent at the sample mean), with effects approximately 60 percent larger when measuring first-half exposure. A 0.01 standard deviation reduction in ninth grade Danish test scores is also found. A back-of-the-envelope calculation using Danish returns to schooling suggests the reduction in educational attainment can explain approximately half of the estimated earnings effect.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on chronic respiratory and mental health outcomes?
A: An interquartile increase in first-year exposure increases chronic respiratory hospitalizations (asthma, COPD) at ages 16–26 by 0.016 per hundred per year (20 percent above the sample mean), with significant increases also apparent at ages one to two. For mental health, the same exposure is associated with 0.5 additional psychiatric clinic visits per hundred per year at ages 16–26 (6.1 percent above the sample mean), with effects becoming more significant in the early twenties. Effects on mental health from this paper are smaller than those estimated for more extreme fetal and early childhood shocks such as Ramadan exposure or maternal bereavement.&lt;/p&gt;
&lt;p&gt;Q: What does the acute respiratory trajectory look like beyond infancy?
A: Elevated acute respiratory hospitalizations persist at age one, then there is a reduction at ages two to three consistent with an immunity formation hypothesis, but this protective effect disappears by age four. There is no significant increase or decrease in acute respiratory hospitalizations at older ages, in contrast to the persistent increase found for chronic respiratory conditions.&lt;/p&gt;
&lt;p&gt;Q: What heterogeneity is found in short-term effects?
A: Effects on infant respiratory hospitalizations are larger for low birth weight children, for male infants (consistent with the fragile male hypothesis), for siblings with shorter birth spacing, and for sibling pairs where the older child attends childcare. The monotonic decline in effect size with increasing birth spacing is the opposite of what would be predicted if differential parental time investment were the main mechanism, supporting intra-household disease spread as the operative channel.&lt;/p&gt;
&lt;p&gt;Q: What is the role of breastfeeding as a moderator?
A: Using supplementary data on breastfeeding duration (covering 2009–2016, matched to 7.6 percent of the sample), the authors find that the impact of disease exposure on younger siblings&amp;rsquo; infancy hospitalizations declines significantly with longer breastfeeding duration. A linear specification implies that 15 months of breastfeeding entirely offsets the elevated hospitalization risk from higher disease exposure. Second-born children breastfed for less than half a month are particularly vulnerable to acute respiratory infections.&lt;/p&gt;
&lt;p&gt;Q: How do the authors validate the identifying assumption?
A: Three validation exercises are used. First, results are robust to adding municipality-specific linear and quadratic trends and maternal fixed effects. Second, using family background characteristics as outcomes in the interaction regression, at most two of fourteen coefficients are significant in any specification, and all effect sizes are less than one percent of sample means. Third, using alternative disease indices based on non-infectious digestive diseases and injuries shows no differential effects for younger siblings, ruling out a parental healthcare-seeking confound.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications?
A: The authors highlight breastfeeding support policies (paid family leave, workplace lactation accommodations), RSV vaccination campaigns for pregnant women and monoclonal antibody prophylaxis for infants, sick pay regulations, and childcare attendance policies as levers to reduce infant respiratory disease burden. They argue that current cost-benefit evaluations of such policies likely undercount the long-term human capital and earnings benefits. The COVID-19 pandemic illustrates the mechanism: restrictions reduced RSV spread during 2020 potentially benefiting infants with older siblings, while the subsequent RSV surge in 2021–2022 may have exposed later cohorts to above-average disease burden.&lt;/p&gt;
&lt;p&gt;Respiratory Disease Exposure Index: A municipality-level cumulative measure of acute respiratory hospitalizations per 100 children aged 13–71 months assigned to each child over their first 12 months of life (or first and second six months separately), designed to proxy for community respiratory disease burden faced by infants from slightly older children, with the child&amp;rsquo;s own older sibling excluded from the count.&lt;/p&gt;
&lt;p&gt;Intra-Household Disease Transmission: The mechanism by which preschool-aged older siblings, exposed to respiratory viruses in group childcare settings, bring home those viruses and infect younger infant siblings who are in a vulnerable stage of immune and brain development, creating a within-family externality in health outcomes.&lt;/p&gt;
&lt;p&gt;Differential Birth Order Effect (Identification): The quasi-experimental design exploits the interaction between birth order (younger siblings are more exposed to older siblings&amp;rsquo; illnesses) and local disease prevalence variation to identify causal impacts, netting out the main effects of both birth order and local disease environment through municipality and birth year-month fixed effects.&lt;/p&gt;
&lt;p&gt;Immunity Formation Hypothesis: The conjecture that early respiratory disease exposure may have a protective effect on later acute respiratory illness through immune system training; supported in the data by reduced acute hospitalizations at ages two to three, though this protection disappears by age four and does not prevent chronic respiratory disease development.&lt;/p&gt;
&lt;p&gt;Dynamic Complementarities with Sibling Health Spillovers: An extension of the Cunha-Heckman framework: while standard models incorporate investment complementarities across time periods for a given child, this paper&amp;rsquo;s findings imply that sibling health spillovers create differential returns to early-life health investments by birth order, since disease asymmetries between older and younger siblings are not incorporated in existing theoretical models.&lt;/p&gt;
&lt;p&gt;Net Long-Term Effects: The estimated long-run impacts incorporate not only the direct biological effects of respiratory illness on the younger sibling but also any parental compensatory responses and immunity benefits; thus they represent lower bounds of the uncompensated biological impact, as parental compensation would attenuate the measured sibling difference.&lt;/p&gt;</description></item><item><title>Growth Experiences and Trust in Government</title><link>https://macropaperwarehouse.com/papers/growth-experiences-and-trust-in-government/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/growth-experiences-and-trust-in-government/</guid><description>&lt;p&gt;This paper investigates whether individuals who have experienced stronger GDP growth over their lifetimes are more likely to trust their national government. The authors — Besley, Dann, and Dray — assemble a newly harmonized global dataset comprising approximately 3.3 million respondents across 166 countries since 1990, drawn from 11 major opinion surveys (Afrobarometer, Americasbarometer, Arabarometer, Asiabarometer, European Social Survey, Gallup World Poll, Integrated Values Survey, Latinobarometer, Life in Transition Survey, South Asia Barometer, and World Justice Project). They supplement this with longer-run U.S. evidence from the American National Election Studies (ANES) going back to 1958, covering respondents born as early as the 1880s, and longitudinal Swiss evidence from the Swiss Household Panel (SHP) which allows individual fixed-effects estimation.&lt;/p&gt;
&lt;p&gt;The core methodological contribution is the exploitation of country-cohort variation in lifetime GDP growth experiences. Following Malmendier and Nagel (2011), the authors construct a weighted average of past growth realizations across an individual&amp;rsquo;s lifetime, with weights decaying linearly over time (lambda = 1), so that more recent growth receives greater weight. The baseline specification includes country fixed effects, cohort-by-subcontinent fixed effects, survey-by-survey-year fixed effects, controls for log GDP per capita at year of birth, and individual characteristics (sex, marital status, education, religious denomination). More demanding specifications add country-by-survey-year and country-by-age fixed effects. For Switzerland, individual fixed effects are included, fully absorbing time-invariant personal characteristics.&lt;/p&gt;
&lt;p&gt;The main finding is that a one standard deviation increase in lifetime GDP growth experience — corresponding to approximately 2 percentage points of additional growth — is associated with a 2.1 percentage point increase in the probability of trusting the national government, significant at the 1 percent level. This corresponds to roughly 0.042 standard deviations of the trust outcome and approximately 5 percent of the global mean trust in government. The effect is quantitatively meaningful: it approximates between one-quarter and one-half of the difference in average trust between older and younger cohorts in India and Italy, respectively. For the U.S. ANES sample, a one standard deviation increase in growth experience (about 0.2 percentage points) increases trust in the federal government by 2.4 percentage points, explaining more than two-thirds of the average trust gap between Baby Boomers (born 1946–1964) and Millennials (born 1981–1996).&lt;/p&gt;
&lt;p&gt;Several scope conditions and heterogeneity findings sharpen the interpretation. First, the growth-trust link is specific to government institutions: there is no statistically significant effect of growth experience on interpersonal trust or trust in religious organizations, indicating the channel runs through perceptions of state performance rather than generalized social capital. Second, a recency heuristic operates: the linearly decaying weighting function (lambda = 1) outperforms both an unweighted lifetime average (lambda = 0) and a formative-years weighting. Growth experienced during formative years (ages 18–25) or before birth has no detectable effect on trust in government; the pre-birth result serves as a placebo test. Third, the positive growth-trust relationship is stronger in democracies than in autocracies, which the authors interpret as democracies producing citizens more responsive to government performance signals. Fourth, a &amp;ldquo;trust paradox&amp;rdquo; emerges: unconditionally, average trust in government is lower in democracies than in autocracies, and longer democratic experience is associated with lower trust, which the authors attribute to democratic institutions generating greater citizen skepticism about government performance. Fifth, core results are robust to controlling for other lifetime politico-economic experiences including inflation, banking and currency crises, epidemics, political unrest, executive turnover, stock market returns, and income inequality. The Swiss evidence further shows that private income growth experience does not drive the result — only aggregate macroeconomic growth does.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s core quantitative finding on the growth-trust relationship?
A: Using the global harmonized dataset of 3.3 million respondents across 166 countries, a one standard deviation increase in lifetime GDP growth experience (corresponding to approximately 2 percentage points of additional growth) is associated with a 2.1 percentage point increase in the probability of trusting the national government, significant at the 1 percent level. Using only the Gallup World Poll subsample (roughly half the observations), the estimated effect is somewhat larger at 3.6 percentage points per standard deviation increase. These estimates remain statistically significant under more demanding specifications with country-by-survey-year and country-by-age fixed effects, though the magnitudes decrease as these interacted fixed effects absorb variation in recent growth experiences.&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure individual lifetime growth experience?
A: The growth experience variable is a weighted average of all past annual GDP per capita growth rates since an individual&amp;rsquo;s birth, with weights that decay linearly over time (lambda = 1 in the Malmendier-Nagel framework). Under this parameterization, the measure simplifies to how much recent economic performance (in the year prior to the survey) exceeds the long-run mean over the respondent&amp;rsquo;s lifetime, scaled by the respondent&amp;rsquo;s midpoint of life. This implies younger individuals are more sensitive to recent growth outcomes because their shorter life histories give recent events relatively greater weight. The authors validate this lambda = 1 choice via a grid search over alternative weighting structures using minimum residual sum of squares as the criterion.&lt;/p&gt;
&lt;p&gt;Q: How is reverse causality addressed?
A: The empirical strategy identifies the relationship using past, cumulative growth experiences measured prior to the survey, so current trust in government cannot cause past growth. Survey-year fixed effects absorb all aggregate time trends simultaneously affecting trust and growth. The authors also conduct a placebo test showing that GDP growth occurring before an individual&amp;rsquo;s birth has a precisely estimated null effect on their trust in government, which would not be the case if unobserved societal trends were jointly driving both growth histories and political perceptions.&lt;/p&gt;
&lt;p&gt;Q: Does growth experience affect interpersonal trust or trust in non-state institutions?
A: No. The estimated coefficient on lifetime growth experience is statistically insignificant at conventional levels when interpersonal trust replaces trust in government as the dependent variable, with narrow confidence intervals indicating a precisely estimated null. Similarly, growth experience has no systematic effect on trust in religious organizations such as churches or mosques. The authors interpret these null results as evidence against the alternative explanation that broad modernizing social changes are jointly driving both growth experiences and political trust.&lt;/p&gt;
&lt;p&gt;Q: What do the U.S. ANES results add?
A: The ANES data, which extends back to 1958 and captures cohorts born as early as the 1880s, provide a within-country test controlling for state fixed effects, generation dummies, and rich individual characteristics including partisan affiliation and partisan strength. A one standard deviation increase in U.S. growth experience (approximately 0.2 percentage points) raises trust in the federal government by 2.4 percentage points, significant at the 1 percent level. This estimate is quantitatively large enough to explain more than two-thirds of the average trust gap between Baby Boomers and Millennials. Results are robust to adding state-by-survey-year fixed effects and birth-state-by-generation fixed effects, and hold for a broader &amp;ldquo;trust in government index&amp;rdquo; covering beliefs about waste, corruption, and responsiveness of the federal government.&lt;/p&gt;
&lt;p&gt;Q: What do the Swiss Household Panel results contribute?
A: The SHP allows individual fixed-effects estimation, exploiting within-person changes in growth experience and trust over time from 1999 onward, which absorbs all time-invariant individual characteristics that could confound the global and U.S. cross-cohort results. The growth experience coefficient remains positive and significant, with a one standard deviation increase yielding a 1.9 percentage point increase in trust in the Swiss federal government (significant at the 1 percent level). The Swiss data also uniquely allow the authors to test whether personal income growth experience drives the result; they find no significant effect of private income growth experience on trust in government, only aggregate macroeconomic growth matters.&lt;/p&gt;
&lt;p&gt;Q: Does the recency heuristic hold — does growth in formative years matter?
A: No. The authors find no detectable effect of growth experienced specifically during formative years (ages 18–25) on trust in government. Additionally, in a grid-search exercise assessing model fit across different lambda values, the linearly decaying weighting scheme (lambda = 1, giving more weight to recent growth) outperforms both equal-weighted lifetime averages (lambda = 0) and weighting schemes that emphasize earlier life experiences (lambda less than 0). The pre-birth placebo result (null effect) and the absence of a formative-years effect together indicate that the operative mechanism is about evaluating current government performance based on recent macroeconomic experience, not the imprinting of long-lasting political dispositions during youth.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;trust paradox&amp;rdquo; and how is it documented?
A: The trust paradox refers to the empirical finding that average trust in government is lower in democracies than in autocracies at the cross-country level, and that longer experience with democratic institutions within countries is associated with lower levels of trust in government in the micro data. This is counterintuitive given the standard view that good institutions should foster confidence in government. The authors suggest the paradox likely reflects democracies cultivating greater citizen skepticism and more critical judgment of government performance, rather than indicating that democratic governance actually performs worse. Importantly, the positive effect of growth experience on trust remains present in democracies, and the growth-trust relationship is actually stronger in democratic regimes, consistent with citizens in democracies being more responsive to government performance signals.&lt;/p&gt;
&lt;p&gt;Q: How is the growth-trust finding related to corruption perceptions and living standards?
A: Using the Gallup World Poll, the authors find that stronger lifetime growth experience is associated with lower perceived corruption in government, greater satisfaction with personal living standards, and higher likelihood of feeling one lives comfortably on one&amp;rsquo;s present income. These results are consistent with citizens attributing economic success to government competence and integrity, and with growth translating into perceptions of improved personal circumstances through both direct income effects and indirect public goods provision.&lt;/p&gt;
&lt;p&gt;Q: Are the results robust to controlling for other lifetime politico-economic experiences?
A: Yes. When the authors include lifetime experience measures for political unrest, executive turnover, epidemic exposure, banking crises, currency crises, and inflation (both levels and volatility) simultaneously in equation (3), the growth experience coefficient remains consistently positive, stable, and significant across all specifications. Among the other experience variables, only lifetime unrest and epidemic exposure are independently negative and statistically significant at conventional levels. F-tests reject the null hypothesis that the crisis and growth experience coefficients are equal in magnitude. The U.S. results are also robust to adding lifetime experiences with S&amp;amp;P 500 returns, unemployment, and top-income-share inequality measures.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: The authors note that sustained economic growth may itself be a mechanism for building political trust, with positive downstream effects for policy compliance — a connection they document has been relevant during the COVID-19 pandemic (where higher-trust societies showed lower mobility during lockdowns and higher vaccine acceptance). The growth-trust channel could have implications for increasing compliance across a range of policy domains including climate action and tax morale. Governments that deliver sustained economic growth can expect citizens to update their trust upward, particularly in democracies where citizens are more performance-responsive, while governments that preside over stagnation or contraction face predictable erosion of political legitimacy across cohorts.&lt;/p&gt;
&lt;p&gt;Growth experience: A weighted average of all past annual GDP per capita growth realizations since an individual&amp;rsquo;s birth, with weights that decay linearly over time following Malmendier and Nagel (2011), so that more recent growth receives greater weight. Under the paper&amp;rsquo;s preferred parameterization (lambda = 1), the measure equals how much last year&amp;rsquo;s GDP per capita exceeds the respondent&amp;rsquo;s lifetime mean, scaled by the respondent&amp;rsquo;s midpoint of life.&lt;/p&gt;
&lt;p&gt;Trust in government: A binary dummy variable equal to one if a survey respondent expresses &amp;ldquo;a great deal&amp;rdquo; or &amp;ldquo;quite a lot&amp;rdquo; of trust or confidence in the national government, constructed from harmonized responses across 11 major opinion surveys. The paper treats this as reflecting respondents&amp;rsquo; perceptions of government performance rather than a deep interpersonal trust relationship.&lt;/p&gt;
&lt;p&gt;Trust paradox: The empirical regularity documented in the paper whereby average trust in government is unconditionally lower in democracies than in autocracies at the cross-country level, and whereby longer democratic experience within countries is associated with lower individual trust in government. The authors attribute this to democratic institutions generating more critical citizen judgment of government performance.&lt;/p&gt;
&lt;p&gt;Recency heuristic: The finding that more recent growth experiences carry greater weight in forming trust in government, as captured by the linear decay weighting scheme (lambda = 1) outperforming equal-weighted or early-life-weighted alternatives. Growth before birth and growth during formative years (ages 18–25) have no detectable effect, while recent macroeconomic performance is the operative signal.&lt;/p&gt;
&lt;p&gt;Cohort-level variation: The within-country differences in lifetime growth experiences across birth cohorts that form the paper&amp;rsquo;s primary identification strategy. Because different cohorts in the same country have lived through different sequences of growth episodes, differences in trust across cohorts within a country can be attributed to differential growth exposure rather than time-invariant country characteristics.&lt;/p&gt;
&lt;p&gt;Formative years effect: The hypothesis, tested and rejected in the paper, that economic experiences during ages 18–25 have a lasting imprint on political attitudes analogous to formative-years effects found in other political behavior literatures. The paper finds no statistically significant association between growth experienced during these years and trust in government.&lt;/p&gt;
&lt;p&gt;Source text origin: In the pipeline context relevant to this paper&amp;rsquo;s acquisition, this refers to whether a summary was generated from full working paper text (&amp;ldquo;pdf&amp;rdquo; or &amp;ldquo;oa-html&amp;rdquo;) versus abstract only (which is hard-blocked). The working paper was obtained from LSE Research Online (eprint 129614), classified as published version under CC BY 4.0.&lt;/p&gt;</description></item><item><title>Health Shocks, Health Insurance, Human Capital, and the Dynamics of Earnings and Health</title><link>https://macropaperwarehouse.com/papers/health-shocks-health-insurance-human-capital-and-the-dynamics-of-earnings-and-health/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/health-shocks-health-insurance-human-capital-and-the-dynamics-of-earnings-and-health/</guid><description>&lt;p&gt;Capatina and Keane build and calibrate a life-cycle model of labor supply and savings for U.S. men that incorporates health shocks, endogenous human capital accumulation via learning-by-doing, employer-sponsored health insurance (ESHI), means-tested social insurance, and endogenous medical treatment decisions. The model is calibrated to White males using the Medical Expenditure Panel Survey (MEPS) for 2000–2013, supplemented by CPS, HRS, and PSID data; separate calibrations are presented for Black and Hispanic men with high school or less education.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central research question is how health shocks affect labor supply, earnings, and earnings inequality over the life cycle, and through which mechanisms. Four channels are identified and quantified: (1) the direct labor supply effect — sick days and reduced tastes for work caused by health shocks; (2) the human capital effect — reduced work experience from health-shock-induced employment exits, which deteriorates future job and wage offers in a snowball dynamic; (3) the health-productivity effect — reduced functional health directly lowering wage offers; and (4) the behavioral effect — anticipation of health risk induces low-skill workers lacking ESHI to curtail labor supply to maintain means-tested transfer eligibility.&lt;/p&gt;
&lt;p&gt;The key quantitative findings from eliminating serious health shocks for working-age men (ages 25–64) are: the expected present value of lifetime earnings (PVE) for White men rises by 11% on average, and inequality in PVE falls by 12% (coefficient of variation). For White men with high school or less education the increase in PVE is 17.9%. For the typical White male the four channels contribute 5.7%, 2.7%, 1.4%, and 0.8% respectively. For low-skill White high school men the same channels contribute 10.7%, 14.8%, 1.3%, and 9.8% — with the human capital and behavioral effects dramatically larger for the low-skill group. For comparison, a severe health shock at age 40 reduces the present value of remaining lifetime earnings by 5.6% (approximately $53.9k) for a typical college man and by 11.5% (approximately $55.0k) for a typical high school man.&lt;/p&gt;
&lt;p&gt;Human capital amplification operates through employment persistence: a major health shock causes full-time employment to drop by 12 percentage points one year after the shock for the average man, and by 20 percentage points for high school men, with recovery still incomplete eight years later (employment remains 7.8 pp and 10 pp below baseline, respectively). Holding human capital fixed as in the pre-shock baseline causes employment to recover quickly, confirming that persistent wage-offer deterioration is the mechanism.&lt;/p&gt;
&lt;p&gt;On health insurance policy, the model evaluates providing public insurance to all workers lacking ESHI. This substantially increases medical utilization, improves health and life expectancy (survival to age 65 rises from 82% to 87% when health shocks are eliminated, as a related benchmark), reduces Medicaid and free-care costs, and raises labor supply among low-skill workers by weakening means-tested transfer incentives. The net program cost in a balanced budget simulation is modest, and all agent types are ex ante better off. By contrast, expanding Medicaid access creates perverse labor supply disincentives — workers reduce labor supply to maintain eligibility — does little to improve health, and makes almost all agents worse off in a balanced budget scenario.&lt;/p&gt;
&lt;p&gt;Scope conditions: the primary calibration covers non-institutionalized civilian White males; results for Blacks and Hispanics are presented only for the high school or less education group due to small samples. The model period ends at 2013, before ACA implementation.&lt;/p&gt;
&lt;p&gt;Q: What is the model&amp;rsquo;s overall estimate of how much health shocks reduce lifetime earnings for White men?
A: Eliminating serious health shocks at working ages (25–64) would increase the expected present value of lifetime earnings (PVE) for the average White male by 11% and reduce inequality in PVE by 12% as measured by the coefficient of variation. For White men with high school or less education the PVE gain is larger at 17.9%.&lt;/p&gt;
&lt;p&gt;Q: What are the four channels through which health shocks affect earnings, and how large is each for the average White male versus a low-skill high school male?
A: The four channels are (1) direct labor supply via sick days and reduced tastes for work, (2) human capital deterioration from lost work experience worsening future job/wage offers, (3) reduced health productivity lowering wage offers, and (4) behavioral responses to health risk reducing labor supply to preserve transfer eligibility. For the average White male the contributions to PVE are 5.7%, 2.7%, 1.4%, and 0.8%, respectively. For low-skill White high school men the same channels contribute 10.7%, 14.8%, 1.3%, and 9.8% — the human capital and behavioral effects are roughly five to twelve times larger for the low-skill group.&lt;/p&gt;
&lt;p&gt;Q: Why is the human capital effect so much larger for low-skill high school men than for college men?
A: Low-skill high school men are much more likely to exit full-time employment following a major health shock and are slow to return. Lifetime work years decline by 1.89 for the typical high school man versus only 0.84 for the typical college man following a major shock at age 40. Because job offer probabilities depend on lagged employment, absence from the labor market creates a snowball effect that persistently depresses offer quality; human capital accounts for 42% of the earnings decline for high school men versus 34% for college men.&lt;/p&gt;
&lt;p&gt;Q: How does the paper characterize the persistent employment effects of a major health shock?
A: For the average man, full-time employment drops by 12 percentage points one year after a severe shock and remains 7.8 pp below baseline after eight years. For high school men the initial drop is 20 pp, still 10 pp below baseline after eight years; for college men the figures are 7 pp and 3 pp. When human capital is held fixed at the pre-shock baseline — so wage and job offers do not deteriorate due to lost experience — employment recovers quickly for workers of all skill levels, confirming the human capital mechanism drives the persistence.&lt;/p&gt;
&lt;p&gt;Q: How does the behavioral effect operate for low-skill workers?
A: Workers without ESHI who face health risk have an incentive to maintain sufficiently low income and assets to qualify for means-tested social insurance, which provides a consumption floor approximating Medicaid, Food Stamps, SSDI, and SSI. This perverse incentive leads low-skill workers to curtail labor supply preemptively. When health risk is eliminated, this incentive disappears and labor supply rises, generating the behavioral effect of 9.8% of PVE for low-skill high school men versus only 0.8% for the average White male.&lt;/p&gt;
&lt;p&gt;Q: How does the paper correct for under-reporting of health shocks among the uninsured?
A: The measurement model assumes health shocks are correctly measured for the treated, but uninsured workers who do not seek treatment only record a shock with a shock-specific probability less than one. A key identifying assumption is that, conditional on health status, risk factors, age, and education, the true frequency of health shocks does not differ by insurance status per se — ruling out ex ante moral hazard. The measurement model parameters are calibrated to match observed frequencies of health shocks and high risk in MEPS for the uninsured.&lt;/p&gt;
&lt;p&gt;Q: What does the model estimate regarding the effect of a severe health shock on cumulative earnings relative to existing reduced-form evidence?
A: The model predicts an average cumulative (non-discounted) earnings loss of $42.8k over ten years following a severe shock for men aged 50, compared with Smith&amp;rsquo;s (2004) estimate of $37k from the HRS. The paper argues Smith&amp;rsquo;s estimate identifies effects on workers who actually experience shocks, who are a selected sample with low baseline earnings (as untreated shocks are more likely to be severe, and non-treaters tend to have low earnings). The model&amp;rsquo;s &amp;ldquo;average effect&amp;rdquo; — comparing a world where everyone experiences the shock to one where no one does — yields a substantially higher loss of $59.8k.&lt;/p&gt;
&lt;p&gt;Q: What are the key findings from the public insurance experiment (providing insurance to the uninsured)?
A: Providing public insurance to all workers lacking ESHI substantially increases medical utilization among the previously uninsured, who are intrinsically less healthy. This improves health and life expectancy, raising Social Security costs. However, it also generates positive labor supply incentives for low-skill workers (reducing their reliance on means-tested transfers), substantially reduces Medicaid and free-care costs, and increases tax revenue. On balance, the net program cost in a balanced budget simulation is modest, and all types of workers are ex ante better off.&lt;/p&gt;
&lt;p&gt;Q: Why does expanding Medicaid access produce perverse results in contrast to providing public insurance?
A: Medicaid is means-tested, so expanded access requires workers to maintain sufficiently low income and assets to remain eligible. This creates disincentives to work and save — workers reduce labor supply to preserve eligibility. The result is reduced earnings, lower tax revenue, little improvement in health (as access to care depends on maintaining low income), and almost all agents being worse off in a balanced budget scenario.&lt;/p&gt;
&lt;p&gt;Q: What role does insurance play beyond consumption smoothing in this model?
A: Beyond lowering out-of-pocket (OOP) costs and smoothing consumption, insurance grants access to care: in the US system, proof of insurance is often required before treatment, so uninsured workers may not have the option to treat at all. The model captures three distinct option sets for the uninsured — all options available, treatment not available, or default not available — each motivated by different real-world contexts. Non-treatment worsens health transition probabilities, so the access-granting role of insurance independently affects health trajectories beyond its cost-reducing role.&lt;/p&gt;
&lt;p&gt;Q: What explains the observed positive association between education, income, insurance, and health transitions in the data, and how does the model generate this without education entering the health production function directly?
A: The association between education and health is largely driven by the positive correlation between education and latent health types; controlling for latent health type in a descriptive logit largely eliminates the education coefficient. The association between insurance and health transitions is driven by the fact that the insured are more likely to receive treatment; controlling for treatment and true shocks eliminates the insurance coefficient. Education affects health indirectly through its effects on treatment decisions — via wages, job offers with ESHI, and consumption capacity — without appearing as a direct argument in the health production function.&lt;/p&gt;
&lt;p&gt;Q: How large are the effects of health shocks on key population health statistics according to the model?
A: Eliminating serious health shocks at working ages would increase the fraction of working-age men in good health from 60% to 75% and raise the probability of survival to age 65 from 82% to 87%. Average annual sick days of 16.42 would be eliminated, implying a 6% increase in work days for employed workers and an employment rate increase from 88% to 91%. Average annual medical costs would fall from $4,618 to $1,132.&lt;/p&gt;
&lt;p&gt;Q: How do the results for Black and Hispanic men compare to White men?
A: The results are qualitatively similar, but the magnitudes for Black men are somewhat larger. Eliminating health shocks would raise PVE for Whites, Blacks, and Hispanics with high school or less education by 17.9%, 23.7%, and 17.7%, respectively. Separate access-to-care probabilities are calibrated for each group, reflecting racial disparities in access that explain part of the observed differences in health outcomes and treatment rates.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the consumption floor (means-tested social insurance) in shaping equilibrium outcomes for low-skill workers?
A: The consumption floor guarantees a minimum household consumption level approximating Medicaid, Food Stamps, SSDI, and SSI. It shields low-skill workers from the full cost of health shocks, reducing both the consumption-smoothing value of ESHI and precautionary saving incentives. However, it also creates a powerful disincentive for low-skill workers without ESHI to work, as earning above the eligibility threshold would eliminate benefits. This mechanism amplifies earnings inequality by generating perverse labor supply behavior concentrated among low-skill, uninsured workers.&lt;/p&gt;
&lt;p&gt;Functional Health (H): A discrete stock variable (Poor, Fair, or Good) measuring aspects of health that directly affect worker productivity and tastes for work; distinguished from asymptomatic health risk. Transitions depend on lagged health, latent health type, age, persistent health shocks, and whether shocks are treated.&lt;/p&gt;
&lt;p&gt;Asymptomatic Health Risk (R): A binary state (low or high) capturing risk factors such as obesity, high cholesterol, and hypertension that increase the probability of future health shocks but do not affect current productivity.&lt;/p&gt;
&lt;p&gt;Human Capital Effect: The channel by which health shocks reduce lifetime earnings not directly but indirectly — by causing employment exits that slow work experience accumulation, which in turn deteriorates future job offer probabilities and wage offers in a persistent, self-reinforcing (snowball) dynamic.&lt;/p&gt;
&lt;p&gt;Behavioral Effect: The reduction in labor supply — and associated earnings loss — that occurs because workers facing health risk and lacking ESHI have an incentive to keep income and assets low enough to maintain eligibility for means-tested social insurance, even absent any contemporaneous health shock.&lt;/p&gt;
&lt;p&gt;Tied Wage-Hours-Insurance Offer: The model&amp;rsquo;s labor market structure in which employment offers jointly specify a wage rate, hours (no offer, part-time, or full-time), and whether the offer includes ESHI; workers accept or reject the bundle rather than choosing hours and insurance independently.&lt;/p&gt;
&lt;p&gt;Source Text Origin: The paper&amp;rsquo;s own term distinguishing how the full text of a paper was obtained (PDF, OA-HTML, or abstract-only); used in the summarization pipeline. [Note: this concept is from the summarization pipeline metadata, not from the paper itself — omitting.]&lt;/p&gt;
&lt;p&gt;Treatment/Payment Options: The set of decisions available to a worker after a health shock occurs — whether to seek treatment and, if treated, whether to pay the out-of-pocket cost or default on bills. The available choice set differs by insurance status and context: the uninsured may face denial of access (option to treat unavailable) or required prepayment (default unavailable), or may have all options including free care.&lt;/p&gt;
&lt;p&gt;Latent Health Type: An unobserved permanent individual characteristic capturing innate biological resilience and pre-age-25 health investments; determines baseline transition probabilities for functional health conditional on shocks. Positively correlated with latent skill type within education groups.&lt;/p&gt;</description></item><item><title>Heterogeneity and the Macro-Economic Effects of Changes in Loan-to-Value Limits</title><link>https://macropaperwarehouse.com/papers/heterogeneity-and-the-macro-economic-effects-of-changes-in-loan-to-value-limits/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/heterogeneity-and-the-macro-economic-effects-of-changes-in-loan-to-value-limits/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;De Veirman and de Jong develop a new approach to estimating the macroeconomic effects of changes in regulatory loan-to-value (LTV) limits on mortgage loans. The central questions are: (1) how do changes in an LTV cap translate into changes in the average LTV and, through that channel, into house prices and real output; and (2) how do heterogeneity in the cross-sectional LTV distribution, non-linearity, and asymmetry shape those effects?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation and Gap&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior empirical literature on macroprudential LTV policy typically pools across countries using coded indicator variables, which imposes the restriction that all LTV policy actions have the same effect regardless of the size of the change or the position of the limit relative to the distribution. Standard TANK models with homogeneous borrowers imply either full symmetry or threshold asymmetry precisely at the point where the constraint ceases to bind. The authors are the first to relate borrower heterogeneity to non-linearity and asymmetry in LTV policy effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical application focuses on the Netherlands, which introduced an LTV cap of 106 percent on August 1, 2011, subsequently reduced in annual one-percentage-point steps to 100 percent by January 2018. Cross-sectional LTV distributions are constructed from the De Nederlandsche Bank Loan Level Data (LLD), covering 77-81 percent of outstanding Dutch mortgage debt in 2012Q4-2014Q4, restricted to borrowers aged 35 or younger as a proxy for first-time buyers. A survey-based average LTV series spanning 1979-2015 was fielded in January 2016 across the CentERpanel and LISS panel (7,943 respondents combined; 2,238 usable observations after cleaning), measuring LTV at the time of first home purchase. This survey-based annual LTV series, together with the log relative house price, log real GDP, and the real mortgage rate, forms a four-variable Vector Error Correction Model (VECM) estimated over 1981-2015, with a single cointegrating vector identified by Johansen maximum likelihood.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors&amp;rsquo; core innovation is to translate changes in the LTV cap into changes in the cross-sectional average LTV by applying each successive cap level to the underlying distribution: observations above the cap are moved to the cap value (with adjustments for exceptions in the ex post variant). These implied annual changes in the average LTV serve as a succession of impulses fed into the VECM. Two variants are implemented: an ex ante approach using only the pre-cap 2010M8-2011M7 distribution, and an ex post approach that uses the most recent empirical distribution prior to each cap change. The Cholesky identification ordering is [LTV, house prices, GDP, mortgage rate].&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Non-trivial macroeconomic effects of Dutch LTV policy: Under the ex post approach (the preferred estimate), the imposition of the cap at 106 percent in 2011 and its gradual reduction to 100 percent by 2018 imply, twenty years after the first shock, that relative house prices are 4.84 percent lower and real GDP is 1.15 percent lower than they would have been in the absence of the cap sequence. The bulk of these responses materializes within ten years, at 4.18 percent and 1.05 percent respectively.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Non-linearity: For a given underlying distribution, changes in the cap have progressively larger effects as the cap tightens. In the ex ante approach, the fraction of households constrained by the cap rises from approximately 20 percent at a limit of 105 percent to approximately 40 percent at a limit of 100 percent. A 10 percentage point tightening from 110 to 100 percent implies a long-run relative house price response of 6.12 percent, while a tightening from 100 to 90 percent implies a response of 14.27 percent — a pronounced non-linearity traceable to the substantial mass of observations in the 90-110 range of the Dutch distribution.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Heterogeneity matters substantially: In mean-preserving comparisons using Pearson-family approximations to the pre-cap Dutch distribution, the macroeconomic effects of the actual Dutch LTV policy sequence are 2.58 times larger in the high standard deviation case (standard deviation 25 percent above the Dutch baseline of 17.09) than in the low standard deviation case (standard deviation 25 percent below). Specifically, twenty-year house price responses are 12.34 percent (high SD) versus 4.79 percent (low SD), and GDP responses are 2.93 percent versus 1.14 percent.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Asymmetry is conditional on the position of the cap relative to the distribution: For the Dutch distribution, symmetry is a good approximation for LTV limits at around 80 percent or lower, where the cap is binding for the bulk of households. Asymmetry is pronounced for higher levels. At an initial cap of 100 percent, the absolute effect of a ten-percentage-point tightening is 2.33 times that of a ten-percentage-point loosening. At 80 percent, the asymmetry ratio is only 1.17. Tightenings have smaller effects when they start from a point where few households are constrained; conversely, loosenings can have larger effects when starting from a point where many are constrained.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Homogeneity assumption understates effects above the mean LTV: Under the homogeneous-borrower benchmark (all borrowers at the Dutch mean of 93.72 percent), asymmetry is infinite at cap levels of 100 and 95 percent but zero at other levels — a feature that causes effects to be entirely absent for caps above the mean. In the heterogeneous Dutch setting, an increase in the LTV limit from 95 to 105 percent raises house prices by 10.72 percent in the long run; the homogeneous case implies no effect at all.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Caveats&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper does not address welfare or financial stability effects. The VECM impulse responses do not establish economic causality. Anticipation effects — if households front-loaded high-LTV purchases before the cap — would cause the procedure to overstate the effect. The LTI robustness check (which smooths the loan-to-income ratio due to noisy survey responses) yields twenty-year responses of 3.32 percent (house prices) and 0.74 percent (GDP), somewhat lower than the baseline, indicating that not controlling for LTI tends to overstate the LTV-macroeconomy connection. The approach requires a usable pre-cap or recent-prior LTV distribution; it is not directly portable to settings where a loosening is studied and no recent pre-cap distribution is available.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the fundamental identification challenge this paper faces, and how does the proposed approach address it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The standard challenge is that LTV caps are changed infrequently and have no long time series suitable for regression, so panel studies typically pool countries and use coded dummy variables that impose size-independence of effects. The authors bypass this by using the cross-sectional LTV distribution itself: they measure how each cap level would truncate the underlying distribution and track the implied change in the cross-sectional mean LTV, which is then fed as a shock into a time-series VECM. This approach does not require the cap to have been in place previously, imposes no cross-country coefficient restrictions, and explicitly accounts for the size of the policy change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the ex ante and ex post approaches to translating cap changes into average LTV changes, and how do their cumulative estimates differ?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The ex ante approach applies all successive cap levels to the single pre-cap distribution of 2010M8-2011M7 (after correcting for the June 2011 sales-tax reduction from 6 to 2 percent), without allowing for exceptions. The ex post approach uses the most recent empirical distribution prior to each cap change and accounts for the observed share of borrowers above the cap as exceptions. The ex ante approach yields a cumulative decline in the average LTV of 3.08 percentage points over 2011-2018; the ex post approach yields 1.96 percentage points, roughly one percentage point less. The difference is largely concentrated in 2011-2012 and stems from the ex ante approach not accounting for exceptions to the cap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the paper correct for the coincident 2011 sales-tax reduction, and why does this matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In June 2011, the Dutch sales tax on housing purchases fell from 6 to 2 percent, approximately coinciding with the August 2011 imposition of the LTV cap. Without correction, the observed drop in high LTVs in the 106-cap period would conflate the two policy changes. The authors apply a tiered correction: LTVs at or below 100 percent are left unchanged (the data show no notable change in that range); LTVs between 100 and 110 percent are reduced proportionally to the share of total closing costs attributable to the tax; LTVs at or above 110 percent are reduced by the full magnitude of the tax decline. This yields the &amp;ldquo;tax-adjusted pre-cap distribution&amp;rdquo; with a mean of 93.72 percent, down from 94.46 percent in the unadjusted data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Why does the fraction of constrained households matter so much, and how does it drive non-linearity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The key mechanism is that the average LTV changes when and only when the cap binds for a given borrower. The larger the share of borrowers whose LTV (in the counterfactual uncapped distribution) would exceed the cap, the larger the share of individual LTVs that move in lockstep with any change in the cap, and therefore the larger the aggregate average LTV response and, through the VECM, the house price and GDP response. As the Dutch cap tightened from 105 to 100 percent, the constrained fraction rose from roughly 20 percent to roughly 40 percent, and the annual implied decline in the average LTV grew from 22 basis points to 42 basis points — illustrating monotonically increasing non-linearity within the ex ante approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the survey design address the risk of selection bias relative to alternative data sources such as the American Housing Survey?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The survey, fielded in January 2016 across both the CentERpanel and LISS panel, asks retrospectively about respondents&amp;rsquo; first home purchase, irrespective of whether they still reside there. This avoids the selection bias in the American Housing Survey, where the first-time-buyer flag captures only those still living in the first home — disproportionately selecting homes that are traded less frequently. A single-wave design also avoids the methodological discontinuities that arise from combining multiple survey waves. The resulting series covers 2,238 observations over 1979-2015 (average 60.49 per year).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What does the VECM cointegration evidence suggest about the long-run relationship between LTV, house prices, GDP, and the real mortgage rate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Augmented Dickey-Fuller tests do not reject a unit root in any of the four series in levels, while all four are stationary in first differences (with the borderline case of log relative house price inflation when an intercept is included). Both the Johansen L-Max and Trace tests reject no cointegration at the 1 percent level, and neither test indicates more than one cointegrating vector. The authors therefore estimate a single-cointegrating-vector VECM with one lag (selected by the Schwarz Information Criterion) over 1981-2015. The long-run relation is normalized so that the coefficient on the log relative house price is one.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What do the impulse responses in the baseline VECM specification imply for the long-run macro effects of Dutch LTV policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under the preferred ex post approach, twenty years after the first shock in 2011 the VECM implies that relative house prices are 4.84 percent lower and real GDP is 1.15 percent lower than the no-cap counterfactual. The bulk of the response materializes within ten years, with house prices 4.18 percent lower and GDP 1.05 percent lower at the ten-year horizon. The twenty-year real mortgage rate response is positive but negligibly small. When the ex ante approach is used instead, responses are larger owing to the larger cumulative LTV impulse.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the paper conduct the mean-preserving heterogeneity exercise, and what are the key quantitative results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The authors generate Pearson-family distributions that match the first four moments of the Dutch pre-cap distribution (mean 93.72, standard deviation 17.09, skewness -1.16, kurtosis 5.97 under the convention that a normal has kurtosis 3), truncated to support (0, 200]. Two alternative distributions are constructed with standard deviations 25 percent below (12.97) and 25 percent above (21.61) the Pearson proxy, holding mean, skewness, and kurtosis constant. The same VECM and Cholesky ordering are applied. Twenty-year house price responses are 12.34 percent (high SD), 8.46 percent (Pearson proxy), and 4.79 percent (low SD). Twenty-year GDP responses are 2.93, 2.01, and 1.14 percent respectively. The ratio of high-to-low-SD responses is 2.58 for both variables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does asymmetry vary across different initial levels of the LTV cap for the Dutch distribution, and what is the intuition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: At a starting cap of 100 percent, a ten-percentage-point tightening produces a long-run house price response 2.33 times larger (in absolute value) than a ten-percentage-point easing from the same starting point. At 80 percent the asymmetry ratio falls to 1.17, meaning the effects of tightening and easing are nearly symmetric. The intuition is that at 80 percent the cap is binding for the bulk of the distribution, so both tightenings and easings move a similarly large fraction of borrowers and have large, roughly comparable effects. At 100 percent, far fewer borrowers are currently constrained, so an easing from 100 to 110 moves almost no one whereas a tightening from 100 to 90 moves substantially more.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the comparison of the heterogeneous-borrower and homogeneous-borrower cases reveal about the implications for TANK and HANK models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under the homogeneous benchmark — all borrowers at the mean Dutch LTV of 93.72 percent — changes in the cap produce infinite asymmetry at cap levels of 100 and 95 percent (tightening has a full effect, easing has zero effect) but zero asymmetry and zero effect for any cap level above 95 percent. For example, an increase in the cap from 95 to 105 percent has no effect in the homogeneous case but raises house prices by 10.72 percent in the heterogeneous case. In sum, homogeneous-borrower models — including TANK frameworks and linearized models with always-binding constraints such as Iacoviello (2005) — overstate asymmetry in a narrow range around the mean LTV and simultaneously understate the effects of cap changes above the mean LTV. The results are more consistent with heterogeneous-agent frameworks, though the authors note they are not aware of any existing HANK paper that investigates asymmetry and non-linearity specifically in response to changes in the borrowing limit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What do the robustness checks show about sensitivity of results to LTV measurement choices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The results are robust to all alternative Cholesky orderings, to using the real mortgage rate computed as the nominal rate minus current (rather than two-year moving average) inflation, to using the computed LTV without cross-checking, and to using the directly reported LTV after cross-checking. The most notable alternative is the directly reported LTV without cross-checking, which yields a twenty-year house price response of 3.81 percent and a GDP response of 0.72 percent (ex post approach), somewhat lower than the baseline of 4.84 and 1.15 percent but in the same direction. A further robustness check using an LTV series that extrapolates 2011-2015 values from the Loan Level Data yields larger estimates (cumulative twenty-year house price response of 6.65 percent and GDP response of 1.40 percent), reflecting the LLD series&amp;rsquo; more moderate drop in 2014.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What is the policy implication regarding the importance of distributional information for gauging LTV policy effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The results imply that knowing the mean of the LTV distribution is not sufficient for estimating the effects of cap changes: the variance — and specifically the fraction of borrowers constrained by the cap — is critical. This is analogous in spirit to the finding of Krueger, Mitman, and Perri (2016) that matching the tails of the wealth distribution, and not just the mean, is essential for determining the aggregate consumption effects of shocks. Existing empirical literature that focuses on the first moment of the LTV distribution will therefore systematically mismeasure the macro effects of LTV limits, and the direction of the bias depends on where the cap stands relative to the distribution.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Loan-to-value (LTV) cap / limit:&lt;/strong&gt; The regulatory maximum on the ratio of total mortgage loan amount to the purchase price of the property (excluding buyer-incurred closing costs such as sales taxes and notary fees). In the Netherlands, this was set at 106 percent from August 2011 and reduced annually by one percentage point to 100 percent by January 2018. The paper explicitly distinguishes the cap (the regulatory threshold) from the average LTV (the cross-sectional mean of the distribution, which the cap may or may not bind for all borrowers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Underlying (or pre-cap) LTV distribution:&lt;/strong&gt; The cross-sectional distribution of LTV ratios that would prevail in the absence of any LTV cap — approximated in the paper by the empirical distribution in the twelve months before the cap was introduced (2010M8-2011M7, adjusted for the June 2011 sales-tax cut). The shape, mean, and variance of this distribution determine the fraction of borrowers who are constrained by any given cap level and therefore govern the magnitude and symmetry of policy effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mean-preserving change in heterogeneity:&lt;/strong&gt; A change in the standard deviation of the LTV distribution that holds the mean (and, in the paper&amp;rsquo;s stylized scenarios, also the skewness and kurtosis) constant. The paper uses this construct to isolate the effect of dispersion per se on the macroeconomic consequences of cap changes, showing that a 25 percent increase in the standard deviation relative to the Dutch baseline more than doubles the macro effects relative to a 25 percent decrease.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex ante approach:&lt;/strong&gt; The method of translating cap changes into average LTV changes that uses only the pre-cap distribution, applying successive cap levels to that single distribution. It does not require an LTV cap to have been in place and is therefore applicable for prospective analysis. It does not account for exceptions to the cap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex post approach:&lt;/strong&gt; The method that uses the most recent empirical LTV distribution preceding each cap change as the proxy for the counterfactual uncapped distribution, and that explicitly accounts for the observed share of borrowers above the cap (treated as exceptions). Preferred by the authors when feasible because it incorporates information about how the underlying distribution has evolved for reasons unrelated to the current cap change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asymmetry ratio:&lt;/strong&gt; The ratio of the absolute value of the long-run house price (or GDP) response to a ten-percentage-point tightening in the cap to the absolute value of the response to a ten-percentage-point easing from the same initial cap level. A ratio exceeding one indicates that tightenings have larger effects than easings of equal magnitude from the same starting point. In the paper, this ratio is shown to depend critically on where the initial cap sits relative to the underlying distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-linearity in LTV effects:&lt;/strong&gt; The property that changes in the cap from a lower starting point have larger macroeconomic effects than changes from a higher starting point, for a given underlying distribution. This arises because the fraction of constrained borrowers increases as the cap is tightened, so a further tightening moves a larger share of individual LTVs. In the paper, this is documented through the increasing year-on-year effects in Table 1 and the large difference between the house price response to a tightening from 110 to 100 percent (6.12 percent) versus from 100 to 90 percent (14.27 percent).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pearson system (as used in this paper):&lt;/strong&gt; A parametric family of distributions in which every combination of the first four moments (mean, variance, skewness, kurtosis) corresponds to a unique distribution. The authors use it to construct smooth approximations to the empirical Dutch distribution with the same mean, skewness, and kurtosis but varying standard deviations, enabling a controlled comparison of heterogeneity scenarios.&lt;/p&gt;</description></item><item><title>Heterogeneous innovations and growth under imperfect technology spillovers</title><link>https://macropaperwarehouse.com/papers/heterogeneous-innovations-and-growth-under-imperfect-technology-spillovers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/heterogeneous-innovations-and-growth-under-imperfect-technology-spillovers/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Jo and Kim ask two related questions: (1) How do firms use different types of innovation when learning others&amp;rsquo; technology takes time? (2) How does this process alter the aggregate implications of firm innovation, particularly in the context of increasing competition?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The paper develops a discrete-time infinite-horizon endogenous growth model with multi-product firms pursuing two types of innovation — &amp;ldquo;own-innovation&amp;rdquo; (improving existing product quality) and &amp;ldquo;creative destruction&amp;rdquo; (entering new product markets by displacing incumbents) — subject to a novel friction called &amp;ldquo;imperfect technology spillovers.&amp;rdquo; The friction takes the specific form of lagged learning: creative destruction builds on the one-period-lagged technology of the target market&amp;rsquo;s incumbent, while only the incumbent can observe the current frontier technology level. This one-period lag creates a technology gap (Δ = q_t / q_{t−1}) between the incumbent&amp;rsquo;s frontier and the level available to rivals. Four possible technology gap values arise in equilibrium: Δ₁ = 1 (no gap), Δ₂ = λ (one successful own-innovation), Δ₃ = η (one successful creative destruction), and Δ₄ = η/λ. The step sizes satisfy λ² &amp;gt; η &amp;gt; λ, meaning a single creative destruction improves quality more than a single own-innovation, but two consecutive own-innovations dominate a single creative destruction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Mechanisms.&lt;/strong&gt; The learning friction generates two novel mechanisms. First, the &amp;ldquo;market-protection effect&amp;rdquo;: incumbents with a technology advantage (Δ &amp;gt; 1) intensify own-innovation to widen the gap and protect their product lines when competitive pressure rises. Formally, own-innovation probability is highest for Δ₂ products and declines monotonically (z₂ &amp;gt; z₃ &amp;gt; z₄ &amp;gt; z₁), and ∂z₂/∂x &amp;gt; ∂z₃/∂x &amp;gt; 0 while ∂z₁/∂x &amp;lt; 0, conditional on value coefficients. Second, the &amp;ldquo;technological barrier effect&amp;rdquo;: higher overall own-innovation and creative destruction intensity widens the average technology gap across products, reducing rivals&amp;rsquo; conditional probability of successfully taking over a product market. This is distinct from the standard Schumpeterian effect (lower expected future profits) and from the escape-competition effect in step-by-step models (which apply only to neck-and-neck, single-product firms).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Empirical Strategy.&lt;/strong&gt; The empirical analysis combines the USPTO PatentsView database, the Longitudinal Business Database (LBD), the Longitudinal Firm Trade Transactions Database (LFTTD), the Census of Manufactures (CMF), Compustat, and NBER-CES data, covering the universe of U.S. patenting firms from 1976 to 2016, with main analyses from 1982 to 2007. Own-innovation is proxied by the self-citation ratio of patents (the ratio of self-citations to total backward citations); creative destruction by new products added and low-self-citation patents. Exogenous competitive pressure comes from China&amp;rsquo;s WTO accession in 2001, instrumented by the industry-level NTR tariff gap (the gap between non-NTR and NTR rates in 1999) following Pierce and Schott (2016).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Findings.&lt;/strong&gt; Pre-shock (1982–1999): patents with lower self-citation ratios (closer to creative destruction) have significantly longer backward citation gaps (coefficient −2.29 to −2.59, p &amp;lt; 0.01 across specifications), confirming that learning others&amp;rsquo; technology takes more time. Creative-destruction-type patents also have higher market value (Kogan et al. stock return measure) and scientific value (forward citations), with self-citation ratio negatively associated with both (e.g., coefficient on self-citation for market value: −0.289 without firm FE; −0.110 with firm FE, p &amp;lt; 0.01). Conditional on patenting, higher self-citation ratios are negatively associated with employment growth (coefficient −0.256, p &amp;lt; 0.05), number of industries added (−0.158, p &amp;lt; 0.05), and products added (−0.274, p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Post-shock (DID): foreign competition had no statistically significant effect on overall patent counts, but firms with above-average innovation intensity in industries with high NTR gaps significantly increased their self-citation ratio — indicating a shift toward own-innovation. The triple-interaction coefficient is 0.795 (p &amp;lt; 0.01) with baseline controls. For a firm with average lagged innovation intensity (0.18) in an industry with an average NTR gap (0.291), this corresponds to a 4.2 percentage point increase in the seven-year growth rate of the self-citation ratio, representing a 15.0% increase relative to the average growth rate of 28.2 percentage points. Consistent with the technological barrier effect, firm entry rates are lower in industries with higher TFPR-skewness-based technological barriers (coefficient −0.012 to −0.016, p &amp;lt; 0.05).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Analysis.&lt;/strong&gt; Calibrated to the U.S. manufacturing sector in 1992, the model matches six target moments including average number of products (2.3), products added (0.3), firm entry rate (7.6%), average productivity growth (1.9%), high-growth-firm employment growth (22.5%), and import penetration (15.3%). Creative destruction contributes approximately 1.88 times more to growth per unit than own-innovation (step size ratio 0.075/0.04). The aggregate R&amp;amp;D-to-sales ratio (untargeted) is 4.6% in the model vs. 4.1% in data.&lt;/p&gt;
&lt;p&gt;A counterfactual increasing outside entrants by 83% (matching the rise in import penetration from 15.3% to 25.1% between 1992 and 2007) generates a 1.51% increase in aggregate creative destruction arrival rate x, but firm-level creative destruction probability falls 1.33% and startup creative destruction also falls 1.33%. The aggregate R&amp;amp;D-to-sales ratio falls 1.6% and creative destruction R&amp;amp;D intensity falls 1.2%. Average domestic productivity growth declines 11.0%, with growth from creative destruction falling 13.0% and growth from domestic startups falling 1.7%. The total mass of domestic firms falls 6.4%.&lt;/p&gt;
&lt;p&gt;In economies with creative destruction costs 80 times higher than the U.S. baseline, the same competitive pressure shock raises rather than lowers total R&amp;amp;D (by 1.0%), but domestic growth still falls 9.7%, because the marginal decline in creative destruction impedes the growth contribution and firm entry even when aggregate innovation spending rises.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the key friction that distinguishes this model from the existing multi-product firm literature (e.g., Klette and Kortum 2004; Akcigit and Kerr 2018)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The key friction is &amp;ldquo;imperfect technology spillovers,&amp;rdquo; modeled as lagged learning: creative destruction can only build on the one-period-lagged technology of the target product (q_{j,t−1}), while the product&amp;rsquo;s current owner observes the frontier technology (q_{j,t}). In models without this friction — such as Akcigit and Kerr (2018) — rivals can instantly learn and copy frontier technology, so firms have no technological advantage and cannot protect their markets. In the current model, own-innovation by the incumbent widens the gap between q_{j,t} and q_{j,t−1}, creating a barrier that a rival must overcome even after successful creative destruction. This makes own-innovation an endogenous function of the technology gap, a feature absent from existing multi-product firm frameworks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does the model predict that own-innovation increases with the technology gap up to a point, then decreases?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: From Corollary 1, the ordering z₂ &amp;gt; z₃ &amp;gt; z₄ &amp;gt; z₁ reflects competing forces. Products with gap Δ₂ = λ gain the most from additional own-innovation in terms of reducing the probability of losing the product line (equation 2), so own-innovation is highest there. Products with Δ₃ = η or Δ₄ = η/λ already have substantial technological advantages from prior creative destruction, so the marginal value of own-innovation in reducing market loss probability is lower. Products with Δ₁ = 1 have no advantage at all: if a rival succeeds in creative destruction, the incumbent loses the product regardless of own-innovation (equation 1), so z₁ is lowest. Beyond a certain gap level, the incumbent is sufficiently protected that additional own-innovation has diminishing returns in deterrence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the market-protection effect formally, and for which products is it strongest?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The market-protection effect (Corollary 2) is the positive response of a firm&amp;rsquo;s own-innovation to an increase in the aggregate creative destruction arrival rate x, conditional on the value coefficients A₁ and A₂ being fixed. It is strongest for products with Δ₂ = λ (∂z₂/∂x is the largest and positive), positive but weaker for Δ₃ = η (∂z₃/∂x &amp;gt; 0), of ambiguous sign for Δ₄ = η/λ, and negative for Δ₁ = 1 (∂z₁/∂x &amp;lt; 0). The asymmetry reflects the asymmetric payoff to own-innovation across gap levels: for Δ₂ products, successful own-innovation can turn a losing situation into a winning one because it shifts the technology gap from Δ₁ to Δ₂ from the rival&amp;rsquo;s perspective, effectively defeating the rival&amp;rsquo;s creative destruction attempt. This mechanism provides a micro-foundation for why frontier firms (like Google or NVIDIA) keep innovating intensely despite their technological leads, a pattern the standard step-by-step model cannot explain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the technological barrier effect and how does it differ from the Schumpeterian effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The technological barrier effect refers to the reduction in rivals&amp;rsquo; incentive for creative destruction caused by an increase in the average technology gap across product lines. When incumbents do more own-innovation or when outside firms do more creative destruction, the distribution of technology gaps shifts rightward (density at Δ₁ falls; density at Δ₂, Δ₃, Δ₄ rises). This raises the average technology barrier rivals must overcome to successfully take over a product market, reducing the conditional takeover probability x^{takeover} and the expected value of creative destruction B. In the U.S. counterfactual, the technological barrier effect accounts for 17.0% of the total change in the aggregate creative destruction rate x and 15.0% of the change in startup creative destruction x_e. In contrast, the Schumpeterian effect refers to the reduction in expected future profits from owning a product due to increased displacement risk (through the value coefficient A₂), a mechanism present in standard quality-ladder models. Both operate simultaneously but the technological barrier effect is a novel feature of this framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How is own-innovation vs. creative destruction measured empirically, and what validates this measure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The self-citation ratio (the share of a patent&amp;rsquo;s backward citations that cite the same assignee&amp;rsquo;s earlier patents) is used as the primary measure: a higher ratio indicates greater reliance on the firm&amp;rsquo;s own prior knowledge, hence a higher probability that the innovation improves an existing product line (own-innovation). This is validated empirically in three ways. First, patents with lower self-citation ratios have significantly larger backward citation gaps (coefficient −2.29 to −2.59 across fixed-effect specifications on 728,721 observations), consistent with creative destruction requiring more time to learn others&amp;rsquo; technology. Second, lower self-citation patents have higher market value and scientific value (forward citations), consistent with η &amp;gt; λ (creative destruction contributes more per event to quality). Third, firm-level regressions show that lower self-citation ratios are associated with higher employment growth, more products added, and more industries entered, consistent with creative destruction contributing more to firm expansion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the DID identification strategy work, and what are the main results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The identification exploits the removal of trade policy uncertainty (TPU) after China&amp;rsquo;s WTO accession in 2001. The treatment variable is the industry-level NTR gap (the gap between non-NTR and NTR tariff rates in 1999): industries with larger gaps experienced a larger reduction in uncertainty and thus a greater increase in Chinese import competition. The DID compares patenting firms across periods (1992–1999 vs. 2000–2007) and across high- vs. low-NTR-gap industries, with a triple interaction for firm-level innovation intensity (lagged five-year average patents per employee, normalized within two-digit NAICS). The main finding (Table 4): the NTR gap × Post interaction has no significant effect on overall patent counts (coefficient 0.238 without controls, standard error 0.237), but the triple interaction (NTR gap × Post × innovation intensity) has a positive and significant effect on the growth rate of the self-citation ratio (0.732 without controls, p &amp;lt; 0.05; 0.795 with baseline controls, p &amp;lt; 0.01). This implies that innovation-intensive firms in high-competition industries shifted their composition toward own-innovation, while overall patenting was unchanged — consistent with an offsetting rise in own-innovation and fall in creative destruction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the aggregate growth effects of increasing competitive pressure in the calibrated model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Using an 83% increase in outside entrants (matching the 1992–2007 rise in import penetration from 15.3% to 25.1%), average domestic productivity growth falls 11.0%. Decomposing: growth from domestic own-innovation falls 11.4%, growth from domestic creative destruction falls 13.0%, and growth from domestic startups falls 1.7% (Table 9). The aggregate R&amp;amp;D-to-sales ratio falls 1.6% and the creative destruction R&amp;amp;D intensity falls 1.2%, indicating that the decline in creative destruction R&amp;amp;D outweighs the rise in own-innovation R&amp;amp;D. The total mass of domestic firms falls 6.4% and the average number of products per firm falls 5.5%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do results differ in economies with high creative destruction costs vs. the U.S.?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: When creative destruction costs (χ̃) are set 80 times higher than the U.S. baseline, the initial equilibrium has much lower creative destruction: R&amp;amp;D-to-sales ratio is 1.39% (vs. 4.58% in U.S.), creative destruction R&amp;amp;D intensity is 8.6% (vs. 63.9%), average number of products is 1.0 (vs. 2.3), and average domestic productivity growth is 1.4% (vs. 1.9%). Under the same competition shock, total R&amp;amp;D actually rises by 1.0% in this high-CD-cost economy (because own-innovation increases more than creative destruction falls, given the already low baseline of creative destruction), in contrast to the −1.6% in the U.S. However, domestic growth still falls 9.7% even in this economy, driven by reductions in creative destruction by incumbents and startups combined with a decline in the mass of domestic incumbents. This result holds even with a fixed firm mass (Table E5), confirming the mechanism is not solely due to entry/exit dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the technological barrier effect&amp;rsquo;s quantitative contribution to the decline in creative destruction?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the U.S. counterfactual (Table 8 and associated decomposition), 17.0% of the total change in the aggregate creative destruction arrival rate x and 15.0% of the total change in startup creative destruction x_e are attributable specifically to the technological barrier effect — that is, to the shift in the technology gap distribution µ(Δℓ) holding all else equal. The conditional takeover probability x^{takeover} declines from 73.2% to 73.0%. The density at Δ₁ (the easiest gap to overcome) falls 0.4%, while densities at Δ₃ and Δ₄ rise 1.1% and 1.4% respectively, driven by increased creative destruction by outside firms and intensified own-innovation by incumbents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the policy implications the paper draws from its framework?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper argues that policies evaluating innovation should account for composition, not just aggregate R&amp;amp;D levels or patent counts. Increased overall innovation driven by defensive own-innovation contributes less to economic growth than creative destruction and restricts firm entry — so it is less beneficial than it appears. In low-creativity economies (e.g., European economies with high regulatory barriers to creative destruction), increased foreign competition may raise aggregate R&amp;amp;D while still lowering domestic growth, misleading policymakers who track only total innovation spending. The model also suggests that the mixed empirical findings in the competition-innovation literature (Aghion et al. 2005; Bloom et al. 2016; Autor et al. 2020) can be reconciled by accounting for compositional shifts: the net effect of competition on total innovation is ambiguous because it raises own-innovation for technologically advantaged firms while reducing creative destruction for all firms.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Imperfect Technology Spillovers:&lt;/strong&gt; The novel friction introduced in this paper, modeled as lagged learning: firms attempting creative destruction can only access the one-period-lagged technology of the target product market (q_{j,t−1}), while the incumbent product owner observes and can improve from the current frontier (q_{j,t}). This asymmetry creates a persistent technological advantage for incumbents and enables strategic defensive innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Own-Innovation:&lt;/strong&gt; R&amp;amp;D investment by a firm to improve the quality of its existing product lines. Successful own-innovation raises product quality by a step size λ &amp;gt; 1. Own-innovation does not require learning others&amp;rsquo; technology and, in the model, constitutes the incumbents&amp;rsquo; defensive margin against creative destruction. At the aggregate level, it contributes more to total growth than creative destruction because it succeeds more frequently, but per successful event it contributes less (λ &amp;lt; η).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Creative Destruction:&lt;/strong&gt; R&amp;amp;D investment enabling a firm to enter a new product market by displacing the incumbent. Successful creative destruction improves the lagged quality of the target product by a step size η &amp;gt; λ, where λ² &amp;gt; η &amp;gt; λ. It requires learning the incumbent&amp;rsquo;s one-period-lagged technology, takes longer to develop (evidenced empirically by longer backward citation gaps), and contributes more to firm growth and product expansion per event than own-innovation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technology Gap (Δ):&lt;/strong&gt; The ratio of a product&amp;rsquo;s current-period technology to its previous-period technology (Δ_{j,t} = q_{j,t}/q_{j,t−1}). This gap summarizes the technological advantage the incumbent holds in a product market under imperfect spillovers. Four values are possible in equilibrium: Δ₁ = 1, Δ₂ = λ, Δ₃ = η, Δ₄ = η/λ. The gap determines both the incumbent&amp;rsquo;s own-innovation incentive and the rival&amp;rsquo;s probability of successfully completing a product takeover conditional on creative destruction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Market-Protection Effect:&lt;/strong&gt; The mechanism by which incumbents with a technological advantage (Δ &amp;gt; 1) increase own-innovation in response to heightened competitive pressure (an increase in the aggregate creative destruction arrival rate x). This effect is maximized for products with Δ₂ = λ and positive but diminishing for Δ₃. It is absent for Δ₁ = 1 products (where own-innovation cannot prevent displacement) and is formally distinct from the escape-competition effect in step-by-step innovation models, which applies only to neck-and-neck single-product firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technological Barrier Effect:&lt;/strong&gt; The reduction in rivals&amp;rsquo; incentive for creative destruction caused by an increase in the average technology gap across the economy&amp;rsquo;s product lines. When incumbents intensify own-innovation and/or when outside creative destruction increases, the distribution of technology gaps shifts toward higher Δ values, reducing the conditional probability that a rival successfully takes over any given product market. This feedback mechanism endogenously suppresses creative destruction and firm entry beyond what the Schumpeterian effect alone would predict.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Self-Citation Ratio:&lt;/strong&gt; The share of a patent&amp;rsquo;s backward citations that cite patents previously owned by the same firm. Used in the paper as a continuous proxy for the likelihood that a patent represents own-innovation vs. creative destruction: a ratio of 1 (100% self-citations) implies 100% probability of own-innovation; a ratio of 0 implies 100% probability of creative destruction. This measure follows Akcigit and Kerr (2018) and is validated in the paper against learning time, quality, and firm growth outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;NTR Gap (Trade Policy Uncertainty Shock):&lt;/strong&gt; The industry-level difference between non-NTR (column 2) and NTR (column 1) U.S. tariff rates in 1999, used as an instrument for the exogenous increase in Chinese competitive pressure following China&amp;rsquo;s WTO accession and the U.S. granting of Permanent Normal Trade Relations (PNTR) in 2002. Industries with larger NTR gaps experienced a greater reduction in trade policy uncertainty and thus a larger increase in competitive pressure from foreign firms.&lt;/p&gt;</description></item><item><title>Homeownership, Polarization, and Inequality</title><link>https://macropaperwarehouse.com/papers/homeownership-polarization-and-inequality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/homeownership-polarization-and-inequality/</guid><description>&lt;p&gt;This paper asks why job polarization and income inequality are higher in large U.S. cities, and proposes a novel housing-market mechanism that operates independently of — but interacts with — the skill-biased technical change (SBTC) explanations dominant in the existing literature.&lt;/p&gt;
&lt;p&gt;The core argument is that large cities have experienced faster growth in house prices relative to both wages (price-wage ratio) and rents (price-rent ratio) since 1980. This excess price growth has priced middle-income households out of homeownership in expensive cities. Because low-income households cannot afford to own anywhere and high-income households can afford to own everywhere, it is specifically middle-income (middle-skilled) households whose location choice becomes entangled with their tenure choice. These households increasingly sort toward smaller, more affordable cities where they can purchase a home. This selective out-migration hollows out the middle of the income distribution in large cities, producing greater employment polarization and income inequality there.&lt;/p&gt;
&lt;p&gt;Empirically, the paper uses Census and ACS data from 1980 to 2019 covering 465 commuting zones (CZs). Polarization is measured following Autor and Dorn (2013) by assigning 3-digit occupations to income percentiles fixed at 1980 levels; inequality is measured by the Gini coefficient and variance of log annual wages. Housing costs are captured by hedonic price and rent indices and three derived ratios. OLS and IV results (instrumented using the interaction of land unavailability and long-run changes in real interest rates) show that doubling of prices is associated with a 1 percentage point decline in the middle-skilled employment share; doubling of the price-rent ratio is associated with an 11.3 percentage point decline; doubling of the price-wage ratio with a 5.3 percentage point decline. Inequality follows the same pattern: doubling prices raises 100x the variance of log wages by 2.3 points; doubling the price-rent ratio raises it by 11.7 points; doubling the price-wage ratio by 7.7 points.&lt;/p&gt;
&lt;p&gt;The migration mechanism is documented using 2001–2019 CPS ASEC data, which — uniquely among available sources — reports reasons for moving. A doubling of the price index, price-wage ratio, or price-rent ratio in the origin state relative to the destination raises the probability that a middle-income (2nd–4th quintile) household moves for housing-related reasons by approximately 5–10 percentage points in absolute terms, implying a 50–80% relative increase compared with low- or high-income households making a housing-related move.&lt;/p&gt;
&lt;p&gt;The theoretical framework extends the standard spatial equilibrium (Rosen-Roback) model with two additions: skill heterogeneity and housing tenure choice. Households face a minimum house size constraint and a payment-to-income (PTI) constraint (calibrated at lambda = 0.308). These constraints create distinct skill thresholds for homeownership that vary by city; the interaction between location and tenure choices applies only to middle-skilled households who can afford ownership in cheap but not expensive cities.&lt;/p&gt;
&lt;p&gt;In the quantitative model, calibrated separately for 1980 and 2019 with two locations (top 30 CZs vs. the rest), counterfactual experiments show that holding price-wage ratios at their 1980 levels reduces the excess polarization gap between large and small CZs by 93% and the excess inequality gap by 40%. Holding price-rent ratios constant reduces the polarization gap by 96% and the inequality gap by 27%. By contrast, shutting down SBTC entirely reduces the polarization gap by only 54% and the inequality gap by 73%. These results establish that while SBTC is an important driver, its effect on polarization and inequality is substantially amplified by faster house price growth in large cities; without the housing affordability channel, the effect of SBTC on disproportionate polarization would be 63–81% smaller and on the inequality gap 18–36% smaller.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central research question?
A: The paper asks why job polarization and income inequality are systematically higher in large U.S. cities than in small ones. Prior literature attributed this to skill-biased technical change, external labor demand shocks, or IT-driven displacement of routine jobs; this paper proposes a complementary, housing-market-based explanation that does not rely on features of the production technology.&lt;/p&gt;
&lt;p&gt;Q: What is the core mechanism linking house prices to polarization?
A: When price-wage and price-rent ratios are higher in large cities, middle-income households face binding minimum-size and payment-to-income constraints that prevent them from owning a home there but not in cheaper cities. Because homeownership carries financial advantages, these households sort toward smaller, more affordable cities. Low-income households cannot afford ownership anywhere and high-income households can afford it anywhere, so only the middle group&amp;rsquo;s location choice is distorted by tenure considerations. This selective out-migration hollows out the middle of the income distribution in expensive large cities.&lt;/p&gt;
&lt;p&gt;Q: What empirical patterns in CZ-level data motivate the paper?
A: Doubling CZ size is associated with a 1.9 percentage point greater fall in the middle-skilled employment share and a 2.7 point higher growth in 100x the variance of log wages from 1980 to 2019. Larger CZs also experienced 3.4% higher price growth, 3.1% higher price-wage ratio growth, and a 10% greater increase in price-rent ratios. These associations persist after controlling for initial CZ size and other characteristics.&lt;/p&gt;
&lt;p&gt;Q: What do the OLS and IV results show about house prices and polarization?
A: A doubling of house prices is associated with a 1 percentage point decline in the middle-skilled share; a doubling of the price-rent ratio with an 11.3 percentage point decline; and a doubling of the price-wage ratio with a 5.3 percentage point decline. IV results using the interaction of land unavailability and the change in real interest rates as an instrument confirm the negative relationship remains statistically significant, suggesting a causal interpretation is plausible.&lt;/p&gt;
&lt;p&gt;Q: What do the OLS and IV results show about house prices and income inequality?
A: A doubling of prices is associated with a 2.3 point increase in 100x the variance of log wages; a doubling of the price-rent ratio with an 11.7 point increase; and a doubling of the price-wage ratio with a 7.7 point increase. IV results suggest a causal relationship between price growth and income inequality at the CZ level.&lt;/p&gt;
&lt;p&gt;Q: What evidence does the paper provide for the migration mechanism?
A: Using 2001–2019 CPS ASEC data (which reports stated reasons for moving, unlike the ACS), the paper estimates logit regressions of interstate migration for housing-related reasons. A doubling of the price index in the origin state relative to the destination raises the probability of a housing-related move for middle-income (2nd–4th quintile) households by 5–6 percentage points; a doubling of the price-wage ratio raises it by 6–7 percentage points; and a doubling of the price-rent ratio raises it by 7–10 percentage points. These effects imply a 50–80% relative increase in housing-related migration probability for the middle quintiles compared with the bottom or top quintile. Housing-related movers constitute over 12% of all interstate migrants in the sample.&lt;/p&gt;
&lt;p&gt;Q: What is the key finding about homeownership rates?
A: There is no statistically significant relationship between the change in homeownership rates and the growth in prices, price-rent, or price-wage ratios from 1980 to 2019. This is consistent with the model&amp;rsquo;s mechanism, in which middle-income households who cannot afford ownership in large cities move away rather than simply switching to renting there — so aggregate local ownership rates need not fall.&lt;/p&gt;
&lt;p&gt;Q: How does the theoretical model generate the polarization result?
A: The model extends the Rosen-Roback spatial equilibrium framework with skill heterogeneity and housing tenure choice. Two skill thresholds — one for minimum-size-constrained ownership and one for unconstrained ownership — interact with the price-wage and price-rent ratios of each city. Proposition 1 proves that a city with higher price-wage and price-rent ratios will have a lower middle-skilled share, because middle-skilled workers (those who can afford to own in cheap but not expensive cities) are drawn to cheaper locations. Proposition 2 shows that in a world with only renters or only owners, skill shares would be identical across cities regardless of price differences — the polarization result requires heterogeneity in tenure choice.&lt;/p&gt;
&lt;p&gt;Q: What does the no-SBTC counterfactual show?
A: Holding the parameters governing local returns to skills at their 1980 levels (shutting down skill-biased technical change) reduces the difference in the decline in the middle-skilled share between large and small CZs by 54% and the gap in the increase in the variance of log wages by 73%. This is broadly consistent with prior literature attributing the bulk of disproportionate polarization and inequality in big cities to SBTC.&lt;/p&gt;
&lt;p&gt;Q: What do the constant price-ratio counterfactuals show?
A: When price-wage ratios are held at 1980 levels (but SBTC is allowed to operate), the excess polarization gap between large and small CZs falls by 93% and the excess inequality gap by 40%. When price-rent ratios are held at 1980 levels, the polarization gap falls by 96% and the inequality gap by 27%. When both are held constant simultaneously, the polarization gap falls by 89% and the inequality gap by 27%. These results show that the effect of SBTC on polarization would be 63–81% smaller in the absence of the housing affordability amplification channel.&lt;/p&gt;
&lt;p&gt;Q: Who are the largest losers from rising price-wage ratios in large cities?
A: The counterfactual welfare analysis identifies middle-skilled workers with skill levels between approximately 0.29 and 0.80 as the primary losers. In the counterfactual with fixed price-wage ratios, workers with skills from 0.29 to 0.57 who previously could not afford ownership in large cities are now able to own there, and those with skills from 0.57 to 0.80 spend a smaller share of income on housing. This group either lost homeownership opportunities or was induced to move to less productive CZs by the actual price growth that occurred.&lt;/p&gt;
&lt;p&gt;Q: How is the quantitative model calibrated and structured?
A: The model is calibrated separately for 1980 and 2019 as two stationary spatial equilibria. It features two locations (the top 30 CZs, which account for 49.3% of employment, and the remaining CZs). Key parameters include a Frechet elasticity of 6.1, an agglomeration externality of 0.04, a PTI constraint of 0.308, and an annual discount factor of 0.96. Land shares differ between large and small CZs (0.3965 vs. 0.2239). The model finds that the price-rent ratio was relatively stable in large cities but fell in small ones, while the price-wage ratio increased much more in large CZs — both indicators point to purchasing a home becoming relatively more expensive in large CZs.&lt;/p&gt;
&lt;p&gt;Q: What are the paper&amp;rsquo;s policy implications?
A: Zoning reforms and other policies that increase housing supply in large, unaffordable cities could produce a more efficient spatial allocation of labor, greater aggregate productivity, and more economically diverse — less polarized and less unequal — cities, while also reducing the wealth gap between owners and renters. Policies that promote homeownership by reducing the cost of owning without raising housing supply may reduce local polarization and inequality but could lower aggregate output and do not necessarily increase homeownership rates.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to existing explanations for city-level polarization?
A: The paper&amp;rsquo;s housing-market mechanism is explicitly complementary to SBTC-based explanations (Baum-Snow, Freedman, and Pavan, 2018; Cerina et al., 2023), external demand shock explanations (Davis, Mengus, and Michalski, 2020), and IT-displacement explanations (Eeckhout, Hedtrich, and Pinheiro, 2024). The paper&amp;rsquo;s key added contribution is that even if SBTC were the primary driver of disproportionate polarization, its measured effect would be substantially smaller in the absence of faster house price growth in large cities — the housing market amplifies rather than replaces the technology channel.&lt;/p&gt;
&lt;p&gt;Job polarization (city-level): The hollowing out of middle-income employment shares in a commuting zone, measured as the change in the share of workers in occupations assigned to the 21st–80th income percentile (using the 1980 occupation-to-percentile mapping fixed over time). In this paper, polarization is greater in cities where price-wage and price-rent ratios grew faster, attributed to selective out-migration of middle-skilled households.&lt;/p&gt;
&lt;p&gt;Price-wage ratio: The ratio of hedonic house prices to median annual wages in a commuting zone, constructed from Census and ACS data. A higher price-wage ratio tightens the payment-to-income constraint on potential homebuyers and is the primary driver of the skill threshold for homeownership in the model.&lt;/p&gt;
&lt;p&gt;Price-rent ratio: The ratio of hedonic house prices to rents in a commuting zone. In the model, a higher price-rent ratio reduces the financial advantage of owning over renting, raising the skill threshold at which ownership becomes optimal. The paper treats price-rent and price-wage ratios as distinct channels that both independently amplify polarization.&lt;/p&gt;
&lt;p&gt;Housing tenure choice: The household decision to own or rent, modeled as a discrete choice made at the start of life that interacts with location choice. Ownership requires satisfying both a minimum house size constraint and a payment-to-income (PTI) constraint (lambda = 0.308). The interaction between tenure and location choices is the paper&amp;rsquo;s key model innovation; it exists only for middle-skilled workers whose income is sufficient for ownership in cheap but not expensive cities.&lt;/p&gt;
&lt;p&gt;Skill threshold for homeownership (s*_i): The minimum skill level at which a worker in city i chooses to own rather than rent, defined by Lemma 2. This threshold is decreasing in local labor productivity and increasing in price-wage and price-rent ratios. Workers with skill below s*_i in all cities always rent; those with skill above s*_i in all cities always own; those in between face city-dependent tenure choice that distorts their location decision.&lt;/p&gt;
&lt;p&gt;Skill-biased technical change (SBTC): In the paper&amp;rsquo;s quantitative model, SBTC is represented by faster growth in the skill dispersion parameter (alpha_it) in large CZs, reflecting differential productivity growth concentrated at the top of the skill distribution. The paper finds SBTC accounts for 54% of the polarization gap and 73% of the inequality gap in its counterfactual, but argues its effect is amplified 4–5x by the housing affordability channel.&lt;/p&gt;
&lt;p&gt;Payment-to-income (PTI) constraint: The constraint that a homebuyer cannot spend more than a fraction lambda (calibrated at 0.308) of annual labor earnings on the annual housing payment (user cost times price times quantity). This constraint, together with the minimum house size, determines the income threshold for ownership and makes location and tenure choices interdependent for middle-skilled workers.&lt;/p&gt;</description></item><item><title>How Bad Are Weather Disasters for Banks?</title><link>https://macropaperwarehouse.com/papers/how-bad-are-weather-disasters-for-banks/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/how-bad-are-weather-disasters-for-banks/</guid><description>&lt;p&gt;Using FEMA disaster declarations matched to SHELDUS property-damage estimates and Call Report data for 1995–2018, this paper finds that weather disasters — even at their most severe — have had modest effects on U.S. bank safety over the last quarter century. For single-county banks exposed to 95th-percentile disasters, Z-scores decline by roughly 9 percent at a five-year horizon under the panel estimates; reaching failure thresholds from sample mean Z-score levels would require a disaster approximately 6.7 standard deviations more destructive than a 95th-percentile event. Federal disaster aid does not appear to be the primary driver of this resilience, since banks exposed to weather events without FEMA declarations exhibit similar stability. Instead, the paper points to a loan demand channel — multi-county bank lending increases roughly 0.25 percentage points per standard deviation of damage at five years without an accompanying interest-rate increase — and to local banks&amp;rsquo; apparent avoidance of mortgage lending in flood-prone areas beyond what official flood maps predict, consistent with local information about true flood risk limiting exposure before disasters strike.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-severe-are-weather-disaster-effects-on-bank-safety"&gt;Q1. How severe are weather disaster effects on bank safety?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper finds that weather disasters at any severity level produce small and often statistically insignificant effects on the key bank safety measures — charge-offs, capital ratios, return-on-assets volatility, and Z-scores — at single-county banks, with the largest measured effect being roughly a 9 percent decline in Z-scores at the 95th percentile of disaster damage at a five-year horizon.&lt;/strong&gt; The regression framework uses bank and state-year fixed effects, with SHELDUS damage as the continuous severity measure and FEMA disaster declarations as a binary indicator. For multi-county banks, charge-offs increase by roughly 10 percent at five years, but net income also rises, suggesting disaster-area loan demand partially offsets credit losses. The paper&amp;rsquo;s calculation is that pushing a typical bank from its mean Z-score of 135.9 to the failure threshold would require a Z-score decline of 127.9 — far exceeding the estimated −9 percent impact of a 95th-percentile disaster, which would need to be approximately 6.7 standard deviations more destructive to close that gap.&lt;/p&gt;
&lt;h3 id="q2-is-bank-resilience-an-artifact-of-federal-disaster-aid"&gt;Q2. Is bank resilience an artifact of federal disaster aid?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper presents evidence that federal disaster aid is not the primary source of bank resilience, since banks exposed to weather events that did not receive FEMA disaster declarations exhibit similarly modest effects on bank safety measures.&lt;/strong&gt; The test is designed to separate the insurance mechanism (FEMA aid replacing household income and debt service capacity) from intrinsic bank resilience. The fact that non-FEMA disasters produce comparable stability redirects attention to the demand-side and local-knowledge channels as the more fundamental explanations for the resilience finding.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-loan-demand-channel-and-how-large-is-it"&gt;Q3. What is the loan demand channel and how large is it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Multi-county banks experience an increase in lending of roughly 0.25 percentage points per standard deviation of SHELDUS damage at a five-year horizon, and the authors find no accompanying increase in loan interest rates, which is consistent with a demand-side shift rather than a tightening of lending standards.&lt;/strong&gt; The demand interpretation is that disasters create a wave of borrowing demand as households and firms repair or replace damaged assets, and the increased loan volume helps offset the increase in charge-offs. The pattern is found at multi-county banks — which can serve affected and unaffected areas simultaneously — but not at single-county banks, consistent with lending capacity mattering for capturing the demand increase.&lt;/p&gt;
&lt;h3 id="q4-what-does-local-knowledge-mean-in-this-context"&gt;Q4. What does &amp;ldquo;local knowledge&amp;rdquo; mean in this context?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Local banks originate approximately 6.4 percent fewer log mortgage dollars per application in FEMA flood zones than would be predicted by the official flood map classifications alone, with the gap widening to 7–8 percent in areas that have experienced more than five FEMA flood declarations compared to areas with fewer than three, which is consistent with local lenders holding information about true flood risk not captured in official maps.&lt;/strong&gt; The finding is consistent with local banks having access to community-level information — observed flooding history, property-level characteristics, local drainage and elevation — that is not incorporated into official FEMA flood zone classifications. This pre-disaster selectivity limits mortgage accumulation in the highest-risk areas before disasters occur.&lt;/p&gt;
&lt;h3 id="q5-what-are-the-implications-for-climate-risk-assessment"&gt;Q5. What are the implications for climate risk assessment?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper explicitly frames the historical resilience documented for 1995–2018 as informing rather than settling assessments of physical risk to banks from future climate change, since more frequent or more severe disasters could overwhelm the demand-offset and local-knowledge mechanisms that the paper identifies as sustaining bank performance.&lt;/strong&gt; The key qualification is temporal scope: the demand-side recovery effect requires that affected areas have the income and economic capacity to service new loans, and the local-knowledge effect requires that banks have experienced enough repeated flooding to develop accurate private flood risk assessments. Both conditions could become less reliable as climate change alters the frequency, geography, and severity of weather events relative to the historical distribution.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Z-score&lt;/strong&gt; : a bank-level distance-to-insolvency measure equal to (return on assets + capital ratio) divided by return-on-assets volatility; higher values indicate greater distance from failure; used here as the primary measure of disaster impact on bank safety.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;SHELDUS&lt;/strong&gt; : the Spatial Hazard Events and Losses Database for the United States, providing county-level property damage estimates for weather events; used in this paper as the continuous measure of disaster severity in panel regressions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;single-county bank&lt;/strong&gt; : a bank whose entire depositor base is drawn from one county, making it fully exposed to local disaster effects with no geographic diversification across other counties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;loan demand channel&lt;/strong&gt; : the mechanism by which disasters increase demand for credit from households and firms repairing or replacing damaged assets, generating new loan volume that partially offsets credit losses at banks serving affected areas.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;local knowledge&lt;/strong&gt; : the paper&amp;rsquo;s label for the informational advantage that local banks appear to have about true flood risk beyond what official FEMA flood zone classifications capture, inferred from lower mortgage originations in areas with a history of repeated flooding.&lt;/p&gt;</description></item><item><title>How Do Rising U.S. Interest Rates Affect Emerging and Developing Economies? It Depends</title><link>https://macropaperwarehouse.com/papers/how-do-rising-u.s.-interest-rates-affect-emerging-and-developing-economies-it-depends/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/how-do-rising-u.s.-interest-rates-affect-emerging-and-developing-economies-it-depends/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper examines how the effects of rising U.S. interest rates on emerging market and developing economies (EMDEs) depend on the underlying source of the interest rate increase. Specifically, it asks: what mix of inflation, reaction, and real shocks has driven changes in U.S. interest rates in recent years; how do these different shock types affect EMDE financial markets, capital flows, borrowing costs, and fiscal outcomes; and how do they affect the likelihood of EMDE financial crises?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation and Context&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Written in late 2022 against the backdrop of the Federal Reserve&amp;rsquo;s most aggressive tightening cycle since the 1990s, the paper argues that the standard practice of treating all interest rate increases as equivalent is misleading. Whether rising U.S. rates reflect strengthening growth, rising inflation expectations, or a perceived hawkish shift in the Fed&amp;rsquo;s reaction function carries very different implications for EMDEs already burdened by post-COVID debt at record highs and scarring from the pandemic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three distinct empirical approaches are used, chosen to match the data frequency and parsimony requirements of each research question.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;A sign-restricted Bayesian VAR model with stochastic volatility is estimated on monthly U.S. data (January 1982 - September 2022) using four variables: 2-year Treasury yield, 10-year Treasury yield, S&amp;amp;P 500 index, and 5-year breakeven inflation expectations. Sign restrictions identify three shocks: (i) &lt;em&gt;real shocks&lt;/em&gt; raise both yields, equity prices, and inflation expectations; (ii) &lt;em&gt;inflation shocks&lt;/em&gt; raise yields and inflation expectations but lower equity prices; (iii) &lt;em&gt;reaction shocks&lt;/em&gt; raise yields but lower both equity prices and inflation expectations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Panel local projection models (Jorda 2005) are estimated at quarterly frequency for 17-38 EMDEs over 1997Q2-2019Q4, excluding the 2008Q4-2009Q4 global financial crisis and the COVID-19 pandemic. The models link the VAR-identified quarterly shock series (normalized to represent a 25-basis-point move in the 2-year yield) to EMDE financial, real, and fiscal variables, including local-currency bond yields, EMBI+ sovereign spreads, capital flows, real GDP components, CPI inflation, the real effective exchange rate, primary fiscal balance, government revenues, expenditures, gross debt, and debt composition.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;A panel logit model with random effects is estimated on annual data for 139 EMDEs over 1985-2018, linking the three shock types to the probability of banking, currency, and sovereign debt crises (as defined by Laeven and Valencia 2020).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Key Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Shock decomposition&lt;/em&gt;: Real shocks account for the largest share of variance in 2-year U.S. yields over the full sample (39 percent at a 10-month horizon); inflation shocks explain 14 percent and reaction shocks 13 percent. However, since the start of 2022, reaction and inflation shocks together account for approximately three-quarters of the cumulative increase in yields, with real shocks playing a negligible role.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Financial market and macroeconomic spillovers&lt;/em&gt;: Conditional on a 25-basis-point shock, reaction shocks produce significantly adverse EMDE outcomes: widening sovereign spreads (EMBI+), declining capital flows, real exchange rate depreciation, and unlike inflation shocks, statistically significant declines in private consumption and fixed investment. Inflation shocks raise domestic EMDE CPI significantly. By contrast, real shocks are associated with declining sovereign spreads, rising capital flows, real exchange rate appreciation, and higher real exports, with other real GDP components unaffected.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Fiscal outcomes&lt;/em&gt;: In response to inflation and especially reaction shocks, EMDE governments improve their primary balances almost exclusively through expenditure cuts, consistent with tighter credit availability constraining fiscal space. Real shocks also improve primary balances, but through both revenue gains and expenditure reductions. Government debt declines in response to all three shock types, though the decline is statistically significant only for real shocks.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Debt composition&lt;/em&gt;: Reaction shocks shift debt composition toward shorter maturities and foreign-currency instruments (the latter reflecting exchange rate depreciation mechanically raising the local-currency value of foreign-currency debt). Real shocks shift composition toward longer maturities and higher external creditor participation, consistent with improved market access.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Heterogeneity by credit rating&lt;/em&gt;: Investment-grade and noninvestment-grade EMDEs show broadly similar responses to reaction shocks, with the exception of statistically larger yield responses for noninvestment-grade economies. The paper notes this finding contrasts with several prior studies that find stronger fundamentals buffer spillovers.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Crisis probabilities&lt;/em&gt;: A 25-basis-point increase in 2-year U.S. yields driven by a reaction shock almost doubles the baseline probability of financial crisis in the average EMDE, from 3.5 percent to 6.6 percent. Extrapolating the nonlinear logit relationship to the 114-basis-point reaction-shock-driven increase in 2-year yields that occurred from January through September 2022 implies the probability of financial crisis in the average EMDE rising approximately 36 percentage points, to nearly 40 percent. The paper cautions that no comparable yield episode occurred in the 1985-2018 estimation sample, so this extrapolation carries substantial uncertainty. Inflation shocks are associated with only small, statistically insignificant changes in crisis probability; real shocks reduce the probability of sovereign debt crisis while raising currency crisis probability by less than reaction shocks do.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Historical episode analysis&lt;/em&gt;: The 2013 taper tantrum was dominated by reaction shocks, causing 10-year yields to rise by approximately 100 basis points; sovereign spreads widened by 60 basis points in the May-June 2013 window and capital flows dropped sharply. The 2022 tightening episode was driven by reaction and inflation shocks (reaction shocks adding 114 basis points to 2-year yields through September 2022), with five-year breakeven inflation expectations breaching 3 percent for the first time in the two-decade history of the series. The 2004-2006 build-up to the global financial crisis involved a mix of all three shock types with real shocks prominent, and EMDE financial conditions remained broadly benign.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: How are the three shock types identified, and what makes this identification strategy credible?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The identification uses sign restrictions imposed on a Bayesian VAR with stochastic volatility. A real shock is identified as one that simultaneously raises 2-year yields, 10-year yields, S&amp;amp;P 500 equity prices, and inflation expectations. An inflation shock raises all yields and inflation expectations but lowers equity prices the equity decline signals that higher rates are not accompanied by stronger growth prospects. A reaction shock raises all yields but lowers both equity prices and inflation expectations the fall in inflation expectations distinguishes it from an inflation shock and signals that markets perceive the Fed is tightening beyond what current inflation warrants. Covering both short- and long-maturity yields in the sign restrictions ensures the identified shocks capture both conventional and unconventional (e.g., quantitative easing tapering) policy moves.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What share of 2-year yield variation do the three shocks each explain over the full sample?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At a 10-month horizon, real shocks explain 39 percent of the forecast error variance in 2-year U.S. Treasury yields, making them the dominant driver over the full sample (January 1982 - September 2022). Inflation shocks account for 14 percent and reaction shocks for 13 percent. Together the three identified shocks explain roughly two-thirds of total yield variation; the remaining one-third reflects residual or unclassified movements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How did the composition of shocks driving 2-year yields change from 2021 into 2022?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Starting in September 2021, as inflation mounted and the Fed pivoted toward aggressive tightening, reaction and inflation shocks became the dominant drivers of 2-year yield increases. By September 2022, reaction and inflation shocks together accounted for approximately three-quarters of the cumulative increase in yields from the beginning of 2022, with reaction shocks alone contributing 114 basis points to the 2-year yield.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the financial market effects of a 25-basis-point reaction shock on EMDEs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Reaction shocks produce significant adverse effects on EMDE financial markets within one quarter: 10-year local-currency government bond yields rise significantly, EMBI+ sovereign spreads widen significantly, capital flows decline significantly, and the real effective exchange rate depreciates significantly. Short-term (3-month) yields and equity prices also deteriorate, but these movements are not statistically significant at conventional levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How do financial market effects of inflation shocks compare to reaction shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Inflation shocks generate adverse directional effects similar to reaction shocks rising 10-year yields, declining capital flows, real exchange rate depreciation, and falling equity prices but with the notable difference that, except for equity prices, these effects are generally not statistically significant. The paper thus finds that reaction shocks are more potent drivers of EMDE financial market tightening than inflation shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do real shocks affect EMDE financial conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Real shocks produce outcomes broadly opposite to those from inflation and reaction shocks. They are associated with significant declines in EMBI+ sovereign spreads, significant increases in capital flows, significant real effective exchange rate appreciation, and significant increases in equity prices. Ten-year government bond yields do rise consistent with global bond market integration but this occurs alongside improving risk sentiment, not financial stress.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the macroeconomic (real activity) effects of the three shock types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Reaction shocks produce a statistically significant decline in real GDP components, particularly in private consumption expenditure and gross fixed capital formation (fixed investment), within one quarter. Real shocks lead to higher real exports consistent with beneficial demand spillovers from stronger U.S. activity while leaving other GDP components unchanged. Inflation shocks induce a large and statistically significant increase in domestic EMDE CPI inflation, while real shocks reduce it; neither produces significant real GDP effects beyond the export channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do EMDE fiscal balances respond differently to the three shock types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both inflation and especially reaction shocks are followed by an improvement in the EMDE primary balance (smaller deficit or larger surplus), achieved almost exclusively through declines in government expenditure. The paper attributes this to tighter credit availability and higher borrowing costs constraining fiscal space. Real shocks also improve primary balances, but the mechanism differs: both revenue increases and expenditure decreases contribute to the improvement. Declines in gross government debt occur in response to all three shocks but are statistically significant only for real shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the composition of government debt shift in response to the different shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following inflation and reaction shocks, debt held by external creditors declines significantly as a share of total government debt, consistent with reduced access to global credit markets. Short-term debt eventually rises following both shock types. Foreign-currency debt rises considerably following reaction shocks likely reflecting the mechanical effect of currency depreciation boosting the local-currency value of pre-existing foreign-currency obligations. Conversely, following real shocks, external creditor participation rises significantly (improved market access), foreign-currency debt shares remain broadly stable, and short-term debt declines significantly (consistent with maturity extension by fiscal authorities seeking to minimize rollover risk under favourable conditions).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Do investment-grade and noninvestment-grade EMDEs respond differently to reaction shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper finds little evidence of important differences between investment-grade and noninvestment-grade EMDEs in their responses to reaction shocks across most variables. Noninvestment-grade economies do show statistically larger increases in 10-year bond yields, and larger increases in EMBI+ spreads and 3-month yields than investment-grade economies though the latter two differences are not statistically distinguishable. For fiscal, GDP, and capital flow outcomes, the two groups respond similarly. The paper notes this finding is inconsistent with several prior studies but consistent with others, concluding the role of fundamentals remains unresolved.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the probability of financial crisis in EMDEs respond to the three shock types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the baseline (explanatory variables at sample means), the average EMDE faces a 3.5 percent probability of experiencing any type of financial crisis in a given year, with currency and banking crises the most common and sovereign debt crisis the least. Reaction shocks drive by far the largest increase: a 25-basis-point increase in 2-year yields from a reaction shock almost doubles the crisis probability to 6.6 percent. Inflation shocks produce small and statistically insignificant effects. Real shocks reduce the probability of sovereign debt crisis (consistent with their benign effects on financial markets) while raising currency crisis probability by less than reaction shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What does the nonlinear logit relationship imply for the 2022 tightening cycle specifically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because the logit function is nonlinear, a doubling of the shock size leads to a more-than-proportional increase in crisis probability. Applying the estimated model to the 114-basis-point reaction-shock contribution to 2-year yields from January to September 2022, the model implies that the probability of financial crisis in the average EMDE increased by approximately 36 percentage points, to nearly 40 percent. The paper emphasizes this estimate carries wide uncertainty because no comparable yield increase occurred during the 1985-2018 estimation period, placing this extrapolation well outside the sample&amp;rsquo;s support.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What crisis dynamics were already materializing in 2022 consistent with the model predictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By the time of writing (late 2022), seven EMDEs had experienced currency depreciations of at least 30 percent against the U.S. dollar meeting the Laeven and Valencia (2020) threshold for a currency crisis and 21 EMDEs had reached agreements with the IMF for additional financing. The paper notes these developments had occurred despite standard macroeconomic factors (interest rate differentials and flight-to-safety flows) not fully explaining the magnitude of depreciations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What robustness tests were conducted, and did they alter the main conclusions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The VAR decomposition was re-estimated using weekly rather than monthly data. The three-shock model was simplified to two shocks (real versus monetary, combining inflation and reaction). The VAR was extended to include real GDP and PCE inflation with contemporaneous exclusion restrictions to insulate shock identification from current macroeconomic conditions. Inflation expectations were replaced with the Haubrich, Pennacchi, and Ritchken (2012) model-based measure throughout, rather than only pre-2003. For the crisis probability models, panel probit with random effects and panel logit with fixed effects were estimated alongside the baseline panel logit with random effects. In all cases, the results were not materially different: inflation and reaction shocks remained more adverse than real shocks for EMDE financial and fiscal variables, and only reaction shocks produced statistically significant increases in overall crisis probability. One noteworthy robustness finding: when combining inflation and reaction into a single monetary shock, the relative importance of the inflation component appears somewhat larger than when the two are separated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: What are this paper&amp;rsquo;s main contributions relative to existing literature?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper makes three stated contributions. First, it is the first to decompose the evolution of U.S. interest rates over the COVID-19 pandemic recession, subsequent recovery, and 2021-22 inflation surge into the separate contributions of real, inflation, and reaction shocks. Second, it extends prior work on EMDE spillovers (e.g., Arteta et al. 2015; Hoek, Kamin, and Yoldas 2021, 2022) by showing how different shock types affect government budget balances, revenues, expenditures, and debt composition, and by expanding the EMDE country sample. Third, it is the first to examine how real, inflation, and reaction shocks differentially affect the probability of banking, currency, and sovereign debt crises in EMDEs.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Reaction shock&lt;/strong&gt;: In this paper&amp;rsquo;s framework, a change in U.S. interest rates caused by a perceived shift in the Federal Reserve&amp;rsquo;s reaction function toward a more hawkish policy stance. Identified as a shock that raises both 2-year and 10-year Treasury yields while simultaneously lowering equity prices and lowering inflation expectations. The fall in inflation expectations distinguishes this shock from an inflation shock and signals that markets believe the Fed is tightening beyond what current inflation alone would warrant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inflation shock&lt;/strong&gt;: A change in U.S. interest rates caused by rising expectations of U.S. inflation. Identified as a shock that raises both yields and inflation expectations but lowers equity prices. The equity decline signals that higher rates reflect inflationary pressure rather than improved growth prospects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Real shock&lt;/strong&gt;: A change in U.S. interest rates driven by improved prospects for U.S. real economic activity. Identified as a shock that simultaneously raises both yields, equity prices, and inflation expectations. The equity increase distinguishes this shock from the other two and signals that higher rates are accompanied by strengthening U.S. growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sign-restricted Bayesian VAR with stochastic volatility&lt;/strong&gt;: The paper&amp;rsquo;s primary model for decomposing U.S. yield movements. Sign restrictions on four variables (2-year yield, 10-year yield, S&amp;amp;P 500, 5-year inflation expectations) identify the three shock types without requiring timing restrictions. Stochastic volatility is incorporated to handle the heteroskedastic financial data and the COVID-19 period&amp;rsquo;s unusual size and nature; the model covers February 1982 to September 2022 at monthly frequency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Panel local projection (Jorda 2005)&lt;/strong&gt;: The empirical framework linking the VAR-identified shock series to EMDE outcomes at quarterly frequency. Direct estimation of impulse responses at each horizon h avoids the misspecification accumulated in iterated VAR forecasts and permits straightforward incorporation of state-dependent (investment-grade vs. noninvestment-grade) heterogeneity via a dummy-variable interaction specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capital flows (as used in this paper)&lt;/strong&gt;: Defined specifically as increases in net portfolio and other investment liabilities of EMDEs, excluding foreign direct investment liabilities. This definition isolates the more volatile, financially driven flows rather than the longer-horizon FDI component.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial crisis typology (Laeven and Valencia 2020)&lt;/strong&gt;: The crisis classification underlying the logit analysis. Sovereign debt crises are defined as a government default or restructuring of debt owed to private creditors. Banking crises require significant distress in the banking system combined with significant policy intervention measures. Currency crises are defined as a sharp nominal depreciation of at least 30 percent against the U.S. dollar. The paper uses these definitions from Laeven and Valencia (2020), extended through 2018 in Kose et al. (2021).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Primary budget balance improvement via expenditure compression&lt;/strong&gt;: In the paper&amp;rsquo;s framework, the fiscal adjustment mechanism triggered specifically by inflation and reaction shocks: EMDE governments improve their primary balance (reduce deficits or increase surpluses) almost exclusively by cutting expenditures, rather than raising revenues, as a response to the credit tightening and higher borrowing costs associated with adverse U.S. interest rate shocks.&lt;/p&gt;</description></item><item><title>How Do You Identify a Good Manager?</title><link>https://macropaperwarehouse.com/papers/how-do-you-identify-a-good-manager/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/how-do-you-identify-a-good-manager/</guid><description>&lt;p&gt;This paper develops a novel experimental method to identify the causal contribution of managers to team performance, and uses it to evaluate which characteristics predict managerial effectiveness and how manager selection mechanisms affect organizational outcomes.&lt;/p&gt;
&lt;p&gt;The core identification challenge is that managers are not randomly assigned to teams in the field, and field managers are a highly non-random sample, making it difficult to infer which traits genuinely predict managerial performance. The authors address this by repeatedly randomly assigning managers to multiple teams in a controlled laboratory experiment, then estimating each manager&amp;rsquo;s average causal contribution to group output after conditioning on group members&amp;rsquo; individual productive skills. The intuition is that a good manager is someone who consistently causes their team to produce more than the sum of their parts.&lt;/p&gt;
&lt;p&gt;The experiment was conducted at the University of Essex lab with 555 participants (46% female, mean age 25, ethnically diverse) forming 728 groups of three across four rounds. Each group consisted of one manager and two workers who performed a Collaborative Production Task requiring coordination across three problem-solving modules (numerical, spatial, and analytical reasoning). The team score was the minimum module score — a weakest-link structure making coordination essential. Prior to group testing, all participants completed individual assessments of task-specific skill, fluid intelligence (CFIT), emotional perceptiveness (Reading the Mind in the Eyes Test, RMET), economic decision-making skill (the Assignment Game, which measures resource allocation under comparative advantage), Big 5 personality, and demographic characteristics. Manager selection was randomly varied at the session level: in 20 sessions, the participant with the strongest preference for leadership became manager (self-promotion); in 19 sessions, managers were assigned by lottery.&lt;/p&gt;
&lt;p&gt;The main quantitative findings are as follows. First, there are large, stable, and statistically significant manager effects: a manager one standard deviation above average improves team performance by approximately 0.23 standard deviations (p = 0.04). This estimate is roughly 90% the size of the combined productive skill coefficient for the two workers (approximately 0.26 sd), indicating that a good manager is roughly twice as valuable as a good individual worker. Manager contributions predict out-of-sample group performance in a leave-one-out procedure (p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Second, among randomly assigned managers, only two predictors significantly explain managerial performance: fluid intelligence (CFIT) and economic decision-making skill (Assignment Game scores), both significant at below the 1% level. Gender, age, and ethnicity do not predict managerial performance.&lt;/p&gt;
&lt;p&gt;Third, self-promoted managers perform substantially worse than lottery-assigned managers, by approximately 0.10 standard deviations — roughly equivalent to being assigned a manager with fluid intelligence one full standard deviation below average. The mechanism is overconfidence: people who strongly prefer management roles are significantly more overconfident (d = 0.41 sd, p &amp;lt; 0.01) and exhibit a strong negative correlation between self-reported social skills and actual emotional perceptiveness on the RMET (r = -0.37, p &amp;lt; 0.001). Among self-promoted managers, self-reported extraversion and political skill are negatively correlated with managerial performance (rho = -0.24 and -0.26, p &amp;lt; 0.05); no such negative relationship appears among lottery managers.&lt;/p&gt;
&lt;p&gt;Fourth, selecting managers on economic decision-making skill rather than self-promotion improves average manager quality by 0.6 standard deviations — equivalent to replacing an average worker in every group with a worker at the 99th percentile of individual productivity.&lt;/p&gt;
&lt;p&gt;The three mechanisms through which good managers improve performance are: (1) monitoring — good managers (1 sd above average) cut monitoring errors from 16% to 8%; (2) optimal task allocation according to comparative advantage — groups with optimally assigned workers score 0.52 sd higher (p &amp;lt; 0.01); (3) worker motivation in late-stage effort — teams led by a 1-sd-above-average manager solve 0.6 more problems in the final two minutes versus only 0.3 more in the first two minutes.&lt;/p&gt;
&lt;p&gt;The experiment was conducted in a university lab in the UK, and the sample skews toward graduate students with limited work experience. Generalizability to field settings is supported by prior evidence that peer productivity spillover experiments yield similar magnitudes in lab versus field settings, and that the estimated manager effects are similar to Lazear et al. (2015) estimates from a large employer dataset.&lt;/p&gt;
&lt;p&gt;Q: What is the core methodological innovation of this paper?
A: The paper requires repeated random assignment of managers to multiple teams, combined with controls for individual productive skill measured prior to group work. This allows identification of each manager&amp;rsquo;s average causal contribution to group output, rather than confounding management quality with team composition or individual worker ability. The key estimand is the standard deviation of individual manager effects (sigma_alpha), interpreted as the impact of having a manager one standard deviation above average.&lt;/p&gt;
&lt;p&gt;Q: How large is the estimated manager effect, and how does it compare to worker effects?
A: A manager one standard deviation above average improves team performance by approximately 0.23 standard deviations (p = 0.04 by randomization inference). This is roughly 90% the size of the combined productive skill effect of both workers together (approximately 0.26 sd), implying a good manager is nearly twice as valuable as a good individual worker. Without conditioning on production skills, the manager effect rises to 0.29 sd.&lt;/p&gt;
&lt;p&gt;Q: What characteristics predict managerial performance among randomly assigned managers?
A: Only two measures predict managerial performance in the lottery arm: fluid intelligence (CFIT) and economic decision-making skill (scores on the Assignment Game), both significant at below the 1% level. These predictors are robust to controls for demographics, education, work experience, emotional perceptiveness, and personality traits. Gender, age, and ethnicity do not predict managerial performance.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;Assignment Game&amp;rdquo; and why is it a strong predictor?
A: The Assignment Game (Caplin et al., 2024) places participants in a simulated managerial role where they must assign fictional workers to tasks. Performing well requires understanding comparative advantage intuitively, managing an attentionally demanding numerical environment, and avoiding biases such as anchoring. The paper argues its strong predictive power reflects that good managers excel at allocating workers according to comparative advantage — which the experiment directly identifies as a key mechanism.&lt;/p&gt;
&lt;p&gt;Q: How do self-promoted managers perform relative to lottery-assigned managers?
A: Self-promoted managers perform approximately 0.10 standard deviations below lottery managers, and this gap is robust across model specifications. The performance deficit is roughly equivalent to being assigned a manager whose fluid intelligence is one full standard deviation below average. This finding implies that common organizational practice of selecting managers partly via self-nomination actively reduces team productivity.&lt;/p&gt;
&lt;p&gt;Q: Why do self-promoted managers underperform?
A: The paper attributes underperformance primarily to overconfidence. People strongly preferring management roles are significantly more overconfident than those without strong preferences (d = 0.41 sd, p &amp;lt; 0.01). Self-promoted managers specifically overestimate their social skills: among them, self-reported people skills are strongly negatively correlated with actual emotional perceptiveness on the RMET (r = -0.37, p &amp;lt; 0.001), and self-reported extraversion and political skill are negatively correlated with managerial performance (rho = -0.24 and -0.26, p &amp;lt; 0.05). None of these negative relationships appear among lottery managers.&lt;/p&gt;
&lt;p&gt;Q: Who wants to be a manager, and does it differ by gender?
A: The three variables most strongly correlated with wanting to be in charge are extraversion, risk appetite, and being male. The relationship between high extraversion and preference for management is driven largely by men. Women are much less likely to nominate themselves for leadership roles despite being equally or more effective on average — a finding consistent with broader experimental evidence on gender and leadership self-selection.&lt;/p&gt;
&lt;p&gt;Q: How large are the potential gains from skill-based manager selection?
A: Compared to self-promotion, selecting managers based on economic decision-making skill yields managers who are 0.6 standard deviations better in terms of estimated manager effects. In terms of group performance, this is equivalent to replacing an average worker in every group with a worker at the 99th percentile of individual productivity. Selecting on both economic decision-making and fluid intelligence outperforms random assignment, selection on social skills, or selection on worker task performance (the Peter Principle).&lt;/p&gt;
&lt;p&gt;Q: What are the three mechanisms through which good managers improve team performance?
A: First, monitoring: good managers (1 sd above average) reduce monitoring errors — defined as having a worker on a module substantially above the minimum score at task end — from 16% to 8% (bivariate correlation with manager performance = -0.40, p &amp;lt; 0.001). Second, optimal task allocation: the probability of finding the optimal comparative-advantage-based assignment is positively associated with manager performance (rho = 0.19, p &amp;lt; 0.01), and groups with always-optimal starting assignments score 0.52 sd higher than those with never-optimal assignments (p &amp;lt; 0.01). Third, worker motivation: team performance in the final two-minute period is about 50% more influential for overall outcomes than the first two minutes (p = 0.038), and 1-sd-above-average managers generate 0.6 more problems solved in the final period versus 0.3 in the first, consistent with differential motivational effects emerging over time.&lt;/p&gt;
&lt;p&gt;Q: What is the Peter Principle, and how does this paper relate to it?
A: The Peter Principle refers to the practice of promoting employees based on their performance as line workers rather than their suitability for management — promoting individuals to their level of incompetence. Benson et al. (2019) document this selection pattern empirically. This paper shows that selecting managers on worker task skill is inferior to selecting on economic decision-making skill or fluid intelligence, confirming that task skill is not the right criterion for manager selection even if it predicts individual worker output.&lt;/p&gt;
&lt;p&gt;Q: How does the paper validate that manager effects are real and not noise?
A: The paper uses randomization inference with 5,000 simulated allocations to compute p-values, obtaining p = 0.04 for the main manager effect. Robustness checks include controlling for pre-existing social relationships, manager risk appetite, variance of individual scores, and granular skill measures — all yielding estimates near 0.22 sd. A leave-one-out out-of-sample prediction test confirms manager contributions significantly predict held-out group performance (p &amp;lt; 0.01), while the analogous worker out-of-sample estimate is less than half the magnitude and not statistically significant.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions on the experimental results?
A: The experiment is conducted in a university lab in the UK with graduate students averaging 25 years of age and two years of work experience, limiting direct generalizability to experienced workers or senior management. The task lasts approximately 15 minutes, which may not capture longer-run managerial dynamics. Compensation equalized average earnings between managers and workers, which differs from most real-world settings. The authors note their effect-size estimates closely match Lazear et al. (2015) from a large employer, and that Herbst and Mas (2015) find lab peer-productivity experiments generalize to the field.&lt;/p&gt;
&lt;p&gt;Manager Effect (sigma_alpha): The standard deviation of individual managers&amp;rsquo; average causal contributions to group performance, estimated via repeated random assignment and conditioning on individual productive skill. Represents the impact of having a manager one standard deviation above average, estimated at approximately 0.23 standard deviations of group output.&lt;/p&gt;
&lt;p&gt;Collaborative Production Task: A novel lab group task in which a manager and two workers solve problems across three modules (numerical, spatial, analytical reasoning), with team score defined as the minimum module score (weakest-link structure). Managers are responsible for worker assignment, monitoring, and motivation; workers face no financial performance incentives.&lt;/p&gt;
&lt;p&gt;Economic Decision-Making Skill: Defined by Caplin et al. (2024) as the ability to make good resource allocation decisions, assessed via the Assignment Game in which participants must optimally assign workers to tasks under comparative advantage. The single strongest predictor of managerial performance in the lottery arm.&lt;/p&gt;
&lt;p&gt;Monitoring Failure: Defined in the paper as having any group member working on a module at task end whose score is substantially greater (e.g., 10 points higher) than the minimum module score — meaning the worker&amp;rsquo;s effort is not contributing to the group score. Occurs in 16% of groups overall; managers one sd above average reduce this to 8%.&lt;/p&gt;
&lt;p&gt;Self-Promotion (as selection mechanism): A treatment condition in which the participant with the strongest stated preference for being manager (on a 1-10 scale) is assigned the managerial role. Contrasted with lottery assignment; self-promoted managers perform approximately 0.10 sd worse than lottery managers.&lt;/p&gt;
&lt;p&gt;Overconfidence (in managerial context): The gap between self-assessed skill (particularly social/interpersonal skill) and objectively measured skill (e.g., RMET score). Self-promoters are significantly more overconfident (d = 0.41 sd), and overconfidence is strongly negatively correlated with actual emotional perceptiveness (r = -0.33, p &amp;lt; 0.001).&lt;/p&gt;
&lt;p&gt;Comparative Advantage Allocation: The practice of assigning each worker to the module in which they have the highest relative (not absolute) performance advantage. Captured via whether a manager selects the optimal one-to-one assignment given pre-measured individual module scores; groups with always-optimal allocation score 0.52 sd higher.&lt;/p&gt;</description></item><item><title>Ideas Have Consequences: The Impact of Law and Economics on American Justice</title><link>https://macropaperwarehouse.com/papers/ideas-have-consequences-the-impact-of-law-and-economics-on-american-justice/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/ideas-have-consequences-the-impact-of-law-and-economics-on-american-justice/</guid><description>&lt;p&gt;This paper quantifies the effect of the Manne Economics Institute for Federal Judges — an intensive two-week economics training program run by the Law and Economics Center from 1976 to 1998 — on the decision-making of U.S. federal judges. The research question is whether exposure to a coherent set of economic ideas can directly shift the policy decisions of sitting policymakers, as distinct from effects operating through partisan affiliation or formal legal rules.&lt;/p&gt;
&lt;p&gt;The program trained nearly half of all federal judges over its two decades of operation. By 1990, forty percent of federal judges had attended; by the late 1990s, roughly half of circuit court cases had a Manne-trained judge on the panel. Instructors included Milton Friedman, Armen Alchian, Harold Demsetz, Martin Feldstein, Paul Samuelson, and Orley Ashenfelter, covering supply-and-demand theory, the Coase Theorem, externalities, property rights, and criminal deterrence following Becker (1968). The program was funded by pro-business foundations and had a recognized conservative-leaning orientation, though it invited both Republican- and Democrat-appointed judges and was popular across party lines.&lt;/p&gt;
&lt;p&gt;The identification strategy is a differences-in-differences design exploiting staggered attendance timing. Because the program was oversubscribed and admitted judges on a first-come-first-served basis — with applicants bumped to later cohorts when capacity was reached — the timing of attendance within the ever-attending population has a quasi-random component. The preferred control group consists exclusively of other ever-attending judges who had not yet attended, rather than never-attenders, because never-attenders differ systematically on observables and show a pre-existing positive trend in economics language use, likely from ambient diffusion through clerks, law schools, and organizations such as the Federalist Society. Judge fixed effects and circuit-by-year (or courthouse-by-year) fixed effects absorb time-invariant judge characteristics and court-level time trends. Elastic-net-selected covariates predicting attendance timing, fully interacted with year fixed effects, are added as robustness controls. Standard errors are clustered by judge.&lt;/p&gt;
&lt;p&gt;The data cover approximately 200,000 published circuit court opinions (1970–2005) from Bloomberg Law, a 5% random sample of circuit cases hand-coded for ideological direction from the Songer-Auburn database, machine-coded regulatory agency outcomes, a newly collected antitrust case dataset, and approximately 1.03 million district court criminal sentencing records (1992–2003) from TRAC.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, after attending the Manne program, judges increase their use of economics language in written opinions by approximately one-third of a standard deviation, measured via word-embedding similarity to an economics lexicon; this effect is statistically significant in the short-run event-study window but does not persist over the full career. Second, Manne attendance raises conservative voting in economics-related cases (labor and regulation) by approximately one-quarter of a standard deviation — corresponding to judges deciding in the conservative direction about 20 percent more often relative to the mean — with no significant effect on non-economics cases; the interaction effect is robust across specifications including never-attenders. Third, post-Manne judges vote more frequently against federal labor and environmental regulatory agencies, a result that is statistically significant and economically meaningful with no detectable pre-trends. Fourth, post-Manne judges impose longer and more frequent prison sentences, with no increase in sentencing harshness for drug crimes — consistent with Manne instructors having explicitly advocated drug legalization — and with the harshness gap between Manne and non-Manne judges widening after the 2005 Booker decision expanded judicial sentencing discretion. Fifth, there is some evidence of increased voting against antitrust enforcement, though this result is more sensitive to specification. Persuasion rates computed following DellaVigna and Gentzkow (2010) are slightly larger than those estimated for partisan media interventions such as Fox News and are closest to the effect of a 10-week Washington Post subscription on Democratic governor vote share. Neither the legalist model (judges follow statutes mechanically) nor the attitudinal model (judges follow party affiliation) can explain these within-judge, within-party shifts.&lt;/p&gt;
&lt;p&gt;Q: What is the central identification challenge and how do the authors address it?
A: The key threat is that judges who chose to attend the Manne program — or who attended at a particular time — may differ systematically from non-attenders in ways correlated with their decision trajectories. The authors address this in two steps. First, they restrict the control group to other ever-attending judges who had not yet attended, exploiting the first-come-first-served oversubscription rule that created quasi-random variation in timing among applicants. Second, they use judge fixed effects plus circuit-by-year fixed effects, and add elastic-net-selected biographical covariates (e.g., birth cohort indicators) interacted with year fixed effects as a robustness check. Republican affiliation — the most salient ideological predictor of attendance — is not a statistically significant predictor of attendance timing, supporting the exclusion restriction.&lt;/p&gt;
&lt;p&gt;Q: Why are never-attenders excluded from the preferred control group?
A: Never-attenders differ from attenders on observables including political party and show a positively trending use of economics language in their opinions even before any treatment, suggesting ambient diffusion of economics ideas through law clerks, law school curricula, and organizations such as the Federalist Society. Including never-attenders in the control group produces a near-zero coefficient on the language outcome, which the authors interpret as reflecting spillovers rather than a true null effect; the coefficient on conservative voting in the interaction specification, however, remains positive and significant even when never-attenders are included.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the effect on economics language use?
A: The within-judge effect of Manne attendance on the word-embedding similarity between judicial opinions and an economics lexicon is approximately one-third of a standard deviation, statistically significant in the short-run event-study window (covering six years before and after attendance). The effect shrinks and becomes non-significant when the full career of Manne judges is examined (rather than just the event-study window), consistent with broad diffusion of economics language across the judiciary over time rather than a persistent individual-level treatment effect.&lt;/p&gt;
&lt;p&gt;Q: How large is the effect on conservative voting, and is it concentrated in particular case types?
A: Post-Manne attendance raises conservative voting in economics-related cases (labor and regulation) by approximately one-quarter of a standard deviation, corresponding to judges deciding in the conservative direction about 20 percent more often relative to the mean liberal-conservative decision rate. There is no statistically significant effect on non-economics cases. The interaction coefficient — the differential effect on economics versus non-economics cases — is positive and significant across all specifications including the full sample with never-attenders, making this the most robust directional result in the paper.&lt;/p&gt;
&lt;p&gt;Q: What is the effect on regulatory agency voting?
A: Post-Manne judges vote more frequently against federal labor agencies (National Labor Relations Board, OSHA, Department of Labor, Federal Labor Relations Authority, Office of Worker&amp;rsquo;s Compensation Programs) and the Environmental Protection Agency. The event study shows a positive and significant increase that persists across the event-study window with no detectable pre-trends. This result is robust to both the baseline specification and the elastic-net-controls specification.&lt;/p&gt;
&lt;p&gt;Q: What is the effect on criminal sentencing, and what heterogeneity is found?
A: Post-Manne judges impose both more frequent prison sentences and longer sentences, consistent with Becker&amp;rsquo;s deterrence framework taught in the program&amp;rsquo;s criminal law curriculum. The sentencing effects are absent for drug crimes, consistent with Manne instructors — including Milton Friedman — having explicitly advocated against the drug war and for drug legalization. The gap in sentencing harshness between Manne and non-Manne judges widens after the 2005 United States v. Booker decision, which made the Federal Sentencing Guidelines advisory rather than mandatory; this is consistent with the program having shaped latent judicial preferences that are expressed more fully when formal constraints are relaxed.&lt;/p&gt;
&lt;p&gt;Q: How do the persuasion rates compare to benchmark media studies?
A: The persuasion rates computed following DellaVigna and Gentzkow (2010) are slightly larger than those estimated for partisan media interventions such as Fox News (DellaVigna and Kaplan, 2007) and are closest to the persuasion rates implied by a 10-week subscription to the Washington Post on Democratic governor vote share (Gerber et al. 2009). The comparison contextualizes the Manne program as a moderately high-intensity ideational intervention relative to documented cases of political persuasion.&lt;/p&gt;
&lt;p&gt;Q: What do the results imply for theories of judicial behavior?
A: The findings are inconsistent with both the legalist/formalist model — under which judges apply statutes and precedent without regard to extra-legal factors, predicting zero effect — and the attitudinal model — under which judges simply follow partisan preferences, also predicting zero effect since the program attended judges of both parties. The within-judge, within-party shifts point to a third channel: judicial worldviews and economic ideas, independent of formal law and partisan affiliation, shape high-stakes precedent-setting decisions.&lt;/p&gt;
&lt;p&gt;Q: Can the authors distinguish between a pedagogical (informational) and an ideological persuasion mechanism?
A: They cannot definitively distinguish between the two. Both mechanisms predict increased economics language, more conservative rulings in economics cases, deregulatory voting, and harsher non-drug sentences. The drug-crime heterogeneity is somewhat more consistent with a nuanced pedagogical channel, since Manne instructors explicitly discussed drug legalization, but this pattern is also consistent with complex ideological effects. Evidence on decision quality (citation rates, judicial promotion) is mixed and not robust, providing no clean test of the informational mechanism.&lt;/p&gt;
&lt;p&gt;Q: What does the antitrust evidence show?
A: Post-Manne judges tend to vote against antitrust claimants (i.e., in favor of less antitrust enforcement), but this result is more sensitive to specification than the regulatory agency and sentencing results and is not always statistically significant across specifications. The authors treat it as suggestive rather than conclusive.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to the literature on economics education and normative beliefs?
A: Prior work finds that economics students are less redistributive (Selten and Ockenfels 1998), view surge prices more favorably (Frey and Meier 2005), favor profit maximization (Rubinstein 2006), and that economics professors are less ideologically liberal than other social scientists (Jelveh et al. 2018). The present paper extends this literature by studying established professionals (judges) making high-stakes real-world decisions, and by documenting a direct policy impact rather than a change in survey responses or experimental choices.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the dataset and the program coverage?
A: The circuit court dataset covers approximately 200,000 published opinions from 1970 through 2005. The district court sentencing dataset covers approximately 1.03 million cases from 1992 through 2003 (event study sample). The Manne program ran from 1976 to 1998, with roughly twenty judges per cohort; by 1990 forty percent of federal judges had attended, and by the late 1990s roughly half of circuit court cases had a Manne-trained panelist. Biographical information comes from the Federal Judicial Center; program attendance lists come from Butler (1999) supplemented by FOIA-obtained annual reports.&lt;/p&gt;
&lt;p&gt;Manne Economics Institute for Federal Judges: An intensive two-week economics training program for sitting U.S. federal judges, run by the Law and Economics Center from 1976 to 1998, covering supply-and-demand theory, the Coase Theorem, externalities, property rights, deterrence theory, and related topics; funded by pro-business foundations; admitted judges on a first-come-first-served basis and trained nearly half of all federal judges over its operation.&lt;/p&gt;
&lt;p&gt;Word-embedding economics language measure: A continuous measure of how closely a judicial opinion&amp;rsquo;s vocabulary aligns with a lexicon of law-and-economics phrases, constructed using word2vec embeddings (Mikolov et al. 2013) trained on the corpus of judicial opinions; measures the semantic proximity of opinion text to the Ellickson (2000) economics lexicon in embedding space, capturing implicit and contextual use of economics reasoning rather than raw phrase counts.&lt;/p&gt;
&lt;p&gt;Deterrence theory (Becker model): The framework, drawn from Becker (1968), taught in the Manne program&amp;rsquo;s criminal law curriculum, which holds that optimal crime deterrence requires setting the expected penalty — the economic cost of punishment times the probability of detection — high enough to outweigh the expected benefits of crime; treated in the paper as the theoretical basis for predicting harsher sentencing among post-Manne judges, and contrasted with retribution- or rehabilitation-based sentencing rationales that dominated before its diffusion.&lt;/p&gt;
&lt;p&gt;Conservative judicial decision (economics cases): In the paper&amp;rsquo;s usage, a ruling against the liberal/pro-plaintiff position in a case involving labor or regulation, as hand-coded by the Songer-Auburn database; includes ruling against a labor agency, rejecting a regulatory claimant, or voting against antitrust enforcement; the paper finds Manne attendance shifts judges in this direction in economics cases but not in non-economics cases.&lt;/p&gt;
&lt;p&gt;First-come-first-served oversubscription: The admission rule of the Manne program during its oversubscribed heyday (from the second cohort in 1977 through the late 1980s), under which applicants who did not secure a spot were bumped to the next year&amp;rsquo;s cohort; the authors argue this rule generates quasi-random variation in the timing of attendance among ever-attending judges, conditional on applying, providing the identifying variation for the differences-in-differences design.&lt;/p&gt;
&lt;p&gt;Persuasion rate: A summary statistic, following DellaVigna and Gentzkow (2010), measuring the fraction of the &amp;ldquo;persuadable&amp;rdquo; population that is convinced by a treatment; used in the paper to benchmark the Manne program&amp;rsquo;s effect size against documented media persuasion interventions such as Fox News and Washington Post subscriptions.&lt;/p&gt;</description></item><item><title>Identification and Estimation of Dynamic Random Coefficient Models</title><link>https://macropaperwarehouse.com/papers/identification-and-estimation-of-dynamic-random-coefficient-models/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/identification-and-estimation-of-dynamic-random-coefficient-models/</guid><description>&lt;p&gt;This paper studies linear panel data models where regression coefficients are individual-specific (random coefficients) and regressors may be predetermined — that is, sequentially exogenous rather than strictly exogenous, as occurs when a lagged dependent variable appears on the right-hand side. The canonical example is the AR(1) model Yit = gamma_i + beta_i * Yi,t-1 + epsilon_it, where both the intercept and the autoregressive coefficient vary across individuals. The setting is short panels (small T), which rules out learning about individual-level coefficient values.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central finding, building on Chamberlain (1993, 2022), is that the mean of the coefficient distribution is not point-identified in this dynamic setting. Chamberlain established this for discrete regressors; the paper&amp;rsquo;s Proposition 1 extends the non-identification result to continuous regressors under stronger assumptions. The paper then characterizes finite lower and upper bounds for the mean, variance, and CDF of the random coefficient distribution. The identification strategy recasts the problem as an infinite-dimensional linear program and exploits the dual representation of that program (following Galichon and Henry (2009) and Schennach (2014)) to derive tractable closed-form bounds for the mean and optimization-based bounds for the variance and CDF.&lt;/p&gt;
&lt;p&gt;For the mean parameter, the bounds take a closed-form expression involving the individual OLS estimator, the pooled OLS estimator, and cross-sectional moments of the data. The bounds remain finite even when the data are unbounded, provided certain moments of the data are finite. Tighter (refined) bounds are available when instrumental variables are brought in as additional unconditional moment restrictions. A numerical illustration shows how the outer identified set for E(beta_i) with a true value of 0.5 shrinks as T increases: at T=3 the outer set is approximately [0.216, 0.617]; at T=5 it narrows to approximately [0.306, 0.613]; the corresponding sharp identified sets (available for T=3 through T=5) range from [0.401, 0.593] at T=3 to [0.473, 0.532] at T=5.&lt;/p&gt;
&lt;p&gt;The paper proposes computationally tractable inference procedures matched to each parameter. For mean parameters, the closed-form bounds permit a delta-method asymptotic approach augmented with Stoye&amp;rsquo;s (2020) smooth approximation to handle cases where the sample analog of the bound width can be negative (due to overidentification or mild misspecification). The resulting confidence intervals are valid and robust to overidentification. For the variance and CDF of the coefficient distribution, the paper uses the Andrews and Shi (2017) procedure for inference on a continuum of moment inequalities, which remains computationally feasible.&lt;/p&gt;
&lt;p&gt;The empirical application estimates a generalization of Guvenen&amp;rsquo;s (2007, 2009) lifecycle earnings models using the Panel Study of Income Dynamics (PSID). Where Guvenen compared a restricted income profile (RIP, homogeneous persistence rho) against a heterogeneous income profile (HIP, heterogeneous time trend beta_i), this paper allows persistence rho itself to vary across households (rho_i). The key empirical findings are: (1) under both the RIP and HIP specifications, the estimated average earnings persistence E(rho_i) is significantly below 1; (2) the two specifications produce similar mean-persistence estimates once heterogeneity in rho_i is permitted, suggesting that misspecifying HIP as RIP or vice versa may not cause serious model misspecification when earnings persistence is allowed to vary; (3) the identified sets for the variance of rho_i provide evidence of genuine heterogeneity in earnings persistence across households, implying that households face different levels of earnings risk, which in turn contributes to heterogeneity in their consumption and savings behavior.&lt;/p&gt;
&lt;p&gt;Q: Why is the mean of the random coefficient not point-identified in a short dynamic panel?
A: Chamberlain (1993, 2022) first established this non-identification for discrete regressors. The paper&amp;rsquo;s Proposition 1 extends the result to continuous regressors under stronger assumptions. The fundamental obstacle is Lemma 1: E(beta_i) is point-identified if and only if there exists an unbiased estimator of beta_i in the individual time series, and no such estimator exists in short panels where T is small relative to the number of individual parameters.&lt;/p&gt;
&lt;p&gt;Q: How does the paper characterize the identified set for the mean parameter?
A: The identification problem is recast as an infinite-dimensional linear program. Using the dual representation (Galichon and Henry, 2009; Schennach, 2014), Theorem 1 yields a closed-form interval [L, U] = [BR - (1/2)&lt;em&gt;sqrt(ER&lt;/em&gt;DR), BR + (1/2)&lt;em&gt;sqrt(ER&lt;/em&gt;DR)], where BR is a weighted average of the individual OLS estimator and the pooled OLS estimator, ER is a non-negative term capturing cross-sectional variation in design matrices, and DR is a non-negative term related to residual variation. The bounds are finite whenever the relevant moments of the data are finite, even with unbounded data.&lt;/p&gt;
&lt;p&gt;Q: How are the bounds tightened using instruments?
A: Proposition 2 introduces refined bounds [LS, US] by incorporating additional unconditional moment restrictions from instruments Sit. The refined bounds use a larger set of restrictions and are weakly tighter than the baseline bounds. The empirical application employs up to 59 regressors with homogeneous coefficients (handled by Proposition 3), and instruments from lagged earnings levels and differences, substantially increasing the number of moment conditions.&lt;/p&gt;
&lt;p&gt;Q: How are the variance and CDF of the coefficient distribution identified?
A: Theorem 2 provides a general duality result for any parameter theta of the coefficient distribution. The lower bound is the maximum of E[min_{b} {m(Wi,b) + sum_k lambda_k phi_k(Wi,b)}] over Lagrange multipliers lambda, and the upper bound is the minimum of the corresponding maximum. Proposition 5 and Proposition 6 specialize this to the second moment (variance) of beta_i, with the upper bound requiring an eigenvalue assumption (Assumption 9) that the smallest eigenvalue of the individual design matrix R&amp;rsquo;R is bounded away from zero. Proposition 7 derives lower and upper bounds for the CDF P(e&amp;rsquo;Bi &amp;lt;= c) using a two-step optimization that separates the support into two regions.&lt;/p&gt;
&lt;p&gt;Q: What guarantees computational tractability of the optimization problems?
A: Proposition 4 establishes that GL(lambda, w) is globally concave in lambda for every w, and GU(lambda, w) is globally convex in lambda for every w. This means the optimization problems for the lower and upper bounds are concave maximization and convex minimization problems respectively, which can be solved with standard convex optimization methods.&lt;/p&gt;
&lt;p&gt;Q: How does the inference procedure for mean parameters handle overidentification and misspecification?
A: In finite samples, the sample analog of the bound-width term D_hat_S can be negative, which would make the estimated bounds degenerate. The paper adopts Stoye&amp;rsquo;s (2020) approach using the smooth approximation s(x,y) = sqrt((xy + sqrt((xy)^2 + r^2))/2). The (1-alpha)-level confidence interval combines a standard bound-based interval with an interval for a pseudo-true parameter mu*_e, ensuring validity under both correct specification and mild overidentification or misspecification.&lt;/p&gt;
&lt;p&gt;Q: How does this paper&amp;rsquo;s approach to inference on the variance and CDF differ from that for the mean?
A: For the mean, closed-form bounds permit a straightforward delta-method asymptotic argument and explicit confidence intervals. For the variance and CDF, the paper uses the Andrews and Shi (2017) procedure for inference on a continuum of moment inequalities, constructing a test statistic TAS(theta) = sup_{lambda} max{sqrt(N)&lt;em&gt;(mu_hat_GL - theta)/sigma_hat_GL, sqrt(N)&lt;/em&gt;(theta - mu_hat_GU)/sigma_hat_GU}^2, 0, with the confidence set being the set of theta values not rejected. This procedure is computationally more demanding but remains feasible.&lt;/p&gt;
&lt;p&gt;Q: What are the main empirical findings from the PSID application?
A: In both the RIP and HIP specifications extended to allow heterogeneous persistence rho_i, the estimated average earnings persistence E(rho_i) is significantly below 1. Both specifications produce similar mean-persistence estimates once rho_i heterogeneity is permitted, suggesting that the HIP vs. RIP misspecification debate may be less consequential when persistence itself varies across households. The identified sets for the variance of rho_i provide evidence of genuine unobserved heterogeneity in earnings persistence.&lt;/p&gt;
&lt;p&gt;Q: What is the economic significance of heterogeneous earnings persistence?
A: Heterogeneity in earnings persistence rho_i means households face different levels of earnings risk: a household with high rho_i experiences earnings shocks that are more persistent, reducing its ability to smooth consumption over time and strengthening its motive for precautionary savings. The paper argues this heterogeneity contributes directly to heterogeneity in consumption and savings behavior, making rho_i a first-order parameter in lifecycle consumption models such as those of Hall and Mishkin (1982), Blundell, Pistaferri, and Preston (2008), and Arellano, Blundell, and Bonhomme (2017).&lt;/p&gt;
&lt;p&gt;Q: How does the paper situate itself relative to Guvenen (2007, 2009)?
A: Guvenen showed that allowing for heterogeneity in the time trend of earnings (HIP: heterogeneous income profile) yields estimated persistence significantly below 1, whereas imposing no such heterogeneity (RIP: restricted income profile) yields persistence near 1. This paper generalizes both models by additionally allowing persistence itself to vary across households (rho_i). The finding that both HIP and RIP deliver similar E(rho_i) estimates significantly below 1 suggests that Guvenen&amp;rsquo;s contrast may be partly an artifact of restricting persistence to be homogeneous.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the identification results?
A: The results apply to short panels (small T, large N), accommodate discrete, continuous, and unbounded data, and require the idiosyncratic error epsilon_it to be mean-independent of the full history of strictly exogenous regressors and of the current history of predetermined regressors. The bounds for the mean are finite under finite moment conditions on the data. The bounds for the variance additionally require the eigenvalue assumption (Assumption 9). The paper notes that the results extend to probit and logit models with individual-specific coefficients, panel VAR models, and systems of panel data regressions, though these extensions are not developed in detail.&lt;/p&gt;
&lt;p&gt;Dynamic random coefficient model: A linear panel data model in which both the intercept and slope coefficients are individual-specific (gamma_i, beta_i), the regressor is predetermined (sequentially exogenous rather than strictly exogenous), and T is small — so individual coefficient values cannot be estimated from the time series alone.&lt;/p&gt;
&lt;p&gt;Partial identification: The property that a parameter of interest (such as E(beta_i)) cannot be consistently estimated from the data (it is not point-identified), but finite lower and upper bounds on its value can be characterized. The paper shows this is the generic situation for dynamic random coefficient models in short panels.&lt;/p&gt;
&lt;p&gt;Dual representation of infinite-dimensional linear programs: The technique, following Galichon and Henry (2009) and Schennach (2014), of converting an infinite-dimensional linear programming problem (which arises when data or coefficients are continuous) into an equivalent dual problem that yields tractable closed-form or convex-optimization-based bounds.&lt;/p&gt;
&lt;p&gt;Refined bounds (instrument-augmented bounds): Tighter identified sets for the mean parameter obtained by incorporating additional unconditional moment restrictions from instruments Sit, beyond the baseline moment conditions. These correspond to Proposition 2 and make the identification interval weakly narrower.&lt;/p&gt;
&lt;p&gt;Sequential exogeneity (predetermined regressor): The assumption E(epsilon_it | gamma_i, beta_i, Zi1,&amp;hellip;,ZiT, Xi1,&amp;hellip;,Xit) = 0, which allows the regressor Xit (e.g., Yi,t-1) to be correlated with future errors but not current or past errors. This is weaker than strict exogeneity and is what makes the model dynamic and identification challenging.&lt;/p&gt;
&lt;p&gt;Heterogeneous income profile (HIP) vs. restricted income profile (RIP): In Guvenen&amp;rsquo;s framework, HIP allows the time trend of earnings to vary across individuals (heterogeneous beta_i), while RIP does not. The paper extends both by also allowing the AR(1) persistence parameter rho to vary across individuals (rho_i), yielding an empirically more general earnings process.&lt;/p&gt;
&lt;p&gt;Earnings persistence (rho_i): The individual-specific autoregressive coefficient in the lifecycle earnings process. High rho_i means earnings shocks last longer, increasing earnings risk, reducing the household&amp;rsquo;s ability to smooth consumption, and strengthening precautionary savings motives. The paper finds evidence that rho_i varies meaningfully across U.S. households in the PSID.&lt;/p&gt;</description></item><item><title>Identification of Time-Inconsistent Models: The Case of Insecticide-Treated Nets</title><link>https://macropaperwarehouse.com/papers/identification-of-time-inconsistent-models-the-case-of-insecticide-treated-nets/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/identification-of-time-inconsistent-models-the-case-of-insecticide-treated-nets/</guid><description>&lt;p&gt;This paper addresses two related problems: the formal identification of time-inconsistent preferences in dynamic discrete choice models with unobserved heterogeneous types, and the structural estimation of those preferences using data from a health intervention in rural Orissa, India. The identification challenge is fundamental — even the standard exponential discount factor delta is generically not identified in dynamic choice models (Rust 1994; Magnac and Thesmar 2002), and this non-identification extends a fortiori to the hyperbolic (beta, delta) parameterization. The paper&amp;rsquo;s first contribution is constructing identification conditions that overcome these results through two exclusion restrictions: a variable z that affects utility only through the perceived value of future states (played in the application by elicited beliefs about state evolution), and a variable r that acts as an imperfect signal of agent type but is uninformative about choices conditional on type.&lt;/p&gt;
&lt;p&gt;The general model accommodates a finite but unknown number of agent types — time-consistent (beta=1), time-inconsistent naive (beta&amp;lt;1, unaware of future present-bias), and time-inconsistent sophisticated (beta&amp;lt;1, aware of future present-bias) — as well as sub-types within each class. The paper proceeds in four identification steps when types are unobserved: identifying the total number of types (via the rank of an observable matrix), recovering type-specific choice probabilities, assigning type identities, and recovering preference parameters. For time-consistent and sophisticated agents, both beta and delta are point-identified. For naive agents, the parameters are set-identified in general, with point identification available under a monotonicity condition (Assumption 14) or by imposing a common exponential discount factor across types (Assumption 15).&lt;/p&gt;
&lt;p&gt;The empirical application studies demand for insecticide-treated nets (ITNs) and their periodic retreatment — a health-protective technology with low up-front cost but substantial future benefits — among households in malarious areas of rural Orissa. A key design feature is that households were offered either a standard ITN contract (with the option to purchase retreatment later) or a commitment contract bundling two consecutive retreatments, allowing the commitment product choice to serve as a noisy type signal r. Elicited beliefs about future state variables serve as the excluded z variable.&lt;/p&gt;
&lt;p&gt;The main empirical findings are: approximately 21% of the population is time-consistent, 49% are naive time-inconsistent, and 30% are sophisticated time-inconsistent — so time-inconsistent agents account for approximately 79% of the sample. The preferred estimates of the hyperbolic parameter beta are 0.16 for naive agents and 0.08 for sophisticated agents, indicating substantial present-bias in both groups. These estimates of the population type distribution and type-specific beta parameters are described as new to the literature.&lt;/p&gt;
&lt;p&gt;A counterfactual exercise quantifies the welfare cost of present-bias: the median undiscounted additional expected total cost of malaria during the study period attributable to under-investment in ITNs exceeds the price of a treated net by a factor of approximately six. However, because time-inconsistent households heavily discount future malaria costs, the discounted total costs of malaria are low for many inconsistent agents relative to the ITN price, explaining low demand from the agents&amp;rsquo; own subjective perspective. The paper also finds that commitment products are not disproportionately chosen by sophisticated agents — take-up of the commitment contract is actually higher among naive households — contradicting the deterministic mapping from commitment product purchase to sophistication that is commonly assumed in the literature. Finally, differences in per-period utilities across agent types exist but are not substantively important in explaining differential outcomes in the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the core identification problem the paper addresses, and why is it hard?&lt;/strong&gt;
A: Even the standard exponential discount factor delta is generically not identified in dynamic discrete choice models (Rust 1994; Magnac and Thesmar 2002). This non-identification extends a fortiori to both beta and delta in the hyperbolic (beta, delta) model. When agents are also heterogeneous in unobserved type, the additional problem of identifying the population distribution of types — itself a key policy parameter — must be solved jointly with preference identification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What two exclusion restrictions provide the key identifying variation?&lt;/strong&gt;
A: The first restriction is a variable z that affects utility only via the perceived value of future states but not per-period utility (Assumption 3); in the application this is played by elicited subjective beliefs about future state evolution. The second is a variable r that predicts agent type but, conditional on type and observables, provides no additional information about choices (Assumption 16); in the application r includes elicited time-preference indicators and the choice of the commitment versus standard ITN contract.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does the paper require at least three periods?&lt;/strong&gt;
A: Three periods are the minimum required to capture the notions of time-inconsistency studied here: with only two periods, no time-inconsistency problem would arise. Three periods allow the researcher to separately observe how an agent plans in period 1, how the agent actually behaves in period 2 (potentially deviating from the period-1 plan), and how the agent behaves in the terminal period 3 where the problem reduces to a static discrete choice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is point-identified versus set-identified across agent types?&lt;/strong&gt;
A: For time-consistent agents, all per-period utilities and the (single) discount factor delta are point-identified. For sophisticated agents, both beta and delta are separately point-identified under the rank conditions in Assumptions 10-11. For naive agents, the parameters are in general only set-identified (Lemma 4 provides sharp bounds); point identification holds under either a monotonicity condition (Assumption 14) or the assumption that naive and sophisticated agents share the same exponential discount factor (Assumption 15).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper identify the total number of types in the population?&lt;/strong&gt;
A: The number of types equals the rank of a directly identified matrix P formed from the joint distribution of actions and states in adjacent time periods (Proposition 1). The rank provides a lower bound in general and equals the true number of types when the state space is sufficiently rich and type-specific choice probabilities vary sufficiently across the state space (Assumptions 17 and 19).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper distinguish naive from sophisticated agents among the identified type-specific choice probabilities?&lt;/strong&gt;
A: A key diagnostic is the function delta_hat_tau(x2,z2), which compares an agent&amp;rsquo;s period-1 view of the future against what would be expected given period 2-3 choices. For time-consistent and sophisticated agents, this function is constant across the state space (x2,z2); for naive agents it varies across the state space (Lemma 7, Proposition 2). This variation arises because naive agents incorrectly anticipate their future behavior in period 1, generating a wedge between planned and actual continuation values that shifts with the state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What fraction of the sample is time-inconsistent, and what are the estimated beta parameters?&lt;/strong&gt;
A: Approximately 79% of the sample is time-inconsistent: 49% are naive and 30% are sophisticated. The preferred estimates of the hyperbolic (present-bias) parameter beta are 0.16 for naive agents and 0.08 for sophisticated agents. Both estimates indicate substantial present-bias. The paper states that these estimates of the population type distribution and the type-specific beta values are new to the literature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the welfare cost of present-bias in terms of malaria risk?&lt;/strong&gt;
A: Present-bias leads to lower ITN purchases and fewer retreatments, which increases the likelihood of contracting malaria. The median undiscounted additional expected total cost of malaria during the study period attributable to under-investment in ITNs exceeds the price of a treated net by a factor of approximately six. However, because inconsistent agents heavily discount future health costs, the discounted total costs of malaria are low relative to the ITN price for many such agents, which explains low demand from the agents&amp;rsquo; own subjective perspective despite large social costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the paper find about commitment products and agent sophistication?&lt;/strong&gt;
A: The commitment contract — bundling two consecutive retreatments — was designed to appeal to sophisticated present-biased agents who anticipate their future self-control problems. Contrary to the deterministic mapping from commitment product purchase to agent sophistication commonly assumed in the literature, take-up of the commitment contract is actually higher among naive households than sophisticated ones. The paper argues this is possible because the model allows commitment product choice to only imperfectly predict type, enabling a richer analysis than prior work that rules out type heterogeneity by assumption.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Are differences in per-period utilities across types an important alternative explanation for observed behavior?&lt;/strong&gt;
A: Per-period utilities do vary across agent types, but the paper finds they are not substantively important in explaining differential outcomes in the sample. This finding supports the interpretation that time-inconsistent preferences — rather than heterogeneity in static preferences over states — are the primary driver of the behavioral differences observed across agent types in this context.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the role of elicited beliefs in the identification strategy?&lt;/strong&gt;
A: Elicited beliefs about the future evolution of state variables serve as the excluded variable z that shifts the forward-looking component of the value function while leaving per-period utility unchanged. The use of expectational data, as advocated by Manski (2004), provides a natural and interpretable source of identifying variation for the discount parameters. The paper argues that this plausible exclusion restriction contributes to the encouraging Monte Carlo simulation results relative to other work in the identification literature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens to identification under partial sophistication?&lt;/strong&gt;
A: When agents are partially sophisticated — aware of some but not all of their future present-bias, so that beta_tilde in [beta, 1] rather than exactly equal to beta or 1 — the three time-preference parameters (delta, beta, beta_tilde) are not point-identified in general (Proposition 4 provides a set identification result). Point identification requires that the exponential discount factor delta be identified separately. The paper shows that partial and complete sophistication can be distinguished from time-consistency by whether the function delta_hat varies across the state space, and partially sophisticated types can be distinguished from fully sophisticated types under an additional variability condition (Assumption 23, Proposition 3).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hyperbolic (beta-delta) discounting:&lt;/strong&gt; A model of time-inconsistent preferences in which future utility at time s discounted from time t carries the factor beta*delta^(s-t), where beta&amp;lt;1 introduces an additional present-bias relative to pure exponential discounting. The parameter beta governs the wedge between the discount rate applied to immediate versus purely future tradeoffs; delta governs the intertemporal rate of substitution between any two future periods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sophisticated vs. naive agents:&lt;/strong&gt; Both types are time-inconsistent (beta&amp;lt;1) and both are aware of their current present-bias. Sophisticated agents (tau_S) also correctly anticipate the extent of their future present-bias (beta_tilde = beta), while naive agents (tau_N) incorrectly believe their future self will behave as if beta_tilde = 1. This difference in beliefs about future behavior drives distinct choice dynamics across the three periods, providing the key observable variation used to distinguish the two types.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exclusion restriction (z variable):&lt;/strong&gt; A state variable that enters the transition probabilities and thus the value of future states but does not enter the current per-period utility function (Assumption 3). Variation in z shifts the forward-looking component of the Bellman equation while holding current utility fixed, providing the identifying variation needed to separately recover discount parameters from per-period utility parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Type indicator / type proxy (r):&lt;/strong&gt; An observed variable that is informative about an agent&amp;rsquo;s time-preference type but, conditional on type and other observables, provides no additional information about choices (Assumption 16). In the application, r includes elicited time-preference indicators and whether the agent chose the commitment versus standard ITN contract. Critically, the mapping from r to type is imperfect, so r does not directly reveal type for each individual.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conditional choice probability (CCP) inversion:&lt;/strong&gt; Following Hotz and Miller (1993), the type-specific conditional choice probabilities P_tau(a_t|x_t, z_t) — directly identified from data given type — can be inverted to recover per-period utility differences and combinations of discount parameters without solving the full dynamic programming problem. This approach underpins the constructive identification arguments throughout the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Commitment contract:&lt;/strong&gt; A product design in which two consecutive ITN retreatments are bundled at purchase, intended to mitigate the time-inconsistency problem by removing the future self-control decision about retreatment. The commitment contract is theoretically predicted to be preferred by sophisticated present-biased agents; the paper finds this prediction fails empirically, with naive households showing higher take-up.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Present-bias welfare cost:&lt;/strong&gt; The undiscounted additional expected total cost of malaria attributable to under-investment in ITNs driven by present-bias. The paper estimates this cost exceeds the price of a treated net by a factor of approximately six at the median, capturing the gap between the social planner&amp;rsquo;s valuation of ITN adoption and the discounted valuation of time-inconsistent agents.&lt;/p&gt;</description></item><item><title>Ideological Alignment and Evidence-Based Policy Adoption</title><link>https://macropaperwarehouse.com/papers/ideological-alignment-and-evidence-based-policy-adoption/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/ideological-alignment-and-evidence-based-policy-adoption/</guid><description>&lt;p&gt;This paper investigates how the ideological alignment between knowledge-disseminating institutions and policymakers affects the adoption of evidence-based policies. The core research question is whether, and through which mechanisms, the ideology of the messenger — rather than the content of the message — determines whether local policymakers act on rigorous research evidence.&lt;/p&gt;
&lt;p&gt;The authors conduct a country-wide randomized controlled trial (RCT) across 5,678 touristic Spanish municipalities. The policy recommendation derives from Hinnosaar et al. (2021), an RCT demonstrating that minor improvements to municipalities&amp;rsquo; Wikipedia pages (adding photographs, local festival information, touristic landmark details) increased overnight tourist stays by 9%. This policy was chosen because it is ideologically neutral, low cost, within local policymakers&amp;rsquo; remit, and its implementation is directly traceable via Wikipedia edit histories.&lt;/p&gt;
&lt;p&gt;Municipalities were randomized into five treatment arms and a control group (approximately 950 municipalities each), stratified by ruling party ideology, population, and touristic accommodation count. Three arms received the same policy brief endorsed by: (1) an ideologically aligned think tank (FAES for right-wing municipalities, Fundación Alternativas for left-wing), (2) the ideologically opposite think tank, or (3) an ideologically nonsalient researcher from the London School of Economics. Two further arms received links to newspaper articles covering the same research from either an ideologically aligned outlet (El Mundo for right, Eldiario.es for left) or an ideologically opposite outlet. The control group received no information. The experiment ran from May to December 2022, with multiple reminder emails sent across the period.&lt;/p&gt;
&lt;p&gt;The main outcome is a binary indicator for whether a municipality&amp;rsquo;s Wikipedia page was changed in line with the recommended guidelines during the study period, coded blind to treatment status by two independent coders.&lt;/p&gt;
&lt;p&gt;Key findings: Pooled across all treatment arms, information provision increased the probability of policy adoption by approximately 0.98 percentage points (a 38% relative increase over the control group baseline), but this effect is only marginally above conventional significance thresholds (p-value = 0.13). The aggregate effect masks sharp heterogeneity by ideological alignment. When the informing institution&amp;rsquo;s ideology aligns with the policymaker&amp;rsquo;s, policy adoption increases by 1.68 percentage points (think tank) and 1.67 percentage points (newspaper) relative to the control group — equivalent to a 66% and 65% relative increase, respectively, both statistically significant at the 5% level. By contrast, information from an ideologically opposite institution produces a coefficient that is negligible and statistically indistinguishable from zero, indicating that misaligned information is no more effective than receiving no information at all. The ideologically nonsalient LSE researcher arm produced an intermediate effect (0.94 percentage points, 37% relative increase), but the p-value (0.27) exceeds conventional thresholds, and the effect is not statistically distinguishable from either the aligned or the control condition. Policy briefs and newspaper articles are equally effective when ideologically aligned (difference of 0.1 percentage points, p-value = 0.82).&lt;/p&gt;
&lt;p&gt;To decompose mechanisms, the authors propose a three-stage framework: (1) selective exposure to information, (2) belief updating, and (3) policy implementation. Email click-through rates (access to the full policy brief or article once the informing institution is revealed) do not differ significantly across treatment arms, ruling out selective exposure as the operative mechanism. A post-intervention online survey experiment with 1,600 policymakers from 1,196 municipalities shows that those receiving information from an aligned or nonsalient institution updated their beliefs about policy effectiveness significantly more than those receiving information from an opposite institution, implicating belief updating as one operative channel. However, comparing the survey experiment (where nonsalient and aligned treatments produce similar belief updating) with the main experiment (where the aligned arm adopts at nearly twice the rate of the nonsalient arm, though not statistically distinguishable) suggests that ideological alignment also affects the third stage — policy implementation — beyond mere belief updating.&lt;/p&gt;
&lt;p&gt;The estimated monetary cost of ideological misalignment is 2,192 euros per municipality per year, calculated using the impact of Wikipedia changes on touristic revenues from Hinnosaar et al. (2021).&lt;/p&gt;
&lt;p&gt;Scope conditions: The context is Spanish local government, a policy that is explicitly non-ideological, low-cost, and easily implemented. Generalizability to ideologically charged or costly policies is not established. Left-wing municipalities show larger responses to aligned information, though this heterogeneity is not statistically significant at conventional levels.&lt;/p&gt;
&lt;p&gt;Q: What is the baseline rate of policy adoption in the control group, and what does the aligned-institution treatment achieve in absolute terms?&lt;/p&gt;
&lt;p&gt;A: The paper reports that ideologically aligned institutions increase the share of municipalities implementing recommended Wikipedia changes by 1.68 percentage points (think tank) and 1.67 percentage points (newspaper) relative to the control group. Working backward from the stated 66% and 65% relative increases, this implies a control group baseline of approximately 2.5 percentage points. The aligned effects are statistically significant at the 5% level.&lt;/p&gt;
&lt;p&gt;Q: Does information from an ideologically opposite institution have any effect on policy adoption?&lt;/p&gt;
&lt;p&gt;A: No. The coefficient for opposite-ideology treatment arms is negligible in magnitude, closely resembling the near-zero coefficients from the placebo analysis conducted for the same months in 2019 (pre-intervention). The authors conclude that receiving information from an ideologically opposite institution is statistically indistinguishable from receiving no information at all. This null result is consistent across heterogeneity analyses by mayor ideology, municipality population, Wikipedia page length, and party type.&lt;/p&gt;
&lt;p&gt;Q: How does the ideologically nonsalient (LSE researcher) treatment compare to aligned and opposite arms?&lt;/p&gt;
&lt;p&gt;A: The nonsalient arm increases policy adoption by 0.94 percentage points (a 37% relative increase), approximately half the effect of the aligned arm (1.68 percentage points). However, the p-value is 0.27, and the effect is not statistically different from either the aligned arm (p-value = 0.34) or the control group at conventional confidence levels. The result should therefore be interpreted with caution.&lt;/p&gt;
&lt;p&gt;Q: Are policy briefs or newspaper articles more effective in promoting policy adoption?&lt;/p&gt;
&lt;p&gt;A: Neither format is significantly more effective than the other. Conditional on ideological alignment, the difference between policy brief and newspaper article effects is 0.1 percentage points with a p-value of 0.82. Both are equally effective when ideologically aligned with the receiving policymaker, a finding the authors describe as a novel contribution to the policy communication literature.&lt;/p&gt;
&lt;p&gt;Q: Does ideological alignment affect whether policymakers choose to access the full information (selective exposure)?&lt;/p&gt;
&lt;p&gt;A: No. Click-through rates on the links to policy briefs or newspaper articles — measured after policymakers have seen the informing institution&amp;rsquo;s identity — do not differ significantly across treatment arms. The observed average click-through rate is 6.42%. This null result is consistent with the hypothesis that policymakers do not strategically filter information acquisition based on the messenger&amp;rsquo;s ideology, at least for non-ideological policies.&lt;/p&gt;
&lt;p&gt;Q: What does the survey experiment reveal about belief updating?&lt;/p&gt;
&lt;p&gt;A: In the post-intervention survey experiment with 1,600 policymakers, participants first reported beliefs about a purportedly beneficial (but actually harmful) policy, then were randomly assigned to receive information about its negative effects from an aligned, opposite, or nonsalient think tank. Those receiving information from an aligned or nonsalient institution updated their beliefs significantly more than those receiving information from an ideologically opposite institution. This implicates belief updating — not just selective exposure — as a channel through which ideological alignment affects policy adoption.&lt;/p&gt;
&lt;p&gt;Q: Why do the authors conclude that ideological alignment also affects the third stage (policy implementation) beyond belief updating?&lt;/p&gt;
&lt;p&gt;A: In the survey experiment, aligned and nonsalient institutions produce statistically similar belief updating. Yet in the main field experiment, the aligned arm adopts policy at nearly twice the rate of the nonsalient arm (1.68 vs. 0.94 percentage points), although this difference is not statistically significant. The authors interpret this gap as suggestive evidence that ideological alignment affects policy implementation through channels beyond belief updating — such as career concerns, party cues, or the political economy of implementation — though they acknowledge the evidence is indirect and the treatment difference is not statistically distinguishable.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated economic cost of ideological misalignment?&lt;/p&gt;
&lt;p&gt;A: The authors estimate a cost of 2,192 euros per municipality per year attributable to ideological misalignment between the informing institution and the receiving policymaker. This calculation uses the estimated impact of Wikipedia changes on touristic revenues from Hinnosaar et al. (2021) and reflects not the cost of not implementing the policy, but the marginal cost of using an ideologically opposite rather than aligned institution to disseminate the research evidence.&lt;/p&gt;
&lt;p&gt;Q: How did outside researchers&amp;rsquo; predictions compare to actual results?&lt;/p&gt;
&lt;p&gt;A: Researchers surveyed on the Social Science Prediction Platform correctly anticipated the rank ordering of treatment effectiveness (aligned &amp;gt; nonsalient &amp;gt; opposite &amp;gt; control) but substantially overestimated adoption rates in every arm. They predicted relative increases of 144%, 103%, and 48% for aligned, nonsalient, and opposite conditions respectively, compared to actual relative increases of roughly 65%, 37%, and ~0%. Email opening rates were the most accurately predicted (49% predicted vs. 38% actual). The results highlight the difficulty of translating evidence into policy even for simple, low-cost interventions.&lt;/p&gt;
&lt;p&gt;Q: What are the main threats to validity and how are they addressed?&lt;/p&gt;
&lt;p&gt;A: Three main threats are considered. First, differential email opening rates across treatment arms: addressed by showing the informing institution was revealed only after email opening, and confirmed by finding no significant differences in opening rates across groups. Second, spillovers between municipalities: the endline survey shows only 5 of 236 control-group respondents reported receiving any information from external sources; spillover distance analyses in Table D.II find no significant effect on control municipalities&amp;rsquo; adoption rates. Third, contamination bias in multi-arm RCTs with strata fixed effects: addressed by replicating main results using the Goldsmith-Pinkham et al. (2022) method, yielding nearly identical estimates.&lt;/p&gt;
&lt;p&gt;Q: What heterogeneity is observed across left- and right-wing municipalities?&lt;/p&gt;
&lt;p&gt;A: The positive effect of receiving information from an ideologically aligned institution appears larger for left-wing municipalities, with coefficients approximately three times larger than for right-wing municipalities, but this difference is not statistically significant at conventional confidence levels. The authors caution that the strength of ideological alignment may differ systematically between the partner think tanks on the left and right, making direct comparisons between left- and right-wing effects difficult to interpret cleanly.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to prior work on evidence-based policymaking?&lt;/p&gt;
&lt;p&gt;A: The closest prior work is Hjort et al. (2021) and Mehmood et al. (2024), which examine the impact of scientific evidence access on actual policy adoption, and DellaVigna and Kim (2022), which identifies ideology as a factor in the diffusion of innovative policies across governments. The present paper&amp;rsquo;s main contribution is being the first to isolate the causal effect of ideological alignment on policy adoption using a large-scale field experiment with real, authoritative ideological institutions — rather than surveys or hypothetical scenarios — while using a non-ideological policy recommendation to avoid confounding messenger ideology with policy ideology.&lt;/p&gt;
&lt;p&gt;Ideological alignment: In this paper&amp;rsquo;s usage, the congruence between the political ideology of the institution disseminating research evidence (think tank or newspaper) and the political ideology of the local government receiving that information. Alignment is operationalized by matching right-wing municipalities with right-leaning institutions (FAES, El Mundo) and left-wing municipalities with left-leaning institutions (Fundación Alternativas, Eldiario.es).&lt;/p&gt;
&lt;p&gt;Evidence-based policy adoption: The actual implementation by local policymakers of a policy recommendation derived from published peer-reviewed research — measured here as whether a municipality&amp;rsquo;s Wikipedia page was edited in line with specific recommended guidelines during the study period, not merely expressed intention or stated support.&lt;/p&gt;
&lt;p&gt;Knowledge brokers: Institutions, such as think tanks, that serve as intermediaries between academic researchers and policymakers, translating and disseminating research findings in accessible formats (policy briefs) to bridge the gap between evidence and policy.&lt;/p&gt;
&lt;p&gt;Nonsalient ideology: A condition in which the informing institution carries no salient or recognizable partisan affiliation, operationalized here by a foreign research university professor (LSE) whose institutional identity does not carry a clear left-right signal in the Spanish political context.&lt;/p&gt;
&lt;p&gt;Three-stage policy adoption framework: The authors&amp;rsquo; conceptual structure positing that ideology can interfere at three sequential stages: (1) selective exposure — whether policymakers choose to access information once the messenger&amp;rsquo;s ideology is revealed; (2) belief updating — whether policymakers revise their assessment of a policy&amp;rsquo;s effectiveness upon receiving evidence; and (3) policy implementation — whether policymakers act on updated beliefs to adopt the policy.&lt;/p&gt;
&lt;p&gt;Selective exposure: The tendency of individuals to avoid information from sources whose ideology conflicts with their own prior beliefs; in this paper, operationalized as differential click-through rates on links to policy briefs or news articles after the informing institution&amp;rsquo;s identity is revealed.&lt;/p&gt;
&lt;p&gt;Motivated reasoning: A documented tendency, also observed in policymakers, to reject or discount evidence that contradicts ideologically held prior beliefs — the mechanism proposed to explain why opposite-ideology information fails to update beliefs as effectively as aligned-ideology information.&lt;/p&gt;</description></item><item><title>Inequality and asset prices during Sudden Stops</title><link>https://macropaperwarehouse.com/papers/inequality-and-asset-prices-during-sudden-stops/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/inequality-and-asset-prices-during-sudden-stops/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies the cross-sectional dimension of Fisher&amp;rsquo;s (1933) debt-deflation mechanism as it operates during Sudden Stop crises — episodes characterized by large, abrupt reversals in the current account. The central question is how the distribution of wealth and leverage across households shapes the macroeconomic dynamics of financial crises, and whether greater inequality makes Sudden Stops more or less severe.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical analysis uses panel microdata from the Mexican Family Life Survey (MxFLS) across three waves (2002, 2005, 2009), covering a representative sample of approximately 8,400 households in 150 localities. The 2009 wave captures a Sudden Stop in which Mexico&amp;rsquo;s current account reversed by 1.5 percentage points of GDP, per capita consumption fell 7 percent, and housing prices fell 4 percent below pre-crisis trend by 2010. Households are sorted by net wealth and leverage ratio — defined as total debt divided by total assets — to identify how balance sheet heterogeneity drove differentiated asset-holding dynamics during the crisis.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a Bewley small open economy model with heterogeneous agents, incomplete markets, aggregate risk (simultaneous shocks to the international interest rate and total factor productivity), and an occasionally-binding loan-to-value (LtV) collateral constraint. Households hold two assets: a one-period risk-free international bond and a risky domestic collateralizable asset (land). Households face persistent non-insurable idiosyncratic risk in both labor income and dividend returns; the latter creates an endogenous risk-wealth tradeoff, since larger asset holdings raise future income volatility while simultaneously expanding debt capacity. The model is calibrated to Mexican data — matching the leverage ratio distribution in 2005 (10 percent of households financially constrained) and a net foreign asset position of −35 percent of GDP — and solved using the FiPIt algorithm combined with the Krusell-Smith stochastic-simulation approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical evidence from Mexico&amp;rsquo;s 2009 crisis reveals sharply divergent asset dynamics across the household balance sheet distribution. Wealthy households (top net-wealth decile) with low leverage increased their real estate holdings by 61.4 percent (annualized, relative to the average) between 2005 and 2009, consistent with a crisis-dampening effect whereby unconstrained agents absorb fire-sales. Wealthy households in the top decile of both net wealth and leverage ratio — financially constrained — reduced their real estate holdings by 36.6 percent, consistent with a crisis-amplifying effect. Cross-country descriptive evidence shows that Sudden Stop episodes are associated with significantly larger contractions in consumption and GDP in more unequal economies (Gini index, World Bank data, 58 Sudden Stop episodes identified by Bianchi and Mendoza 2020).&lt;/p&gt;
&lt;p&gt;In the calibrated model, the crisis-dampening effect dominates relative to the representative agent baseline: the heterogeneous-agents economy produces a smaller decline in asset prices (−0.99 percent vs. −2.57 percent in the representative agent model during crisis episodes), but a larger and more persistent consumption decline (−2.97 percent vs. −1.17 percent) and current account reversals (1.56 percentage points vs. 0.09 percentage points). The wealth Gini index generated by the calibrated model is 0.61, close to the untargeted 2005 Mexican estimate of 0.73. The aggregate equity premium generated is 5.1 percent, close to the data estimate of 6.5 percent; of this, 55.3 percent is attributable to the risk component, 35.9 percent to the persistence effect, and 8.6 percent to the constraint effect.&lt;/p&gt;
&lt;p&gt;When comparing the baseline emerging economy (wealth Gini 0.61) to an advanced economy calibration in which idiosyncratic dividend risk is set to zero (wealth Gini 0.29), crises are milder and less frequent in the more equal economy: consumption drops 1.0 percentage point less, asset prices drop 0.2 percentage points less, and the net foreign debt position is 6.2 percentage points larger relative to GDP. The implied slope coefficient from the model relating consumption declines during Sudden Stops to the income Gini (−11.1) closely matches the cross-country empirical estimate (−11.5). An economy with an income Gini index 0.10 points lower experiences a decline in consumption 1.1 percentage points smaller during a crisis.&lt;/p&gt;
&lt;p&gt;An impulse response to a two-standard-deviation aggregate shock confirms that, conditional on starting from a perfectly equal (symmetric) initial distribution via complete redistribution, declines in consumption and asset prices are approximately 0.5 percentage points smaller than in the baseline economy with the stationary ergodic distribution as initial condition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Redistributive Dividend Tax&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A flat 30 percent dividend income tax, redistributed as lump-sum transfers, reduces Sudden Stop severity by lowering average asset prices by 9.6 percent relative to the benchmark, which shrinks effective debt capacity and limits bond adjustment during crises. The average current account reversal during a crisis falls by 0.54 percentage points, and aggregate consumption falls by 0.63 percentage points less than in the benchmark. Crisis probability under the benchmark threshold falls from 4.3 to 1.83 percent (less than half). Average welfare improves by a gain equivalent to 2.8 percent of consumption. However, 26.7 percent of households — those more leveraged and three times wealthier than the beneficiaries — experience welfare losses averaging 6.8 percent of consumption, due to asset price declines and tighter financial conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Overall Conclusion&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Both the empirical evidence and the model suggest that economies with lower inequality, whether due to reduced idiosyncratic risk (as in advanced versus emerging economy calibrations) or wealth redistribution across agents with identical idiosyncratic risk processes, experience less severe Sudden Stop crises.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the two cross-sectional channels through which household heterogeneity affects the debt-deflation mechanism, and in which direction do they move asset prices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: The dampening effect operates when unconstrained wealthy households — who hold diversified portfolios and have precautionary savings in bonds — purchase fire-sold assets from constrained households, relieving downward pressure on asset prices. The amplifying effect operates when highly leveraged households, once pushed into binding credit constraints by declining asset prices, must further liquidate asset positions, deepening the price decline and tightening the collateral constraint for additional households via the pecuniary externality. These two effects move in opposite directions, so the net effect of inequality on crisis severity is theoretically ambiguous and depends on calibration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What specific empirical evidence from Mexico&amp;rsquo;s 2009 Sudden Stop supports both cross-sectional effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: Using MxFLS microdata, Table 1 in the paper shows that wealthy households (top net-wealth decile) with low leverage (deciles I–VII of leverage) increased their real estate holdings by 61.4 percent between 2005 and 2009 — evidence for the dampening effect. Wealthy households in the top decile of both net wealth and leverage reduced their real estate holdings by 36.6 percent — evidence for the amplifying effect. Between 2005 and 2009, the share of financially constrained households (leverage ratio above 0.168, the 90th percentile) increased by 1.7 percentage points, while the share of financial savers dropped by 5.0 percentage points. The pre-crisis period (2002–2005) shows no comparable divergence, ruling out a mechanical mean-reversion explanation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the risk-wealth tradeoff, and why is it central to generating a realistic wealth and leverage distribution in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: The risk-wealth tradeoff arises because idiosyncratic dividend risk is endogenous to asset holdings: holding more risky domestic assets increases debt capacity (relaxing borrowing constraints) but also raises future income volatility, since the variance of household flow income is convex in asset holdings. For households earning high dividend realizations, there exists a threshold beyond which precautionary savings motives — driven by rising income risk — dominate the benefit from expanded debt capacity, causing these households to begin accumulating bonds and eventually become net savers. This mechanism generates an empirically plausible distribution in which some households are financially constrained at the LtV limit, others are unconstrained borrowers, and a fraction are net savers holding both domestic assets and positive international bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the model calibration match the stationary distribution of Mexican households?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: Three parameters governing the dividend income risk process (average dividend yield, autocorrelation, and standard deviation) are jointly calibrated to match three statistics from the MxFLS 2005 distribution of households: 14.1 percent financial savers (data: 14.2 percent), 75.9 percent unconstrained indebted (data: 75.8 percent), and 10.0 percent financially constrained (data: 10.0 percent). The collateral fraction κ = 0.168 is set equal to the 90th percentile of the leverage ratio distribution in 2005, reflecting that the average delinquency rate for commercial bank household credit was 10.3 percent between 2004 and 2008. The discount factor β = 0.90 matches the average net foreign asset position relative to GDP of −35 percent for Mexico.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the heterogeneous-agents model compare to the representative agent model in terms of crisis dynamics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: In the heterogeneous-agents benchmark, the average current account reversal during a Sudden Stop is 1.56 percentage points, consumption falls 2.97 percent, and asset prices fall 0.99 percent below the steady state. In the representative agent model with the same average leverage ratio (κ = 0.12), the current account reversal is only 0.09 percentage points, consumption falls 1.17 percent, and asset prices fall 2.57 percent. The crisis-dampening effect in the heterogeneous economy produces a smaller asset price drop but a larger consumption decline, because leveraged households must make larger consumption adjustments when hit by negative idiosyncratic shocks in addition to the aggregate shock. Impulse response analysis shows the heterogeneous-agents economy generates current account reversals 1.9 percentage points larger than the representative agent, and consumption responses approximately four times larger.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the mechanism by which comparing emerging and advanced economy calibrations shows that lower inequality leads to less severe crises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: The advanced economy calibration sets idiosyncratic dividend risk to zero, eliminating the risk-wealth tradeoff and resulting in a wealth Gini of 0.29 (compared to 0.61 in the baseline). Without dividend risk, households have weaker incentives to accumulate assets as a precautionary buffer against income volatility, so they hold less debt on average and the long-run net foreign debt relative to GDP is 6.2 percentage points larger (i.e., less debt). During a Sudden Stop under this calibration, consumption drops 1.0 percentage point less, asset prices drop 0.2 percentage points less, and the economy is less frequently in crisis. The model-implied slope of consumption decline on income Gini is −11.1, matching the cross-country empirical estimate of −11.5.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What does the impulse response analysis reveal about the effect of wealth redistribution on crisis severity, holding idiosyncratic risk constant?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: The impulse response analysis compares the baseline heterogeneous-agents economy (with the stationary ergodic distribution as the initial condition) against a version in which all households are given a perfectly symmetric initial distribution — identical bond and asset holdings equal to long-run averages — while retaining the same idiosyncratic risk processes. The symmetric initial condition corresponds to a complete redistribution of wealth without changing fundamentals. In the first three periods after a two-standard-deviation aggregate shock, the symmetric economy shows declines in consumption and asset prices approximately 0.5 percentage points smaller than the baseline. This demonstrates that even holding the risk environment constant, reducing wealth dispersion mitigates crisis severity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the equity premium decomposition work in the heterogeneous-agents model, and which components are quantitatively most important?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: The aggregate equity premium is decomposed into five components (Equation 7 in the paper): a constraint effect (positive, increasing in the measure and intensity of constrained households), a risk effect (positive, from the negative covariance between the individual stochastic discount factor and individual equity return, weighted more heavily on constrained households), a persistence effect (positive, from the covariance between idiosyncratic dividend return and asset holdings, since high-dividend households accumulate more assets), a trading cost effect (approximately zero in aggregate), and a no-short-sales effect (negative, since households at the short-sales constraint add to asset demand without increasing the marginal benefit of saving). In the calibrated model, the equity premium is 5.1 percent; the risk effect accounts for 55.3 percent, the persistence effect for 35.9 percent, and the constraint effect for 8.6 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the mechanism by which the dividend income tax reduces crisis severity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: A flat 30 percent dividend income tax lowers average after-tax dividend returns, reducing households&amp;rsquo; incentive for precautionary accumulation of domestic assets and weakening the risk-wealth tradeoff. As a result, households demand fewer domestic assets and fewer international bonds in normal times. The reduced demand for the domestic asset lowers the equilibrium asset price by 9.6 percent on average relative to the benchmark, which — through the pecuniary externality embedded in the LtV constraint — tightens borrowing constraints, raising the share of financially constrained households from 5.6 to 7.8 percent. Nevertheless, the reduction in equilibrium debt positions means that during a crisis, bond adjustments and consumption drops are more limited: the average current account reversal during crises falls by 0.54 percentage points, and aggregate consumption falls by 0.63 percentage points less than in the benchmark. Crisis probability under the benchmark threshold falls from 4.3 to 1.83 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Who benefits and who loses from the dividend income tax, and by how much?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: Among the simulated population, 73.3 percent of households experience welfare gains averaging 6.2 percent of consumption in consumption-equivalent terms, while 26.7 percent experience welfare losses averaging 6.8 percent of consumption. The average welfare gain across all households is equivalent to 2.8 percent of consumption. The households experiencing losses are more leveraged and three times wealthier on average than those that benefit; the policy reduces their net worth through lower asset prices and tightens their financial constraints. The welfare analysis accounts for the transition to the new tax policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Why does the representative agent model miss the cross-sectional effects that are central to the paper&amp;rsquo;s mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: In the representative agent model, all households behave identically and either collectively want to buy or sell assets, but since there is no one to trade with domestically, actual asset holdings remain unchanged by cross-sectional forces. Additionally, the average debt constraint multiplier in the representative agent equals the single household&amp;rsquo;s multiplier, whereas in the heterogeneous model a small fraction of highly constrained households can have much larger individual multipliers, amplifying the aggregate debt-deflation effect. In the calibrated stationary model, 10 percent of constrained households own 7.7 percent of assets and have a consumption share of 9.0 percent, while 75.9 percent of unconstrained indebted households hold 88.1 percent of assets with a consumption share of 78.1 percent — distributional features invisible to a representative agent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What robustness does the model validation provide for the quantitative results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: The model reproduces the untargeted net wealth and asset distributions across deciles from MxFLS 2005 closely, with slight underestimation at the top deciles; the exception is the bottom decile of debt (where the model cannot generate households with negative net wealth since default is not modeled). The aggregate law of motion for the Krusell-Smith algorithm fits with R² = 0.99 for bond position and R² = 0.93 for asset price, and Den Haan (2010) accuracy checks show maximum forecast errors of 2.8 (current account) and 1.1 (asset price). The model replicates the untargeted magnitude of current account reversals observed in Mexican Sudden Stops. The wealth Gini of 0.61 is close to the untargeted 2005 Mexican estimate of 0.73, and the equity premium of 5.1 percent is close to the data estimate of 6.5 percent.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Sudden Stop&lt;/strong&gt;: An episode characterized by a large, abrupt reversal in the current account, typically triggered by a sudden halt in foreign capital inflows. In this paper, Sudden Stops are modeled as endogenous crises that arise from the interaction of a negative aggregate shock (simultaneous rise in the international interest rate and decline in total factor productivity) with an occasionally-binding LtV collateral constraint. The paper follows Bianchi and Mendoza (2020) in identifying 58 such episodes over the past four decades.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt-deflation mechanism (cross-sectional dimension)&lt;/strong&gt;: The paper studies Fisher&amp;rsquo;s (1933) debt-deflation spiral — in which declining asset prices tighten credit constraints, forcing further asset sales, further depressing prices — through the lens of household heterogeneity. The cross-sectional dimension refers to the fact that different households (wealthy unconstrained vs. highly leveraged constrained) respond differently to price declines, generating two opposing effects: dampening (wealthy buyers absorb fire-sales) and amplifying (constrained households fire-sell additional assets).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-wealth tradeoff&lt;/strong&gt;: A novel feature of the model in which holding more risky domestic assets simultaneously (a) expands debt capacity by relaxing the LtV constraint and (b) increases future income volatility through higher exposure to idiosyncratic dividend risk, since the variance of household flow income is convex in asset holdings. This tradeoff generates the endogenous transition of households from indebted to net-saver status and gives rise to the empirically plausible distribution of savers, unconstrained borrowers, and constrained households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Loan-to-value (LtV) collateral constraint&lt;/strong&gt;: A borrowing limit requiring that households&amp;rsquo; international debt (negative bond holdings) cannot exceed a fixed fraction κ of the market value of their domestic asset holdings. In the paper, κ = 0.168 (the 90th percentile of the Mexican leverage ratio distribution in 2005). The constraint is occasionally binding and generates a pecuniary externality: households fail to internalize that their individual portfolio choices affect the aggregate asset price, which in turn determines the borrowing limits of all other households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pecuniary externality&lt;/strong&gt;: The externality arising from the LtV constraint in which each household&amp;rsquo;s choice of asset holdings affects the equilibrium asset price, thereby changing the borrowing limits of all households simultaneously. This externality drives the debt-deflation spiral and is the source of Sudden Stop crises in the model: no single household internalizes the aggregate impact of its fire-sales on credit conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fire-sale&lt;/strong&gt;: In the context of this paper, the forced liquidation of domestic asset holdings by financially constrained households during a crisis. Fire-sales are triggered when the LtV constraint becomes binding, forcing households to sell assets to reduce debt; the resulting price decline tightens the constraint further, producing additional fire-sales. The paper documents that, during Mexico&amp;rsquo;s 2009 Sudden Stop, wealthy constrained households (top decile of both net wealth and leverage) reduced real estate holdings by 36.6 percent, while wealthy unconstrained households increased holdings by 61.4 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dampening and amplifying effects&lt;/strong&gt;: Two opposing cross-sectional effects on asset prices during a crisis. The dampening effect: unconstrained wealthy households purchase depressed assets fire-sold by constrained households, relieving downward pressure on prices and weakening the debt-deflation spiral. The amplifying effect: highly leveraged households that are pushed into binding constraints by falling prices must also fire-sell assets, further depressing prices and tightening financial conditions. The net impact on crisis severity depends on which effect dominates, which the paper establishes empirically and quantitatively is inequality-dependent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equity premium decomposition&lt;/strong&gt;: A decomposition derived in the paper (Equation 7) that expresses the aggregate excess return on the risky domestic asset as the sum of five components: a constraint effect (positive, from the measure and intensity of binding LtV constraints), a risk effect (positive, from the covariance of individual stochastic discount factors with individual equity returns), a persistence effect (positive, from the covariance of idiosyncratic dividend returns with asset holdings due to return persistence), a trading cost effect (approximately zero in aggregate), and a no-short-sales effect (negative). In the calibrated model, the risk and persistence effects account for 91 percent of the 5.1 percent equity premium.&lt;/p&gt;</description></item><item><title>Inference Based on Time-Varying SVARs Identified with Sign Restrictions</title><link>https://macropaperwarehouse.com/papers/inference-based-on-time-varying-svars-identified-with-sign-restrictions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/inference-based-on-time-varying-svars-identified-with-sign-restrictions/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; The paper asks how to conduct valid Bayesian inference in time-varying structural vector autoregressions (SVARs) identified with sign restrictions, a setting in which existing algorithms are shown to be theoretically flawed. As an empirical illustration, the authors use the new framework to examine three questions about the 2022–2023 Federal Reserve tightening cycle: (i) how did the Fed respond to the state of the economy; (ii) how would more dovish or hawkish stances have fared; and (iii) was the Fed behind the curve in 2021, and at what cost?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The paper defines a class of rotation-invariant time-varying SVARs, building on Bognanni (2018). A model belongs to this class when its prior over sequences of structural parameters is invariant to orthogonal transformations of those sequences—i.e., it assigns equal prior density to all observationally equivalent structural parameter sequences (Proposition 1 establishes that observational equivalence corresponds exactly to orthogonal rotation of the sequence). The authors prove an if-and-only-if characterization (Proposition 2): a prior belongs to this class if and only if the induced prior over sequences of orthogonal matrices is uniform and independent of the time-varying reduced-form parameters.&lt;/p&gt;
&lt;p&gt;A specific member of this class, the Random Correlations SVAR (RC-SVAR), is constructed by combining a prior over time-varying reduced-form parameters based on Archakov and Hansen&amp;rsquo;s (2021) parametrization of correlation matrices with a uniform prior over sequences of orthogonal matrices. The RC-SVAR is preferred over alternatives (Primiceri 2005&amp;rsquo;s decomposition, which is order-dependent; Bognanni&amp;rsquo;s 2018 discounted Wishart model, whose marginal likelihood significantly underperforms) because, for the type of empirical applications considered, it generally implies a higher log-predictive score than most orderings of the Primiceri (2005) model.&lt;/p&gt;
&lt;p&gt;The authors introduce three algorithms. Algorithm 1 (simple acceptance sampling) is theoretically correct but computationally infeasible when sign restrictions span many periods because the probability of satisfying all restrictions simultaneously converges to zero as sample length T grows. Algorithm 2, the current approach in the literature (Baumeister and Peersman 2013; Bognanni 2018; Debortoli, Galí and Gambetti 2020), draws orthogonal matrices period-by-period from the sign-restriction-truncated uniform distribution; the authors show this does not draw from the correct target posterior because the resulting prior over orthogonal matrices is not independent of the reduced-form parameters and therefore the prior does not satisfy the rotation-invariance condition. Algorithm 3, the paper&amp;rsquo;s contribution, uses a Gibbs sampler that incorporates the Particle Gibbs with Ancestor Sampling (PGAS) method of Lindsten, Jordan and Schon (2014) to draw sequentially from the correct target posterior conditional on sign restrictions over an arbitrary number of periods.&lt;/p&gt;
&lt;p&gt;An important additional contribution is the allowance for time-varying sign restrictions—restrictions that are imposed only in selected periods—enabling researchers to tailor identification to institutional knowledge about when particular restrictions are economically appropriate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Empirical Application.&lt;/strong&gt; The RC-SVAR is estimated at a quarterly frequency with five variables: output growth (log difference of real GDP), core inflation (log difference of core PCE price index), the federal funds rate, money growth (log difference of M2), and the Moody&amp;rsquo;s Baa corporate bond yield relative to the 10-year Treasury yield (credit spread). The sample runs from 1959:Q1 to 2023:Q2, with a constant and two lags (n=5, p=2, m=11). Four independent MCMC chains of 20,000 draws are used, keeping every tenth draw after discarding the first 2,500; 1,800 particles approximate the reduced-form posterior and 3,600 particles approximate the posterior of the orthogonal matrices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt; Decomposing the unexpected change in the federal funds rate from 2022:Q2 to 2023:Q2 into contributions from the predictable component, the systematic monetary policy response to non-monetary-policy shocks, and pure monetary policy shocks, the authors find that the lion&amp;rsquo;s share of the unpredictable rate increase was a systematic response to non-monetary policy shocks. Monetary policy shocks contributed about 100 basis points of the unexpected change in the federal funds rate by 2023:Q2 (out of roughly 4.99 percentage points of cumulative actual funds rate).&lt;/p&gt;
&lt;p&gt;In the Dovish Fed counterfactual—where the response of the federal funds rate to contemporaneous inflation is halved for the first quarter of 2022—the economy would have marginally overheated, with inflation running persistently above 5 percent. In the Hawkish Fed counterfactual—where the response to inflation is doubled—inflation would have quickly declined at a small output cost: focusing on posterior medians, real GDP in 2023:Q2 would have been about 0.7 percent lower than in the data, though the lower envelope of the 68 percent probability bands indicates the output cost could have been as large as 3.1 percent.&lt;/p&gt;
&lt;p&gt;Regarding the &amp;ldquo;behind the curve&amp;rdquo; question, the model finds evidence that the Fed was accommodative in 2021 (expansionary monetary policy shocks in that period), consistent with Summers (2021b). However, monetary policy shocks contributed only about 0.6 percentage points to annualized core inflation during 2021:Q2–2021:Q4 on a cumulative basis; the larger and dominant source of the unexpected inflation surge was non-monetary policy shocks. A comparison of the RC-SVAR with a constant-parameter SVAR identified only by Restriction 1 (Uhlig 2005) shows substantively different conclusions: the constant-parameter model attributes the unexpected increase in the federal funds rate to shocks that affect money growth and credit spreads, without a clear connection to the real economy, whereas the RC-SVAR links the rate increases to shocks that made the economy run hotter.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the fundamental theoretical flaw in existing algorithms for time-varying SVARs identified with sign restrictions, and why does it matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Existing algorithms (e.g., Baumeister and Peersman 2013; Bognanni 2018; Debortoli, Galí and Gambetti 2020) draw orthogonal matrices period-by-period from the uniform distribution restricted to those matrices satisfying the sign restrictions at each t. This construction implicitly defines a marginal density for the orthogonal matrices conditional on the reduced-form parameters that is not uniform: it is proportional to the reciprocal of the volume of the sign-restriction-satisfying subset of the orthogonal group, which depends on the reduced-form parameters. Consequently, the prior over structural parameters implied by these algorithms does not assign equal density to observationally equivalent sequences of structural parameters, violating Proposition 2&amp;rsquo;s necessary and sufficient condition. The resulting posteriors are therefore not correctly targeted to the desired posterior, meaning inference is distorted in a way that cannot be corrected by importance reweighting without prohibitive computation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What does Proposition 1 establish, and how does it generalize the constant-parameter case?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 1 proves that two sequences of time-varying structural parameters are observationally equivalent if and only if there exists a sequence of orthogonal matrices such that one sequence is obtained from the other by post-multiplying each period&amp;rsquo;s structural parameters by the corresponding orthogonal matrix. This directly mirrors the constant-parameter result in Rubio-Ramírez, Waggoner and Zha (2010) and Uhlig (2005), where a single orthogonal matrix produces observational equivalence. The extension to sequences is non-trivial because the law of motion couples parameter draws across time, but the likelihood&amp;rsquo;s separability across periods preserves the period-by-period orthogonal rotation structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is Proposition 2, and what is its practical implication for constructing valid priors?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 2 states that the prior over time-varying structural parameters satisfies the rotation-invariance condition (Equation 3) if and only if the induced prior over the time-varying orthogonal reduced-form parameters does not depend on the sequence of orthogonal matrices—equivalently, the prior over (Qt) is uniform over the product of orthogonal groups and is independent of the reduced-form parameters (Bt, Σt). The practical implication is constructive: any prior over time-varying reduced-form parameters (Bt, Σt), combined with an independent uniform prior over sequences of orthogonal matrices, automatically produces a rotation-invariant SVAR. This means that widely-used priors for reduced-form time-varying VARs (Primiceri 2005, Bognanni 2018, the new RC prior) can all be adapted for structural analysis without modification, as long as the orthogonal matrices are drawn uniformly and independently of the reduced-form parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Why do models with heteroskedastic structural shocks (identification via heteroskedasticity) not belong to the class of rotation-invariant SVARs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In models identified through heteroskedasticity, the time-varying structural parameters take the form (A Ψt^{-1/2}, F Ψt^{-1/2}), where Ψt is a time-varying diagonal matrix. For any permissible sequence, post-multiplying by a non-diagonal orthogonal matrix at one period produces a sequence where the ratio of structural parameters across consecutive periods is not diagonal, which violates the permissibility constraint of those models. Thus, the class of rotation-invariant SVARs and models identified through heteroskedasticity are mutually exclusive when the heteroskedastic specification has constant impulse responses up to scale—a restriction that the authors note has been criticized as a potential weakness of the heteroskedasticity-based approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why is the Random Correlations SVAR (RC-SVAR) chosen as the baseline, and how does it compare to alternatives?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The RC-SVAR uses the Archakov and Hansen (2021) parametrization of correlation matrices to define a prior over time-varying reduced-form parameters that is order-invariant (unlike Primiceri 2005, which produces n! different elements depending on variable ordering) and avoids the highly restrictive structure of Bognanni&amp;rsquo;s (2018) discounted Wishart model, which significantly underperforms in marginal likelihood. For the empirical applications considered, Arias, Rubio-Ramírez and Shin (2023) show the RC-SVAR generally achieves a higher log-predictive score than most orderings of the Primiceri (2005) model, motivating its use as the baseline. The theoretical results apply to any member of the rotation-invariant class, so the algorithm is not specific to the RC-SVAR.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why are time-varying sign restrictions important, and how are they implemented in the monetary policy application?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Time-varying sign restrictions allow researchers to impose identification restrictions only in periods where those restrictions are economically appropriate, adhering to the principle &amp;ldquo;If you know it, impose it; if you do not know it, do not impose it&amp;rdquo; (Uhlig 2017). In the monetary policy application, Restriction 2 (which constrains the contemporaneous elasticities in the policy rule to plausible ranges, following Arias, Caldara and Rubio-Ramírez 2019) is not imposed during three exceptional periods: 1979:Q4–1982:Q4 (non-borrowed reserves targeting under Volcker), 2009:Q1–2015:Q3 (quantitative easing following the Great Recession), and 2020:Q2–2021:Q4 (QE and effective zero lower bound during COVID-19). Restriction 1 (sign restrictions on impulse responses to a monetary policy shock, following Uhlig 2005) is imposed throughout the entire sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What do the estimated contemporaneous elasticities reveal about how monetary policy has changed over time?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model estimates show substantial time variation. The contemporaneous elasticity of the federal funds rate to output growth exhibits three peaks: during Arthur Burns&amp;rsquo;s chairmanship in 1974 (capturing the sharp rate cut during the 1974–1975 recession), during Volcker&amp;rsquo;s chairmanship in 1983–1984 (when annualized real GDP growth averaged 6.8 percent), and during Greenspan&amp;rsquo;s tenure in 2001 (when the federal funds rate fell from 6.4 percent in December 2000 to 1.8 percent by end-2001). Outside these peaks, the elasticity averaged about 0.1, implying a 0.1 percentage point rise in the annualized federal funds rate per 1 percentage point increase in annualized GDP growth. The elasticity to inflation averaged about 0.3 percentage points per 1 percentage point rise in annualized core inflation, with a range from above 0.5 in the early 1970s and early Volcker years down to about 0.15 during Yellen&amp;rsquo;s tenure. The elasticity to the credit spread moved from about −1.4 at the beginning of Burns&amp;rsquo;s tenure to −2.2 at the end of Nixon&amp;rsquo;s presidency, then declined through the mid-1970s to the Great Recession, and stood at about −1 by mid-2023.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the exact decomposition of the 2022–2023 tightening cycle into predictable, systematic non-monetary, and monetary policy shock components?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 1 from the paper shows the federal funds rate decomposition. In 2022:Q2, the predictable component was 0.27 percentage points, the unpredictable component due to systematic response to non-monetary shocks was 0.24 pp, and the unpredictable component due to monetary policy shocks was 0.26 pp, summing to 0.77 pp. By 2023:Q2, these were 1.70 pp (predictable), 2.25 pp (systematic/non-monetary), and 1.04 pp (MP shocks), totaling 4.99 pp. Thus, at the tightening cycle&amp;rsquo;s end in 2023:Q2, the systematic response to non-monetary shocks accounted for about two-thirds of the unpredictable component (2.25 / (2.25 + 1.04) ≈ 68 percent), consistent with the broader literature finding that most variation in policy instruments is driven by the systematic component of policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do the Hawkish and Dovish Fed counterfactuals work, and what do they imply?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Hawkish (Dovish) counterfactual replaces the estimated contemporaneous response to inflation in the policy rule with one that is twice (half) as large as the estimated response for the first quarter of 2022, then simulates history forward from 2022:Q2 under the modified rule. Under the Dovish Fed, the economy would have marginally overheated with output rising above CBO potential GDP estimates, and inflation would have run persistently above 5 percent. Under the Hawkish Fed, posterior medians show inflation quickly declining at a cost of about 0.7 percent of real GDP in 2023:Q2 relative to the data; the lower envelope of the 68 percent probability bands shows the output cost could have been as large as 3.1 percent. A parallel set of counterfactuals, designed to be robust to the Lucas critique by working through one-time monetary policy shocks rather than changes to the reaction function, yields broadly similar results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the comparison with Romer and Romer (2023a) reveal about the model&amp;rsquo;s monetary policy shock series?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Romer and Romer (2023a) identify a contractionary monetary policy shock in July 2022 (2022:Q3) using a narrative approach. The RC-SVAR&amp;rsquo;s estimated monetary policy shock series is broadly consistent with this finding: the model detects a contractionary shock in 2022:Q3 and, like Romer and Romer, also finds some evidence of a contractionary shock in 2022:Q2 (though they characterized it as &amp;ldquo;signs but not definitive evidence&amp;rdquo;). Beyond the Romer-Romer estimation window, the RC-SVAR additionally finds evidence of an expansionary monetary policy shock in 2023:Q1, when the Fed decelerated the pace of rate increases from 50 to 25 basis points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the RC-SVAR&amp;rsquo;s inference on the 2022–2023 tightening cycle differ from that of a constant-parameter SVAR identified only with Restriction 1?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two salient differences emerge. First, through the lens of the constant-parameter SVAR, monetary policy shocks contribute insignificantly to unexpected output growth between 2022:Q2 and 2023:Q2; in fact, the posterior median output response to a contractionary monetary policy shock is positive in that model (consistent with Uhlig 2005&amp;rsquo;s finding), implying that the positive monetary policy shocks needed to explain the rate increase would propel rather than reduce output. In the RC-SVAR, the posterior median output response to a contractionary shock is negative, so contractionary monetary policy shocks worked to decelerate output against a backdrop of non-monetary shocks that made the economy run hotter. Second, in the constant-parameter SVAR, non-monetary policy shocks that drive the unexpected increase in the federal funds rate do not propagate through output or inflation, whereas in the RC-SVAR they do—yielding a much more coherent macroeconomic narrative for the tightening cycle.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What does the model find about whether the Fed was behind the curve in 2021, and what were the consequences?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model&amp;rsquo;s 2021:Q1 forecasts predicted the federal funds rate would reach about 0.6 percent by end-2021, consistent with a view that rate normalization was already warranted. The actual federal funds rate remained at its effective lower bound through 2021:Q4, and the shock decomposition shows that the cumulative unexpected change in the funds rate during 2021:Q2–2021:Q4 was driven by expansionary monetary policy shocks—supporting the view that monetary policy was accommodative and the FOMC fell behind the curve. However, monetary policy shocks contributed only about 0.6 percentage points (annualized) to the unexpected increase in core inflation during this period; the dominant and larger source of the inflation surge was non-monetary policy shocks. The model therefore finds that the delay in tightening was not the primary driver of the 2021 inflation surge.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Do time-varying sign restrictions materially affect inference, as demonstrated in Section 6.8?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Comparing the baseline identification scheme (Restrictions 1 and 2, with Restriction 2 not imposed during exceptional periods) against an alternative scheme that imposes both restrictions throughout the entire sample reveals differences in the estimated monetary policy shocks, particularly in 2021:Q4. Under the alternative scheme, there was an expansionary monetary policy shock in 2021:Q4, while the baseline finds the shock was nearly centered around zero. Additionally, for 2021:Q2, the alternative scheme implies the contemporaneous output response to an expansionary monetary policy shock is more likely to have been positive, whereas the baseline scheme yields a different posterior distribution for this response. These differences illustrate that imposing or omitting restrictions in specific periods affects inference about structural shocks and impulse responses at economically important junctures.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Rotation-Invariant Time-Varying SVAR:&lt;/strong&gt; A class of time-varying SVAR models whose prior over sequences of structural parameters satisfies: for every permissible sequence of structural parameters and every sequence of orthogonal matrices, the orthogonally-rotated sequence is also permissible and receives the same prior density. This ensures the prior does not break the observational equivalence among structural parameter sequences related by orthogonal rotation, so that identification comes solely from the imposed restrictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Observational Equivalence in Time-Varying SVARs:&lt;/strong&gt; Two sequences of time-varying structural parameters are observationally equivalent if and only if there exists a sequence of orthogonal matrices such that one sequence equals the other sequence post-multiplied period-by-period by the corresponding orthogonal matrix. This definition extends Rothenberg&amp;rsquo;s (1971) concept to the time-varying setting and directly implies the rotation-invariance restriction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Random Correlations SVAR (RC-SVAR):&lt;/strong&gt; A specific member of the rotation-invariant class constructed by using the Archakov and Hansen (2021) parametrization of correlation matrices to define the prior over time-varying reduced-form parameters, combined with a uniform prior over sequences of orthogonal matrices. The prior is order-invariant and, for the empirical applications considered, generally achieves higher log-predictive scores than the workhorse Primiceri (2005) model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Time-Varying Sign Restrictions:&lt;/strong&gt; Sign restrictions imposed only on selected time periods rather than uniformly across the sample, implemented by allowing the restriction function St() to differ across t (including the possibility that no restriction is imposed at some t). This allows researchers to tailor identification to periods in which the theoretical or institutional knowledge motivating the restriction is deemed applicable—e.g., imposing policy-rule contemporaneous restrictions only when the federal funds rate is the primary policy instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Particle Gibbs with Ancestor Sampling (PGAS):&lt;/strong&gt; The sequential Monte Carlo method (from Lindsten, Jordan and Schon 2014) used in the paper&amp;rsquo;s Algorithm 3 to draw the sequence of structural parameters At from its conditional posterior given the sign restrictions. PGAS conditions on the previous Gibbs draw of the structural parameter sequence to ensure an invariant distribution, which is the key property that makes the Gibbs sampler valid for drawing from the correct target posterior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Systematic Component of Monetary Policy:&lt;/strong&gt; In the paper&amp;rsquo;s structural monetary policy equation, the linear combination of contemporaneous endogenous variables (output growth, inflation, money growth, credit spread) that enters the federal funds rate equation, weighted by the contemporaneous elasticities ψ. It represents the portion of interest rate variation that is a predictable, rule-based response to economic conditions, as distinguished from the monetary policy shock (the residual).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Contemporaneous Elasticity:&lt;/strong&gt; The coefficient ψi,t in the monetary policy equation measuring the response of the federal funds rate to a one-unit contemporaneous change in variable i at time t, defined directly in terms of the structural parameter matrix At. The paper&amp;rsquo;s time-varying framework allows these elasticities to evolve over the sample, revealing historically distinct episodes of how aggressively the Fed responded to output growth, inflation, money growth, and credit spreads.&lt;/p&gt;</description></item><item><title>Input Sourcing under Climate Risk: Evidence from U.S. Manufacturing Firms</title><link>https://macropaperwarehouse.com/papers/input-sourcing-under-climate-risk-evidence-from-u.s.-manufacturing-firms/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/input-sourcing-under-climate-risk-evidence-from-u.s.-manufacturing-firms/</guid><description>&lt;p&gt;Blaum, Esposito, and Heise study how supply chain risk — specifically, the risk of unexpected shipping delays caused by ocean weather conditions — affects U.S. manufacturing firms&amp;rsquo; import sourcing decisions. The paper asks three related questions: Do weather-induced shipping delays harm firm performance? Do firms adapt their sourcing strategies ex ante in response to shipping time risk? And what are the aggregate welfare costs of heightened supply chain risk from climate change, geopolitical tensions, and port congestion?&lt;/p&gt;
&lt;p&gt;The empirical foundation is the U.S. Census Bureau&amp;rsquo;s Longitudinal Firm Trade Transactions Database (LFTTD), covering the universe of U.S. import transactions from 1992 to 2016, merged with the Longitudinal Business Database and Annual Survey of Manufacturers for firm-level outcomes. For ocean shipments, the authors reconstruct vessel routes using vessel names, foreign port stops, and U.S. ports of entry, then map those routes to hourly wave height and direction data from NOAA&amp;rsquo;s WaveWatch III model at 0.5-degree resolution across more than 40,000 distinct maritime routes (period: 2011–2016 for weather data).&lt;/p&gt;
&lt;p&gt;The identification strategy proceeds in two steps. First, observed shipping times are regressed on a rich set of fixed effects — supplier, product, route-month, vessel, buyer, relationship status — plus controls for shipping charges and weight, to strip out anticipated determinants of delivery time. Second, the residuals are projected onto realized wave height and direction along the vessel&amp;rsquo;s route to isolate the weather-induced, unexpected component of shipping time variation. The identifying assumption is that realized wave conditions along the entire multi-week ocean crossing are not predictable by importers at the time orders are placed, beyond seasonal patterns absorbed by route-month fixed effects. This assumption is supported by the literature on weather forecasting, which finds accuracy degrades sharply beyond seven days.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s first empirical result concerns the consequences of weather-induced delays. Defining an extreme delay as a weather-induced shipping time above the 95th percentile for a given product-route, the authors estimate that a one standard deviation increase in the share of input costs that are weather-delayed (2.66 percentage points) reduces firm sales by 6.5%, profits by 3.5%, and employment by 1.0% within the same year. These effects are estimated from panel regressions for 2011–2016, with importer, product, and year fixed effects. The magnitudes indicate that firms are typically unable to fully hedge supply chain disruptions through insurance or financial instruments.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s second empirical result concerns ex ante adaptation. Risk exposure is measured as the standard deviation of weather-induced shipping times over three-year rolling windows for each supplier-route-product combination, then aggregated to the importer-product-year level using pre-determined import shares as weights (Bartik shift-share). Moving from the 25th to the 75th percentile of this shipping risk distribution increases the number of routes used by 7.7% and the number of foreign suppliers by 4.9%, while reducing total import value by 5.1%, route concentration (HHI) by 4.6%, and supplier concentration (HHI) by 3.2%. The risk effect on imports is estimated conditional on average shipping time, indicating that uncertainty exerts an additional, independent negative effect on import demand beyond the level of delays.&lt;/p&gt;
&lt;p&gt;To rationalize these findings, the authors build a quantitative general equilibrium model of importing with firm heterogeneity. Firms source domestic and foreign inputs; foreign input quality is reduced when delivery is late, and firms face uncertainty about shipping times when placing orders. Risk-neutral firms nonetheless face a concavity in expected revenues from monopolistic competition, so higher variance in input quality reduces expected profits. Firms can diversify by adding foreign suppliers (at a per-supplier fixed cost), and a key theoretical result is that a mean-preserving spread in supplier quality variance increases the optimal number of suppliers but, because the extensive-margin elasticity is less than one, total import value necessarily falls.&lt;/p&gt;
&lt;p&gt;The calibrated model is used to evaluate three counterfactual scenarios. Ocean wave height volatility increased by 0.34% per year on average between 2011 and 2023; projecting this trend forward 50 years generates a climate change scenario. The Houthi attacks in the Red Sea caused rerouting that raised both the mean and variance of navigation time. Post-Covid port congestion (2021–2022) increased the variance of port waiting times. Across all three scenarios, U.S. real income falls by 0.4% to 1.33%, driven by firms substituting toward more expensive domestic inputs as they reduce exposure to risky foreign sourcing.&lt;/p&gt;
&lt;p&gt;The sample scope is U.S. manufacturing importers using ocean shipping during 2011–2016 for the main empirical results (weather data period), with an extended robustness sample of 1992–2016 using residualized shipping time volatility. The study covers 43,080 origin-destination port pairs, 401,700 unique vessels, and approximately 35.8 million seaborne transactions.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s core research question?
A: The paper asks how supply chain risk — specifically, the risk of unexpected delays in ocean shipping caused by weather conditions — affects U.S. manufacturing firms&amp;rsquo; import sourcing decisions and aggregate welfare. It examines both the disruption effects of realized delays and the ex ante adaptation of sourcing strategies to risk exposure, then quantifies aggregate costs through a calibrated general equilibrium model.&lt;/p&gt;
&lt;p&gt;Q: What data sources underpin the empirical analysis?
A: The primary dataset is the LFTTD, which covers the universe of U.S. import transactions from 1992 to 2016, recording importer and exporter identities, HS-10 product codes, values, quantities, shipping dates, vessel names, and port pairs. This is merged with the Longitudinal Business Database for employment and industry, and with Census of Manufactures and Annual Survey of Manufacturers for sales, material costs, and payroll. Weather data come from NOAA&amp;rsquo;s WaveWatch III model at hourly, 0.5-degree resolution for 2011–2016. Ocean routes are constructed using Eurostat&amp;rsquo;s SeaRoute program, covering over 40,000 distinct routes across approximately 10,500 route segments.&lt;/p&gt;
&lt;p&gt;Q: How do the authors isolate the unexpected component of shipping time variation?
A: They use a two-step residualization. In step one, observed log shipping times are regressed on supplier, product, route-month, vessel, buyer, and relationship-status fixed effects, plus controls for log shipping charges and log weight; the residuals capture variation not explained by anticipated factors. In step two, these residuals are projected onto realized average wave height and relative wave direction along the vessel&amp;rsquo;s route to extract the weather-induced component. The identifying assumption is that importers cannot forecast realized wave conditions beyond seasonal patterns when placing orders that initiate multi-week ocean crossings, consistent with evidence that weather forecasts lose accuracy beyond seven days and that ocean wave height is particularly hard to predict.&lt;/p&gt;
&lt;p&gt;Q: What are the estimated effects of weather-induced shipping delays on firm performance?
A: A one standard deviation increase in the share of input costs that are weather-delayed (2.66 percentage points) reduces firm sales by 6.5%, profits by 3.5%, and employment by 1.0% within the same year. Using a broader measure of residualized shipping time delays (not restricted to the weather-induced component) produces similar results: a one standard deviation increase reduces sales by 6%, profits by 3.2%, and employment by 0.9%. These effects are estimated from panel regressions for 2011–2016 with importer, product, and year fixed effects.&lt;/p&gt;
&lt;p&gt;Q: How do firms adjust their sourcing strategies in response to higher shipping time risk?
A: Moving from the 25th to the 75th percentile of the shipping risk distribution (a 61 log-point increase) raises the number of routes used by 7.7% and the number of foreign suppliers by 4.9%, while reducing route HHI by 4.6%, supplier HHI by 3.2%, and total import value by 5.1%. The margin of route diversification is larger than supplier diversification, consistent with shipping risk being determined primarily at the route level. Higher risk also increases the likelihood of switching to air freight by 1.0% over the same interquartile range.&lt;/p&gt;
&lt;p&gt;Q: Does the risk effect on imports operate independently of the level of shipping times?
A: Yes. The regressions of total import demand on risk exposure control for average shipping time, and the coefficient on risk remains negative and significant after this control. This indicates that the variance of shipping times has an independent negative effect on import demand beyond the first-moment effect of longer average delays.&lt;/p&gt;
&lt;p&gt;Q: What is the theoretical mechanism through which shipping time risk reduces import demand?
A: In the model, firms are risk-neutral but face monopolistically competitive output markets, which introduces curvature in the revenue function. Higher variance in input quality (stemming from unpredictable shipping times) reduces expected revenues even for risk-neutral firms. Firms can diversify by adding foreign suppliers at a per-supplier fixed cost, which reduces variance in average input quality. However, the elasticity of the optimal number of suppliers with respect to quality variance is less than one, so total import expenditure necessarily falls as variance rises — diversification is incomplete and firms substitute toward domestic inputs.&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 1 state about the extensive margin response to risk?
A: Proposition 1 establishes that, under the condition that shipping time risk is small relative to expected revenues, a mean-preserving spread in the variance of supplier quality increases the optimal number of foreign suppliers. However, the elasticity of the optimal number of suppliers with respect to quality variance is strictly less than one, which implies that total import value necessarily falls whenever quality variance increases, regardless of the extensive margin diversification response.&lt;/p&gt;
&lt;p&gt;Q: How is the calibration structured and what moments does it target?
A: The model features firm heterogeneity in both productivity and shipping time risk (variance of delivery times). The calibration targets three sets of moments: the estimated effect of shipping time risk on the extensive margin of importing (number of suppliers), the negative association between firm sales and average shipping times (which disciplines the timeliness elasticity parameter tau), and the joint distribution of firm size and risk observed in the data — specifically, the empirical finding that larger importers are matched with safer (lower-risk) foreign suppliers, with a correlation of -0.12. The calibrated model replicates the key moments of shipping time risk and import demand.&lt;/p&gt;
&lt;p&gt;Q: What are the three counterfactual scenarios and their aggregate welfare costs?
A: (1) Climate change: ocean wave height volatility increased by 0.34% per year on average between 2011 and 2023; projecting this trend forward 50 years and passing the resulting increase in shipping time variance through the model. (2) Red Sea/Houthi attacks: re-routing around the Suez Canal raises both the mean and variance of navigation time. (3) Post-Covid port congestion: greater variability in port waiting times during 2021–2022. Across all three scenarios, U.S. real income falls by 0.4% to 1.33%, driven by firms substituting from cheaper foreign inputs toward more expensive domestic production to reduce risk exposure.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the shift-share (Bartik) instrument in the risk exposure measure?
A: The exposure measure aggregates supplier-route-product level risk (standard deviation of weather-induced shipping times over three-year rolling windows) to the importer-product-year level using pre-determined import shares from the prior three years as weights. Using lagged shares rather than contemporaneous shares ensures that the weights are not endogenous to current sourcing decisions. This construction is standard in the Bartik shift-share literature and helps isolate variation in risk that is plausibly exogenous to the firm&amp;rsquo;s current sourcing choices.&lt;/p&gt;
&lt;p&gt;Q: How do the authors handle the endogeneity concern that firms may select into riskier routes?
A: The weather-induced component of shipping time variation is by construction driven by realized ocean conditions that are unpredictable at the time orders are placed. The residualization removes all fixed-effect variation associated with route, season, vessel, supplier, and buyer characteristics. Additionally, the shift-share construction uses pre-determined weights, so risk exposure does not mechanically reflect current sourcing decisions. The authors also show robustness using the longer 1992–2016 sample with residualized (rather than weather-specific) shipping time volatility, obtaining qualitatively and quantitatively similar results.&lt;/p&gt;
&lt;p&gt;Q: What does the paper contribute relative to the literature on shipping times and trade?
A: Prior work by Evans and Harrigan (2005) and Hummels and Schaur (2010, 2013) focused on the level of shipping times (the first moment) as a trade cost. This paper is the first to systematically study the variance of shipping times (the second moment) as an independent determinant of import demand and sourcing structure, both empirically and theoretically. The authors show that uncertainty around delivery times has negative effects on trade that are separate from the effects of longer average delays.&lt;/p&gt;
&lt;p&gt;Q: What are the robustness checks reported for the main empirical results?
A: For the effects of risk on sourcing behavior, the authors show that using residualized shipping time volatility over the longer 1992–2016 sample (rather than the weather-induced measure over 2011–2016) produces similar results: moving from the 25th to the 75th percentile increases routes by 6.6%, suppliers by 3.7%, decreases route HHI by 3.9%, and supplier HHI by 2.5%, while reducing total imports by 10.5%. For the effects of delays on firm performance, applying the same specification with residualized (not weather-induced) delay shares yields coefficients on sales, profits, and employment that are very close to the baseline estimates.&lt;/p&gt;
&lt;p&gt;Q: What are the welfare implications for firms that cannot hedge through financial markets?
A: The large negative effects of weather-induced delays on sales, profits, and employment — and the finding that firms respond by ex ante restructuring their supply chains rather than relying on insurance — indicate that financial hedging instruments are largely unavailable or insufficient for managing input delivery risk. This motivates the model&amp;rsquo;s assumption that firms must manage risk through sourcing diversification, which is costly because of per-supplier fixed costs and because it ultimately requires substituting toward more expensive domestic inputs.&lt;/p&gt;
&lt;p&gt;Weather-induced unexpected shipping time: The component of shipping time variation explained by realized ocean wave height and direction along the vessel&amp;rsquo;s route, after removing all variation attributable to anticipated factors (route, season, vessel, supplier, buyer characteristics, shipping charges, weight). Interpreted as unexpected because multi-week ocean crossings begin before accurate weather forecasts are available.&lt;/p&gt;
&lt;p&gt;Shipping time risk: Measured as the standard deviation of weather-induced residualized shipping times over three-year rolling windows for each foreign supplier-route-product combination. This captures the second moment (variance) of delivery time uncertainty, distinct from the first moment (average shipping time level).&lt;/p&gt;
&lt;p&gt;Shift-share risk exposure: An importer-product-year level risk measure constructed as a weighted average of supplier-route-product level risk, using pre-determined import shares from the prior three years as weights. This Bartik-style construction ensures exposure weights are not endogenous to current sourcing decisions.&lt;/p&gt;
&lt;p&gt;Timeliness elasticity (tau): A structural parameter in the model governing how rapidly input quality degrades when delivery is later than expected. Specifically, when a shipment arrives di days late, quality is reduced by the factor exp(-tau*(di - E[di])). Calibrated to match the observed negative association between firm sales and average shipping times in the data.&lt;/p&gt;
&lt;p&gt;Extensive margin diversification: The response of firms to higher shipping time risk by increasing the number of foreign suppliers and shipping routes used for a given product, rather than increasing the volume sourced from existing suppliers. In the model and data, this margin is the primary channel through which firms hedge delivery risk.&lt;/p&gt;
&lt;p&gt;Mean-preserving spread condition: The theoretical condition (Proposition 1) under which higher variance in supplier quality increases the optimal number of foreign suppliers. The condition requires that shipping time risk be small relative to expected revenues, so that the diversification benefit of adding suppliers (reducing variance in average quality) dominates the revenue-reducing effect of higher variance.&lt;/p&gt;
&lt;p&gt;Per-supplier fixed cost: A fixed cost in the model that must be paid for each foreign supplier relationship maintained. This cost limits the extent of diversification, ensuring that firms cannot fully eliminate shipping time risk by adding arbitrarily many suppliers, and that higher risk raises (rather than eliminates) per-unit sourcing costs.&lt;/p&gt;</description></item><item><title>Insurer Risk and Public Risk-Sharing: Quantifying the Value of Reinsurance</title><link>https://macropaperwarehouse.com/papers/insurer-risk-and-public-risk-sharing-quantifying-the-value-of-reinsurance/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/insurer-risk-and-public-risk-sharing-quantifying-the-value-of-reinsurance/</guid><description>&lt;p&gt;Kim and Li study how publicly provided reinsurance affects insurer behavior and market outcomes in health insurance markets where firms face substantial cost uncertainty. The central question is whether standard expected-profit models—which predict that reinsurance reducing only cost volatility (not expected cost) should leave prices unchanged—miss an important mechanism: insurers internalizing the implicit financial cost of bearing claims uncertainty through &amp;ldquo;risk charges.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;The paper develops a stylized monopoly-insurer model in which the insurer&amp;rsquo;s objective includes both expected claims cost and a risk charge term L(S), where S is a risk measure (e.g., standard deviation of total claims). This yields a first-order condition in which effective marginal cost includes both standard expected claims cost and a marginal risk charge. The model predicts that public reinsurance acts through two distinct channels: (1) a cost subsidy—reimbursing a share of high-cost claims reduces expected cost; and (2) risk protection—reducing the variance of claims lowers the risk charge and thus effective marginal cost. When both channels operate, the model predicts pass-through of public reinsurance to premiums can exceed unity, in contrast to the standard less-than-one pass-through under market power.&lt;/p&gt;
&lt;p&gt;Empirically, the authors use three primary data sources for the U.S. individual health insurance exchange market. NAIC Schedule S filings (2014–2023) provide transaction-level private reinsurance contracts, including ceded premiums, realized claims, and financial solvency measures. CMS Public Use Files and MLR reports provide plan-level premiums, enrollment, and claims. The Colorado All Payer Claims Database (CO APCD, 2014–2022) and Connect for Health Colorado administrative records (2015–2021) provide individual-level claims and insurance choices for structural analysis.&lt;/p&gt;
&lt;p&gt;Descriptive evidence establishes that 62% of exchange insurers purchase private reinsurance despite average reinsurance markups of 1.54 (reinsurance margin of 0.54), and that smaller, less financially solvent insurers are disproportionate buyers—consistent with risk charges driving demand for risk protection even at above-actuarially-fair prices.&lt;/p&gt;
&lt;p&gt;An event study exploiting staggered adoption of state-level public reinsurance programs finds that public reinsurance reduces premiums by approximately 14.5% on average (27% in Colorado Tiers 1–2, 46% in Tier 3), with a pass-through rate of 1.3—significantly greater than one (p = 0.037 one-sided). Public reinsurance reduces the probability of purchasing private reinsurance by 26 percentage points (a 42% reduction from baseline) and per-member private reinsurance expenditures by $19.5 (a 68% reduction from baseline). Premium and private reinsurance effects are larger for financially constrained insurers (RBC ratio below 3). No significant effects are found on insurer entry/exit, total medical expenses (ruling out moral hazard), or private reinsurance markups.&lt;/p&gt;
&lt;p&gt;The structural model, estimated on the Colorado exchange for 2017–2020, finds that the risk charge coefficient for regional insurers averages rho = 0.25, implying regional insurers face 9.8% higher effective costs than national insurers due to risk charges and private reinsurance expenses. Risk charges account for at least half the premium-cost wedge for small regional insurers. Counterfactual decomposition of Colorado&amp;rsquo;s program shows the direct cost subsidy accounts for approximately 75% of equilibrium price reductions; risk protection and competition effects together account for the remaining 25%. In a bang-for-buck comparison, public reinsurance dominates premium subsidies of equal government expenditure by approximately 20–30%, because reinsurance uniquely reduces risk charges and enhances competition by reducing smaller regional insurers&amp;rsquo; cost disadvantage.&lt;/p&gt;
&lt;p&gt;Q: What is the core theoretical innovation of the paper?
A: The paper adds a risk charge term L(S) to the standard expected-profit objective, where S is a risk measure of the insurer&amp;rsquo;s cost distribution. This makes the insurer behave &amp;ldquo;as if risk averse,&amp;rdquo; with effective marginal cost including both expected claims cost and a marginal risk charge that decreases with insured pool size due to risk pooling. When rho = 0, the model collapses to the standard monopoly case; when rho &amp;gt; 0, cost uncertainty directly inflates prices and creates a novel role for reinsurance even when reinsurance is actuarially fair priced.&lt;/p&gt;
&lt;p&gt;Q: What are the two distinct mechanisms through which public reinsurance affects insurer pricing?
A: The first is a cost subsidy: by reimbursing a portion of high-cost claims without requiring an actuarially fair premium upfront, public reinsurance lowers the insurer&amp;rsquo;s net expected cost. The second is risk protection: by providing ex-post payments for extreme health shocks, reinsurance reduces the variance of claims costs, lowering the risk charge component of effective marginal cost. Together, these channels can produce pass-through exceeding unity even under imperfect competition, where standard cost-subsidy pass-through is typically below one.&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 1 say about actuarially fair reinsurance (theta = 1)?
A: Proposition 1(i) states that actuarially fair reinsurance—which does not alter net expected cost—still lowers the insurer&amp;rsquo;s price if and only if the insurer faces a risk charge (rho &amp;gt; 0). An insurer without risk charges is entirely unaffected by actuarially fair reinsurance. This result isolates the risk-protection channel as theoretically distinct from cost subsidization and establishes that pass-through exceeding one requires risk charges to be operative.&lt;/p&gt;
&lt;p&gt;Q: Why would an insurer purchase costly private reinsurance (theta &amp;gt; 1)?
A: Proposition 1(iii) shows that an insurer with no risk charge would never purchase private reinsurance with theta &amp;gt; 1, since it increases net expected cost with no offsetting benefit. An insurer facing a risk charge (rho &amp;gt; 0) may purchase private reinsurance because the risk-protection benefit—the reduction in cost variance and thus the risk charge—can outweigh the net cost increase. The paper documents that 62% of exchange insurers buy private reinsurance at an average markup of 1.54 (reinsurance margin 0.54), with smaller and financially weaker insurers more likely to purchase, consistent with this mechanism.&lt;/p&gt;
&lt;p&gt;Q: How does the paper establish empirically that insurers face and internalize cost uncertainty?
A: Three lines of evidence are presented. First, the CO APCD shows the claims distribution has a long right tail: the top 5% (1%) of consumers account for 68% (38%) of total expenses, and 2.5% of consumers exceed the $30,000 reinsurance threshold. Second, simulations show that with 1,000 enrollees, the probability that realized claims exceed expected costs by 25% is approximately 7%; even at 10,000 enrollees there is a 17% probability of exceeding expected costs by 5%. Third, in over 24% of insurer-year observations premium revenue falls short of realized claims costs, and the within-firm standard deviation of the claims-to-premium ratio is 0.15.&lt;/p&gt;
&lt;p&gt;Q: What are the event study findings on premiums?
A: Using staggered introduction of state-level public reinsurance programs, the event study finds premiums fell by 14.5% on average following program adoption. In Colorado specifically, Tiers 1 and 2 experienced 27% decreases and Tier 3 (highest reinsurance generosity) experienced a 46% decrease. The implied pass-through rate for 2020 is 1.3, meaning for every dollar the government spent on reinsurance, health insurance premiums fell by $1.30. A one-sided t-test rejects pass-through equal to one at p = 0.037.&lt;/p&gt;
&lt;p&gt;Q: What are the event study findings on private reinsurance?
A: Public reinsurance reduces the probability that an insurer purchases private reinsurance by 26 percentage points, a 42% decline from the pre-program baseline. Average per-member private reinsurance expenditures fall by $19.5, a 68% reduction from baseline. The substitution away from private reinsurance is consistent with the model prediction that public reinsurance displaces the demand for risk protection previously met by private markets, and reinforces the interpretation that risk management is a key driver of private reinsurance demand.&lt;/p&gt;
&lt;p&gt;Q: Do financially constrained insurers respond differently to public reinsurance?
A: Yes. The premium-reduction effect is significantly larger for insurers with RBC ratios below 3 (an additional interaction effect of -0.161 log points on top of the baseline -0.135). The reduction in per-member private reinsurance expenditures is also significantly larger for insurers with significant prior private reinsurance purchases (-$108.8 vs. baseline of -$19.5). This heterogeneity supports the hypothesis that the risk protection channel is more valuable for financially constrained insurers who face higher implicit costs of bearing risk.&lt;/p&gt;
&lt;p&gt;Q: Does public reinsurance affect insurer entry/exit, moral hazard, or private reinsurance markups?
A: The event study finds no statistically significant effect on market entry, total monthly medical expenses per enrollee, the probability that individual expenses exceed the reinsurance threshold (ruling out insurer moral hazard), or private reinsurance markups paid by primary insurers. These null results support the interpretation that premium reductions reflect reduced cost uncertainty rather than cost containment distortions, and that the competitive structure of the private reinsurance market is not directly altered by public programs.&lt;/p&gt;
&lt;p&gt;Q: What are the structural estimates of risk charges?
A: The estimated risk charge coefficient for regional insurers averages rho = 0.25. This implies that regional insurers incur, on average, 9.8% higher effective costs than national insurers (who are assumed not to face risk charges due to scale and diversification), stemming from both direct risk charges and private reinsurance expenses required to manage risk. Risk charges account for at least half the observed wedge between premiums and marginal claims costs for small regional insurers.&lt;/p&gt;
&lt;p&gt;Q: How does the structural model decompose the impact of Colorado&amp;rsquo;s reinsurance program?
A: Counterfactual analysis decomposes the equilibrium price reduction into three channels. The direct cost subsidy effect—reimbursing a share of high-cost claims between the $30,000 attachment point and $400,000 cap—accounts for approximately 75% of the price reduction. The risk protection effect (reduction in risk charges from lower portfolio variance) and the competition effect (smaller regional insurers facing lower cost disadvantages and competing more aggressively with national insurers) together account for the remaining 25% of the equilibrium price reduction.&lt;/p&gt;
&lt;p&gt;Q: How does public reinsurance compare to premium subsidies in bang-for-buck terms?
A: For equal government expenditure, public reinsurance is estimated to be approximately 20–30% more cost-effective than premium subsidies at reducing premiums. The advantage stems from two sources: reinsurance reduces risk charges, shifting down the marginal cost curve for regional insurers in a way demand-side premium subsidies do not; and reinsurance enhances competition by reducing the cost disadvantage of smaller regional insurers relative to national ones. The dominant effect is risk reduction rather than markup inflation, making reinsurance the more efficient instrument when the degree of financial risk is considerable.&lt;/p&gt;
&lt;p&gt;Q: What is the role of market size in risk charges, and why does this create a competitive asymmetry?
A: The model shows that the marginal risk charge decreases as the insured population grows (risk pooling), with marginal standard deviation equal to sigma_0 / (2*sqrt(q)), which vanishes as q approaches infinity. This implies that larger national insurers, covering very large populations, effectively face no risk charges, while smaller regional insurers face meaningful marginal risk charges. This size-asymmetry is the fundamental reason why public reinsurance disproportionately benefits smaller insurers—by reducing their risk charges, it narrows the cost gap with national insurers and intensifies competition.&lt;/p&gt;
&lt;p&gt;Q: What scope conditions apply to the structural findings?
A: The structural estimates are based on the Colorado individual health insurance exchange, covering years 2017–2020, chosen to avoid unsatisfactory early data quality and to net out systematic pandemic effects. The model assumes national insurers do not face risk charges in the baseline specification, and that aggregate (correlated) risk is not the primary driver during the sample period. Results are robust to staggered-treatment corrections (Callaway-Sant&amp;rsquo;Anna 2021; Borusyak et al. 2024), alternative outcome measures (benchmark premiums, Silver plan averages), alternative aggregation levels, and sensitivity analyses allowing for insurer entry/exit, correlated risks, moral hazard, and alternative risk charge functional forms.&lt;/p&gt;
&lt;p&gt;Q: What are the broader policy implications of the framework?
A: The framework applies to any market where firms face substantial cost uncertainty and internalize financial risk, including property and casualty insurance, flood insurance, wildfire insurance, and government loan guarantee programs. The analysis suggests that ignoring the risk protection channel causes policymakers to underestimate the effectiveness of public reinsurance relative to demand-side subsidies. Supply-side risk-sharing policies are particularly important for markets with small, financially constrained firms, where cost uncertainty most severely distorts pricing and competition, and where the competitive benefits of risk reduction are largest.&lt;/p&gt;
&lt;p&gt;Risk Charge: An additional cost term in the insurer&amp;rsquo;s objective function representing the implicit financial cost of bearing claims uncertainty, formalized as L(S) where S is a risk measure of total cost. Risk charges make the insurer behave &amp;ldquo;as if risk averse,&amp;rdquo; raising effective marginal cost above expected claims cost. In the baseline model the risk charge equals rho times the standard deviation of total claims.&lt;/p&gt;
&lt;p&gt;Risk Charge Coefficient (rho): The parameter governing the insurer&amp;rsquo;s marginal cost of financial risk, estimated structurally at an average of 0.25 for regional insurers in Colorado. It can be interpreted as either a direct risk-aversion parameter, the marginal cost of regulatory capital, or a reduced-form representation of financial and regulatory frictions that make bearing cost uncertainty costly.&lt;/p&gt;
&lt;p&gt;Risk Protection Channel: The mechanism through which reinsurance (public or private) reduces claims cost variance and thereby lowers the insurer&amp;rsquo;s risk charge, distinct from the cost-subsidy channel. The risk protection channel is operative even for actuarially fair reinsurance (theta = 1) and is responsible for pass-through rates exceeding unity under public reinsurance programs.&lt;/p&gt;
&lt;p&gt;Cost Subsidy Channel: The mechanism through which subsidized public reinsurance (theta less than 1) lowers the insurer&amp;rsquo;s net expected claims cost by reimbursing a share of high-cost claims without charging an actuarially fair premium. This channel operates regardless of whether the insurer faces risk charges and is the primary channel in standard models.&lt;/p&gt;
&lt;p&gt;Pass-Through Rate: The ratio of premium reduction to government expenditure on reinsurance. In standard models with market power, pass-through of cost subsidies is typically below one; the paper documents a pass-through rate of 1.3 in Colorado (p = 0.037 for the null of pass-through equal to one), attributing the excess to the risk protection channel reducing both expected cost and cost uncertainty simultaneously.&lt;/p&gt;
&lt;p&gt;Stop-Loss Reinsurance: A contract structure in which the reinsurer reimburses the primary insurer for individual claims costs exceeding a deductible (attachment point) kappa up to a cap. In Colorado&amp;rsquo;s program the attachment point is $30,000 and the cap is $400,000, with government coinsurance rates of 40–80% depending on county tier. More generous reinsurance corresponds to lower kappa; full reinsurance is kappa = 0.&lt;/p&gt;
&lt;p&gt;Risk-Based Capital (RBC) Ratio: The ratio of capital surplus (assets minus liabilities) to required risk-based capital, used by NAIC as a measure of insurer solvency. NAIC scrutinizes companies with RBC ratios below 200%; the paper uses RBC ratio below 3 as a proxy for financial constraint in heterogeneity analysis, finding larger premium and private reinsurance responses among constrained insurers.&lt;/p&gt;
&lt;p&gt;Tail-End Risk: The risk arising from the possibility that a small fraction of enrollees incurs extremely high medical costs, concentrated in the right tail of the claims distribution. In Colorado, the top 5% of consumers account for 68% of total expenses; tail-end risk is especially severe for small insurers with fewer than 10,000–100,000 enrollees and is the primary motivation for private reinsurance purchases even at above-actuarially-fair prices.&lt;/p&gt;</description></item><item><title>Insuring Peace: Index-Based Livestock Insurance, Droughts, and Conflict</title><link>https://macropaperwarehouse.com/papers/insuring-peace-index-based-livestock-insurance-droughts-and-conflict/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/insuring-peace-index-based-livestock-insurance-droughts-and-conflict/</guid><description>&lt;p&gt;This paper provides quasi-experimental evidence that Index-Based Livestock Insurance (IBLI) — a remote-sensing-triggered, automated payout scheme for pastoralists — substantially reduces drought-induced conflict in Kenya over the 2001–2020 period.&lt;/p&gt;
&lt;p&gt;The research question is whether a market-based financial instrument can mitigate the causal chain running from drought shocks to violent conflict between nomadic pastoralists and sedentary farmers and other land users. The authors motivate the study by documenting that droughts force pastoralists out of their traditional grazing grounds and into mixed-land-use areas (farms, ranches, urban settlements, nature reserves), where miscoordination with other land users escalates into violence. A case study of the Samburu-Laikipia-Isiolo-Meru region in central Kenya — drawing on georeferenced survey data from Lengoiboni et al. (2010) and ACLED conflict events — validates this spatial mechanism: during droughts, roughly 60–90% of non-pastoral land users report encounters with pastoralists, and conflicts accumulate precisely where drought migration routes cross into non-pastoral land.&lt;/p&gt;
&lt;p&gt;The empirical design combines two sources of variation: (1) plausibly exogenous changes in rainfall deficits at the 0.1 × 0.1-degree grid-cell level (roughly 10 × 10 km), derived from NASA GPM satellite data; and (2) the staggered, five-wave rollout of IBLI across 146 insurance districts in Kenya from 2010 onward, which the authors argue was driven primarily by technical challenges rather than pre-existing conflict or drought patterns. The unit of observation is 94,300 cell-periods. Because conflicts due to pastoralist drought migration occur in the neighborhood of affected areas rather than within them, both drought and IBLI coverage are measured as inverse-distance-weighted averages over surrounding cells. The estimating equation is a linear probability model with cell and period fixed effects, interacting neighborhood rainfall deficit with neighborhood IBLI coverage; the coefficient on this interaction term (delta3) is the parameter of interest.&lt;/p&gt;
&lt;p&gt;The main finding is that a one-standard-deviation increase in neighborhood IBLI coverage reduces the semi-elasticity of neighborhood rainfall deficit on conflict probability by approximately 23%. In absolute terms, a one-percentage-point increase in the rainfall deficit raises the probability of conflict by 6.92 percentage points at average IBLI coverage; with one additional standard deviation of neighborhood IBLI, that same deficit raises conflict probability by only 5.34 percentage points — a reduction of 1.58 percentage points against a baseline conflict probability of roughly 2.5%.&lt;/p&gt;
&lt;p&gt;Scope conditions: the effect is estimated for Kenya specifically, over a pastoralist-heavy population of approximately 8.8 million out of 53 million Kenyans, during 2001–2020. The conflict-mitigating effect is approximately four times larger in mixed-land-use areas (nine times when rollout-cluster-times-period fixed effects are included), consistent with the theoretical expectation that IBLI matters most where pastoralists are most likely to encounter other land users during drought migration.&lt;/p&gt;
&lt;p&gt;Two mechanisms are identified. First, IBLI reduces migratory pressure: when pastoral homelands have IBLI coverage, the distance between the ethnic homeland centroid and conflict events involving that group decreases, indicating reduced drought migration. Second, IBLI smooths incomes — corroborated with Afrobarometer geo-coded data — raising the opportunity cost of fighting. An instrumental-variable specification finds that actual IBLI payouts in the neighborhood reduce conflict probability by approximately 150% relative to the baseline risk.&lt;/p&gt;
&lt;p&gt;A cost-effectiveness analysis finds that even using conservative World Health Organization or World Bank estimates of the value of statistical life, IBLI delivers fatality savings of between 10 and 22 cents per dollar spent on government subsidies for the program, making it a cost-effective complement to political and institutional conflict-mitigation approaches.&lt;/p&gt;
&lt;p&gt;Q: What is the core causal mechanism linking droughts to conflict that IBLI interrupts?&lt;/p&gt;
&lt;p&gt;A: Droughts deplete forage in pastoralists&amp;rsquo; traditional grazing grounds, forcing them to migrate into mixed-land-use areas — farms, ranches, urban settlements, and nature reserves — where encounters with other land users are more likely to escalate into violence. Without insurance, pastoralists hold excess livestock as precautionary savings, amplifying the extent of necessary migration during dry periods. IBLI payouts allow pastoralists to purchase forage locally, reducing migration distance and intensity, and also smooth income, raising the opportunity cost of engaging in violence.&lt;/p&gt;
&lt;p&gt;Q: How does IBLI work technically, and why does it overcome problems of traditional livestock insurance?&lt;/p&gt;
&lt;p&gt;A: IBLI uses satellite remote sensing to calculate whether a district-specific drought threshold has been crossed; if so, automated payments are triggered immediately without requiring direct loss assessment or field inspections. This design eliminates moral hazard and adverse selection problems inherent in traditional indemnity insurance, reduces monitoring costs, and enables fast delivery via mobile payment platforms such as MPESA even to remote households. The Kenyan government rebranded the program as the Kenyan Livestock Insurance Program (KLIP) in 2015 and fully subsidizes coverage for up to five tropical livestock units per household.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the main conflict-mitigation result?&lt;/p&gt;
&lt;p&gt;A: A one-standard-deviation increase in neighborhood IBLI coverage reduces the semi-elasticity of the neighborhood rainfall deficit on conflict probability by approximately 23% (delta3/delta1 = -0.0158/0.0692). In absolute terms, this translates to a reduction from a 6.92 percentage-point increase in conflict probability per one-percentage-point rainfall deficit to a 5.34 percentage-point increase — a decline of 1.58 percentage points against a mean conflict probability of roughly 2.5%.&lt;/p&gt;
&lt;p&gt;Q: Why do the authors use a neighborhood rather than cell-level treatment measure?&lt;/p&gt;
&lt;p&gt;A: Drought-induced pastoralist conflicts occur primarily not in the pastoral home areas themselves but in neighboring regions where drought migration routes cross into non-pastoral land. The case study documents this pattern directly: ACLED conflict events accumulate where migration routes from Namelok, Lodungokwe, and Ngaremara communities intersect urban or agricultural areas, not within the pastoral zones. The neighborhood approach, using inverse-distance-weighted averages, captures both the probability of migration from surrounding cells and the declining probability of migration with distance.&lt;/p&gt;
&lt;p&gt;Q: What is the main identification concern and how do the authors address it?&lt;/p&gt;
&lt;p&gt;A: The main concern is that the timing of the IBLI rollout is endogenously determined — areas with a higher latent drought-conflict elasticity might receive coverage earlier or later, biasing the interaction coefficient. The authors show that the pre-treatment drought-conflict elasticity has no systematic correlation with either IBLI eligibility or the timing of coverage receipt. Placebo tests interacting the neighborhood rainfall deficit with pre-treatment eligibility or eventual coverage indicators yield positive, statistically insignificant coefficients, suggesting any bias would run in the direction of underestimating the mitigation effect. A permutation test randomly reassigning IBLI coverage across the six rollout clusters finds the actual point estimate is in the bottom 2.2% of the simulated distribution, indicating it is unlikely to arise from cluster-level confounders.&lt;/p&gt;
&lt;p&gt;Q: How do the authors rule out that other programs — cash transfers or development aid — explain the result?&lt;/p&gt;
&lt;p&gt;A: The authors control for cell-level and neighborhood-level coverage of Kenya&amp;rsquo;s Hunger Safety Net Programme (HSNP), which provides unconditional cash transfers to vulnerable households and covers most IBLI-eligible areas, as well as for World Bank agricultural aid projects. Across these specifications, the estimated conflict mitigation ranges from -19.16% to -42.24%, with the baseline estimate of -22.79% remaining robust, indicating neither HSNP nor development aid is a plausible alternative explanation.&lt;/p&gt;
&lt;p&gt;Q: What is the alternative identification strategy using within-rollout-cluster variation?&lt;/p&gt;
&lt;p&gt;A: The authors exploit pre-determined (1984 government land-use map) variation in mixed-land-use status across cells within the same IBLI rollout cluster-period, including rollout-cluster-times-period fixed effects that absorb any omitted variable related to the potentially endogenous rollout steps. The conflict-mitigating effect of IBLI is approximately four times larger in mixed-land-use cells, and approximately nine times larger in the most restrictive specification with rollout-cluster-times-period fixed effects, consistent with the prediction that IBLI matters most where pastoralists encounter other land users.&lt;/p&gt;
&lt;p&gt;Q: How do the authors establish the migratory pressure mechanism?&lt;/p&gt;
&lt;p&gt;A: Following Eberle et al. (2023), the authors match conflict actors to ethnic homelands using Murdock (1967) boundaries and test whether IBLI coverage in a homeland reduces the distance between the homeland centroid and conflict events involving that group. They find that it does, indicating that IBLI coverage reduces the spatial range of pastoralist drought migration and thus the probability of conflict-generating encounters with other land users.&lt;/p&gt;
&lt;p&gt;Q: How do the authors establish the income-smoothing mechanism?&lt;/p&gt;
&lt;p&gt;A: Using geo-coded Afrobarometer survey data, the authors show that IBLI coverage is associated with higher reported incomes among pastoralist households, consistent with Jensen et al. (2017). Higher incomes raise the opportunity cost of fighting (following Grossman, 1991), contributing to the overall conflict-mitigating effect alongside reduced migratory pressure.&lt;/p&gt;
&lt;p&gt;Q: What does the instrumental variable specification find?&lt;/p&gt;
&lt;p&gt;A: The authors instrument inverse-distance-weighted IBLI payouts in the neighborhood with the interaction of neighborhood rainfall deficit and neighborhood IBLI coverage. The first stage confirms that rainfall deficits trigger payouts conditional on coverage. The second stage finds that the occurrence of payouts in the neighborhood reduces the probability of conflict by approximately 150% relative to the baseline risk, corroborating the reduced-form results.&lt;/p&gt;
&lt;p&gt;Q: How do the authors assess cost-effectiveness?&lt;/p&gt;
&lt;p&gt;A: The authors predict plausible drought-induced conflict fatalities in Kenya over the pre-treatment period and calculate yearly lives saved from the main estimates, then compare the monetary value of saved lives to government subsidy expenditures on IBLI. Using conservative VSL estimates from the WHO and World Bank, IBLI delivers between 10 and 22 cents of pure fatality savings per dollar of public subsidy expenditure.&lt;/p&gt;
&lt;p&gt;Q: How robust are the results to alternative drought and conflict measures?&lt;/p&gt;
&lt;p&gt;A: Results are qualitatively similar using an Aridity Index or Dry Matter Productivity (DMP) as drought proxies instead of rainfall deficit. The estimated interaction effect maintains a t-statistic above two for spatial decay functions ranging from distance^-0.5 to distance^-1.5 and for Conley standard error cutoffs from 200 km up to 400 km. Results also hold when restricting to conflict events not involving the government, or to battles, riots, and violence against civilians only, and when excluding the pre-IBLI period (2000–2009) entirely.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications regarding scalability?&lt;/p&gt;
&lt;p&gt;A: Pastoralism covers 43% of the African landmass across 36 countries, supporting approximately 268 million people (FAO, 2018). The World Bank and private equity were planning to invest close to 900 million dollars in East African pastoralist programs over 2023–2027. The authors argue that IBLI&amp;rsquo;s cost structure — high fixed costs of technology and setup but low marginal costs of expansion — gives it a scalability advantage over cash transfer programs or public works schemes that require sustained state capacity. Market-based IBLI complements rather than substitutes for political and institutional reforms.&lt;/p&gt;
&lt;p&gt;Index-Based Livestock Insurance (IBLI): A financial instrument that uses satellite remote sensing to automatically trigger preemptive cash payouts to pastoralists when a pre-determined district-specific drought threshold is crossed, bypassing direct loss assessment and thereby eliminating moral hazard and adverse selection problems inherent in traditional indemnity insurance.&lt;/p&gt;
&lt;p&gt;Drought-conflict semi-elasticity: The percentage-point change in the probability of conflict associated with a one-percentage-point increase in the rainfall deficit; the paper&amp;rsquo;s main outcome quantity, estimated at 6.92 percentage points at mean IBLI coverage, reduced by 23% for a one-standard-deviation increase in neighborhood IBLI coverage.&lt;/p&gt;
&lt;p&gt;Neighborhood approach: An empirical strategy that measures both drought severity and IBLI coverage as inverse-distance-weighted averages over all surrounding grid cells, reflecting the authors&amp;rsquo; finding that pastoralist drought-migration generates conflicts not in the pastoral home area but in neighboring mixed-land-use zones where migration routes intersect other land users.&lt;/p&gt;
&lt;p&gt;Migratory pressure: The mechanism by which drought forces pastoralists — who hold excess livestock as precautionary savings in the absence of insurance — to migrate farther from traditional grazing grounds into mixed-land-use areas, increasing the probability of encounters and violent miscoordination with farmers, urban dwellers, and protected-area managers.&lt;/p&gt;
&lt;p&gt;Mixed land use: Areas, designated using a 1984 Kenyan government land-use map, where pastoral grazing zones are proximate to farms, ranches, urban settlements, or nature reserves; the paper identifies these as the locations with the highest expected treatment intensity, where IBLI coverage reduces drought-induced conflict approximately four to nine times more than elsewhere.&lt;/p&gt;
&lt;p&gt;Tropical Livestock Unit (TLU): The standard unit of account for IBLI contracts in Kenya; one TLU corresponds to one head of cattle or ten goats or sheep; the Kenyan government fully subsidizes IBLI for up to five TLUs per household.&lt;/p&gt;
&lt;p&gt;Rollout-cluster-times-period fixed effects: A restrictive set of fixed effects included in the alternative identification strategy that absorbs all omitted variables varying at the level of the six IBLI spatial rollout clusters over time, allowing the authors to identify the conflict-mitigating effect purely from within-cluster variation in mixed-land-use exposure.&lt;/p&gt;</description></item><item><title>Intergenerational Impacts of Secondary Education: Experimental Evidence from Ghana</title><link>https://macropaperwarehouse.com/papers/intergenerational-impacts-of-secondary-education-experimental-evidence-from-ghana/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/intergenerational-impacts-of-secondary-education-experimental-evidence-from-ghana/</guid><description>&lt;p&gt;This paper provides experimental evidence on the intergenerational impacts of secondary education subsidies in a low-income context, leveraging a randomized controlled trial (RCT) conducted in rural Ghana with a 15-year longitudinal follow-up. The study exploits a 2008 scholarship lottery in which 682 students — drawn from 2,064 rural youth who had been admitted to public senior high school but had not enrolled due to financial constraints — were randomly selected to receive four-year secondary school scholarships covering full tuition and fees. Scholarship receipt increased senior high school completion by 27–28 percentage points for both men and women (from 39.8% to 67.2% for women; from 49.7% to 77.9% for men), and raised average years of education by 1.33 years.&lt;/p&gt;
&lt;p&gt;The central research question is whether secondary education subsidies generate intergenerational benefits — specifically, whether children of scholarship recipients have better survival and cognitive development outcomes — and what mechanisms drive any such effects.&lt;/p&gt;
&lt;p&gt;For female scholarship recipients, the scholarship significantly altered fertility timing and partnership. By 2013, female recipients were 6.9 percentage points less likely to have ever been pregnant (on a control-group base of 48.3%), with the decline driven almost entirely by a 7 percentage point (17%) reduction in unwanted pregnancies. Though total fertility eventually caught up by 2022, recipients were still less likely to be married or cohabiting as of 2019 and were significantly more likely to have a partner with tertiary education.&lt;/p&gt;
&lt;p&gt;Children of female scholarship recipients experienced substantially lower mortality. Among control-group female respondents, 3.5% of children died before age one and 4.0% before age three. These rates fell to 1.7% (p=0.028) and 2.2% (p=0.065) respectively among children of female recipients — a roughly 45–51% reduction in under-one and under-three mortality.&lt;/p&gt;
&lt;p&gt;Child cognitive development gains emerge only once children reach school age. Children of female recipients show no significant cognitive score differences at 18 months, 2.5 years, or 3.5 years, but score 0.238 standard deviations higher at age five (p=0.005) and 0.252 standard deviations higher at age seven (p=0.035). Effects span language, math and numeracy, spatial reasoning, and executive function, but not socio-cognitive development. These effect sizes fall between the 75th and 80th percentile of RCT-based educational intervention effect sizes in low- and middle-income countries.&lt;/p&gt;
&lt;p&gt;The primary mechanism is not higher income or greater monetary investment in children. The study finds no significant treatment effect on household SES index (0.107 SDs, p=0.103), no impact on formal schooling inputs, and no difference in parental aspirations or knowledge of child stimulation&amp;rsquo;s importance. Instead, more-educated mothers seek more prenatal care, engage in more preventive health behaviors, and — critically — spend more time interacting with their children in stimulating ways. Day-long LENA (Language Environment Analysis) recordings at 18 months confirm 20% more adult-child conversational turns per minute (effect size 0.068, p=0.005) and 17% more child vocalizations per minute (effect size 0.32, p=0.014) for children of female recipients.&lt;/p&gt;
&lt;p&gt;For male scholarship recipients, no analogous intergenerational benefits appear. Their partners are not more educated (in fact slightly less educated on tertiary rates), their children show no mortality improvement, and cognitive scores are if anything negative at age five (point estimate -0.22, p=0.069). The absence of effects is attributed to male scholarship recipients having caregivers — overwhelmingly mothers — with no more education than in the control group, and to children of male recipients being 8.7 percentage points less likely to live with their father.&lt;/p&gt;
&lt;p&gt;A cost-benefit analysis finds internal rates of return (IRR) of 27%–76% for a female-only means-tested scholarship program and 20%–51% for a mixed-gender program. The cost per under-three death averted ($15,184 for female-only) places the scholarship program within the range of the 10th-percentile most cost-effective WHO-recommended child health interventions.&lt;/p&gt;
&lt;p&gt;Scope conditions: the study estimates effects for students who qualified for senior high school but faced binding financial constraints in rural Ghana in 2008 — a population that is well-prepared academically but economically disadvantaged. Results may not generalize to students who would not have qualified for secondary school or to contexts where financial barriers are not binding.&lt;/p&gt;
&lt;p&gt;Q: What was the experimental design and who was in the study sample?
A: In 2008, 2,064 rural Ghanaian students who had been admitted to senior high school (SHS) but had not enrolled — typically due to inability to pay fees — were sampled. After a baseline survey, 682 were randomly selected (approximately one-third) by lottery to receive a four-year scholarship covering full tuition and fees for a day (non-boarding) student, stratified by district, school, gender, and exam-year cohort. The two-thirds comparison group received no scholarship. Students were on average 17 years old at baseline and just over 31 at the last follow-up in Spring 2023.&lt;/p&gt;
&lt;p&gt;Q: How large was the scholarship&amp;rsquo;s effect on educational attainment?
A: Scholarship receipt raised SHS completion from 39.8% to 67.2% among women (a 69% increase) and from 49.7% to 77.9% among men (a 57% increase). Overall, the scholarship led to an average of 1.33 more years of education. For women only, it also significantly raised tertiary education: by 2023, scholarship receipt increased tertiary completion by 10.8 percentage points for women, but had no significant tertiary effect for men.&lt;/p&gt;
&lt;p&gt;Q: What were the effects on fertility and family formation for female scholarship recipients?
A: By 2013, female recipients were 6.9 percentage points less likely to have ever been pregnant (base: 48.3% in control), driven almost entirely by a 7 percentage point (17%) reduction in unwanted pregnancies. By 2019, recipients were still 6 percentage points less likely to have started childbearing and had 0.152 fewer children on average (p=0.065). Total fertility eventually caught up by 2022. By 2016, female recipients were 12.1 percentage points (24% of control mean) less likely to have ever lived with a partner, and by 2019 were 6.2 percentage points less likely to be married or cohabiting. Conditional on having a partner, they were significantly more likely to have a partner who completed tertiary education (p=0.071).&lt;/p&gt;
&lt;p&gt;Q: What were the effects on fertility and family formation for male scholarship recipients?
A: Male recipients showed few changes in fertility or marriage behavior. They were 7.8 percentage points (30% of control mean) more likely to still be living with their parents as of 2019. Their partners were not more educated; in the cognitive games subsample, treatment actually reduced the share of partners with tertiary education by 3.6 percentage points from a control base of 4.3%.&lt;/p&gt;
&lt;p&gt;Q: What were the child mortality results for children of female scholarship recipients?
A: Among children of female control respondents, 3.5% died before age one and 4.0% before age three. These fell to 1.7% (p=0.028) and 2.2% (p=0.065), respectively, among children of female recipients — approximately a halving of under-one and under-three mortality. These point estimates are robust to varying the covariates (linear vs. fixed effects for birth year, dropping or adding controls). After multiple-hypothesis testing adjustment using the Romano-Wolf step-down procedure, the p-value for survived-to-one rises from 0.028 to 0.119.&lt;/p&gt;
&lt;p&gt;Q: What were the child mortality results for children of male scholarship recipients?
A: The estimated effects for children of male recipients were smaller and statistically insignificant: a 1.4 percentage point increase in survived-to-one (p=0.161) and 0.9 percentage points in survived-to-three (p=0.549). These estimates are not significantly different from those for female recipients. Results were sensitive to sample perturbations given the smaller sample: only 26 of 1,016 children of male respondents died before age one.&lt;/p&gt;
&lt;p&gt;Q: What child cognitive development gains did children of female scholarship recipients show, and at what ages?
A: No significant differences emerged at 18 months (-0.066 SDs, p=0.489), 2.5 years (-0.024 SDs, p=0.850), or 3.5 years (0.026 SDs, p=0.736). Significant gains appeared at age five (0.238 SDs, p=0.005) and age seven (0.252 SDs, p=0.035). Effects span language (0.15 SDs at five; 0.27 SDs at seven), math and numeracy (0.15 SDs; 0.26 SDs), spatial reasoning (0.20 SDs; 0.12 SDs), and executive function (0.25 SDs; 0.20 SDs), but not socio-cognitive development. These effect sizes fall between the 75th and 80th percentile of educational RCT effect sizes in low- and middle-income countries.&lt;/p&gt;
&lt;p&gt;Q: What cognitive development effects did children of male scholarship recipients show?
A: No significant positive effects emerged at any age. Point estimates were negative at all ages except 18 months, and marginally significantly negative at age five (-0.22 SDs, p=0.069). The difference in treatment effects between children of male and female recipients is statistically significant at age five (p=0.005).&lt;/p&gt;
&lt;p&gt;Q: Why do cognitive gains appear only at age five and not earlier?
A: The authors offer three interpretations: first, that the cognitive tests for younger children are noisier instruments (cross-sectional and longitudinal correlations within domains are much lower for 1.5-year tests than 5-year tests); second, that impacts on cognitive development may take time to materialize; third, that marginal survivors in the treatment group may start with a cognitive deficit (e.g., surviving a cerebral malaria episode), and maternal education effects require time to overcome this initial handicap. Gains concentrate on skills underlying literacy and numeracy, consistent with more educated mothers bridging home and school environments.&lt;/p&gt;
&lt;p&gt;Q: What is the primary mechanism driving intergenerational effects?
A: The primary mechanism is changes in parenting behaviors, not income. Female recipients do not invest more money in children (no significant difference in SES index or child investment index). Instead, they seek more prenatal care, engage in significantly more preventive health behaviors, and interact more with their children in cognitively stimulating ways. Day-long LENA recordings at 18 months show 20% more conversational turns per minute (effect size 0.068, p=0.005) and 17% more child vocalizations per minute (effect size 0.32, p=0.014). Caregiver reports confirm more playing, singing, and doing simple mathematics with children.&lt;/p&gt;
&lt;p&gt;Q: Does the income effect of scholarship receipt explain the child outcomes?
A: No. Duflo et al. (2024) find no significant earnings impacts until 2019 or later, meaning children tested at ages five and seven by 2023 largely grew up before their mothers&amp;rsquo; earnings improved. The household SES index shows only a 0.107 SD gain (p=0.103), indistinguishable from the effect for children of male recipients. There is also no evidence of a quality-quantity trade-off: caregivers of scholarship recipients do not have fewer children to care for.&lt;/p&gt;
&lt;p&gt;Q: Does the increase in maternal age at birth explain the child mortality reduction?
A: It is not the primary driver. Maternal age at birth increases by only 0.349 years on average (p=0.142) for children of female recipients, and 0.64 years for first-born children (p=0.040). Point estimates on mortality for first-born children are somewhat smaller than for the full sample, suggesting maternal age is not the main channel. Moreover, maternal age at birth falls for children of male recipients yet their survival point estimates are positive, which further argues against maternal age as the primary mechanism.&lt;/p&gt;
&lt;p&gt;Q: How does the education of the primary caregiver mediate the results?
A: For 84% of children in the sample, the primary caregiver is the child&amp;rsquo;s mother. Children of female scholarship recipients have caregivers who are 25 percentage points more likely to have completed secondary school and 5 percentage points more likely to have completed tertiary education. Children of male scholarship recipients have caregivers with no more education than the control group, because the recipients&amp;rsquo; partners — the typical caregivers — are not more educated. Treatment effects for female recipients are not altered when father&amp;rsquo;s education is added as a control, confirming maternal education as the main driver.&lt;/p&gt;
&lt;p&gt;Q: What threat to validity arises from co-residence of the father?
A: Children of male scholarship recipients are 8.7 percentage points less likely to live with their father (p=0.024), compared to no such effect for children of female recipients (92% of whom live with their scholarship-recipient mother). LENA recordings show negative treatment effects for children of male recipients — fewer adult words and conversational turns — consistent with father absence mechanically reducing auditory engagement and possibly leaving single mothers less time to verbally interact with each child.&lt;/p&gt;
&lt;p&gt;Q: How are multiple-hypothesis testing concerns addressed?
A: The pre-analysis plan pre-specified child survival and child cognitive development as primary outcomes. The authors apply the Romano-Wolf step-down procedure for multiple hypothesis testing adjustment. After adjustment, the p-value for survived-to-one for children of female recipients rises from 0.028 to 0.119; the cognitive development effects at age five and seven remain significant.&lt;/p&gt;
&lt;p&gt;Q: How does the study address potential sample selection bias in the child outcomes sample?
A: The authors use entropy balancing (Hainmueller, 2012) to reweight observations so that baseline (2008) characteristics are balanced between treatment and control within the subsample of recipients who had children. Results are qualitatively unchanged for both female and male recipients. The authors also note that children of female recipients are younger on average (4.71 months, p=0.067), which is why the study collects data at fixed age windows (14-22 months, 2.5 years, 3.5 years, 5 years, 7 years) rather than in a single cross-sectional wave.&lt;/p&gt;
&lt;p&gt;Q: What is the cost-effectiveness and cost-benefit result for secondary school scholarships?
A: Social costs are estimated at $585 per recipient for a mixed-gender program and $505 for a female-only program (combining school fees, materials, and foregone wages). The cost per under-three death averted is $23,582 for mixed-gender and $15,184 for female-only — placing the female-only program within the range of the 10th-percentile most cost-effective WHO-recommended child health interventions. The IRR is 27%–76% for a female-only means-tested scholarship program and 20%–51% for a mixed-gender program. These are likely conservative, as they exclude welfare gains from avoiding unwanted pregnancies, greater female agency, and recipient health benefits.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the experiment and to what population do findings generalize?
A: The study estimates ITT effects for students in rural Ghana who qualified for SHS on exam performance but faced binding financial constraints in 2008 — a population that is academically prepared but economically disadvantaged. Results do not directly apply to students who would not have qualified, to contexts without binding financial barriers, or to settings where secondary school quality or the marriage market differs substantially. The study also cannot yet observe complete fertility, since scholarship-lottery participants were only 31 years old on average at last follow-up.&lt;/p&gt;
&lt;p&gt;LENA (Language Environment Analysis): A day-long recording device worn by a child that uses speech recognition software to generate count-based metrics — adult word count, adult-child conversational turns, and child vocalizations per minute — providing an objective measure of the child&amp;rsquo;s auditory environment and caregiver engagement quality without reliance on self-report.&lt;/p&gt;
&lt;p&gt;IRT Score (Item Response Theory Score): A latent-trait measure of child cognitive ability estimated from a one-parameter logistic model applied to binary correct/incorrect responses across cognitive game questions, assigned a difficulty level to each question and a latent ability to each child, then standardized. Used as the primary cognitive development outcome across age windows.&lt;/p&gt;
&lt;p&gt;Incarceration Effect: The hypothesis that education delays fertility mechanically only while students are in school (analogous to incarceration preventing activity), with no persistent effect once they exit. The authors rule this out by showing that the fertility gap between female treatment and control groups persists well after the majority of scholarship recipients have graduated.&lt;/p&gt;
&lt;p&gt;Quality-Quantity Trade-off (Becker 1991): The economic framework predicting that more educated parents, facing higher opportunity costs of children and lower costs of investing in child quality, will have fewer but better-invested-in children. The authors find delayed and reduced fertility but do not find that recipients have fewer children to care for in the cognitive assessment sample, suggesting the child quality gains operate primarily through parenting practices rather than resource concentration.&lt;/p&gt;
&lt;p&gt;Intent-to-Treat (ITT) Effect: The treatment effect estimated by comparing all lottery winners to all losers regardless of whether winners actually enrolled, which captures the effect of the scholarship offer (including compliance costs). The cost-benefit analysis uses ITT estimates, so the cost of subsidizing inframarginal students who would have attended anyway is incorporated.&lt;/p&gt;
&lt;p&gt;Entropy Balancing: A reweighting procedure (Hainmueller, 2012) that assigns weights to observations in the control group so that the weighted distribution of baseline covariates matches that of the treatment group, used to assess whether imbalances in the subsample of participants who had children drive the results. The authors apply this as a robustness check for both mortality and cognitive development outcomes.&lt;/p&gt;
&lt;p&gt;Unwanted Pregnancy: A pregnancy reported by the respondent as unplanned at the time of conception, which the authors use to distinguish fertility reduction from a change in desired fertility versus a reduction in unintended out-of-wedlock pregnancies. The scholarship&amp;rsquo;s early fertility impact is almost entirely a reduction in unwanted pregnancies (7 percentage point decline, 17% reduction).&lt;/p&gt;</description></item><item><title>International Trade Responses to Labor Market Regulations</title><link>https://macropaperwarehouse.com/papers/international-trade-responses-to-labor-market-regulations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/international-trade-responses-to-labor-market-regulations/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; This paper asks whether differences in labor market regulations — specifically payroll taxes and minimum wages — shape countries&amp;rsquo; comparative advantage in the cross-border provision of labor-intensive services. The question has broad policy relevance: if lower labor standards confer a systematic trade advantage, countries may face pressure to race to the bottom in labor protections, and political support for economic integration may erode.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Identification.&lt;/strong&gt; The paper exploits the EU &amp;ldquo;posting policy,&amp;rdquo; a large trade program established in 1959 that allows firms in one EU member state to temporarily send their employees to perform service contracts in another member state. In 2017, posting accounted for roughly one-third of all within-EU trade in services (approximately 2% of EU GDP), involving about 2 million workers (in full-time equivalents) in 2019. The setting is analytically attractive because competing foreign and domestic firms serve the same customers at the same physical location using shared capital, holding most determinants of comparative advantage constant while labor market regulations vary by the firm&amp;rsquo;s country of origin.&lt;/p&gt;
&lt;p&gt;Under posting rules, payroll taxes are generally origin-based (exporting firms pay their home country&amp;rsquo;s tax rate) but become destination-based when contracts exceed a regulatory duration threshold (12 months pre-2010, 24 months from 2010–2020, 18 months from 2020 onward). Minimum wages are destination-based: foreign firms must match the importing country&amp;rsquo;s statutory minimum wage floor when it exceeds the workers&amp;rsquo; home-country wage level. This generates the paper&amp;rsquo;s key identifying variation — payroll taxes and minimum wages vary across countries, over time, and within countries across sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The author uses administrative A1 social security forms filed for every EU posting contract from 2007–2018, collected from 25 EU member states, supplemented by micro-level national posting registries in Belgium (LIMOSA), France (SIPSI), and Luxembourg (matched employer-employee data). Labor cost data (wages, payroll tax rates, minimum wages) come from Eurostat and the OECD Taxing Wages Dataset.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The paper proceeds in three steps. First, it documents steady-state cross-sectional correlations between bilateral posting flows and labor cost differentials. Second, it estimates difference-in-differences (DiD) elasticities from four quasi-natural experiments. Third, it estimates a theory-consistent gravity model using all sources of variation across 25 EU countries from 2009–2018.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Steady-state correlation:&lt;/em&gt; A strong negative relationship exists between bilateral posting flows and labor cost differentials, with a cross-sectional elasticity of approximately –0.58 (SE 0.08). In sharp contrast, the relationship between bilateral goods trade and labor cost differentials is weak and if anything marginally positive (point estimate +0.13), confirming that labor cost differences are a distinctive driver of trade specifically in labor-intensive services rather than goods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Belgian tax shift (2016–2018):&lt;/em&gt; When Belgium cut employers&amp;rsquo; social security contributions from 33% to 25%, imports of posting services into Belgium slowed relative to France (a neighboring control country on parallel pre-reform trends). The reduced-form elasticity of posting imports with respect to the payroll tax rate is 1.45 (SE 0.3).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Luxembourg EU regulation reform (2010):&lt;/em&gt; A new EU regulation required temporary employment agencies in border regions to pay destination-based payroll taxes, raising statutory rates faced by Luxembourgish exporters from 15% to 44%. Posting exports from Luxembourg&amp;rsquo;s temporary employment sector fell by 40% relative to the pre-reform level and relative to the domestic (control) sector, while the sheltered road transportation sector showed no response. The reduced-form elasticity with respect to the statutory payroll tax rate is –1.55 (SE 0.24), and the triple-difference estimate is –1.37 (SE 0.08).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Bunching at duration thresholds:&lt;/em&gt; The distribution of posting contract lengths in France (which has the EU&amp;rsquo;s highest payroll taxes) shows a sharp spike just below the 24-month payroll tax threshold. When the threshold was moved to 18 months in 2020, excess mass migrated to the new threshold, confirming that bunching reflects behavioral responses to the tax notch rather than reference-point effects. This documents that payroll tax differentials shape not only the quantity (extensive margin) but also the length (intensive margin) of posting contracts.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;German minimum wage reform (2015):&lt;/em&gt; Germany&amp;rsquo;s introduction of a national minimum wage of €8.50 per hour — which was already binding on construction workers through a sectoral minimum, but not on foreign firms providing non-construction services — caused postings to Germany in manufacturing to fall by approximately 60% relative to the construction (control) sector. The reduced-form elasticity is –1.34 (SE 0.43). Heterogeneity analysis shows that export declines were monotonically larger for low-wage origin countries where the new minimum wage was binding, and placebo estimates using Germany&amp;rsquo;s high-wage neighboring countries (where minimum wage requirements did not change) are statistically indistinguishable from zero.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Gravity estimates:&lt;/em&gt; The preferred specification (PPML with origin-year, destination-year, and pair fixed effects, exploiting bilateral variation in minimum wage bindingness across origin countries) yields a model-implied trade elasticity θ of –1.2 (SE 0.2). The range across specifications is –1.2 to –2.4. These estimates are smaller than the goods trade elasticity (typically estimated around 5) and below the medium-run reduced-form elasticities from the DiD case studies, consistent with short-run gravity estimates capturing only partial adjustment while DiD designs measure longer-run equilibrium responses.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Policy Counterfactual.&lt;/strong&gt; The paper&amp;rsquo;s estimates imply that the Bolkestein Directive — which proposed exempting foreign firms from all destination-country labor regulations — would have doubled exports of physical services from Eastern European countries (upper bound), as their cost advantage would have been dramatically amplified by removal of minimum wage requirements. Counterpart to this export boom, average posted workers&amp;rsquo; wages would have fallen by approximately 16%, since workers would lose their entitlement to destination-country minimum wages. The paper documents that the Bolkestein controversy — sparked by the &amp;ldquo;Polish plumber&amp;rdquo; debate in early 2005 — coincided with a sharp and persistent drop in French voter support for the EU constitutional treaty, which was subsequently rejected.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results apply specifically to trade in physical (labor-intensive) services traded via temporary worker posting within the EU, where productivity differences across countries for these tasks are plausibly small (Balassa-Samuelson), making institutional factors a primary driver of wage differences. The paper estimates intent-to-treat effects, assuming perfect compliance by exporting firms. The paper does not perform a comprehensive welfare analysis covering consumer price effects or general equilibrium wage and trade-balance responses.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the EU posting policy and why does it provide an unusually clean setting for identifying the causal effect of labor regulations on trade?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The EU posting policy, established in 1959, allows firms in one EU member state to temporarily send employees to perform service contracts in another member state. The policy keeps most determinants of comparative advantage constant — competing foreign and domestic firms serve the same customers at the same physical location using shared capital — while labor market regulations vary by the firm&amp;rsquo;s country of origin. Productivity differences for physical services across countries are also plausibly limited (Balassa-Samuelson), making institutional wage differences the primary cost driver. Enforcement is facilitated by the on-site nature of the service, and administrative A1 forms create a direct measure of the number of workers involved in cross-border transactions without a minimum reporting threshold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the three sources of labor cost differences the paper identifies and quantifies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Foreign firms competing for posting contracts face different costs through three channels: (i) equilibrium gross wages differ across origin countries, reflecting both productivity differences and institutional/information frictions that allow wage discrimination between posted and domestic workers; (ii) payroll tax rates are origin-based and differ substantially across countries (for example, France&amp;rsquo;s employer payroll tax is approximately 40% versus approximately 15% for Luxembourg before the 2010 reform); and (iii) destination-specific minimum wages impose a &amp;ldquo;posting allowance&amp;rdquo; on firms from countries with lower wages, equal to the shortfall between the firm&amp;rsquo;s home-country wage and the importing country&amp;rsquo;s minimum wage floor. Micro-level wage data from France confirm that most posted workers from low-wage countries are paid exactly at the French minimum wage, demonstrating the bindingness of the third channel, while French workers performing the same tasks receive wages near the French average (approximately €21.1 per hour versus a minimum wage of approximately €10 per hour in 2018).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What does the cross-sectional evidence show about the relationship between labor cost differentials and posting flows, and how does this compare to goods trade?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Bilateral posting flows and bilateral labor cost differentials have a tight negative cross-sectional relationship with an estimated elasticity of –0.58 (SE 0.08), indicating that countries export more posting services when their labor costs are substantially below those of the destination country. The same exercise applied to bilateral goods trade yields a coefficient of +0.13 (SE 0.07) — weak and marginally positive — consistent with goods trade being driven by capital, technology, and scale rather than labor cost differentials. The gap confirms that labor cost differences are a distinctive comparative advantage mechanism for labor-intensive services but not for less labor-intensive goods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What does the Belgian tax shift reform demonstrate, and how is identification established?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Belgium cut employer social security contributions from 33% to 25% between 2016 and 2018 in a revenue-neutral reform (financed by VAT, excise duties, and dividend taxes). The DiD compares posting imports into Belgium with those into France (a neighboring, similarly sized importer on parallel pre-reform trends). Belgium and France imported posting services at similar rates before 2015; Belgian imports slowed immediately after the reform while French imports continued growing. The reduced-form elasticity of posting flows with respect to the destination payroll tax rate is 1.45 (SE 0.3). The elasticity with respect to total labor cost is 3.7 (SE 0.7). No discernible response is detected for trade in manufacturing goods, providing a within-reform placebo. A synthetic control using all available importing countries yields a smaller elasticity of 0.6 (SE 0.22).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the Luxembourg EU regulation reform (2010) improve on the Belgian case for identification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The 2010 EU regulation required temporary employment agencies in border regions to pay destination-based (rather than origin-based) payroll taxes, raising statutory rates for Luxembourgish exporters from 15% to 44%. Unlike the Belgian reform, this created within-country variation: the same Luxembourgish firms were exposed in the temporary employment sector but not in road transportation (which received a 10-year exemption). This within-exporter, cross-sector design controls for all Luxembourg-wide demand or supply shocks. Posting exports by the temporary employment sector fell 40% relative to pre-reform levels and relative to the domestic (control) sector, while road transportation posting showed zero response. The monthly data confirm the drop occurred in the exact month following the regulation with no anticipation. The triple-difference elasticity (with respect to the payroll tax rate) is –1.37 (SE 0.08).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What does the bunching evidence at payroll tax duration thresholds add to the DiD findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When posting contracts exceed a regulatory duration threshold (24 months during 2010–2020, then 18 months from July 2020), payroll taxes become destination-based. Because France has the highest payroll tax in the EU, all exporting firms face strong incentives to avoid crossing the threshold. The distribution of posting contract lengths in France shows sharp excess mass just below 24 months in 2017. When the threshold moved to 18 months in 2020, the excess mass migrated to the new threshold while diminishing at the old one, confirming that bunching is tax-motivated rather than driven by a reference-point at 24 months. This establishes that labor tax differentials shape not only the quantity of posting contracts (extensive margin) but also their length (intensive margin).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the main findings from the German minimum wage reform, and how do the heterogeneity tests strengthen identification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Germany&amp;rsquo;s January 2015 introduction of a national minimum wage of €8.50 per hour (preceded by a sectoral minimum in meat processing in August 2014) raised wage costs for foreign firms providing non-construction services, but not for construction firms already covered by a higher sectoral minimum. Postings to Germany in manufacturing fell by approximately 60% relative to the construction (control) sector, implying a reduced-form elasticity of –1.34 (SE 0.43). Two heterogeneity tests reinforce identification: (i) within the treated German sector, posting declines are monotonically increasing in the degree to which the new minimum wage is binding in the origin country, with Luxembourg (where the minimum is non-binding) showing no statistically significant effect; (ii) the same industry-by-country comparison in Germany&amp;rsquo;s high-wage neighboring countries (which did not change minimum wage rules) yields placebo estimates statistically indistinguishable from zero. The reform raised wages for German workers by an average of 6% (and up to 10% for most affected workers) but automatically raised wages for posted workers by an average of 40%, doubling them for workers from the poorest sending countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do the gravity model estimates compare to the reduced-form DiD estimates, and what explains the difference?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Across gravity specifications, model-implied elasticities range from –0.75 to –2.4. The preferred specification — PPML with pair fixed effects, destination-year fixed effects, and origin-year fixed effects — yields θ = –1.2 (SE 0.2). These estimates are systematically below the medium-run reduced-form DiD estimates because: (a) the gravity model uses nationwide average tax and minimum wage measures that introduce measurement error relative to the sector-specific reforms in the case studies; and (b) the gravity model captures year-to-year (short-run) adjustments, while the DiD designs compare outcomes several years before and after the reform, picking up longer-run equilibrium reallocation. The finding that responses grow over time mirrors evidence on dynamic adjustment in goods trade (Boehm, Levchenko and Pandalai-Nayar, 2023), and contradicts the conventional belief that fiscal devaluations boost exports only in the short run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the gravity model reveal about trade in goods as a function of posting-specific wage costs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When the same gravity specification is applied to bilateral goods trade rather than posting flows, posting-specific wage costs have a positive — not negative — coefficient on goods trade. This is inconsistent with a model where unobserved shocks affect all exports symmetrically, and instead suggests a small substitution effect: as the cost to import labor services rises (due to tighter posting regulations), countries substitute toward importing goods. For some activities (such as meat processing), importing finished goods is a partial substitute for importing labor services to produce on-site.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the Bolkestein Directive counterfactual implications, and how do they connect to the political economy evidence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Bolkestein Directive (proposed 2005) would have enforced a &amp;ldquo;country of origin principle,&amp;rdquo; exempting foreign posting firms from destination-country minimum wages. Using the preferred lower-bound elasticity from the gravity model (column 5, θ = –1.2) and an upper bound averaging gravity and DiD estimates, the paper predicts this would have at least doubled exports of labor services from Eastern European countries. Tax revenues collected on posted workers in origin countries would also double. However, average posted workers&amp;rsquo; wages would fall by approximately 16%, as workers would lose their entitlement to destination-country minimum wages. The paper documents that the Bolkestein controversy — introduced to the EU Parliament in March 2005 and popularized via the &amp;ldquo;Polish plumber&amp;rdquo; trope — coincided with a sharp and permanent drop in French voter support for the EU constitutional treaty, which was subsequently rejected in referendum. This is consistent with Rodrik&amp;rsquo;s (1998) hypothesis that voters withdraw support for economic integration when comparative advantage appears to be based on institutional choices that conflict with importing countries&amp;rsquo; social norms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper handle the incidence of payroll taxes — does the canonical result that payroll taxes are fully passed through to workers hold in this context?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The canonical competitive labor market model predicts full pass-through of payroll taxes to workers&amp;rsquo; net wages, leaving firms&amp;rsquo; labor costs unchanged. The paper finds substantial trade responses to payroll tax reforms, inconsistent with full pass-through. Nominal rigidities — including binding minimum wages that constrain downward wage adjustment — help rationalize incomplete pass-through in the EU context. The paper estimates elasticities both with respect to statutory tax rates (the reduced-form, making no incidence assumption) and with respect to total wage costs (instrumented with the reform, allowing for gross wage responses). Wage data from Belgium show no distinguishable wage response to the Belgian tax cut, suggesting the incidence fell largely on firms&amp;rsquo; costs rather than workers&amp;rsquo; wages in that episode.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What do the destination-based taxation counterfactual (tax cooperation proposal) calculations show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A proposal to shift all posting payroll taxation to destination-based rates would decrease posting exports from Eastern European countries by between 10% and 25%. Despite the volume reduction, total taxes collected on posted workers would still increase under this reform even when the upper-bound elasticity (approximately –3.7 with respect to total wage cost) is used, because a 1% increase in the payroll tax rate translates to a much smaller proportional increase in total wage cost.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Posted workers / posting policy:&lt;/strong&gt; Employees temporarily sent by their employer (the &amp;ldquo;exporting firm&amp;rdquo;) to perform a service contract in another EU member state. Posted workers maintain their employment contract with the firm in the origin country but physically work in the destination country. This creates a setting where competing domestic and foreign firms serve the same customers at the same location under different labor regulations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Posting allowance:&lt;/strong&gt; The additional wage component that exporting firms must pay to posted workers to satisfy the destination country&amp;rsquo;s minimum legal wage when that minimum exceeds the firm&amp;rsquo;s home-country wage level. The posting allowance is zero when the exporting country&amp;rsquo;s average wage already exceeds the destination minimum wage; it can be large for low-wage origin countries. The allowance enters directly into firms&amp;rsquo; labor costs and is the minimum-wage channel of the paper&amp;rsquo;s labor cost formula.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Origin-based vs. destination-based payroll taxation:&lt;/strong&gt; Under posting, payroll taxes are normally assessed in the country where the exporting firm is registered (origin-based), creating tax rate differentials between competing firms in the same job site. EU regulations convert payroll taxes to destination-based when posting contracts exceed a duration threshold, eliminating the tax advantage of lower-tax origin countries for those contracts. The 2010 EU regulation additionally imposed destination-based taxation on border-region temporary employment agencies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trade elasticity for physical services (θ):&lt;/strong&gt; The structural parameter from the Eaton-Kortum (2002) gravity model that governs the elasticity of bilateral posting flows with respect to changes in firms&amp;rsquo; total wage costs when exporting services from country i to country j. The paper&amp;rsquo;s preferred estimate is –1.2 (from gravity estimation) to approximately –1.3 to –1.5 (from reduced-form DiD designs), substantially smaller in absolute value than the goods trade elasticity (typically estimated around 5).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social standards as comparative advantage:&lt;/strong&gt; The paper uses &amp;ldquo;standards&amp;rdquo; to refer to countries&amp;rsquo; domestic policy choices about payroll taxes (which finance social insurance programs) and minimum wages (which set worker protection floors). The paper demonstrates that these regulatory choices — distinct from productivity differences, factor abundance, or technology — create measurable cost advantages that shape specialization in labor-intensive service sectors. This is in contrast to &amp;ldquo;benign&amp;rdquo; sources of comparative advantage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bolkestein Directive / country of origin principle:&lt;/strong&gt; A 2005 EU legislative proposal that would have required posting firms to operate under the laws of their home country when supplying services in other EU member states, eliminating the hard core of destination-country regulations (including minimum wages) that the 1996 Posted Workers Directive had imposed on foreign firms. The proposal was withdrawn after a wave of protests and its association with a sharp fall in French support for the EU constitutional treaty.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bunching / notch at duration threshold:&lt;/strong&gt; A behavioral response in which exporting firms strategically keep posting contract lengths below the duration threshold that triggers destination-based payroll taxation, generating an excess mass in the distribution of contract lengths just below the threshold. The paper uses this bunching, together with the movement of the threshold from 24 to 18 months in 2020, as additional evidence that payroll tax differentials affect the intensive margin of posting.&lt;/p&gt;</description></item><item><title>Investing in Influence: Investors, Portfolio Firms, and Political Giving</title><link>https://macropaperwarehouse.com/papers/investing-in-influence-investors-portfolio-firms-and-political-giving/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/investing-in-influence-investors-portfolio-firms-and-political-giving/</guid><description>&lt;p&gt;This paper investigates whether institutional investors influence the political activities of their portfolio firms, using political action committee (PAC) giving as a window into the broader question of whether institutional investors can leverage their concentrated ownership to extract benefits from portfolio firms for their own interests rather than those of their clients.&lt;/p&gt;
&lt;p&gt;The sample covers 574 institutional investors (those with at least $100 million in assets under management, i.e., 13-F filers) matched to 2,456 portfolio firms that had PACs, over the period 1980–2018. The primary source of variation is the first acquisition by an institutional investor of at least one percent of a portfolio firm&amp;rsquo;s outstanding shares, yielding 68,387 large acquisition events. PAC giving data come from FEC records matched by name to investor and firm entities. The main regression specification examines how the relationship between investor and firm PAC contributions to the same congressional district changes after such an acquisition, using a saturated set of fixed effects including firm × investor, firm × congressional district, firm × election cycle, investor × congressional district, investor × election cycle, and district × election cycle.&lt;/p&gt;
&lt;p&gt;The central finding is that, following a large block purchase, a firm&amp;rsquo;s PAC giving mirrors more closely that of the acquiring investment management company. In the preferred specification (column 8 of Table 2), the probability that a portfolio firm gives to a politician supported by its investor&amp;rsquo;s PAC increases by 31 percent after an acquisition. Using a cosine similarity measure of investor-firm PAC giving, the mean similarity of 0.10 at the acquisition cycle rises by 0.02–0.03 (a 20–30 percent increase) by the fourth post-acquisition election cycle.&lt;/p&gt;
&lt;p&gt;A key identification concern is that acquisitions may be driven by shared political preferences rather than representing a causal effect. To address this, the authors exploit stock index inclusions as exogenous shifters of institutional investor block purchases: when a firm is added to an index for the first time, passive indexers are compelled to rebalance toward that firm regardless of political alignment. Restricting to 5,601 index-inclusion acquisitions by passive investors, the authors find near-identical effect sizes (beta1 = 0.0132 in column 8 versus 0.0135 in the full sample), and an event study shows no pre-trend in giving convergence for the index subsample, in contrast to a slight pre-trend in the full sample. Divestment events exhibit the symmetric negative pattern: the interaction of post-divestment and investor PAC giving falls by between -0.074 and -0.058 across specifications.&lt;/p&gt;
&lt;p&gt;The authors argue that investors drive the convergence rather than portfolio firms adjusting investor preferences. Around acquisition dates, firms exhibit a larger drop in between-election-cycle cosine similarity than investors do. In a difference-in-differences comparison of the acquisition period relative to the preceding period, the difference in stability between investors and firms is 0.075 (significant at the 1 percent level), indicating that firms shift their giving more than investors. Investors obtaining a board seat at the portfolio firm amplifies the effect: in the preferred specification, the board-seat interaction is more than twice as large as the acquisition-alone interaction.&lt;/p&gt;
&lt;p&gt;Heterogeneity analysis provides evidence that the convergence reflects investors&amp;rsquo; partisan tastes rather than coordinated profit-maximizing political strategy. Acquisitions by more partisan investors (those whose giving is more skewed toward one party) produce a convergence coefficient roughly twice as large (0.020) as less partisan investors (0.010). Private fund families show more than twice the convergence effect of publicly owned fund families. The partisan composition of firm giving also shifts: a firm acquired by an investor giving exclusively to Republicans sees its Republican share increase by 2.8 percentage points relative to a baseline of 47.4 percent (a 5.9 percent increase).&lt;/p&gt;
&lt;p&gt;Finally, higher overall institutional ownership is associated with an increase in total PAC giving at the firm level, and this expanded giving does not go disproportionately to politicians on committees overseeing issues the firm actively lobbies — suggesting the ownership-driven increment in political spending is non-strategic from the firm&amp;rsquo;s profit standpoint and likely serves investors&amp;rsquo; own interests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the central research question and why does it matter?&lt;/strong&gt;
The paper asks whether institutional investors influence the political giving of portfolio firms, motivated by the broader concern that the rise of institutional ownership — from 6 percent of U.S. public equities in 1950 to 65 percent in 2017 — concentrates not only economic but also political power in the hands of a small number of asset managers. This matters because if investors shape firms&amp;rsquo; PAC giving to serve investors&amp;rsquo; own preferences rather than firms&amp;rsquo; profit interests, it represents a misuse of corporate resources and a potential amplification of a small group&amp;rsquo;s political voice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What data are used and how is the sample constructed?&lt;/strong&gt;
The analysis draws on 13-F filings (investors with at least $100M AUM) from Thomson-Reuters, matched to FEC PAC records via fuzzy and manual name matching. The resulting sample contains 574 investors with PACs and 2,456 portfolio firms with PACs, spanning 1980–2018. The Cartesian product of investor-firm pairs is restricted to those connected by at least one large acquisition event (defined as first acquisition of at least 1 percent of outstanding shares), yielding 68,387 such events. PAC contributions are measured at the investor- and firm-congressional-district-election-cycle level, linked to House of Representatives winners using MIT Election Data files.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the baseline regression and what does it find?&lt;/strong&gt;
The baseline regression (equation 1) interacts Log Investor PAC with a Post indicator (equal to 1 after the first large acquisition and while the stake is maintained) at the investor-firm-congressional-district-election-cycle level, with a saturated set of fixed effects. The coefficient on the interaction (beta1) is positive and highly significant (p &amp;lt; 0.001) across all eight specifications, ranging from 0.013 to 0.032. In the preferred specification, the increase in giving similarity is 31 percent relative to the pre-acquisition baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do the authors establish causality and rule out endogenous acquisitions?&lt;/strong&gt;
The primary identification strategy uses first-time inclusions of firms in stock indices (approximately 1,000 indices tracked in the sample) as exogenous shifters: passive indexers must rebalance toward the included firm regardless of political alignment. This subsample of 5,601 index-inclusion acquisitions produces near-identical coefficient estimates (0.0132 versus 0.0135 in the full sample), and the event study for this subsample shows no pre-trend in giving convergence, unlike the slight pre-trend in the full sample. Equality of the two coefficients cannot be rejected at standard significance levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What evidence shows it is firms adjusting to investors rather than the reverse?&lt;/strong&gt;
The authors compute between-election-cycle cosine similarity separately for investors and firms around acquisitions. On average, investors exhibit more stable giving than firms at acquisition dates (Cos(xi,t, xi,t+1) &amp;gt; Cos(xf,t, xf,t+1)). The difference-in-differences estimate — comparing the acquisition period to the preceding period — is 0.075 (significant at 1 percent), indicating a relatively larger break in firm giving. Over a two-cycle window, the difference-in-differences estimate is 0.083, again indicating convergence is driven by firms shifting toward investors rather than the reverse.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What role does board representation play?&lt;/strong&gt;
In approximately 5 percent of acquisitions in the sample, the investor obtains a board seat. In specifications that include both the acquisition effect (Post × Log Investor PAC) and a board-membership interaction (Board × Log Investor PAC), both terms are positive and significant at the 1 percent level. In the preferred specification, the board-seat interaction is more than twice as large as the acquisition-alone interaction, indicating that a direct governance channel — board representation — substantially amplifies the convergence in political giving.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the divestment analysis show?&lt;/strong&gt;
Symmetric to the acquisition results, divestment events (where an investor exits a stake of at least 1 percent held for at least one election cycle) are associated with a decline in investor-firm PAC giving correlation. Post-divestment interaction coefficients range from -0.074 to -0.058 across specifications, and an event study confirms the correlation falls sharply after the divestment cycle.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does investor partisanship affect the magnitude of influence?&lt;/strong&gt;
Yes. Classifying investors as &amp;ldquo;More Partisan&amp;rdquo; (above-mean absolute deviation from 50/50 party split) versus &amp;ldquo;Less Partisan,&amp;rdquo; the interaction coefficient for More Partisan investors (0.020) is roughly twice that of Less Partisan investors (0.010). After a large acquisition by a fully Republican-giving investor, the acquired firm&amp;rsquo;s giving to that politician increases by 23.5 percent; the comparable figure for a Less Partisan investor is 7.6 percent. This pattern holds in both the full sample and the index-inclusion subsample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do private versus public fund families differ in their influence?&lt;/strong&gt;
Private fund families (e.g., Vanguard, Fidelity) show more than twice the convergence coefficient of publicly owned fund families (e.g., BlackRock, State Street, Invesco). The authors attribute this to private fund managers facing less outside scrutiny, allowing their giving to more readily reflect the preferences of owners and managers. Private investors also show greater partisan polarization: the 10th–90th percentile Republican-giving range for private investors is 6.3–100 percent, versus 21.7–88.3 percent for public investors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does increased institutional ownership expand overall firm PAC spending?&lt;/strong&gt;
Yes. In firm-year level regressions, institutional ownership is a positive and significant predictor of total firm PAC giving (significant at at least the 5 percent level in both cross-sectional and firm-fixed-effects specifications). Total corporate political expenditure by sample firms increased by nearly a factor of six over 1980–2018. The authors note that while many factors contribute, increased institutional ownership may be at least partly responsible for this expansion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does the additional giving driven by institutional ownership go to strategically important politicians for the firm?&lt;/strong&gt;
No. Regressions relating institutional ownership to giving to politicians on congressional committees overseeing issues the firm actively lobbies (a standard measure of politicians&amp;rsquo; strategic importance to firms) yield near-zero and statistically weak point estimates. In the preferred firm-fixed-effects specification, the share of total PAC giving devoted to such strategically relevant politicians is negatively associated with institutional ownership at marginal significance (p &amp;lt; 0.10), consistent with the interpretation that ownership-driven incremental political spending is non-strategic from the firm&amp;rsquo;s own profit perspective and expands total giving rather than displacing strategic giving.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the policy and legal implications?&lt;/strong&gt;
The authors flag three concerns: (i) the ownership-driven increment in political spending may represent a misuse of corporate resources that does not serve portfolio firm shareholders; (ii) it may constitute an illegal activity, since using a firm&amp;rsquo;s PAC to reimburse or proxy for an investor&amp;rsquo;s own political preferences can run afoul of campaign finance law; and (iii) it is a channel through which unequal resources amplify the political voice of a small number of fund managers at the expense of dispersed ultimate investors who are likely unaware of and do not sanction these contributions. The findings challenge the Supreme Court&amp;rsquo;s premise in Citizens United that corporate political speech reflects shareholder profit maximization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PAC comovement (investor-firm giving similarity):&lt;/strong&gt; The increase in the probability that a portfolio firm&amp;rsquo;s PAC donates to a politician also supported by an acquiring investor&amp;rsquo;s PAC, measured as the interaction coefficient between Log Investor PAC and a Post-acquisition indicator in the baseline regression. In the preferred specification this represents a 31 percent increase relative to the pre-acquisition baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cosine similarity (cross-time and cross-entity):&lt;/strong&gt; A measure defined as the Euclidean dot product between two vectors of PAC giving (either the same entity across adjacent election cycles, or investor versus firm in the same cycle), taking values between 0 and 1, where 1 indicates identical giving patterns. Used both to confirm convergence post-acquisition and to attribute that convergence to firm rather than investor adjustment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Index-inclusion acquisition:&lt;/strong&gt; A large block purchase that results from a firm being added for the first time to a stock index tracked by a passive institutional investor, used as an exogenous shifter of investor stakes that is orthogonal to investor-firm political alignment. There are 5,601 such events in the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Partisanship (investor):&lt;/strong&gt; Classified as &amp;ldquo;More Partisan&amp;rdquo; if an investor&amp;rsquo;s absolute deviation from a 50/50 party split in PAC donations is above the sample mean. More partisan investors produce roughly twice the convergence effect on portfolio firm giving compared to less partisan investors, used as evidence that personal political preferences rather than profit-maximizing business strategy drive the convergence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Post indicator (Postift):&lt;/strong&gt; A binary variable equal to 1 for all election cycles following an investor&amp;rsquo;s first acquisition of at least 1 percent of a portfolio firm&amp;rsquo;s outstanding shares, and remaining 1 as long as the investor holds any stake in the firm. The key source of temporal variation in the baseline regression.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Strategically important politicians:&lt;/strong&gt; Members of Congress sitting on committees that oversee issues on which a firm actively lobbies, identified by crosswalking lobbying reports from the Senate Office of Public Records to relevant committee jurisdictions. Used to test whether ownership-driven political giving displaces or supplements firm-profit-motivated giving.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Board seat channel:&lt;/strong&gt; The mechanism through which investor influence on firm political giving is amplified when the investor obtains representation on the portfolio firm&amp;rsquo;s board of directors (present in approximately 5 percent of acquisitions). The board interaction coefficient is more than twice the acquisition-alone coefficient in the preferred specification.&lt;/p&gt;</description></item><item><title>Jackknife Standard Errors for Clustered Regression</title><link>https://macropaperwarehouse.com/papers/jackknife-standard-errors-for-clustered-regression/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/jackknife-standard-errors-for-clustered-regression/</guid><description>&lt;p&gt;Hansen (2025) makes a theoretical case for replacing the conventional cluster-robust variance estimator (CRVE) and heteroskedasticity-consistent (HC) standard errors with a specific jackknife variance estimator, V5, in linear regression with heteroskedastic and/or cluster-dependent observations.&lt;/p&gt;
&lt;p&gt;The paper identifies two fundamental problems with conventional CRVE1 and CRVE2 estimators. First, these estimators can be fully downward biased: Theorem 2 establishes that the infimum of E[v̂1²]/v² and E[v̂2²]/v² over all admissible regressor and covariance matrix configurations equals zero, meaning expected variance can be arbitrarily close to zero relative to the true variance. This pathology arises from extreme regressor leverage — specifically when one cluster dominates the sample — and holds even under homoskedasticity and clusterwise invertibility. Second, Theorem 5 shows that confidence intervals constructed from CRVE1 and CRVE2 standard errors have worst-case coverage probability equal to zero for any finite critical value c, making them unable to achieve any target coverage level uniformly over regression designs.&lt;/p&gt;
&lt;p&gt;Crucially, Hansen shows that even the conventional jackknife estimators V3 and V4, which are already in use (e.g., via Stata&amp;rsquo;s vce(jackknife) option), share these pathologies when clusterwise noninvertibility is present. Clusterwise noninvertibility occurs when deleting a single cluster renders the regressor matrix singular — as in regressions with cluster-level fixed effects, a single treated cluster, or sparse dummy variables. Stata&amp;rsquo;s existing fix of simply dropping noninvertible clusters is shown to be insufficient: under clusterwise noninvertibility, the infimum of E[v̂3²]/v² and E[v̂4²]/v² over the broader model class equals zero (Theorem 2, equations 19–20), and the corresponding confidence intervals also achieve worst-case coverage of zero.&lt;/p&gt;
&lt;p&gt;The proposed estimator V5 resolves these problems through three modifications to the conventional jackknife: (1) it uses a generalized (Moore-Penrose) inverse rather than dropping noninvertible clusters, ensuring all clusters are included; (2) it centers at the full-sample estimator β̂ rather than the mean of delete-one estimates; and (3) it omits the (G−1)/G degrees-of-freedom correction. Theorem 1 proves that E[V̂5] ≥ V in the positive semidefinite sense for all sample sizes, regressor matrices, and covariance structures — the estimator is never downward biased. Theorem 3 then shows that jackknife-based confidence intervals C̃5(c) have coverage probability bounded below by the Cauchy distribution for any c ≥ 1. With the conventional critical value c = 1.96, this guarantees finite-sample coverage of at least 70% and test size of at most 30%, regardless of regression design or error variance structure.&lt;/p&gt;
&lt;p&gt;To improve upon the conservative Cauchy bound in practice, the paper proposes a Satterthwaite adjusted t approximation for the jackknife t-ratio. The adjustment derives degrees of freedom K and a scale factor a from the eigenvalue structure of a design-dependent matrix D. Theorem 7 shows that a → 1 and K → ∞ as n → ∞ under mild regularity conditions (no single cluster dominates). Simulation evidence across six regression designs — varying regressor distributions (Normal, LogNormal with cluster dependence, sparse Dummy) and error structures (clustered normal, heteroskedastic) — with G ∈ {6, 12, 40, 100} clusters confirms that the Satterthwaite jackknife interval achieves coverage rates uniformly above 93% at the nominal 95% level even with G = 6, while CRVE1 intervals fall as low as 57% coverage in the LogNormal/heteroskedastic design. The empirical application extends Meng, Qian, and Yared (2015) on Chinese TV access and redistribution preferences, finding that the jackknife standard error for the TV access coefficient exceeds the CRVE1 standard error and the Satterthwaite interval is wider, affecting conclusions about statistical significance.&lt;/p&gt;
&lt;p&gt;The theory holds under Assumptions 1–4: correctly specified linear regression with zero conditional mean errors, full rank X, finite second moments, arbitrary cluster sizes and within-cluster covariance structure, and (for Theorem 3) normal errors. Results hold for fixed k and G, arbitrary n, and allow clusterwise noninvertibility subject to Assumption 3 (inference targets the well-identified regressors).&lt;/p&gt;
&lt;p&gt;Q: What is the central claim of the paper?
A: Conventional CRVE and HC variance estimators should be replaced by the jackknife estimator V5 in all linear regression contexts with heteroskedastic or clustered errors. V5 is never downward biased (its expectation weakly exceeds the true variance matrix), whereas CRVE1 and CRVE2 can be arbitrarily downward biased. The Satterthwaite-adjusted V5 confidence interval has excellent finite-sample coverage.&lt;/p&gt;
&lt;p&gt;Q: What is the worst-case bias of CRVE1?
A: The infimum of E[v̂1²]/v² over all admissible regressor matrices and covariance matrices equals zero (Theorem 2, equation 15). This means that for some data-generating process, the expected CRVE1 variance estimate is arbitrarily close to zero relative to the true variance — full downward bias. Importantly, this pathology holds even under homoskedasticity (Σ = Iₙ) and clusterwise invertibility; it is driven entirely by extreme regressor leverage.&lt;/p&gt;
&lt;p&gt;Q: Why is CRVE2 also fully downward biased, and how does its failure differ from CRVE1&amp;rsquo;s?
A: Theorem 2 (equation 16) shows that the infimum of E[v̂2²]/v² over F* also equals zero. The difference is that the proof for CRVE2 requires non-i.i.d. errors, meaning CRVE2&amp;rsquo;s failure requires manipulation of the covariance matrices in addition to extreme leverage, whereas CRVE1 can fail under i.i.d. errors from leverage alone.&lt;/p&gt;
&lt;p&gt;Q: What is clusterwise noninvertibility and why does it matter?
A: Clusterwise noninvertibility occurs when deleting a single cluster renders the regressor design matrix X&amp;rsquo;X − Xg&amp;rsquo;Xg singular. This happens in regressions with cluster-level fixed effects, with a cluster-level treatment indicator when only one cluster is treated, or with sparse dummy variables. The paper shows that the conventional jackknife estimators V3 and V4 become fully downward biased (infimum of expectation ratio equals zero) under clusterwise noninvertibility, even though Stata&amp;rsquo;s existing fix of dropping noninvertible clusters was explicitly designed to handle this case.&lt;/p&gt;
&lt;p&gt;Q: What is the key innovation in V5 that makes it robust to clusterwise noninvertibility?
A: V5 uses the Moore-Penrose generalized inverse in the delete-one-cluster estimator β̂₋g, ensuring all G clusters are included in the sum rather than discarding noninvertible clusters. It also centers at the full-sample β̂ rather than the mean β̄ of delete-one estimates, and omits the (G−1)/G degrees-of-freedom correction. The paper shows these three differences together imply V̂5 ≻ V̂4 ≻ V̂3 in the positive semidefinite ordering.&lt;/p&gt;
&lt;p&gt;Q: What does Theorem 1 establish about V5?
A: Theorem 1 proves E[V̂5] ≥ V in the positive semidefinite sense for all sample sizes, all regressor matrices, all covariance matrices, and under clusterwise noninvertibility. This conservative property holds without any assumption on cluster sizes, regressor leverage, within-cluster correlation, or heteroskedasticity beyond Assumption 1 (correct specification and finite second moments). The infimum of E[v̂5²]/v² equals 1 (equation 21), meaning the inequality is sharp.&lt;/p&gt;
&lt;p&gt;Q: What does the Cauchy distribution bound say, and how useful is it in practice?
A: Theorem 3 shows that for any c ≥ 1, the jackknife confidence interval C̃5(c) has coverage probability at least P[|ζ| ≤ c] where ζ is Cauchy. With c = 1.96, this guarantees coverage of at least 70% and test size of at most 30% uniformly over all regression designs and error structures (under normality). The bound is not tight in typical applications — actual coverage is much higher — but it provides the first generally applicable uniform guarantee for clustered/heteroskedastic regression. The Cauchy critical value at 5% is 12.7, far too large for practical use, so the bound is more useful as a theoretical guarantee than as a practical inference tool.&lt;/p&gt;
&lt;p&gt;Q: What does Theorem 5 establish about confidence intervals from CRVE1–CRVE4?
A: Under normality, the worst-case coverage probability of confidence intervals constructed from any of the four estimators v̂1 through v̂4 equals zero for any finite critical value c (equations 26–29). For v̂1 and v̂2, this holds over the clusterwise-invertible model class F*; for v̂3 and v̂4 it holds over the broader class F allowing noninvertibility. Zero worst-case coverage cannot be fixed by enlarging c, since the result holds for all finite c. This is not an impossibility result in the Bahadur-Savage sense; it is a statement that specific commonly-used intervals fail, while V5-based intervals succeed.&lt;/p&gt;
&lt;p&gt;Q: What is the Satterthwaite approximation and how is it implemented?
A: The Satterthwaite adjustment replaces the jackknife t-ratio&amp;rsquo;s exact finite-sample distribution — a ratio of a normal to the square root of a weighted sum of chi-squares — with a scaled t distribution with K degrees of freedom, where K and a scale factor a are matched by moment conditions on the eigenvalues of a design matrix D. The confidence interval is θ̂ ± v̂5 · t^{1−α/2}_K / a, and the p-value uses a Student t or F distribution with the same K and scale. These quantities can be computed without explicit eigendecomposition using trace formulas (equations 38–39), which are preferred computationally when G &amp;gt; k.&lt;/p&gt;
&lt;p&gt;Q: What do the simulations show about coverage rates?
A: Across six designs (three regressor types × two error types) and G ∈ {6, 12, 40, 100}, CRVE1 falls as low as 57% coverage in the LogNormal regressor/heteroskedastic error design with G = 6. CRVE2 has somewhat better but still substantially undercovering intervals. The conventional jackknife interval undercovers (as low as 85%) in leveraged/heteroskedastic designs. The Satterthwaite jackknife interval achieves coverage uniformly exceeding 93% across all designs, though it can be excessively conservative (100%) in some cases. All simulation estimates have standard errors less than 0.003 (20,000 replications).&lt;/p&gt;
&lt;p&gt;Q: Does the Satterthwaite adjustment vanish in large balanced samples?
A: Yes. Theorem 7 shows that if the design matrix is uniformly non-singular and no single cluster dominates (maxg ||Xg||² = o(n)), then a → 1 and K → ∞ as n → ∞. Consequently, the Satterthwaite interval converges to the standard normal interval in well-balanced large samples.&lt;/p&gt;
&lt;p&gt;Q: How does V5 relate to the classical HC3 estimator?
A: Under independent sampling (no clustering, ng = 1), V5 reduces to the HC3 estimator of Andrews (1991) and Davidson and MacKinnon (1993), which uses the Moore-Penrose inverse. The conventional jackknife V3/V4 reduce to the HC3 of MacKinnon and White (1985). The paper&amp;rsquo;s results thus provide a formal theoretical basis for the longstanding recommendation (by Efron-Stein 1981, MacKinnon-White 1985, Andrews 1991, and others) to use HC3/jackknife standard errors.&lt;/p&gt;
&lt;p&gt;Q: What is the practical recommendation for empirical researchers?
A: Replace all CRVE1/CRVE2/HC standard errors with V5, computed via the Moore-Penrose generalized inverse including all clusters. Report V5-based standard errors (which are never downward biased) alongside Satterthwaite-adjusted confidence intervals and p-values using equations (30)–(31). The adjustment parameters a and K differ per coefficient and must be computed separately for each. The paper advises against reporting a/v̂5 as an &amp;ldquo;adjusted standard error&amp;rdquo; since that quantity loses the never-downward-biased property.&lt;/p&gt;
&lt;p&gt;Q: What is the empirical application and what does it find?
A: The paper extends Meng, Qian, and Yared (2015), which studies the effect of TV access on demand for redistribution in China using provincial household survey data (30 provinces, multiple years), and Canay, Santos, and Shaikh (2021), who found CRVE1 standard errors may be unreliable in that setting. Applying V5, the jackknife standard error for the TV access coefficient exceeds the CRVE1 standard error, the Satterthwaite interval is wider than the conventional interval, and conclusions about statistical significance are affected.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions and limitations?
A: The bias results (Theorems 1–2) require only correct specification (zero conditional mean) and finite second moments. The Cauchy bound (Theorem 3) additionally requires normal errors; whether a similar bound holds without normality or in G → ∞ asymptotics is left open. The Satterthwaite adjustment applies only to inference on real-valued (scalar) parameters and does not extend to joint hypothesis tests. Assumption 3 limits inference to &amp;ldquo;well-identified&amp;rdquo; regressors (those whose leave-cluster-out coefficients are uniquely defined after partialling out controls).&lt;/p&gt;
&lt;p&gt;V5 (jackknife variance estimator): The paper&amp;rsquo;s proposed estimator, defined in equation (10) as the sum over all G clusters of outer products of (β̂₋g − β̂), where β̂₋g uses the Moore-Penrose generalized inverse. Unlike conventional jackknife estimators, V5 includes all clusters (no dropping), centers at the full-sample β̂, and omits the (G−1)/G correction. Its key property is E[V̂5] ≥ V for all regression designs.&lt;/p&gt;
&lt;p&gt;Never-downward-biased (conservative) estimator: A variance estimator whose expectation is weakly greater than the true variance in the positive semidefinite sense, for all admissible regressor matrices and covariance structures. V5 has this property; CRVE1, CRVE2, and conventional jackknife estimators do not.&lt;/p&gt;
&lt;p&gt;Full downward bias: The worst-case property that the infimum of E[v̂²]/v² equals zero over the model class — meaning the expected variance estimate can be arbitrarily close to zero relative to the true variance. CRVE1 is fully downward biased under clusterwise invertibility alone; CRVE2 requires non-i.i.d. errors; conventional jackknife estimators become fully downward biased under clusterwise noninvertibility.&lt;/p&gt;
&lt;p&gt;Clusterwise noninvertibility: The condition where deleting a single cluster g renders the matrix X&amp;rsquo;X − Xg&amp;rsquo;Xg singular, so the standard delete-one-cluster estimator β̂₋g is undefined. This occurs in regressions with cluster-level fixed effects, a single treated cluster, or sparse dummy variables. V5 handles this via the Moore-Penrose generalized inverse; Stata&amp;rsquo;s existing fix of dropping such clusters is shown to be non-robust.&lt;/p&gt;
&lt;p&gt;Cauchy distribution bound: Theorem 3&amp;rsquo;s result that the jackknife confidence interval C̃5(c) has coverage probability at least P[|ζ| ≤ c] for all c ≥ 1, uniformly over all regression designs and error variances (under normality). With c = 1.96, this gives a guaranteed coverage floor of 70%. This is the first generally applicable uniform coverage guarantee for clustered/heteroskedastic regression.&lt;/p&gt;
&lt;p&gt;Satterthwaite adjusted t approximation: A data-dependent distributional approximation for the jackknife t-ratio that approximates the denominator&amp;rsquo;s weighted chi-square distribution by a scaled chi-square with K degrees of freedom, where K and scale factor a are computed from trace formulas involving the design matrix. The resulting confidence interval θ̂ ± v̂5 · t^{1−α/2}_K / a converges to the standard normal interval in well-balanced large samples.&lt;/p&gt;
&lt;p&gt;Regressor leverage: The degree to which variation in a coefficient of interest is concentrated in a small number of clusters. High leverage (when one cluster dominates the regressor of interest) is the mechanism by which CRVE1/CRVE2 achieve worst-case downward bias even under homoskedasticity.&lt;/p&gt;</description></item><item><title>Joined at the Hip: Monetary and Fiscal Policy in a Liquidity-Dependent World</title><link>https://macropaperwarehouse.com/papers/joined-at-the-hip-monetary-and-fiscal-policy-in-a-liquidity-dependent-world/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/joined-at-the-hip-monetary-and-fiscal-policy-in-a-liquidity-dependent-world/</guid><description>&lt;h2 id="layer-1--what-this-paper-finds-and-why-it-matters"&gt;Layer 1 — What this paper finds and why it matters&lt;/h2&gt;
&lt;p&gt;Calvo and Velasco study an economy where both money and government bonds provide liquidity services, and they show that this shared role implies bond-financed fiscal expansions can be neutral or contractionary — not merely less effective than hoped. The mechanism turns on a fundamental asymmetry: the price of money in terms of goods is pinned down by sticky prices, whereas the price of long-term bonds is free to jump immediately in response to expected changes in bond supply. When the government announces a future bond-financed transfer to households, bond prices fall right away, compressing total liquidity before a single new bond is actually issued; the liquidity-in-advance constraint then forces aggregate demand and output down, producing a recession that precedes and is qualitatively separable from any subsequent boom. The paper maps four distinct timing cases — unanticipated permanent, anticipated permanent, unanticipated transitory flow, and unanticipated temporary stock — and shows each has a different (and sometimes opposite) short-run sign for output. To prevent these contractionary liquidity effects, the central bank must cut the interest rate on money and expand the money supply in ways that are precisely coordinated with the timing of the bond helicopter drop; in this sense fiscal and monetary authorities are, the authors conclude, joined at the hip. The paper also distinguishes this result from standard fiscal-dominance stories: the monetary authority is not compelled to finance the deficit but to stabilize bond prices in order to protect aggregate demand.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on working paper (LSE Research Online accepted version, December 2025). AI-assisted, human review pending. See the linked original for authoritative claims.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="layer-2--in-depth"&gt;Layer 2 — In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-central-question-and-how-does-the-paper-differ-from-the-standard-new-keynesian-framework"&gt;Q1. What is the central question and how does the paper differ from the standard New Keynesian framework?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The central question is whether bond-financed government transfers raise, lower, or leave unchanged aggregate demand and output when bonds provide liquidity services.&lt;/strong&gt; Standard Keynesian and New Keynesian treatments focus on whether expansionary fiscal policy crowds out private investment through higher interest rates, or amplifies demand when the zero lower bound binds. Calvo and Velasco instead focus on the liquidity channel: because long-term bond prices are free to jump on news about future bond supply, increases in expected bond issuance can immediately reduce the market value of outstanding bonds, compressing total liquidity in private portfolios and thereby reducing consumption and output even before any new bond is issued. They call this a &amp;ldquo;non-standard&amp;rdquo; result and note that, by contrast, the price of money is insulated from such anticipatory jumps by sticky goods prices.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-model-structure"&gt;Q2. What is the model structure?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper uses a bare-bones, continuous-time, closed-economy model with a single infinitely lived household, one consumption good, and two assets in positive net supply: money (equated with central-bank reserves) and a long-term government bond (a perpetuity paying a coupon).&lt;/strong&gt; The key friction is a liquidity-in-advance constraint — households must hold sufficient liquidity (a weighted combination of real money balances and the real market value of bonds) to consume. The supply side is a standard Calvo (1983) Phillips curve. Policy instruments are the nominal interest rate on money, the nominal money supply, the nominal bond supply, and the bond coupon; the price of long-term bonds is endogenous. Commercial banks are abstracted away: money is effectively a CBDC. The paper notes that all main results also go through under a money-in-the-utility-function specification, provided the elasticity of substitution between consumption and liquidity is sufficiently low.&lt;/p&gt;
&lt;h3 id="q3-what-does-liquidity-mean-in-the-papers-own-sense-and-why-does-the-bond-price-matter-for-it"&gt;Q3. What does &amp;ldquo;liquidity&amp;rdquo; mean in the paper&amp;rsquo;s own sense, and why does the bond price matter for it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Liquidity is defined as a CES-weighted sum of real money holdings and the real market value of bond holdings, where the market value of bonds equals the bond price times the real quantity outstanding.&lt;/strong&gt; Because the bond price is free to jump, the market value of bonds (and therefore total liquidity) can change instantaneously in response to news, even when neither the nominal money stock nor the nominal bond stock has yet changed. Money does not share this vulnerability: its &amp;ldquo;price&amp;rdquo; in terms of goods is fixed in the short run by nominal price stickiness. This asymmetry — sticky price of money, flexible price of bonds — is the paper&amp;rsquo;s central mechanism. The authors attribute the stickiness insight to Keynes&amp;rsquo;s General Theory (the &amp;ldquo;price theory of money&amp;rdquo; as labelled by Calvo 2012).&lt;/p&gt;
&lt;h3 id="q4-what-happens-when-the-bond-supply-rises-unexpectedly-and-permanently"&gt;Q4. What happens when the bond supply rises unexpectedly and permanently?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;An unanticipated and permanent step increase in the nominal (and, on impact, real) supply of long-term bonds is neutral: consumption and output are unchanged.&lt;/strong&gt; Bond prices fall immediately so that the total market value of bonds outstanding — and therefore total liquidity — is the same as before. The analogy drawn is to an unanticipated permanent increase in the money supply under fully flexible prices, which also has no real effects. The coupon must rise proportionally so that the return on bonds remains at its steady-state level. The paper notes that neutrality may not hold if bond holdings are distributed non-uniformly (e.g., concentrated in financial intermediaries that use bonds as repo collateral), because the drop in bond prices could trigger runs on those institutions.&lt;/p&gt;
&lt;h3 id="q5-what-happens-when-a-permanent-bond-supply-increase-is-anticipated-in-advance"&gt;Q5. What happens when a permanent bond-supply increase is anticipated in advance?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;An anticipated and permanent future step increase in nominal bond supply causes a recession during the announcement-to-implementation interval, before any new bond has been issued.&lt;/strong&gt; Because arbitrage prevents an anticipated capital loss on bonds, the bond price cannot jump down at the implementation date T. Instead it must fall gradually starting at announcement date 0, reaching its new (lower) steady-state level exactly at T. This declining bond price reduces the market value of bonds and thereby compresses total liquidity throughout the interval [0, T), generating deflation and a negative output gap over that entire period. A naïve observer who notes an output boom just as the government begins to issue bonds at T would incorrectly conclude the policy is expansionary, when in fact the boom is the recovery from the pre-implementation recession.&lt;/p&gt;
&lt;h3 id="q6-what-happens-when-the-fiscal-authority-issues-bonds-at-a-constant-rate-for-a-finite-period-transitory-flow"&gt;Q6. What happens when the fiscal authority issues bonds at a constant rate for a finite period (transitory flow)?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;An unanticipated, transitory, constant-rate bond issuance over an interval [0, T) also has a recessionary impact on impact and during the issuance period.&lt;/strong&gt; Bond prices fall faster than the nominal bond stock accumulates, so the total market value of bonds declines and liquidity is compressed. The Calvo-Phillips equation evaluated with negative and rising inflation implies a negative output gap throughout the early part of the episode. A boom follows after bond issuance ends — not because &amp;ldquo;confidence is restored&amp;rdquo; or fiscal sustainability has improved, but because the boom is mechanically part of the same liquidity-adjustment cycle as the earlier recession.&lt;/p&gt;
&lt;h3 id="q7-what-happens-under-an-unanticipated-but-temporary-step-increase-in-the-bond-stock"&gt;Q7. What happens under an unanticipated but temporary step increase in the bond stock?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;An unanticipated but temporary step increase in bond supply — one that will be reversed at a known future date T — is expansionary on impact.&lt;/strong&gt; Because the price of bonds cannot be anticipated to jump at T, the bond price must rise from its impact level back to the initial steady state by T. On impact, the bond price falls but by less than the increase in nominal bond supply, so the market value of bonds rises and total liquidity increases, pushing aggregate demand and output above their natural rates. The initial boom is thus followed by a recession around the time bond supply is cut back, which the authors note could generate political pressure to extend the &amp;ldquo;expansionary&amp;rdquo; fiscal policy.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-common-mechanism-linking-the-contractionary-cases"&gt;Q8. What is the common mechanism linking the contractionary cases?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In both contractionary cases (anticipated permanent and unanticipated transitory flow), the bond price falls more rapidly than the bond stock rises, so the total market value of bonds declines, compressing liquidity.&lt;/strong&gt; From the model&amp;rsquo;s liquidity identity (equation 18 in the paper), total liquidity depends on real money balances (fixed on impact) plus a weight on the relative position of bonds to money. When that relative position (captured by the variable s_t in the model) falls, total liquidity falls. The liquidity-in-advance constraint then directly constrains consumption and output downward. Deflation is the only endogenous mechanism to rebuild real liquidity, but it works gradually and involves a protracted recession.&lt;/p&gt;
&lt;h3 id="q9-what-monetary-policy-does-the-paper-prescribe-to-neutralize-the-contractionary-effects"&gt;Q9. What monetary policy does the paper prescribe to neutralize the contractionary effects?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;To avoid the contractionary liquidity effects of anticipated bond helicopter drops, the central bank must cut the interest rate on money and expand the money supply in a manner whose precise time profile depends on the timing of the fiscal shock.&lt;/strong&gt; For an anticipated permanent bond-supply increase, the required monetary response involves gradually expanding the nominal money supply between announcement and implementation, followed by a discrete step decrease in nominal (and real) money at exactly the moment bond supply jumps up. This coordinated monetary expansion offsets the bond-price-driven compression of liquidity. The paper confirms this formally in Section IV (not fully extracted in the source text), with the conclusion that avoiding unwanted contractionary effects requires coupling fiscal bond issuance with specific, coordinated monetary actions.&lt;/p&gt;
&lt;h3 id="q10-how-does-the-paper-relate-to-fiscal-dominance--and-how-does-it-differ"&gt;Q10. How does the paper relate to fiscal dominance — and how does it differ?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper identifies a novel form of fiscal dominance in which monetary policy is compelled not to monetize the fiscal deficit but to stabilize government bond prices in order to protect aggregate demand and inflation.&lt;/strong&gt; Traditional fiscal dominance (common in emerging markets) forces the central bank to print money to finance the deficit. Here, the mechanism is different: expected bond issuance drives down bond prices and compresses liquidity, so the central bank must intervene in bond markets — effectively buying newly issued bonds — to prevent deflationary recessions. An outside observer could mistake this for traditional monetization. The paper frames the Federal Reserve&amp;rsquo;s $1 trillion Treasury purchase program from mid-March 2020 onward as consistent with this bond-price-stabilization logic, citing Vissing-Jorgensen (2021) on the causal role of Fed purchases in driving down yields through acute liquidity provision.&lt;/p&gt;
&lt;h3 id="q11-what-is-the-scope-of-the-non-standard-results"&gt;Q11. What is the scope of the non-standard results?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The non-standard (neutral or contractionary) results apply specifically to bond-financed increases in government transfers to the private sector; money-financed fiscal expansion and bond-financed government consumption changes are not the focus and do not share these properties in the model.&lt;/strong&gt; The authors explicitly note this caveat. However, they argue the exercise is policy-relevant because much of the fiscal response to both the 2008 Global Financial Crisis and the Covid-19 crisis took the form of sharp increases in government transfers financed by bond issuance. The model also assumes lump-sum taxes, so in the absence of liquidity effects Ricardian equivalence would obtain; all non-neutralities are driven entirely by the liquidity channel.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Liquidity-in-advance constraint&lt;/strong&gt; : An analog of a cash-in-advance constraint in which the household must hold a weighted sum of real money balances and the real market value of bonds sufficient to finance current consumption; it always binds in the model&amp;rsquo;s equilibrium, so liquidity directly pins down output.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price theory of money&lt;/strong&gt; : The proposition (attributed to Keynes and labelled by Calvo 2012) that money is highly liquid partly because the nominal goods-price level is sticky, fixing the price of money in terms of goods; this insulates the real value of money from the anticipatory jumps that affect bond prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond helicopter drop&lt;/strong&gt; : A government transfer to households financed by issuing long-term bonds (perpetuities), with no change in taxes or money supply; the term &amp;ldquo;helicopter drop of bonds&amp;rdquo; is used by the authors to parallel Friedman&amp;rsquo;s helicopter money but with bonds as the instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond-price stabilization (non-traditional fiscal dominance)&lt;/strong&gt; : The authors&amp;rsquo; term for a situation in which expected fiscal bond issuance compresses bond-market liquidity and forces the central bank to expand money supply and cut the interest rate on money in order to stabilize bond prices and prevent contractionary effects, even though the central bank is not formally required to finance the deficit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;s_t (bond-to-money relative position)&lt;/strong&gt; : A model variable defined as the log-deviation from steady state of the ratio of the real market value of bonds to real money balances; it captures the relative contribution of bonds to total portfolio liquidity and is the key endogenous state variable linking bond-price dynamics to aggregate demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Calvo-Phillips curve&lt;/strong&gt; : The standard Calvo (1983) staggered-pricing supply side, used here to generate the inflation-output gap trade-off; in the paper&amp;rsquo;s notation, inflation dynamics satisfy π̇_t = δπ_t − κ(y_t − ȳ), where output gaps are driven by liquidity shortfalls rather than standard demand shocks.&lt;/p&gt;</description></item><item><title>Jumpstarting an International Currency</title><link>https://macropaperwarehouse.com/papers/jumpstarting-an-international-currency/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/jumpstarting-an-international-currency/</guid><description>&lt;p&gt;This paper asks how a currency achieves international status — moving from zero to positive cross-border use — and whether deliberate central bank policy can accelerate that transition. The authors focus on the People&amp;rsquo;s Bank of China (PBoC) swap lines signed between 2009 and 2018, which extended RMB-denominated lender-of-last-resort credit to foreign central banks for the stated purpose of supporting RMB-denominated trade finance and settlement.&lt;/p&gt;
&lt;p&gt;The empirical analysis combines two datasets. The first covers every RMB swap line agreement the PBoC signed with a foreign central bank (38 countries by 2018), compiled from PBoC news releases and validated against counterparty communications, treated as a staggered binary absorbing treatment. The second is monthly SWIFT data on cross-border payment message values (October 2010 – October 2018), disaggregated by currency and message type (payment orders MT103/MT202 and trade-finance messages MT400/MT700). The working sample, after excluding financial centre hubs, sanctioned countries, pre-sample treated countries, and small economies, covers 114 countries with 11,058 observations, of which 21 are treated during the sample period.&lt;/p&gt;
&lt;p&gt;The main identification strategy is a staggered difference-in-differences design using the imputation estimator of Borusyak et al. (2024), with controls for bilateral trade with China, Chinese economic policy variables (RMB clearing bank presence, AIIB membership, infrastructure investment flows, UN voting alignment), and regional RMB adoption trends. The authors are explicit that conditional independence is not guaranteed and characterize results as documenting an association.&lt;/p&gt;
&lt;p&gt;At the extensive margin, signing a swap line is associated with an approximately 14 percentage point increase in the probability that a country uses the RMB for international payments in a given month (baseline column: 11%, rising to approximately 14% with controls and approximately 20% when anticipation effects are accounted for by shifting treatment timing six months earlier). At the intensive margin — using ln(1 + RMB payments) and Poisson specifications — RMB usage is between 250% and 440% higher in treated countries following the policy. The effect concentrates within the first 12 months of signing and persists without reversion. The effect is present in payments not involving China as a counterparty, is not explained by Belt and Road Initiative membership, and does not extend to bilateral trade volumes with China.&lt;/p&gt;
&lt;p&gt;Four mechanisms from the paper&amp;rsquo;s theoretical model are tested and supported. First, swap lines reduce offshore RMB borrowing costs by an estimated 115 basis points on average (rising to 205 basis points for emerging market currencies). Second, the 2015–16 RMB crisis — in which the PBoC drained offshore liquidity to defend the exchange rate peg, sharply raising private RMB borrowing costs — caused a significant decline in RMB use among countries without a swap line but not among those with one, consistent with the model&amp;rsquo;s prediction that swap lines cap the right tail of borrowing cost distributions. Third, effects are concentrated in trade-finance SWIFT messages, stronger in countries with above-median trade shares with China, and increasing in intermediate import intensity and working capital reliance. Fourth, the RMB gains displace existing international currencies — the USD share falls by approximately 8 percentage points and the EUR share by approximately 2.5 percentage points — rather than displacing local currencies, as the model predicts. There are also geographic spillovers: a neighboring country signing a swap line is associated with a 10% increase in RMB use even for countries that did not sign.&lt;/p&gt;
&lt;p&gt;The theoretical framework models import-export firms that choose simultaneously the currency of trade finance and the currency of sales invoicing. Sticky prices create a complementarity between these two choices. A swap line truncates the right tail of the borrowing cost distribution (first-order stochastic dominance), which can push firms above a threshold into using the rising currency for both liabilities and invoicing. The model predicts threshold behavior — a currency either jumpstarts or does not — and explains why only a small number of currencies ever achieve international status.&lt;/p&gt;
&lt;p&gt;Q: What are the PBoC swap lines and how do they mechanically affect firms?
A: A PBoC swap line is a renewable 3-year agreement between the PBoC and a foreign central bank that allows the foreign central bank to borrow RMB and on-lend it domestically to support RMB-denominated trade finance. Like other central bank lending facilities, they place a ceiling on interest rates, thereby truncating the right tail of the distribution of RMB borrowing costs faced by commercial banks and their firm customers. The key insurance property holds even when lines are not actively drawn upon, because their existence caps tail risk.&lt;/p&gt;
&lt;p&gt;Q: What is the extensive margin finding for swap lines and RMB payments?
A: Signing a swap line is associated with an approximately 11% increase in the probability that a country uses the RMB for cross-border payments in a given month without controls, rising to approximately 14% with the full set of controls, and to approximately 20% when treatment timing is shifted six months earlier to account for anticipation effects. The event study shows the effect concentrates within 12 months of signing and does not revert.&lt;/p&gt;
&lt;p&gt;Q: What is the intensive margin finding?
A: Using ln(1 + RMB payments) and Poisson specifications — preferred because Mongolia is an outlier and payment value volatility is increasing in payment level — treated countries have RMB payment values between 250% and 440% higher than control countries after signing. The RMB share of payments rises by 0.13 percentage points on average, compounding to approximately 0.3 percentage points in years 3–4, or roughly one-fifth of the overall rise in RMB payments over the full sample period.&lt;/p&gt;
&lt;p&gt;Q: How do the authors address the concern that swap lines are signed precisely when economic integration with China is deepening?
A: They include a comprehensive set of controls: bilateral export and import values to/from China, the ratio of Chinese trade to GDP, China trade agreement status, RMB clearing bank presence, AIIB membership, infrastructure investment flows, and UN voting alignment. They also show separately that (i) the effect is present in RMB payments not involving China as a counterparty, (ii) Belt and Road Initiative membership does not account for the effect, and (iii) there is no increase in bilateral trade with China following swap line signing. The authors nonetheless characterize results as documenting an association, not establishing causation.&lt;/p&gt;
&lt;p&gt;Q: Do swap lines actually reduce RMB borrowing costs as the model requires?
A: Yes. Using the same staggered difference-in-differences methodology, signing a swap agreement is associated with a 115 basis point fall in offshore RMB borrowing rates on average. For emerging market currency comparators the effect rises to 205 basis points. The event study shows an immediate and sustained reduction with no detectable pre-trend.&lt;/p&gt;
&lt;p&gt;Q: What does the 2015–16 RMB crisis reveal about the mechanism?
A: In August 2015 the PBoC adjusted its RMB-USD central parity rate, triggering a 3% depreciation over two days and subsequent offshore liquidity drainage that raised both the level and volatility of offshore RMB borrowing costs until approximately April 2017. This shock was primarily financial rather than reflecting a Chinese economic slowdown. Countries without a swap line experienced a sharp decline in RMB payment usage in 2015Q4, while countries with a swap line — whose right-tail borrowing costs were capped — did not, consistent with the model&amp;rsquo;s prediction that the lines insulate against tail risk shocks.&lt;/p&gt;
&lt;p&gt;Q: Are the effects concentrated in trade finance as the model predicts?
A: Yes. Restricting the analysis to SWIFT trade-finance message types (MT400 and MT700), the coefficient estimates are similar in magnitude to those for all payments. Effects on the trade finance extensive margin are concentrated among countries with above-median trade shares with China. The effects are also increasing in countries&amp;rsquo; intermediate import intensity and in the degree to which export industries rely on working capital.&lt;/p&gt;
&lt;p&gt;Q: Which currencies does the RMB displace and which does it not displace?
A: The swap line is associated with a 14 percentage point rise in the RMB share of payments to and from China. Decomposing this: the USD share falls by approximately 8 percentage points, the EUR share by approximately 2.5 percentage points, the combined GBP/JPY/CHF share by approximately 0.5 percentage points, and other currencies by approximately 3 percentage points. The local currency of the country receiving the swap line does not show a statistically significant decline, consistent with the model&amp;rsquo;s prediction that the RMB competes primarily with existing international vehicle currencies rather than with domestic currencies.&lt;/p&gt;
&lt;p&gt;Q: Are there geographic spillovers from swap lines?
A: Yes. A neighboring country (defined as countries within 1,000 km, or the nearest five if fewer than five are within that distance) signing a swap line is associated with a 10% increase in RMB payments for the non-signatory neighbor. The authors attribute this to supply chain linkages: firms importing RMB-invoiced inputs from a swap-line country face an incentive to adopt RMB for their own downstream transactions.&lt;/p&gt;
&lt;p&gt;Q: What does the model predict about which currencies can ever become international?
A: The model identifies three thresholds a currency must pass. First, exchange rate variance must be sufficiently low; most currencies fail this condition. Second, the right tail of borrowing costs in that currency must not be too high; skewed distributions fail the threshold condition in Proposition 2. Third, the currency-issuing country must be large enough as an export market or intermediate input source to generate the complementarity factor Psi that makes adopting the currency worthwhile. Most currencies fail on multiple dimensions, explaining why so few achieve international status.&lt;/p&gt;
&lt;p&gt;Q: How do sticky prices create the complementarity between trade finance currency and invoicing currency in the model?
A: Firms set prices in advance before exchange rates and borrowing costs are realized. If a firm borrows in currency r to finance imported inputs but prices its exports in currency d, cost and revenue shocks are mismatched, creating profit volatility. Nominal price stickiness means firms cannot adjust prices ex post to maintain constant markups. This makes it optimal to align the currency of liabilities (trade finance) with the currency of export invoicing, creating a complementarity that amplifies the effect of a reduction in r-currency borrowing costs on invoicing currency choice.&lt;/p&gt;
&lt;p&gt;Q: How do the authors handle the potential bias from heterogeneous treatment effects in the staggered difference-in-differences design?
A: They use the imputation estimator of Borusyak et al. (2024), which is robust to heterogeneous treatment effects across cohorts, clustering standard errors at the country level and averaging treatment effects by cohort. They also verify results using the synthetic difference-in-differences estimator of Arkhangelsky et al. (2021), which reweights observations to equalize pre-treatment trends, and show results are robust across both two-way fixed effects and these more modern estimators.&lt;/p&gt;
&lt;p&gt;Q: What historical parallel do the authors draw and what does it imply for the RMB&amp;rsquo;s future?
A: The paper draws a parallel with the USD&amp;rsquo;s displacement of pound sterling in trade finance in the decade following the Federal Reserve&amp;rsquo;s creation in 1913 and the establishment of bankers&amp;rsquo; acceptances. That transition was supported by World War I&amp;rsquo;s damage to the UK economy and rapid US economic growth. The authors conclude that RMB internationalization will require not only continued policy support but also favorable economic fundamentals including sound monetary policy and deeper capital markets.&lt;/p&gt;
&lt;p&gt;Q: How does the PBoC&amp;rsquo;s swap line program differ from Federal Reserve and ECB swap lines?
A: PBoC lines differ in four key respects: they have longer maturities (3-year renewable agreements vs. shorter-term Fed/ECB lines); they involve a large and diverse set of mostly developing countries rather than a handful of advanced economies; they target trade finance in a context of limited RMB cross-border banking rather than addressing foreign-bank dollar funding shortfalls caused by dollar dominance; and they were designed to initiate internationalization rather than to respond to an existing dominant currency&amp;rsquo;s liquidity stresses. The aggregate notional limit of approximately RMB 3 trillion is nonetheless comparable in scale to the USD 600 billion of peak drawings from Fed swap lines.&lt;/p&gt;
&lt;p&gt;International currency jumpstart: The process by which a currency moves from zero to positive international use, as opposed to the better-studied phenomenon of a currency achieving dominance. The paper distinguishes jumpstart (initial adoption) from dominance (widespread adoption), arguing that different mechanisms govern each stage.&lt;/p&gt;
&lt;p&gt;PBoC swap lines: Renewable 3-year agreements between the People&amp;rsquo;s Bank of China and foreign central banks enabling the latter to borrow RMB and on-lend it domestically for RMB-denominated trade finance. In the paper&amp;rsquo;s framework, they function as an extension of the lender of last resort function abroad, placing a ceiling on offshore RMB borrowing costs and truncating the right tail of the borrowing cost distribution.&lt;/p&gt;
&lt;p&gt;Trade finance currency complementarity: The paper&amp;rsquo;s central mechanism — the alignment incentive between the currency of a firm&amp;rsquo;s liabilities (working capital / trade finance for imported inputs) and the currency of its export invoicing. Sticky prices create this complementarity because misaligned currency choices expose firms to uninsurable profit volatility.&lt;/p&gt;
&lt;p&gt;Borrowing cost distribution truncation: The mechanism by which a swap line affects firm behavior — not by lowering average costs but by capping the right tail of the distribution of possible RMB borrowing rates. The model requires first-order stochastic dominance of the post-swap-line distribution over the pre-swap-line distribution.&lt;/p&gt;
&lt;p&gt;Threshold condition for currency adoption: Derived from the model&amp;rsquo;s Proposition 2, the condition on the expected concave function of borrowing costs relative to an adjusted interest rate differential that must be satisfied for a firm to choose r-currency credit over d-currency credit. The complementarity factor Psi, which increases with the size of the rising-currency market, enters this threshold.&lt;/p&gt;
&lt;p&gt;Extensive vs. intensive margin of currency use: The extensive margin refers to whether a country uses the RMB at all in a given month (1(Rpayment &amp;gt; 0)); the intensive margin refers to the share of payments denominated in RMB or the log value of RMB payments. The paper finds the swap lines affect both margins, with the extensive margin effect appearing immediately and stabilizing after 12 months.&lt;/p&gt;
&lt;p&gt;Vehicle currency displacement: The paper&amp;rsquo;s empirical finding that RMB adoption displaces existing international vehicle currencies (USD, EUR) rather than local currencies. This is a prediction of the model: firms adopting RMB for trade finance were previously using an existing international currency, not their domestic currency, for that purpose.&lt;/p&gt;</description></item><item><title>Labor Market Competition and the Assimilation of Immigrants</title><link>https://macropaperwarehouse.com/papers/labor-market-competition-and-the-assimilation-of-immigrants/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/labor-market-competition-and-the-assimilation-of-immigrants/</guid><description>&lt;h2 id="labor-market-competition-and-the-assimilation-of-immigrants"&gt;Labor Market Competition and the Assimilation of Immigrants&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;Why have immigrant-native wage gaps widened substantially across arrival cohorts in the United States since the 1960s, and why has the speed of wage convergence slowed? The paper argues that the existing literature, which attributes these trends entirely to declining immigrant cohort quality, omits a critical general-equilibrium channel: labor market competition arising from imperfect substitutability between immigrants and natives. The paper quantifies how much of the observed deterioration in wage assimilation profiles can be attributed to (i) increasing immigrant cohort sizes raising labor market competition, (ii) secular shifts in relative skill demand, and (iii) genuine changes in immigrant cohort quality.&lt;/p&gt;
&lt;h3 id="data-and-methodology"&gt;Data and Methodology&lt;/h3&gt;
&lt;p&gt;The analysis uses U.S. Census microdata for 1970, 1980, 1990, and 2000, combined with American Community Survey (ACS) data pooled for 2009–2011 (labeled 2010) and 2018–2019 (labeled 2020), all drawn from IPUMS-USA. The sample covers individuals aged 25–64 who are employed in the civilian sector, not self-employed, not in group quarters, and report positive earnings. Immigrant cohort sizes grew from approximately 800,000 individuals in the 1960s cohort to 2.3 million in the 1980s cohort and 4.6 million in the 2000s cohort.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a constant elasticity of substitution (CES) production function in which workers supply two types of skills: &amp;ldquo;general&amp;rdquo; skills portable across countries and &amp;ldquo;specific&amp;rdquo; skills particular to the host country (including language proficiency and knowledge of cultural and institutional environment). Immigrants arrive with the same general skills as observationally equivalent natives but only a fraction of their specific skills; they accumulate specific skills over time. Because immigrants disproportionately supply general skills upon arrival, increasing immigrant inflows raise the relative supply of general skills, depress the relative price of general skills, and thereby widen the immigrant-native wage gap. This mechanism operates only when immigrants and natives are imperfect substitutes (elasticity of substitution σ &amp;lt; ∞).&lt;/p&gt;
&lt;p&gt;The model is estimated in two steps using nonlinear least squares (NLS). First, productivity factor parameters are estimated from native wages year by year, with state dummies identifying state-level skill prices. Second, specific skill accumulation parameters and the elasticity of substitution σ are jointly identified from immigrant wage differences across labor markets (defined as U.S. states) and over time. The demand shift parameter δ_t, which captures changes in the relative demand for specific skills (e.g., technology that favors communication over manual tasks), enters as a linear time trend in the baseline specification.&lt;/p&gt;
&lt;h3 id="main-findings-with-quantitative-magnitudes"&gt;Main Findings with Quantitative Magnitudes&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Competition effect:&lt;/strong&gt; Immigration-induced increases in labor market competition explain 14.2, 43.9, and 40.8 percent of the increase in the initial wage gap of the 1970s, 1980s, and 1990s cohorts relative to the 1960s cohort, respectively. Averaged across all years spent in the United States, the competition effect alone accounts for 14.1, 22.4, and 20.4 percent — approximately one fifth overall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Competition plus demand effect:&lt;/strong&gt; Adding secular shifts in relative skill demand raises these figures to 24.8, 68.3, and 109.5 percent at arrival and 21.2, 33.6, and 36.4 percent averaged across years — approximately one third overall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Elasticity of substitution:&lt;/strong&gt; The baseline estimate of σ (elasticity of substitution between general and specific skills) is 0.020 (s.e. 0.002), implying an inverse elasticity of approximately 50.5. The relative supply of general skills increased by 1.67 log points between 1970 and 2020, producing a predicted increase in the relative price of specific skills of approximately 59.6 log points. The demand shift trend is estimated at 1.3 log points per year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cohort quality:&lt;/strong&gt; Once competition and demand effects are netted out, the remaining deterioration in assimilation profiles is entirely attributable to observable changes in immigrants&amp;rsquo; educational attainment and country-of-origin composition. Conditional on these two observable characteristics, unobservable skill quality improved across cohorts (consistent with English language proficiency trends), reversing the conventional narrative of declining cohort quality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Specific skills gap at arrival:&lt;/strong&gt; The 1960s cohort faced a specific skills gap of approximately 52.4 percent relative to native equivalents; this narrowed to 41.8 percent for the 1970s cohort, 35.6 percent for the 1980s cohort, and 17.6 percent for the 1990s cohort, conditional on origin and education. After 20–30 years, all cohorts reach 83.7–92.0 percent of their native counterparts&amp;rsquo; specific skill levels.&lt;/p&gt;
&lt;h3 id="scope-conditions"&gt;Scope Conditions&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;The analysis focuses on employed men in the main text (women are analyzed in an Online Appendix, showing qualitatively similar but quantitatively smaller patterns).&lt;/li&gt;
&lt;li&gt;Labor markets are defined at the U.S. state level in the baseline; robustness checks use state-education and state-gender cells.&lt;/li&gt;
&lt;li&gt;The decomposition covers the period from the 1960s to the 1990s arrival cohorts.&lt;/li&gt;
&lt;li&gt;Results are robust to corrections for selective outmigration, undercounting of undocumented immigrants, immigrant network effects, alternative demand shift specifications, alternative labor market definitions, and endogenous immigrant location choice (using shift-share instruments in the spirit of Card, 2001).&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core theoretical mechanism by which increasing immigrant inflows widen the immigrant-native wage gap?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Because immigrants disproportionately supply general (country-portable) skills upon arrival, while natives disproportionately supply specific (host-country) skills, an increase in immigrant inflows raises the ratio of general to specific skills in the economy. Under imperfect substitutability (σ &amp;lt; ∞), this lowers the relative price of general skills and raises the relative price of specific skills, thereby widening the wage gap between immigrants (who earn predominantly from general skills) and natives (who earn more from specific skills). The effect is larger in the early years after arrival when immigrants&amp;rsquo; specific skill endowment s is small, and diminishes as immigrants accumulate specific skills over time.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the paper model immigrants&amp;rsquo; skill accumulation, and how do accumulation profiles differ across groups?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Immigrants&amp;rsquo; specific skill endowment s(·) upon arrival and over time is modeled as a flexible polynomial in years since migration, interacted with dummies for region of origin, education, cohort of entry, and potential experience abroad. Mexican high school dropouts (the reference group) are estimated to arrive with approximately 80 percent of the specific skills of equivalent natives. Immigrants from Latin America, Asia, and other regions arrive with lower specific skills than Western immigrants, who arrive near native parity. Higher-educated immigrants arrive relatively less similar to equivalently educated natives than low-educated immigrants, reflecting the greater importance of language-intensive skills in high-skill occupations. Conditional on origin and education, more recent cohorts arrive with narrower specific skill deficits: the 1990s cohort faces a gap of 17.6 percent at arrival compared to 52.4 percent for the 1960s cohort.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the estimated technology parameters, and how are they interpreted?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The elasticity of substitution between general and specific skills is estimated at σ = 0.020 (s.e. 0.002), with a confidence interval of [0.017, 0.024]. This implies an inverse elasticity of approximately 50.5, meaning a one percent increase in the relative supply of general skills raises the relative price of specific skills by about 50.5 percent. The implied elasticity of substitution between natives and immigrants (evaluated at market-level averages) is approximately 0.013 in 1990, 0.020 in 2000, and 0.025 in 2010 — in the same range as the Ottaviano and Peri (2012) benchmark of 0.034 (s.e. 0.008). The demand shift trend is estimated at δ̃ = 0.013 (s.e. 0.001) log points per year, reflecting secular increases in the relative demand for specific (host-country) skills.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the paper identify the elasticity of substitution σ and the skill accumulation parameters separately?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The estimation proceeds in two steps. First, productivity factor parameters (returns to education and experience) are estimated from native wage regressions, with state-year dummies absorbing state-specific skill prices. Second, skill accumulation parameters θ are identified from wage differences between immigrants with different characteristics working in the same labor market, while σ and the demand shift δ̃ are identified from variation in immigrant wage gaps across states (which have different immigrant population shares) and over time. Specifically, states with higher immigrant shares display lower relative prices of general skills, providing the identifying variation for σ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the quantitative magnitudes of the competition effect for specific cohorts at different time horizons?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: At the time of arrival, the competition effect explains 14.2 percent (1970s cohort), 43.9 percent (1980s cohort), and 40.8 percent (1990s cohort) of the increase in initial wage gaps relative to the 1960s cohort. After 10 years, these figures are 17.1, 22.7, and 22.2 percent respectively. After 20 years, they are 12.2, 16.9, and 16.2 percent. After 30 years, 10.9, 15.3, and 13.7 percent. The declining share across years reflects the fact that as immigrants accumulate specific skills, their wages become less sensitive to equilibrium skill prices. Averaged across all years since migration, the competition effect accounts for 14.1, 22.4, and 20.4 percent for the three cohorts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does labor market competition affect the speed of wage assimilation, and does it prevent full convergence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The effect on assimilation speed is theoretically ambiguous and depends on whether future cohorts are larger or smaller than the reference cohort, and whether immigrants fully converge to native skill levels. In the stylized examples, a one-time permanent increase in competition raises both the initial wage gap and the speed of subsequent convergence (since the gap between immigrant and native skill levels is larger and therefore more responsive to changes in skill prices). However, continuous inflows of increasingly large cohorts counteract this speedup by continuously shifting the wage profile downward — the &amp;ldquo;dynamic competition effect.&amp;rdquo; For immigrants who fully converge (s → 1), competition delays but does not prevent convergence; for those who only partially converge (s → &amp;lt; 1), competition permanently widens the long-run wage gap. Quantitatively, the paper finds the effect on assimilation speed to be small in the full-sample decomposition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What do the illustrative examples for specific immigrant groups reveal about heterogeneous competition effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: For a Mexican male high school dropout (1960s cohort skills), facing the same competition level as the 1990s cohort would widen the initial wage gap by 10.2 log points; facing 2010 competition levels would widen it by 21.1 log points. However, because this group fully converges (s → 1), the effect dissipates entirely after approximately 25 years, and long-run wage assimilation is not prevented. For a Latin American male high school graduate who only partially converges (s → &amp;lt; 1), facing 1990s competition would widen the initial gap by 17.4 log points and leave a 3.8 log-point larger long-run wage gap. For a Western college graduate who arrives near native skill parity, competition effects are negligible throughout.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the changes in absolute wage gaps documented in the baseline data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The 1960s cohort arrived with an initial wage gap of approximately 17.2 log points relative to natives. The 1970s cohort arrived with a gap of 30.1 log points, the 1980s cohort 29.2 log points, and the 1990s cohort 20.8 log points. Under the no-competition counterfactual, these initial gaps narrow to 13.6, 24.7, 20.3, and 15.7 log points respectively. Removing both competition and demand effects further narrows them to 13.7, 23.4, 17.5, and 13.3 log points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the paper find about the role of observable versus unobservable immigrant quality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Once competition and demand effects are accounted for, all remaining cohort differences in assimilation profiles are attributable to observable changes in immigrants&amp;rsquo; educational attainment and country-of-origin composition. Conditional on these two observable characteristics, immigrants in more recent cohorts display higher levels of unobservable skills (smaller specific skill deficits conditional on origin and education), consistent with rising English language proficiency across cohorts. This reverses the standard interpretation that unobservable immigrant quality has declined.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do aggregate skill supplies and relative skill prices evolve over the sample period?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Between 1970 and 2020, the total supply of general skills from immigrants grew by a factor of 16.3, while the supply of specific skills grew by a factor of 15.0. The resulting increase in the relative supply of general skills caused the relative price of general skills to fall from 0.89 to 0.38. Accounting for growing relative demand for specific skills (the δ_t trend), the ratio of relative skill prices fell further to 0.20 by 2020. At the state level, relative prices of general skills are well below 0.3 in high-immigration states like California, Florida, and New York, and approach 1.0 in states with low immigrant shares.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Are the results robust to selective outmigration, undocumented immigrants, and alternative specifications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Yes. Across twelve robustness checks covering selective outmigration corrections (using Borjas and Bratsberg 1996 or Rho and Sanders 2021 outmigration rates, and synthetic cohort reweighting), undocumented immigrant undercounting corrections, immigrant network controls (share and stock of compatriots in the same state), alternative demand shift specifications (quadratic and time dummies), alternative labor market definitions (state-education and state-gender cells), and endogenous immigrant location choice (GMM with shift-share instruments), the estimated elasticity of substitution σ ranges from 0.017 to 0.033 and the average competition effects remain stable. Averaged across all robustness checks, competition effects are 1.3 log points (1960s cohort), 3.0 log points (1970s), 5.2 log points (1980s), and 4.3 log points (1990s), compared to baseline values of 1.4, 3.1, 5.5, and 4.6 log points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the policy implications highlighted by the authors?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: First, since assimilation and competition effects are intertwined, the wage impact of immigration on natives is intrinsically dynamic: newly arrived immigrants initially compete relatively little with natives but increasingly substitute for them as their specific skills grow. Second, labor market competition may reduce immigrants&amp;rsquo; incentives to invest in host-country-specific skills, a channel not modeled in most existing structural models. Third, dispersal policies (such as those used during refugee crises) that reallocate immigrants across regions will affect local skill price ratios and therefore alter wage assimilation trajectories — a potentially unintended consequence of geographic allocation policies.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;General skills:&lt;/strong&gt; Skills that are portable across countries and can be used productively in any labor market. In the paper&amp;rsquo;s framework, general skills are those required for tasks (such as manual or physical labor) that are similar across national contexts. Upon arrival, immigrants are assumed to supply the same amount of general skills as observationally equivalent natives, making immigrants&amp;rsquo; relative supply of general skills high at arrival.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Specific skills (host-country-specific skills):&lt;/strong&gt; Skills particular to the host country, including language proficiency (English in the U.S. context) as well as familiarity with the institutional and cultural environment. Immigrants arrive with only a fraction s of the specific skills of comparable natives; this fraction evolves over time as immigrants spend time in the host country. The level of specific skills governs how substitutable a given immigrant worker is with native workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor market competition effect:&lt;/strong&gt; The mechanism by which increasing immigrant inflows affect relative wages through equilibrium changes in skill prices rather than through individual skill accumulation. When immigrants and natives are imperfect substitutes, rising immigrant inflows raise the relative supply of general skills, depress the relative price of general skills, and widen the immigrant-native wage gap. This effect is larger for recently arrived immigrants (small s) and diminishes as immigrants assimilate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic competition effect:&lt;/strong&gt; The combined effect on a given cohort&amp;rsquo;s observed assimilation profile of continuous, growing immigrant inflows over its time in the country. Unlike a one-time permanent increase in competition (which would raise both the initial gap and assimilation speed), continuously growing inflows both widen the initial gap and exert a continuous downward shift on the cohort&amp;rsquo;s wage profile, with an ambiguous net effect on the speed of convergence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demand shift (δ_t):&lt;/strong&gt; A time-varying parameter in the CES production function capturing secular changes in the relative demand for specific versus general skills beyond what is explained by standard skill-biased technological change. A positive trend in δ_t (estimated at 1.3 log points per year in the baseline) reflects technological change that favors communication-intensive (specific-skill-intensive) tasks over manual (general-skill-intensive) tasks, and amplifies the competition effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Elasticity of substitution between general and specific skills (σ):&lt;/strong&gt; The key technology parameter governing the degree of imperfect substitutability between natives and immigrants in equilibrium. Estimated at σ = 0.020 in the baseline. When σ = ∞, immigrants and natives are perfect substitutes and labor market competition has no effect on relative wages. As σ decreases, the competition effect on relative wages becomes stronger for a given change in relative skill supplies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Specific skill accumulation function s(·):&lt;/strong&gt; A flexible parametric function of years since migration, interacted with region of origin, education level, cohort of entry, and potential experience at arrival, that governs the rate at which immigrants acquire host-country-specific skills over time. The intercept of s(·) at arrival (relative to a native s = 1) measures the initial specific skill deficit; the polynomial in years since migration captures how quickly this deficit closes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage assimilation profile:&lt;/strong&gt; The trajectory of the immigrant-native log wage gap as a function of years spent in the host country, conditional on a cohort of arrival. The paper distinguishes between changes in the level of the profile (the initial wage gap) and changes in its slope (the speed of convergence), and decomposes both dimensions into competition effects, demand effects, and cohort quality effects.&lt;/p&gt;</description></item><item><title>Labor Market Shocks and Monetary Policy</title><link>https://macropaperwarehouse.com/papers/labor-market-shocks-and-monetary-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/labor-market-shocks-and-monetary-policy/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; The paper asks two related questions: (1) How much, and through which channels, do employer-to-employer (EE) worker transitions affect macroeconomic outcomes — particularly inflation? (2) What is the optimal monetary policy within a class of Taylor rules when EE flows are taken explicitly into account?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation.&lt;/strong&gt; Standard monetary policy frameworks condition on the unemployment rate as the primary labor market slack measure and underemphasize the &amp;ldquo;quality&amp;rdquo; dimension of employment. The paper documents a striking empirical pattern: the 2016–2019 recovery and the 2021–2022 recovery from COVID-19 featured nearly identical declines in the unemployment rate, yet exhibited dramatically different EE rate dynamics and inflation outcomes. During 2016–2019, the EE rate remained flat despite a roughly 25 percent decline in the unemployment rate from trend. During 2021–2022, the EE rate rose by around 8 percent above trend over a comparable unemployment decline. Correspondingly, unit labor cost (ULC) growth reached approximately 6 percent during the COVID-19 recovery when unemployment fell below 4 percent, compared with only about 2 percent ULC growth in the 2016–2019 period at similar unemployment levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The authors develop a Heterogeneous Agent New Keynesian (HANK) model with a frictional labor market featuring on-the-job search (OJS). Workers are heterogeneous in wealth (mutual fund shares), human capital, match-specific productivity, and endogenous piece-rate wages. Human capital stochastically appreciates when employed and depreciates when unemployed, capturing scarring effects and job-stayer wage growth. Wage determination follows a Bertrand competition protocol based on flow output: workers switch to higher-productivity matches and extract the full surplus from the new firm, while outside offers from lower-productivity firms can still trigger rebargaining with the incumbent firm and raise the piece rate without a job switch. Three vertically integrated sectors — labor services, intermediate goods, and final goods — are linked so that the real price of labor services pl is the real marginal cost for intermediate firms and the sole driver of inflation in the New Keynesian Phillips curve (absent aggregate productivity shocks). The economy is subject to AR(1) shocks to the discount rate β (demand), aggregate labor productivity z (supply), and OJS efficiency ν (the relative search efficiency of employed workers). The model is solved using the Sequence-Space Jacobian (SSJ) method, extended to handle discretized worker distributions as direct inputs to equilibrium conditions.&lt;/p&gt;
&lt;p&gt;The model is calibrated to U.S. pre-Great Recession data (2004–2006), targeting the fraction of hand-to-mouth individuals (16 percent of SIPP sample), unemployment rate (5.1 percent), EU separation rate (3.8 percent quarterly), EE rate (2 percent quarterly from LEHD), earnings drop upon job loss (35 percent), wage growth of job switchers (9 percent), and the labor share (0.67). Shock processes are estimated by minimizing deviations from empirical correlations and standard deviations of output, unemployment, EE rate, and inflation over 1995:Q3–2008:Q4.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — positive analysis.&lt;/strong&gt; Shocks to OJS efficiency account for 43.1 percent of fluctuations in inflation in the variance decomposition, and 78.7 percent of fluctuations in the EE rate. The mechanism: a higher OJS efficiency lowers the expected match value EJ for labor services firms through three channels — (i) a compositional shift toward employed job seekers who extract the entire match surplus, (ii) shorter expected match duration as workers face higher poaching probabilities, and (iii) more frequent wage rebargaining where outside offers bid up wages without accompanying productivity gains. To maintain the free-entry condition, the real price of labor services pl must rise, increasing the real marginal cost and inflation. This direct labor market effect explains 139 percent of the total increase in pl; general equilibrium effects through reduced tightness θ — which raises expected match values by making vacancies easier to fill and workers less likely to be poached — offset −42 percent; the remainder (3 percent) comes from real rate changes driven by the monetary policy reaction.&lt;/p&gt;
&lt;p&gt;In two historical simulations, muted OJS efficiency during 2016–2019 generated approximately 0.23 percentage points lower annualized inflation at the peak relative to a counterfactual economy with the same unemployment path but an endogenously rising EE rate. Conversely, elevated OJS efficiency during 2021–2022 generated approximately 0.56 percentage points higher annualized inflation compared to the flat-EE-rate counterfactual. The paper notes that strong worker mobility accounts for roughly 10 percent of the approximately 6 percentage point total rise in annual inflation during the COVID-19 recovery episode.&lt;/p&gt;
&lt;p&gt;An important cross-model comparison shows that the Representative Agent New Keynesian (RANK) version of the model overestimates the decline in demand, output, and labor market tightness upon a positive OJS shock, and underestimates the rise in real rate, marginal cost, and inflation. Household heterogeneity is therefore quantitatively important: hand-to-mouth households&amp;rsquo; demand responds directly to labor income increases from job switches, mitigating the demand decline and amplifying inflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings — normative analysis.&lt;/strong&gt; The optimal monetary policy within an augmented Taylor rule — adding an EE gap term ΦEE(EEt − EE*) alongside the standard inflation and unemployment gap terms — prescribes Φ*_u = −3.18 and Φ*_EE = 2.22 (with Φπ fixed at 1.5). This yields a 78.7 percent reduction in the central bank loss relative to the baseline Taylor rule. A policy that ignores EE dynamics and optimizes only the unemployment gap coefficient (finding Φu = −2.71, ΦEE = 0) produces a 12 percent larger central bank loss than the full optimal policy. In terms of welfare, the optimal policy delivers 0.16 percent additional lifetime consumption equivalent in the aggregate. Workers at the bottom of the match quality distribution gain the most (0.24 percent), as do the unemployed (0.20 percent), while those at the top of the wealth distribution gain the least due to larger share price fluctuations under the more aggressive policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results are derived conditional on a dual-mandate central bank objective (variance of inflation and output gaps), within a class of Taylor-type rules (not fully optimal Ramsey policy), under first-order approximation around a non-stochastic steady state. The historical simulations abstract from supply shocks active in the normative exercises and assume the economy starts from steady state in 2016.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the OJS efficiency shock, and how does it differ from a standard demand or supply shock?&lt;/strong&gt;
An OJS efficiency shock is modeled as a time-varying shift in νt, the relative job search efficiency of employed workers compared with unemployed workers. Unlike demand shocks (discount rate β innovations) and productivity shocks (aggregate z innovations), which move inflation and unemployment in opposite directions under standard New Keynesian logic (divine coincidence), OJS efficiency shocks move inflation and unemployment in the same direction: a positive OJS shock raises inflation while also raising unemployment (because the higher real rate induced by the central bank&amp;rsquo;s reaction reduces demand and employment). This makes OJS shocks behave like cost-push shocks and introduces a genuine policy trade-off for a dual-mandate central bank.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the three mechanisms through which higher OJS efficiency raises the real price of labor services, and what is the quantitative contribution of each?&lt;/strong&gt;
The decomposition (Figure 8) shows that the direct effect of ν on EJ — encompassing the composition channel (more employed job seekers who extract the full surplus), the match-duration channel (shorter expected match lives), and the wage rebargaining channel (outside offers raise wages without productivity gains) — explains 139 percent of the total increase in pl. The general equilibrium reduction in labor market tightness θ, which raises EJ and partially offsets the cost increase, explains −42 percent in total: −18 percent through increased supply of labor services L (productivity-enhancing job switches improve the match distribution) and −24 percent through reduced output Y (lower aggregate demand). Real rate effects account for the remaining 3 percent net (8 percent from the inflation channel and −5 percent from the unemployment channel). Labor market effects in total therefore explain 97 percent of the marginal cost increase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Does the positive relationship between EE rates and inflation require wage increases upon job switches?&lt;/strong&gt;
No. The paper demonstrates (Section 2.4.2, Figure 3) that even when the piece rate for workers hired from unemployment is set to α = 0.95 (so that outside offers have negligible wage effects), a positive OJS efficiency shock still generates a decline in output and a rise in inflation in both the RANK and TANK models. Quantitatively, the inflation response is similar across the baseline and near-zero composition-channel specifications, confirming that the shorter expected match duration is the primary driver of the increase in the real price of labor services. The match duration channel operates independently of wage increases: firms anticipate shorter matches and require a higher flow price to break even on vacancy costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does household heterogeneity change the quantitative effects of OJS shocks relative to the RANK benchmark?&lt;/strong&gt;
Under a constant real rate, in the RANK model a higher OJS efficiency increases the real price of labor services and inflation but has no effect on aggregate demand or output (because higher labor income for the PIH household is exactly offset by lower firm profits). In the TANK model, hand-to-mouth households consume their entire labor income, so the rise in labor income from job switches directly boosts their demand, raising output and tightness and further amplifying inflation. Under an endogenous real rate, the RANK model overestimates the decline in demand and output, and underestimates the rise in real rate and inflation, compared with the TANK model. The TANK model requires a substantially larger equilibrium real rate increase to contain inflation because HtM households&amp;rsquo; demand is less elastic to the real rate than PIH households'.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How are aggregate shock processes estimated, and what share of inflation variance do OJS shocks explain?&lt;/strong&gt;
The six AR(1) parameters governing β, z, and ν (three persistence parameters ρj and three standard deviations σj) are estimated by minimizing the sum of squared deviations between model-generated and empirical moments: the autocorrelation of output; correlations of the unemployment rate, EE rate, and inflation with output; and standard deviations of output, unemployment rate, EE rate, and inflation. Data cover 1995:Q3–2008:Q4. Estimated values are ρβ = 0.909, ρz = 0.332, ρν = 0.936 and σβ = 0.001, σz = 0.002, σν = 0.003. The variance decomposition (Table 4) assigns 43.1 percent of inflation variance to OJS efficiency shocks ν, 52.0 percent to demand shocks β, and 4.9 percent to productivity shocks z.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How is the &amp;ldquo;missing inflation&amp;rdquo; during 2016–2019 quantified, and what is the counterfactual?&lt;/strong&gt;
The exercise simulates two economies both replicating the same unemployment path — a 15 percent decline in unemployment relative to its 5.2 percent steady state, spread linearly over 16 quarters, followed by mean reversion. The first economy uses only positive demand shocks, which generate an endogenously rising EE rate consistent with the historical unemployment-EE correlation. The second economy additionally introduces negative OJS efficiency shocks to keep the EE rate unchanged, as observed in the data during 2016–2019. Annualized inflation in the second economy is 0.23 percentage points lower at the peak (16 quarters after the shock), implying that had the EE rate risen normally, inflation would have been around 2 percent in 2019 rather than the observed 1.8 percent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How is the inflationary role of elevated EE transitions during 2021–2022 quantified?&lt;/strong&gt;
Using the same unemployment path as the 2016–2019 exercise, the COVID-19 recovery economy combines positive demand shocks with positive OJS efficiency shocks to replicate the observed 0.16 percentage point (8 percent above trend) increase in the EE rate. Comparing this economy to the flat-EE-rate economy from the prior exercise, the elevated EE rate generates 0.56 percentage points higher annualized inflation. Because annual inflation rose approximately 6 percentage points in the data during this episode, the model attributes roughly 10 percent of the total inflation increase to strong worker mobility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the optimal Taylor rule coefficients when EE dynamics are included, and what is the welfare cost of ignoring them?&lt;/strong&gt;
The optimal policy over the augmented Taylor rule it = i* + Φπ(πt − π*) + Φu(ut − u*) + ΦEE(EEt − EE*), with Φπ fixed at 1.5 and a dual-mandate loss function W = var(πt − π*) + 0.25·var(Yt − Y*), prescribes Φ*_u = −3.18 and Φ*_EE = 2.22. This reduces the central bank loss by 78.7 percent relative to the baseline rule (Φu = −0.25, ΦEE = 0). If the EE gap term is excluded and only the unemployment gap coefficient is re-optimized (finding Φu = −2.71), the central bank loss is 12 percent higher than under the full optimal policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the optimal policy affect macroeconomic volatility, and who gains most from it?&lt;/strong&gt;
Table 5 shows that the optimal policy substantially reduces volatility of inflation (standard deviation falls from 0.0013 to 0.0011), output (0.0059 to 0.0020), consumption (0.0059 to 0.0020), unemployment (0.0047 to 0.0013), labor market tightness (0.0600 to 0.0175), and the real marginal cost pl (0.0203 to 0.0081), at the cost of higher real rate volatility (0.0019 to 0.0033) and share price volatility (0.1975 to 0.3051). In terms of welfare (Table 6), the unemployed gain 0.20 percent in lifetime consumption equivalents (versus 0.15 percent for the employed), workers at the bottom quintile of match quality gain 0.24 percent (versus 0.16 percent at the top), and wealth-poor individuals in the bottom share quintile gain 0.23 percent (versus 0.11 percent at the top, whose gains are eroded by larger share price fluctuations).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the model extend the SSJ computational method, and why is this extension necessary?&lt;/strong&gt;
The standard SSJ method of Auclert, Bardoczy, Rognlie, and Straub (2021) handles settings where only scalar aggregates enter equilibrium conditions in sequence space. In this model, the discretized distributions of employed workers µE(h, x) and unemployed workers µU(h) at the job search stage enter directly into the expected match value EJ (because human capital and current match productivity determine output and wage levels upon new contacts), and the distribution λE(h, x, α) at the production stage enters into labor services firm profits ΓS. The authors treat worker distributions as histograms and compute Jacobians for each mass point, combining the SSJ method with Reiter (2009)-style projection. This substantially increases computation time but remains feasible, extending the SSJ method to multi-stage models with search frictions where endogenous distributions are state variables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What are the three sources of wage growth in the HANK model, and what is their relevance for inflation dynamics?&lt;/strong&gt;
First, human capital h stochastically appreciates during employment (at rate πE = 0.018 per quarter, calibrated to annual job-stayer wage growth of approximately 2 percent), raising wages through a higher piece-rate base. Second, job switches to higher-productivity matches yield wage increases as the worker extracts the full surplus from the new firm (the new piece rate equals x/x&amp;rsquo;, the ratio of old to new match productivity). Third, outside offers with productivity x&amp;rsquo; satisfying αx &amp;lt; x&amp;rsquo; &amp;lt; x — not good enough to trigger a switch but better than the current bargaining threat — cause the incumbent firm to raise the piece rate to x&amp;rsquo;/x via rebargaining, increasing wages without a job change. The second and third channels are the ones directly affected by OJS efficiency shocks and are inflationary: they raise labor costs beyond productivity gains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Why do OJS shocks have a shorter match duration channel even without wage increases?&lt;/strong&gt;
When OJS efficiency ν rises, each employed worker faces a higher probability νtf(θt) of contacting another firm each period. Even if wages do not change upon contact (as in the α = 0.95 robustness exercise), a labor services firm posting a vacancy expects that any match it forms will be shorter-lived: the worker is more likely to be poached in the future. This shortens the expected present discounted value of the match for the firm, reducing EJ. To satisfy the free-entry condition (expected profit = vacancy cost κ), the price of labor services pl must rise, increasing the real marginal cost and inflation. Figure 3 confirms a nearly identical inflationary response under α = 0.95 as under the baseline, isolating this match-duration mechanism.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;OJS efficiency shock (νt shock).&lt;/strong&gt; A time-varying shift in the relative job search efficiency of employed workers compared with unemployed workers. Modeled as an AR(1) process for νt (estimated persistence ρν = 0.936). An increase in νt raises the probability that employed workers contact outside firms each period, boosting the EE rate. In the model, this acts as a cost-push shock: it raises inflation and unemployment simultaneously, breaking divine coincidence and creating a policy trade-off for a dual-mandate central bank.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expected match value (EJt).&lt;/strong&gt; The ex-ante expected value to a labor services firm of a filled vacancy, conditional on contacting a worker, defined as a weighted average of match values J across the pool of job seekers (unemployed and employed). The free-entry condition Vt = κ/q(θt) = EJt pins down the real price of labor services pl: when EJt declines (due to shorter match durations or compositional shifts toward high-surplus-extracting workers), pl must rise to maintain zero expected profit for vacancy posters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Composition channel.&lt;/strong&gt; The mechanism by which a rise in OJS efficiency shifts the composition of the job-seeker pool toward employed workers, who (under Bertrand competition) extract the entire flow surplus of a new match and receive wage equal to plF(h,x). Since firms receive zero rent from poached workers, an increase in the fraction of employed in the applicant pool lowers EJt and requires a compensatory increase in pl.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Match duration channel.&lt;/strong&gt; When OJS efficiency ν rises, each existing match faces a higher probability of dissolution because the worker is more likely to be poached. The reduced expected match duration lowers the present discounted value of a match for the firm (even holding wages fixed), reducing EJt and raising pl. Demonstrated as the primary driver of inflation in the α = 0.95 robustness exercise where wage increases upon job switches are near zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Piece-rate α (endogenous).&lt;/strong&gt; The share of match output F(h,x) that the worker receives as wage, determined through Bertrand competition on flow output following Postel-Vinay and Robin (2002). A worker hired from unemployment starts at α = x̄/x&amp;rsquo; (where x̄ is the lowest match productivity). Job switches to higher-x&amp;rsquo; firms reset α = x/x&amp;rsquo;. Rebargaining upon a credible outside offer from a firm with αx &amp;lt; x̃ &amp;lt; x raises α to x̃/x. The piece rate endogenizes wage dynamics for switchers, stayers, and job losers, allowing the model to discipline these moments in the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Divine coincidence (and its breakdown under OJS shocks).&lt;/strong&gt; In standard New Keynesian models, demand and productivity shocks move inflation and unemployment gaps in opposite directions, so stabilizing inflation also stabilizes the output gap. OJS efficiency shocks break this property: they generate simultaneous increases in inflation and unemployment, introducing a genuine trade-off between the two mandates and making EE-augmented Taylor rules welfare-improving relative to rules that respond only to unemployment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sequence-Space Jacobian (SSJ) method with distributed worker states.&lt;/strong&gt; An extension of the Auclert, Bardoczy, Rognlie, and Straub (2021) computational method to settings where discretized distributions of workers (µE(h,x) and µU(h)) enter directly into equilibrium conditions — specifically into the free-entry condition via EJt and into firm profits. The authors treat distributions as histograms and compute Jacobians for each mass point, combining SSJ with Reiter (2009)-style projection to efficiently solve for transitional dynamics under aggregate uncertainty.&lt;/p&gt;</description></item><item><title>Latent Heterogeneity in the Marginal Propensity to Consume</title><link>https://macropaperwarehouse.com/papers/latent-heterogeneity-in-the-marginal-propensity-to-consume/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/latent-heterogeneity-in-the-marginal-propensity-to-consume/</guid><description>&lt;p&gt;Lewis, Melcangi, and Pilossoph estimate the unconditional distribution of the marginal propensity to consume (MPC) using the 2008 Economic Stimulus Act (ESA) rebate payments, deploying Gaussian mixture linear regression (GMLR) — a clustering regression approach — rather than the standard practice of interacting the rebate with observable household characteristics. The key methodological departure is that households are assigned to groups not by any presupposed observable, but by how well estimated group-specific MPCs describe each household&amp;rsquo;s actual consumption response; this allows recovery of the full unconditional MPC distribution, including heterogeneity driven by latent (unobservable) factors.&lt;/p&gt;
&lt;p&gt;Data come from the 2008 Consumer Expenditure Survey (CEX), which contains household-level expenditure data and supplemental questions on ESA payments. Identification exploits the quasi-random timing of rebate receipt, determined by the last two digits of recipients&amp;rsquo; Social Security Numbers, following the design of Parker, Souleles, Johnson, and McClelland (2013). The specification is updated following Borusyak et al. (2024) to avoid &amp;ldquo;forbidden comparisons&amp;rdquo; in staggered treatment settings. The number of groups G is selected by BIC, which selects G = 3 for total expenditures, confirmed by K-fold cross-validation.&lt;/p&gt;
&lt;p&gt;The main finding is substantial MPC heterogeneity. For total expenditures, the three estimated group-level MPCs are 0.04, 0.23, and 1.33, with population shares of 30%, 48%, and 23% respectively. The implied aggregate (share-weighted average) MPC is 0.42, compared to 0.24 in the homogeneous Parker et al. (2013) specification estimated on the same data. Splitting by consumption category: for nondurables, two groups have MPCs of 0.09 and 0.18, with roughly equal population shares, and the lower bound of 0.09 is statistically distinguishable from zero — evidence against strict adherence to the Permanent Income Hypothesis even among the lowest-MPC group. For durables, the MPC distribution is dichotomous: about 29% of households have a durable MPC statistically indistinguishable from zero, while 21% have an MPC of 0.67. The cross-good correlation between household-level nondurable and durable predicted MPCs is only 0.13, ruling out strong substitution but indicating weak complementarity.&lt;/p&gt;
&lt;p&gt;Turning to observable determinants, the paper finds that many household characteristics are individually correlated with estimated MPCs — including homeownership, mortgage status, income, and the average propensity to consume (APC) — despite the fact that the same dataset and similar identification strategies previously yielded insignificant relationships. Homeowners have significantly higher MPCs than renters; households with a mortgage have even higher MPCs than outright homeowners. In salary income, households in the top tercile spend 0.17 more per rebate dollar than the baseline group; households in the top tercile of non-salary income spend 0.19 more. However, in joint regressions, only two characteristics remain robustly and positively correlated with MPCs: total income (both salary and non-salary components) and the APC. The APC relationship is particularly notable: a one-percentage-point higher prior spending rate is associated with 0.19 additional cents spent per rebate dollar in the full multivariate specification.&lt;/p&gt;
&lt;p&gt;The paper identifies three groups in the joint income-APC space: &amp;ldquo;poor savers&amp;rdquo; (low income, low APC, lowest MPCs), an intermediate group (high income or high APC but not both), and &amp;ldquo;rich spenders&amp;rdquo; (high income and high APC, highest MPCs). The &amp;ldquo;rich spender&amp;rdquo; group has received little prior attention in consumption-savings models.&lt;/p&gt;
&lt;p&gt;Critically, observable characteristics jointly explain at most 8% of MPC variation (adjusted R-squared from a measurement-error correction). With 92% of MPC heterogeneity unexplained by standard observables, the authors conclude that a substantial share of variation reflects latent household traits — plausibly heterogeneity in discount rates or intertemporal elasticities of substitution. This finding also limits the practical scope for government targeting of fiscal transfers: because observable characteristics predict little MPC variation, any targeting strategy can exploit only a small fraction of the overall distribution.&lt;/p&gt;
&lt;p&gt;Scope conditions: results apply to household expenditure responses (marginal propensities to spend, not to consume in the strict sense) within one quarter of rebate receipt. The income-MPC positive correlation is confined to households within the income range eligible for the 2008 ESA (phased out above $150,000 for joint filers). The sample excludes the top and bottom 1.5% of consumption changes as outliers.&lt;/p&gt;
&lt;p&gt;Q: What is the core methodological innovation of this paper?
A: The paper applies Gaussian mixture linear regression (GMLR) to the 2008 tax rebate setting, jointly estimating group-level MPCs and household group membership probabilities without imposing any prior restriction on which observable characteristics drive heterogeneity. Because groups are determined by how well group-specific MPCs explain consumption patterns rather than by presupposed observables, the method recovers the full unconditional distribution of MPCs, including latent heterogeneity. This contrasts with sample-splitting approaches that can only recover co-variation with chosen characteristics.&lt;/p&gt;
&lt;p&gt;Q: What are the three group-level MPCs for total expenditures, and what shares of the population do they represent?
A: The three estimated MPCs are 0.04 (30% of households), 0.23 (48%), and 1.33 (23%), all with precisely estimated group shares (standard errors of 0.01). The largest MPC of 1.33 is statistically significant at the 1% level. The lowest MPC of 0.04 is not statistically different from zero even under the more favorable conditional standard errors that treat group assignment as known.&lt;/p&gt;
&lt;p&gt;Q: How does the average MPC implied by the GMLR distribution compare to the homogeneous specification?
A: The share-weighted average MPC from the three-group GMLR is 0.42, compared to 0.24 from the homogeneous (G=1) specification on the same data and identification strategy. This gap arises partly because the homogeneous estimate averages across households with very heterogeneous responses, and partly because the distribution has a right-skewed tail with a meaningful mass at MPC above 1.&lt;/p&gt;
&lt;p&gt;Q: What are the MPC distributions for nondurable and durable goods separately?
A: For nondurables, BIC selects two groups with MPCs of 0.09 and 0.18 and roughly equal population shares (48% and 52%); crucially, the lower bound of 0.09 is statistically distinguishable from zero at the 5% level, providing evidence that no household strictly follows the Permanent Income Hypothesis for nondurables. For durables, BIC selects three groups: MPCs of 0.03 (not distinguishable from zero, 29% of households), 0.15 (50%), and 0.67 (21%), reflecting the discrete, lumpy nature of durable goods purchases.&lt;/p&gt;
&lt;p&gt;Q: How correlated are nondurable and durable MPCs at the household level?
A: The correlation between household-level posterior predicted MPCs for nondurables and durables is 0.13, statistically significant at the 1% level. This rules out substitution between goods categories, but the positive complementarity is quantitatively small. The authors interpret this as possibly reflecting a small share of &amp;ldquo;spender&amp;rdquo; types who adjust multiple consumption categories in response to transitory income shocks.&lt;/p&gt;
&lt;p&gt;Q: Which observable characteristics are individually correlated with MPCs?
A: Homeowners have significantly higher MPCs than renters; households with a mortgage display even greater MPCs than outright homeowners. Both salary and non-salary income are positively correlated: households in the top tercile of salary income have MPCs about 0.13 higher than the omitted group, and top-tercile non-salary income households have MPCs about 0.015 higher (though the latter is individually less precisely estimated). The average propensity to consume (APC) is significantly positively correlated with the MPC, with a coefficient of 0.075 in univariate regression and 0.166 in the full joint specification.&lt;/p&gt;
&lt;p&gt;Q: Which observable characteristics remain significant in the joint (multivariate) regression?
A: When all household characteristics are included jointly, only income (both salary and non-salary components) and the APC remain robustly and positively correlated with MPCs. Top-tercile salary income is associated with 0.112 higher MPCs and top-tercile non-salary income with 0.049 higher MPCs, while the APC coefficient rises to 0.166 (from 0.075 univariate). Homeownership, age, education, and most demographic controls become statistically insignificant in the joint specification.&lt;/p&gt;
&lt;p&gt;Q: What fraction of MPC variation is explained by observable characteristics?
A: The adjusted R-squared from the full multivariate regression of predicted MPCs on all observable characteristics is approximately 6%. After a measurement-error correction proposed in Supplement A.6 to account for noise in estimated posterior MPCs, the corrected R-squared rises to 8%. Either way, the vast majority — over 90% — of MPC heterogeneity is unexplained by standard observables, implicating latent household traits such as heterogeneous discount rates or intertemporal elasticities of substitution.&lt;/p&gt;
&lt;p&gt;Q: How does the extent of MPC heterogeneity recovered by GMLR compare to sample-splitting on observables?
A: Table 4 shows that splitting by age terciles yields MPC estimates ranging from 0.13 to 0.34; splitting by total income yields a range of 0.18 to 0.45; splitting by the APC yields 0.06 to 0.21. All of these ranges are far narrower than the GMLR-recovered range of 0.04 to 1.33. The authors argue that sample-splitting on individual observables, which are noisy and correlated with only a portion of MPC heterogeneity, systematically understates the true extent of heterogeneity.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;rich spender&amp;rdquo; finding and why is it theoretically notable?
A: Households with both high total income and a high prior average propensity to consume have the largest MPCs. This &amp;ldquo;rich spender&amp;rdquo; group is poorly accommodated by standard consumption-savings models: the canonical one-asset incomplete markets model typically predicts a negative MPC-APC correlation conditional on income, and the two-asset Kaplan-Violante (2014) model can generate wealthy hand-to-mouth households with high income and high MPCs, but not necessarily high APCs. Preference heterogeneity — e.g., heterogeneous intertemporal elasticities of substitution as in Aguiar, Boar, and Bils (2019) — can rationalize the positive income-APC-MPC nexus.&lt;/p&gt;
&lt;p&gt;Q: What explains the positive income-MPC correlation, and how does the paper relate it to the prior literature?
A: The paper notes that this positive correlation is consistent with Kueng (2018), who finds higher spending propensities among high-income recipients of Alaska Permanent Fund payments, and rationalizes it via near-rationality or mental accounting: when a rebate is small relative to income, the perceived cost of deviating from consumption smoothing is low. The authors also note that low-income households still exhibit large absolute MPCs, suggesting sizable deviations from consumption smoothing at the bottom of the income distribution, even if relatively lower than for high-income households.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications for targeting fiscal transfers?
A: The paper finds that the 2008 ESA increased spending for all households in partial equilibrium (minimum group MPC of 0.04, nondurable lower bound 0.09, all statistically positive or near-positive). Among observable characteristics, targeting relatively higher-income households (including retirees and entrepreneurs via non-salary income) would maximize aggregate consumption effects. However, since observables explain only 8% of MPC variation, any targeting strategy can exploit only a small fraction of the overall heterogeneity; the government faces fundamental limits on feasible targeting. This also implies a tension between stimulus and distributional/insurance motives for transfer programs.&lt;/p&gt;
&lt;p&gt;Q: How does the paper confirm that recovered heterogeneity is not spurious?
A: The authors generate 250 Monte Carlo samples from the estimated homogeneous model, impose G=3, and re-run the GMLR and observable regressions; they find significant relationships with observable characteristics in virtually none of these samples. Additionally, applying the BIC to homogeneous Monte Carlo samples, the BIC selects G=1 in all 250 samples, confirming that the selected G=3 in actual data reflects genuine heterogeneity rather than overfitting.&lt;/p&gt;
&lt;p&gt;Q: How does GMLR compare to quantile regression for recovering the MPC distribution?
A: Quantile regression (as used by Misra and Surico (2014) on the same data) recovers relationships at percentiles of the overall conditional distribution of consumption changes, so the ranking of households is driven by all sources of variation in consumption, not just the rebate response. If factors unrelated to the rebate dominate the conditional distribution, MPC heterogeneity will be underestimated in the presence of noise. The authors illustrate this formally in Supplement B and note that Misra and Surico (2014) find a substantial share of MPCs at or below zero for nondurables, in contrast to the GMLR lower bound of 0.09 that is statistically positive.&lt;/p&gt;
&lt;p&gt;Q: What do the longer-run (lagged) MPC estimates show?
A: The specification includes up to two lags of rebate indicators, allowing measurement of spending responses in subsequent quarters after rebate receipt. The paper reports these results (Section 4.4) but the text provided does not fully detail them; the heterogeneous structure is maintained across horizons.&lt;/p&gt;
&lt;p&gt;Gaussian Mixture Linear Regression (GMLR): A probabilistic clustering regression approach that jointly estimates group-specific regression coefficients (here, MPCs) and population group shares by maximizing an expected log-likelihood via the EM algorithm. Households receive continuous posterior weights (gamma_{jg}) reflecting uncertainty about their group membership rather than binary hard assignment, with identification from a Gaussianity assumption on within-group errors.&lt;/p&gt;
&lt;p&gt;Unconditional MPC Distribution: The full marginal distribution of MPCs across all households in the population, capturing heterogeneity from both observable and latent (unobservable) sources. Contrasted in the paper with the conditional distributions recovered by sample-splitting on observables, which by construction can only reflect co-variation with the chosen splitting variable.&lt;/p&gt;
&lt;p&gt;Posterior Predicted MPC: For each household, the expectation of the group-specific MPC weighted by the household&amp;rsquo;s posterior group membership probabilities (lambda-tilde_{0,j} = sum_g gamma_{jg} lambda_{0g}). This object is the optimal (MSE-minimizing) individual-level MPC prediction and is the relevant input for targeted fiscal policy design.&lt;/p&gt;
&lt;p&gt;Latent Heterogeneity: MPC variation that cannot be attributed to any observable household characteristic and is instead driven by unobserved traits — plausibly heterogeneous discount rates, intertemporal elasticities of substitution, or other preference parameters. Operationalized as the share of MPC variance unexplained by observable regressors (approximately 92% in this paper).&lt;/p&gt;
&lt;p&gt;Rich Spenders: A group identified jointly in the APC-income space: households with both high total income and a high average propensity to consume, displaying the largest marginal propensities to consume out of the rebate. This group is not well-accommodated by standard one-asset or two-asset incomplete markets models under homogeneous preferences.&lt;/p&gt;
&lt;p&gt;Average Propensity to Consume (APC): Defined empirically as average lagged consumption expenditures divided by total income, intended to capture persistent preference heterogeneity — a &amp;ldquo;spender type&amp;rdquo; — by measuring how much of income a household habitually spends before receiving the rebate. A one-percentage-point higher APC is associated with 0.19 additional cents spent per rebate dollar in the full multivariate specification.&lt;/p&gt;
&lt;p&gt;Forbidden Comparisons: A bias identified by Borusyak et al. (2024) in event-study designs with staggered treatment, arising when newly treated units are compared to previously treated units rather than true controls. The paper addresses this by regressing consumption changes on rebate receipt indicators (iota_{jl}) directly rather than on rebate amounts, and including lagged rebate indicators to account for persistent effects.&lt;/p&gt;</description></item><item><title>Lender concentration of external debts and sudden stops</title><link>https://macropaperwarehouse.com/papers/lender-concentration-of-external-debts-and-sudden-stops/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/lender-concentration-of-external-debts-and-sudden-stops/</guid><description>&lt;h1 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h1&gt;
&lt;h2 id="research-question"&gt;Research Question&lt;/h2&gt;
&lt;p&gt;This paper studies how the lender structure of external debt — specifically, the degree to which a borrowing country&amp;rsquo;s external debt is concentrated among a small number of large lenders — affects open economies&amp;rsquo; credit conditions, borrowing behavior, and the severity of sudden stops.&lt;/p&gt;
&lt;h2 id="core-mechanism"&gt;Core Mechanism&lt;/h2&gt;
&lt;p&gt;The paper argues that the pecuniary externality arising from collateral foreclosure can be internalized not only by borrowers (as in the standard Bianchi 2011 framework) but also by lenders. When a large lender holds a substantial share of total loans, it has an incentive to foreclose only partially on seized collateral. Selling foreclosed collateral injects asset supply and depresses the collateral price; a sufficiently large lender internalizes this price impact and therefore restrains foreclosure. Atomistic lenders, by contrast, take the collateral price as given and sell all seized collateral (foreclosure rate = 1). Consequently, concentrating external debt in fewer, larger lenders supports a higher collateral price during financial downturns. This higher collateral price raises borrowing capacity, weakens borrowers&amp;rsquo; precautionary saving motive, and causes them to overborrow relative to the social optimum.&lt;/p&gt;
&lt;h2 id="empirical-evidence"&gt;Empirical Evidence&lt;/h2&gt;
&lt;p&gt;Using FFIEC 009a data — quarterly exposure of individual U.S. banks to the external debts of other countries, covering 2003Q1–2022Q2 — the paper documents two new empirical facts. First, lender concentration of emerging countries&amp;rsquo; external debt has been considerably higher than that of advanced countries since the Global Financial Crisis. The average difference in the mean top-3 lender concentration (LTop3) between emerging and advanced economies is 0.11 (= 0.93 − 0.82), with a t-statistic of 13.87. Second, higher lender concentration alleviates sudden stop events in terms of both current account reversal and the decline in asset price proxies. In a difference-in-differences specification interacting sudden stop indicators with lagged lender concentration, the coefficient on the interaction term is negative and statistically significant across all concentration measures. A one-standard-deviation increase in LTop3 (7.2 percentage points) results in a 2.6 percentage point reduction in current account-to-GDP reversal during sudden stops, constituting 7.5% of the overall sudden stop increase. Lender concentration also mitigates real effective exchange rate depreciation during sudden stops, consistent with the mechanism operating through the collateral price channel. Results hold when controlling for rollover risk motives.&lt;/p&gt;
&lt;h2 id="model"&gt;Model&lt;/h2&gt;
&lt;p&gt;The model extends a standard small open economy DSGE framework (Bianchi 2011) by introducing one large lender who holds share eta of total loans and internalizes the pecuniary externality of collateral foreclosure, alongside atomistic lenders who hold share (1 − eta) and take the collateral price as given. When tradable endowment falls short of debt obligations (foreclosure state), lenders optimally choose their foreclosure rate: atomistic lenders set foreclosure rate = 1 (sell all seized collateral), while the large lender sets foreclosure rate &amp;lt; 1 (partial foreclosure to maintain the collateral price). Higher lender concentration (larger eta) leads to lower aggregate foreclosure, less collateral sold, a higher nontradable goods price, a higher borrowing capacity, more tradable consumption, and a weaker precautionary saving motive — generating overborrowing relative to the social planner&amp;rsquo;s allocation.&lt;/p&gt;
&lt;p&gt;Two channels through which concentration affects overborrowing are identified: (1) a debt capacity channel, whereby concentration raises the nontradable price in foreclosure states and thereby increases borrowing capacity; and (2) an amplification channel, whereby concentration steepens the decline in nontradable price per unit fall in tradable consumption, amplifying the pecuniary externality that the social planner internalizes.&lt;/p&gt;
&lt;h2 id="quantitative-results-calibrated-to-argentina"&gt;Quantitative Results (Calibrated to Argentina)&lt;/h2&gt;
&lt;p&gt;In the competitive equilibrium, agents encounter foreclosure with probability 2%, and the large lender sells two-thirds of seized collateral. The social planner&amp;rsquo;s allocation eliminates foreclosure entirely. The social planner&amp;rsquo;s allocation can be implemented via a state-dependent debt tax; the implied consumption-equivalent welfare gain is 0.78%. The pecuniary externality internalized by lenders is estimated to equal two-thirds of the externality internalized by borrowers. Overborrowing is increasing in lender concentration.&lt;/p&gt;
&lt;h2 id="optimal-lender-structure"&gt;Optimal Lender Structure&lt;/h2&gt;
&lt;p&gt;When lender countries optimally choose their lender structure, they select further concentration relative to the baseline in order to gain higher foreclosure repayment. Under optimal lender structure, domestic agents consume and borrow more and encounter sudden stops with higher probability, but completely avoid foreclosure events. Borrower welfare improves by 0.1% in consumption-equivalent terms relative to the baseline competitive equilibrium. The paper concludes that managing lender structure benefits both sides of the international credit market, and notes that policies targeting creditor coordination — such as collective action clauses — may be insufficient to fully correct the efficiency implications of lender structure.&lt;/p&gt;
&lt;h2 id="key-implication"&gt;Key Implication&lt;/h2&gt;
&lt;p&gt;Because lender concentration alleviates crisis severity, emerging economies (which are documented to have substantially more concentrated lender structures than advanced economies) face a reduced precautionary saving motive and therefore tend to overborrow more than advanced economies, compounding their vulnerability to sudden stops.&lt;/p&gt;
&lt;h1 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h1&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the paper&amp;rsquo;s central departure from the Bianchi (2011) sudden stops framework?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The standard Bianchi (2011) model features atomistic lenders who take the collateral price as given, so the pecuniary externality of collateral fire-sales is internalized only by the borrower&amp;rsquo;s social planner. This paper introduces a large lender who holds a non-trivial share eta of total loans and therefore internalizes the price impact of selling foreclosed collateral. This creates a second source of pecuniary externality internalization — on the lender side — that is absent from the canonical framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why do atomistic lenders sell all seized collateral, while the large lender does not?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Atomistic lenders take the collateral price as given and therefore face no downside from selling their entire share of seized collateral — they cannot individually affect the price. The large lender, holding share eta of total loans, recognizes that selling a large quantity of collateral depresses the nontradable goods price, which reduces the value of any remaining collateral claims. It therefore optimally sets foreclosure rate &amp;lt; 1, retaining some seized collateral to support the equilibrium price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the two channels through which lender concentration amplifies overborrowing, and how do they differ?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The debt capacity channel operates in foreclosure states: higher concentration reduces foreclosure, raises the nontradable price, and increases the collateral value that backs borrowing. This directly expands the borrowing capacity available to agents and weakens their precautionary saving motive. The amplification channel operates through the slope of the nontradable price response: greater concentration steepens the decline in the nontradable price per unit fall in tradable consumption, which amplifies the pecuniary externality that the social planner internalizes. The two channels reinforce each other in driving overborrowing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What empirical dataset is used, and what does it measure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper uses FFIEC 009a data, which records the quarterly exposure of individual U.S. banks to the external debts of other countries, covering 2003Q1–2022Q2. From these data, the paper constructs lender concentration measures — including LTop3, the combined share of the top three lenders — at the borrowing-country level for each quarter.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the quantitative magnitude of the lender concentration gap between emerging and advanced economies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The average difference in mean top-3 lender concentration (LTop3) between emerging countries and advanced countries is 0.11 (= 0.93 − 0.82), and this difference is highly statistically significant, with a t-statistic of 13.87. This gap emerged and persisted notably since the Global Financial Crisis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does lender concentration affect sudden stop severity in the empirical specification, and how large is the effect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper estimates a difference-in-differences specification in which current account reversal (and other sudden stop outcome variables) is regressed on a sudden stop indicator, lagged lender concentration, and their interaction, with country and time fixed effects. The coefficient on the interaction term is negative and statistically significant across all concentration measures. A one-standard-deviation increase in LTop3 (7.2 percentage points) reduces current account-to-GDP reversal by 2.6 percentage points, which corresponds to 7.5% of the overall increase in the current account during a sudden stop episode.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Does higher lender concentration also mitigate exchange rate and asset price pressures during sudden stops?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Lender concentration is also found to mitigate real effective exchange rate depreciation during sudden stops, which is consistent with the model&amp;rsquo;s proposed mechanism: higher concentration supports the collateral (nontradable goods) price, which in turn limits the depreciation of the real exchange rate. The paper reports results on asset price proxy declines as well.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the welfare cost of overborrowing under the baseline calibration to Argentina?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The social planner&amp;rsquo;s allocation, implemented by a state-dependent debt tax, delivers a consumption-equivalent welfare gain of 0.78% relative to the competitive equilibrium. This measures the efficiency cost of overborrowing under the calibrated model in which the large lender sells two-thirds of seized collateral and competitive equilibrium agents encounter foreclosure with probability 2%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How large is the lender-side pecuniary externality relative to the borrower-side externality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under the baseline calibration, the pecuniary externality internalized by lenders is estimated to be two-thirds of the externality internalized by borrowers. This is described as a &amp;ldquo;plausible parameterization,&amp;rdquo; meaning that lender-side internalization of the externality is quantitatively substantial relative to the classic borrower-side effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the optimal lender structure exercise find, and what does it imply for welfare?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When lender countries are allowed to optimally choose lender structure, they select a more concentrated structure than the baseline in order to maximize foreclosure repayment. Under this optimal structure, domestic (borrowing-country) agents consume and borrow more, face sudden stops with higher probability, but completely avoid foreclosure events. Borrower welfare improves by 0.1% in consumption-equivalent terms relative to the baseline competitive equilibrium. This implies that concentrating lender structure can be mutually beneficial for both sides of the international credit market.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Why might collective action clauses be insufficient to correct the efficiency implications of lender structure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Collective action clauses are policies designed to improve creditor coordination in sovereign debt restructuring. The paper argues that the efficiency distortions arising from lender structure go beyond pure coordination failures: because a concentrated lender structure generates welfare-relevant pecuniary externalities through the collateral price channel — affecting overborrowing and crisis severity — addressing creditor coordination alone is insufficient to fully resolve these inefficiencies.&lt;/p&gt;
&lt;h1 id="key-concepts"&gt;Key Concepts&lt;/h1&gt;
&lt;p&gt;&lt;strong&gt;Lender concentration (LTop3):&lt;/strong&gt; The combined loan share held by the top three lenders in a borrowing country&amp;rsquo;s external debt. Measured using FFIEC 009a data. Used as the primary empirical proxy for the degree to which external debt is concentrated in a few large creditors rather than dispersed among many atomistic lenders.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pecuniary externality (lender-side):&lt;/strong&gt; The price impact that a large lender imposes on the collateral market when selling foreclosed assets. Unlike in the standard Bianchi (2011) framework where only borrowers (via the social planner) internalize this externality, a sufficiently large lender also internalizes it by restraining collateral sales to support the collateral price.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Foreclosure rate (zeta):&lt;/strong&gt; The fraction of seized collateral that a lender sells after foreclosure. Atomistic lenders set zeta = 1 (sell everything); the large lender sets zeta &amp;lt; 1 (partial foreclosure) to prevent collateral price depression. The aggregate foreclosure rate is a weighted average across lender types.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Overborrowing:&lt;/strong&gt; Borrowing in excess of the social planner&amp;rsquo;s optimal level, arising because competitive equilibrium agents do not internalize the pecuniary externality of their borrowing on the collateral price. In this model, overborrowing is increasing in lender concentration because a more concentrated lender structure supports a higher collateral price, reducing precautionary saving.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sudden stop:&lt;/strong&gt; An abrupt reversal of capital inflows to an emerging economy, typically associated with a sharp current account reversal, real exchange rate depreciation, and a decline in asset prices. In the model, sudden stops are associated with foreclosure states in which tradable endowment falls short of debt obligations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt capacity channel:&lt;/strong&gt; The mechanism by which higher lender concentration raises the nontradable goods price in foreclosure states, thereby increasing the collateral value and expanding agents&amp;rsquo; borrowing capacity, which weakens the precautionary saving motive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Amplification channel:&lt;/strong&gt; The mechanism by which higher lender concentration steepens the slope of the nontradable price response to a fall in tradable consumption, amplifying the magnitude of the pecuniary externality that the social planner internalizes and thus increasing the social planner&amp;rsquo;s incentive to restrict borrowing.&lt;/p&gt;</description></item><item><title>Leveraging Virtual Contact and Social Networks to Foster Interethnic Harmony</title><link>https://macropaperwarehouse.com/papers/leveraging-virtual-contact-and-social-networks-to-foster-interethnic-harmony/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/leveraging-virtual-contact-and-social-networks-to-foster-interethnic-harmony/</guid><description>&lt;p&gt;This paper investigates whether virtual contact — exposure to an outgroup through a documentary film — can promote interethnic harmony, and whether targeting network-central individuals amplifies effects on untreated community members. The study addresses a context of deep, historically rooted discrimination: the Santal ethnic minority in northwestern Bangladesh have faced colonial-era land dispossession, ongoing violence, labor market discrimination, and structural exclusion by the Bengali ethnic majority. The Santals are the second-largest ethnic-minority group in Bangladesh; in the study villages, their share ranges from 13% to 83% of the population.&lt;/p&gt;
&lt;p&gt;The authors conducted a cluster-randomized field experiment across 121 multiethnic villages in the Rajshahi and Naogaon districts of Bangladesh, involving over 3,300 households. Villages were randomly assigned to three arms: a random treatment arm (RR, 40 villages, N=562 Bengalis) in which approximately 14 randomly selected ethnic-majority households per village watched a 45-minute documentary film (&amp;ldquo;Ami Santal&amp;rdquo; / &amp;ldquo;I Am Santal&amp;rdquo;) portraying Santal culture, economic hardships, and aspirations; a central treatment arm (41 villages) in which approximately 7 randomly selected Bengalis (RC) and 7 network-central Bengalis identified via a diffusion-centrality nomination exercise (CC) watched the same film; and a control arm (40 villages) in which households watched a placebo documentary on flower farming. The documentary, costing approximately $13 per participant, was screened individually at participants&amp;rsquo; homes on tablets. Data were collected at baseline (September–October 2022), first end line approximately 3 months post-screening (February–March 2023), and a casual-work field experiment second end line approximately 4.5–5 months post-screening (April–May 2023). Outcomes were measured via lab-in-the-field experiments (dictator game, solidarity game), an experimentally validated interethnic trust survey item (Falk et al. 2018), self-reported behaviors, administrative police complaint data, and facial emotion detection during screening.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, treated Bengalis in the central arm (RC) gave 14.7% more in the dictator game (p &amp;lt; .01) and exhibited 21.7% greater trust toward Santals (p &amp;lt; .01) compared to controls; RR participants showed a 7.1% increase in solidarity game giving (p &amp;lt; .10) and 11.8% greater trust (p &amp;lt; .01). Effects on reducing negative stereotypes and discriminatory opinions were not statistically significant, suggesting that affective components of prejudice are more responsive to the intervention than cognitive components. About 82% of treated Bengalis reported acquiring new information about Santals, primarily regarding occupational struggles, educational aspirations, and economic potential. Facial expression analysis using emotion-detection software found sadness to be significantly more prevalent among viewers (p &amp;lt; .05), particularly among network-central participants, consistent with an empathetic response.&lt;/p&gt;
&lt;p&gt;Second, untreated Bengalis in the central arm — who never watched the documentary — showed 20.9% higher altruism (p &amp;lt; .10), 27.3% higher solidarity (p &amp;lt; .05), and 8.1% higher trust (p &amp;lt; .05) toward Santals relative to controls. No significant effects on untreated Bengalis were found in the random arm. Untreated Santals in both arms exhibited greater trust toward Bengalis (11% increase in random arm, p &amp;lt; .05; 21.7% increase in central arm, p &amp;lt; .01) and higher subjective well-being (p &amp;lt; .01 in both arms). Village-level administrative data show a significant reduction in Bengali police complaints against Santals post-intervention (p &amp;lt; .05), but only in the central arm.&lt;/p&gt;
&lt;p&gt;Third, in the casual-work field experiment, multiethnic pairs jointly produced paper bags under piece-rate compensation. Overall productivity increased approximately 5% (p &amp;lt; .05) in the central arm only. Both Bengali and Santal workers increased productivity specifically in the finisher role — the most critical role for determining earnings — in the central arm. The authors interpret Bengali productivity gains as reflecting increased prosociality toward Santal co-workers, and Santal productivity gains as reflecting conformism or peer pressure in response to Bengali effort. The scope of all effects is limited to multiethnic villages in northwestern Bangladesh, a context of historically severe and ongoing majority-minority inequality; the intervention deliberately did not challenge the socioeconomic hierarchy of the villages.&lt;/p&gt;
&lt;p&gt;Q: What was the documentary film&amp;rsquo;s content and design rationale?
A: The 45-minute film &amp;ldquo;Ami Santal&amp;rdquo; featured three narrative layers: Santal culture (rituals, cuisine, the Baha festival), economic hardships (housing, water access, low incomes, labor market struggles, educational barriers), and aspirational stories of Santals who achieved success. All stories were narrated by non-actor local Santals, filmed outside the study region, and deliberately avoided attributing blame to Bengalis. The film was designed under the supervision of anthropologists at the University of Rajshahi to maintain ethnographic authenticity and a non-moralistic, observational tone (moral judgment language was much lower than in comparison Bangladeshi documentaries and general films, per LIWC-22 analysis).&lt;/p&gt;
&lt;p&gt;Q: How were network-central individuals identified and why might targeting them matter?
A: In central-arm villages, enumerators surveyed approximately 18–20 randomly selected passers-by at village markets and asked them to nominate the 15 people most effective at disseminating information. The seven most consistently and highly ranked individuals per village were selected as network-central (CC). These individuals were expected to have high diffusion centrality — meaning information they receive spreads widely — so targeting them with the documentary could shift attitudes and behavior among untreated community members through persuasion, visibility, credibility, or diffusion (the paper cannot separately identify which mechanism operates).&lt;/p&gt;
&lt;p&gt;Q: What were the primary behavioral effects on treated Bengalis (the ethnic majority who watched the film)?
A: Randomly selected participants in the central arm (RC) gave 14.7% more in the dictator game (p &amp;lt; .01) and 8% more in the solidarity game (not statistically significant), and exhibited 21.7% greater trust toward Santals (p &amp;lt; .01), all relative to controls. In the random arm (RR), participants showed a 6.4% increase in dictator game giving (not statistically significant), a 7.1% increase in solidarity game giving (p &amp;lt; .10), and 11.8% greater trust toward Santals (p &amp;lt; .01). Effects on self-reported behaviors — interethnic friendships, social interactions, amount charged to minorities for water — were not statistically significant.&lt;/p&gt;
&lt;p&gt;Q: Did the intervention change Bengali stereotypes or discriminatory opinions toward Santals?
A: No. Despite treated Bengalis acquiring substantial new information (approximately 82% reported learning new things, primarily about Santal occupational struggles and educational aspirations), the authors find no significant effects on the stereotypes index or the discriminatory-opinions index among treated Bengalis. They propose two explanations: cognitive components of prejudice (stereotypes) are harder to change through indirect contact than affective components (emotions, prosocial behavior), consistent with Tropp and Pettigrew (2005) and Turner, Crisp, and Lambert (2007); and a single documentary may be insufficient to counter deeply ingrained generational biases due to resistance to change.&lt;/p&gt;
&lt;p&gt;Q: What emotional responses did the documentary elicit, and how was this measured?
A: Field assistants took candid photographs of participants&amp;rsquo; faces at a random point during the screening; these were analyzed using Emotimeter software (machine learning-based emotion detection) that assigns scores across seven emotion categories summing to 100%. Sadness was significantly more prevalent among documentary viewers compared to placebo viewers (p &amp;lt; .05), particularly among network-central participants (CC). The authors interpret this as consistent with an empathetic response to the film&amp;rsquo;s content about Santal hardships, and connect it to increased prosocial behavior via emotion-regulation mechanisms (alleviating sadness through prosocial action).&lt;/p&gt;
&lt;p&gt;Q: What were the spillover effects on untreated Bengalis in the central arm?
A: Untreated Bengalis in central-arm villages — who never watched the documentary — showed 20.9% higher altruism (p &amp;lt; .10), 27.3% higher solidarity (p &amp;lt; .05), and 8.1% higher trust toward Santals (p &amp;lt; .05) relative to controls. By contrast, untreated Bengalis in random-arm villages showed no statistically significant effects on any of these outcomes. The authors attribute the central-arm spillovers to the presence of network-central individuals being treated in those villages, though whether these patterns reflect persuasion, visibility, credibility, or information diffusion cannot be separately identified.&lt;/p&gt;
&lt;p&gt;Q: How did the intervention affect the Santal ethnic minority (who never watched the documentary)?
A: Untreated Santals in both arms exhibited greater trust toward Bengalis: an 11% increase in the random arm (p &amp;lt; .05) and a 21.7% increase in the central arm (p &amp;lt; .01) compared to controls. Santals in both arms also reported higher subjective well-being (p &amp;lt; .01). A weakly significant increase in food security was observed among Santals in the central arm (p &amp;lt; .10), possibly reflecting increased material support from Bengalis. No statistically significant effects were found on Santal altruism or solidarity.&lt;/p&gt;
&lt;p&gt;Q: What did the village-level administrative complaint data show?
A: Using data collected from two police stations covering all 121 villages, the authors find a significant reduction in Bengali complaints against Santals post-intervention in the central arm (p &amp;lt; .05). No significant reduction was found in Santals&amp;rsquo; complaints against Bengalis (p &amp;gt; .10) in any arm. Data from village counselors&amp;rsquo; offices (shalish arbitration complaints) showed no significant change in any arm. The distinction matters because police complaints involve more serious, violent matters, while village-counselor complaints involve routine arbitration.&lt;/p&gt;
&lt;p&gt;Q: How was the casual-work field experiment designed, and what did it find?
A: Approximately 4.5 months after the documentary screenings, 720 participants (360 Bengalis, 360 Santals) drawn equally from the three study arms were paired into multiethnic dyads to jointly produce paper bags for a local supplier under piece-rate compensation, with earnings split equally. One worker was randomly assigned the preparer role and the other the finisher role; roles were switched halfway through the three-hour session. The paper finds an approximately 5% overall productivity increase (p &amp;lt; .05) in the central arm only, concentrated in the finisher role (the role most critical for final output). Bengalis and Santals both increased productivity specifically as finishers in the central arm.&lt;/p&gt;
&lt;p&gt;Q: What mechanisms explain the productivity effects in the casual-work experiment?
A: For Bengali finishers, the productivity gain is interpreted as prosocial behavior: treated Bengalis who showed greater altruism toward Santals worked harder to increase the earnings of their Santal co-workers. For Santal finishers, the productivity gain is interpreted as conformism or peer pressure: Santals increased effort more when they worked as finisher after swapping roles (i.e., after observing Bengalis&amp;rsquo; higher effort as finisher first), suggesting responsiveness to the higher productivity of Bengalis rather than an independent prosocial motivation. The authors present a simple theoretical model to formalize these interpretations, citing Rotemberg (1994) on prosocial effort and Kandel and Lazear (1992) and Mas and Moretti (2009) on peer pressure mechanisms.&lt;/p&gt;
&lt;p&gt;Q: Why was virtual rather than direct contact used in this intervention?
A: The authors argue that encouraging direct contact between Bengalis and Santals in this setting carries specific risks: the unequal status of the groups may generate anxiety during interactions, potentially limiting engagement or provoking backlash. By contrast, the documentary provides an indirect, low-cost ($13 per participant) form of contact that presents Santal lives without disrupting the socioeconomic hierarchy of the villages and without attributing blame to Bengalis. The film&amp;rsquo;s entertaining veneer and emotional storytelling make it more scalable and logistically feasible in contexts where direct contact is socially difficult or impractical.&lt;/p&gt;
&lt;p&gt;Q: What are the primary limitations acknowledged by the authors?
A: The authors acknowledge that the study&amp;rsquo;s sampling protocol relied on a door-to-door skip procedure without systematic records of approached households, raising the possibility of convenience or snowball-type recruitment and potential deviations from random sampling — this is reflected in some imbalances in baseline characteristics across arms. CC-control comparisons are explicitly descriptive (not causal) because network-central individuals were selected on centrality. Differential attrition was found among untreated Santals (both treatment arms had significantly lower attrition than control, p &amp;lt; .05), which could bias estimates for that subgroup. The authors cannot separately identify the mechanisms (persuasion, visibility, credibility, diffusion) underlying spillover effects in central villages.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of this study?
A: The findings suggest that media-based virtual contact interventions are a low-cost, scalable tool for improving interethnic prosociality even in contexts of deep-rooted discrimination where direct contact may be socially impractical. Targeting network-central individuals — identified via a simple nomination exercise requiring no pre-existing network data — amplifies village-wide effects, including among untreated community members and the minority group itself. The productivity gains in multiethnic work teams imply that improved interethnic relations can have tangible economic consequences beyond attitudinal change. However, the null effects on stereotypes and discriminatory opinions suggest that single documentary interventions may not be sufficient to alter deep-seated cognitive biases, and more intensive or repeated interventions may be needed to achieve durable attitude change.&lt;/p&gt;
&lt;p&gt;Virtual contact: Indirect exposure to an ethnic outgroup through a documentary film, as distinct from direct intergroup contact; posited to influence majority-group attitudes and behavior by increasing empathy and identification with the outgroup without requiring face-to-face interaction.&lt;/p&gt;
&lt;p&gt;Diffusion centrality: A network measure of how effectively an individual can spread information through a community, operationalized via a nomination exercise in which community members identify those best positioned to disseminate information; used to select the seven highest-ranked individuals per village for targeted treatment.&lt;/p&gt;
&lt;p&gt;Prosociality (altruism and solidarity): Measured using incentivized lab-in-the-field games — the dictator game (unilateral allocation of an endowment to a passive outgroup recipient) and the solidarity game (precommitted transfers to an outgroup member who may incur a random loss) — capturing willingness to benefit non-coethnic others at personal cost.&lt;/p&gt;
&lt;p&gt;Affective versus cognitive components of prejudice: A distinction between emotional aspects of prejudice (feelings, empathy) — which the authors find to be more responsive to the documentary intervention — and cognitive aspects (negative stereotypes, discriminatory opinions) — which show no significant change despite new information acquisition.&lt;/p&gt;
&lt;p&gt;Spillover effects (untreated individuals): Changes in behavior or attitudes among community members who did not directly receive the intervention (did not watch the documentary), attributed to the influence of treated individuals in their village, particularly network-central individuals in the central arm.&lt;/p&gt;
&lt;p&gt;Piece-rate casual-work field experiment: A second end line in which multiethnic pairs of Bengali and Santal workers jointly produced paper bags for a local supplier, with individual earnings determined by joint piece-rate output; designed to measure whether improved interethnic attitudes translated into higher workplace productivity in ethnically mixed teams.&lt;/p&gt;
&lt;p&gt;Source text origin: The provenance classification of the text used to generate a paper summary (full PDF, open-access HTML, or abstract only); the paper&amp;rsquo;s pipeline rules impose a hard block on abstract-only summarization.&lt;/p&gt;</description></item><item><title>Life-Cycle Wages and Human Capital Investments: Selection and Missing Data</title><link>https://macropaperwarehouse.com/papers/life-cycle-wages-and-human-capital-investments-selection-and-missing-data/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/life-cycle-wages-and-human-capital-investments-selection-and-missing-data/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 &amp;ndash; Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how wage inequalities build up over the life cycle when individual wage trajectories are plagued by interruptions in private-sector participation, and when the standard Missing At Random (MAR) assumption used to handle those gaps may be violated. Specifically, it asks: what is the causal effect of career interruptions on both the level and the dispersion of wages after twenty years of potential experience, and does endogeneity of those interruptions matter for the dispersion result?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Sample&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical analysis uses the 2011 DADS Grand Format-EDP panel, a French administrative dataset merging social security records (DADS) and census extracts (EDP). The working sample covers males who entered the private sector between 1985 and 1992, aged 16-30 at entry, and observed through 2011. The authors require at least 15 years of observed private-sector wages, yielding a working sample of 7,004 males and 137,315 person-year observations. Education is grouped into four levels (high-school dropouts, high-school graduates, some college, college graduates). Participation outside the private sector &amp;ndash; including public-sector employment, self-employment, unemployment, and non-employment &amp;ndash; constitutes the &amp;ldquo;alternative sector&amp;rdquo; and generates missing wage observations. On average, cumulative duration outside the private sector is 3.7 years, and the average number of interruptions is 1.44.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper builds on a structural Ben Porath (1967) human capital model extended to two sectors (private sector and an alternative sector), yielding a reduced-form log-wage equation with five individual-specific coefficients: an intercept (initial human capital), a linear trend in potential experience (growth rate), a curvature term in potential experience (Mincer concavity), the cumulative years of interruptions, and a curvature term in interruptions. Because parameters are individual-specific, the wage equation is a random-coefficient model estimated with a fixed-effects approach.&lt;/p&gt;
&lt;p&gt;Selection into the private sector is addressed not by a standard MAR assumption but by a weaker &amp;ldquo;Missing At Random Conditionally On Factors&amp;rdquo; (MARCOF) assumption. Sector-preference shocks, human capital prices, and depreciation rates are each decomposed into a common factor (time-varying) and an individual factor loading, plus a residual that is mean-independent of factors and loadings. Conditional on factors and factor loadings, wage residuals and sector choices are independent, making covariates &amp;ndash; including the interruption variables &amp;ndash; exogenous. The preferred specification includes two unobserved factors, selected by four of six Bai-Ng (2002) information criteria.&lt;/p&gt;
&lt;p&gt;Estimation proceeds via an Expectation-Maximization (EM) algorithm adapted from Bai (2009) and Song (2013), with initial values from Moon and Weidner (2018)&amp;rsquo;s nuclear-norm convex estimator. Because individual parameters converge at rate sqrt(T) and summary statistics of their distributions suffer from incidental-parameter bias, the authors use bias-correction methods from Jochmans and Weidner (2019) for quantiles and inter-decile ranges, and from Arellano and Bonhomme (2012) for variances. Monte Carlo experiments confirm that variances remain poorly corrected even when T &amp;gt; 20, so the paper focuses on inter-decile ranges as the dispersion measure.&lt;/p&gt;
&lt;p&gt;Counterfactual &amp;ldquo;average structural functions&amp;rdquo; (Blundell and Powell, 2003) are constructed by holding individual parameters fixed and manipulating the history of interruptions. These compare four scenarios: the observed benchmark, the counterfactual with no interruptions (potential wage), the counterfactual with no current-period selection, and both combined.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Downward bias from omitting interruptions and factors.&lt;/em&gt; Omitting interruption variables and unobserved factors strongly downward biases estimated returns to experience after 20 years. Most of this bias is attributable to interruptions rather than to the interactive factor effects: selectivity is mainly captured through the interruption channel, not through residual factor structure.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Effect on mean wages.&lt;/em&gt; Potential experience increases log wages by approximately 65% over 20 years, consistent with cross-country evidence from homogeneous Mincer equations. The average cost of interruptions after 20 years is approximately 10% of log wages. Reassigning interruptions to the beginning of the working life has a persistent negative effect on mean log wages that never fully recovers over 20 years, while reassigning them to the end increases mean wages above the no-interruption benchmark at every experience level.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Effect on wage dispersion &amp;ndash; a new stylized fact.&lt;/em&gt; Interruptions decrease, not increase, the inter-decile range of log wages after 20 years. After 20 years, with an average interruption duration of 2.47 years, interruptions decrease the inter-decile range by 0.52 log points (approximately 38%). This compression operates differentially: the 90th percentile falls by 0.34 and the 10th percentile rises by 0.18.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Endogeneity explains the dispersion compression.&lt;/em&gt; When years of interruption are randomly reassigned across time (holding total interruption years fixed), the inter-decile range diverges upward from the observed benchmark after about 5 years. This shows that the dispersion-reducing effect of actual interruptions is due to the endogenous timing of those interruptions &amp;ndash; specifically to the negative correlation between the timing of interruptions and potential log wages &amp;ndash; rather than to the correlation between the structural coefficients on interruptions and potential wages (which is also negative, with a Spearman rank correlation of -0.32 between eta_i1 and eta_i3). Endogenously chosen interruptions smooth inequality over time.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;em&gt;Current-period selection is negligible.&lt;/em&gt; Current-period selection into private-sector employment has no statistically significant effect on median, mean, variance, or inter-decile range of wages at any experience level, as confirmed by the small inter-decile range of the interactive factor component.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to cohorts of French males entering the private sector between 1985 and 1992, restricted to those with at least 15 observed private-sector years. The French context is distinctive: wage inequality in the working population was stable over 1985-2011, driven in part by minimum wage policy and payroll tax exemptions for lower-skilled workers, in contrast to rising inequality in the United States and Germany. Results on timing of interruptions (eta_i3 and eta_i4) are identified only for individuals with at least two interruptions followed by re-entry (roughly those with K_T &amp;gt;= 2). The paper does not analyze female wages.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 &amp;ndash; Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the structural model and how does it generate a reduced-form wage equation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is a Ben Porath (1967) two-sector human capital model in which individuals divide time between investing in human capital and earning wages in either the private sector (e) or an alternative sector (n). Human capital accumulation in each sector has a sector-specific return rate (rho^s) and depreciation (lambda^s_t). Period utility is log income minus a quadratic investment cost, plus a sector preference shock. Solving the dynamic program backwards (because of log-linearity) yields closed-form optimal investments that are linear in the individual-specific terminal value of human capital (kappa). The resulting log-wage equation (Proposition 5) is a function of five terms: an intercept (eta_i0), a linear trend in potential experience t (eta_i1), a geometric curvature term beta^{-t} (eta_i2), cumulative years of interruptions x^(3)_it (eta_i3), and a curvature in interruptions x^(4)_it (eta_i4), all with individual-specific coefficients. This provides a tractable random-coefficient structure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the MARCOF assumption and why is it weaker than MAR?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;MARCOF &amp;ndash; Missing At Random Conditionally On Factors &amp;ndash; posits that sector-preference shocks, human capital prices, and depreciation rates each follow factor structures: a common time-varying factor (phi_t) multiplied by an individual loading (theta_i) plus an i.i.d. residual. The residuals are assumed mean-independent of factors and loadings, and independent over time. Under standard MAR, missingness is assumed independent of outcomes conditional on observables alone. Under MARCOF, residuals in the wage equation and the sector choice equation are independent conditional on (unobserved) factors and factor loadings. This is weaker than MAR because it allows the unobservable determinants of wages and participation to share common factors, accommodating the high persistence observed in human capital stocks (20-year lag correlation of 0.28, far above the geometric decay benchmark of 0.024).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How are the individual-specific parameters identified?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under exogenous selection (or, under MARCOF, conditional on factors), identification of eta_i0, eta_i1, and eta_i2 requires variation in potential experience within the individual&amp;rsquo;s time series. Identification of eta_i3 and eta_i4 separately requires individuals to experience at least two spells out of the private sector each followed by re-entry (at least four transitions, so K_T &amp;gt;= 2). An individual with only one interruption spell generates proportional variation in x^(3) and x^(4), so only a linear combination of eta_i3 and eta_i4 is identified. The &amp;ldquo;flat spot&amp;rdquo; approach &amp;ndash; using the observed fact that individuals aged 50-55 have stopped investing in human capital &amp;ndash; separately identifies time, cohort, and age effects and provides the restriction that factors are orthogonal to the level, trend, and curvature in potential experience.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What do the distributions of estimated individual-specific coefficients look like?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Focusing on the main (two-factor) specification with bias correction: the median of the growth parameter eta_i1 is positive (consistent with rising wages with experience) and the median of the curvature parameter eta_i2 is negative (consistent with concavity). However, heterogeneity is substantial: the 90th percentile of eta_i1 is 6.2 times the median, and the first quartile of eta_i1 is negative (implying declining potential wages for a non-negligible share). For the interruption coefficients eta_i3 (year of interruptions) and eta_i4 (curvature), bias-corrected medians are close to zero in the sub-sample with &amp;gt;=2 interruptions, but dispersion is large and symmetric around zero. Bias correction reduces the 90th percentile of eta_i1 by approximately 20% and reduces the absolute 10th percentile of eta_i3 by approximately 27%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How important are interruptions relative to potential experience and factors in explaining wage variation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A wage decomposition using inter-decile ranges (preferred over variance due to bias) shows that the potential experience component is the largest contributor to wage dispersion, followed by the interruption component (described as &amp;ldquo;sizable&amp;rdquo;), while factors play a minor role. Crucially, the potential experience and interruption components are highly negatively rank-correlated: the Spearman rank correlation between the growth coefficient eta_i1 and the interruption coefficient eta_i3 is -0.32. This negative correlation is central to understanding why interruptions compress dispersion rather than expanding it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the finding on the effect of interruptions on mean wages, and what does the timing experiment show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;After 20 years, the average cost of interruptions (relative to a counterfactual of no interruptions) is approximately 10% of log wages. The timing of interruptions matters: reassigning interruptions to the beginning of the working life causes a persistent loss in mean log wages that does not fully recover over the 20-year horizon, while reassigning them to the end raises mean log wages above the no-interruption level at every experience level. For median wages, the early-interruption loss is eventually recovered (median log wages do catch up), but the mean does not catch up. These asymmetries are consistent with early interruptions having a larger negative effect on human capital accumulation due to the geometric structure of investment returns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the key finding on wage dispersion and what explains it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Interruptions compress the inter-decile range of log wages by 0.52 log points (approximately 38%) after 20 years, with average interruption duration of 2.47 years. This compression is asymmetric: the 90th percentile of wages falls by 0.34 and the 10th percentile rises by 0.18. The dispersion-reducing effect is established by comparing the benchmark (observed interruptions) to the counterfactual of no interruptions. When interruptions are instead randomly reassigned across time (holding total interruption duration fixed), the inter-decile range diverges upward from the benchmark starting around 5 years of experience. This demonstrates that the compression is due to the endogenous timing of interruptions &amp;ndash; individuals who have high potential wages tend to time their interruptions in ways that reduce the measured spread of actual wages &amp;ndash; rather than to the negative structural coefficient (eta_i3 &amp;lt; 0 for high-wage workers on average).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the paper handle the incidental parameter problem for distributional statistics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because individual parameters are estimated at rate sqrt(T) and the panel is unbalanced (some individuals observed for as few as 15 years while the model has up to 7 individual parameters), standard distributional statistics like the variance suffer from substantial incidental parameter bias. Monte Carlo experiments show that bias-corrected variance estimates remain strongly biased even at T &amp;gt; 20. Inter-decile ranges are better behaved and the Jochmans and Weidner (2019) bias-correction procedure reduces their bias satisfactorily. This is why the paper reports inter-decile ranges as its primary dispersion measure rather than variances. The bias in corrected inter-decile ranges is at most approximately 10% of the uncorrected estimate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the paper show about the MAR assumption in the context of this data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The results directly challenge the MAR assumption that is standard in the life-cycle earnings literature. Under MAR, interruptions would be treated as random conditional on observables, and their endogeneity would be ignored. The paper shows that treating interruptions as endogenous (through the MARCOF + structural model approach) substantially changes estimated returns to experience (there is a strong downward bias when interruptions and factors are omitted) and reverses the sign of the effect of interruptions on dispersion (under exogenous interruptions, randomly reassigned, dispersion would be higher than observed; the actual compression is an artifact of endogenous timing). The conclusion is that MAR assumptions produce systematically misleading pictures of life-cycle wage inequality dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the robustness and external validity considerations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The working sample excludes individuals observed fewer than 15 years. A robustness exercise compares the subsample observed 10-14 years to a censored version of the 20+ subsample with matched marginal distributions of observation counts. Median profiles for the uncensored and censored 20+ samples are similar, and inter-decile ranges are slightly more dispersed in the censored sample only for potential experience greater than 7. However, the 10-14 year sample shows substantially different patterns &amp;ndash; larger median gaps between benchmark and no-interruption cases, and a larger inter-decile range &amp;ndash; consistent with lower private-sector returns to human capital for that group. The authors conclude that selection into the 15+ working sample matters, and results are explicitly restricted to that working sample. The French context (stable aggregate wage inequality, minimum wage policy) limits direct comparability to countries with rising inequality.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;MARCOF (Missing At Random Conditionally On Factors):&lt;/strong&gt; The paper&amp;rsquo;s central identifying assumption, weaker than standard MAR. It posits that sector-preference shocks, human capital prices, and depreciation rates follow factor structures (common time-varying factor x individual loading + i.i.d. residual), and that residuals are mean-independent of factors, loadings, and their own histories. Conditional on factors and loadings, wage residuals and sector-choice residuals are independent, making selection exogenous.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Interactive effects / factor structure for selection:&lt;/strong&gt; An approach in which unobserved confounders are modeled as a bilinear product of time-varying common factors (phi_t) and individual factor loadings (theta_i). This allows flexible correlation between wage processes and participation choices without requiring exclusion restrictions or instrumental variables. The paper&amp;rsquo;s preferred specification uses two unobserved factors identified by Bai-Ng information criteria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average structural functions:&lt;/strong&gt; Objects defined by Blundell and Powell (2003) that integrate counterfactual outcomes (wages evaluated at a manipulated interruption history) over the distribution of individual-specific parameters. They allow estimation of the causal impact of a change in interruption timing or presence while holding individual structural parameters fixed, under identification conditions analogous to those of Chernozhukov et al. (2013).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Individual-specific coefficients (random coefficients):&lt;/strong&gt; The five parameters (eta_i0, eta_i1, eta_i2, eta_i3, eta_i4) governing each individual&amp;rsquo;s wage equation, with structural interpretations: initial log human capital, return to potential experience, curvature (Mincer concavity), effect of cumulative interruption years, and curvature in interruptions. Their individual-specificity is the source of the incidental parameter problem for distributional statistics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Flat spot approach:&lt;/strong&gt; An identification device (from Heckman, Lochner, and Taber, 1998; Bowlus and Robinson, 2012) that uses median wages of workers aged 50-55 &amp;ndash; who are assumed to have stopped investing in human capital &amp;ndash; as consistent estimates of human capital prices by education group and year. This separates the volume of human capital from its price, and provides the restriction identifying the level, trend, and curvature factors from the time-varying unobserved factors phi_t.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Interruption variables x^(3) and x^(4):&lt;/strong&gt; Reduced-form variables derived from the structural model summarizing the history of private-sector participation gaps. x^(3)_it is the cumulative number of periods spent in the alternative sector prior to date t; x^(4)_it is a geometric-weighted version of those interruptions that reflects the timing (early vs. late) through the discount factor beta. They enter the wage equation with individual-specific coefficients that are identified only for workers with at least two complete interruption spells.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mincer dip:&lt;/strong&gt; A U-shaped profile in wage variance (or inter-decile range) over potential experience, predicted by the Ben Porath model because high-return workers invest more at the start of their careers (reducing current wages), causing their wage profile to cross below then above low-return workers. Estimated in this paper at approximately 5 years of potential experience under the main specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incidental parameter bias in distributional statistics:&lt;/strong&gt; The bias that arises when estimating moments or quantiles of the distribution of individual-specific parameters that converge at rate sqrt(T) rather than sqrt(N). The paper shows through Monte Carlo experiments that variance estimates remain substantially biased even after Arellano-Bonhomme (2012) correction when T &amp;gt;= 20, while inter-decile ranges corrected by Jochmans-Weidner (2019) are more reliable.&lt;/p&gt;</description></item><item><title>Life-cycle worker flows and cross-country differences in aggregate employment</title><link>https://macropaperwarehouse.com/papers/life-cycle-worker-flows-and-cross-country-differences-in-aggregate-employment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/life-cycle-worker-flows-and-cross-country-differences-in-aggregate-employment/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; The paper asks: what are the sources of cross-country differences in aggregate employment across European economies, and which types of worker flows — between employment (E), unemployment (U), and nonparticipation (N) — drive those differences? The authors pay particular attention to heterogeneity by gender and age, motivated by the observation that cross-country employment dispersion is concentrated among women, youth, and older workers, and that a large portion of the dispersion is traceable to differences in labor force participation rather than unemployment rates alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The empirical analysis draws on microdata from the EU Statistics on Income and Living Conditions (EU-SILC), an annual survey covering 32 European countries for 2004–2019. Germany is covered using the German Socio-Economic Panel (GSOEP, 2003–2018) because GSOEP longitudinal coverage begins earlier. The combined sample contains 7,064,306 individual-year observations for 2,221,672 individuals. Labor force status is recorded monthly via a retrospective calendar; transition probabilities are estimated at the quarterly frequency after correcting for measurement error (a &amp;ldquo;de-NUN-ification&amp;rdquo; procedure following Elsby et al. [2015]) and time-aggregation bias (Shimer [2012]).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology — empirical.&lt;/strong&gt; Six quarterly transition probabilities among E, U, and N are estimated by gender and single year of age (16–65). The life-cycle profile of each probability is extracted nonparametrically by regressing age-time cells on age and time dummies, removing business-cycle variation. To decompose cross-country employment differences into contributions of the six transition rates while handling the path-dependence of the decomposition (6! = 720 possible orderings), the authors apply the Shapley-Owen decomposition, which assigns to each transition rate its average marginal contribution across all orderings. An initial first-pass decomposition allocates the aggregate employment gap between any two countries into three parts: demographics, initial conditions (distribution across E, U, N at age 16), and transition probabilities. Transition probabilities account for 93–105% of the cross-country variance in aggregate employment, while demographics and initial conditions together explain less than 10%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology — structural model.&lt;/strong&gt; The authors build a life-cycle Diamond-Mortensen-Pissarides (DMP) model with three labor market states, calibrated separately by gender and country for France, Germany, Italy, Spain, and the U.K. — the five largest economies in the sample. A key feature is that all primitives (technology, search and matching) are age-independent; life-cycle variation in worker flows arises endogenously from the finite retirement horizon and from two search margins: (i) an &lt;em&gt;intensive margin&lt;/em&gt; — variable search intensity &lt;em&gt;s&lt;/em&gt; in [0,1] chosen optimally each period — and (ii) an &lt;em&gt;extensive margin&lt;/em&gt; — the endogenous labor force participation decision modeled as a discrete choice with i.i.d. extreme-value utility shocks. The model also incorporates permanent match quality (an experience good revealed stochastically with probability alpha per period following Jovanovic [1979]), transitory match-quality shocks (persistent AR(1) process), exogenous job-destruction shocks (per-period probability delta), a two-tier UI system, a two-tier EPL system capturing temporary vs. permanent contracts, and proportional value-added and social-security taxes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main empirical findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;For male workers, employment-to-unemployment (EU) transitions account for approximately half of the cross-country variance in aggregate male employment across all 32 countries, rising to about three-quarters when looking at the five largest economies, and exceeding 85% for prime-age males (ages 25–54). Transitions in the reverse direction (UE) explain less than 30% of the variance across all 32 countries and play almost no role among the five largest economies. The labor force participation margin (combining NE and EN transitions) explains a non-negligible 25–30% of the aggregate male employment gap.&lt;/li&gt;
&lt;li&gt;For female workers, at least half of the cross-country variance in employment is explained by participation-related flows, primarily transitions from nonparticipation to employment (NE). In the full 32-country sample, NE alone explains 65% of the variance in female employment rates across all ages (16–65). Its role is somewhat smaller in the five largest economies, where EN transitions also play a larger role. Crucially, the sum of NE and EN variance contributions for women is at least as large as the sum of UE and EU contributions, underlining the indispensability of a three-state model.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Main quantitative (model-based) findings.&lt;/strong&gt;
The model decomposes cross-country employment differences into technology (the distribution of permanent match quality, job-separation risk delta, and information frictions alpha), search parameters (vacancy costs, non-work utility, search-cost parameters), and policies (UI generosity, firing costs, taxes). The total employment variance across the five economies and two gender groups is 0.36 percentage points squared. Technology differences over-explain this variance (contribution of 0.65), while policies play almost no role (contribution of -0.04) and search frictions have a negative variance contribution (-0.25). The negative sign of search and policy contributions reflects the negative cross-country correlation between these factors and technology: countries with high employment rates (e.g., France) tend to have more generous UI and higher taxes, which the model attributes to compensating technology advantages. For individual countries: France is about 4.4 percentage points above the cross-country benchmark, driven by technology and partly offset by the highest replacement ratios and labor tax rates in the sample (67% and 56%, respectively). Spain is about 7 percentage points below the benchmark, driven by the lowest measured labor productivity (78% of Germany&amp;rsquo;s level) and the highest employment outflow rates (~4–5% per quarter vs. ~2% in France).&lt;/p&gt;
&lt;p&gt;The channels through which technology affects employment are predominantly the &lt;em&gt;employment inflows&lt;/em&gt;, not outflows. The exogenous job-separation risk delta affects aggregate employment mostly through its impact on expected duration of future employment spells, which reduces search incentives and job-finding rates from both unemployment and nonparticipation, and lowers labor force attachment. Similarly, mean permanent match quality (mu_x) and labor taxes (tau_ss) operate mainly through the inflow margin. Technology effects are amplified by search effort margins, particularly for women and youth: women face higher non-work utility (interpreted as labor-market frictions or opportunity costs), implying a lower employment surplus and therefore a higher surplus elasticity; for young workers, the long remaining horizon amplifies the effect of technology variations on discounted lifetime earnings, generating relatively higher search-effort responses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; The analysis is confined to European countries. The structural decomposition covers only the five largest European economies. The authors acknowledge that parameters labeled as &amp;ldquo;job-separation risk&amp;rdquo; may also capture employment protection and temporary contracts not explicitly modeled, or non-monetary quit motives, so the attribution to &amp;ldquo;technology&amp;rdquo; should be interpreted with that caveat in mind. The model operates in a complete-markets, no-savings environment without on-the-job search.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What fraction of cross-country employment variance is explained by transition probabilities vs. demographics and initial conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the full 32-country sample, transition probabilities account for 94.7% of the cross-country variance in aggregate male employment and 99.9% for female employment. In the five largest economies, the corresponding figures are 93.5% (men) and 104.9% (women) — the slight excess above 100% reflects the negative contribution of initial conditions for women. Demographics and initial conditions together explain less than 10% of the variance, with somewhat larger demographic effects in Baltic and Eastern European countries, plausibly due to emigration-driven changes in age composition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: For male workers, which specific transition probability dominates the cross-country employment variance, and how does this vary by age and across country groupings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: EU (employment-to-unemployment) transitions account for approximately 51% of the cross-country variance in aggregate male employment (ages 16–65) across all 32 countries, rising to 77% in the five largest economies, and to 89% for prime-age males (ages 25–54) in the same group. By contrast, UE (job-finding from unemployment) explains at most 29% across all 32 countries and virtually nothing in the five largest economies. For prime-age men, EU remains dominant throughout; toward the end of the working life, EN (employment-to-nonparticipation) transitions become the main driver as workers move into retirement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: For female workers, what is the primary driver of cross-country employment variance, and does the pattern differ from men?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: For women, transitions from nonparticipation to employment (NE) explain 65% of the cross-country variance in female employment across all ages in the 32-country sample. This dominance is more concentrated at ages 20–30, when participation entry is particularly heterogeneous across countries, likely reflecting fertility and child-rearing patterns. The sum of NE and EN contributions for women equals or exceeds the combined UE and EU contributions in both country groupings, demonstrating a fundamentally different demographic structure of employment differences for women relative to men.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the model generate life-cycle variation in transition rates despite having age-independent primitives?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model produces age-varying transition rates through two mechanisms operating on age-independent fundamentals. First, variable search intensity declines as workers age because the remaining time to retirement shortens, reducing the expected lifetime returns to job search — the &amp;ldquo;horizon effect&amp;rdquo; (Cheron et al. [2011, 2013]). This mechanism explains virtually all of the life-cycle variation in the NE job-finding rate and an overwhelmingly large share of the variation in the UE rate, as shown by counterfactual exercises that fix search intensity at its life-cycle average. Second, information frictions about permanent match quality generate declining separation rates over the working life: young workers disproportionately hold matches with unrevealed quality and thus face higher reallocation risk upon quality revelation; as workers age, their employment share shifts toward matches with revealed quality, which have lower separation rates due to sorting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the structural decomposition (Table 7) reveal about the role of technology vs. policies in explaining cross-country employment differences?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The variance decomposition in Table 7 shows that technology parameters (permanent match-quality distribution, job-separation risk delta, and match-quality revelation probability alpha) account for a variance contribution of 0.65 (against total employment variance of 0.36), over-explaining the cross-country dispersion. Labor market policies (UI benefits, firing costs, taxes) have a near-zero variance contribution of -0.04. Search parameters contribute -0.25. The result that policies explain little does not mean they have no level effect: in simple comparative statics, the model predicts that more generous UI and higher labor taxes lower employment. However, in the cross-country calibration, countries with higher employment rates tend to have more interventionist policies, so the cross-country correlation between policies and technology masks individual policy effects at the variance level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do technology effects propagate to employment differences through worker flows, and why is the inflow channel dominant?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Table 8 decomposes employment elasticities with respect to delta (job-separation risk), mu_x (mean log permanent match quality), and tau_ss (social security tax rate) into contributions from (i) the NE job-finding rate, (ii) the share of nonemployed in the labor force (labor force attachment, u-tilde), (iii) the differential between UE and NE rates, and (iv) the employment outflow rate (pEO). At the aggregate level, the separation risk delta has an employment elasticity of -0.28, of which the outflow contribution (dpEO = -0.08) is smaller in absolute magnitude than the sum of inflow contributions (dpNE = -0.06, du-tilde = -0.07, dpDelta = -0.06). Mean match quality mu_x has an employment elasticity of 0.53, primarily mediated through inflows. The mechanism is that changes in delta or mu_x alter expected lifetime earnings, which in turn change search incentives and participation decisions, generating correlated movements in job-finding rates and labor force attachment that amplify the employment impact beyond what a simple outflow change would imply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why do women and youth show larger search-effort responses to technology variations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: For women, the calibrated non-work utility yo is higher in all five countries than for men (interpreting this as extra costs and wedges on the returns to working), which implies a smaller employment surplus. A smaller surplus generates a higher elasticity of surplus with respect to parameter changes, and since search intensity and participation decisions depend on expected surplus, women exhibit larger employment elasticities to technology variations. The aggregate employment elasticity of delta is -0.39 for women vs. -0.19 for men; for mu_x, it is 0.78 for women vs. 0.33 for men. For youth (ages 20–29), the long remaining horizon amplifies the effect of technology changes on discounted expected lifetime earnings, which in turn amplifies participation incentives: the labor force attachment channel (du-tilde) contributes -0.13 for youth compared to -0.07 at the aggregate, while dE = -0.31 for youth vs. -0.28 aggregate for delta.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the quantitative role of individual technology sub-components (match quality, job-separation risk, information frictions)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Panel B of Table 7 breaks down technology into three sub-components. Match quality (mean mu_x and variance sigma^2_x) and job-separation risk (delta) are the key drivers; the match-quality revelation probability (alpha, &amp;ldquo;match revelation&amp;rdquo;) plays almost no independent role (variance contribution approximately 0.00). For France, the primary positive technology contributor is mean match quality (consistent with France&amp;rsquo;s labor productivity slightly above the German benchmark). For Germany and the U.K., the low job-separation risk is the primary positive contributor. For Spain, the high job-separation risk — calibrated to match Spain&amp;rsquo;s employment outflow rate of around 4–5% per quarter versus 2% in France — is the main negative contributor, reflecting the widespread prevalence of temporary contracts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What role do labor market policies play at the country-specific level, even though they explain little cross-country variance?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Panel C of Table 7 shows that employment protection legislation plays almost no role for any country. Labor taxes are quantitatively important: they explain the relatively high employment rate in the U.K. (the country with the lowest social security contribution rate, about 20%), contributing positively. In France, where labor taxes exceed 50% of the average wage, the policy contribution is strongly negative, roughly offsetting the large positive technology contribution. UI benefits lower aggregate employment — Italy, with calibrated UI benefits lower than France&amp;rsquo;s, has a smaller employment gap vis-a-vis the benchmark partly because of this. The finding that policies explain little variance while having large individual-country effects is explained by the negative cross-country correlation: countries with generous policies also tend to have favorable technology, so policy and technology contributions partially offset each other in the variance decomposition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the model fit untargeted moments, particularly the empirical Shapley-Owen variance decomposition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model is calibrated to aggregate transition rates by gender, and to moments describing labor productivity, vacancy rates, and policy targets. Despite having age-independent primitives, the calibrated model captures the empirical life-cycle profiles of transition rates as untargeted moments: declining NE and UE rates with age, rising EN rates near retirement, and the hump-shaped patterns. More stringently, the model replicates the empirical Shapley-Owen variance decomposition: it correctly predicts that EU separations account for most of the employment variance for men, and that NE inflows are relatively more important for women and youth. A notable limitation is that the model overshoots the UN (unemployment-to-nonparticipation) transition rate for a significant share of data points — but the authors note that flows between U and N play almost no role in cross-country employment variance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the &amp;ldquo;horizon effect&amp;rdquo; and how does it operate in this model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The horizon effect, coined by Cheron et al. [2011, 2013] in a two-state (E/U) DMP model, refers to the phenomenon that as workers approach retirement, the expected returns to job search fall because the remaining period of employment is shorter. This reduces search intensity from both unemployment and nonparticipation, lowering job-finding rates, and in the present model also affects the match-acceptance probability: workers near retirement find it optimal to remain in unemployment to collect UI benefits rather than accept a job offer, further reducing the UE rate. The current paper generalizes this effect to a three-state setting by incorporating the labor force participation margin alongside search intensity, generating plausible declining job-finding rates and increasing EN rates at older ages from age-independent parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the paper handle the gender dimension in the model calibration?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model assumes that men and women share the same production and matching technology parameters within a country (A, cv, delta, alpha, mu_x, sigma^2_x, sigma^2_z), but allows the search-cost and non-work-utility parameters (ceu, cnu, cu, kappa_u, kappa_n, yo) to differ by gender. The gender-specific search parameters are identified from the gender-specific transition rates: for example, kappa_u (marginal search cost in unemployment) for women is inferred from the female UE transition rate, relative to the normalization for men. The non-work utility yo is consistently higher for women in all five countries, rationalizing lower female employment through a lower employment surplus. This generates a higher surplus elasticity for women, which in turn explains why women&amp;rsquo;s employment is more responsive to technology variations across countries.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shapley-Owen Decomposition.&lt;/strong&gt; A method from cooperative game theory (Shapley [1953], Owen [1977]) used here to decompose cross-country differences in employment into contributions of individual worker-flow transition rates (or structural parameters). It computes the marginal contribution of each component averaged over all 6! = 720 orderings of the six transition rates, yielding a unique, symmetric, exact decomposition that sums to the total employment gap. Unlike sequential decompositions, it is path-independent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive Margin of Search Effort.&lt;/strong&gt; The binary labor force participation decision: whether a nonemployed worker enters the unemployment state (and thus accesses the superior search technology at a flow cost) or remains in nonparticipation. In the paper&amp;rsquo;s model, this is captured as a discrete choice between states U and N, governed by i.i.d. extreme-value utility shocks, yielding a closed-form logit participation probability.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive Margin of Search Effort.&lt;/strong&gt; The continuous choice of search intensity s in [0,1] by nonemployed workers (both unemployed and nonparticipants), which scales the probability of meeting a vacancy per period. The optimal intensity equates the marginal cost of search (convex in s) to the marginal benefit (the expected surplus from meeting a firm times the contact rate). Search intensity declines with age because the remaining working life shortens, reducing the discounted value of a job.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Permanent Match Quality (x).&lt;/strong&gt; A time-invariant, match-specific productivity component drawn from a log-normal distribution upon meeting a firm, but initially unobserved by both worker and firm (an experience good). With per-period probability alpha, the quality is revealed; prior to revelation, the parties form expectations over the distribution. Revelation triggers reallocation of bad matches, generating a negative relation between job tenure and separation probability (following Jovanovic [1979]).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Horizon Effect.&lt;/strong&gt; The mechanism by which workers reduce search effort as they approach retirement because the expected present value of future employment spells shortens. In this paper the concept, coined by Cheron et al. [2011, 2013] in a two-state DMP setting, is extended to include the labor force participation margin: near-retirement workers not only search less intensively but also become more likely to choose nonparticipation (or to remain unemployed to collect benefits rather than accept a job), generating the observed life-cycle decline in job-finding rates from age-independent parameters.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technology Parameters (theta).&lt;/strong&gt; In the paper&amp;rsquo;s structural decomposition, &amp;ldquo;technology&amp;rdquo; refers specifically to the vector (mu_x, sigma^2_x, alpha, delta) — the mean and variance of log permanent match quality, the match-quality revelation probability, and the exogenous job-destruction probability. These are contrasted with search-cost parameters (phi) and policy parameters (psi). The label &amp;ldquo;technology&amp;rdquo; is acknowledged to potentially also capture employment protection and quit motives not explicitly modeled.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Life-Cycle DMP Model.&lt;/strong&gt; A finite-horizon version of the Diamond-Mortensen-Pissarides search-and-matching framework in which workers live for J periods, all primitives are age-independent, and life-cycle variation in worker flows arises endogenously from the interaction of the finite horizon with search intensity, labor force participation, and match-learning mechanisms. The model distinguishes three labor market states (E, U, N) and uses Nash bargaining to split the employment surplus.&lt;/p&gt;</description></item><item><title>Linking Social and Personal Preferences: Theory and Experiment</title><link>https://macropaperwarehouse.com/papers/linking-social-and-personal-preferences-theory-and-experiment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/linking-social-and-personal-preferences-theory-and-experiment/</guid><description>&lt;p&gt;This paper asks whether an individual&amp;rsquo;s attitude toward risk in the personal domain (choices affecting only oneself) can be linked to that same individual&amp;rsquo;s attitude toward risk in the social domain (choices affecting both oneself and others). The authors provide a theoretical answer in the form of necessary and sufficient conditions, and then test those conditions experimentally.&lt;/p&gt;
&lt;p&gt;The formal model posits a decision maker (DM) with a preference relation over lotteries on a set of social states, where a distinguished subset of states are personal (consequences for the DM alone). The authors assume preferences satisfy Completeness, Transitivity, Continuity, and State Monotonicity — the last being equivalent to respect for First-Order Stochastic Dominance (FOSD), a condition weaker than the Expected Utility Independence Axiom and satisfied by virtually all extant decision theories including Weighted Expected Utility, Rank-Dependent Utility, and Prospect Theory. The key theoretical result (Theorem 1) establishes that the full preference relation over all social lotteries can be uniquely deduced from the partial observations of (i) riskless social choices and (ii) risky personal choices if and only if the DM finds every social state indifferent to some personal state. When this condition fails, there exist social lotteries whose ranking cannot be recovered from the partial data.&lt;/p&gt;
&lt;p&gt;For two empirically relevant preference types, this condition generates directly testable predictions: for selfish subjects (who allocate nothing to others in deterministic social choices), risky personal preferences must coincide with risky social preferences; for impartial subjects (who treat self and other symmetrically in deterministic social choices), riskless social preferences must coincide with risky social preferences.&lt;/p&gt;
&lt;p&gt;The experiment was conducted at the University of Bergen and NHH Norwegian School of Economics with 276 undergraduate subjects. Each subject faced 50 budget-line choice problems in each of three domains: Personal Risk (equiprobable binary lotteries over own payoffs only), Social Choice (deterministic splits between self and an anonymous other), and Social Risk (equiprobable binary lotteries over symmetric payout pairs for self and other). The graphical interface of Choi et al. (2007b) was used throughout. One randomly selected decision per domain was paid out; each token was worth 1.2 NOK (approximately 0.2 USD), with average earnings of approximately 270 NOK.&lt;/p&gt;
&lt;p&gt;Within-domain consistency, measured by the Critical Cost Efficiency Index (CCEI), is high: mean CCEIs are 0.959, 0.952, and 0.902 in the Personal Risk, Social Choice, and Social Risk domains respectively. At the CCEI &amp;gt; 0.90 threshold, 89.9%, 85.9%, and 69.9% of subjects pass in the three domains. Using a 0.95 share-to-self threshold, 103 subjects (37.3%) are classified as selfish; using revealed-preference criteria at the 5% significance level, 33 subjects (12.0%) are classified as impartial.&lt;/p&gt;
&lt;p&gt;Testing is done via an individual-level nonparametric permutation test that draws 10,000 random data sets per subject and compares simulated CCEI distributions to actual cross-domain CCEIs, with Bonferroni correction. At the 1% significance level, the null that Personal Risk and Social Risk preferences coincide is rejected for only 5.9%–9.3% of selfish subjects (varying by classification threshold), compared with 14.7%–16.3% rejection rates for non-selfish subjects. For impartial subjects at the 1% level, the null that Social Choice and Social Risk preferences coincide is rejected for 0.0%–11.1%, compared with 19.8%–26.8% for non-impartial subjects. The theory&amp;rsquo;s predictions are thus supported for a large majority of both selfish and impartial subjects.&lt;/p&gt;
&lt;p&gt;A theoretical extension (Theorem 2) shows that if one additionally observes comparisons between social states and personal lotteries, unique deduction of the full preference relation requires that preferences in both personal and social domains satisfy Expected Utility (Independence Axiom) and that every social state is indifferent to some personal lottery — a strictly stronger set of conditions.&lt;/p&gt;
&lt;p&gt;Q: What is the central theoretical question and why does it matter?
A: The paper asks whether preferences over risky social choices (lotteries over outcomes for self and others) can be deduced from observing only riskless social choices and risky personal choices. This matters because people frequently observe or predict the risky social choices of leaders and representatives, but may have access only to those leaders&amp;rsquo; personal risk-taking behavior and their expressed social preferences under certainty.&lt;/p&gt;
&lt;p&gt;Q: What is the main theoretical result (Theorem 1)?
A: Under Completeness, Transitivity, Continuity, and State Monotonicity, the unique extension of the partial preference relation (over social states and personal lotteries) to the full domain of social lotteries exists if and only if every social state is indifferent to some personal state. When this condition is not met, multiple distinct preference relations can extend the partial observations, making deduction impossible.&lt;/p&gt;
&lt;p&gt;Q: What is State Monotonicity and how does it relate to standard axioms?
A: State Monotonicity requires that if each social state in one lottery dominates the corresponding state in another lottery, then the first lottery is weakly preferred. The paper shows this is equivalent to respect for First-Order Stochastic Dominance (FOSD) given the other axioms, and is strictly weaker than the von Neumann–Morgenstern Independence Axiom. It is satisfied by Weighted Expected Utility, Rank-Dependent Utility, and Prospect Theory, making it a broadly applicable assumption.&lt;/p&gt;
&lt;p&gt;Q: What are the testable predictions for selfish subjects?
A: Proposition 2 establishes that if a subject&amp;rsquo;s Social Choice preferences are selfish — meaning any bundle (x, y) is indifferent to (0, y), so the subject is indifferent between keeping x for self and giving it to other — then preferences in the Personal Risk domain must coincide with preferences in the Social Risk domain. In the experiment, selfish subjects are those allocating more than 95% of tokens to themselves in the Social Choice domain (103 of 276 subjects, or 37.3%).&lt;/p&gt;
&lt;p&gt;Q: What are the testable predictions for impartial subjects?
A: Proposition 3 establishes that if a subject&amp;rsquo;s Social Choice preferences are symmetric — meaning (x, y) is indifferent to (y, x) for all pairs — then preferences in the Social Choice domain must coincide with preferences in the Social Risk domain, implying risk neutrality toward social lotteries. The intuition is that such a subject treats self and other identically, so risky splits are evaluated by expected value alone. In the experiment, 33 subjects (12.0%) are classified as impartial by the revealed-preference criterion at the 5% significance level.&lt;/p&gt;
&lt;p&gt;Q: How does the experiment measure within-domain rationality?
A: Choices within each domain are evaluated using the Critical Cost Efficiency Index (CCEI, following Afriat 1967), which measures how much a budget constraint must be relaxed to remove all GARP violations. Mean CCEIs are 0.959 (Personal Risk), 0.952 (Social Choice), and 0.902 (Social Risk). At the CCEI &amp;gt; 0.90 threshold, 248 subjects (89.9%), 237 (85.9%), and 193 (69.9%) pass in the three domains respectively, compared to a simulated mean CCEI of only 0.585 for subjects randomizing uniformly.&lt;/p&gt;
&lt;p&gt;Q: How does the cross-domain test work and why is it nonparametric?
A: The test uses individual-level permutation inference: under the null that preferences in domains I and J are identical, any 50-element subset drawn from the pooled 100 choices should satisfy GARP as well as the actual domain-specific choices. For each subject, 10,000 such random draws are generated, their CCEI scores are computed, and the distribution is compared to the actual cross-domain CCEI with Bonferroni correction. The test makes no functional form assumptions about utility and accommodates the observed within-domain errors without parametric error modeling.&lt;/p&gt;
&lt;p&gt;Q: What are the rejection rates for the selfish-subject prediction?
A: At the 1% significance level, the null that Personal Risk and Social Risk preferences coincide is rejected for only 5.9%–9.3% of selfish subjects (range across four classification thresholds from 0.99 to 0.90 share-to-self), compared to 14.7%–16.3% for non-selfish subjects. At the 5% level, rejection rates rise to 20.4%–25.6% for selfish and 22.4%–31.8% for non-selfish subjects.&lt;/p&gt;
&lt;p&gt;Q: What are the rejection rates for the impartial-subject prediction?
A: At the 1% significance level, the null that Social Choice and Social Risk preferences coincide is rejected for 0.0%–11.1% of impartial subjects (range depending on threshold and classification method), compared to 19.8%–26.8% for non-impartial subjects. At the 5% and 10% levels, rejection rates for impartial subjects range from 0.0% to 22.2%.&lt;/p&gt;
&lt;p&gt;Q: Does the theory predict how risk aversion should map across domains for non-selfish, non-impartial subjects?
A: The theory does not directly produce testable cross-domain predictions for subjects who are neither selfish nor impartial without additional parametric assumptions, because the specific personal-state equivalent of each social state depends on the form of preferences. The paper restricts its nonparametric tests to the two polar cases where the equivalence mapping is determinate from social choice behavior alone.&lt;/p&gt;
&lt;p&gt;Q: What is the extended result (Theorem 2) and what stronger conditions does it require?
A: When one additionally observes comparisons between social states and personal lotteries (not just within each domain separately), unique deduction of the full preference relation is possible if and only if preferences in both the personal and social domains are consistent with an Expected Utility representation and every social state is indifferent to some personal lottery. This requires the Independence Axiom — a strictly stronger condition than State Monotonicity — highlighting that the main Theorem 1 result exploits the weaker observational structure.&lt;/p&gt;
&lt;p&gt;Q: What is the distribution of social preferences in the sample?
A: Of 276 subjects, 103 (37.3%) are classified as selfish at the 0.95 share-to-self threshold. Only 6 subjects (2.2%) kept fewer than 0.45 of tokens on average, making purely altruistic subjects rare. In the Personal Risk domain, 41 subjects (14.9%) allocated more than 95% to the cheaper account (consistent with risk neutrality), while 9 (3.3%) allocated fewer than 55% (consistent with infinite risk aversion). In the Social Risk domain, 30 subjects (10.9%) are consistent with utilitarianism in money and 9 (3.3%) with Rawlsianism in money.&lt;/p&gt;
&lt;p&gt;Q: How does the Social Risk domain compare to the Personal Risk and Social Choice domains in terms of rationality scores?
A: The Social Risk domain shows lower consistency than the other two: mean CCEI is 0.902 versus 0.959 and 0.952, and only 69.9% of subjects exceed the 0.90 threshold versus 89.9% and 85.9%. The CCEI distribution is shifted left for Social Risk, suggesting the novel combined dimension of social and risky choice introduces more decision complexity or error.&lt;/p&gt;
&lt;p&gt;Q: What is the relationship to the prior experimental literature on social and risk preferences?
A: The Personal Risk domain replicates the symmetric risk experiment of Choi et al. (2007a), and the Social Choice domain replicates the linear two-person dictator experiment of Fisman et al. (2007). The Social Risk domain is new to this paper. The theoretical framework connects to Saito (2013) on social preferences under risk, and to the preference extension literature of Grant et al. (1992) and Nishimura et al. (2017).&lt;/p&gt;
&lt;p&gt;State Monotonicity: The axiom requiring that if each social state in one lottery weakly dominates the corresponding social state in another lottery, the first lottery is weakly preferred. The paper proves this is equivalent to respect for First-Order Stochastic Dominance given Completeness, Transitivity, and Continuity, and distinguishes it from the stronger Independence Axiom by noting that Independence compares lotteries over lotteries while State Monotonicity only compares lotteries over states.&lt;/p&gt;
&lt;p&gt;Selfish preferences (in the paper&amp;rsquo;s sense): Preferences in the Social Choice domain such that (x, y) is indifferent to (0, y) for all bundles — the subject is indifferent between receiving x themselves versus giving x to the other person. Operationally measured as allocating more than a threshold share (e.g., 95%) of tokens to self across Social Choice decisions.&lt;/p&gt;
&lt;p&gt;Impartial preferences (in the paper&amp;rsquo;s sense): Preferences in the Social Choice domain such that (x, y) is indifferent to (y, x) for all bundles — the subject treats self and other symmetrically. Operationally identified by the revealed preference criterion that choices in the Social Choice domain satisfy GARP and are consistent with symmetric treatment.&lt;/p&gt;
&lt;p&gt;Unique extension (deducibility): The property that there exists exactly one complete preference relation over all social lotteries that is consistent with the axioms and agrees with the observed partial relation over social states and personal lotteries. Theorem 1 identifies the necessary and sufficient condition for unique extension under State Monotonicity.&lt;/p&gt;
&lt;p&gt;Personal state indifference condition: The condition that for every social state omega in Omega minus P, there exists some personal state in P to which the DM is indifferent. This is the necessary and sufficient condition in Theorem 1 for deducibility of the full preference relation. Interpreted as: for every proposed social allocation, there exists a &amp;ldquo;bribe&amp;rdquo; — a personal allocation with nothing for others — that the DM finds equally desirable.&lt;/p&gt;
&lt;p&gt;Critical Cost Efficiency Index (CCEI): A measure of how much budget constraints must be scaled down to eliminate all GARP violations in a dataset of choices from budget lines (following Afriat 1967). A CCEI of 1 indicates perfect rationality; the paper uses 0.90 as a practical threshold. Mean values are 0.959, 0.952, and 0.902 in the Personal Risk, Social Choice, and Social Risk domains respectively.&lt;/p&gt;
&lt;p&gt;Nonparametric permutation test: The individual-level test used to assess consistency across choice domains. Under the null that preferences are identical in domains I and J, any random 50-element draw from the pooled 100 choices should achieve CCEI scores no worse than the actual domain scores. The test draws 10,000 permuted datasets per subject and uses the Bonferroni correction for multiple comparisons, making no assumptions about the functional form of utility.&lt;/p&gt;</description></item><item><title>Liquidity Traps, Prudential Policies, and International Spillovers</title><link>https://macropaperwarehouse.com/papers/liquidity-traps-prudential-policies-and-international-spillovers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/liquidity-traps-prudential-policies-and-international-spillovers/</guid><description>&lt;p&gt;The paper develops a tractable open-economy New Keynesian model with nominal rigidities and an occasionally binding zero lower bound (ZLB) to study how monetary policy and macroprudential policy (modeled as a tax on capital flows) jointly transmit to output, capital flows, and the exchange rate, and what this implies for international spillovers and global welfare. An analytical decomposition identifies three transmission channels — intertemporal substitution, expenditure switching, and aggregate income — and the calibration finds that capital controls operate almost entirely through intertemporal substitution (about 95%), whereas expenditure switching accounts for roughly a quarter to a third of the effect of monetary policy. On the normative side, the authors show that, absent capital controls, monetary policy faces a tradeoff between stabilizing output today and curbing capital flows to lower the likelihood of a future liquidity trap, but that &amp;rsquo;leaning against the wind&amp;rsquo; (pre-emptively raising rates) is not necessarily optimal and can be counterproductive when tradables and non-tradables are highly substitutable. Quantitatively, adding capital controls lowers the average unemployment rate conditional on a liquidity trap from about 6% to about 1.5% and cuts the unconditional welfare cost of liquidity traps from about 0.4% to about 0.1% of permanent consumption, with an average ex-ante tax on inflows of about 0.2% and an average ex-post tax on outflows of about -0.05%. Finally, contrary to &amp;lsquo;currency war&amp;rsquo; concerns, the authors argue that capital controls are not beggar-thy-neighbor: a country can use them to insulate itself from adverse foreign-policy spillovers (which operate through the world real interest rate), and coordination is beneficial only during a liquidity trap and works by stimulating rather than restricting flows. All results hold within their small-open-economy model under its calibration.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-model-and-which-policies-does-it-study"&gt;Q1. What is the model, and which policies does it study?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper studies an infinite-horizon small open economy with nominal rigidities and an occasionally binding zero lower bound on the nominal interest rate, in which the government has two instruments — the nominal interest rate (monetary policy) and a tax on capital flows (macroprudential policy).&lt;/strong&gt; The economy has a tradable final good and a non-tradable good with sticky prices, and features aggregate demand externalities. The authors use this setting to ask three questions: how interrelated are the transmission channels of the two policies; how should monetary policy be used jointly with macroprudential policy; and what happens to global welfare when many countries adopt prudential policies simultaneously.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-three-transmission-channels-and-how-much-does-each-matter"&gt;Q2. What are the three transmission channels, and how much does each matter?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;An analytical decomposition (extending Kaplan, Moll and Violante 2018 and Auclert 2019 to an open economy) identifies three channels — intertemporal substitution, expenditure switching, and aggregate income — and the calibration shows monetary policy and capital controls operate through very different channels.&lt;/strong&gt; The intertemporal substitution channel accounts for about 95% of the effect of capital controls, while expenditure switching (operating through exchange-rate depreciation that shifts demand toward non-tradables) accounts for a substantial share of the effect of monetary policy — the paper states &amp;lsquo;about one-third&amp;rsquo; in its introduction and &amp;lsquo;about one-quarter&amp;rsquo; in its conclusion. The expenditure-switching channel and the role of the exchange rate are what distinguish the open-economy decomposition from its closed-economy antecedents.&lt;/p&gt;
&lt;h3 id="q3-do-open-capital-markets-amplify-or-dampen-monetary-policy"&gt;Q3. Do open capital markets amplify or dampen monetary policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Capital flows may either amplify or attenuate the output effects of monetary policy, depending on the relative sizes of the elasticity of substitution over time and the elasticity across sectors.&lt;/strong&gt; If the intertemporal elasticity exceeds the intratemporal one, an open capital account amplifies monetary policy (a monetary expansion raises total consumption more than output, so households borrow from abroad); the result reverses when the intratemporal elasticity is larger, in which case a closed capital account produces the larger output expansion.&lt;/p&gt;
&lt;h3 id="q4-is-leaning-against-the-wind-the-optimal-prudential-use-of-monetary-policy"&gt;Q4. Is &amp;rsquo;leaning against the wind&amp;rsquo; the optimal prudential use of monetary policy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Contrary to a widespread policy view, leaning against the wind is not necessarily optimal: when the elasticity of substitution across sectors is higher than across time, raising the interest rate ahead of a liquidity trap can be counterproductive.&lt;/strong&gt; In that case a rate hike generates a large negative expenditure-switching effect and a sharp income drop while only modestly reducing consumption, so in general equilibrium it leads to capital inflows and more external debt — exacerbating the aggregate demand externality and making a future contraction more likely. The implication is that a prudential monetary policy may require lowering, not raising, the interest rate ahead of a liquidity trap.&lt;/p&gt;
&lt;h3 id="q5-how-should-monetary-and-macroprudential-policy-be-combined-and-how-pre-emptively"&gt;Q5. How should monetary and macroprudential policy be combined, and how pre-emptively?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When capital controls are available, the central bank uses monetary policy to stabilize output and uses the capital-flow tax to manage flows, with the macroprudential tax on debt positive only if the ZLB is likely to bind next period; monetary policy, by contrast, must be used prudentially even when the ZLB binds only in some distant future.&lt;/strong&gt; Because monetary policy is a blunter instrument, it has to be used more pre-emptively than capital controls. The authors also show the central bank may restrict outflows during a liquidity trap when that trap is either temporary or very severe.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-quantitative-welfare-and-unemployment-gains-from-capital-controls"&gt;Q6. What are the quantitative welfare and unemployment gains from capital controls?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Adding capital controls substantially improves macroeconomic stabilization: average unemployment conditional on a liquidity trap falls from about 6% to about 1.5%, and the unconditional welfare cost of liquidity traps falls from about 0.4% to about 0.1% of permanent consumption — more than a fourfold reduction.&lt;/strong&gt; The average ex-ante prudential tax on inflows is about 0.2% and the average ex-post tax on outflows is about -0.05%. The authors also note that, with capital controls, liquidity traps are less frequent and less severe but — perhaps surprisingly — tend to last longer.&lt;/p&gt;
&lt;h3 id="q7-are-capital-controls-beggar-thy-neighbor-and-how-do-international-spillovers-work"&gt;Q7. Are capital controls beggar-thy-neighbor, and how do international spillovers work?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors argue that, contrary to emerging policy concerns, capital controls are not beggar-thy-neighbor and can enhance global macroeconomic stability; international spillovers operate through the world real interest rate, and a country can use capital controls to insulate itself from adverse foreign policies.&lt;/strong&gt; In their multi-country extension, a country can remain insulated from negative spillovers of a change in the foreign monetary stance through capital controls, which can help prevent the outbreak of a currency war.&lt;/p&gt;
&lt;h3 id="q8-when-is-international-policy-coordination-desirable"&gt;Q8. When is international policy coordination desirable?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The authors provide conditions under which a regime of uncoordinated capital controls can dominate laissez-faire, and they find that coordination is desirable only during a liquidity trap — where, notably, it calls for stimulating capital flows rather than preventing them.&lt;/strong&gt; This stands against the view that uncoordinated capital-control policies necessarily produce a global paradox of thrift.&lt;/p&gt;
&lt;h3 id="q9-how-do-these-results-differ-from-prior-open-economy-liquidity-trap-models"&gt;Q9. How do these results differ from prior open-economy liquidity-trap models?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper&amp;rsquo;s more benign view of spillovers contrasts with contributions such as Caballero, Farhi and Gourinchas (2021), Eggertsson et al. (2016), and Fornaro and Romei (2019), and the authors trace the difference to two features of their model: positive liquidity and the presence of ex-post capital controls.&lt;/strong&gt; Because goods subject to nominal rigidities are consumed only domestically, foreign policies that favor savings (lowering the world interest rate) raise demand for domestic goods through asset markets and can be stabilizing at the ZLB; and ex-post controls let the central bank actively manage flows during a trap to offset adverse spillovers.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;aggregate demand externality&lt;/strong&gt; : the externality (as in Schmitt-Grohe and Uribe 2016 and Farhi and Werning 2016) by which an individual agent&amp;rsquo;s borrowing raises external debt and, given nominal rigidities and the ZLB, makes the economy more vulnerable to a future demand-driven contraction; it is the market failure that prudential policy targets in this model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;expenditure switching channel&lt;/strong&gt; : the open-economy transmission channel through which an exchange-rate depreciation makes non-tradables relatively cheaper, shifting demand toward domestically produced goods; the paper finds it accounts for a substantial share (roughly a quarter to a third) of monetary policy&amp;rsquo;s effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;intertemporal substitution channel&lt;/strong&gt; : the channel through which a change in the intertemporal price shifts consumption between present and future; it accounts for about 95% of the effect of capital controls in the calibration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;liquidity trap / occasionally binding ZLB&lt;/strong&gt; : a state in which the zero lower bound on the nominal interest rate binds, so conventional monetary policy cannot stabilize output; the risk of entering such a state in the future is what makes pre-emptive prudential policy valuable here.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;capital controls (prudential tax on flows)&lt;/strong&gt; : the macroprudential instrument in the model — a tax on capital inflows (ex ante) or outflows (ex post) — used to manage the level and timing of capital flows and to insulate the economy from foreign spillovers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;beggar-thy-neighbor&lt;/strong&gt; : a policy that improves one country&amp;rsquo;s outcomes at others&amp;rsquo; expense; the paper argues capital controls are, contrary to common concern, not beggar-thy-neighbor in its setting and can raise global stability.&lt;/p&gt;</description></item><item><title>Lives Versus Livelihoods: The Impact of the Great Recession on Mortality and Welfare</title><link>https://macropaperwarehouse.com/papers/lives-versus-livelihoods-the-impact-of-the-great-recession-on-mortality-and-welfare/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/lives-versus-livelihoods-the-impact-of-the-great-recession-on-mortality-and-welfare/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Does the Great Recession reduce or increase mortality, and what are the welfare implications of incorporating recession-induced mortality changes into standard macroeconomic welfare frameworks?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Identification.&lt;/strong&gt; The authors exploit spatial variation in the severity of the 2007–2009 Great Recession across 741 U.S. Commuting Zones (CZs), following the empirical design of Yagan (2019). The primary shock variable is the percentage-point change in the CZ unemployment rate between 2007 and 2009. The key identifying assumption is that no concurrent shocks to mortality coincide with the timing and geographic pattern of the Great Recession shock. Pre-trend evidence supports this: CZs subsequently harder hit experienced a slight relative &lt;em&gt;increase&lt;/em&gt; in mortality before 2007, which is the opposite sign from the main effect, supporting the validity of the design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; Mortality data come from CDC restricted-use death certificate microdata (2003–2016) covering the universe of U.S. deaths, combined with SEER population denominators. A 20 percent random sample of Medicare enrollees aged 65–99 provides an individual-level panel that directly addresses concerns about endogenous migration. The main outcome is the log age-adjusted CZ mortality rate; economic indicators come from BLS, BEA, and FHFA; air pollution data from the EPA AQS monitor network (PM2.5); morbidity from the BRFSS; nursing home characteristics from federal certification inspections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Mortality Finding.&lt;/strong&gt; A one-percentage-point increase in the local unemployment rate between 2007 and 2009 is associated with a 0.50 percent decline (SE = 0.15) in the annual age-adjusted mortality rate in 2007–2009, and a 0.58 percent decline (SE = 0.34) in 2010–2016; the two periods are statistically indistinguishable (p = 0.78). Because the national average unemployment rate rose by 4.6 percentage points, the Great Recession on average reduced the annual age-adjusted mortality rate by approximately 2.3 percent, with effects persisting for at least 10 years. The authors note this is equivalent to approximately two years of secular mortality improvement at the pre-recession trend pace of 1.1 percent per year. For a 55-year-old, the estimates imply that 1 in 25 gained an extra year of life from a shock of this magnitude.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by Cause of Death.&lt;/strong&gt; Mortality declines appear across most major causes. Cardiovascular disease (34 percent of 2006 deaths) declines by 0.65 percent per percentage-point unemployment increase (SE = 0.21) and accounts for approximately 48 percent of the total estimated mortality reduction. Motor vehicle mortality falls by 1.7 percent (SE = 0.56) and liver disease by 1.1 percent (SE = 0.43). Suicides show a statistically significant 1.7 percent decline (SE = 0.5) in the 2010–2016 period. The notable exception is cancer (the second-largest cause of death), for which the estimated effect is a precise null of 0.02 percent (SE = 0.11). The null cancer result is interpreted as a specification check: if mortality declines were spurious (e.g., driven by population mismeasurement), cancer mortality should also decline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by Demographics.&lt;/strong&gt; Recession-induced mortality declines are similar in percentage terms across gender and race/ethnicity, and statistically equi-proportional across age groups (p-value for equality across 25–64 versus 65+: 0.76). Because mortality is heavily concentrated in the elderly, those aged 65 and over account for approximately 74.3 percent of averted deaths, roughly proportional to their 72.5 percent share of 2006 mortality. The most striking heterogeneity is by education: the entire mortality decline is concentrated among the approximately 52 percent of the population with a high school degree or less. The estimated 2007-2016 effect is −1.3 percent per percentage-point unemployment increase (SE = 0.56) for those with high school or less, compared to +0.34 percent (SE = 0.68) for those with more than high school (statistically distinguishable at p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms.&lt;/strong&gt; The authors distinguish internal effects (own reduced employment or consumption improving health) from external effects (externalities from reduced aggregate economic activity, holding own employment/consumption fixed). Evidence strongly favors external effects as the primary driver. Three-quarters of averted deaths accrue to the elderly, who experienced no direct income effects from the labor market shock. Moreover, the timing pattern—an immediate mortality drop that does not grow over time—is inconsistent with health-behavior channels (e.g., smoking cessation, improved diet) that would build up gradually. Direct tests find no statistically significant impact on self-reported health behaviors (smoking, drinking, exercise) and no impact on healthcare use among Medicare enrollees.&lt;/p&gt;
&lt;p&gt;Among external channels, neither reduced spread of infectious disease nor improved nursing home staffing receives empirical support. Reduced air pollution (PM2.5) is identified as a quantitatively important channel. A one-percentage-point increase in CZ unemployment is associated with a 0.16 µg/m³ decline in PM2.5 (SE = 0.04), a 1.3 percent decline relative to the 2006 national average of 12 µg/m³. A mediation analysis (controlling for the PM2.5 shock) attenuates the estimated mortality effect by 37 percent, from −0.52 percent to −0.33 percent per percentage-point unemployment increase. Back-of-the-envelope calculations combining the PM2.5 decline with external estimates of PM2.5-mortality elasticities suggest pollution can explain 17 to 35 percent of total recession-induced mortality declines.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lag Structure.&lt;/strong&gt; Exploiting variation in the speed of post-recession labor market recovery (measured by 2010–2016 EPOP ratio changes) conditional on the initial shock, the authors find that mortality reductions persist in areas that have fully recovered economically by 2016, suggesting lagged mortality effects of the initial economic downturn beyond what contemporaneous economic conditions alone explain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare Analysis.&lt;/strong&gt; The authors extend the Krebs (2007) consumption-based welfare cost-of-recessions model to incorporate endogenous mortality. For a 45-year-old with γ = 2 and a value of a statistical life-year (VSLY) of $250k (five times annual consumption), accounting for endogenous mortality reduces the willingness to pay to avoid all future recessions from 2.00 percent of average annual consumption to 0.91 percent—a reduction of approximately 55 percent. Starting around age 55, recessions become welfare-improving on net. For the Great Recession specifically, at age 55 endogenous mortality reduces the welfare cost by approximately 25 percent (from 2.39 to 1.80 percent of average annual consumption). Because mortality declines are concentrated among those with high school or less, accounting for endogenous mortality also substantially mitigates—and at older ages reverses—the finding that the Great Recession was more costly for the less educated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Caveats.&lt;/strong&gt; (i) The design captures only differential local effects, not nationwide impacts (e.g., stock market collapse, nationwide malaise). (ii) Mortality impacts may not generalize to milder recessions, though the relationship appears approximately linear in shock size. (iii) The analysis excludes morbidity, though limited evidence suggests morbidity is also pro-cyclical and roughly equi-proportional across ages. (iv) The welfare analysis begins at age 35 and does not account for longer-run mortality costs of recession entry for younger cohorts.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the baseline empirical specification, and why does the design exploit cross-sectional variation rather than time-series panel regressions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The estimating equation regresses the log age-adjusted CZ mortality rate on an interaction of the CZ-level Great Recession shock (2007–2009 unemployment change) with year indicators, plus CZ and year fixed effects, weighted by 2006 CZ population. The authors prefer this to the standard two-way fixed effects panel approach (area and year FE with contemporaneous unemployment rate) for three reasons: (1) it directly identifies the full dynamic lag structure of the shock rather than imposing contemporaneity; (2) exploiting a single spatially differentiated shock reduces risk of confounding from other concurrent area-level shocks; (3) the panel can be linked to individual-level Medicare data, allowing explicit control for endogenous migration, which the existing literature cannot do.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the paper address the concern that mortality rate declines might simply reflect unmeasured population outflows from hard-hit areas rather than genuine reductions in deaths?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors offer two main responses. First, cancer mortality shows a precise null effect despite being the second-leading cause of death; if unmeasured population losses were driving the results, cancer deaths should decline proportionally. Second, using the Medicare individual-level panel, they fix each enrollee&amp;rsquo;s location at their 2003 CZ and find a statistically significant mortality decline of 0.35 percent per percentage-point unemployment increase in the reduced-form (2007–2009 period). A control function approach that instruments current-year location with 2003 location yields an estimate of −0.37 percent (SE = 0.17), similar to the baseline −0.50 percent from the aggregate specification, confirming that migration bias is not the primary driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How long do the mortality reductions from the Great Recession persist, and does the paper identify whether these are contemporaneous or lagged effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The 2007–2009 period estimate is −0.50 percent per percentage-point unemployment increase and the 2010–2016 period estimate is −0.58 percent, and these are statistically indistinguishable (p = 0.78). To identify whether persistence reflects ongoing economic effects or true lagged mortality effects, the authors compare CZs with above- vs. below-median 2010–2016 EPOP recovery (conditional on initial shock decile). Both groups show similar 2010–2016 mortality declines despite the above-median recovery CZs having returned to pre-recession employment levels by 2016. This finding is consistent with lagged mortality effects of the initial economic downturn that persist independently of current economic conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Are mortality reductions concentrated among individuals already near death (&amp;ldquo;harvesting&amp;rdquo;), or do they represent meaningful longevity gains?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use a Medicare auxiliary model to predict counterfactual remaining life expectancy for each enrollee based on age, demographics, and chronic conditions. The marginal life saved has only about 6 percent lower counterfactual remaining life expectancy than a typical decedent of the same age, and this difference is statistically insignificant. Because effects persist over 10 years (not just days or weeks), short-run mortality displacement (harvesting) is not the operative concern. The 6 percent difference is also small enough that the authors do not adjust their welfare analysis for it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the educational gradient in mortality impacts, and is it explained by age composition or other confounders?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Mortality declines are entirely concentrated among those with a high school degree or less: the 2007–2016 estimate is −1.3 percent per percentage-point unemployment increase (SE = 0.56) for this group versus +0.34 percent (SE = 0.68) for those with more than high school, distinguishable at p &amp;lt; 0.01. This gradient holds within age groups (confirmed in Appendix analysis), and further disaggregation shows no mortality declines for those with some college or college-or-more separately. In Medicare data, the elderly mortality effect is concentrated among the approximately 12 percent enrolled in Medicaid (a proxy for low income), reinforcing the socioeconomic concentration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What evidence rules out improved health behaviors (increased exercise, reduced smoking, reduced alcohol) as the main mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two types of evidence argue against this channel. First, three-quarters of averted deaths are among the elderly, who experienced no direct income or employment effects from the local labor market shock and would not plausibly change their health behaviors in response to someone else losing employment. Second, the mortality decline is immediate in 2007 and flat through 2016 rather than growing over time; smoking cessation, for example, takes 10–15 years to accumulate mortality effects. Direct tests of behavioral outcomes from BRFSS find no statistically significant impact on smoking, drinking, exercise, or flu vaccination rates, individually or pooled. The pooled average treatment effect on six morbidity measures is statistically significant and negative (suggesting morbidity improvements), but behavioral covariates show no movement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the evidence for and against improved nursing home care as a mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior literature (Stevens et al. 2015; Konetzka et al. 2018; Antwi and Bowblis 2018) documents that recessions increase nursing home staffing and reduce nursing home deaths in earlier decades. However, the authors find no evidence for this channel in the Great Recession context. Estimated mortality impacts are virtually identical (approximately 0.5 percent per percentage-point unemployment increase) for the 7 percent of the elderly in nursing home care and the 93 percent not in nursing home care. Direct measures of nursing home staffing (direct-care staff hours per resident-day, highly skilled nurses ratio) show no statistically significant change in harder-hit areas: the point estimate for direct-care hours is −0.11 percent (SE = 0.22) in 2007–2009. Nursing home occupancy rates and resident characteristics also show no significant changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How is the quantitative importance of the air pollution channel estimated, and what are the two complementary approaches used?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Approach 1 (back-of-the-envelope): The authors combine their estimate that a one-percentage-point unemployment increase reduces PM2.5 by 0.16 µg/m³ with external estimates from Deryugina et al. (2019) of PM2.5&amp;rsquo;s effect on elderly daily mortality, rescaled to annual exposure. This calculation implies pollution explains 17–35 percent of total recession-induced mortality declines, depending on which Deryugina et al. mortality estimates are used. Approach 2 (mediation analysis): Adding the county-level PM2.5 shock as an additional control in the mortality regression attenuates the Great Recession mortality coefficient from −0.52 percent to −0.33 percent per percentage-point unemployment increase—a 37 percent attenuation. Both approaches are suggestive rather than definitive, as the mediation analysis requires the strong assumption that the recession shock and PM2.5 shock are conditionally independent of other unmeasured mediators.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the specific calibration parameters in the welfare model and how does the paper set the mortality decline parameter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors extend Krebs (2007)&amp;rsquo;s income process calibration (pH = 0.03, pL = 0.05, dH = 0.09, dL = 0.21, g = 0.02, σ = 0.01, πH = 0.5) and use 2007 SSA life tables for age-specific mortality rates in normal times. The recession mortality parameter is set to dm = −0.015 for all ages, derived from a 3.1 percentage-point unemployment increase in a typical recession multiplied by the estimated 0.5 percent mortality decline per percentage-point. VSLY values are parameterized at two, five, or eight times annual consumption ($100k, $250k, or $400k at $50k annual consumption). Risk aversion γ takes values 1.5, 2, and 2.5. For the Great Recession-specific exercise, dmA = −0.023 (4.6 × 0.5 percent), dmHS = −0.037, and dmC = 0.0006.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does accounting for endogenous mortality change the distributional welfare analysis of the Great Recession by education group?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under exogenous mortality, the welfare cost of the Great Recession at age 35 is 2.89 percent of average annual consumption for those with high school or less versus 1.23 percent for those with more than high school—the less educated bear roughly twice the burden. Under endogenous mortality, the mortality declines are concentrated entirely among the less educated (dmHS = −0.037 vs. dmC ≈ 0), so accounting for mortality disproportionately offsets welfare losses for that group. By around age 65, the welfare costs of the Great Recession converge across education groups, and after age 65, the less educated bear &lt;em&gt;lower&lt;/em&gt; welfare costs than the more educated, reversing the exogenous-mortality ranking. This result depends on the same education differential in mortality impacts that drives the main empirical finding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What robustness checks demonstrate that the baseline mortality estimates are not driven by geographic or functional-form choices?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline CZ-level estimate of −0.50 percent (SE = 0.15) is replicated almost exactly at the state level (−0.62, SE = 0.25) and county level (−0.49, SE = 0.10). A Poisson regression yields −0.45 percent (SE = 0.14). Dropping the top/bottom decile of CZs by shock size yields −0.46 percent (SE = 0.16). Adding Census-division-by-year fixed effects attenuates the estimate slightly to −0.38 percent (SE = 0.14) but retains statistical significance. Dropping CZs with high fracking activity and dropping the ten most populous CZs both produce estimates similar to baseline. Quartile regressions show monotone mortality reductions across quartiles of the unemployment shock, consistent with approximate linearity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What does the expert survey reveal about prior beliefs, and how does the paper&amp;rsquo;s finding compare?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In a spring 2023 survey of over 300 experts, 50 percent predicted the Great Recession would &lt;em&gt;increase&lt;/em&gt; mortality and only 27 percent predicted a decrease. Of those predicting a decrease, 93 percent gave a magnitude larger (in absolute value) than the paper&amp;rsquo;s negative point estimate of 0.50 percent per percentage-point unemployment increase, and 82 percent gave a prediction larger than the upper bound of the 95 percent confidence interval. This illustrates that the paper&amp;rsquo;s finding—mortality is meaningfully pro-cyclical during the Great Recession—was highly surprising to the empirical and policy economics community.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Pro-cyclical mortality&lt;/strong&gt;: The phenomenon whereby mortality rates fall during economic downturns and rise during expansions. The paper documents this for the Great Recession using a spatial identification strategy, in contrast to the time-series correlation that had weakened in the two decades before the Great Recession. The term &amp;ldquo;pro-cyclical&amp;rdquo; means mortality moves in the same direction as the business cycle (up in booms, down in recessions), implying recessions are associated with fewer deaths.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Internal vs. external effects (of recessions on mortality)&lt;/strong&gt;: The paper distinguishes internal effects—whereby an individual&amp;rsquo;s own reduced employment or consumption affects her own mortality—from external effects, which are changes in mortality from reduced aggregate economic activity that hold constant one&amp;rsquo;s own employment and consumption. This distinction has direct welfare implications: external effects (e.g., less pollution from lower industrial output) are genuine welfare improvements for people who did not lose income, while internal effects of behavioral change are mitigated by the envelope theorem if behavior is privately optimal.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Commuting Zone (CZ) shock&lt;/strong&gt;: The paper&amp;rsquo;s primary treatment variable, defined as the percentage-point change in the CZ unemployment rate between 2007 and 2009. CZs are aggregations of counties (741 total) designed to approximate local labor markets. The median CZ experienced a 4.6-percentage-point increase, with substantial variation ranging from roughly 2.9 points (bottom quartile) to 6.7 points (top quartile).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Value of a Statistical Life-Year (VSLY)&lt;/strong&gt;: The dollar value placed on one additional year of life in expectation, used in the welfare calibration. In the paper&amp;rsquo;s framework it equals VSLY = bcγ − c/(γ−1), where b is a preference parameter governing the marginal utility of life-years. Results are reported for VSLYs of $100k, $250k, and $400k corresponding to two, five, and eight times average annual consumption of $50k, following Hall and Jones (2007).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous mortality in welfare analysis&lt;/strong&gt;: The paper&amp;rsquo;s central theoretical contribution is augmenting the Krebs (2007) welfare cost-of-recessions framework to allow mortality to vary with the aggregate state of the economy. When mortality is endogenously lower in recessions, the willingness to pay to eliminate recession risk falls—and at high enough VSLY or old enough ages, recessions become welfare-improving because the mortality benefit outweighs the consumption cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mortality displacement (harvesting)&lt;/strong&gt;: The possibility that short-run mortality declines merely reflect the premature death of already-frail individuals being slightly delayed, without meaningful longevity gains. The paper argues this is not the operative concern given 10-year persistence and uses auxiliary Medicare models to show marginal lives saved have only 6 percent shorter counterfactual life expectancy than average decedents of the same age.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PM2.5 mediation analysis&lt;/strong&gt;: An empirical approach in which the county-level change in fine particulate matter (PM2.5, in µg/m³) between 2006 and 2010 is added as a covariate in the mortality regression. Under the assumption that the recession shock and the PM2.5 shock are conditionally independent of other unmeasured mediators, the attenuation in the recession-mortality coefficient when controlling for PM2.5 identifies the share of the mortality effect operating through the pollution channel. A 37 percent attenuation is found in the 2007–2009 period.&lt;/p&gt;</description></item><item><title>Local Projection-Based Inference under General Conditions</title><link>https://macropaperwarehouse.com/papers/local-projection-based-inference-under-general-conditions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/local-projection-based-inference-under-general-conditions/</guid><description>&lt;p&gt;This paper develops a uniform asymptotic theory for local projection (LP) regression under general conditions, addressing a gap in the literature where existing results required restrictive assumptions about lag order, data persistence, and shock processes. The research question is: how can one conduct valid statistical inference on impulse responses from LP regressions when the true lag order is unknown (possibly infinite), data exhibit arbitrary persistence including unit roots and near-unit roots, horizons are allowed to grow with sample size, and shocks follow general conditionally heteroskedastic martingale difference sequences (MDS)?&lt;/p&gt;
&lt;p&gt;The paper works within a VAR(infinity) data-generating process framework, where the vector autoregression may have an unknown and potentially infinite number of lags. The LP regression truncates this at a chosen model order p, with the truncation bias controlled by tail decay conditions on the VAR coefficients. The theoretical framework accommodates a class of VARMA models as a specific illustration, showing that Assumptions 1 and 2 hold for VARMA(q+1, r) processes when the model lag order p diverges at least as fast as log n.&lt;/p&gt;
&lt;p&gt;The main theoretical result (Theorem 1) establishes uniform asymptotic normality of the LP estimator, simultaneously over: the coefficient parameter space A, model lag orders p in [p_low, p_high], horizons h in [1, h_bar], and configurations of the linear combination vector gamma (covering both individual and cumulated impulse responses). The convergence rate is pi_1(h; gamma)^{-1/2} n^{1/2}, which depends on persistence level and horizon. For an AR(1) process, the individual response rate is (sum_{i=0}^{h-1} a_1^{2i})^{-1/2} n^{1/2} and the cumulative response rate is h^{-3/2} n^{1/2}, which is slower.&lt;/p&gt;
&lt;p&gt;The paper makes two principal contributions. First, LP is shown to be semiparametrically efficient when the controlled lag order diverges. Under classical assumptions (homoskedastic MDS shocks, stationarity, fixed horizon), the LP estimator achieves the same asymptotic distribution as the VAR-implied iterative estimator, and reaches the semiparametric efficiency bound of Chamberlain (1987) under the conditional moment restriction model. Under Gaussianity, LP is asymptotically Cramer-Rao efficient. This extends Plagborg-Moller and Wolf (2021) from distributional equivalence of estimands to equivalence of asymptotic distributions. The commonly held view that LP is inefficient relative to VAR-implied methods holds only under finite small-order VAR models; with a diverging lag order, the efficiency gain from the parsimonious VAR structure vanishes. The alternative LP estimator of Lusompa (2022), shown to be more efficient than standard LP under a known AR(1) model, is likewise shown (Proposition 2) to be asymptotically equivalent to standard LP when a sufficiently large lag order is used (p_u/sqrt(n) -&amp;gt; 0 and sqrt(n)(1-|rho|)^{p_u} -&amp;gt; 0).&lt;/p&gt;
&lt;p&gt;Second, two new standard errors are proposed, neither involving HAR-type correction or bandwidth selection. SE_1 is a White-style heteroskedasticity-robust standard error applied after partialling out controls; it is uniformly consistent under a zero fourth cumulant condition on shocks (e.g., zero excess kurtosis with conditional homoskedasticity), but not for general MDS shocks. SE_2, the paper&amp;rsquo;s main methodological contribution, constructs the variance estimator using martingale-transformed scores: the LP residual Delta_t is projected onto forward residuals (Delta_{t+1}, &amp;hellip;, Delta_{t+h-1}) to partial out serial dependence, recovering the true MDS error xi_{1t}(h; gamma) asymptotically. SE_2 is uniformly consistent for general MDS shocks (Proposition 4) and, under a finite-order VAR DGP, requires only p = p_true lags (rather than p &amp;gt;= p_true + 1 required by SE_1 and HAR-type methods).&lt;/p&gt;
&lt;p&gt;Simulations using univariate ARMA(1,1) models with rho in {0, 0.5, 0.95, 1} and theta in {-0.5, 0, 0.5}, and bivariate VAR(1) models, confirm that SE_2-based 95% confidence intervals maintain coverage close to the nominal level across all cases including unit roots, while SE_1 shows degraded coverage under conditional heteroskedasticity (GARCH). Both outperform MOPM for cumulated responses at longer horizons.&lt;/p&gt;
&lt;p&gt;Scope conditions: the framework accommodates data with unit roots and near-unit roots but not explosive roots or integration of order greater than one (for which differencing is prescribed before applying the LP). The growing-horizon rate condition p^2 h^2 / n -&amp;gt; 0 becomes binding as h grows, requiring h and p to grow at comparable rates or p more slowly. The results are for the VAR framework and do not directly apply to structural (SVAR) identification without additional assumptions.&lt;/p&gt;
&lt;p&gt;Q: What is the central inferential problem that motivates this paper?&lt;/p&gt;
&lt;p&gt;A: Applied macroeconomists estimating impulse responses via LP regressions face a trilemma: the true lag order is unknown and may be infinite, data may be highly persistent or integrated, and shocks may be conditionally heteroskedastic. Existing uniform validity results (chiefly Montiel Olea and Plagborg-Møller 2021) assume a finite and known model order and require mean-independent shocks, leaving inference potentially invalid when these conditions fail. The paper constructs a theory and inference procedures that remain valid simultaneously over all these dimensions.&lt;/p&gt;
&lt;p&gt;Q: What is the VAR(infinity) data-generating process assumed, and what are the key restrictions on it?&lt;/p&gt;
&lt;p&gt;A: The DGP is yt = sum_{j=1}^{infinity} a_j y_{t-j} + u_t, where u_t is serially uncorrelated. Assumption 1 bounds the impulse responses uniformly over the parameter space (ruling out explosive roots and integration of order greater than one). Assumption 2 imposes that the tail coefficients a_j decay fast enough that the truncation bias is asymptotically negligible: the rate condition requires sqrt(n) * p * sum_{j=1}^{infinity} j |a_{p+j}| -&amp;gt; 0, implying p must diverge for infinite-order processes. For VARMA models, p need only diverge as slowly as log n.&lt;/p&gt;
&lt;p&gt;Q: What does Theorem 1 establish, and what is the convergence rate?&lt;/p&gt;
&lt;p&gt;A: Theorem 1 establishes uniform asymptotic normality of the LP estimator, with the supremum taken jointly over the coefficient space A, lag orders p in [p_low, p_high], horizons h in [1, h_bar], and the linear combination vector gamma. The convergence rate is pi_1(h; gamma)^{-1/2} n^{1/2}, where pi_1(h; gamma) = sum_{i=1}^{h} |phi_{1i}|^2 captures persistence and horizon effects. For an AR(1) process, the individual response rate is (sum_{i=0}^{h-1} a_1^{2i})^{-1/2} n^{1/2} and the cumulative response rate is the slower h^{-3/2} n^{1/2}.&lt;/p&gt;
&lt;p&gt;Q: In what sense is LP semiparametrically efficient, and under what assumptions?&lt;/p&gt;
&lt;p&gt;A: Under classical assumptions — homoskedastic MDS shocks, stationarity, and fixed horizon — when the controlled lag order p diverges at the appropriate rate, the LP estimator reaches the semiparametric efficiency bound of Chamberlain (1987) under the conditional moment restriction model E(yt - sum a_j y_{t-j} | ys, s &amp;lt;= t-1) = 0. It achieves the same asymptotic distribution as the VAR-implied estimator, which itself has the same distribution as the LP estimator under these conditions (established by extending Lutkepohl 1990). Under Gaussianity, LP is asymptotically Cramer-Rao efficient.&lt;/p&gt;
&lt;p&gt;Q: Why does the efficiency advantage of VAR-implied methods over LP vanish with a large lag order?&lt;/p&gt;
&lt;p&gt;A: Under a finite, small-order VAR model, imposing the functional relationship between all impulse responses and a small set of VAR slope parameters — analogous to dimension reduction in a factor model — yields an efficiency gain for the iterative VAR-implied estimator. However, as the model lag order grows, the number of parameters to estimate grows correspondingly, eroding the dimension-reduction benefit. With a diverging lag order, the extraction of common parameters through a parsimonious model no longer tightens the asymptotic variance of the VAR-implied estimator relative to the direct LP estimator.&lt;/p&gt;
&lt;p&gt;Q: How does SE_2 avoid the need for HAR (heteroskedasticity and autocorrelation robust) bandwidth selection?&lt;/p&gt;
&lt;p&gt;A: The LP regression error Delta_t(h; gamma) is serially correlated for h &amp;gt;= 2 (it contains MA terms of order h-1), which would normally require HAR correction. SE_2 avoids this by constructing the variance estimator from the martingale-transformed score: the LP residual Delta_t is regressed on the forward residuals (Delta_{t+1}, &amp;hellip;, Delta_{t+h-1}) and the fitted residual hat{xi}&lt;em&gt;{1t} is used in place of Delta_t. Asymptotically, hat{xi}&lt;/em&gt;{1t} recovers the true LP(infinity) error xi_{1t}(h; gamma) = sum_{i=1}^{h} phi&amp;rsquo;&lt;em&gt;{1i} u&lt;/em&gt;{t+i}, which is a MDS with respect to {u_t, u_{t-1}, &amp;hellip;}. Since MDS sums have a martingale structure, their variance can be estimated as a simple sum of squares without bandwidth selection.&lt;/p&gt;
&lt;p&gt;Q: Under what condition is SE_1 uniformly consistent, and when does it fail?&lt;/p&gt;
&lt;p&gt;A: SE_1 is the standard White heteroskedasticity-robust variance estimator applied to the partialled-out score. It is uniformly consistent under the zero fourth cumulant condition on shocks — that is, when u_t has zero excess kurtosis and is conditionally homoskedastic. This condition fails for general MDS shocks (e.g., GARCH-type shocks), because the cross-moment Cov((tau&amp;rsquo;w_0)^2, (tau&amp;rsquo;w_k)^2) does not vanish in general. Simulation results confirm that SE_1-based confidence intervals show degraded coverage under GARCH shocks, while SE_2 maintains coverage.&lt;/p&gt;
&lt;p&gt;Q: What is the relationship between this paper and Montiel Olea and Plagborg-Møller (2021)?&lt;/p&gt;
&lt;p&gt;A: Montiel Olea and Plagborg-Møller (2021) (MOPM) established uniform validity of LP inference under a finite-order, known VAR model and required mean-independent (not merely MDS) shocks. The current paper extends MOPM in five dimensions: it allows an unknown and potentially infinite true lag order; allows the controlled lag order to diverge; develops new asymptotic theory for general MDS shocks; proposes SE_2 whose consistency does not require mean-independent shocks; and unifies inference for both individual and cumulated impulse responses. The lag-augmented LP regression of MOPM (setting p = p_true + 1) is a special case of the framework here.&lt;/p&gt;
&lt;p&gt;Q: What does the paper show about the alternative LP estimator of Lusompa (2022)?&lt;/p&gt;
&lt;p&gt;A: Lusompa (2022) showed that, under a known AR(1) model with the true lag order, an alternative LP estimator that exploits the serial dependence structure of the LP error is asymptotically more efficient than standard LP across horizons. Proposition 2 of the current paper shows this efficiency gain does not survive when a sufficiently large lag order is used for the preliminary VAR used to compute the transformation. Specifically, when p_u/sqrt(n) -&amp;gt; 0 and sqrt(n)(1-|rho|)^{p_u} -&amp;gt; 0, the alternative and standard LP estimators are asymptotically equivalent: sqrt(n)[tilde{beta}_1(h) - beta_1(h)] - sqrt(n)[hat{beta}_1(h) - beta_1(h)] = o_p(1). The discrepancy arises from estimation errors in the preliminary residuals entering the asymptotic distribution.&lt;/p&gt;
&lt;p&gt;Q: What are the rate conditions on the lag order p and horizon h, and how do they compare to VAR-implied methods?&lt;/p&gt;
&lt;p&gt;A: Under a fixed horizon, the condition p^2/n -&amp;gt; 0 suffices for LP, which is weaker than the p^3/n -&amp;gt; 0 typically required for VAR-implied methods (the stricter condition arises because VAR-implied methods must estimate all p slope matrices jointly, while LP treats all but the first as nuisance). Under growing horizons (h -&amp;gt; infinity), the rate condition is p^2 h^2/n -&amp;gt; 0, and the analysis shows p = O(h) is sometimes optimal — p and h should grow at the same rate or p more slowly. By contrast, VAR-implied methods require p = o(n^{1/3}/h^{2/3}) under growing horizons.&lt;/p&gt;
&lt;p&gt;Q: What is the lag order flexibility advantage of SE_2 under a finite-order VAR DGP?&lt;/p&gt;
&lt;p&gt;A: When the true DGP is a finite-order VAR(p_true), SE_2 achieves consistent inference using exactly p = p_true lags — the exact order. In contrast, SE_1 and HAR-type standard errors require p &amp;gt;= p_true + 1 (at least one extra lag) because at p = p_true the LP residuals Delta_t(h; gamma) contain MA terms of order h-1 that create serial dependence. SE_2&amp;rsquo;s martingale transformation handles this serial dependence directly, without requiring the extra lag to purge it.&lt;/p&gt;
&lt;p&gt;Q: What scope conditions limit the paper&amp;rsquo;s framework?&lt;/p&gt;
&lt;p&gt;A: The framework rules out explosive roots (violating the uniform impulse response bound in Assumption 1) and integration of order two or higher (violating Assumption 1(iii)). For I(2) variables, the prescribed solution is to take differences before applying the LP, and then use the cumulated response (gamma = gamma_CIR) to recover original level responses. The growing-horizon results require the tension condition h_bar * p^2 / n -&amp;gt; 0 (for gamma with ||gamma||_1 = O(1)), implying a binding tradeoff between the range of allowed horizons and the range of allowed lag orders. Results do not directly extend to structural identification without additional assumptions.&lt;/p&gt;
&lt;p&gt;Local Projection (LP) regression: A direct regression of the outcome h periods ahead on current and lagged endogenous variables, as in Jorda (2005). The LP estimator of the horizon-h impulse response is the OLS coefficient on the current endogenous variable in this regression, with p-1 lags included as controls. It estimates impulse responses directly for each horizon without imposing the recursive structure of a VAR model.&lt;/p&gt;
&lt;p&gt;Uniform asymptotic validity: A distributional approximation (here, standard normal) that holds simultaneously over a parameter space A, a range of model lag orders [p_low, p_high], a range of horizons [1, h_bar], and specifications of the linear combination vector gamma — not merely pointwise for fixed parameter values. Uniformity is the operative concept ensuring finite-sample reliability across empirically relevant configurations.&lt;/p&gt;
&lt;p&gt;Semiparametric efficiency: In the paper&amp;rsquo;s usage, the LP estimator achieves the efficiency bound of Chamberlain (1987) for the semiparametric conditional moment restriction model E(yt - sum a_j y_{t-j} | ys, s &amp;lt;= t-1) = 0 when the controlled lag order diverges. Under Gaussianity, this coincides with Cramer-Rao efficiency. The key result is that the efficiency loss of LP relative to VAR-implied methods — well-documented under finite small-order VAR — is asymptotically negligible once the lag order diverges.&lt;/p&gt;
&lt;p&gt;Martingale difference sequence (MDS) shocks: The shock process u_t satisfying E(u_t | u_s, s &amp;lt;= t-1) = 0 almost surely — a condition weaker than mean independence (E(u_t | u_s, s &amp;lt;= t-1) = 0 for all functions of past shocks). MDS shocks include GARCH and stochastic volatility processes. The paper&amp;rsquo;s SE_2 is designed to be consistent for general MDS shocks, while SE_1 and MOPM require the stronger mean-independence condition.&lt;/p&gt;
&lt;p&gt;SE_2 (martingale-transformed standard error): The paper&amp;rsquo;s proposed standard error, constructed by first regressing LP residuals Delta_t on their forward values (Delta_{t+1}, &amp;hellip;, Delta_{t+h-1}) to partial out serial dependence, then using the residual hat{xi}&lt;em&gt;{1t} in the variance estimator as a simple sum of squares. SE_2 is uniformly consistent for general MDS shocks and requires no bandwidth selection, because the residual hat{xi}&lt;/em&gt;{1t} asymptotically recovers the MDS LP(infinity) error xi_{1t}(h; gamma).&lt;/p&gt;
&lt;p&gt;VAR(infinity) model: A vector autoregression yt = sum_{j=1}^{infinity} a_j y_{t-j} + u_t with potentially infinitely many lags. The paper&amp;rsquo;s framework treats the true lag order as unknown and possibly infinite, requiring the controlled lag order p in the LP regression to diverge (at a rate constrained by Assumption 2) so that truncation bias becomes asymptotically negligible. VARMA processes are a special case shown to satisfy the paper&amp;rsquo;s assumptions.&lt;/p&gt;
&lt;p&gt;Cumulated impulse response: The linear combination beta_1(h; gamma_CIR) = sum_{j=1}^{h} beta_1(j), corresponding to gamma = (1, &amp;hellip;, 1)&amp;rsquo;. Cumulated responses exhibit slower convergence rates than individual responses — h^{-3/2} n^{1/2} versus (sum_{i=0}^{h-1} a_1^{2i})^{-1/2} n^{1/2} for an AR(1) — and are especially relevant when the response variable is in differences and the researcher seeks level responses of the original variable.&lt;/p&gt;</description></item><item><title>Making the Invisible Hand Visible: Managers and Worker Allocation</title><link>https://macropaperwarehouse.com/papers/making-the-invisible-hand-visible-managers-and-worker-allocation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/making-the-invisible-hand-visible-managers-and-worker-allocation/</guid><description>&lt;p&gt;This paper asks why managers matter for firm performance, and specifically whether managers improve productivity by matching workers to better-suited jobs inside firms rather than through supervision, motivation, or selection out of the firm. The setting is the internal labor market of a large private consumer goods multinational enterprise (MNE) operating in more than 100 countries, with annual turnover exceeding EUR 50 billion. The data cover the universe of white-collar workers and managers at the firm — 200,000 workers and 30,000 managers observed monthly over 11 years (January 2011 to December 2021) — linked to payroll, performance ratings, organizational chart, digital platform activity, employee surveys, and an independent sales productivity series for field sales workers in 15 countries.&lt;/p&gt;
&lt;p&gt;The paper confronts two identification challenges. First, the author constructs a measure of manager quality — &amp;ldquo;high flyers&amp;rdquo; — defined as managers who were promoted to the first managerial work level (WL2) by age 30. This threshold yields 26.2% of managers classified as high flyers. The measure is defined entirely ex ante, before the manager ever supervises the worker under study, which addresses reverse causality. It is validated against ex post performance metrics including future salary growth, probability of promotion to WL3, performance ratings, and anonymous subordinate feedback. Second, to identify causal effects of manager quality on workers, the author exploits the firm&amp;rsquo;s long-standing policy of rotating WL2 managers laterally across teams as part of their career development, a practice implemented for several decades. Using an event-study design centered on the worker&amp;rsquo;s first manager transition, the author compares workers who transition from a low-flyer to a high-flyer manager (LtoH) against workers who transition from one low-flyer to a different low-flyer (LtoL), netting out the effect of the transition itself. Pre-event parallel trends are confirmed empirically.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. Gaining a high-flyer manager causes substantial reallocation of workers within the firm through lateral job transfers: seven years after the manager transition event, cumulative lateral moves are 40% higher for workers who gained a high-flyer manager relative to those who gained another low-flyer. These lateral moves are not confined to a single organizational margin — transfers rise within-team, across teams in the same function, and across functions — and they involve meaningfully larger shifts in task content, as measured by angular separation across O*NET cognitive, routine, and social task intensity dimensions, with cumulative task distance becoming statistically distinguishable from zero approximately seven quarters post-transition. These gains in lateral mobility translate into persistent wage growth: seven years after the manager transition, workers supervised by a high-flyer earn salaries 13% higher than the comparison group, with divergence beginning only after the transition date. Using independent sales bonus data, three years after gaining a high-flyer manager workers&amp;rsquo; sales productivity increases by 0.347 standard deviations, ruling out the interpretation that wage gains merely reflect manager favoritism rather than genuine productivity improvement. Establishment-level data further show that sites with a higher share of workers under high-flyer managers display higher output per worker and lower operational costs per unit.&lt;/p&gt;
&lt;p&gt;Effects are asymmetric: gaining a good manager has large positive effects, but losing one (comparing HtoL with HtoH transitions) produces no corresponding negative effects, implying that a single exposure to a high-flyer manager generates durable benefits that survive a subsequent downgrade in manager quality. A mediation analysis finds that 64% of the salary gain is explained by lateral job changes, though the author notes this understates the full allocation channel because it excludes vertical transfers and the gains from remaining well-matched in the current role. These findings hold under multiple robustness checks including restricting to new hires, using the Sun and Abraham (2021) interaction-weighted estimator, varying the age threshold for high-flyer classification, using a tenure-based alternative, and placebo tests with randomly assigned manager types.&lt;/p&gt;
&lt;p&gt;The scope conditions are specific to white-collar workers at a large, organizationally homogeneous consumer goods multinational. All workers hold college degrees, mean firm tenure is 8.5 years, team sizes average five workers, and the firm has the same organizational structure across all countries, functions, and years.&lt;/p&gt;
&lt;p&gt;Q: How does the paper define &amp;ldquo;high flyer&amp;rdquo; managers and what share of managers receive this classification?
A: High flyers are managers who achieved the first managerial work level (WL2) by age 30, a threshold derived from continuous age estimates constructed from 10-year age bands in the personnel records. This definition yields 26.2% of managers classified as high flyers. The measure is time-invariant and defined ex ante relative to any interaction with the workers whose outcomes are studied.&lt;/p&gt;
&lt;p&gt;Q: What validates the high-flyer measure as capturing genuine managerial ability rather than noise?
A: The high-flyer classification is significantly positively correlated with multiple ex post performance metrics recorded after the manager&amp;rsquo;s own promotion: future salary growth, probability of subsequent promotion to WL3 (director level), annual performance ratings, and anonymous upward feedback scores from subordinates on leadership. High flyers are also 14.5 percentage points less likely to be mid-career recruits, suggesting they are internally developed talent rather than external hires.&lt;/p&gt;
&lt;p&gt;Q: What is the source of identifying variation and how does the event-study design address endogeneity?
A: The firm has operated a decades-long policy of rotating WL2 managers laterally across teams to broaden their experience and to screen candidates for promotion to WL3. These rotations are asserted by firm executives and HR representatives to be orthogonal to worker and team characteristics. The author verifies this empirically by showing that a wide range of team characteristics measured over the two years before a transition — including team performance, inequality, transfer rates, and team diversity — cannot predict the type of incoming manager. The event-study design compares workers who receive a high-flyer replacement (LtoH) against workers who receive another low-flyer replacement (LtoL), netting out any generic effect of a managerial change, and confirms parallel pre-trends.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of gaining a high-flyer manager on lateral job mobility?
A: Seven years after the manager transition, workers assigned to a high-flyer manager exhibit lateral moves that are 40% higher relative to workers assigned to another low-flyer. These lateral moves occur across all organizational margins: within the same team, across teams within the same function (the largest contributor), and across functions. Beyond frequency, lateral moves under high-flyer managers also involve larger task-content shifts, with cumulative task distance (measured using O*NET cognitive, routine, and social task dimensions via angular separation) becoming statistically distinguishable from zero approximately seven quarters after the transition.&lt;/p&gt;
&lt;p&gt;Q: What is the wage effect of gaining a high-flyer manager and when does it materialize?
A: Workers who transition from a low-flyer to a high-flyer manager earn a salary 13% higher than workers who transition to another low-flyer, measured seven years after the transition event. The divergence begins only after the transition date, consistent with the pre-event parallel trends assumption, and accumulates gradually rather than appearing as an immediate jump.&lt;/p&gt;
&lt;p&gt;Q: Does the wage gain reflect genuine productivity improvement or simply managerial favoritism in pay decisions?
A: The author uses an independent sales bonus series — based on monthly targets set by supply chain demand planning teams, not by managers — for 5,604 field sales workers in 15 countries from 2018 to 2021. Three years after gaining a high-flyer manager, workers&amp;rsquo; sales productivity increases by 0.347 standard deviations. This confirms that pay gains correspond to actual productivity improvement rather than inflated ratings for unchanged performance.&lt;/p&gt;
&lt;p&gt;Q: How much of the wage gain is attributable to the lateral reallocation channel specifically?
A: A mediation analysis attributes 64% of the 13% salary gain to lateral job changes. The author cautions that this is a lower bound because the mediation excludes vertical transfers (which mechanically raise salary) and does not capture gains for workers who remain in their current job because it represents a good match rather than requiring reallocation.&lt;/p&gt;
&lt;p&gt;Q: Are the effects symmetric — does losing a high-flyer manager reverse the gains?
A: No. Comparing workers who transition from a high-flyer to a low-flyer manager (HtoL) against workers who transition from a high-flyer to another high-flyer (HtoH) reveals no corresponding negative effects. The gains from a single prior exposure to a high-flyer manager are persistent and are not undone by a subsequent low-quality manager. The author interprets this as evidence that a good match, once created, endures independently of the manager who created it.&lt;/p&gt;
&lt;p&gt;Q: Does gaining a high-flyer manager raise the rate of worker exit from the firm?
A: No. There is no statistically detectable effect on either voluntary exits (quits) or involuntary exits (layoffs), with null results that are not masked by heterogeneity across high- and low-performing workers. This rules out the interpretation that high-flyer managers improve measured outcomes of retained workers by selecting out underperformers.&lt;/p&gt;
&lt;p&gt;Q: Do workers move into roles connected to their high-flyer manager&amp;rsquo;s prior network or follow their manager when the manager moves?
A: No. There is no evidence that workers move into roles connected to the high-flyer manager&amp;rsquo;s prior colleagues; if anything, subordinates of high-flyer managers are less likely to make such moves. Workers also do not follow their high-flyer managers when those managers subsequently rotate to a different team. These findings rule out favoritism, social network access, and information-advantage explanations as primary drivers.&lt;/p&gt;
&lt;p&gt;Q: How does the paper rule out on-the-job teaching (human capital transmission) as the primary mechanism?
A: If high-flyer managers improved worker outcomes primarily by teaching workers to be more productive in their current job, the prediction would be reduced lateral mobility (workers become too productive to leave their current role). The observed pattern — substantially higher rates of lateral reallocation under high-flyer managers — is the opposite of this prediction, making teaching as the dominant channel unlikely.&lt;/p&gt;
&lt;p&gt;Q: What does the manager behavior evidence show about how high flyers spend their time?
A: Time-use data from a random sample of approximately 600 WL2 managers in 2019 show that high-flyer managers spend 19% more time in one-on-one meetings with subordinates and engage more in communication and multitasking activities relative to low-flyer managers. Their skill profiles also differ: high flyers are more likely to have strengths in strategy and talent management rather than project management, consistent with a more coordination-intensive and people-development-oriented style.&lt;/p&gt;
&lt;p&gt;Q: What heterogeneity is there in who benefits from high-flyer managers?
A: Effects are larger when managers and workers are in the same physical office (proximity facilitates talent assessment), when the organizational unit has a more diverse set of job roles (more matching opportunities), and for younger workers who are still discovering their comparative advantages. Critically, benefits are not concentrated among high-baseline performers: workers with low initial pay growth experience gains comparable to those of high performers, suggesting high-flyer managers uncover and deploy hidden talent broadly rather than accelerating only already-visible stars.&lt;/p&gt;
&lt;p&gt;Q: Does high-flyer management aggregate to establishment-level productivity?
A: Yes. Establishments where a higher share of workers are supervised by high-flyer managers show higher output per worker (tons per FTE) and lower operational costs per unit of output (operational costs per ton), measured using establishment-year data across approximately 150 sites globally over 2019-2021. This is consistent with the individual-level allocation mechanism producing aggregate productivity gains.&lt;/p&gt;
&lt;p&gt;Q: What are the organizational design implications of the asymmetric effects?
A: Because the gains from a single exposure to a high-flyer manager persist even after a subsequent manager downgrade, firms do not need each worker to be continuously supervised by a high-flyer. It is sufficient to rotate high-flyer managers across teams so that each worker receives at least one exposure. This makes the allocation mechanism resource-neutral relative to hiring, firing, or formal training programs.&lt;/p&gt;
&lt;p&gt;High flyer (paper&amp;rsquo;s definition): A manager who achieved the first managerial work level (WL2) at the firm by age 30 — a time-invariant, ex ante classification representing the firm&amp;rsquo;s revealed-preference assessment of leadership potential, validated against subsequent salary growth, promotion probability, performance ratings, and subordinate feedback. Constitutes 26.2% of managers in the sample.&lt;/p&gt;
&lt;p&gt;Internal labor market (paper&amp;rsquo;s usage): The system within the firm through which workers are allocated to jobs via lateral transfers and vertical promotions, mediated by managers rather than by external price mechanisms; the institutional context within which manager-worker matching produces wage growth and productivity gains.&lt;/p&gt;
&lt;p&gt;Lateral transfer (paper&amp;rsquo;s usage): A horizontal reallocation of a worker to a different job title, team, subfunction, or function at the same work level, as distinct from a vertical promotion. Captured monthly in personnel records; operationalized as moves involving changes in task content measured by O*NET task distances.&lt;/p&gt;
&lt;p&gt;Task distance (paper&amp;rsquo;s usage): The angular separation between origin and destination occupations across three O*NET task dimensions (cognitive, routine, and social intensity), ranging from zero (identical task profiles) to one (completely distinct profiles), used to characterize the substantive scope of lateral moves induced by high-flyer managers.&lt;/p&gt;
&lt;p&gt;Manager rotation (paper&amp;rsquo;s usage): The firm&amp;rsquo;s longstanding policy of reassigning WL2 managers laterally across teams within a subfunction, designed to broaden managerial experience and screen for promotion to WL3; treated in the empirical strategy as generating plausibly exogenous variation in the manager type each worker encounters.&lt;/p&gt;
&lt;p&gt;Allocation mechanism (paper&amp;rsquo;s usage): The process by which managers discover workers&amp;rsquo; specific skills and match them to specialized jobs inside the firm, operating through lateral reallocation rather than through hiring, firing, or on-the-job training; identified in the paper as the primary channel through which high-flyer managers generate persistent wage and productivity gains.&lt;/p&gt;
&lt;p&gt;Asymmetric persistence (paper&amp;rsquo;s usage): The empirical pattern in which the gains from gaining a high-flyer manager are large and durable, while losing a high-flyer manager (transitioning to a low-flyer) produces no corresponding negative effects on the outcomes of previously well-matched workers, implying that good matches, once formed, survive a change in manager quality.&lt;/p&gt;</description></item><item><title>Manager Pay Inequality and Market Power</title><link>https://macropaperwarehouse.com/papers/manager-pay-inequality-and-market-power/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/manager-pay-inequality-and-market-power/</guid><description>&lt;p&gt;This paper asks whether managers are paid for market power. Bao, De Loecker, and Eeckhout build a general equilibrium model in which firms compete oligopolistically in goods markets (following Atkeson and Burstein 2008) while managers are allocated to firms through a competitive matching market (following Gabaix and Landier 2008 and Tervio 2008). The model identifies two distinct channels through which market power and firm size jointly determine executive compensation: a market power channel, whereby a more productive firm charges a higher markup given its output level, and a firm size channel, whereby higher total factor productivity expands output given markups. Because manager ability and firm type are complementary inputs into TFP, assortative matching arises: high-ability managers sort into high-type firms, amplifying both productivity dispersion and markup dispersion across firms.&lt;/p&gt;
&lt;p&gt;The authors estimate the model year-by-year using Simulated Method of Moments on Compustat data covering 1994 to 2019, targeting ten moments including the average salary share, markup distribution, employment, and manager compensation levels. Firm-level markups are estimated using the production approach of De Loecker, Eeckhout, and Unger (2020). The ExecuComp variable TDC1 — encompassing salary, bonus, restricted stock grants, and option grant values — measures manager pay. Finance, insurance, and real estate sectors (SIC 6000–6799) are excluded.&lt;/p&gt;
&lt;p&gt;Main findings: market power accounts for on average 45.8% of total manager pay over the sample period, rising from 38.0% in 1994 to 48.8% in 2019. Over the full period, average CEO compensation (net of reservation utility) roughly doubled, from approximately $2.94 million to $6.43 million. Of the $3.49 million cumulative increase, $2.02 million (57.8%) is attributed to rising market power, with the remainder ($1.47 million) due to the firm size channel. The market power channel&amp;rsquo;s dominance is concentrated among top managers: for the highest-ranked managers in 2019, 80.3% of pay is attributable to market power, and nearly all of their pay growth since 1994 stems from the market power channel. For lower-ranked managers, pay is determined primarily by the firm size channel and has been roughly flat over the period.&lt;/p&gt;
&lt;p&gt;Within the market power channel, changes in technology — specifically increasing dispersion in firm-level TFP — are the dominant factor, contributing $1.33 million (65.9% of total market power channel growth). The increasing importance of manager ability (rising parameter alpha) contributes an additional $1.14 million through the market power channel. Within the firm size channel, TFP change accounts for 70.1% ($1.03 million) of growth, but the large effects from rising alpha and rising complementarity (gamma) are substantially offset by increasing dispersion in firm type. Structural estimates confirm that the average number of firms per market declines from 4.40 to 3.15, and firm-type dispersion (sigma_z) rises from 0.51 to 0.77, both consistent with rising market power over the period.&lt;/p&gt;
&lt;p&gt;A counterfactual economy with no market power — firms priced at marginal cost — would yield a social welfare gain of 58.4% on average. The welfare cost of market power in 1994 could be offset by a 33.8% TFP increase; by 2019 the required TFP offset had risen to 51.7%. Without any market power, even the most talented managers would earn only their reservation utility, because firms earn zero profits regardless of productivity, eliminating the complementarity-driven matching surplus that makes top managers valuable. This confirms that superstar manager pay is intrinsically tied to the existence of market power in goods markets, not solely to firm size.&lt;/p&gt;
&lt;p&gt;Scope conditions: the model applies to publicly listed US firms covered by Compustat and ExecuComp. The mechanism relies on Cournot competition within oligopolistic markets, assortative matching between managers and firms, and complementarity between manager ability and firm type (elasticity of substitution gamma estimated to be negative throughout the sample). The findings on market power share apply to CEOs specifically; the authors argue the same logic extends to all managerial positions with span-of-control over other workers, which encompasses roughly one-fifth of the workforce.&lt;/p&gt;
&lt;p&gt;Q: What are the two channels through which manager pay is determined in the model, and how do they differ mechanically?
A: The market power channel captures how a given level of TFP translates into higher markups — more productive firms charge more above marginal cost — thereby increasing profits per unit of output. The firm size channel captures how higher TFP expands the quantity of output a firm produces, increasing total profits through scale rather than through price-cost margin. Both channels raise profits and thus the marginal product of managers, but they operate through distinct economic mechanisms: one through pricing power and the other through productive scale.&lt;/p&gt;
&lt;p&gt;Q: What is the empirical magnitude of the market power channel&amp;rsquo;s contribution to manager pay levels and growth?
A: Market power accounts for an average of 45.8% of total manager pay over 1994–2019, rising monotonically from 38.0% in 1994 to 48.8% in 2019. For the total pay increase of $3.49 million over the period, $2.02 million (57.8%) is due to the increase in market power, with the remaining $1.47 million attributable to the firm size channel.&lt;/p&gt;
&lt;p&gt;Q: How does the market power channel&amp;rsquo;s importance vary across the manager ability distribution?
A: For the highest-ranked managers, 80.3% of total pay in 2019 is attributable to market power, and nearly all of their pay growth since 1994 runs through the market power channel. For the lowest-ranked managers, pay is almost entirely explained by the firm size channel and has been approximately flat over the period. This heterogeneity arises because top managers sort into high-markup firms through assortative matching, making their compensation disproportionately dependent on those firms&amp;rsquo; market power.&lt;/p&gt;
&lt;p&gt;Q: How does the model generate assortative matching between manager ability and firm type?
A: Manager ability and firm type are complementary inputs into TFP (the CES aggregator with elasticity of substitution gamma less than one), which makes the matching output supermodular. In a frictionless matching market with transferable utility, supermodularity guarantees that high-ability managers match with high-type firms in equilibrium (Proposition 1). This positive assortative matching then amplifies productivity and markup dispersion, since the most productive firms become even more productive and gain larger market shares.&lt;/p&gt;
&lt;p&gt;Q: What structural changes drive the rising importance of market power in manager pay over time?
A: The dominant factor within the market power channel is changes in technology, specifically increasing firm-type dispersion (sigma_z rising from 0.51 to 0.77), which contributes $1.33 million or 65.9% of market power channel growth. The rising importance of manager ability (alpha, the weight on manager ability relative to firm type in the TFP aggregator) contributes another $1.14 million. The number of firms per market declines from an average of 4.40 to 3.15, further reducing competitive pressure and amplifying the markup premium for high-productivity firms.&lt;/p&gt;
&lt;p&gt;Q: What does the counterfactual with no market power (first-best pricing) imply for manager pay and social welfare?
A: Without market power, firms price at marginal cost and earn zero profits regardless of productivity, which eliminates the surplus from manager-firm matching. All managers would earn only their reservation utility, which is negligible relative to actual compensation. Social welfare would increase by 58.4% on average. The efficiency cost of market power — measured as the TFP increase needed to offset welfare losses — rose from 33.8% in 1994 to 51.7% in 2019, indicating a worsening welfare distortion over the period.&lt;/p&gt;
&lt;p&gt;Q: How are markups measured, and what is their trend in the data?
A: Markups are not directly observable and are estimated using the production approach of De Loecker, Eeckhout, and Unger (2020), which recovers firm-level price-cost margins from production data without requiring price data. Average markups in the Compustat sample rose from 1.53 in 1994 to 1.78 in 2019. The reduced-form elasticity of manager pay with respect to markups (controlling for firm characteristics, year, and firm fixed effects) increased substantially: in 2019 a one-percent increase in firm-level markup raises manager pay by 0.41 percent, which is 70.1% larger than the effect estimated in 1994.&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle the identification challenges inherent in regressing manager pay on markups?
A: The reduced-form regression (with firm fixed effects, year effects, and interactions of year dummies with markups) documents a robust positive correlation but cannot establish causality due to reverse causality and omitted-variable bias. The paper addresses this by embedding the markup-manager pay relationship in a structural model where both are jointly determined by primitives — technology, market structure, and manager ability — and estimating those primitives via Simulated Method of Moments. The quantitative decomposition into market power and firm size channels derives from the model structure rather than from identifying variation in an instrumental variables sense.&lt;/p&gt;
&lt;p&gt;Q: What do the matching model estimates reveal about manager-firm complementarity over time?
A: The estimated elasticity of substitution between manager ability and firm type (gamma) is negative throughout the sample, confirming complementarity. Gamma was relatively stable before declining sharply from -2.22 in 2014 to -3.55 in 2019, indicating that manager ability and firm type became substantially more complementary in the latter part of the sample. The importance-of-manager parameter alpha is small (consistent with Gabaix and Landier 2008) but generally increasing, suggesting managers play an expanding role in determining firm-level TFP over time.&lt;/p&gt;
&lt;p&gt;Q: What are the broader macroeconomic and distributional implications of the findings?
A: Because approximately one-fifth of workers supervise other workers, the market-power-driven premium in managerial pay has implications beyond CEO compensation for the shape of the earnings distribution. The rise in top-1-percent income is identified as an efficiency concern, not just an equity concern: the best managers are hired by high-markup firms where they generate profits for shareholders but disproportionately little additional social value. Assortative matching between top managers and top firms widens the productivity gap between competitors, increasing market power and deadweight loss — the social return to managerial talent is therefore below the private return in equilibrium.&lt;/p&gt;
&lt;p&gt;Market Power Channel: The component of manager pay attributable to how a firm&amp;rsquo;s TFP raises its markup — the ratio of output price to marginal cost — given the level of output. Distinct from the firm size channel; operates through pricing power rather than scale.&lt;/p&gt;
&lt;p&gt;Firm Size Channel: The component of manager pay attributable to how a firm&amp;rsquo;s TFP expands output quantity given markups. Increasing output scale raises total profits and thus the marginal product of the manager even absent any change in price-cost margins.&lt;/p&gt;
&lt;p&gt;Assortative Matching: The equilibrium allocation of high-ability managers to high-type firms, arising because manager ability and firm type are complementary inputs into TFP (supermodular matching output). Matching is determined in a frictionless market with transferable utility.&lt;/p&gt;
&lt;p&gt;Markup: The ratio of output price to marginal cost, equal to the inverse of the price elasticity of demand under the nested CES preference structure. Endogenously determined by the firm&amp;rsquo;s sales share within its oligopolistic market and the elasticities of substitution within markets (eta) and across markets (theta).&lt;/p&gt;
&lt;p&gt;Manager-Firm Complementarity: The property that manager ability and firm type are imperfect substitutes with elasticity of substitution gamma less than one in the TFP aggregator. Complementarity is the necessary condition for positive assortative matching and for the supermodularity of matching surplus.&lt;/p&gt;
&lt;p&gt;Span of Control (Lucas 1978): The mechanism by which a manager raises the productivity of all workers under supervision, so that a more able manager generates a proportionally larger productivity gain the larger the firm. Provides the microfoundation for why firm size amplifies the value of manager ability.&lt;/p&gt;
&lt;p&gt;Market Structure: The number of firms in each oligopolistic sub-market (Ij), which varies across markets and over time. Together with the distribution of firm-level TFP within a market, market structure determines how much competitive pressure limits markup extraction. Average firms per market declines from 4.40 to 3.15 over 1994–2019.&lt;/p&gt;</description></item><item><title>Manipulation-Robust Prediction</title><link>https://macropaperwarehouse.com/papers/manipulation-robust-prediction/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/manipulation-robust-prediction/</guid><description>&lt;p&gt;This paper addresses the problem of algorithmic manipulation: when consequential decisions are encoded in machine learning algorithms, individuals strategically alter their behavior to achieve desired outcomes, undermining the predictive validity of the algorithm. The authors develop a &amp;ldquo;strategy-robust&amp;rdquo; approach to training decision rules that explicitly models the incentives and costs of manipulation, producing rules that remain stable even when fully transparent. They then deploy and evaluate this approach in a large field experiment in Kenya — the first real-world implementation and evaluation of such a strategy-robust empirical decision rule.&lt;/p&gt;
&lt;p&gt;The theoretical framework considers a policymaker who observes training data with features x_i and optimal decisions y_i, and wishes to estimate a decision rule to apply to new instances where behavior may be manipulated. While the standard approach (OLS or LASSO) selects a rule optimal for the training distribution, the strategy-robust approach models how individuals will adjust behavior in response to the incentive structure implied by any given rule. Under linear decision rules and quadratic manipulation costs, each individual shifts behavior by C_i^{-1} * beta away from their &amp;ldquo;bliss level,&amp;rdquo; where C_i captures individual- and behavior-specific manipulation costs. The strategy-robust estimator finds the rule that minimizes prediction error in the counterfactual world where people manipulate — a &amp;ldquo;Stackelberg&amp;rdquo; solution that commits the policymaker to a rule while anticipating equilibrium behavioral responses. Unlike LASSO, which penalizes all features equally without regard to their manipulability, the strategy-robust approach attenuates the weight on features that are both easily manipulated and subject to manipulation noise.&lt;/p&gt;
&lt;p&gt;The empirical setting is a smartphone app (&amp;ldquo;Smart Sensing&amp;rdquo;) deployed to 1,557 participants in Nairobi, Kenya, in collaboration with the Busara Center. The app passively collected over 1,000 behavioral indicators (calls, texts, app usage, mobility, etc.) and delivered weekly financial &amp;ldquo;challenges&amp;rdquo; that rewarded participants based on decision rules randomly assigned to them. Average weekly payouts were calibrated to approximate typical digital credit loan amounts in Kenya at the time (approximately $4.80). The experiment has two phases: a training phase using control (beta = 0) and simple single-behavior incentive rules to estimate manipulation cost parameters via GMM, and an implementation phase using complex multi-feature decision rules to compare strategy-robust versus LASSO classifiers.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, participants demonstrably manipulate behavior: a joint F-test that incentive diagonals all equal zero is rejected with p &amp;lt; 0.001. The number of texts sent was 49 times more responsive to incentives than the number of people called during the workday. Outgoing communications are cheaper to manipulate than incoming, and simple behaviors (e.g., average talk time) more manipulable than complex ones (e.g., standard deviation of talk time). Individuals who self-report higher tech skills find manipulation 9% easier on average, and the 90th percentile of gaming ability finds manipulation twice as easy as the 10th percentile.&lt;/p&gt;
&lt;p&gt;Second, in the implementation phase, strategy-robust decision rules outperform LASSO when the decision rule is made transparent to participants. Across all pooled outcomes, strategy-robust rules reduce RMSE by 11% (p = 0.024) relative to LASSO under transparency. For the single income-prediction outcome alone, the improvement is 5% ($0.19 RMSE reduction) but not statistically significant (p = 0.507).&lt;/p&gt;
&lt;p&gt;Third, the framework enables estimation of the &amp;ldquo;cost of transparency.&amp;rdquo; Making naive LASSO rules transparent lowers performance by 23%. Switching to strategy-robust rules under full transparency reduces that performance decline to 9.2% — a 60% reduction in the cost of transparency. The model predicts this cost to be 9.8%, close to the implemented value of 11.3%.&lt;/p&gt;
&lt;p&gt;The scope of the findings is bounded by the linear model with quadratic manipulation costs, a particular population of Kenyan smartphone users, and financial incentive magnitudes comparable to small digital credit loans. The mechanism relies on experimentally estimating manipulation cost parameters, though the authors also show that expert elicitation provides a correlated but noisier substitute (correlation 0.30 with experimental estimates).&lt;/p&gt;
&lt;p&gt;Q: What is the core market failure the paper addresses, and why do standard fixes fail?&lt;/p&gt;
&lt;p&gt;A: Standard machine learning training assumes the relationship between observed features and outcomes is stable, but implementing a consequential decision rule creates incentives for individuals to manipulate the features on which the rule is based (Goodhart&amp;rsquo;s Law; Lucas critique). The two common industry responses — restricting to &amp;ldquo;stable&amp;rdquo; predictors and keeping rules secret — are inadequate: restricting predictors amounts to a dogmatic prior that manipulation costs are either infinite or zero, while secrecy is increasingly at odds with demands for algorithmic transparency and fails anyway when sophisticated actors reverse-engineer the rule. Periodic retraining treats manipulation as generic covariate shift, can produce non-converging oscillations, and requires observing mistakes before learning from them.&lt;/p&gt;
&lt;p&gt;Q: How does the strategy-robust estimator differ from OLS and LASSO?&lt;/p&gt;
&lt;p&gt;A: OLS maximizes fit within the unincentivized training sample but ignores that implementing beta will shift behavior; LASSO adds a regularization penalty but still assumes behavior remains fixed at bliss levels and so penalizes all features equally regardless of manipulability. The strategy-robust estimator replaces each individual&amp;rsquo;s observed behavior x_i with their anticipated counterfactual behavior x_tilde_i(beta) = x_i + C_i^{-1} * beta, and finds the beta that minimizes prediction error in this manipulated distribution — a Stackelberg equilibrium. It attenuates features that are easily manipulated or subject to high manipulation noise, shifting weight toward harder-to-manipulate features even when the latter are less predictive in the training data.&lt;/p&gt;
&lt;p&gt;Q: What are the three ways the strategy-robust estimator differs from standard estimators?&lt;/p&gt;
&lt;p&gt;A: First, it anticipates level shifts in behavior: behaviors respond to beta, so observed training behaviors are replaced by counterfactual manipulated behaviors. Second, it accounts for signaling and noise: when manipulation ability correlates with the outcome of interest, manipulation can be informative about type (as in Spence 1973), but unobserved heterogeneity in gaming ability that is unrelated to outcomes introduces noise that attenuates coefficients on manipulable behaviors. Third, it achieves subgame perfection by anticipating how behaviors would respond to off-path deviations in beta, rather than assuming behaviors are fixed when beta deviates — yielding a Stackelberg rather than a one-step best-response solution.&lt;/p&gt;
&lt;p&gt;Q: How were manipulation cost parameters estimated in the Kenya experiment?&lt;/p&gt;
&lt;p&gt;A: In the training phase, each participant was randomly assigned to simple single-behavior incentive rules (e.g., &amp;ldquo;earn 12 Ksh. per incoming call this week, up to 250 Ksh.&amp;rdquo;) or control rules (beta = 0). This random variation in per-behavior incentives identifies how sensitive each behavior vector is to incentives, enabling GMM estimation of individual and behavior-specific cost parameters C and the heterogeneity scaling parameter omega. Off-diagonal elements of C were regularized to zero due to noisy estimation; diagonal elements used LASSO penalization with lambda = 1.0 set by cross-validation. Observable heterogeneity was allowed to vary with self-reported tech skills, which explained the most variation in preliminary analysis.&lt;/p&gt;
&lt;p&gt;Q: What patterns were found in manipulation costs across behaviors?&lt;/p&gt;
&lt;p&gt;A: Outgoing communications are cheaper to manipulate than incoming communications. Text messages, being relatively cheap to send, are more manipulable than calls. Simple behaviors such as average call duration are more manipulable than complex behaviors such as the standard deviation of talk time. Cross-behavior elasticities exist but are mostly noisy: 94.5% of off-diagonal incentive effects are not statistically significant (p &amp;lt; 0.05), 3.6% are significantly positive, and 1.8% are significantly negative.&lt;/p&gt;
&lt;p&gt;Q: How large is heterogeneity in gaming ability, and what predicts it?&lt;/p&gt;
&lt;p&gt;A: Individuals who self-report advanced or higher tech skills find it on average 9% easier to manipulate behaviors. Including unobserved heterogeneity, the 90th percentile of gaming ability finds manipulation twice as easy as the 10th percentile. Much of the heterogeneity arises from unobservables not captured by observables in the model.&lt;/p&gt;
&lt;p&gt;Q: What happened when the naive LASSO rule was made transparent versus when the strategy-robust rule was made transparent?&lt;/p&gt;
&lt;p&gt;A: Under the transparent treatment, participants received the full coefficients of the decision rule plus access to an interactive earnings calculator. Making naive LASSO rules transparent lowered performance by 23% relative to the opaque naive rule (RMSE $3.780 versus $4.641 in pooled outcomes). Switching to strategy-robust rules under full transparency reduced the performance decline to 9.2% — corresponding to a 60% reduction in the cost of transparency. The model predicted this cost to be 9.8%, which is close to the implemented value of 11.3%.&lt;/p&gt;
&lt;p&gt;Q: What does the reduced-form evidence on behavior change under complex decision rules show?&lt;/p&gt;
&lt;p&gt;A: Under the opaque treatment, participant behavior responses to complex decision rules were largely statistically insignificant and often in the wrong direction — 38.5% of estimated behavioral effects are in the same direction as the incentivized behavior. Under the transparent treatment, 75.4% of point-estimated effects are in the same direction as the incentive, confirming that transparency is a prerequisite for meaningful manipulation in this setting.&lt;/p&gt;
&lt;p&gt;Q: How does the paper compare strategy-robust estimation to iterative retraining?&lt;/p&gt;
&lt;p&gt;A: Simulation results show that iterative retraining of a naive LASSO model approaches the performance of the strategy-robust method after approximately 4 iterations. However, simulated performance of iterative retraining then begins to deteriorate; for the intelligence outcome, performance eventually falls below baseline performance before any retraining began. This illustrates that myopic best responses can produce non-convergent or suboptimal dynamics, while the strategy-robust approach finds the equilibrium rule directly.&lt;/p&gt;
&lt;p&gt;Q: How does the paper compare strategy-robust estimation to the &amp;ldquo;intuitive&amp;rdquo; approach of simply excluding highly manipulable features?&lt;/p&gt;
&lt;p&gt;A: The intuitive approach of excluding features above a manipulability threshold reduces predicted manipulability but also discards useful predictors. In some cases, the exclusions leave LASSO with no behaviors predictive enough to include, reducing performance. The strategy-robust approach can extract signal even from manipulable behaviors by adjusting their weights to account for manipulation noise, and outperforms the intuitive exclusion approach in the simulations reported in the Supplemental Appendix.&lt;/p&gt;
&lt;p&gt;Q: Can manipulation costs be estimated without an experiment?&lt;/p&gt;
&lt;p&gt;A: The authors briefly explore expert elicitation as a nonexperimental alternative: 171 individuals were surveyed to predict how Kenyans would manipulate phone behaviors when incentivized. Experts generally predicted lower costs (more manipulability) than observed experimentally, but the correlation between expert predictions and experimental estimates is 0.30. Using expert-elicited costs to train the strategy-robust model improved simulated performance substantially for one focal outcome and had an inconsequential negative effect for the other. Costs can also potentially be estimated from market prices and first principles when a structural model of underlying manipulations is available.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s interpretation of its results through the lens of the Lucas critique?&lt;/p&gt;
&lt;p&gt;A: The paper frames its contribution as a machine learning interpretation of Lucas (1976): just as implementing an economic policy changes the behavioral relationships on which the policy was calibrated, implementing a predictive decision rule beta changes the distribution of the very features the rule is based on. The key insight is that this counterfactual world has predictable structure — including a feature in the model tends to induce manipulation in that feature of a magnitude directly related to beta — so counterfactual fit can be estimated and rules can be optimized to perform well in the equilibrium they induce.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications for algorithmic transparency?&lt;/p&gt;
&lt;p&gt;A: The framework allows a policymaker to quantify and reduce the performance cost of transparency. The estimated equilibrium cost of transparency is roughly 10% when using strategy-robust rules, substantially less than the approximately 23% cost of making naive rules transparent. This means that strategy-robust rules can be disclosed — satisfying demands for a &amp;ldquo;right to explanation&amp;rdquo; under regulations such as GDPR — while losing far less performance than opaque naive rules would lose if disclosed.&lt;/p&gt;
&lt;p&gt;Strategy-robust decision rule: A decision rule trained to anticipate that individuals will manipulate the features on which it is based, by replacing observed training behaviors with anticipated counterfactual manipulated behaviors in the loss function. It yields a Stackelberg equilibrium in which the policymaker commits to a rule while correctly forecasting the equilibrium behavioral response.&lt;/p&gt;
&lt;p&gt;Manipulation costs (C_i): Individual- and behavior-specific quadratic costs that determine how far an individual shifts behavior from their bliss level in response to the incentive implied by a decision rule&amp;rsquo;s coefficient vector beta. Higher costs imply less behavioral response; costs are parameterized to allow separable heterogeneity by person and by behavior.&lt;/p&gt;
&lt;p&gt;Bliss level (x_i): An individual&amp;rsquo;s unincentivized behavior — the behavior they would exhibit absent any decision rule (i.e., when beta = 0). Estimated from control periods in the experiment.&lt;/p&gt;
&lt;p&gt;Gaming ability (gamma_i): Individual-level scaling factor for manipulation costs; a higher value means lower costs and easier manipulation. Modeled as a function of observable characteristics (e.g., self-reported tech skills) and unobservable heterogeneity.&lt;/p&gt;
&lt;p&gt;Counterfactual fit: Predictive fit evaluated in the counterfactual state of the world where the decision rule is implemented and agents manipulate their features in response. The strategy-robust approach maximizes counterfactual fit, sacrificing within-sample fit (as measured on unmanipulated training data) to improve performance in deployment.&lt;/p&gt;
&lt;p&gt;Cost of transparency: The reduction in predictive performance of a decision rule when its coefficients are disclosed to the individuals being evaluated. In the experiment, disclosure reduces performance of naive LASSO rules by 23% and strategy-robust rules by 9.2%, implying strategy-robust rules reduce the cost of transparency by 60%.&lt;/p&gt;
&lt;p&gt;Stackelberg equilibrium: The solution concept in which the policymaker (leader) commits to a decision rule, correctly anticipating the best-response behavior of individuals (followers), rather than taking behavior as fixed or updating myopically. The strategy-robust estimator implements this equilibrium concept.&lt;/p&gt;
&lt;p&gt;Performative prediction: The broader phenomenon, drawing on Perdomo et al. (2020), whereby a decision rule changes the distribution of the data it is applied to. The paper&amp;rsquo;s strategy-robust approach is an empirically estimable solution within this framework.&lt;/p&gt;</description></item><item><title>Marginal Propensity to Consume and Personal Characteristics: Evidence from Bank Transaction Data and Survey</title><link>https://macropaperwarehouse.com/papers/marginal-propensity-to-consume-and-personal-characteristics-evidence-from-bank-transaction-data-and-survey/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/marginal-propensity-to-consume-and-personal-characteristics-evidence-from-bank-transaction-data-and-survey/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; This paper asks whether heterogeneity in the marginal propensity to consume (MPC) stems from &lt;em&gt;temporary circumstances&lt;/em&gt; (e.g., transient wealth shocks that tighten liquidity) or &lt;em&gt;persistent personal characteristics&lt;/em&gt; (e.g., high time discount rates or strong risk aversion that permanently shape saving behavior). Because liquidity constraints are endogenous — they can reflect either bad luck or impatient preferences — disentangling these two sources requires independently measured individual characteristics, which are not available in standard transaction datasets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting.&lt;/strong&gt; The study combines two data sources drawn from Mizuho Bank, one of Japan&amp;rsquo;s three largest banks (approximately 24 million individual accounts). First, weekly bank account transaction data for January 2019 to November 2022 covering all outflows (ATM withdrawals, credit card debits, utility payments, interbank transfers) for the approximately 5,282 survey respondents. Second, a bespoke survey conducted in November–December 2022 among 400,000 randomly selected salary-receiving account holders (response rate 1.32%, yielding 5,282 usable observations). The survey elicits the Arrow–Pratt measure of absolute risk aversion, quantitative time discount rates for one-week, one-year, and ten-year horizons, self-reported liquidity constraints, homeownership, education, age, and gender, among other variables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three Income Shocks.&lt;/strong&gt; MPC is estimated against three distinct income events: (1) the Japanese government&amp;rsquo;s Special Cash Payments (SCP) — a 100,000 JPY (approximately 800 USD) per-person lump-sum transfer during COVID-19, likely transitory, unexpected, and nearly randomly timed across municipalities due to administrative bottlenecks; (2) regular salary receipts (recurring, expected in both timing and amount); and (3) semi-annual bonus payments (received twice yearly, with timing known in advance but amount largely unknown — intermediate between SCP and salary in terms of expectedness).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation Strategy.&lt;/strong&gt; A two-way fixed effects regression with event-study leads and lags (windows of five weeks before and after each income event) is used to estimate consumption responses. Individual and week fixed effects absorb time-invariant heterogeneity and aggregate shocks (including COVID-19 emergency declarations). Standard errors are clustered at the individual level. For heterogeneity analysis, the income shock variable is interacted with individual characteristics from the survey (treated as proxies for persistent characteristics) and with time-varying log wealth and a liquidity constraint dummy (wealth below one-twelfth of annual income, proxying temporary circumstances).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Average MPC.&lt;/strong&gt; Across all three income types, the on-impact MPC (week of receipt) is approximately 0.2: specifically γ₀ = 0.23 for the SCP (significant at 5%), 0.20 for salary, and 0.22 for bonus. When estimated jointly in a single regression, coefficients are γ_SCP = 0.21, γ_salary = 0.19, and γ_bonus = 0.21. This uniformity holds despite the sharply different properties of these shocks (transitory-unexpected vs. regular-expected vs. semi-known).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Heterogeneity.&lt;/strong&gt; Significant heterogeneity in MPC is found primarily in the bonus subsample, where statistical power is greatest. The following cross-term coefficients are significant at the 5% level in the multivariate specification: (a) &lt;em&gt;liquidity constraint dummy&lt;/em&gt; — positive and significant, indicating that individuals temporarily below one month&amp;rsquo;s income in deposits spend a larger fraction of their bonus, with a one standard deviation increase raising MPC by 0.094 (9.4 percentage points); (b) &lt;em&gt;time discount rate&lt;/em&gt; (quantitative measure) — positive and significant, with a one standard deviation increase in impatience raising MPC by 0.084; (c) &lt;em&gt;risk aversion&lt;/em&gt; (quantitative Arrow–Pratt measure) — positive and significant, conditional on controlling for wealth and liquidity, with a one standard deviation increase raising MPC by 0.031; (d) &lt;em&gt;education&lt;/em&gt; — negative and significant irrespective of wealth/liquidity controls, with a one standard deviation increase in education reducing MPC by 0.041.&lt;/p&gt;
&lt;p&gt;These magnitude estimates are sizable relative to the baseline MPC of approximately 0.2. For SCP and salary shocks, cross-term coefficients are uniformly insignificant at the 5% level, which the author attributes partly to smaller sample sizes and shorter observation windows for the SCP subsample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The sample consists of Mizuho Bank account holders who receive salary payments directly into their Mizuho account, overrepresenting metropolitan areas and salaried workers relative to the national census. Wealth at Mizuho captures only deposits at that institution and excludes securities accounts, postal savings, and intra-household transfers. Age and gender do not yield significant cross-term coefficients in any specification; the self-reported survey measure of liquidity constraints (ability to cover one month&amp;rsquo;s income by drawing on savings, assets, or borrowing) is also insignificant, in contrast to the transaction-based liquidity constraint dummy.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. Why is separating temporary circumstances from persistent characteristics important for MPC estimation?&lt;/strong&gt;
Liquidity constraints — the standard proximate predictor of high MPC — are endogenous. An individual may be liquidity-constrained because of a temporary adverse income shock (bad luck) or because of persistently high impatience (high time discount rate) that leads to chronically low saving. If policy evaluation treats all constrained households symmetrically, it conflates these two very different channels. The paper follows Jappelli and Pistaferri (2020), Gelman (2021), and Aguiar, Bils, and Boar (2021) in arguing that both channels matter and that their relative contributions need empirical separation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. Why are Japanese bonuses particularly well-suited to identifying MPC heterogeneity?&lt;/strong&gt;
Bonuses are paid semi-annually to most regular employees in Japan (accounting for roughly 15–30% of annual income), with timing known in advance but amount largely unknown until receipt. This intermediate nature — partially anticipated in timing but uncertain in magnitude — provides meaningful variation in consumption responses across individuals while maintaining a clean event-study design. The bonus subsample (3,722 individuals who received a bonus at least once) is also large enough to detect cross-term effects that are statistically insignificant in the SCP subsample (2,446 individuals) and in the salary analysis, likely due to greater statistical power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. How is the Arrow–Pratt measure of risk aversion constructed from the survey?&lt;/strong&gt;
Respondents are asked whether they would purchase a lottery ticket at prize value Z = 100,000 JPY and price p = 10,000 JPY for varying winning probabilities α. The threshold α at which a respondent switches from accepting to rejecting identifies their risk attitude. The absolute risk aversion σ = −U&amp;rsquo;&amp;rsquo;/U&amp;rsquo; is then calculated as (αZ² − 2αZp + p²) / (2(αZ − p)). This yields σ ranging from −4.5 (when α = 0.01, i.e., risk-loving) to 0.891 (when α = 1, i.e., refusing to buy even at a 90% win probability). Risk neutrality corresponds to σ = 0 (at α = 0.1).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. How are time discount rates measured, and what is the range?&lt;/strong&gt;
Respondents are asked the minimum amount X they would require to wait one week, one year, or ten years to receive a payment instead of receiving 100,000 JPY one week from now (using a one-week anchor to address hyperbolic discounting). The discount rate is calculated as r = X/100,000. The range is 0.01 (X = 100 JPY) to 100 (X = 10,000,000 JPY, i.e., would not wait even for 1,100,000 JPY in ten years). The unweighted average across one-week, one-year, and ten-year horizons is used as the composite discount rate in the multivariate specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What is the transaction-based liquidity constraint dummy, and how does it differ from the survey-based measure?&lt;/strong&gt;
The transaction-based dummy equals one if end-of-month deposits at Mizuho Bank (the previous month) are below one-twelfth of the individual&amp;rsquo;s annual income — i.e., if the individual holds less than one month&amp;rsquo;s equivalent income in liquid deposits. This is a time-varying measure. The survey-based measure asks respondents to self-report whether they could cover one month&amp;rsquo;s income by drawing on savings, selling assets, or borrowing. The transaction-based measure is significant at the 5% level in the bonus and salary heterogeneity regressions, while the survey-based measure is insignificant, indicating that the precise definition and data source of the liquidity constraint measure matters materially for detecting its effect on MPC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What are the estimated on-impact MPC values for each income shock, and how stable are they across robustness checks?&lt;/strong&gt;
The point estimates from the event-study regression (γ₀) are: 0.23 for SCP in the baseline sample (SCP recipients in 2020, N = 2,446 individuals), 0.20 for salary (all 5,282 survey respondents), and 0.22 for bonus (3,722 bonus recipients). In a robustness specification restricting to only year-2020 data for the SCP, γ₀ = 0.235; using cash withdrawals from ATMs as a proxy for consumption instead of total outflows, γ₀ = 0.162 for SCP. In a joint regression including all three income types simultaneously, γ_SCP = 0.21, γ_salary = 0.19, and γ_bonus = 0.21. The SCP MPC for the smaller second-wave subsample (200 individuals, 2021–22) is 0.104 and insignificant, consistent with insufficient statistical power rather than a structural difference.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. Why is the similarity in MPC across the three shock types potentially surprising, and what does the paper say about it?&lt;/strong&gt;
Standard theory predicts divergent MPCs: transitory unexpected windfalls (SCP) should have a higher MPC than permanent salary changes under the permanent income hypothesis, while Ricardian equivalence might reduce the MPC to fiscal transfers like the SCP if households anticipate future tax increases. The paper finds the MPCs are approximately equal (around 0.2 across all three types), and if anything the SCP MPC is slightly higher than the salary MPC. The paper acknowledges this uniformity without offering a structural explanation, using it primarily as a robustness check on the baseline estimate rather than a substantive puzzle to resolve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. Which personal characteristics are significantly associated with higher MPC, and in which income shock samples?&lt;/strong&gt;
In the multivariate heterogeneity regression, significant cross-term coefficients at the 5% level are found exclusively in the bonus subsample (columns 5–6 of Table 6): the quantitative risk aversion measure (positive, coefficient 0.042–0.049), the quantitative discount rate (positive, coefficient 0.004), and education (negative, coefficient −0.034 to −0.037). The liquidity constraint dummy (transaction-based) is also positive and significant for bonuses. In the univariate robustness regressions (Table 7), the own-house dummy is negative and significant at 5% for bonuses (controlled and uncontrolled); discount rates for one-week and ten-year horizons are positive and significant at 5% for bonuses; risk aversion A (direct self-report) is negative and significant at 5% for SCPs in the uncontrolled specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. Do age and gender matter for MPC heterogeneity?&lt;/strong&gt;
No. In all specifications across all three income shock types, the cross-term coefficients on age and the male dummy are uniformly insignificant at the 5% level. The lack of significance for age and gender is noted as a notable result, since both are commonly used demographic proxies in heterogeneous agent models that assume they reflect economically meaningful differences in consumption behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. How does the paper quantify the economic magnitude of each significant heterogeneity factor?&lt;/strong&gt;
Table 8 reports the product of each cross-term coefficient and the standard deviation of the corresponding variable. For the bonus subsample: a one standard deviation increase in the liquidity constraint dummy raises MPC by 0.094 (9.4 percentage points); a one standard deviation increase in the discount rate raises MPC by 0.084; a one standard deviation increase in risk aversion raises MPC by 0.031; and a one standard deviation increase in education reduces MPC by 0.041. All four magnitudes are described as sizable relative to the baseline MPC of approximately 0.2 (20%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. Why does the paper focus on bonuses for the heterogeneity analysis rather than the SCP?&lt;/strong&gt;
The SCP events provide cleaner identification of transitory, exogenous income shocks (near-random timing due to municipal administrative bottlenecks, as documented by Kubota, Onishi, and Toyama 2021), but the subsample of SCP recipients is smaller (2,446 in 2020, 200 in the second wave), reducing statistical power for detecting heterogeneity in cross-term coefficients. The salary sample is large (5,282 individuals) but salaries are expected, recurring, and may partially update permanent income, complicating interpretation of cross-term estimates. Bonuses offer a balance: a relatively large subsample (3,722) and a partially unexpected income component, making them the most informative sample for heterogeneity analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12. What are the main caveats and limitations the paper identifies?&lt;/strong&gt;
Four caveats are noted. First, the personal characteristics from the survey — including time discount rates and risk aversion — are treated as exogenous, but they may themselves be endogenous to economic circumstances or short-term conditions at the time of the survey. Second, only Mizuho Bank deposits are observed; financial assets at other institutions (securities, postal savings) are missing, meaning the liquidity constraint measure understates true wealth for some respondents. Third, the sample is tilted toward metropolitan salaried workers and toward wealthier individuals compared to the full Mizuho customer base (median log wealth of 7.4 vs. 5.9 in Kubota et al. 2021). Fourth, the multiple-testing problem is acknowledged: with many cross-term tests conducted, some rejections of the null at the 5% level may be spurious.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Marginal Propensity to Consume (MPC, on-impact).&lt;/strong&gt; In this paper, MPC is operationalized as the coefficient γ₀ from the two-way fixed effects event-study regression — specifically, the fraction of an income shock spent during the &lt;em&gt;same week&lt;/em&gt; the shock is received, estimated from total bank account outflows. This is a weekly, within-account measure, not a lifetime or annual consumption response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Arrow–Pratt Absolute Risk Aversion (σ).&lt;/strong&gt; A quantitative measure of risk preferences computed from the paper&amp;rsquo;s survey by eliciting the probability threshold α at which a respondent is indifferent between buying and not buying a lottery with prize Z = 100,000 JPY and price p = 10,000 JPY. Calculated as σ = (αZ² − 2αZp + p²) / (2(αZ − p)). Ranges from −4.5 to 0.891 in the sample, with σ = 0 indicating risk neutrality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Time Discount Rate (r).&lt;/strong&gt; Measured by asking respondents the minimum additional amount X (beyond 100,000 JPY) they would require to delay receipt by one week, one year, or ten years, with r = X/100,000. The paper uses the unweighted average of three horizon-specific rates as a composite measure. Ranges from 0.01 to 100 in the sample. Used as a proxy for impatience or myopia — a persistent personal characteristic.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Liquidity Constraint Dummy (transaction-based).&lt;/strong&gt; A time-varying binary indicator that equals one if individual i&amp;rsquo;s end-of-month Mizuho Bank deposit balance in month t−1 is below one-twelfth of annual income at t−1 — i.e., less than one month&amp;rsquo;s equivalent income in liquid deposits. Distinguished in the paper from a survey-based self-report of liquidity constraints, which is found to be insignificant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Special Cash Payment (SCP).&lt;/strong&gt; The Japanese government&amp;rsquo;s COVID-19 pandemic transfer program, providing 100,000 JPY (approximately 800 USD) per person in 2020 (universal) and 100,000 JPY per child in 2021–22 (restricted to households with children under 18 and income below 9.6 million JPY annually). Used in this paper as a transitory, salient, and largely unexpected income shock because municipal administrative bottlenecks made the exact timing unpredictable and nearly random across households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two-Way Fixed Effects Event-Study Regression.&lt;/strong&gt; The paper&amp;rsquo;s primary estimator, which includes individual fixed effects (controlling for time-invariant person-level heterogeneity) and week fixed effects (absorbing aggregate shocks such as COVID-19 emergency declarations and seasonal patterns). Event-study leads and lags (k = −5 to +5 weeks around each income receipt) allow pre-trend testing and tracing of the dynamic consumption response. Normalized to γ_{−1} = 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;MPC Heterogeneity Cross-Term.&lt;/strong&gt; A regression augmentation (equation 3 in the paper) in which the contemporaneous income shock X⁰_{it} is interacted with individual characteristic Z_{it}. The coefficient δ on this cross-term identifies how the MPC varies with Z — the marginal effect of characteristic Z on the MPC. Persistent characteristics (e.g., risk aversion, discount rate, education from the survey) and temporary circumstances (e.g., log wealth, liquidity constraint dummy from transaction data) are included as separate Z variables.&lt;/p&gt;</description></item><item><title>Marginal Returns to Public Universities</title><link>https://macropaperwarehouse.com/papers/marginal-returns-to-public-universities/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/marginal-returns-to-public-universities/</guid><description>&lt;p&gt;This paper asks whether enrolling in an American public university generates positive net returns for marginal students — those who barely qualify for admission — and whether those returns justify public expenditures. The question is policy-relevant because marginal students have weak academic preparation, face high dropout risk, and the net returns to expanding admission margins are theoretically ambiguous.&lt;/p&gt;
&lt;p&gt;The author assembles administrative records spanning all 35 public universities in Texas, covering the universe of Texas public high school graduates from 2004–2014 (approximately 2.7 million students). Texas public universities collectively enroll over 10 percent of all American public university students. The data link high school records (test scores, demographics, coursework, attendance, disciplinary infractions) to college application and admission records, postsecondary enrollment and degree completion records, financial aid packages, institutional expenditure data from IPEDS, and quarterly earnings records from the Texas Workforce Commission unemployment insurance system.&lt;/p&gt;
&lt;p&gt;The identification strategy exploits hundreds of decentralized SAT/ACT score cutoffs in university admissions — varying across schools and application years — that generate sharp discontinuities in admission probability. A fuzzy regression discontinuity design compares applicants just above versus just below each cutoff. On average, crossing a cutoff raises the probability of admission by 27 percentage points and the probability of enrolling at the target university by 15 percentage points. Density tests and pre-college covariate balance validate the smoothness assumptions. The typical cutoff complier is more disadvantaged than the average college applicant but comparable to the average Texas high school graduate.&lt;/p&gt;
&lt;p&gt;Roughly half of cutoff compliers would fall back to another, typically less selective, four-year institution if rejected; 43 percent would fall back to a two-year community college; and only about 6 percent would forgo higher education entirely. The pooled estimates therefore blend intensive-margin effects (more selective versus less selective four-year college) with extensive-margin effects (four-year college versus community college or no college).&lt;/p&gt;
&lt;p&gt;Main causal findings for enrollment compliers: the typical marginally admitted student completes approximately one additional year of credits in the four-year sector and becomes 12 percentage points more likely to ever earn a bachelor&amp;rsquo;s degree from any institution. About half of the additional four-year credits are offset by 15 fewer credits in the two-year sector, and associate degree or certificate completion falls by 7 percentage points. All bachelor&amp;rsquo;s degree gains are in non-STEM fields; STEM degree completion shows no detectable increase. Compliers become about 3 percentage points more likely to hold a graduate degree by 10 years out.&lt;/p&gt;
&lt;p&gt;On earnings, admitted compliers earn less than rejected counterparts in the first five years due to continued enrollment. Year six is the crossover point; by years 8–12, compliers earn a stable 8.6 percent earnings premium in log terms (8.2 percent in dollar ratio terms, representing a LATE of $3,339 against an untreated complier mean of $40,829), with earnings ranks rising approximately 4 percentiles from a base near the 50th percentile.&lt;/p&gt;
&lt;p&gt;Marginally admitted students pay no additional net tuition on average: $4,600 in additional gross tuition is nearly fully offset by grant aid, though they take on $5,300 more in student loans. Society incurs approximately $10,000 in additional educational expenditures per complier. Internal rates of return are 26 percent for students, 16 percent for society, and 7 percent for the government budget. At a 3 percent discount rate, the lifetime net present value of enrolling the typical marginal applicant is approximately $80,000 — $70,000 accruing to the student and $10,000 to taxpayers.&lt;/p&gt;
&lt;p&gt;Earnings gains are similar across institutions of varying selectivity, but significantly smaller for low-income compliers, who spend more time enrolled, complete fewer degrees, and major in less lucrative fields. A bounding method shows that extensive-margin compliers (those who would otherwise not attend any four-year college) experience larger effects than intensive-margin compliers.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question and why is credible evidence scarce?
A: The paper asks whether enrolling marginal students in American public universities generates positive net returns — private, social, and fiscal — and what drives heterogeneity in those returns. Credible evidence is scarce because most existing work is correlational and fails to account for selection bias: individuals with more college education may have had pre-existing advantages, confounding college&amp;rsquo;s causal effect with systematic sorting into it. Even if average returns are positive, the policy-relevant question is whether the marginal student — who has weak preparation and high dropout risk — represents a good investment.&lt;/p&gt;
&lt;p&gt;Q: What is the regression discontinuity design, and what does the first stage look like?
A: The author infers hundreds of decentralized SAT/ACT score cutoffs across approximately 700 application cells (combinations of university, year, GPA quartile, and test type) by searching for the score value with the largest discontinuity in admission and enrollment within each cell. This procedure delivers a superconsistent estimator of each cell&amp;rsquo;s true cutoff. Pooled across all cells, crossing a cutoff raises the probability of admission by 27 percentage points and the probability of enrollment at the target university by a precisely estimated 15 percentage points. The density of applicants and a rich set of pre-college characteristics run smoothly through the cutoffs, supporting the exclusion restriction.&lt;/p&gt;
&lt;p&gt;Q: Who are the cutoff compliers, and are they representative of any broader population?
A: Compliers — applicants who enroll in the target university if and only if they barely cross its cutoff — comprise approximately 15 percent of marginal applicants. In observable characteristics, compliers are roughly representative of the broader population of marginal applicants at the cutoff. They are significantly more disadvantaged than the average public university applicant, but broadly comparable to the average Texas public high school graduate in terms of academic preparation and family income.&lt;/p&gt;
&lt;p&gt;Q: What are the next-best alternatives for marginal applicants who are rejected?
A: Approximately 47 percent of compliers would fall back to another Texas four-year college (mostly public), 43 percent to a two-year community college, and approximately 9 percent would not enroll in any Texas institution. National Student Clearinghouse data for the 2008–2014 cohorts confirm that only 4 percent of untreated compliers attend a college outside the THECB universe, meaning approximately 6 percent of all compliers truly forgo higher education altogether if rejected. The empirically relevant extensive margin is therefore between the four-year sector and the two-year sector, not between college and no college.&lt;/p&gt;
&lt;p&gt;Q: How does cutoff crossing change the institutional characteristics a complier experiences?
A: Compliers are propelled into substantially better-resourced environments: the average math test score of college peers rises by half a standard deviation; peers are 12 percentage points less likely to have been low-income; gross tuition rises by $2,400 (a 42 percent increase over the untreated complier mean of $5,700); educational spending per student rises by $3,200 (43 percent over the untreated mean); peers&amp;rsquo; 10-year BA completion rate rises by 28 percentage points; and peer mean earnings 8–12 years after college entry are $6,700 higher.&lt;/p&gt;
&lt;p&gt;Q: What are the educational attainment effects?
A: Cutoff crossing causes compliers to complete approximately 28 additional credits at any four-year institution (roughly one full year of a four-year program) and increases the probability of ever earning a bachelor&amp;rsquo;s degree by 12 percentage points, raising the completion rate from approximately 40 percent to just above 50 percent. About 15 fewer two-year sector credits are offset against the four-year gains, and associate degree or certificate completion falls by 7 percentage points. All bachelor&amp;rsquo;s degree gains are in non-STEM fields; there is no detectable increase in STEM degrees. Graduate degree completion rises by approximately 3 percentage points by 10 years out.&lt;/p&gt;
&lt;p&gt;Q: What is the earnings trajectory, and when does the premium materialize?
A: Admitted compliers earn less than rejected counterparts in the first five years after application because they remain enrolled longer. Year six is the crossover point. By years 8–12, the earnings premium stabilizes at approximately 8.6 percent in log terms and 8.2 percent in dollar ratio terms (a LATE of $3,339 against an untreated complier mean of $40,829). Earnings rank rises by approximately 4 percentiles from a base near the 50th percentile. These results are robust across sandwich earnings, all-quarters-with-earnings, and zero-imputed specifications.&lt;/p&gt;
&lt;p&gt;Q: What does the cost-benefit analysis show?
A: Marginally admitted students pay no additional net tuition on average: $4,600 in additional gross tuition is nearly fully offset by additional grant aid. They do borrow $5,300 more in student loans, likely financing higher room, board, and consumption costs at four-year colleges. From society&amp;rsquo;s perspective, compliers generate approximately $10,000 in additional educational expenditures. Cumulative undiscounted earnings benefits surpass costs after 8 years for students, 11 years for society, and 19 years for taxpayers. At a 3 percent discount rate, the lifetime net present value is approximately $80,000 total — $70,000 accruing to the student and $10,000 to taxpayers — with internal rates of return of 26 percent for students, 16 percent for society, and 7 percent for the government budget.&lt;/p&gt;
&lt;p&gt;Q: Does selectivity of the admitting institution predict larger earnings returns?
A: No. Compliers at more selective institutions experience substantially larger increases in peer quality than those at less selective institutions, but they are also less likely to be on the extensive margin of four-year enrollment and experience smaller BA attainment gains. These factors roughly offset, producing no systematic difference in earnings gains across institutions of varying selectivity. More selective institutions also impose no additional cumulative cost on society, while compliers actually pay slightly less in additional net tuition at more selective schools.&lt;/p&gt;
&lt;p&gt;Q: How does the commonly used measure of college value-added (mean peer earnings) compare to actual complier returns?
A: Mean peer earnings overpredicts actual value-added for marginal students by a factor of two: compliers attend an institution with $6,700 higher average peer earnings as a result of admission but gain only $3,300 themselves. The measure also overpredicts the earnings return to selectivity by a factor of three: a 100-SAT-point increase in target school selectivity predicts $3,000 higher peer earnings but only a statistically insignificant $900 higher gain in the complier&amp;rsquo;s own earnings.&lt;/p&gt;
&lt;p&gt;Q: How do earnings returns differ by family income?
A: Compliers from low-income families experience significantly smaller earnings gains compared to higher-income compliers. The gap is not explained by differential changes in college quality induced by admission. Instead, low-income compliers gain fewer degrees despite spending more time in college and major in less lucrative fields, consistent with related findings in the literature on family income gaps in degree completion and major choice.&lt;/p&gt;
&lt;p&gt;Q: How do earnings returns differ by gender and by race?
A: Female and male compliers eventually earn similar log earnings and earnings rank gains, but women reach their gains more quickly — likely because men take longer to finish college. White and Asian compliers experience similar earnings gains and BA completion improvements as Black and Hispanic compliers, despite white and Asian students experiencing larger increases in college selectivity and spending per student as a result of admission.&lt;/p&gt;
&lt;p&gt;Q: What is the method for separating intensive- and extensive-margin effects?
A: The two complier types are not directly distinguishable in the data. The author first uses an endogenous but strong stratification variable — having at least one other Texas public university admission offer — to identify some mean potential outcomes for each type. He then imposes an empirically-informed rank assumption to bound the remaining unknown mean potential outcomes, delivering tightly informative upper and lower bounds on each margin&amp;rsquo;s effects without requiring full nonparametric identification. The results show that pooled effects are driven by larger returns for extensive-margin compliers who would not have attended any four-year college, with smaller contributions from intensive-margin compliers shifting between four-year institutions.&lt;/p&gt;
&lt;p&gt;Q: How do this paper&amp;rsquo;s earnings estimates compare to prior studies, and what explains the differences?
A: This paper&amp;rsquo;s 8 percent earnings gain is smaller than the 17–26 percent reported in prior studies (Zimmerman 2014: 22%; Kozakowski 2023: 26%; Smith, Goodman, and Hurwitz 2025: 17%; Bleemer 2024: 21%; Hoekstra 2009: 20%). The differences are likely explained by the much larger educational attainment and institutional quality gains induced by those studies&amp;rsquo; natural experiments: in Zimmerman (2014), enrollment compliers gain roughly three additional years of four-year education versus one year in this paper; in Bleemer (2024), compliers experience roughly $30,000 more in institutional spending per student versus approximately $3,000 in this paper.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions for these results?
A: The results pertain to marginal applicants to Texas public universities (excluding UT-Austin, which uses holistic admission with no detectable SAT/ACT cutoffs) from the 2004–2014 high school graduation cohorts. The identified effects are local average treatment effects for compliers — applicants who would enroll in the target university if and only if they barely crossed its admission cutoff — and do not represent effects for always-takers or infra-marginal students. Earnings are measured only for Texas-based workers covered by the state unemployment insurance system, which captures an estimated 90 percent of the civilian labor force.&lt;/p&gt;
&lt;p&gt;Cutoff complier: An applicant who enrolls in their target university if and only if their SAT/ACT score barely exceeds that university&amp;rsquo;s admission cutoff. Compliers are the population whose behavior — and thus whose treatment effects — are identified by the fuzzy RD design. They comprise approximately 15 percent of marginal applicants and are more disadvantaged than the average public university applicant but broadly comparable to the average high school graduate.&lt;/p&gt;
&lt;p&gt;Extensive versus intensive margin: The extensive margin refers to the contrast between attending any four-year college versus falling back to a two-year community college or no college. The intensive margin refers to the contrast between attending a more selective versus a less selective four-year institution. Approximately half of cutoff compliers are on each margin; the paper treats them as economically distinct parameters requiring separate identification.&lt;/p&gt;
&lt;p&gt;Fuzzy regression discontinuity (RD) design: An identification strategy that uses the discontinuous jump in admission probability at a test score cutoff as an instrument for enrollment, recovering the LATE for compliers via the ratio of the reduced-form discontinuity in outcomes to the first-stage discontinuity in enrollment. &amp;ldquo;Fuzzy&amp;rdquo; refers to the fact that crossing the cutoff changes admission and enrollment probabilities with a discrete jump rather than with certainty.&lt;/p&gt;
&lt;p&gt;Internal rate of return (IRR): The discount rate at which the net present value of an investment equals zero — here, the discount rate equating the discounted stream of earnings benefits to the discounted stream of costs. The paper estimates IRRs separately for students (26 percent), society (16 percent), and the government budget (7 percent), reflecting different cost and benefit definitions from each perspective.&lt;/p&gt;
&lt;p&gt;Rank assumption (bounding method): An empirically-informed assumption about the ordering of mean potential outcomes across latent complier types (extensive vs. intensive margin) that, combined with partial identification from a strong endogenous stratification variable, yields tight upper and lower bounds on each margin&amp;rsquo;s causal effects without requiring full nonparametric identification.&lt;/p&gt;
&lt;p&gt;Net tuition: Gross tuition charges minus grant aid. For the typical marginal complier, gross tuition rises by $4,600 but is nearly fully offset by additional grant aid, yielding approximately zero additional net tuition cost — meaning the private financial cost of attending a public university for marginal students is effectively zero on net, though they take on $5,300 more in student loans to finance room, board, and consumption.&lt;/p&gt;
&lt;p&gt;Sandwich earnings measure: A procedure applied to quarterly state earnings data that retains only quarters with positive earnings sandwiched between other quarters with positive earnings, discarding high-variance transition quarters between employment spells. Annualized by multiplying the quarterly average by four; used to reduce noise from entry and exit transitions in administrative earnings records.&lt;/p&gt;</description></item><item><title>Market Segmentation through Information</title><link>https://macropaperwarehouse.com/papers/market-segmentation-through-information/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/market-segmentation-through-information/</guid><description>&lt;p&gt;This paper asks what market outcomes an information designer — modeled as an internet platform that knows consumers&amp;rsquo; preferences — can achieve by choosing what information to disclose to competing oligopolistic firms who then make personalized price offers. The model features n firms each producing a single differentiated product at zero cost, a continuum of consumers with unit demand and multidimensional valuations (one per product), and a designer who commits to a mapping from consumer types to joint distributions over messages sent to firms before they play a simultaneous pricing game. The designer&amp;rsquo;s objective spans the full range from maximizing producer surplus to maximizing consumer surplus.&lt;/p&gt;
&lt;p&gt;The paper establishes two main results. First, under a necessary and sufficient condition called Aggregate Incentive Compatibility (AIC), the designer can implement full surplus extraction by firms — the producer-optimal outcome — in which every consumer buys her most preferred product at a price exactly equal to her valuation for it, capturing 100% of available surplus for producers. The AIC condition requires, for each firm i and each candidate deviation price p_hat_i, that the infra-marginal losses firm i would bear on its natural customers (those in Ei who value i most) from lowering price to p_hat_i must be weakly greater than the maximum business-stealing profit available from consumers who prefer other products but have valuation for i above p_hat_i. The condition is easier to satisfy when consumer preferences are more polarized, i.e., when consumers have stronger relative preferences for their most-preferred product. When firms offer homogeneous products the condition fails everywhere and no information structure can generate any producer surplus — Bertrand competition drives all profits to zero under any signal structure.&lt;/p&gt;
&lt;p&gt;Second, the paper characterizes the consumer-optimal information structure, which achieves the maximum possible consumer surplus across all equilibria induced by any information structure. The upper bound on consumer surplus is CS* = (total surplus) minus sum_i Pi*_i, where Pi*_i is the profit firm i can guarantee itself by ignoring the designer&amp;rsquo;s signal and setting the best uniform price assuming all rivals price at zero. This bound is tight: the designer can implement it by publicly partitioning consumers into groups by most-preferred product, inducing rival firms to price at marginal cost (zero) for consumers who prefer another firm&amp;rsquo;s product, and then applying the Bergemann-Brooks-Morris (2015) extremal segmentation within each firm&amp;rsquo;s natural customer set to preserve each firm&amp;rsquo;s guarantee profit while achieving efficiency.&lt;/p&gt;
&lt;p&gt;The illustrative two-firm example shows the quantitative stakes concretely. With no information disclosure, firms charge 4/5 and total producer surplus is about 76% of total surplus S*, consumer surplus is just under 10% of S*, and some consumers are excluded. With full disclosure, producer surplus rises to about 81% of S* and consumer surplus to 19%. The producer-optimal information structure (Case 3) achieves 100% of S* as producer surplus by pooling consumers who prefer different products into the same message submarket, giving each firm an incentive to price for its highest-valuing customers and ignore the others. The consumer-optimal information structure (Case 4) brings producer surplus down to about 57% of S* — its guaranteed lower bound — and delivers roughly 43% of S* to consumers, an outcome unattainable by full disclosure alone.&lt;/p&gt;
&lt;p&gt;Both producer-optimal and consumer-optimal outcomes are efficient: all consumers buy their most-preferred product in both cases. The paper further characterizes the full efficient frontier between consumer- and producer-optimal outcomes, showing that mixing the consumer-optimal and full-information structures (or consumer-optimal, full-information, and producer-optimal structures when the latter is implementable) spans every point on the frontier.&lt;/p&gt;
&lt;p&gt;The model assumes firms will price-discriminate if they can, that the designer has full knowledge of consumer types, and that the game is played once. The core results extend to continuous type distributions as shown in Online Appendix B.2. The analysis is restricted to a monopoly platform; competition among platforms is left for future work.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why does the two-benchmark comparison used by antitrust authorities miss important possibilities?&lt;/p&gt;
&lt;p&gt;A: The paper asks what market outcomes — combinations of consumer and producer surplus — an information designer (a platform) can achieve by choosing among all possible information structures, not just the two benchmarks of no-information and full-information. Antitrust analysis that compares only those two cases misses a vast middle ground: an intermediary can package information in ways that, for instance, implement perfect collusion (extracting all surplus as producer surplus) while appearing to use privacy-protective technologies, or can intensify competition well beyond the full-information benchmark to benefit consumers.&lt;/p&gt;
&lt;p&gt;Q: What is the producer-optimal information structure and when does it exist?&lt;/p&gt;
&lt;p&gt;A: A producer-optimal information structure is one that induces an equilibrium in which every consumer buys her most-preferred product at a price exactly equal to her valuation — full surplus extraction. It exists if and only if, for every firm i and every candidate deviation price p_hat_i, the Aggregate Incentive Compatibility (AIC) condition holds: the aggregate infra-marginal losses firm i would suffer on its natural customers Ei from lowering price to p_hat_i must be at least as large as the maximum business-stealing profit from consumers outside Ei who have valuation for i weakly above p_hat_i. This is a condition on the distribution of consumer valuations, not on the information structure per se.&lt;/p&gt;
&lt;p&gt;Q: What is the economic mechanism behind the producer-optimal structure — how does pooling consumers implement full surplus extraction?&lt;/p&gt;
&lt;p&gt;A: The designer assigns consumers who prefer product A to the same message submarket as consumers who prefer another product but have a lower valuation for A. Firm A is then price-recommended its highest-valuing customers&amp;rsquo; willingness to pay. The presence of the &amp;ldquo;outside&amp;rdquo; consumers in the same message makes it unprofitable for firm A to deviate downward to capture them, because the infra-marginal loss on the natural customers exceeds the additional revenue. Simultaneously, the rival firm cannot identify and undercut for A&amp;rsquo;s natural customers because the messages do not allow it to distinguish them. The result is that each firm plays a niche strategy, setting price equal to the valuation of its highest-type natural customers and excluding the others from its offer.&lt;/p&gt;
&lt;p&gt;Q: When does polarization of consumer preferences help achieve the producer-optimal outcome?&lt;/p&gt;
&lt;p&gt;A: Proposition 1 states that if a producer-optimal information structure exists under distribution f, it also exists under any distribution f_tilde that is more polarized than f — where more polarized means the mass of consumers who prefer i and have valuation above any threshold for i increases, and the mass of consumers who prefer j but have valuation above that threshold for i decreases. Intuitively, polarization slackens the Firm IC constraints because it reduces the business-stealing temptation: fewer consumers with high cross-product valuations are available for firm i to capture by undercutting. Concrete continuous-distribution examples include: uniform over the unit square (producer-optimal always exists), Hotelling anti-correlated values (exists everywhere), and truncated normal with mean 1/2 — producer-optimal is feasible for all standard deviations sigma &amp;gt; 0.15.&lt;/p&gt;
&lt;p&gt;Q: Why does the producer-optimal outcome fail entirely when products are homogeneous?&lt;/p&gt;
&lt;p&gt;A: Proposition 2 states that when all consumer types have equal valuations across products (the support of f lies on the diagonal of V^n), then for any information structure and any induced equilibrium, every consumer buys at price zero and all firms earn zero profit. The logic extends the standard Bertrand undercutting argument: with homogeneous products, any positive price a firm charges is undercut by a rival who can always profitably steal demand, and this applies to any posterior distribution induced by any signal realization. Even private signals cannot prevent this outcome because no signal realization can give a firm a non-contestable position.&lt;/p&gt;
&lt;p&gt;Q: How is the consumer-optimal information structure constructed, and what is its key economic logic?&lt;/p&gt;
&lt;p&gt;A: Theorem 2 shows the consumer-optimal structure has three layers. First, consumers are partitioned into n groups by most-preferred product (Ei). Second, firms j not equal to i are induced — by publicly revealing which group a consumer belongs to — to set price zero for consumers outside their group, because competing for those consumers is hopeless when their preferred firm is identified. Third, within each Ei, consumers are further partitioned into submarkets using the Bergemann-Brooks-Morris (2015) extremal segmentation applied to residual valuations (theta_i minus the maximum of competing valuations), ensuring firm i earns exactly its guarantee profit Pi*_i. By holding each firm down to its guarantee profit, the residual goes to consumers, maximizing CS.&lt;/p&gt;
&lt;p&gt;Q: What is the guarantee profit Pi*_i and how does it bound consumer surplus?&lt;/p&gt;
&lt;p&gt;A: Pi*&lt;em&gt;i is the maximum profit firm i can achieve by ignoring all designer signals and setting a single uniform price to all consumers, against the worst-case scenario in which all other firms price at zero. Formally, Pi*&lt;em&gt;i = max&lt;/em&gt;{pi} sum&lt;/em&gt;{theta in Ei: theta_i - pi &amp;gt;= max_{j not equal i} theta_j} pi * f(theta). Since firm i can always achieve Pi*_i regardless of the information structure (by simply ignoring signals), no information structure can push firm i&amp;rsquo;s profit below Pi*_i. The sum of these guarantee profits across all firms provides a lower bound on total producer surplus — and therefore an upper bound on consumer surplus — achievable by any information structure.&lt;/p&gt;
&lt;p&gt;Q: In the two-firm numerical example, what is the quantitative comparison across the four cases?&lt;/p&gt;
&lt;p&gt;A: Total available surplus S* = 0.84. Under no information (Case 1): producer surplus approximately 76% of S*, consumer surplus just under 10% of S*, and consumers of types (3/5, 2/5) and (2/5, 3/5) do not trade. Under full disclosure (Case 2): producer surplus approximately 81% of S*, consumer surplus 19% of S*, efficient. Under the producer-optimal structure (Case 3): producer surplus = 100% of S* (all surplus extracted), consumer surplus = 0%, efficient. Under the consumer-optimal structure (Case 4): producer surplus approximately 57% of S*, consumer surplus approximately 43% of S*, efficient. All cases except Case 1 are efficient; the no-information case excludes some consumers from trading.&lt;/p&gt;
&lt;p&gt;Q: Is the full-information disclosure structure consumer-optimal?&lt;/p&gt;
&lt;p&gt;A: Not in general. Proposition 3 states that full information is consumer-optimal if and only if all consumers in Ei have identical residual valuations (theta_i minus their second-best alternative) — a condition that generically fails. When residual valuations within Ei are heterogeneous, the designer can do strictly better for consumers by applying the extremal segmentation within each Ei rather than revealing full information, which would allow firms to price-discriminate on individual residual valuations and extract more surplus.&lt;/p&gt;
&lt;p&gt;Q: Can the designer trace out the entire efficient frontier between consumer- and producer-optimal outcomes?&lt;/p&gt;
&lt;p&gt;A: Yes, under two conditions. First, by mixing the consumer-optimal structure (point A) with the full-information structure (point B) using fractions lambda and 1-lambda respectively, the designer can implement any point on the efficient frontier between A and B. Second, when the producer-optimal outcome (point C) is also implementable, mixing the full-information structure with the producer-optimal structure by applying them to fractions lambda and 1-lambda of the consumer population respectively spans every point between B and C. The key insight is that the AIC condition, if it holds for f, also holds for any rescaled sub-distribution of f (it is scale-invariant), so the producer-optimal sub-problem remains feasible.&lt;/p&gt;
&lt;p&gt;Q: What are the regulatory implications of the analysis?&lt;/p&gt;
&lt;p&gt;A: The paper identifies a fundamental tension: banning information use sacrifices efficiency (some consumers excluded, wrong products purchased), but unrestricted use permits platforms to implement perfect collusion through information design. Critically, the paper shows that privacy-enhancing technologies that pool consumers into cohorts — like Google&amp;rsquo;s Privacy Sandbox — are equally consistent with the producer-optimal (collusive) and consumer-optimal (competitive) structures; the two differ only in the principle by which consumers are grouped. The paper suggests regulators could mandate that consumers in the same cohort share the same most-preferred product and that information be disclosed symmetrically across firms — the defining features of the consumer-optimal structure. This would block the producer-optimal grouping (which mixes consumers with different most-preferred products) while preserving efficiency.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to and extend Bergemann, Brooks, and Morris (2015)?&lt;/p&gt;
&lt;p&gt;A: Bergemann, Brooks, and Morris (2015) characterize achievable consumer and producer surplus outcomes when a designer discloses information to a single monopolist who can price-discriminate. The present paper extends this to oligopoly, where competition between firms creates both additional constraints (firms may undercut each other) and additional instruments (the designer can play firms against each other). The consumer-optimal construction directly applies the BBM (2015) extremal segmentation within each firm&amp;rsquo;s natural customer set Ei, but the outer layer — using public revelation of group membership to induce rival firms to price at zero — is new and arises specifically from the oligopoly setting.&lt;/p&gt;
&lt;p&gt;Information designer: An entity (modeled as a platform) that observes the full joint distribution of consumer valuations over all products and commits, before firms price, to a mapping from consumer types to joint distributions over messages sent to competing firms; the designer can be interpreted as an internet intermediary choosing how to package and share consumer data.&lt;/p&gt;
&lt;p&gt;Aggregate Incentive Compatibility (AIC): The necessary and sufficient condition on the distribution of consumer valuations for the existence of a producer-optimal information structure; for each firm i and each candidate deviation price p_hat_i, the aggregate infra-marginal losses firm i would incur on its natural customers by lowering price to p_hat_i must weakly exceed the maximum revenue firm i could gain by attracting consumers who prefer rival products but have valuation for i above p_hat_i.&lt;/p&gt;
&lt;p&gt;Producer-optimal information structure: An information structure that induces an equilibrium in which every consumer buys her most-preferred product at a price exactly equal to her full valuation for it, extracting 100% of available surplus as producer surplus — the outcome equivalent to the firms&amp;rsquo; fully collusive joint surplus maximum.&lt;/p&gt;
&lt;p&gt;Consumer-optimal information structure: An information structure that achieves the maximum consumer surplus attainable across all equilibria induced by any information structure, holding each firm to its guarantee profit Pi*_i (the best uniform-price profit the firm can secure by ignoring all signals) and allocating all residual surplus to consumers while maintaining allocative efficiency.&lt;/p&gt;
&lt;p&gt;Guarantee profit (Pi*&lt;em&gt;i): The maximum profit firm i can secure unilaterally by ignoring the designer&amp;rsquo;s signal and setting an optimal uniform price, computed against the worst case in which all rival firms price at zero; it equals max&lt;/em&gt;{pi} times the sum of f(theta) over all types in Ei for which theta_i minus pi exceeds all rival valuations.&lt;/p&gt;
&lt;p&gt;Polarization of preferences: A stochastic dominance condition under which, relative to a baseline distribution, the mass of consumers who prefer product i and have high valuations for it increases while the mass of consumers who prefer rival products but have high valuations for i decreases; higher polarization weakens the Firm IC constraints and makes the producer-optimal outcome easier to implement (Proposition 1).&lt;/p&gt;
&lt;p&gt;Separation and Consistency: Two structural properties any producer-optimal information structure must satisfy: Separation requires that the messages firm i sends to different consumers in Ei who have distinct valuations for i are disjoint in support; Consistency requires that every message firm i can send to any consumer type is contained in the union of messages firm i sends to consumers in Ei, preventing firm i from ever inferring that a consumer prefers a rival&amp;rsquo;s product.&lt;/p&gt;</description></item><item><title>Markov-Perfect Equilibria in Differential Games—With an Application to Climate Policy</title><link>https://macropaperwarehouse.com/papers/markov-perfect-equilibria-in-differential-gameswith-an-application-to-climate-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/markov-perfect-equilibria-in-differential-gameswith-an-application-to-climate-policy/</guid><description>&lt;p&gt;This paper by Jaakkola and Wagener addresses a long-standing open problem in the theory of differential games: how to make Markov-perfect equilibria (MPE) well-defined when best-response policy functions are generically discontinuous in the state variable. The paper&amp;rsquo;s primary contribution is methodological — it introduces discontinuous Markovian strategies into differential games and proves that, under this extension, (i) payoffs can always be computed and (ii) unique best responses exist for almost all strategy profiles of opponents. The authors then apply this framework to derive the entire set of symmetric MPE in a canonical non-cooperative climate mitigation model (van der Ploeg and de Zeeuw, 1992), finding welfare results that are quantitatively large and policy-relevant.&lt;/p&gt;
&lt;p&gt;The technical difficulty the paper resolves is that discontinuous policy functions can cause the ordinary differential equation governing state dynamics to lack classical solutions, making payoffs undefined. Prior literature responded either by restricting strategies to continuous functions — which rules out many natural best responses and imposes an unjustified constraint on the strategy space — or by allowing discontinuities only in &amp;ldquo;admissible&amp;rdquo; profiles, which makes each player&amp;rsquo;s strategy set depend on opponents&amp;rsquo; choices and thus violates the basic structure of non-cooperative game theory. The authors&amp;rsquo; solution is to adopt Filippov solutions (differential inclusions that convexify dynamics at discontinuities), so that a well-defined state trajectory and payoff exist for every strategy profile, not just admissible ones.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s three main theorems cover existence (Theorem 1), characterization (Theorem 2), and symmetric equilibrium conditions (Theorem 3). Theorem 1 establishes that, given any fixed set of potential jump points, the best-response correspondence maps almost all opponent strategy profiles to a unique Markovian best response — &amp;ldquo;almost all&amp;rdquo; in the sense of prevalence on infinite-dimensional function spaces. Theorem 2 provides necessary and sufficient conditions for a strategy to be a best response: it must satisfy the maximum principle where the value function is differentiable, value discontinuities may only occur at jump points of opponents&amp;rsquo; strategies where the player cannot unilaterally push the state back to the low-stock side, and the value at any such interface must exceed the static optimum. Theorem 3 translates these into conditions for symmetric Nash equilibrium.&lt;/p&gt;
&lt;p&gt;Applied to the van der Ploeg–de Zeeuw climate model — N symmetric countries choosing emissions a_i, with carbon stock x evolving as x-dot = sum(a_i) - delta&lt;em&gt;x, and flow utility u(x, a_i) = a_i - (1/2)a_i^2 - dx — the paper characterizes the complete set of symmetric MPE. The unique continuous globally defined equilibrium (the linear MPE, previously established by Rowat 2007) is shown to be weakly Pareto-dominated by every other MPE with a continuous value function. The best equilibria feature discontinuous strategies that act like stock-conditioned trigger strategies: when the carbon stock falls below a target steady state x&lt;/em&gt;, players respond with a discrete upward jump in emissions to rapidly return the economy to x*; when carbon rises above x*, players increase emissions only gradually, creating a threat of drifting to a higher-pollution steady state that disciplines deviations. In a calibrated example with N=10, delta=0.02, rho=0.02, and damage parameter d=0.5, the linear equilibrium steady state is approximately 2.5 times the first-best level, while the best continuous-value MPE steady state is approximately 1.2 times the first-best level. Choosing the best equilibrium rather than the linear equilibrium closes between 50 and 100 percent of the welfare gap to the first-best outcome, depending on initial conditions. The paper also identifies particularly bad equilibria involving value-function discontinuities — coordination failures in which no single country can unilaterally stop the carbon stock from rising past a threshold — that can yield welfare outcomes worse than the linear equilibrium at high carbon levels.&lt;/p&gt;
&lt;p&gt;The scope of the methodological results covers differential games with a single state variable and strategies that are real-analytic except at finitely many points. Extension to multiple state variables is left for future work. The climate application is restricted to the symmetric linear-quadratic van der Ploeg–de Zeeuw framework, chosen to facilitate comparison with prior literature.&lt;/p&gt;
&lt;p&gt;Q: What is the fundamental technical problem with MPE in differential games that this paper resolves?&lt;/p&gt;
&lt;p&gt;A: In differential games with Markovian strategies, best-response policy functions are generically discontinuous in the state variable. Discontinuous right-hand sides in the state dynamics ODE can prevent existence or uniqueness of classical solutions, making payoffs undefined for some strategy profiles. Prior literature either restricted attention to continuous strategies (causing non-existence of best responses to many profiles) or defined &amp;ldquo;admissible&amp;rdquo; strategy sets that depend on opponents&amp;rsquo; choices (violating non-cooperative game theory structure). This paper resolves both problems for the single-state-variable case.&lt;/p&gt;
&lt;p&gt;Q: How does the paper make payoffs well-defined under discontinuous strategies?&lt;/p&gt;
&lt;p&gt;A: The paper adopts Filippov solutions — differential inclusions that replace the dynamics at a discontinuity point with a convex hull of the left and right limits. At a &amp;ldquo;push-push&amp;rdquo; discontinuity (where dynamics push the state toward the jump point from both sides), the Filippov solution remains at the jump point and flow payoffs are a weighted average of left and right actions. This ensures a well-defined trajectory and payoff for every strategy profile, not just &amp;ldquo;admissible&amp;rdquo; ones, restoring the standard non-cooperative game-theoretic structure.&lt;/p&gt;
&lt;p&gt;Q: What does Theorem 1 establish, and what does &amp;ldquo;almost all&amp;rdquo; mean in this context?&lt;/p&gt;
&lt;p&gt;A: Theorem 1 establishes that, for any fixed collection of jump points, each player has a unique Markovian best response to almost every profile of opponents&amp;rsquo; strategies. &amp;ldquo;Almost all&amp;rdquo; is in the sense of prevalence on infinite-dimensional function spaces (following Hunt, Sauer, and Yorke 1992): the set of profiles for which a unique best response fails to exist is shy (measure-zero analog in infinite dimensions) and nowhere dense. This resolves the long-standing open problem of making MPE well-founded in differential games.&lt;/p&gt;
&lt;p&gt;Q: What are the necessary and sufficient conditions for a best response given by Theorem 2?&lt;/p&gt;
&lt;p&gt;A: A strategy phi_i is the best response to opponents&amp;rsquo; profile if and only if: (i) at all points where the value function is differentiable, the strategy satisfies the maximum principle; (ii) the value function is decreasing in the state (monotonicity); (iii) value discontinuities may occur only at opponents&amp;rsquo; jump points where player i cannot unilaterally move the state back to the low-stock region; (iv) at any such interface, the value must be at least as large as the static optimum u(x, a_i)/rho; and (v) the value is differentiable at push-push steady states. These conditions extend the standard maximum principle with local requirements that restrict which discontinuities are possible.&lt;/p&gt;
&lt;p&gt;Q: What is the van der Ploeg–de Zeeuw model and why is it used here?&lt;/p&gt;
&lt;p&gt;A: The van der Ploeg–de Zeeuw (1992) model has N symmetric countries choosing emissions a_i, with carbon stock evolving as x-dot = sum(a_i) - delta*x, and flow utility u(x, a_i) = a_i - (1/2)a_i^2 - dx. It is linear-quadratic, so a linear MPE exists and is analytically tractable, and prior literature (Dockner and Long 1993; Rowat 2007; Dockner and Wagener 2014) has studied it extensively. The paper uses it as a benchmark to demonstrate that the new methods yield novel and economically important results for even well-understood models.&lt;/p&gt;
&lt;p&gt;Q: What is the linear equilibrium and why does it produce poor welfare outcomes?&lt;/p&gt;
&lt;p&gt;A: The linear equilibrium phi_L(x) = alpha + beta*x, with beta negative, is the unique continuous globally defined MPE (Rowat 2007). In it, emissions decrease with the carbon stock because each player anticipates that opponents will also reduce emissions when carbon is high. This strategic substitutability creates adverse dynamic free-riding: players try to exploit the fact that high carbon stock will cause opponents to cut back, so each has an incentive to emit more when carbon is low. In the calibrated example, the linear equilibrium steady state is approximately 2.5 times the first-best level.&lt;/p&gt;
&lt;p&gt;Q: What do the best equilibria look like, and why do they achieve high welfare?&lt;/p&gt;
&lt;p&gt;A: The best equilibria feature a target steady state x* near the first-best level and a discontinuous upward jump in emissions when carbon falls slightly below x*. This threat rapidly returns any carbon reduction back to x*, eliminating the strategic incentive to free-ride on others&amp;rsquo; reductions. When carbon rises above x*, emissions increase only slightly, causing the economy to drift slowly toward a higher-pollution steady state — the threat of this bad outcome disciplines overshooting. This mechanism is analogous to a trigger strategy but is conditioned on the stock level rather than on past actions, making it compatible with Markovian strategies.&lt;/p&gt;
&lt;p&gt;Q: How large are the welfare gains from the best equilibrium relative to the linear equilibrium?&lt;/p&gt;
&lt;p&gt;A: In the calibrated example with N=10, delta=0.02, rho=0.02, and d=0.5, the best continuous-value MPE steady state is approximately 1.2 times the first-best level, compared to 2.5 times for the linear equilibrium. Choosing the best equilibrium closes between 50 and 100 percent of the welfare gap between the linear equilibrium and the first-best outcome, depending on initial conditions. The paper characterizes this as a quantitatively large, first-order welfare improvement.&lt;/p&gt;
&lt;p&gt;Q: What are &amp;ldquo;coordination failure&amp;rdquo; equilibria and when do they arise?&lt;/p&gt;
&lt;p&gt;A: Coordination failure equilibria feature discontinuities not only in the strategy (emission rate) but also in the value function itself. They arise when no single country can unilaterally prevent the carbon stock from rising past a threshold — formally, when N * a_max &amp;lt; delta * x at the discontinuity point. In such cases, if opponents are emitting heavily, no individual country can stop atmospheric carbon from rising even if it emits nothing, making heavy emission a best response. All players following this logic simultaneously produce a self-fulfilling collapse to high emissions. At high carbon levels these equilibria can yield welfare outcomes worse than the linear equilibrium.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s main policy implication for climate negotiations?&lt;/p&gt;
&lt;p&gt;A: The paper argues that international climate negotiations should be understood as a coordination problem over which of many MPE is played, rather than as bargaining over a limited cooperative surplus in a dynamic prisoners&amp;rsquo; dilemma. Since the best equilibria are self-enforcing (they are Nash equilibria, not cooperative solutions), they do not require external enforcement. The paper suggests effective agreements may involve threshold-based commitments — sharp decarbonisation if a carbon target is met, but acceptance of a substantially higher stabilisation target (e.g., 2.5 degrees C rather than 2 degrees C) if the first target is missed — to create the discontinuous strategic incentives that support good equilibria.&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle the previously identified &amp;ldquo;local MPE&amp;rdquo; that could not be extended to the entire state space?&lt;/p&gt;
&lt;p&gt;A: Prior work (Dockner and Long 1993; Rubio and Casino 2002; Dockner and Wagener 2014) constructed nonlinear equilibria that were only locally defined, and the validity of such equilibria was questioned (Rowat 2007; Bernhard 2024) because they were undefined on the full state space. The present paper&amp;rsquo;s framework allows discontinuous strategies, so these locally defined equilibria can be extended into globally defined, discontinuous MPE. Most previously discovered equilibria are shown to be nested within the larger set of all symmetric MPE identified here.&lt;/p&gt;
&lt;p&gt;Q: What mathematical tools are used to prove the main results?&lt;/p&gt;
&lt;p&gt;A: The proofs rely on the theory of viscosity solutions to Hamilton-Jacobi-Bellman equations (Bardi and Capuzzo-Dolcetta 2008), building on and extending results of Barles, Briani, and Chasseigne (2013, 2014) on optimal control with discontinuous dynamics. A key departure from Barles et al. is that the paper cannot assume controllability of the dynamics near discontinuities without imposing undue restrictions on opponents&amp;rsquo; strategies. The application of these results to a fixed-point condition of the best-response correspondence to construct MPE conditions is described as entirely novel.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions and limitations of the methodological results?&lt;/p&gt;
&lt;p&gt;A: The main results (Theorems 1–3) apply to differential games with a single state variable and strategies that are real-analytic except at finitely many points with one-sided derivatives everywhere. The climate application is further restricted to the symmetric linear-quadratic van der Ploeg–de Zeeuw framework. Extension to multiple state variables is acknowledged as future work. The welfare calibration results are specific to the parameter values N=10, delta=0.02, rho=0.02, d=0.5.&lt;/p&gt;
&lt;p&gt;Markov-perfect equilibrium (MPE): A Nash equilibrium in Markovian strategies, where each player&amp;rsquo;s strategy conditions only on the current state variable and not on the history of play. The paper makes this concept well-founded in differential games by allowing discontinuous strategies, ensuring payoffs can be computed for all strategy profiles and unique best responses exist almost everywhere.&lt;/p&gt;
&lt;p&gt;Filippov solution: A solution concept for ordinary differential equations with discontinuous right-hand sides, which replaces the dynamics at a discontinuity point with a convex hull of the left and right limits. Used in this paper to define well-specified state trajectories and payoffs even when players&amp;rsquo; strategies have jumps, eliminating the need to restrict strategy sets to &amp;ldquo;admissible&amp;rdquo; profiles.&lt;/p&gt;
&lt;p&gt;Discontinuous Markovian strategy: A policy function phi: X -&amp;gt; A that maps the state to an action and is real-analytic except at finitely many points, with well-defined one-sided derivatives everywhere. The key innovation of the paper — allowing such strategies makes differential games well-behaved as standard non-cooperative games while capturing the generically discontinuous nature of optimal policy functions.&lt;/p&gt;
&lt;p&gt;Push-push steady state: A steady state at a discontinuity point of a strategy where the dynamics push the state toward that point from both sides. Under Filippov solutions the state remains at such a point, with flow payoffs being a weighted average of left and right actions. Theorem 2 requires the value function to be differentiable at these points in equilibrium.&lt;/p&gt;
&lt;p&gt;Coordination failure equilibrium: An MPE featuring discontinuities in both the strategy and the value function, arising when no single player can unilaterally move the state across a threshold. At high carbon levels, if opponents emit heavily, individual emission cuts are ineffective; heavy emission becomes a best response for all, sustaining a self-fulfilling high-emission outcome. These equilibria can yield welfare outcomes worse than the linear equilibrium.&lt;/p&gt;
&lt;p&gt;Linear equilibrium: The unique continuous globally defined symmetric MPE in the van der Ploeg–de Zeeuw model, characterized by emissions decreasing linearly in the carbon stock. It involves adverse strategic substitutability — each player reduces emissions in response to high carbon because opponents do likewise — and is weakly Pareto-dominated by every MPE with a continuous value function.&lt;/p&gt;
&lt;p&gt;Skiba point: A state at which the optimal policy is discontinuous because the value function has distinct left and right derivatives, corresponding to the boundary between two basins of attraction with different long-run outcomes. In this paper, the steady state of a best equilibrium is a Skiba-type point: below it, emissions jump up to return rapidly to the target; above it, emissions increase only gradually.&lt;/p&gt;</description></item><item><title>Markups Across Space and Time</title><link>https://macropaperwarehouse.com/papers/markups-across-space-and-time/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/markups-across-space-and-time/</guid><description>&lt;p&gt;Anderson, Rebelo, and Wong study the behavior of markups in the retail sector across regions and over time, using a combination of firm-level Compustat data and product-level scanner data from two large retailers — one operating over 100 stores across U.S. states (quarterly data from 2006 Q1 to 2009 Q3, covering roughly 3.6 million SKU-store pairs across 79 product categories) and one operating hundreds of stores across Canadian provinces (quarterly data from 2016 Q1 to 2018 Q4, covering 15.6 million item-store pairs across 41 product groups). Markups are measured using gross margins — sales minus cost of goods sold as a fraction of sales — computed at the product level using the replacement cost for every item. This measurement approach is appropriate for retail because cost of goods sold accounts for over 80 percent of total retail firm costs, making it a reliable proxy for marginal cost. The replacement cost data, available at the store level, is the cost used by managers in actual pricing decisions, distinguishing these datasets from typical scanner data that contain only average costs.&lt;/p&gt;
&lt;p&gt;The paper documents five main facts. First, markups are remarkably stable over time and display a mild procyclical pattern. At the aggregate level, gross margins are roughly acyclical or mildly procyclical while sales and cost of goods sold are highly procyclical. The elasticity of gross margins with respect to real GDP is statistically insignificant at both the aggregate and firm level. The conditional response of gross margins to high-frequency monetary policy shocks and oil price shocks is also statistically insignificant, while net operating profit margins fall significantly in response to both shocks. Operating profit margins are 3.4 times more volatile than gross margins at a quarterly frequency, and sales and costs are roughly 2.6 times more volatile.&lt;/p&gt;
&lt;p&gt;Second, there is large regional dispersion in gross margins. A variance decomposition shows that the regional variance of gross margins (0.103) is substantially larger than the time-series variance (0.013), with a near-zero covariance between the two components. Third, regions with higher incomes and more expensive houses have higher markups — gross margins are positively correlated with log household income and log median house value in both the U.S. and Canadian data.&lt;/p&gt;
&lt;p&gt;Fourth, these higher regional markups do not result from less intense competition or regional differences in marginal costs. Gross margins are uncorrelated with the Herfindahl index (a measure of competition) and with a rural dummy (a proxy for higher transportation costs). The cyclicality of markups is acyclical or mildly procyclical regardless of whether the underlying product costs are themselves acyclical, procyclical, or countercyclical.&lt;/p&gt;
&lt;p&gt;Fifth, and most distinctively, regional variation in markups arises from differences in assortment composition across regions rather than from deviations from uniform pricing. A decomposition of regional gross margin variance confirms that the dominant component is the term capturing differences in product assortment across markets; the term capturing differences in gross margins for the same item — which would be nonzero under geographic price discrimination — accounts for very little of the regional variation. When the same item is available in different regions, the retailer charges a uniform price, consistent with Della Vigna and Gentzkow (2019).&lt;/p&gt;
&lt;p&gt;To rationalize these five facts, the authors propose a model with non-homothetic, quadratic preferences (following Melitz and Ottaviano 2008). In the model, higher-productivity regions choose higher-quality goods, which have less elastic demand and therefore higher markups. The markup is procyclical with respect to productivity shocks (A) but acyclical with respect to labor supply shocks (N), so a mixture of both types of shocks produces mildly procyclical markups. The model generates uniform pricing across regions for the homogeneous good, with regional markup differences arising through quality and assortment selection rather than price discrimination.&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure markups, and why is this approach appropriate for retail?
A: Markups are measured as gross margins — (sales minus cost of goods sold) divided by sales — computed at the product level using the replacement cost for every item. This is appropriate for retail because cost of goods sold is the predominant variable cost, accounting for over 80 percent of total retail firm costs. The replacement cost is the marginal cost concept used by managers in pricing decisions and is available at the store level rather than as a national average.&lt;/p&gt;
&lt;p&gt;Q: What is the cyclical behavior of gross margins at the aggregate retail level?
A: Gross margins are roughly acyclical or mildly procyclical. Sales and cost of goods sold are highly procyclical, suggesting that the business cycle primarily affects quantities sold rather than markups. Operating profit margins are 3.4 times more volatile than gross margins at a quarterly frequency, while sales and costs are roughly 2.6 times more volatile.&lt;/p&gt;
&lt;p&gt;Q: What is the conditional response of gross margins to monetary policy and oil price shocks?
A: The response of gross margins to both high-frequency monetary policy shocks (identified from Federal Funds futures data) and oil price shocks (identified via the Ramey-Vine 2010 VAR approach) is statistically insignificant. In contrast, net operating profit margins fall in a statistically significant manner in response to both types of shocks, indicating that fixed cost absorption rather than markup adjustment drives profit volatility.&lt;/p&gt;
&lt;p&gt;Q: How large is the regional dispersion in gross margins relative to their time-series variation?
A: The variance decomposition shows that the regional variance of gross margins is 0.103, compared to a time-series variance of only 0.013, with a covariance term close to zero. The vast majority of gross margin variation is therefore cross-sectional rather than time-series.&lt;/p&gt;
&lt;p&gt;Q: What variables explain the regional variation in gross margins?
A: In the U.S. data, gross margins are positively correlated with log household income and log median house value. Gross margins are uncorrelated with the Herfindahl index (a competition measure) and with the rural county dummy (a transportation cost proxy). Canadian data confirms the positive correlation between gross margins and both log household income and log median house value.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism through which higher-income regions have higher markups?
A: Regional markup differences are driven by assortment composition differences, not price discrimination. When the same item is sold in multiple regions, it sells at a uniform price. Higher-income regions carry different (higher-quality, higher-margin) products. The correlation between unique items sold and regional household income is 0.42 for the Canadian retailer and 0.17 for the U.S. retailer.&lt;/p&gt;
&lt;p&gt;Q: How is the variance of regional gross margins decomposed into assortment versus pricing components?
A: The variance decomposition separates total regional gross margin variance into: (1) a term for differences in gross margins for the same item across regions (would be nonzero with geographic price discrimination), (2) a term for differences in assortment composition holding gross margins fixed, and (3) an interaction term plus covariance terms. The dominant term is the assortment composition component; the same-item price difference term accounts for very little of the regional variation.&lt;/p&gt;
&lt;p&gt;Q: Does the acyclicality of gross margins hold for products with procyclical costs?
A: Yes. The authors divide products into those with acyclical, procyclical, and countercyclical costs and show (Table 7) that gross margins are acyclical or mildly procyclical for all three groups in both the U.S. and Canadian data. This implies that retailer pricing behavior contributes to price inertia even for products whose wholesale costs move with the cycle.&lt;/p&gt;
&lt;p&gt;Q: What fraction of gross margin changes are active versus passive?
A: In the U.S. data, 91 percent of margin changes are active (resulting from price changes, regardless of whether replacement cost has changed); 9 percent are passive (replacement cost changes with no price change). In the Canadian data, 93 percent of changes are active. Both the probability of active margin changes and the size of margin changes are acyclical with respect to unemployment and local house prices.&lt;/p&gt;
&lt;p&gt;Q: How does the Hall approach compare to gross-margin-based markup estimates?
A: When the Hall approach is implemented using output elasticities (deflating sales by a product-level price deflator to obtain quantity), the resulting markup estimates are very close to those from gross margins — the ratio is 1.014 for the U.S. firm and 0.991 for the Canadian firm. However, when revenue elasticities are used instead of output elasticities (the common practice in the literature due to data limitations), the implied markup is 14 percent lower for the U.S. firm and 13 percent lower for the Canadian firm, confirming the bias documented by Bond et al. (2020).&lt;/p&gt;
&lt;p&gt;Q: What are the key features of the theoretical model and what facts does it explain?
A: The model uses non-homothetic quadratic preferences (Melitz-Ottaviano form) in which demand elasticity falls as consumption quality rises. Higher-productivity regions optimally consume higher-quality varieties, which face less elastic demand and hence carry higher markups. The markup is procyclical in productivity (A) with an elasticity less than one (incomplete cost passthrough) and acyclical in labor supply (N), so a mixture of shocks generates mild procyclicality. Uniform pricing across regions for the homogeneous good holds by construction, and regional markup differences arise through quality-assortment selection.&lt;/p&gt;
&lt;p&gt;Q: Which existing macroeconomic models are consistent with the time-series evidence, and which are not?
A: The evidence is inconsistent with models featuring countercyclical markups (Rotemberg-Woodford 1992 imperfect competition, Ravn-Schmitt-Grohe-Uribe deep habits, Jaimovich-Floetotto entry-exit, and standard New Keynesian models with sticky prices and procyclical marginal costs). The time-series evidence is consistent with models featuring sticky retail prices and acyclical marginal costs (Nakamura-Steinsson 2010, Coibion-Gorodnichenko-Hong 2015) and models with price and wage rigidities at the manufacturing level (Erceg-Henderson-Levin 2000, Christiano-Eichenbaum-Evans 2005). Mildly procyclical search models (Alessandria 2009) are also consistent when procyclicality is mild.&lt;/p&gt;
&lt;p&gt;Q: Which existing trade and regional models are consistent or inconsistent with the regional evidence?
A: The spatial price discrimination models of Greenhut-Greenhut (1975) and Thisse-Vives (1988), which predict higher markups in less competitive regions, are inconsistent with the data. The Bertoletti-Etro (2017) non-homothetic model predicts that regional markup variation is driven by deviations from uniform pricing, which is also inconsistent. The Fajgelbaum-Grossman-Helpman (2011) model predicts countercyclical markups when costs are procyclical, contradicting the time-series results. Most existing macroeconomic models rely on homothetic preferences, predicting markups independent of regional income, inconsistent with the regional facts.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions on the measurement approach?
A: Gross margins are valid proxies for markups only in the retail sector, where cost of goods sold is the dominant variable cost (over 80 percent of total costs). In manufacturing, where labor and other costs represent a larger fraction of total variable costs, gross margins would not be a reliable markup measure. The product-level scanner data cover the 2006-2009 period for the U.S. and 2016-2018 for Canada; the U.S. sample includes a recession while the Canadian sample covers a moderate expansion.&lt;/p&gt;
&lt;p&gt;Gross margin as markup proxy: The ratio of (sales minus cost of goods sold) to sales, computed at the product level using the replacement cost for each item at each store and time period. Used as a proxy for the price-cost markup because cost of goods sold is the dominant variable cost in retail (over 80 percent of total costs), and the replacement cost is the marginal cost concept managers use in pricing decisions.&lt;/p&gt;
&lt;p&gt;Replacement cost: The cost at which the retailer would replenish a unit of inventory at current prices, available at the store level in the scanner datasets. Distinct from average historical cost and used here as a direct proxy for marginal cost, eliminating one of the main sources of markup mismeasurement in prior empirical work.&lt;/p&gt;
&lt;p&gt;Assortment composition: The set of products stocked and the expenditure weights of those products within a region. The paper&amp;rsquo;s central mechanism for regional markup variation — higher-income regions carry different (higher-quality, higher-margin) goods rather than charging different prices for the same goods.&lt;/p&gt;
&lt;p&gt;Uniform pricing: The practice of charging identical prices for the same item across different geographic regions. Confirmed empirically in both the U.S. and Canadian scanner datasets, and embedded structurally in the theoretical model for the homogeneous good.&lt;/p&gt;
&lt;p&gt;Active versus passive margin changes: A decomposition of gross margin changes into active changes (arising from retailer price decisions, irrespective of cost changes) and passive changes (arising when replacement cost changes but the retailer holds price fixed). Ninety-one percent of U.S. margin changes and 93 percent of Canadian changes are active.&lt;/p&gt;
&lt;p&gt;Non-homothetic quadratic preferences: The utility specification (following Melitz and Ottaviano 2008) in which the absolute value of the own-price demand elasticity falls as quality consumption rises. This property implies that higher-quality goods carry higher markups and that richer regions, which demand higher quality, have higher average markups — the key mechanism linking income to markups in the model.&lt;/p&gt;
&lt;p&gt;Hall approach to markup estimation: A production-function-based method in which the markup equals the output elasticity with respect to a variable input divided by that input&amp;rsquo;s cost share in revenue. The paper shows this yields estimates close to gross-margin estimates when implemented with true output quantities, but produces markups roughly 13-14 percent lower when revenue is substituted for output (a common approximation), confirming the Bond et al. 2020 bias.&lt;/p&gt;</description></item><item><title>Markups: A Search-Theoretic Perspective</title><link>https://macropaperwarehouse.com/papers/markups-a-search-theoretic-perspective/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/markups-a-search-theoretic-perspective/</guid><description>&lt;h2 id="what-this-paper-finds--and-why-it-matters"&gt;What this paper finds — and why it matters&lt;/h2&gt;
&lt;p&gt;Across macroeconomics, market power is almost always modelled with the Dixit–Stiglitz (1977) monopolistic-competition framework, in which a seller&amp;rsquo;s markup is pinned down by how substitutable buyers perceive its variety to be. This paper instead derives a closed-form formula for the equilibrium distribution of markups in the &lt;strong&gt;search-theoretic&lt;/strong&gt; model of imperfect competition of Butters (1977), Varian (1980) and Burdett–Judd (1983), where a seller has market power not because its good lacks substitutes but because search and information frictions leave some buyers unable to reach the cheapest seller. In this model markups are strictly positive even though all sellers&amp;rsquo; varieties are &lt;em&gt;perfect&lt;/em&gt; substitutes, are dispersed even when all sellers operate the &lt;em&gt;same&lt;/em&gt; technology, and — once sellers differ in marginal cost — can be increasing, decreasing, or constant in a seller&amp;rsquo;s size; yet the equilibrium is efficient. Menzio proves an &amp;ldquo;anything-goes&amp;rdquo; result: any twice-differentiable markup function can arise as an equilibrium for an appropriate choice of parameters, so a Dixit–Stiglitz model can always reproduce the search model&amp;rsquo;s markups — but only with reduced-form buyer preferences that depend on the search model&amp;rsquo;s deep parameters and are therefore unstable to policy changes (a Lucas-critique problem), and that would (incorrectly) read those markups as symptoms of inefficiency and a case for corrective subsidies. The paper&amp;rsquo;s central and deliberately modest claim is a cautionary one for macroeconomics: because two well-established models can both match observed markups yet imply opposite conclusions about welfare, optimal policy, and counterfactuals, markup data &lt;em&gt;alone&lt;/em&gt; cannot identify the macroeconomic consequences of market power — one also needs evidence on the &lt;em&gt;origin&lt;/em&gt; of that market power. The results are theoretical (unit demand, constant returns to scale, a Poisson contact process); the sharp comparative statics are derived for a log-uniform cost distribution, and the same logic extends to labor-market &lt;em&gt;markdowns&lt;/em&gt; in the Burdett–Mortensen (1998) model.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-two-theories-of-market-power-does-the-paper-compare-and-how-do-they-differ-at-root"&gt;Q1. What two theories of market power does the paper compare, and how do they differ at root?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper contrasts the Dixit–Stiglitz (1977) monopolistic-competition framework, in which market power comes from product differentiation, with the search-theoretic framework of Butters (1977), Varian (1980) and Burdett–Judd (1983), in which market power comes from buyers&amp;rsquo; limited choice sets.&lt;/strong&gt; In Dixit–Stiglitz, &amp;ldquo;every seller is a monopolist of its own product variety,&amp;rdquo; and the size of markups &amp;ldquo;is determined by the substitutability of different varieties in the buyers&amp;rsquo; utility function.&amp;rdquo; In the search-theoretic framework, by contrast, &amp;ldquo;a seller has market power not because it carries a good that has no perfect substitutes, but because (some) buyers do not have every seller in their choice set due to informational frictions … or physical frictions,&amp;rdquo; so markups are instead &amp;ldquo;determined by the distribution of the size of buyers&amp;rsquo; choice sets.&amp;rdquo; Menzio motivates the second view with retail examples (e.g., the same bottle of Heinz ketchup sold at many stores at different markups), where it strains credulity that buyers see one store&amp;rsquo;s bottle as a poor substitute for the identical bottle elsewhere.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-equilibrium-markup-formula-when-all-sellers-are-identical"&gt;Q2. What is the equilibrium markup formula when all sellers are identical?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;With homogeneous sellers, a seller at quantile x of the price distribution charges a gross markup μ(x) = 1 + (u/c − 1)·e^(−λ(1−x)), the product of a monopoly markup and a rank-dependent discount factor.&lt;/strong&gt; Here u is the buyer&amp;rsquo;s valuation, c the common marginal cost, and λ the Poisson coefficient for the number of sellers a buyer contacts — &amp;ldquo;the average number of sellers with which a buyer is in contact, and, in this sense, … a measure of the extent of competition in the market.&amp;rdquo; The term u/c − 1 is &amp;ldquo;the net markup for a monopolist.&amp;rdquo; The discount factor e^(−λ(1−x)) &amp;ldquo;is equal to 1 for the seller at the top of the price distribution&amp;rdquo; (no discounting) and falls to its minimum e^(−λ) for the seller at the bottom; a higher λ makes markups decline more steeply down the price ranking. The equilibrium price distribution and its support are derived in closed form (F(p) and the lowest price p_ℓ = c + e^(−λ)(u − c)), and the equilibrium is shown to exist, be unique, and be efficient (Proposition 1).&lt;/p&gt;
&lt;h3 id="q3-why-are-markups-positive-and-dispersed-even-when-goods-are-perfect-substitutes-and-technology-is-identical"&gt;Q3. Why are markups positive and dispersed even when goods are perfect substitutes and technology is identical?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Markups are positive because search frictions leave some buyers &amp;ldquo;captive&amp;rdquo; — in contact with only one seller — which forces equilibrium profits, and hence prices, strictly above marginal cost; markups are dispersed for the same reason there is price dispersion in these models — non-captive buyers prevent any mass point in the price distribution.&lt;/strong&gt; As Menzio puts it, &amp;ldquo;sellers meet a positive measure of buyers that are captive, in the sense that these buyers cannot purchase from any other seller,&amp;rdquo; so &amp;ldquo;prices must be strictly above marginal cost&amp;rdquo;; simultaneously, the positive measure of non-captive buyers &amp;ldquo;implies that the price distribution cannot have any mass points above marginal cost.&amp;rdquo; The two facts together require sellers to post different prices and therefore charge different markups, despite identical goods and identical technology.&lt;/p&gt;
&lt;h3 id="q4-in-the-homogeneous-seller-case-how-do-markups-relate-to-a-sellers-price-and-size"&gt;Q4. In the homogeneous-seller case, how do markups relate to a seller&amp;rsquo;s price and size?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;With identical sellers, markups are increasing in a seller&amp;rsquo;s price and decreasing in a seller&amp;rsquo;s size.&lt;/strong&gt; Because μ(x) and the posted price p(x) both rise with rank x while quantity sold q(x) = bλ·e(−λx) falls with x, &amp;ldquo;markups are increasing in the seller&amp;rsquo;s price&amp;rdquo; and &amp;ldquo;decreasing in the seller&amp;rsquo;s size.&amp;rdquo; Menzio notes this is the opposite of &amp;ldquo;Marshall&amp;rsquo;s second law of demand,&amp;rdquo; and that it implies larger sellers face a higher elasticity of demand. He stresses this counterfactual pattern (empirically, larger firms tend to charge &lt;em&gt;higher&lt;/em&gt; markups) is exactly why the paper goes on to add cost heterogeneity.&lt;/p&gt;
&lt;h3 id="q5-what-changes-when-sellers-differ-in-marginal-cost"&gt;Q5. What changes when sellers differ in marginal cost?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;With heterogeneous marginal costs, the markup formula gains an extra term reflecting that higher-ranked (higher-cost) firms put less competitive pressure on a seller, and equilibrium markups need no longer be decreasing in size — they can be increasing, decreasing, or hump-shaped.&lt;/strong&gt; A seller&amp;rsquo;s price is a strictly increasing function of its cost (Lemma 3), so its rank in the price distribution equals its rank in the cost distribution. The generalized markup (eq. 3.22) adds, to the monopoly-times-discount term, &amp;ldquo;the additional markup that the seller can charge because the firms ranked above it in the price distribution produce at higher marginal cost,&amp;rdquo; with the excess cost of nearer-ranked firms weighted more heavily. Using a phase-diagram (nullcline) analysis, Menzio shows the markup function μ(x) can be strictly increasing, strictly decreasing, or hump-shaped in rank depending on parameters. The heterogeneous-cost equilibrium is again shown to exist, be unique, and be efficient (Proposition 2).&lt;/p&gt;
&lt;h3 id="q6-what-is-the-anything-goes-theorem-and-why-does-it-matter"&gt;Q6. What is the &amp;ldquo;anything-goes&amp;rdquo; theorem, and why does it matter?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Menzio proves (Theorem 3) that any twice-continuously-differentiable markup function μ&lt;/em&gt;(x) &amp;gt; 1 can be generated as an equilibrium of the search-theoretic model, given an appropriate contact intensity λ and cost distribution c(x).&lt;/em&gt;* Concretely, for any target markup schedule there is a λ and a quantile cost function c(x) (given in closed form) that deliver it as the equilibrium outcome. The consequence is sharp: &amp;ldquo;the search-theoretic model of market power can rationalize any pattern of markups observed in the data,&amp;rdquo; so &amp;ldquo;markup data cannot be used to reject the search-theoretic model.&amp;rdquo; Combined with the fact that the Dixit–Stiglitz model can reproduce the same markups, both theories are consistent with any markup evidence — which is the crux of the paper&amp;rsquo;s identification argument.&lt;/p&gt;
&lt;h3 id="q7-can-a-dixitstiglitz-model-reproduce-these-markups-and-at-what-cost"&gt;Q7. Can a Dixit–Stiglitz model reproduce these markups, and at what cost?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Yes — a Dixit–Stiglitz model can always reproduce the search model&amp;rsquo;s markups, but only with reduced-form buyer preferences that depend on the search model&amp;rsquo;s deep parameters (λ, u, c, b) and are therefore unstable.&lt;/strong&gt; Menzio constructs the buyer utility function v(q) (its marginal utility solves a differential equation, eq. 2.24) that makes a Dixit–Stiglitz seller choose the same markups and quantities as in the search model. That reduced-form utility has v&amp;rsquo;(q) decreasing (so varieties look like imperfect substitutes, rationalizing positive markups) and an elasticity of demand that rises with q (rationalizing markups that fall with size). Critically, &amp;ldquo;the reduced-form utility function depends on the parameters of the search-theoretic model&amp;rdquo; and so &amp;ldquo;is unstable, in the sense that changes in the environment and counterfactual experiments lead to changes in the reduced-form utility function&amp;rdquo; — meaning any policy or counterfactual exercise that holds these preferences fixed &amp;ldquo;would not produce valid predictions,&amp;rdquo; i.e., is subject to the Lucas critique.&lt;/p&gt;
&lt;h3 id="q8-why-would-reading-these-markups-through-the-dixitstiglitz-lens-give-the-wrong-welfare-and-policy-conclusions"&gt;Q8. Why would reading these markups through the Dixit–Stiglitz lens give the wrong welfare and policy conclusions?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Because in Dixit–Stiglitz positive and heterogeneous markups signal inefficiency and call for subsidies, whereas the search-theoretic equilibrium that generated those very markups is efficient.&lt;/strong&gt; Through the Dixit–Stiglitz lens, positive net markups imply &amp;ldquo;sellers produce an inefficiently small quantity,&amp;rdquo; and heterogeneous markups imply misallocation across sellers, leading an analyst to &amp;ldquo;recommend the introduction of consumption subsidies&amp;rdquo; and &amp;ldquo;finely-tuned production subsidies that reallocate inputs and consumption from low to high-markup sellers.&amp;rdquo; &amp;ldquo;None of these welfare and policy implications are, however, correct, since the equilibrium of the search-theoretic model … is efficient.&amp;rdquo; The root of the error is the demand curve&amp;rsquo;s interpretation: the quantity q(p) − q(c) a seller does not sell is, in Dixit–Stiglitz, lost gains from trade (an inefficiency), but in the search model it is &amp;ldquo;equally valuable trades that the buyers make with other sellers,&amp;rdquo; and so is not an inefficiency.&lt;/p&gt;
&lt;h3 id="q9-what-determines-the-level-and-shape-of-the-markup-distribution"&gt;Q9. What determines the level and shape of the markup distribution?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;For a log-uniform cost distribution (Theorem 4), markups decrease with the extent of competition λ, increase with the buyers&amp;rsquo; valuation u, decrease with the highest marginal cost c_h, and increase with the rate κ at which marginal costs decline across sellers; the sign of the markup–size relationship flips at parameter thresholds.&lt;/strong&gt; Specifically, the markup function is strictly decreasing in rank x (markups rising with size) when competition is weak (λ below a cutoff λ*), constant when λ = λ*, and strictly increasing in x (markups falling with size) when λ &amp;gt; λ*; analogous thresholds u* and κ* govern the slope&amp;rsquo;s sign as u and κ vary. The intuition: when λ is low, sellers rarely compete for the same buyers and low-cost sellers face little pressure, so markups are high and higher for low-cost (large) sellers; when λ is high, low-cost sellers are pushed toward marginal-cost pricing while high-cost sellers — facing no pressure from above — retain markups near u/c_h. Menzio notes the monotone-level results (markups decreasing in λ and c_h, increasing in u and in κ(x) = c&amp;rsquo;(x)/c(x)) generalize beyond the log-uniform family to arbitrary cost distributions, while the slope-sign results are stated for the log-uniform case.&lt;/p&gt;
&lt;h3 id="q10-what-is-the-bottom-line-claim-for-macroeconomics"&gt;Q10. What is the bottom-line claim for macroeconomics?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Markup data alone are insufficient to draw conclusions about the welfare, policy, and counterfactual consequences of market power; identifying those consequences requires evidence on the &lt;em&gt;source&lt;/em&gt; of market power — product differentiation versus search/information frictions.&lt;/strong&gt; The paper frames this as &amp;ldquo;a cautionary note to the macroeconomic literature that uses the Dixit–Stiglitz framework to model market power and markups&amp;rdquo; — a literature spanning monetary policy (e.g., Blanchard–Kiyotaki 1985; Christiano, Eichenbaum and Evans 2005; Golosov and Lucas 2007), misallocation and aggregate TFP (Hsieh and Klenow 2009), and the gains from trade (Krugman; Melitz 2003). In Dixit–Stiglitz estimations, markup heterogeneity is &amp;ldquo;quantitatively important&amp;rdquo; for the welfare cost of inflation in sticky-price models (Galí 1995), the gains from trade (Dhingra and Morrow 2019), and the cost of market power (Boar and Midrigan 2024); Menzio&amp;rsquo;s point is that &amp;ldquo;neither the level nor the dispersion of markups observed in the data are necessarily symptomatic of any inefficiency.&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q11-does-the-paper-claim-the-search-theoretic-model-is-the-correct-one"&gt;Q11. Does the paper claim the search-theoretic model is the correct one?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;No — the paper explicitly does not argue that the search-theoretic model is closer to the truth than monopolistic competition; it makes the &amp;ldquo;more modest, but not unimportant&amp;rdquo; claim that two sensible, well-established models fit the same markup data yet imply very different welfare, policy, and counterfactual conclusions.&lt;/strong&gt; Menzio notes both theories &amp;ldquo;are likely to be overly simplified descriptions of the world,&amp;rdquo; and that the existence of still other models generating the same markups &amp;ldquo;only strengthens&amp;rdquo; the point. The constructive takeaway he poses is an empirical identification question: &amp;ldquo;How much of the downward sloping demand curve facing a seller is due to the heterogeneity in buyer&amp;rsquo;s outside options and how much is it due to preferences?&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q12-does-the-argument-extend-beyond-product-markets"&gt;Q12. Does the argument extend beyond product markets?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Yes — the same logic applies to the labor market: in the Burdett–Mortensen (1998) search model one can derive a closed-form formula for equilibrium &lt;em&gt;markdowns&lt;/em&gt; that are positive even when employers are perfect substitutes to workers, are heterogeneous even with identical technology, and may be increasing, decreasing, or constant in firm size, with the equilibrium again efficient.&lt;/strong&gt; Menzio concludes that &amp;ldquo;the same caution that I recommend using when interpreting markups should be applied to the interpretation of markdown data.&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q13-what-are-the-scope-conditions-and-what-does-the-paper-not-do"&gt;Q13. What are the scope conditions, and what does the paper not do?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The results are theoretical, derived under unit buyer demand, constant returns to scale, and a Poisson process for the number of sellers each buyer contacts; the closed-form comparative statics of Theorem 4 assume a log-uniform marginal-cost distribution; and the paper offers no empirical calibration or estimation.&lt;/strong&gt; Menzio notes the efficiency result depends on the model&amp;rsquo;s assumptions — relaxing unit demand or adding externalities could make the equilibrium inefficient — but argues this does not weaken the core identification point. A companion paper (Menzio 2024b, NBER WP 33253) shows the efficiency of the search-theoretic equilibrium extends to a general-equilibrium setting with endogenous firm entry. The paper&amp;rsquo;s contribution is an analytical characterization and a cautionary/identification argument, not a quantitative welfare estimate.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Search-theoretic model of imperfect competition&lt;/strong&gt; : The Butters (1977)/Varian (1980)/Burdett–Judd (1983) framework in which sellers carry identical (perfectly substitutable) goods, and market power arises because buyers contact only a random subset of sellers — so some buyers are &amp;ldquo;captive&amp;rdquo; to a single seller. Markups are determined by the distribution of buyers&amp;rsquo; choice-set sizes, not by preferences over differentiated varieties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dixit–Stiglitz monopolistic competition&lt;/strong&gt; : Any model in which each seller is the sole producer (monopolist) of its own variety, sets its price, and is too small to affect the aggregate; the size of markups is governed by the substitutability of varieties in buyers&amp;rsquo; utility (CES, VES, translog, or Kimball preferences all qualify in the paper&amp;rsquo;s usage).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gross / net markup&lt;/strong&gt; : The gross markup μ is the ratio of a seller&amp;rsquo;s posted price to its marginal cost (p/c); the net markup is μ − 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Captive vs. non-captive buyers&lt;/strong&gt; : A captive buyer is in contact with only one seller and so cannot shop around (the source of strictly positive markups); a non-captive buyer is in contact with several sellers and buys from the cheapest (the source of price dispersion and the absence of mass points in the price distribution).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;λ (extent of competition)&lt;/strong&gt; : The coefficient of the Poisson distribution governing how many sellers a buyer contacts — equivalently the average number of contacts per buyer; higher λ means more competition and lower markups.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reduced-form preferences / Lucas critique&lt;/strong&gt; : The buyer utility function a Dixit–Stiglitz modeller would infer to rationalize the search model&amp;rsquo;s markups; because it depends on the search model&amp;rsquo;s deep parameters (λ, u, c, b), it shifts whenever the environment or policy changes, so counterfactuals computed holding it fixed are invalid.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency (of the equilibrium)&lt;/strong&gt; : The search-theoretic equilibrium maximizes the sum of buyer and seller payoffs — every contacted buyer buys (since valuation u exceeds cost c) and, with heterogeneous costs, buys from the lowest-cost contacted seller — so the positive, dispersed markups are &lt;em&gt;not&lt;/em&gt; symptoms of any inefficiency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Markdown&lt;/strong&gt; : The labor-market analogue of a markup — the gap between a worker&amp;rsquo;s marginal product and the wage — which in the Burdett–Mortensen (1998) search model has the same qualitative properties (positive, heterogeneous, size-dependent, efficient) as product-market markups here.&lt;/p&gt;</description></item><item><title>Marriage, Fertility, and Cultural Integration in Italy</title><link>https://macropaperwarehouse.com/papers/marriage-fertility-and-cultural-integration-in-italy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/marriage-fertility-and-cultural-integration-in-italy/</guid><description>&lt;p&gt;Bisin and Tura study the cultural integration of immigrants in Italy by estimating a structural model of marital matching embedded with intra-household decisions — fertility, socialization of children, and divorce — along cultural-ethnic lines. The central research question is how to decompose the demand for integration (from immigrants) and the supply of cultural acceptance (from natives) in explaining the pace and heterogeneity of cultural convergence.&lt;/p&gt;
&lt;p&gt;The empirical analysis exploits administrative individual-level data from ISTAT&amp;rsquo;s ADELE Laboratory covering the universe of marriages formed in Italy from 1995 to 2012 and the universe of births and separations over the same period. After matching marriage, birth, and separation records, the final sample comprises more than 4 million marriages, representing 92.6% of all marriages celebrated in Italy over the period. Seven cultural-ethnic groups are studied: Italian (majority), Europe-EU15, Other Europe, North Africa–Middle East, Sub-Saharan Africa, East Asia, and Latin America. The model is a transferable-utility (TU) frictionless marriage market in which the joint marital surplus depends on a systematic component — itself the outcome of a collective household decision problem — and an idiosyncratic component capturing unobserved individual heterogeneity (following Choo and Siow, 2006). Parameters are estimated via method of moments, with identification drawing on cross-sectional variation across ethnic-group pairings and across Italy&amp;rsquo;s 20 administrative regions. Cultural socialization is proxied by language transmission (whether Italian is spoken at home with children).&lt;/p&gt;
&lt;p&gt;The data confirm strong positive assortative mating along cultural-ethnic lines, with particularly high homogamy rates for Sub-Saharan African and East Asian minorities. Homogamous minority households show notably lower rates of Italian-language use at home — for East Asian parents, 20% in a homogamous marriage versus 92% in a heterogamous marriage. Heterogamous marriages have higher separation rates (7.5% for mixed families with at least one Italian spouse versus 6.4% for homogamous Italian couples) and lower fertility.&lt;/p&gt;
&lt;p&gt;The estimated cultural intolerance parameters — measuring the psychological value a parent places on socializing a child to his/her own ethnic identity relative to a child acquiring a different identity — are strictly positive, asymmetric across directions, and highly heterogeneous across groups. North Africa–Middle East immigrants exhibit the highest minority intolerance (estimated at 97.85), more than six times that of Europe-EU15 immigrants (6.69). Latin America (93.13), Sub-Saharan Africa (87.08), and East Asia (81.22) also show high intolerance. On the native side, Italian intolerance is highest toward Sub-Saharan African immigrants (78.23) and lowest toward Europe-EU15 immigrants.&lt;/p&gt;
&lt;p&gt;Long-run simulations over successive generations show that all minorities eventually converge to the Italian majority along the language dimension, but at heterogeneous rates. Seventy-five percent of second-generation immigrants speak Italian at home with their children (one-generation integration rate). Europe-EU15 and Other Europe minorities converge almost completely within a single generation. Latin America shows the slowest path, with only 70% integration after four generations. East Asia and Sub-Saharan Africa also integrate more slowly, driven respectively by high fertility rates and strong selection into homogamous marriages.&lt;/p&gt;
&lt;p&gt;A counterintuitive counterfactual result is central to the paper: if Italian cultural intolerance were reduced to zero (full acceptance), cultural integration of minorities would slow by 15 percentage points over a generation (from 93% to 78% by the third generation). The mechanism is that greater native acceptance enables immigrants to sustain their own language even within heterogamous (mixed) marriages, increasing demand for such marriages and raising minority fertility, thereby preserving cultural distinctiveness.&lt;/p&gt;
&lt;p&gt;Finally, doubling immigration inflows while holding population shares constant reduces third-generation integration from 93% to 86% (a 7-percentage-point reduction). Effects are concentrated among Sub-Saharan African (20-percentage-point reduction) and East Asian (6-percentage-point reduction) minorities, with little impact on European and North African minorities. When inflows are reweighted toward Sub-Saharan African and East Asian groups, integration losses for those minorities range from 20 to 60 percentage points by the third generation.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s core methodological contribution?
A: The paper embeds a collective household decision problem — covering fertility, socialization, and divorce — within a transferable-utility frictionless marriage matching framework. This allows marital utility to emerge endogenously from intra-household decisions rather than being specified exogenously. The key innovation is that socialization incentives and technologies differ systematically between homogamous and heterogamous marriages, and these differences feed back into marital matching and long-run cultural dynamics.&lt;/p&gt;
&lt;p&gt;Q: What does &amp;ldquo;cultural intolerance&amp;rdquo; mean in this model, and how is it identified?
A: Cultural intolerance is the psychological value a parent obtains from socializing a child to his/her own ethnic identity, relative to having a child adopt a different cultural-ethnic identity. It is the main parameter driving socialization effort and resistance to cultural integration. Identification relies on two sources of cross-sectional variation: differences in matching patterns, fertility, separation, and socialization rates across cultural-ethnic group pairings, and exogenous variation in the ethnic composition of the regional population across Italy&amp;rsquo;s 20 administrative regions.&lt;/p&gt;
&lt;p&gt;Q: How heterogeneous are the estimated cultural intolerance parameters across minority groups?
A: The parameters are highly heterogeneous. North Africa–Middle East immigrants have the highest estimated minority intolerance (97.85), more than six times the EU15 estimate (6.69). Latin America (93.13), Sub-Saharan Africa (87.08), and East Asia (81.22) are also substantially higher than EU15. The matrix is asymmetric: Italian intolerance toward Sub-Saharan Africans (78.23) is higher than toward North Africans (67.88), even though those two groups show comparable minority intolerance levels.&lt;/p&gt;
&lt;p&gt;Q: What are the three mechanisms beyond intolerance parameters that explain heterogeneous integration dynamics?
A: First, selection into homogamous marriages: Sub-Saharan Africa&amp;rsquo;s particularly strong selection into homogamy gives those households access to superior coordinated socialization technology, sustaining cultural heterogeneity despite similar intolerance levels to other groups. Second, fertility rates: East Asian minorities have particularly high estimated fertility, which amplifies the transmission of their cultural identity across generations. Third, socialization effectiveness in heterogamous marriages: Latin American immigrants are uniquely able to socialize children to their own language even when married to native Italians, making their integration the slowest despite being in many mixed marriages.&lt;/p&gt;
&lt;p&gt;Q: What is the counterintuitive result about Italian cultural intolerance and integration speed?
A: Lowering Italian cultural intolerance to zero would reduce minority integration by 15 percentage points over one generation, with third-generation integration falling from 93% to 78%. The intuition is that higher native acceptance enables immigrants to maintain their own language more effectively within heterogamous marriages, which in turn increases immigrant demand for intermarriage with natives and raises minority fertility — both of which slow cultural convergence rather than accelerating it.&lt;/p&gt;
&lt;p&gt;Q: How do divorce dynamics differ between homogamous and heterogamous households?
A: Heterogamous households exhibit higher separation rates than culturally homogeneous unions: 7.5% for mixed families with at least one Italian spouse versus 6.4% for homogamous Italian couples. In the model, divorce by heterogamous households can be a strategic choice by mothers with high cultural intolerance, since custody grants single mothers greater unilateral control over socialization. Divorce probabilities are decreasing in the number of children for both family types. Interestingly, heterogamous households invest more in socialization when divorced than when married, because the high-intolerance parent can act without spousal opposition.&lt;/p&gt;
&lt;p&gt;Q: How well does the model fit the data?
A: The raw correlation between predicted and observed gains to marriage is 0.84. The correlation between predicted and observed foreign-language socialization rates is 0.83, for both homogamous and heterogamous families. The dataset covers 92.5% of all marriages in Italy from 1995 to 2012, representing over 4 million marriages matched with birth and separation records at a 98.5% one-to-one match rate.&lt;/p&gt;
&lt;p&gt;Q: What happens to cultural integration when immigration inflows are doubled with an overweighting of North Africa–Middle East, Sub-Saharan Africa, and East Asian immigrants?
A: North Africa–Middle East immigrants reduce third-generation convergence by only 4 percentage points. By contrast, East Asian and Sub-Saharan African minorities produce integration losses ranging from 20 to 60 percentage points by the third generation. This wide range reflects how the interaction between high fertility, strong homogamy selection, and effective socialization in heterogamous marriages amplifies cultural persistence when these groups constitute a larger share of inflows.&lt;/p&gt;
&lt;p&gt;Q: What is the one-generation cultural integration rate, and which groups diverge most from it?
A: Seventy-five percent of second-generation immigrants speak Italian at home with their children, constituting the one-generation baseline integration rate. Europe-EU15 and Other Europe minorities converge almost completely within one generation, as does North Africa–Middle East. Latin America diverges most sharply downward, with only 70% integration even after four generations, and shows a partial retreat from integration in the first generation. Sub-Saharan Africa and East Asia also fall below the 75% one-generation benchmark.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to the debate on native labor market effects of immigration?
A: The paper notes that sizeable negative labor market effects of immigration on natives are far from well-documented in the empirical literature, with results ranging from negative wage effects (Borjas) to positive or heterogeneous effects (Card, Ottaviano-Peri, Dustmann et al.). The authors therefore focus on the cultural externalities channel, which they argue better explains voter opposition to immigration, and study cultural integration structurally rather than examining wage outcomes.&lt;/p&gt;
&lt;p&gt;Cultural intolerance: The psychological value a parent obtains from socializing a child to his/her own ethnic identity, relative to having a child adopt a different cultural-ethnic identity. It is specific to the household type (homogamous vs. heterogamous) and is the primary parameter measuring the strength of a group&amp;rsquo;s resistance to cultural integration.&lt;/p&gt;
&lt;p&gt;Cultural socialization / language transmission: The costly investments parents make to transmit their own cultural-ethnic traits to children. In the empirical model, socialization is proxied by whether a parent speaks his/her own non-Italian language at home with children. Socialization technologies are more efficient in homogamous (same-ethnicity) marriages than heterogamous ones.&lt;/p&gt;
&lt;p&gt;Homogamous vs. heterogamous marriage: A homogamous marriage is one in which both spouses share the same cultural-ethnic identity; a heterogamous marriage is one in which spouses differ. The distinction is load-bearing throughout the model: homogamous households have coordinated socialization incentives and superior technology, higher fertility, and lower separation rates.&lt;/p&gt;
&lt;p&gt;Transferable utility (TU) matching: A marriage market framework in which utility is transferable between spouses, so that the equilibrium allocation maximizes aggregate marital surplus and equilibrium transfers are determined by outside options. The model is frictionless, meaning matching is driven purely by preferences over the characteristics of potential spouses.&lt;/p&gt;
&lt;p&gt;Cultural integration (language dimension): In the paper&amp;rsquo;s long-run simulations, cultural integration is defined as the share of second- (or later-) generation immigrants who speak Italian at home with their own children. It is the empirical outcome used to track convergence to the majoritarian culture across generations.&lt;/p&gt;
&lt;p&gt;Assortative mating along cultural-ethnic lines: The tendency for individuals to match with spouses of the same cultural-ethnic group. The paper finds positive assortative mating for all groups, with particularly strong homogamy for Sub-Saharan African and East Asian minorities, and explains it as the equilibrium outcome of the TU matching model given cultural intolerance preferences.&lt;/p&gt;
&lt;p&gt;Socialization technology asymmetry: The model&amp;rsquo;s assumption that homogamous married parents hold a more efficient socialization technology than heterogamous parents, but that divorced heterogamous households invest more in socialization than married heterogamous ones, because the high-intolerance parent can act unilaterally without spousal opposition.&lt;/p&gt;</description></item><item><title>Measuring and Mitigating Racial Disparities in Tax Audits</title><link>https://macropaperwarehouse.com/papers/measuring-and-mitigating-racial-disparities-in-tax-audits/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/measuring-and-mitigating-racial-disparities-in-tax-audits/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Do Black taxpayers face higher IRS audit rates than non-Black taxpayers, despite race-blind audit selection? And if so, why — and what would mitigation look like?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology.&lt;/strong&gt; The authors use comprehensive administrative microdata covering approximately 148 million individual income tax returns and 780,627 operational audits for tax year 2014, supplemented with 71,878 research audits from the IRS National Research Program (NRP) pooled over 2010-2014. Because neither the researchers nor the IRS observe taxpayer race, the authors employ Bayesian Improved First Name Surname Geocoding (BIFSG), which imputes the probability that a taxpayer is Black from first name, surname, and Census Block Group. They develop a novel partial identification strategy: two estimators (a probabilistic estimator and a linear estimator) that, under conditions verified using a matched North Carolina voter-registration dataset containing self-reported race, asymptotically bound the true racial audit disparity from below and above respectively. To address the selective labels problem — underreporting is observable only for audited returns — the authors combine operational audit data with NRP random-sample audits to simulate counterfactual audit selection algorithms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Magnitude of the disparity.&lt;/em&gt; The probabilistic estimator implies a racial audit disparity of 0.81 percentage points; the linear estimator implies 1.34 percentage points. Against a base audit rate of 0.54% for the overall U.S. population in 2014, these bounds imply that Black taxpayers are audited at between 2.9 and 4.7 times the rate of non-Black taxpayers.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Role of the EITC.&lt;/em&gt; The disparity is concentrated among EITC claimants. The estimated disparity within the EITC population is 1.96 to 2.90 percentage points, compared to only 0.10 to 0.18 percentage points among non-EITC claimants. In relative terms, Black EITC claimants are audited at 2.9 to 4.4 times the rate of non-Black EITC claimants. A formal decomposition attributes 70-73% of the overall disparity to higher audit rates among Black EITC claimants, 20-21% to racial differences in EITC claiming rates, and 7-8% to differential audit rates among non-EITC filers. Within EITC claimants, 78.5% of the observed audit disparity is attributable to the Dependent Database (DDb) program.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Source of the disparity — algorithmic objective.&lt;/em&gt; Using counterfactual audit selection algorithms estimated on NRP data, the authors find that allocating EITC audits to maximize detected total underreporting (from any source) would produce audit rates of 0.74% for Black EITC claimants versus 1.63% for non-Black EITC claimants — reversing the disparity. In contrast, the status quo, which prioritizes detecting overclaimed refundable credits, yields 3.00% for Black claimants versus 1.04% for non-Black claimants. The primary driver is a difference in the types of noncompliance that are more prevalent by race: dependent-claiming errors are more common among Black EITC claimants (dependent error rate of 26.6% vs. 16.3% for non-Black), while the highest underreporting via business income underreporting is disproportionately concentrated among non-Black EITC claimants. An algorithm focused on refundable credit overclaims implicitly targets dependent errors and therefore selects Black taxpayers at higher rates.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Prediction model bias.&lt;/em&gt; Even conditional on the refundable-credit objective, the status quo disparity (1.96 p.p.) exceeds the disparity that would arise under an oracle that uses actual rather than predicted refundable credit overclaims (1.08 p.p.), suggesting that prediction errors are unevenly distributed by race. The refundable credit prediction algorithm generates a disparity of 1.75 p.p., approximately 60% larger than the oracle. The authors find suggestive evidence of missingness in birth certificate data (paternal information is disproportionately missing for children claimed on Black taxpayers&amp;rsquo; returns) and differential predictive accuracy in the DDb risk score across race.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Operational consequences.&lt;/em&gt; Switching the objective from refundable credit overclaims to total underreporting would shift the composition of audited returns from predominantly dependent-eligibility issues (80% of refundable credit oracle-selected returns contain a dependent error) toward business income (86% of total-underreporting oracle-selected returns have business income underreporting). EITC returns with substantial business income (gross receipts above $25,000) cost on average $369.70 to audit versus $23.09 for other EITC returns. Holding the audit rate fixed, the switch would raise average examination costs by nearly an order of magnitude, while also increasing detected underreporting (mean adjustment of $22,578 per return under the total underreporting oracle versus $9,595 under the refundable credit oracle).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results pertain primarily to tax year 2014. The paper finds similar patterns for tax years 2010, 2012, 2016, and 2018. The analysis covers Black versus non-Black taxpayers; disparities for other racial and ethnic groups are not the focus. The selective labels identification strategy relies on the NRP random-audit sample and the bounding conditions verified in the North Carolina matched data.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. Why can&amp;rsquo;t the disparity be attributed simply to Black taxpayers being more likely to claim the EITC, combined with EITC claimants facing higher audit rates generally?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors test this directly by estimating racial audit disparities separately within EITC claimants and non-claimants. If differential EITC claiming rates were the full explanation, the within-EITC disparity would be close to zero. Instead, the disparity among EITC claimants (1.96-2.90 p.p.) is larger in absolute terms than the overall disparity (0.81-1.34 p.p.), indicating that Black EITC claimants face substantially higher audit rates than non-Black EITC claimants even holding EITC claimant status fixed. The formal decomposition attributes 70-73% of the overall disparity to differential audit rates within the EITC claimant population, not to differential claiming rates across the population.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How does the partial identification strategy work, and what are its key identifying assumptions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors derive two estimators of the racial audit disparity that use BIFSG-imputed race probabilities rather than observed race. The probabilistic estimator weights each taxpayer&amp;rsquo;s contribution by their estimated probability of being Black; it is downward-biased when there is a positive residual covariance between audits and true race after conditioning on imputed race (E[Cov(Y,B|b)] &amp;gt; 0). The linear estimator regresses audit status on imputed race probability; it is upward-biased when there is a positive residual covariance between audits and imputed race after conditioning on true race (E[Cov(Y,b|B)] &amp;gt; 0). When both covariance terms are positive, the probabilistic and linear estimates bound the true disparity from below and above. The authors verify both conditions are positive and statistically significant (p &amp;lt; 0.01) in the matched North Carolina dataset, for the full population and the EITC population specifically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. Does the racial audit disparity within EITC claimants disappear when comparing taxpayers with similar levels of underreporting?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The authors use NRP data to estimate audit rates by race within each underreporting decile among EITC claimants. Within every decile of the underreporting distribution, the estimated audit rate for Black taxpayers exceeds that for non-Black taxpayers. An oracle algorithm that selects returns in descending order of actual underreporting produces an audit rate of 0.74% for Black EITC claimants and 1.63% for non-Black EITC claimants — the opposite of the status quo pattern (3.00% for Black, 1.04% for non-Black). This rules out total-dollar underreporting as the primary driver of the observed disparity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. Why does focusing audit selection on refundable credit overclaims specifically lead to higher audit rates for Black taxpayers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms operate simultaneously. First, EITC eligibility is linked to children, so detecting erroneously claimed dependents generates large refundable credit adjustments. The dependent error rate is higher among Black EITC claimants than non-Black EITC claimants (26.6% vs. 16.3% in the probabilistic estimate, or 30.8% vs. 15.4% in the linear estimate). Second, the highest-dollar noncompliance via underreported business income is disproportionately concentrated among non-Black EITC claimants: among EITC claimants in the top 1% of business income underreporting, the probabilistic estimate shows 0.05% are Black compared to 0.21% non-Black. An algorithm aimed at refundable credit overclaims implicitly targets dependent errors and therefore selects Black taxpayers at higher rates; one aimed at total underreporting would prioritize business income underreporting instead and therefore select non-Black taxpayers at higher rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. How do the simulated algorithms compare to the actual IRS algorithms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors cannot directly replicate the IRS&amp;rsquo;s confidential DDb algorithm, but they provide three pieces of evidence that their refundable credit prediction algorithm is a reasonable proxy. First, public governmental documents describe DDb&amp;rsquo;s stated goal as identifying taxpayers who do not meet refundable credit eligibility requirements. Second, when selecting audits based on predicted refundable credit overclaims using largely the same features available to IRS, the authors generate a disparity (1.75 p.p.) close to the status quo disparity (1.96 p.p.). Third, operational audits of EITC returns are strongly associated with their predicted refundable credit overclaims measure but show a much weaker association with predicted total underreporting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What does the status quo disparity exceeding the refundable credit oracle disparity reveal about prediction model design?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The status quo disparity (1.96 p.p.) is approximately 80% larger than the disparity that would arise if the IRS were perfectly informed about actual refundable credit overclaims and selected accordingly (oracle disparity: 1.08 p.p.). The refundable credit prediction algorithm generates a disparity of 1.75 p.p., approximately 60% larger than the oracle. This gap between the oracle and prediction disparity is consistent with prediction errors being distributed unevenly by race. The authors find that birth certificates of children claimed on Black taxpayers&amp;rsquo; returns are substantially more likely to be missing paternal identity information, which may reduce the predictive accuracy of the DDb model for this population. They provide suggestive evidence that modifying the predictive features used could reduce the disparity without substantially degrading credit overclaim detection.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. What are the downstream operational consequences of switching the algorithmic objective?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Switching from refundable credit overclaims to total underreporting would shift audited issues from dependent eligibility (80% of refundable credit oracle-selected returns have a dependent error) toward business income (86% of total underreporting oracle-selected returns have business income underreporting). Auditing business income returns is substantially more resource-intensive: $369.70 per return on average for returns with gross receipts above $25,000, versus $23.09 for other EITC returns. Holding the current EITC audit rate fixed, the share of audited returns with substantial business income would rise from 3% to 93%, raising total examination costs by nearly an order of magnitude. However, because total detected underreporting per audited return would also rise substantially (mean of $22,578 vs. $9,595), the increase in detected noncompliance would exceed the increase in audit costs, and the qualitative pattern persists even when accounting for higher per-return costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. Is the disparity consistent across years, and is it driven by a particular audit type?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors find comparable audit disparities for tax years 2010, 2012, 2016, and 2018, confirming the 2014 results are not year-specific. The disparity is concentrated in correspondence audits: the estimated disparity in correspondence audit rates is 0.804-1.328 p.p. for the full population, while the disparity in field/office audit rates is only 0.010-0.016 p.p. The disparity is present in both pre-refund and post-refund audits, though pre-refund audits show a larger disparity even among correspondence audits alone. Among EITC claimants, the correspondence audit channel is nearly entirely responsible for the group-level disparity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. What heterogeneity exists within EITC claimants?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The disparity is especially pronounced among unmarried male EITC claimants with dependents: among this subgroup, the audit rate for Black men exceeds the audit rate for non-Black men by more than 4 percentage points, and both are an order of magnitude above the overall U.S. population audit rate. Disparities are smaller among joint filers, unmarried women, and unmarried men without dependents, though the ratio of Black to non-Black audit rates remains substantial across all subgroups. The concentration of the disparity among unmarried men with dependents is consistent with the role of dependent-claiming errors, which are more likely to arise in family structures characterized by nonmarital cohabitation — a pattern more prevalent among Black Americans due to lower marriage rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. Can the disparity be attributed to disparate treatment — i.e., race-conscious selection?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors rule out disparate treatment for the EITC population. The DDb audit selection process for EITC returns is automated (no manual review), and IRS does not use race or geography as an input into audit selection. The disparity is therefore the product of disparate impact: race-neutral selection criteria interact with racially correlated patterns of tax return characteristics to produce differential audit rates. For higher-income non-EITC taxpayers, where audit selection may involve human classifiers, the authors cannot rule out disparate treatment.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Audit Disparity (D).&lt;/strong&gt; Defined in the paper as D = E[Y|B=1] - E[Y|B=0], the difference in audit rates between Black taxpayers (B=1) and non-Black taxpayers (B=0). This is a group-level difference in selection rates, not conditional on any other characteristic, and is the primary estimand throughout.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Probabilistic Disparity Estimator.&lt;/strong&gt; An estimator that calculates group-specific audit rates by weighting each taxpayer&amp;rsquo;s contribution by their BIFSG-imputed probability of being Black (or non-Black). It is shown to be downward-biased when E[Cov(Y,B|b)] &amp;gt; 0, i.e., when there is residual positive association between true race and audits after conditioning on imputed race.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Linear Disparity Estimator.&lt;/strong&gt; An estimator based on regressing audit status (Y) on BIFSG-imputed race probability (b). It is shown to be upward-biased when E[Cov(Y,b|B)] &amp;gt; 0, i.e., when imputed race probability predicts audits even after conditioning on true race. Together, the probabilistic and linear estimators form bounds on the true disparity under conditions verified empirically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;BIFSG (Bayesian Improved First Name Surname Geocoding).&lt;/strong&gt; A probabilistic race imputation method that uses Bayes rule under a conditional independence assumption (first name, surname, and geography are independent given race) to compute Pr[Black | first name, surname, Census Block Group]. Applied here to all 148 million tax returns; calibrated and validated against matched North Carolina voter registration data with self-reported race.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Selective Labels Problem.&lt;/strong&gt; The problem that noncompliance (underreporting) is observed only for returns selected for audit, not for the full filing population. In this paper it means the IRS cannot directly observe the underreporting distribution for unaudited returns. The authors address this using NRP random-audit data, which allows estimation of the unaudited underreporting distribution and construction of counterfactual selection algorithms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Algorithmic Objective.&lt;/strong&gt; The paper distinguishes between (1) the prediction component of audit selection — which model to use to forecast noncompliance — and (2) the objective component — what type of noncompliance to predict and pursue (overclaimed refundable credits versus total underreporting from any source). The paper finds that the objective, not just prediction error, is an independent driver of the racial audit disparity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dependent Database (DDb) Program.&lt;/strong&gt; The IRS&amp;rsquo;s primary EITC audit selection program, responsible for approximately 75% of audited EITC returns in 2014. DDb flags returns based on rules, heuristics, and proprietary risk scores, with the stated goal of identifying taxpayers who do not meet refundable credit eligibility requirements. Selection through DDb is fully automated, without human classifier review.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;National Research Program (NRP).&lt;/strong&gt; A stratified random sample audit program through which the IRS conducts near-line-by-line examinations of a small fraction of the filing population each year (approximately 2% of audited returns in 2014). The paper pools 71,878 NRP audits from 2010-2014 to identify the distribution of underreporting in the full EITC filing population and to estimate counterfactual selection algorithms.&lt;/p&gt;</description></item><item><title>Merger Effects and Antitrust Enforcement: Evidence from US Consumer Packaged Goods</title><link>https://macropaperwarehouse.com/papers/merger-effects-and-antitrust-enforcement-evidence-from-us-consumer-packaged-goods/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/merger-effects-and-antitrust-enforcement-evidence-from-us-consumer-packaged-goods/</guid><description>&lt;p&gt;This paper by Bhattacharya, Illanes, and Stillerman makes two contributions to the debate over US antitrust enforcement stringency. First, it documents the price, quantity, and assortment effects of a comprehensive set of consummated mergers in US consumer packaged goods (CPG). Second, it develops and estimates a model of agency enforcement decisions to quantify antitrust stringency and simulate counterfactual outcomes under stricter regimes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and scope.&lt;/strong&gt; The analysis covers 129 product markets across 47 transactions in US CPG from 2006 to 2017, using the NielsenIQ Retail Scanner Dataset (covering 35,000–50,000 stores and 2.6–4.5 million UPCs). The sample is restricted to all deals valued at $280 million or more where both the acquirer and target sold products in at least one overlapping product market-DMA. Geographic markets are NielsenIQ designated market areas (DMAs). The sample is defined to avoid selection bias from studying only mergers that attracted press attention or were litigation targets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification strategy.&lt;/strong&gt; The empirical approach is a before-after event study within geography and product. For each merger, a brand-specific linear time trend is estimated from the 36 months prior to the merger announcement, controlling for UPC-DMA fixed effects, month-of-year fixed effects, input cost indices, and log median household income. Post-merger outcomes (24 months after completion) are measured as deviations from the extrapolated pre-merger trend. The identifying assumption is that secular demand and cost trends are gradual and well-captured by a linear trend. Pre-trend placebo tests show no significant departures from trend in the pre-period, and randomized-date placebos confirm that the linear trend is a better predictor of post-period outcomes under random merger dates than under actual merger dates, supporting the interpretation that observed post-period departures reflect merger effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price effects.&lt;/strong&gt; The average price effect of consummated CPG mergers is small: across specifications, estimates range from -0.6% to 1.0%, with a baseline mean of 0.3%. However, heterogeneity is substantial. The standard deviation of merger-level price effects is 4.0–7.5 percentage points. In the baseline specification, the first quartile of price effects is -2.1% and the third quartile is 3.7%. Merging and non-merging party price changes are positively correlated (correlation = 0.49), consistent with strategic complementarity. Thirty-six percent of mergers lead both groups to lower prices; 36% lead both groups to raise prices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantity and assortment effects.&lt;/strong&gt; Total quantities fall on average by 0.4–1.0% across specifications, with 60% of mergers producing quantity reductions. Merging parties exhibit a larger average quantity decline of 6.4%. Mergers also lead to a 2.7% average reduction in the number of stores served by merging parties, a 2.2% reduction in the number of brands sold in a DMA by merging parties, and a 3.2% reduction for non-merging parties. Brands with less than 5% of the merged entity&amp;rsquo;s sales are 6 percentage points more likely to be dropped post-merger.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Enforcement model.&lt;/strong&gt; To interpret these outcomes relative to enforcement, the authors develop a model in which the agency receives a noisy signal of a merger&amp;rsquo;s price effect and challenges the merger if the posterior mean exceeds a threshold that is decreasing in deal size. They estimate the model by maximum likelihood using data on enforcement actions (6 mergers receiving remedies, 4 withdrawn under antitrust pressure) and realized price changes. The estimated sales-weighted average threshold is 4.8–6.3%: agencies act as if they challenge CPG mergers only when they expect a price increase exceeding this level. The posterior standard deviation of the agency&amp;rsquo;s assessment is 2.5–3.2 pp (aggregate prices) to 4.1–4.8 pp (merging-party prices).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual stringency.&lt;/strong&gt; Tightening the threshold from approximately 6.1% to 2.5% would roughly quadruple the challenge probability (from 0.075 to 0.30), reduce aggregate price changes of consummated mergers by approximately 1.4 pp, and lower the share of allowed anti-competitive mergers from roughly 50% to 35%. Critically, type I errors (blocking pro-competitive mergers) remain negligible at thresholds down to approximately 3%; at 0% threshold only 10% of blocked mergers would be type I errors. The primary cost of tighter enforcement is a significantly larger agency workload, not an increase in blocked pro-competitive mergers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; Results pertain specifically to large CPG mergers (deal size ≥ $280 million) sold through US retail outlets, 2006–2017. Findings on structural presumptions show DHHI and merging share have predictive value for price changes, but structural metrics alone explain less than 10% of the variance in price effects (adjusted R-squared never exceeds 10% even with third-order interactions).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the average price effect of consummated CPG mergers and how should it be interpreted?&lt;/strong&gt;
A: Across specifications, the average price effect is between -0.6% and 1.0%, with a baseline mean of 0.3%. This small average does not imply that enforcement is strict: Carlton (2009) shows that with perfect foresight, the largest observed price change — not the average — would indicate stringency. Because agencies face uncertainty, the distribution of realized price changes reflects both inframarginal approved mergers and the noise in agency forecasts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How large is the heterogeneity in merger price effects?&lt;/strong&gt;
A: The standard deviation of merger-level price effects is 4.0–7.5 percentage points across specifications. In the baseline, the first quartile of price effects is -2.1% and the third quartile is 3.7% for all parties combined. Merging parties specifically show a first quartile of -3.2% and third quartile of 3.7%, meaning a full quarter of mergers raise merging-party prices by more than 3.7%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do merging and non-merging party prices co-move?&lt;/strong&gt;
A: Price changes for merging and non-merging parties are positively correlated (correlation = 0.49, s.e. = 0.08), consistent with strategic complementarity in pricing. Thirty-six percent of mergers lead both groups to lower prices, 36% lead both to raise prices, 13% cause merging parties to lower while non-merging parties raise, and 15% cause the reverse. The timing evidence shows merging-party prices begin changing upon merger completion, with rivals following suit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens to quantities following mergers?&lt;/strong&gt;
A: Total quantities fall on average between 0.4% and 1.0% across specifications, with 60% of mergers producing quantity reductions. Merging parties bear the bulk of quantity adjustment, with an average quantity decline of 6.4% and a standard deviation and interquartile range both around 30 pp. Non-merging party quantity changes are much less variable. The correlation between merging and non-merging party quantity changes is 0.36 (s.e. 0.08), which is positive — at odds with theoretical predictions from demand systems with the &amp;ldquo;type aggregation property&amp;rdquo; (Nocke and Schutz, 2018, 2024), where mergers should produce negatively correlated quantity changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What non-price competitive responses do mergers trigger?&lt;/strong&gt;
A: Merging parties reduce the number of stores they serve by 2.7% on average, though in 38% of mergers store networks expand. Both merging and non-merging parties reduce product portfolios: merging parties drop the number of brands in a DMA by 2.2% on average and non-merging parties by 3.2%. Brands most likely to be dropped are those with less than 5% of the merged entity&amp;rsquo;s sales (6 pp more likely to be dropped), brands in small DMAs, and brands with small DMA shares.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Do the Merger Guidelines&amp;rsquo; structural presumptions (HHI, DHHI, merging share) predict price effects?&lt;/strong&gt;
A: DHHI and merging share have statistically significant but quantitatively modest predictive power. A 100-point increase in average DHHI is associated with a 0.2 pp increase in merging-party price changes and 0.3 pp for non-merging parties. Price effects are significantly larger when merging share exceeds 30%. However, structural metrics alone explain very little variance: adjusted R-squared never exceeds 10% even with third-order interactions of HHI, DHHI, merging share, private label share, and market size. Within-merger, DHHI is positively correlated with local price changes, and markets with DHHI above 200 exhibit significantly higher price effects than those below.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do the authors model antitrust enforcement and identify its stringency?&lt;/strong&gt;
A: The agency observes a noisy signal of a merger&amp;rsquo;s price effect, forms a posterior distribution combining a normally distributed prior (mean X&amp;rsquo;beta, standard deviation sigma_p*) with a normally distributed signal error (standard deviation sigma_epsilon), and challenges the merger if the posterior mean exceeds a threshold that is decreasing in deal size. The model is estimated by maximum likelihood: for approved mergers, the realized price change is observed; for withdrawn/remedied mergers, the posterior mean must have exceeded the threshold. Six mergers (from four deals) received remedies for horizontal market power concerns and four mergers (from two deals) were withdrawn under antitrust pressure, forming the challenged set.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the estimated enforcement threshold and how does it vary across mergers?&lt;/strong&gt;
A: The sales-weighted average threshold is 4.8–6.3% using aggregate price changes and 6.6–7.8% using merging-party price changes. The threshold is lower for larger mergers: a 10% increase in merging-party sales is associated with an approximately 0.06 pp decrease in the threshold. The first quartile of thresholds across mergers is 4.5–5.6% and the third quartile is 5.6–6.9%, reflecting that the agencies apply stricter standards to larger deals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How accurate are the agencies&amp;rsquo; forecasts of merger price effects?&lt;/strong&gt;
A: Using only the prior (structural characteristics), the agency&amp;rsquo;s accuracy in classifying mergers as anti-competitive versus pro-competitive is 56% (s.e. 3 pp). Adding the signal increases accuracy to 83% (s.e. 9 pp). The correlation between the prior mean and the true price change is 0.29 (s.e. 0.08); the correlation between the posterior mean and the true price change is 0.85 (s.e. 0.15). The posterior standard deviation is 2.5–3.2 pp for aggregate price changes and 4.1–4.8 pp for merging-party price changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What would happen under stricter antitrust enforcement?&lt;/strong&gt;
A: Tightening the average threshold from 6.1% to 2.5% would raise the challenge probability from approximately 0.075 to 0.30 — roughly quadrupling it — and would reduce aggregate price changes of consummated mergers by approximately 1.4 pp (from roughly 0.2% to -1.2%). Moving to a 0% threshold would result in challenges to 57% of mergers, with 60–70% of consummated mergers then causing price decreases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How large are type I and type II errors at the current and counterfactual thresholds?&lt;/strong&gt;
A: At the current threshold (~6.1%), approximately 50% of allowed mergers are type II errors (anti-competitive mergers that should have been challenged). Type I errors (pro-competitive mergers wrongly blocked) are negligible at the current threshold and only become non-trivial starting around a 3% threshold. At a 2.5% threshold, the type II error share falls to 35%; at a 0% threshold, to 16%, while type I errors reach 10% of blocked mergers. The primary trade-off of stricter enforcement is therefore a larger agency workload, not an increase in blocking pro-competitive mergers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What identification strategy is used and how is it validated?&lt;/strong&gt;
A: The strategy is a within-product, within-geography before-after comparison using a brand-specific linear pre-merger trend as the counterfactual. Validation proceeds through three checks: (1) coefficient plots from an extended event study show no significant pre-trends after controlling for the linear trend; (2) a plot of brand trends against estimated price effects shows little explanatory power (statistically significant negative correlation but small magnitude, not consistent with results being driven by trend extrapolation); (3) placebo tests randomizing merger dates within the same markets yield a distribution centered at zero, narrower than the true distribution, and a significantly higher mean squared prediction error in the post-period, confirming that the linear trend is a better predictor under randomly assigned merger dates than under true dates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why do the authors not use alternative control group approaches?&lt;/strong&gt;
A: Non-merging firms in the same market are rejected as controls because they may strategically respond to the merger. Synthetic controls using similar-industry untreated markets are rejected because deals often treat multiple similar markets (ruling out natural donors) and estimates prove sensitive to individual donors. Geographic controls (markets where merging parties have small shares) are rejected because they omit all 39 national mergers, untreated markets are not randomly selected, and regional pricing by non-merging parties could propagate effects into untreated regions, biasing estimates toward zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Merger retrospective.&lt;/strong&gt; In this paper&amp;rsquo;s usage, an ex-post empirical study of the price, quantity, and assortment effects of a consummated merger, using pre-merger trends as the counterfactual, as opposed to forward-looking merger simulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Enforcement stringency.&lt;/strong&gt; The marginal price increase at which the antitrust agency would expect to challenge a merger. Measured here as the sales-weighted average posterior-mean threshold: the value above which the agency acts as if it would propose a remedy, estimated at 4.8–6.3% for US CPG mergers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Type I error (antitrust).&lt;/strong&gt; The mistake of challenging (blocking) a merger that would have reduced prices (a pro-competitive merger). In the model, this occurs when an adverse signal causes the agency to block a merger whose true price effect is below the threshold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Type II error (antitrust).&lt;/strong&gt; The mistake of allowing a merger that increases prices (an anti-competitive merger). In the model, this occurs when a favorable signal causes the agency to approve a merger whose true price effect is above the threshold. Estimated at approximately 50% of allowed mergers at the current enforcement threshold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural presumptions.&lt;/strong&gt; The HHI-based rules in the 2010 and 2023 Merger Guidelines that create a presumption of competitive harm when DHHI exceeds specified thresholds (e.g., DHHI &amp;gt; 200 and post-merger HHI &amp;gt; 2,500 for the &amp;ldquo;red zone&amp;rdquo;). The paper finds DHHI and merging share have statistically significant but low explanatory power (adjusted R-squared below 10%) for actual price changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Prior and signal (in the enforcement model).&lt;/strong&gt; The agency&amp;rsquo;s prior is a normal distribution over the merger&amp;rsquo;s true price effect, parameterized by structural characteristics (HHI, DHHI). The signal is a noisy draw centered on the true price effect, capturing information gathered through due diligence (e.g., evidence of efficiencies). The posterior mean — combining prior and signal — determines whether the agency challenges the merger.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Product market-deal pair (merger).&lt;/strong&gt; The unit of observation in the empirical analysis: a specific NielsenIQ product module (e.g., soluble coffee) within a specific acquisition transaction (e.g., a food conglomerate merger). The sample contains 129 such pairs across 47 deals.&lt;/p&gt;</description></item><item><title>Micro MPCs and Macro Counterfactuals: The Case of the 2008 Rebates</title><link>https://macropaperwarehouse.com/papers/micro-mpcs-and-macro-counterfactuals-the-case-of-the-2008-rebates/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/micro-mpcs-and-macro-counterfactuals-the-case-of-the-2008-rebates/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; Do the high marginal propensities to consume (MPCs) estimated in the leading household studies of the 2008 U.S. tax rebates—particularly Parker et al. (2013), which found MPCs of 50–90 percent within three months—imply plausible macroeconomic counterfactuals? And if not, what combination of micro-level bias corrections and general equilibrium forces reconciles the micro evidence with aggregate data?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting.&lt;/strong&gt; The 2008 Economic Stimulus Act distributed approximately $100 billion in tax rebates, totaling eleven percent of January 2008 monthly disposable income. Among the 85 percent of households receiving a check, the average amount was $1,000. Rebates were distributed primarily from April through July 2008, with nearly half delivered in May alone. The timing of receipt was determined by the last two digits of Social Security numbers, providing quasi-random variation exploited by the household-level literature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The paper proceeds in two halves. In the first, the authors construct macro counterfactuals by calibrating a standard medium-scale two-good, two-agent New Keynesian (TANK) model with the micro MPCs from the literature and simulating what aggregate consumption would have been absent the rebate. The model contains life-cycle permanent income households and hand-to-mouth households whose dynamic spending propensities are calibrated directly to match the household-level estimates. General equilibrium effects—including Keynesian income multipliers, real interest rate movements, and changes in the relative price of durable goods—are incorporated. Counterfactual consumption paths are constructed by subtracting model-simulated deviations from steady state from actual NIPA consumption data.&lt;/p&gt;
&lt;p&gt;In the second half, the authors revisit both the micro estimates and the macro model. On the micro side, they identify three upward biases in standard two-way fixed effects (TWFE) estimates applied to CEX data: (1) omitted variable bias from excluding the lagged rebate indicator; (2) &amp;ldquo;forbidden comparisons&amp;rdquo; bias arising from comparing cohorts with heterogeneous treatment effects, following Borusyak et al. (2022) and Sun and Abraham (2020); and (3) a rebate reporting bias in which households are systematically more likely to report receiving the rebate in the month that coincides with large expenditure increases, causing spurious positive correlation between reported receipt and contemporaneous spending. On the macro side, the baseline model is modified to incorporate an upward-sloping supply curve for durable goods (calibrated to a supply elasticity of 5, midway between House and Shapiro (2008) and Goolsbee (1998)), replacing the baseline assumption of frictionless conversion between nondurable and durable intermediates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings with quantitative magnitudes.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Implausibility of baseline counterfactuals.&lt;/em&gt; When calibrated to Parker et al.&amp;rsquo;s (2013) micro MPC of 0.9, the baseline model implies that real PCE absent the rebate would have collapsed by 6.0 percent from April through July 2008—a decline exceeded historically only by the Covid-19 lockdowns. Even the more modest micro MPC of 0.5 implies a 2.7 percent three-month PCE decline, comparable only to the 1980 Volcker disinflation with credit controls. For motor vehicle expenditures, the counterfactual drops range from 38 percent (micro MPC = 0.3) to 67 percent (micro MPC = 0.9)—larger than any historical experience, including the 30 percent Covid decline. Contemporaneous professional forecasters (Federal Reserve Greenbooks, Survey of Professional Forecasters, Goldman Sachs) predicted at most small consumption declines in summer 2008. Even the authors&amp;rsquo; own pessimistic forecast model—incorporating actual oil price paths and a Lehman Brothers bankruptcy dummy—implies that the cumulative difference between actual and forecast consumption attributable to the rebate was at most $20 billion out of $100 billion in rebates, for an implied GE-MPC of at most 0.2.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Bias correction in micro MPC estimates.&lt;/em&gt; Applying all three bias corrections to CEX data (the preferred specification with lagged rebate indicator, cohort-level treatment effects, and lagged expenditure controls), the estimated three-month MPC falls from 0.50 to 0.28 in the full sample and from 0.82 to 0.34 in the rebate-recipients-only sample, with both rounding to approximately 0.3. The Borusyak-Jaravel-Spiess (BJS) imputation method yields an MPC of 0.20 in the full sample and 0.37 in the rebate-only sample, consistent with the OLS corrections.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Composition of spending.&lt;/em&gt; In the preferred corrected specification, essentially all of the total expenditure MPC of 0.3 is accounted for by motor vehicle spending: the MPC on motor vehicles is 0.30 in the full sample and 0.26 in the rebate-only sample, while the MPC on all other expenditures is −0.02 (full sample) and 0.08 (rebate-only sample).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;General equilibrium dampening via inelastic durable supply.&lt;/em&gt; In the model with a calibrated durable supply elasticity of 5, rebate-induced demand for motor vehicles raises the relative vehicle price by approximately 1.1 percent in July 2008. This price increase crowds out durable expenditure by optimizing households through intertemporal substitution. At the preferred micro MPC of 0.3, the general equilibrium MPC (GE-MPC) for total PCE is only 0.07, well below the 0.3 micro estimate. At a micro MPC of 0.5, the GE-MPC is 0.22. The combination of the bias-corrected micro MPC and dampening general equilibrium forces implies a general equilibrium consumption multiplier below 0.2 for the 2008 rebates.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Importance of durable goods composition for HANK models.&lt;/em&gt; A model that abstracts from durable goods and calibrates the full expenditure micro MPC to nondurable spending predicts a GE-MPC of 0.36 when the micro MPC is 0.30—five times larger than the 0.07 implied by the model with durable goods. This contrast illustrates that the distribution of spending across nondurable and durable goods is a key determinant of the aggregate fiscal multiplier, in addition to heterogeneity in wealth and income emphasized by the existing HANK literature.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the central empirical puzzle the paper addresses?&lt;/strong&gt;
A. The leading household studies of the 2008 rebates estimate very high three-month MPCs (50–90 percent). When these estimates are plugged into a standard New Keynesian model to construct counterfactual consumption paths absent the rebate, the model implies that PCE would have collapsed by 2.7–6.0 percent from April through July 2008 and then sharply recovered just as Lehman Brothers failed in September. No contemporaneous forecaster or narrative evidence suggests such extreme, short-lived macroeconomic stress was present. The Lehman collapse itself caused only a 1.1 percent three-month PCE decline—smaller than all three counterfactual declines implied by micro MPCs of 0.3, 0.5, or 0.9.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. What are the features of the TANK model used to construct the counterfactuals?&lt;/strong&gt;
A. The model is a two-good (nondurable and durable), two-agent (optimizing life-cycle and hand-to-mouth) New Keynesian model calibrated at monthly frequency, building on Ramey (2021) and Galí et al. (2007). Intermediate goods can, in the baseline, be frictionlessly converted into either nondurable or durable goods (implying a fixed relative price of one). Durable goods (interpreted as motor vehicles) enter household utility, with optimizing households facing a Calvo-type adjustment friction motivated by Evans and Ramey (1992) calculation costs. The fraction of hand-to-mouth consumers and their dynamic propensities to spend are calibrated directly to match the micro MPC estimates from the household literature. The model incorporates a Calvo-style price-adjustment structure for nondurables, sticky wages set by unions, capital with adjustment costs and variable utilization, and an inertial monetary policy rule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. How does the model translate micro MPCs into macro counterfactuals, and why does it amplify rather than dampen the micro estimates in the baseline?&lt;/strong&gt;
A. The model&amp;rsquo;s GE-MPC equals the micro MPC&amp;rsquo;s direct demand effect plus Keynesian income multiplier effects. Because the rebate is highly transitory, there is little movement in the real interest rate (the Phillips curve is flat and monetary policy is inertial), so the dominant general equilibrium force is the income multiplier. This amplifies, rather than dampens, the micro MPCs. As a result, the GE counterfactuals exhibit even sharper V-shapes than the pure micro counterfactuals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. What narrative and forecast evidence do the authors use to argue the baseline counterfactuals are implausible?&lt;/strong&gt;
A. Contemporary forecasts from the Federal Reserve Greenbooks, the Survey of Professional Forecasters, and Goldman Sachs all predicted at most small consumption declines in summer 2008—Goldman Sachs forecast only −0.125 percent (not annualized) per quarter in Q2–Q3 2008. The authors also construct their own &amp;ldquo;pessimistic&amp;rdquo; time-series forecast that incorporates actual oil price paths (which rose from $98 to $140 per barrel by July 2008) and a Lehman Brothers bankruptcy dummy; even this forecast lies above all three model counterfactuals in summer 2008 and displays no V-shape. Furthermore, the cumulative difference between actual PCE and the pessimistic forecast over April–October 2008 totals only $20 billion—implying a GE-MPC of at most 0.2 even if the entire gap were attributed to the rebate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What is the first bias in standard TWFE estimates of the MPC, and how large is its effect?&lt;/strong&gt;
A. The first bias is omitted variable bias from excluding the lagged rebate indicator. In a first-differenced panel regression, lagged treatment enters the error term. Because current treatment reduces the probability of past treatment, current and lagged treatment are negatively correlated, and omitting the lag inflates the OLS estimate of the contemporaneous effect. Including a lagged rebate indicator reduces the contemporaneous spending response by $40 in the full CEX sample (from $470 to $434) and by approximately $237 in the rebate-only sample (from $764 to $527).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What is the &amp;ldquo;forbidden comparisons&amp;rdquo; bias and how is it corrected?&lt;/strong&gt;
A. When treatment effects are heterogeneous across cohorts (e.g., the June rebate cohort has a larger MPC than the September cohort), standard homogeneous TWFE estimates use later-treated cohorts as control groups for earlier-treated cohorts even after accounting for average mean-reversion. Because the mean-reversion of the earlier (larger-effect) cohort is larger than that of the later cohort, this comparison is contaminated, inflating the estimate. The authors correct for this by allowing cohort-specific treatment effects, following Sun and Abraham (2020). This reduces the contemporaneous effect by a further $90 in the full sample; in the rebate-only sample the correction raises the estimate slightly (by $70) because later treatment effects are larger in that sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. What is the rebate reporting bias and what mechanism underlies it?&lt;/strong&gt;
A. The rebate reporting bias arises because households in the CEX are systematically more likely to report receiving the rebate in the interview month that coincides with high expenditure. Although the true timing of rebate checks is determined by Social Security number last-digits (and is thus random), the reported timing may reflect recall issues: households more readily remember and report receiving the rebate when it was accompanied by a large purchase. The empirical signature is a statistically significant negative effect of future rebate receipt on current expenditure (−$863 in the full sample, −$575 in the rebate-only sample at the 10% level), indicating that rebate reporters had unusually low spending in the period prior to reporting receipt. Controlling for lagged expenditure and income decile fixed effects corrects for this bias, reducing the three-month MPC in the full sample from 0.37 to 0.28.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. What are the authors&amp;rsquo; preferred bias-corrected MPC estimates, and how do they compare across specifications and estimators?&lt;/strong&gt;
A. After correcting for all three biases (preferred specification, column 4 of Table 3), the implied three-month MPC is 0.28 in the full sample and 0.34 in the rebate-only sample, both approximately 0.3. The Borusyak-Jaravel-Spiess imputation method, which imposes weaker assumptions and overcomes the first two biases by construction, yields an MPC of 0.20 (full sample) and 0.37 (rebate-only sample), with an average consistent with the OLS-corrected estimates. Both methods point to an MPC around 0.3, substantially below the 0.5–0.9 range from the baseline Parker et al. (2013) approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. How is almost all of the total expenditure MPC concentrated in motor vehicles?&lt;/strong&gt;
A. After bias correction, the MPC on motor vehicles is 0.30 in the full sample and 0.26 in the rebate-only sample. The MPC on all other PCE is −0.02 (full sample) and 0.08 (rebate-only sample), neither statistically significant. This concentration in durables is consistent with Adams et al. (2009) and Aaronson et al. (2012), and is corroborated by CEX vehicle-expenditure data showing a car-purchase response concentrated in the three months surrounding receipt of the rebate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. How does introducing an upward-sloping supply curve for durable goods change the model&amp;rsquo;s general equilibrium predictions?&lt;/strong&gt;
A. In the modified model, durable goods producers face a production externality (or fixed factor) that makes the short-run supply of motor vehicles upward-sloping, with supply elasticity calibrated to 5. When rebate recipients increase demand for motor vehicles, the relative price of motor vehicles rises by approximately 1.1 percent in July 2008 (consistent with the observed 1.5 percent spike in the BLS new vehicle price index relative to core CPI around the rebate distribution). This price increase induces optimizing households to intertemporally substitute away from durable goods. Because durable demand is highly price-elastic (long-run elasticity of −1 to −15 depending on the study), even a modest relative price increase generates substantial crowding out of durable expenditure by non-recipients.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. What are the GE-MPC estimates in the modified model with less elastic durable supply, and how do they decompose?&lt;/strong&gt;
A. At the preferred micro MPC of 0.3, the GE-MPC for total PCE is 0.07—general equilibrium forces dampen the micro effect. At micro MPC of 0.5, GE-MPC is 0.22 (modest dampening). At micro MPC of 0.9, the GE-MPC rises to 1.42 (amplification). Decomposing by good type at micro MPC of 0.3: the GE-MPC on motor vehicles is 0.09 and the GE-MPC on nondurables is −0.03. The dampening is concentrated almost entirely in durable expenditure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12. How sensitive are the GE-MPC results to the calibration of durable demand elasticity?&lt;/strong&gt;
A. The baseline calibration uses a long-run vehicle demand elasticity of −15, based on household-level evidence from Bachmann et al. (2021). When the authors instead use the lower-bound estimate of −6.4 from Baker et al. (2019), the GE-MPC at micro MPC of 0.3 rises from 0.07 to 0.12. Even at this lower demand elasticity there is substantial crowding out in general equilibrium, so the qualitative conclusion is robust.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13. Why does a nondurables-only model with the same overall MPC substantially overstate the fiscal multiplier?&lt;/strong&gt;
A. When abstracting from durable goods and calibrating a nondurable MPC of 0.30 (to match the overall expenditure MPC), the model predicts a GE-MPC of 0.36—five times larger than the 0.07 from the two-good model. This occurs because nondurable demand is far less price-elastic than durable demand, and the nearly-flat Phillips curve makes nondurable supply very elastic, so there is no relative-price-driven crowding out channel. The comparison illustrates that the distribution of spending across nondurable and durable goods is a quantitatively important determinant of the fiscal multiplier, independent of the level of the MPC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14. What evidence is provided that the control group in the household regressions is itself affected by the rebate in general equilibrium?&lt;/strong&gt;
A. Figure 9 in the paper plots motor vehicle spending per household by rebate-receipt status using CEX data. When rebate recipients begin reporting receipt in June 2008, motor vehicle expenditure in the rebate group rises while simultaneously falling in the never-rebate group. This pattern is consistent with the model&amp;rsquo;s prediction that the rebate-induced rise in relative motor vehicle prices crowds out purchases by non-recipient households. This general equilibrium spillover means the difference-in-differences micro MPC estimate remains valid as a micro estimate (the symmetric crowding out does not affect the treated-versus-control difference), but the aggregate GE-MPC is less than the micro MPC.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15. How do the authors verify that their preferred corrected specification recovers true MPCs?&lt;/strong&gt;
A. In Appendix C.6 the authors simulate household-level data from the modified Section 5 model and apply both the original Parker et al. (2013) specification (Equation 1) and their preferred corrected specification (Equation 5). The Parker et al. specification produces upward-biased MPC estimates in the simulated data, consistent with Kaplan and Violante&amp;rsquo;s (2014) theoretical argument. The preferred corrected specification recovers the true MPCs from the model, validating the correction methodology.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;GE-MPC (General Equilibrium Marginal Propensity to Consume).&lt;/strong&gt; The paper&amp;rsquo;s term for the aggregate increase in total consumer spending per dollar of tax rebate, incorporating both the direct micro-level demand effect of the rebate on hand-to-mouth households&amp;rsquo; consumption and the induced macroeconomic income effects from Keynesian multipliers and relative price changes. Distinct from the micro MPC, which captures only the household-level spending response before any general equilibrium feedbacks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Micro MPC.&lt;/strong&gt; The causal effect of receiving a temporary lump-sum transfer on a household&amp;rsquo;s own consumer expenditure, expressed as a fraction of the transfer amount, estimated from household panel data via difference-in-differences event studies. In the paper&amp;rsquo;s usage, this is a partial equilibrium concept that excludes any impact of the policy on prices, wages, or other households&amp;rsquo; incomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Forbidden comparisons bias.&lt;/strong&gt; A form of bias in two-way fixed effects event study estimates that arises when treatment effects are heterogeneous across cohorts and later-treated units are used as control groups for earlier-treated units whose outcomes are still reverting after treatment. Named and formalized in Borusyak and Jaravel (2017) and Borusyak et al. (2022); in this paper it manifests because cohorts receiving rebates in June have systematically larger spending responses than those receiving in September, so using September recipients as a &amp;ldquo;clean&amp;rdquo; control for June reversal yields contaminated estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rebate reporting bias.&lt;/strong&gt; A bias specific to the CEX survey data in which the timing of a household&amp;rsquo;s self-reported rebate receipt is correlated with unusually high contemporaneous expenditure (and correspondingly low prior-period expenditure), likely due to recall effects. Because the true rebate timing is random but the reported timing is not, this correlation inflates the difference-in-differences estimate of the spending effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two-good, two-agent New Keynesian (TANK) model.&lt;/strong&gt; A medium-scale New Keynesian model containing two types of households (optimizing life-cycle consumers and hand-to-mouth consumers who exhaust current income) and two goods (nondurables and durable goods interpreted as motor vehicles). The model is used in this paper as a framework to translate micro MPC estimates into aggregate general equilibrium counterfactuals, calibrated at monthly frequency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Durable supply elasticity.&lt;/strong&gt; The elasticity of real durable goods production with respect to the relative price of durable goods, calibrated in the paper to 5. In the baseline model, this elasticity is infinite (the relative price is fixed at one because intermediates convert frictionlessly). With a finite supply elasticity of 5, rebate-induced durable demand causes the relative vehicle price to rise, generating crowding out of optimizing households&amp;rsquo; durable expenditure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Calvo durable adjustment friction.&lt;/strong&gt; An adjustment friction imposed on optimizing households&amp;rsquo; durable goods purchases, motivated by Evans and Ramey&amp;rsquo;s (1992) calculation cost model. Only a fraction 1−θd of households reoptimize their durable stock each period (with probability drawn randomly), producing a Calvo-type reduced form. This friction limits both the extensive and intensive margins of durable adjustment and prevents unrealistically large intertemporal substitution of durable purchases in response to price changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Macro counterfactual.&lt;/strong&gt; In this paper&amp;rsquo;s usage, the simulated path of aggregate consumption that would have occurred in the absence of the 2008 tax rebate, constructed by subtracting the model-implied impulse response to the rebate from the actual observed NIPA consumption series. Plausibility of the counterfactual is assessed by comparison to contemporaneous forecasts and to historical episodes of large consumption declines.&lt;/p&gt;</description></item><item><title>Minimum Wages, Efficiency, and Welfare</title><link>https://macropaperwarehouse.com/papers/minimum-wages-efficiency-and-welfare/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/minimum-wages-efficiency-and-welfare/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; Can minimum wages improve welfare through efficiency — by correcting monopsony-driven under-employment — and, if so, by how much? What is the optimal minimum wage, and how much of the welfare gain from a higher minimum wage comes from efficiency versus redistribution?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and methodology.&lt;/strong&gt; The paper develops a tractable general equilibrium oligopsony model with heterogeneous workers (four types: non-high-school, high-school, college workers, and capital owners) and heterogeneous firms (varying in total factor productivity), embedded in a continuum of local labor markets where firms compete strategically in Cournot fashion. Firms face downward-sloping labor supply curves; their market power generates wages below the marginal revenue product of labor (markdowns). The model is calibrated to US data using the Census Longitudinal Business Database (LBD, 2014), the Bureau of Labor Statistics Current Population Survey (CPS, 2019), and the Survey of Consumer Finances (SCF). Key calibration targets include: average firm size of 22.83 workers (LBD), 29 percent of workers earning below $15/hr (CPS), labor and capital income shares, and household-level earnings and capital income ratios. The model is validated by quantitatively replicating four strands of empirical evidence: (i) reallocation effects of the German minimum wage introduction (Dustmann et al., 2021); (ii) employer spillover responses to Amazon&amp;rsquo;s voluntary $15 minimum wage (Derenoncourt et al., 2021); (iii) wage distribution compression evidence from Brazil (Engbom and Moser, 2021); and (iv) heterogeneous employment effects by market concentration (Azar et al., 2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three channels for efficiency gains.&lt;/strong&gt; The model identifies three mechanisms through which a minimum wage can improve efficiency under oligopsony: (1) a &lt;em&gt;direct effect&lt;/em&gt; in which constrained firms with monopsony markdowns increase wages and expand employment toward the competitive level (Region II firms); (2) a &lt;em&gt;spillover effect&lt;/em&gt; in which unconstrained competitor firms narrow their own markdowns in response to constrained firms&amp;rsquo; increased wages and market shares; (3) a &lt;em&gt;reallocation effect&lt;/em&gt; in which employment is shifted away from low-productivity firms (which enter Region III — constrained on labor demand) toward more productive firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings on efficiency versus redistribution.&lt;/strong&gt; Under the $15.12/hr minimum wage that maximizes social welfare under utilitarian weights (population-share weights), less than 5 percent of the welfare gains come from improved efficiency, while more than 95 percent come from redistribution. When the government is additionally given access to budget-neutral lump-sum transfers that fully address redistribution goals, the efficiency-maximizing minimum wage narrows to a range of approximately $7.50–$10.00 per hour, which is robust across social welfare weight specifications. The welfare gains attributable to efficiency alone are approximately 0.16–0.20 percent in consumption-equivalent terms, representing only about 1–2 percent of the welfare gains achievable in an economy with no labor market power at all (which would be 15.26 percent in consumption-equivalent terms under the same conditions with optimal transfers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Why efficiency gains are small.&lt;/strong&gt; Three structural reasons limit efficiency gains: (i) low-productivity firms — which are the firms most affected by a binding minimum wage in Region II — have endogenously narrow markdowns even absent a minimum wage, because they face more elastic labor supply and command small market shares; (ii) the calibrated production function has relatively flat marginal revenue product of labor schedules (decreasing returns parameter α = 0.940), so once firms enter Region III, employment rationing occurs rapidly; (iii) the large, high-productivity firms with the widest markdowns are not materially affected by the minimum wages of their small, low-wage competitors because those competitors have small market shares — making spillovers quantitatively negligible even though the model matches empirical cross-employer wage elasticities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Optimal minimum wages under alternative frameworks.&lt;/strong&gt; Without transfers and under utilitarian weights, the optimal minimum wage is $15.12. Without transfers but under Negishi weights (which rationalize the observed competitive equilibrium and load approximately 62 percent of weight on college workers and owners versus their 35 percent population share), the optimal is $6.97. Under a 97 percent weight on high-school graduates, the optimal rises to $18.32. With optimal lump-sum transfers, the optimal collapses to $7.76–$10.11 regardless of social welfare weights — a range robust across Frisch elasticity variants (ϕ ∈ {0.30, 0.62, 0.86}), regional decompositions (low, medium, and high income US states), short-run capital-fixed scenarios (where the optimum declines by approximately $1 under utilitarian weights), and the removal of household heterogeneity entirely (which yields an optimum of $7.74).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distributional proxies versus welfare.&lt;/strong&gt; Wage inequality (college–non-college log wage premium, cross-sectional variance of log wages) and the labor income share are monotonically improving as the minimum wage rises, even as welfare is hump-shaped and eventually declining. A rise in the minimum wage from $7.50 to $15 reduces the college–non-college log wage premium from 0.53 to 0.43 (roughly one-fifth), reduces the cross-sectional variance of log wages by nearly half, and raises the aggregate labor income share by approximately 3 percentage points — all while welfare (under utilitarian weights with no transfers) reaches its maximum at $15.12 and then declines. These standard proxies therefore do not reliably indicate welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; All results are long-run steady-state comparisons unless otherwise noted. Results assume no price passthrough and a unit elasticity of substitution between capital and labor. The paper abstracts from capital–labor substitution responses and occupational choice. The redistribution channel quantified here is specific to the utilitarian welfare criterion and to the existing distribution of capital and profit income, in which owners (6 percent of households) earn 92 percent of dividends.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the three regions of firm behavior in response to a binding minimum wage, and what are their efficiency implications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: A firm can be in one of three regions. In Region I the minimum wage is not binding: the firm pays its optimal monopsony wage and employment is inelastically below the competitive level. In Region II the minimum wage binds and exceeds the firm&amp;rsquo;s optimal monopsony wage, but labor supply at the minimum wage still falls short of labor demand: employment and efficiency improve as the shadow markdown narrows. In Region III the minimum wage exceeds the competitive wage, so unconstrained labor supply would exceed demand: the firm rations employment and the rationing constraint binds, reducing efficiency. At the boundary of Region II and Region III, the shadow markdown equals one and the firm is at its efficient employment level. Only a firm-specific minimum wage targeting each firm&amp;rsquo;s competitive wage could deliver economy-wide efficiency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the paper define and use &amp;ldquo;shadow wages&amp;rdquo; to characterize equilibrium?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The shadow wage for a firm is the effective wage that rationalizes equilibrium employment given rationing constraints. Formally, when a firm rations employment (Region III), households act as if facing a shadow wage equal to the actual minimum wage multiplied by a rationing factor p &amp;lt; 1 (the Lagrange multiplier on the rationing constraint, normalized as a fraction). Shadow wages aggregate across firms into market- and type-level shadow wages via CES aggregation. The key insight is that shadow wages, not observed wages, are allocative: aggregate labor supply for each worker type is determined by the type-level shadow wage, not by the minimum wage that firms actually pay. This allows the paper to express aggregate efficiency via two wedges — the aggregate shadow markdown (capturing average market power) and a misallocation term — without tracking all firm-specific constraints individually.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the two aggregate efficiency wedges and how do they behave as the minimum wage rises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The two wedges are: (i) the aggregate shadow markdown µ̃, which is a productivity-weighted average of firm-level shadow markdowns and measures the extent to which aggregate wages fall short of marginal revenue products; and (ii) the misallocation term ω, which measures whether employment is allocated toward more productive firms and equals one when all shadow markdowns are identical. As the minimum wage rises from zero, µ̃ initially narrows (improving efficiency) because firms in Region II expand toward their competitive employment level and constrained firms&amp;rsquo; market shares rise, tightening the residual labor supply of unconstrained competitors and narrowing their markdowns. But as the minimum wage rises further, Region III rationing causes shadow markdowns to widen rapidly — first for low-productivity firms and then progressively for more productive ones — so µ̃ turns back downward. The misallocation term ω first improves as low-productivity firms are pushed out, but then worsens because rationing at intermediate-productivity firms redirects employment from high- to medium-productivity firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What does the model validation exercise on the German minimum wage (DLSUB 2021) show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper calibrates the model to the German context by setting a minimum wage of $8.95/hr equivalent to 48 percent of the pre-reform median wage — matching Germany&amp;rsquo;s 8.50 euro introduction in 2015, where 15 percent of workers earned below the threshold. The model produces employment effects that are slightly positive (consistent with empirical findings of no disemployment), average wage increases consistent with both constrained and unconstrained firms raising wages, a negative elasticity of the number of operating firms with respect to minimum wage exposure (correctly signed, moderately smaller than data), and a positive elasticity of average firm size with respect to exposure (slightly larger than the data). The reallocation direction — small unproductive firms shrinking and workers moving to larger, more productive firms — matches the data qualitatively and within the range of data estimates across specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the Amazon spillover replication (DNWT 2021) show, and what does it imply about the minimum wage spillover channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Derenoncourt et al. (2021) estimate a cross-employer wage elasticity of 0.26: when Amazon raised wages by approximately 18.1 percent, competitors raised wages by 4.7 percent on average. The model replicates this by treating Amazon as the largest (or second-largest) firm in each market, exogenously narrowing its markdown by a fraction ζ calibrated to deliver an 18.1 percent wage increase. Competitors in the model raise wages through the strategic interaction mechanism: Amazon&amp;rsquo;s higher wage and market share tightens competitors&amp;rsquo; residual supply curves, inducing them to narrow their own markdowns. The model matches the 0.26 cross-employer elasticity when Amazon is the largest firm in markets with at least 36 competitors, or the second-largest in markets with at least 12. Critically, the authors note that this empirical evidence concerns responses to a &lt;em&gt;large&lt;/em&gt; firm raising wages; for minimum wages the question is whether &lt;em&gt;large&lt;/em&gt; firms respond to their small wage competitors, which the model shows they do not substantially, because small firms have negligible market shares.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the paper separate efficiency from redistribution, and what is the key methodological innovation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper gives the government access to budget-neutral, unrestricted lump-sum transfers across households in addition to the minimum wage. With transfers available, the government can use them to meet any redistributive objective encoded in arbitrary social welfare weights. Whatever is left for the minimum wage to do must be purely efficiency-improving. The paper shows (via aggregation theorems) that optimal lump-sum transfers can be computed in closed form for any social welfare weights, and that the social welfare maximizing allocation subject to transfers can be decentralized by transfers that sum to zero across households. Under this framework, the efficiency-maximizing minimum wage lies between $7.50 and $10.00 per hour regardless of whether utilitarian, Negishi, or 97 percent high-school-weighted social welfare functions are used — collapsing the original $0–$31 range to a tight interval.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How are Negishi weights computed, and why are they important for interpreting the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The Negishi weights are the social welfare weights under which a planner would choose the observed competitive equilibrium with zero lump-sum transfers. They are computed by inverting the planner&amp;rsquo;s first-order conditions: for the competitive equilibrium to be optimal under some set of weights, the implied consumption ratios must match observed data. The calibrated Negishi weights assign a combined weight of approximately 62 percent to college workers and owners, who constitute only 35 percent of the population. This means the competitive equilibrium is disproportionately aligned with higher-income households. A utilitarian planner, which weights households by population shares, therefore sees large scope for redistribution toward non-college workers — which is exactly why the utilitarian-optimal minimum wage is $15.12 and why 94 percent of its welfare gains come from redistribution rather than efficiency.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the quantitative welfare gains from the efficiency-maximizing minimum wage, and how small are they relative to the potential gains from eliminating monopsony?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: With optimal lump-sum transfers, the welfare gains from the efficiency-maximizing minimum wage are approximately 0.16–0.20 percent in consumption-equivalent terms, robust across social welfare weight specifications, Frisch elasticity variations, and regional decompositions. The welfare gains associated with an economy in which all firms&amp;rsquo; markdowns are set to one (no labor market power at all), also evaluated with optimal transfers, are 15.26 percent in consumption-equivalent terms. The efficiency-maximizing minimum wage therefore recovers approximately 1–2 percent of the potential welfare gains from eliminating monopsony. Equivalently, the efficiency gains correspond to roughly a 0.1 percent increase in TFP. These gains are small despite the model matching all empirical evidence on the channels through which efficiency gains could occur.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do employment effects of minimum wages vary by market concentration, and why?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In concentrated markets (upper tercile of HHI), firms have larger monopsony markdowns, so a binding minimum wage pushes them into Region II — where employment expands — over a wider range of minimum wage values before entering Region III. This produces large, positive employment effects in concentrated markets. In less concentrated markets, firms already have narrow markdowns (they are closer to competitive), so even small minimum wage increases push them into Region III, where employment contracts. The model replicates the statistically significant positive effects in high-concentration markets and negative effects in low-concentration markets documented by Azar et al. (2019), for initial minimum wages below approximately $8/hr. At higher initial minimum wages, however, even high-concentration markets exhibit negative employment effects as more firms enter Region III.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the robustness exercise for Mississippi reveal?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Mississippi has the lowest per capita income in the US, and a $15 minimum wage would bind for 41.3 percent of its workers (versus 29.4 percent nationally). Despite this, the model finds that Mississippi would benefit from a $15 federal minimum wage under utilitarian weights, and the Mississippi-specific optimal minimum wage is $14.89 — nearly identical to the national optimum. The reason is an offsetting compositional effect: while Mississippi has lower average wages (pushing toward a lower optimal), it has a larger share of high-school graduates (63 percent versus 52.8 percent nationally) who prefer higher minimum wages (around $17 in the model). These two forces wash out, producing a stable optimal close to the national figure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What happens to common empirical proxies for inequality and worker power as the minimum wage rises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The college–non-college log wage premium declines from 0.53 to 0.43 (a fall of roughly one-fifth) as the minimum wage rises from $7.50 to $15. The cross-sectional variance of log wages falls by nearly half over this range, driven equally by declining within- and between-type inequality. The aggregate labor income share rises by approximately 3 percentage points, and the share of output created in non-high-school jobs paid to non-high-school workers rises by 7 percentage points. All of these proxies are monotonically improving in the minimum wage throughout, even as aggregate welfare under the model&amp;rsquo;s social welfare function is hump-shaped and declining past the optimum. The paper concludes that observations of declining inequality or a rising labor share are consistent with falling welfare, so these proxies cannot serve as reliable welfare indicators.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the short-run (fixed-capital) analysis differ from the long-run baseline?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the short run, capital at each firm is fixed at the type-specific level chosen under a zero minimum wage. This creates sharper decreasing returns in labor (parameter γα rather than α̃), overhead costs that can make operation unprofitable, and a narrower range of minimum wages over which firms remain in Region II. The result is that firms in the short run enter Region III at lower minimum wages than in the long run, limiting the range of efficiency gains. Quantitatively, the efficiency-maximizing optimal minimum wage declines by approximately $1 under utilitarian weights (from about $10 to about $9 in the short-run exercise) and by only about $0.20 under Negishi weights. The robustness conclusion is that the difference between short- and long-run optimal minimum wages is modest, and the main finding that efficiency gains are small is preserved.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Shadow wage (w̃ᵢⱼ):&lt;/strong&gt; The effective wage that rationalizes a firm&amp;rsquo;s equilibrium employment in the presence of a minimum wage. When labor is rationed at firm ij (Region III), the shadow wage equals the actual minimum wage multiplied by a rationing factor pᵢⱼ &amp;lt; 1, where pᵢⱼ is derived from the Lagrange multiplier on the household&amp;rsquo;s rationing constraint. The shadow wage is allocative — it determines labor supply decisions — while the observed minimum wage wage is not. When the rationing constraint is slack (Regions I and II), the shadow wage coincides with the observed wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shadow markdown (µ̃ᵢⱼ):&lt;/strong&gt; The ratio of a firm&amp;rsquo;s shadow wage to its marginal revenue product of labor. In Region I (unconstrained), this equals the standard monopsony markdown. In Region II (constrained, on the labor supply curve), the shadow markdown narrows as the minimum wage increases, moving the firm toward its efficient employment level. In Region III (constrained, on the labor demand curve), the shadow markdown equals the rationing multiplier pᵢⱼ and widens, reflecting efficiency losses from rationing. An aggregate shadow markdown µ̃ is computed as a productivity-weighted average of firm-level shadow markdowns across all firms in the economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Misallocation wedge (ω):&lt;/strong&gt; A productivity-weighted measure of how well employment is allocated across firms. In an efficient allocation with identical shadow markdowns, ω = 1. When high-productivity firms have wider markdowns than low-productivity firms (the baseline oligopsony outcome), ω &amp;lt; 1 because employment is directed away from productive firms. A minimum wage can improve ω by shrinking low-productivity firms but worsens it when high-productivity firms enter Region III and are over-rationed relative to medium-productivity firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Oligopsony with Cournot competition:&lt;/strong&gt; The specific form of labor market power in this model. In each local labor market (defined as a NAICS 3-digit industry × commuting zone cell), a finite number of firms compete strategically in employment quantities, taking their competitors&amp;rsquo; employment levels as given (Cournot assumption). Each firm has an upward-sloping labor supply curve derived from nested CES household preferences, and exercises a markdown on the marginal revenue product of labor. This differs from monopsony (one firm) or perfect competition (infinitely many firms), and generates both direct effects and spillover effects of minimum wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negishi weights:&lt;/strong&gt; The vector of social welfare weights under which the observed competitive equilibrium allocation would be the solution to a social planner&amp;rsquo;s problem with zero lump-sum transfers. In this model, the calibrated Negishi weights assign roughly 62 percent combined weight to college workers and owners (who constitute only 35 percent of the population), reflecting the fact that the market equilibrium allocates a disproportionate share of consumption to high-income households. The Negishi weights are used both to identify the gap between market outcomes and utilitarian objectives (motivating redistribution) and as one alternative normative benchmark.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency-maximizing minimum wage:&lt;/strong&gt; The minimum wage that maximizes social welfare when the government additionally has access to budget-neutral lump-sum transfers across households. Because transfers can be optimized to handle any redistributive objective encoded in any arbitrary social welfare weights, the minimum wage under this framework serves solely to improve productive efficiency. In the calibrated model, the efficiency-maximizing minimum wage is approximately $7.50–$10.00 per hour, robust to social welfare weight specifications, Frisch elasticity variations (ϕ ∈ {0.30, 0.86}), and regional income differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rationing constraint (n̄ᵢⱼₖ):&lt;/strong&gt; A firm-specific, type-specific upper bound on the labor a household may supply to a firm in equilibrium. These constraints are taken as given by households and determined in equilibrium by firms&amp;rsquo; labor demand decisions. When the minimum wage is above the firm&amp;rsquo;s competitive wage (Region III), the firm&amp;rsquo;s labor demand is less than what households would want to supply at that wage, so the rationing constraint binds. The binding rationing constraint generates the shadow wage discount (pᵢⱼ &amp;lt; 1) and is the mechanism by which high minimum wages reduce efficiency in the model.&lt;/p&gt;</description></item><item><title>Mis(sed) Diagnosis: Physician Decision Making and ADHD</title><link>https://macropaperwarehouse.com/papers/missed-diagnosis-physician-decision-making-and-adhd/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/missed-diagnosis-physician-decision-making-and-adhd/</guid><description>&lt;p&gt;This paper develops and estimates a structural model of ADHD diagnosis to decompose the mechanisms driving the observed 2.3:1 male-to-female diagnostic difference in the United States. The research question is: to what extent does the large gender gap in ADHD diagnosis reflect true differences in symptom prevalence, versus patient-side utilization costs, versus physician decision-making under uncertainty? The setting is particularly well-suited to this question because DSM-V diagnostic guidelines for ADHD are explicitly gender-neutral, making any gender difference in physician thresholds a detectable deviation from uniform clinical rules.&lt;/p&gt;
&lt;p&gt;The data come from de-identified electronic health records from a large Arizona healthcare system covering January 2014 through September 2017. The sample encompasses 36,193 unique encounters for approximately 11,070 pediatric patients. The raw male-to-female diagnostic ratio in the data is 2.32:1 (7.2% of males vs. 3.1% of females receive a clinical ADHD diagnosis). This gap persists after controlling for demographics, general healthcare utilization, and mental health utilization in reduced-form regressions, motivating the structural approach.&lt;/p&gt;
&lt;p&gt;Because two key variables — whether a patient received a behavioral assessment (Qi) and the ADHD match signal observed by the physician (xi) — are not directly recorded in the EHR, the author constructs them from clinical doctor note text. A random forest machine learning classifier trained on labeled appointments predicts behavioral assessment take-up for unlabeled encounters; approximately 20.8% of children are predicted to have received a behavioral assessment (23.2% of males vs. 18.3% of females). The ADHD match signal is constructed via an adjusted Bag-of-Words cosine similarity measure comparing each patient&amp;rsquo;s aggregated note text to the DSM-V symptom list, rescaled to [0,1]. The average signal is 0.319 overall, with males averaging 0.326 and females 0.311.&lt;/p&gt;
&lt;p&gt;The structural model has three stages. First, patients/caregivers decide whether to schedule a behavioral assessment, a function of underlying latent ADHD risk (vi) and mental healthcare utilization costs (ci). Second, conditional on assessment, the physician receives a noisy signal of vi and updates beliefs via Bayesian learning; signal quality ρ governs diagnostic uncertainty. Third, the physician diagnoses ADHD if posterior risk exceeds a gender-specific diagnostic threshold τ. Population mean ADHD risk (μ) is identified using regression-adjusted initial primary care provider referral rates as a quasi-exogenous cost-shifter — patients of high-referral-rate providers select into assessment less selectively, so their observed signals approach population mean risk. This extrapolation approach follows Arnold et al. (2022).&lt;/p&gt;
&lt;p&gt;The structural parameter estimates reveal that male and female children have similar but slightly different mean ADHD risk (μm = 0.290 vs. μf = 0.262) and similar mean utilization costs (cm = 0.116 vs. cf = 0.109). The most striking differences are in physician parameters: signal quality is lower for male patients (ρm = 0.479 vs. ρf = 0.552), indicating higher diagnostic uncertainty for boys; and diagnostic thresholds are substantially lower for male patients (τm = 0.257 vs. τf = 0.312), meaning physicians are willing to diagnose ADHD in boys with lower posterior risk.&lt;/p&gt;
&lt;p&gt;Counterfactual decomposition simulations attribute approximately 20–25% of the 2.32:1 diagnostic gap to underlying differences in ADHD risk, approximately 20% to differences in selection into behavioral assessments, and the remaining majority — approximately 55–60% — to physician decision-making. Within physician decision-making, differences in diagnostic thresholds alone account for roughly two-thirds of the overall diagnostic gap.&lt;/p&gt;
&lt;p&gt;The paper offers economic rationales for why gender-specific thresholds may be consistent with physician rationality despite uniform guidelines: higher diagnostic uncertainty for boys justifies lower thresholds under Bayesian updating; hyperactive/impulsive symptoms predominant in boys impose larger classroom externalities (Aizer, 2008); and female patients show higher rates of internalizing co-morbidities (anxiety, depression) that may reduce the marginal benefit of an additional ADHD diagnosis. A type-specific threshold extension finds that for male patients the threshold for hyperactive/impulsive symptoms is significantly lower than for inattentive symptoms, consistent with salience of externally disruptive behaviors. These rationalizations do not vindicate the gap as fully guideline-consistent, but suggest physicians may be responding to real heterogeneity in external costs and co-morbidity patterns.&lt;/p&gt;
&lt;p&gt;Q: What is the main research question and why is ADHD a useful setting?
A: The paper asks what mechanisms produce the 2.3:1 male-to-female ADHD diagnostic difference: true symptom prevalence, patient utilization costs, or physician decision-making. ADHD is well-suited because (1) clinical guidelines (DSM-V) are explicitly gender-neutral and require the same symptom count threshold regardless of sex; (2) diagnosis is based on subjective behavioral assessment rather than objective testing, creating substantial physician discretion; and (3) both missed and excess diagnosis carry meaningful costs — missed diagnosis limits educational accommodations; excess diagnosis exposes children to Schedule II controlled substances.&lt;/p&gt;
&lt;p&gt;Q: What data does the paper use and what are the key descriptive facts?
A: The data are de-identified electronic health records from a large Arizona healthcare system, 2014–2017, covering 36,193 encounters for 11,070 pediatric patients aged 5 and above. Overall ADHD diagnosis rate is 5.2%, with males at 7.2% and females at 3.1%, a 2.32:1 ratio that matches national levels. Approximately 49.5% of the sample is Hispanic, which the author notes contributes to a below-national-average overall diagnosis rate. The gender diagnostic gap persists even after controlling for demographics, general healthcare utilization, and mental health utilization in reduced-form regressions.&lt;/p&gt;
&lt;p&gt;Q: How does the paper construct the behavioral assessment indicator (Qi) and the ADHD match signal (xi)?
A: Qi is constructed using a random forest classifier trained on doctor notes from appointments where assessment status is known with near-certainty (ADHD diagnosis or DSM-V comorbid diagnosis = positive; non-mental-health diagnosis code for patients with no mental health history = negative). The classifier uses 41 features including note length and top-20 word frequencies for each label class. xi is constructed via an adjusted Bag-of-Words cosine similarity between each patient&amp;rsquo;s combined behavioral assessment notes and the DSM-V symptom list, separately for inattentive and hyperactive/impulsive sub-types, taking xi = max{xi1, xi2}. The average xi is 0.319 (males 0.326, females 0.311) in the behavioral assessment subsample.&lt;/p&gt;
&lt;p&gt;Q: What is the identification strategy for recovering population mean ADHD risk (μ)?
A: Because xi is observed only for endogenously selected patients, the observed sample mean overestimates population mean risk. The author uses regression-adjusted referral rates of each patient&amp;rsquo;s initial primary care provider (IPCP) as a quasi-exogenous cost-shifter satisfying (a) relevance — IPCP referral intensity lowers patient scheduling costs — and (b) independence from patient ADHD risk vi, since IPCPs are typically chosen before behavioral symptoms develop and only 28% of IPCPs in the sample ever diagnose ADHD themselves. Population mean risk is then recovered by extrapolating the relationship between IPCP referral propensity and average observed xi to propensity = 1, following Arnold et al. (2022). The maximum observed IPCP referral propensity is only about 0.75, so the estimate requires extrapolation beyond the observed support.&lt;/p&gt;
&lt;p&gt;Q: What are the estimated structural parameters and what do they imply?
A: Mean ADHD risk is μm = 0.290 vs. μf = 0.262 — males have modestly higher underlying risk. Mean utilization costs are cm = 0.116 vs. cf = 0.109 — nearly identical across genders. Signal quality (diagnostic certainty) is lower for males: ρm = 0.479 vs. ρf = 0.552, indicating physicians face more diagnostic uncertainty when assessing boys. Most importantly, diagnostic thresholds are lower for males: τm = 0.257 vs. τf = 0.312, meaning physicians diagnose ADHD in boys at a lower required posterior risk level, consistent with viewing missed diagnosis as relatively more costly for male patients.&lt;/p&gt;
&lt;p&gt;Q: How much of the 2.32:1 diagnostic gap can be attributed to each mechanism?
A: Counterfactual simulations decompose the gap as follows: differences in underlying ADHD risk distribution account for approximately 20–25% of the diagnostic difference; differences in selection into behavioral assessments (utilization costs operating through assessment rates) account for approximately 20%; and physician decision-making differences account for the remaining majority, approximately 55–60%. Within physician factors, differences in diagnostic thresholds (τm &amp;lt; τf) are the single largest contributor, explaining roughly two-thirds of the overall male/female diagnostic gap.&lt;/p&gt;
&lt;p&gt;Q: What do the type-specific threshold estimates reveal?
A: When the baseline model is extended to allow separate diagnostic thresholds for inattentive vs. hyperactive/impulsive symptom sub-types, male patients show significantly lower thresholds for hyperactive/impulsive symptoms relative to inattentive symptoms (τ^HI_m &amp;lt; τ^Inatt_m). This is consistent with the hypothesis that more externally salient and disruptive symptoms carry larger classroom externalities, which physicians may implicitly factor into diagnosis decisions (following Aizer, 2008). For female patients, the threshold differences across symptom types are smaller and less statistically significant.&lt;/p&gt;
&lt;p&gt;Q: What economic rationales does the paper offer for gender-specific diagnostic thresholds despite uniform guidelines?
A: Three mechanisms are identified. First, higher diagnostic uncertainty for males (lower ρm) implies that under symmetric costs, Bayesian-rational physicians should set lower thresholds when the signal is noisier — this alone partially rationalizes the threshold gap. Second, hyperactive/impulsive symptoms predominant in boys impose greater externalities on classroom peers (Aizer, 2008), increasing the social benefit of diagnosis for boys on the margin. Third, females show substantially higher rates of co-morbid internalizing conditions (anxiety, depression) whose treatment may mitigate ADHD-related behaviors or whose interaction with stimulant medication makes the marginal ADHD diagnosis less beneficial for girls (Currie et al., 2014). These factors together suggest physicians may be responding to genuine heterogeneity in net diagnosis benefits, even if their behavior deviates from gender-neutral clinical guidelines.&lt;/p&gt;
&lt;p&gt;Q: What share of the 2.3:1 national diagnostic gap is consistent with genuine symptom prevalence differences?
A: Simulations indicate that only about 20–25% of the 2.32:1 male/female diagnostic difference can be explained by the underlying difference in ADHD risk distributions. The majority — roughly 75–80% — reflects factors beyond true prevalence: selection into care and, most substantially, physician decision-making differences including both signal quality and diagnostic thresholds.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications?
A: The findings suggest that targeted interventions in physician awareness and clinical training are likely more effective than generic awareness campaigns, since the dominant driver of the diagnostic gap is physician threshold-setting rather than symptom prevalence. Structured decision support tools or updated training that make physicians aware of gender-specific diagnostic patterns could reduce medically unwarranted diagnostic differences. Policies targeting patient-side access barriers (the ~20% explained by selection) remain relevant but secondary. The roughly 20–25% of the gap attributable to genuine symptom prevalence differences is, by construction, guideline-consistent and should not be targeted for elimination.&lt;/p&gt;
&lt;p&gt;Q: What are the methodological contributions?
A: The paper makes three methodological contributions. First, it develops a structural model of mental health diagnosis that explicitly incorporates endogenous patient selection — a feature absent from standard physician decision-making models — which is shown empirically important. Second, it applies machine learning and NLP to clinical doctor note text to construct key unobserved clinical variables (behavioral assessment indicator and ADHD match signal) that are unavailable as structured data in EHRs. Third, the identification of population mean health risk uses a quasi-exogenous variation approach (IPCP referral rates) analogous to Arnold et al. (2022)&amp;rsquo;s method for measuring racial discrimination in bail decisions, adapted here to a continuous health risk setting with endogenous selection.&lt;/p&gt;
&lt;p&gt;Diagnostic threshold (τ_θ): The gender-specific posterior ADHD risk level above which a physician chooses to diagnose ADHD. Set ex-ante, it reflects the physician&amp;rsquo;s perceived tradeoff between the costs of over-diagnosis (misdiagnosis) and under-diagnosis (missed diagnosis). A lower threshold implies the physician views missed diagnosis as relatively more costly for that patient group. By construction, uniform clinical guidelines imply a single threshold independent of patient gender.&lt;/p&gt;
&lt;p&gt;ADHD match signal (x_i): A physician-observed, noisy signal of a patient&amp;rsquo;s true latent ADHD risk (v_i), observed only conditional on the patient receiving a behavioral assessment. In estimation, it is proxied via a cosine similarity measure between the patient&amp;rsquo;s aggregated clinical doctor note text and the DSM-V symptom list, constructed separately for inattentive and hyperactive/impulsive sub-types.&lt;/p&gt;
&lt;p&gt;Signal quality / diagnostic uncertainty (ρ_θ): The correlation between the physician&amp;rsquo;s observed ADHD match signal and the patient&amp;rsquo;s true ADHD risk. Higher ρ means the physician&amp;rsquo;s signal is more informative and diagnostic uncertainty is lower. In the Bayesian updating framework, higher ρ implies the physician places more weight on the observed signal relative to the prior.&lt;/p&gt;
&lt;p&gt;Mental healthcare utilization cost (c_i): The composite of all patient/caregiver factors that affect the decision to schedule a behavioral assessment net of child symptom level. Includes non-monetary barriers such as time constraints, distance, stigma, and information from primary care providers during wellness visits; does not include monetary out-of-pocket costs since insurance typically covers behavioral assessments.&lt;/p&gt;
&lt;p&gt;Initial Primary Care Provider (IPCP) referral rate: The regression-adjusted share of a given PCP&amp;rsquo;s patients who ultimately receive a behavioral assessment at some point in the sample. Used as a quasi-exogenous cost-shifter that influences patient scheduling costs without being correlated with patient ADHD risk, enabling identification of population mean ADHD risk via extrapolation.&lt;/p&gt;
&lt;p&gt;Latent ADHD risk (v_i): An unobserved continuous measure of a child&amp;rsquo;s underlying ADHD-related behavioral symptoms, drawn from a gender-specific normal distribution N(μ_θ, σ²_θ). A child&amp;rsquo;s true ADHD status is Si = 1(v_i &amp;gt; v̄), where v̄ is the DSM-V minimum symptom threshold, defined identically for boys and girls.&lt;/p&gt;
&lt;p&gt;Adjusted Bag-of-Words (BOW) cosine similarity: The NLP method used to construct the ADHD match signal proxy. Patient notes are tokenized into uni-grams and bi-grams after preprocessing (spell check, abbreviation replacement, part-of-speech tagging, synonym replacement), and tf-idf weighted. The cosine similarity between the resulting document vector and the DSM-V symptom text vector is computed separately for each ADHD sub-type and rescaled to [0,1].&lt;/p&gt;</description></item><item><title>Misspecified Expectations among Professional Forecasters</title><link>https://macropaperwarehouse.com/papers/misspecified-expectations-among-professional-forecasters/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/misspecified-expectations-among-professional-forecasters/</guid><description>&lt;p&gt;Analyzing panel data from the U.S. Survey of Professional Forecasters (SPF, 1992Q1–2019Q4, 77 forecasters, 1,520 forecaster-quarter observations), Julio Ortiz finds that a &amp;ldquo;misspecified expectations&amp;rdquo; model — in which forecasters perceive an AR(2) data-generating process to be an AR(1), causing them to misperceive its underlying persistence — tends to outperform a noisy-information rational benchmark and two leading non-FIRE alternatives (overconfident and diagnostic expectations) when fit to forecast errors and revisions. The models are estimated by maximum likelihood and ranked using forecast-encompassing weights; for the baseline real GDP growth case, misspecified expectations earns the largest encompassing weight (0.539 vs. 0.462 for diagnostic, ~0 for rational and overconfident) and the highest log-likelihood. Across 14 macroeconomic variables, misspecified expectations provides the best fit for most series both in-sample and out-of-sample, though diagnostic expectations fits better for some (e.g., GDP deflator, industrial production, real residential investment) and rational expectations fits the unemployment rate best. The author argues misspecified expectations succeeds in part because its bias enters both the prediction and updating equations, producing overreaction to new information plus overextrapolation across horizons, which makes forecast errors longer-lived; he concludes it can serve as a &amp;ldquo;suitable approach&amp;rdquo; / useful benchmark to model professional-forecaster expectation formation, while emphasizing the results are specific to the context of professional forecasting and may not carry over to household or firm expectations.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-question-does-the-paper-address"&gt;Q1. What question does the paper address?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper undertakes a formal comparison of competing non-FIRE theories of expectation formation to move toward establishing a benchmark non-FIRE model in the context of professional forecasting.&lt;/strong&gt; Ortiz motivates this with the observation that survey forecast errors are predictably correlated with real-time information — a violation of full-information rational expectations (FIRE) — but that, as noted in Reis (2020), the literature &amp;ldquo;has not yet settled on a benchmark non-FIRE model.&amp;rdquo; The paper offers &amp;ldquo;a partial answer to this question.&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q2-what-models-are-compared"&gt;Q2. What models are compared?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Four models are estimated: a noisy-information rational expectations baseline plus three biased non-FIRE models — overconfident expectations (Daniel et al., 1998), diagnostic expectations (Bordalo et al., 2020), and misspecified expectations (in the spirit of Fuster et al., 2010).&lt;/strong&gt; All are embedded in a common noisy-information environment where the latent variable is unobservable and forecasters update via a Kalman filter from a noisy private signal. Overconfidence has forecasters misperceive their signal noise as smaller than it is; diagnostic expectations introduces a representativeness distortion ϕ &amp;gt; 0 generating overreaction to recent news; misspecified expectations has forecasters treat an AR(2) process as an AR(1).&lt;/p&gt;
&lt;h3 id="q3-what-exactly-is-misspecified-expectations-in-this-paper"&gt;Q3. What exactly is &amp;ldquo;misspecified expectations&amp;rdquo; in this paper?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Misspecified expectations is a model in which the underlying state follows an AR(2) process but forecasters treat it as an AR(1), so they misperceive the true persistence of the data-generating process.&lt;/strong&gt; The author notes this version is &amp;ldquo;closest to natural expectations as modeled in Fuster et al. (2010),&amp;rdquo; with forecasters neglecting longer lags. Importantly, forecasters still understand the information structure. If the perceived persistence loads excessively onto the first lag, forecasters overextrapolate. The author flags three technical differences from Fuster et al. (2010): he does not model an AR(2) in levels with AR(1)-in-growth-rates forecasting; the perceived persistence is estimated from the data rather than defined as a function of the true autocorrelation parameters; and he does not define expectations as a weighted average of rational and naive AR(1) expectations.&lt;/p&gt;
&lt;h3 id="q4-what-data-and-sample-are-used"&gt;Q4. What data and sample are used?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The estimation uses U.S. SPF panel data from 1992Q1 to 2019Q4, yielding 77 unique forecasters and 1,520 forecaster-quarter observations for the baseline.&lt;/strong&gt; The 1992 start is chosen to avoid spanning different regimes and because the survey redefined output from GNP to GDP in 1992. The procedure requires unbroken observation sequences, so only each forecaster&amp;rsquo;s longest spell is kept, with a minimum spell length of eight quarters (because entry/exit may be non-random, per Engelberg et al., 2011). Real GDP growth is the baseline variable; 13 other macroeconomic variables are also estimated. Real-time forecast errors (not errors based on revised figures) are used, following the literature.&lt;/p&gt;
&lt;h3 id="q5-how-are-the-models-estimated-and-compared"&gt;Q5. How are the models estimated and compared?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The models are estimated via a three-step maximum likelihood procedure, and their relative fit is compared using forecast-encompassing weights (West, 2001; Harvey et al., 1998; West, 2006), supplemented by AIC and a Vuong (1989) non-nested likelihood-ratio test.&lt;/strong&gt; Step 1 estimates the fundamental process parameters (ρ₁, ρ₂, σ_w) from the macro time series and fixes them across models; step 2 estimates the signal-noise dispersion σ_v from the rational model and calibrates it across the other three; step 3 estimates each bias parameter (α_v, ϕ, ρ̂) by MLE on SPF data. This keeps fundamental and information parameters consistent across biased models so they are evaluated solely on the biases they generate, and makes identification transparent (notably, σ_v and α_v cannot be jointly identified in the overconfidence model). Encompassing weights are obtained from a constrained linear regression of realizations on model-based one-quarter-ahead forecasts, with weights summing to 1.&lt;/p&gt;
&lt;h3 id="q6-what-are-the-baseline-real-gdp-growth-results"&gt;Q6. What are the baseline real GDP growth results?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;For real GDP growth, the misspecified expectations model produces the highest log-likelihood and the largest encompassing weight, 0.539, versus 0.462 for diagnostic expectations and approximately 0.000 for both rational and overconfident expectations.&lt;/strong&gt; The fundamental process estimates imply relatively low persistence (first-order autocorrelation ρ₁ ≈ 0.434, second-order ρ₂ ≈ −0.006). The estimated bias parameters are: overconfidence ≈ 0.72, diagnosticity ≈ 0.23, and perceived persistence ρ̂ ≈ 0.564. Because ρ̂ ≈ 0.56 exceeds the estimated ρ₁ ≈ 0.43, the misspecified model implies forecasters overestimate the first-order autocorrelation and neglect the partial reversal in the second lag, generating overreactions. The signal-to-noise ratio implied by the estimated private noise dispersion is σ_w/σ_v ≈ 1.09. AIC rankings (and BIC) do not change the ordering relative to the maximized likelihoods.&lt;/p&gt;
&lt;h3 id="q7-does-the-result-hold-across-other-macroeconomic-variables"&gt;Q7. Does the result hold across other macroeconomic variables?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Across the 14 SPF macroeconomic variables, misspecified expectations provides the best in-sample fit for most series, but not all.&lt;/strong&gt; Diagnostic expectations registers larger encompassing weights for certain series — the GDP deflator (0.771), industrial production (1.000), and real residential investment (0.624). Rational expectations provides the best fit for the unemployment rate (0.745) and housing starts (in-sample). For the bulk of the remaining variables (e.g., CPI 0.859, payroll employment 1.000, real consumption 0.777, real federal spending 1.000, real GDP 0.539, real nonresidential investment 1.000, real state/local spending 1.000, 3-month Treasury bill 0.713, 10-year bond 0.746), misspecified expectations carries the largest weight. Overconfident expectations &amp;ldquo;does not yield particularly large encompassing weights for any variable.&amp;rdquo;&lt;/p&gt;
&lt;h3 id="q8-why-does-misspecified-expectations-fit-better-and-for-which-variables-especially"&gt;Q8. Why does misspecified expectations fit better, and for which variables especially?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The author finds that, among variables exhibiting overreactions, misspecified expectations tends to offer a better fit for less persistent series, because the scope for it to generate overreaction (ρ̂ − ρ₁) is greater when ρ₁ is low.&lt;/strong&gt; Unlike the alternatives, the persistence bias ρ̂ − ρ₁ can be positive or negative, allowing the model to account for both overreacting and underreacting variables; the alternative models cannot generate forecaster-level underreaction. Figure 2 plots the encompassing weight on misspecified expectations against the sum of autoregressive coefficients and suggests (with some exceptions) that less persistent variables have higher weight on misspecified expectations.&lt;/p&gt;
&lt;h3 id="q9-does-the-model-perform-out-of-sample"&gt;Q9. Does the model perform out of sample?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The misspecified expectations model also provides a better out-of-sample fit for more of the variables, estimated on 1992Q1–2005Q4 and evaluated on the latter half of the sample.&lt;/strong&gt; However, out of sample diagnostic expectations now outperforms for the GDP deflator (0.987), industrial production (0.959), payroll employment (0.813), and real federal government expenditures (0.591); overconfident expectations outperforms for the 10-year government bond (0.653); and rational expectations outperforms for housing starts (0.502) and the unemployment rate (1.000). The author cautions that these results do not imply forecasters could improve their forecasts in real time, because the MLE observations include contemporaneous individual and consensus forecast errors that are not known to forecasters when they issue forecasts; for the same reason, the results are &amp;ldquo;not inconsistent with&amp;rdquo; Eva and Winkler (2023) on the poor out-of-sample performance of error-predictability regressions.&lt;/p&gt;
&lt;h3 id="q10-could-the-apparent-advantage-of-misspecified-expectations-just-reflect-learning"&gt;Q10. Could the apparent advantage of misspecified expectations just reflect learning?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The author argues that learning about the data-generating process does not appear to drive the relative model rankings in favor of misspecified expectations, based on two exercises.&lt;/strong&gt; First, using the full pre-COVID sample (1968Q4–2019Q4) over 25-year rolling windows (three-year roll), the misspecified model outperforms diagnostic expectations in six of ten sub-samples and all models in five of ten, while diagnostic expectations wins four of ten — patterns that &amp;ldquo;do not indicate that learning over time favors misspecified expectations.&amp;rdquo; Second, splitting forecasters by &amp;ldquo;age&amp;rdquo;/tenure (a proxy for experience), misspecified expectations outperforms the others among experienced (above-median age) forecasters (encompassing weight 0.766, with overconfidence 0.234) and is dominant among inexperienced ones (1.000). The author concedes learning &amp;ldquo;is likely reflected in professional forecasts&amp;rdquo; but does not appear to drive the rankings.&lt;/p&gt;
&lt;h3 id="q11-what-additional-moments-does-misspecified-expectations-match"&gt;Q11. What additional moments does misspecified expectations match?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Beyond overall fit, the author shows in the appendix that misspecified expectations matches five features of the data — overreaction, underreaction, overshooting, persistent disagreement, and updating behavior — and is the only model generating delayed overshooting.&lt;/strong&gt; All three non-rational models generate individual-level overreaction (Bordalo et al., 2020 errors-on-revisions regression) and aggregate underreaction (Coibion-Gorodnichenko, 2015 consensus regression). But when simulating impulse responses, &amp;ldquo;only the misspecified expectations model generates a sign switch in the forecast error,&amp;rdquo; indicating delayed overshooting (Angeletos et al., 2020). The author reports &amp;ldquo;stronger evidence&amp;rdquo; favoring misspecified expectations on two further moments: it better generates persistent disagreement across horizons, and it better matches the relative weights forecasters place on priors versus news — because its bias also enters the prediction equation (not just the update equation), producing longer-lived errors.&lt;/p&gt;
&lt;h3 id="q12-what-are-the-scope-conditions-and-limitations-the-author-stresses"&gt;Q12. What are the scope conditions and limitations the author stresses?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The author emphasizes that the results are specific to the context of professional forecasting and that the relative model rankings &amp;ldquo;may be different&amp;rdquo; for household or firm expectations, or for micro-level expectations rather than aggregate forecasts.&lt;/strong&gt; He notes professional forecasters are arguably the most well-informed agents, so the literature has treated their predictions as informative about a lower bound on economy-wide information frictions and biases. The paper abstracts away from learning in the model setup and from theories that generate only underreaction. Models excluded from the comparison (e.g., imperfect memory, multi-frequency forecasting, asymmetric attention, learning) are set aside mainly because they cannot be flexibly nested into the common setting and would introduce additional parameters posing identification challenges.&lt;/p&gt;
&lt;h3 id="q13-what-does-the-author-conclude-and-recommend"&gt;Q13. What does the author conclude and recommend?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Ortiz concludes that misspecified expectations &amp;ldquo;can serve as a suitable approach&amp;rdquo; / useful benchmark to model expectation formation among professional forecasters for a variety of macroeconomic aggregates, while framing this as only &amp;ldquo;a partial answer&amp;rdquo; to the search for a non-FIRE benchmark.&lt;/strong&gt; He highlights a practical advantage: embedding this form of misspecified expectations into a quantitative model &amp;ldquo;only requires introducing two parameters into an otherwise standard model.&amp;rdquo; He also notes misspecification can arise either from a behavioral bias or because adopting parsimonious forecasting models is optimal (Branch and Evans, 2006; Pfajfar, 2013). A promising avenue for future research is whether evidence favors misspecified expectations in other settings.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;dl&gt;
&lt;dt&gt;&lt;strong&gt;Full-information rational expectations (FIRE)&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;The benchmark in which forecast errors are uncorrelated with any information in the forecaster&amp;rsquo;s time-t information set; the orthogonality conditions it implies &amp;ldquo;tend to be violated in the data,&amp;rdquo; motivating non-FIRE models.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Misspecified expectations&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;The paper&amp;rsquo;s focal bias — the true state follows an AR(2) process, xₜ = ρ₁xₜ₋₁ + ρ₂xₜ₋₂ + wₜ, but forecasters treat it as an AR(1), xₜ = ρ̂xₜ₋₁ + uₜ, misperceiving its persistence; forecasters retain the correct information structure. The bias enters both the predict and update equations.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Persistence bias (ρ̂ − ρ₁)&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;The gap between perceived AR(1) persistence and true first-order autocorrelation; positive values generate overextrapolation/overreaction, negative values generate underreaction, and its overreaction scope is larger when ρ₁ is low.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Overconfident expectations&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;Forecasters misperceive their private signal noise as smaller (σ̃_v = α_v σ_v, α_v ∈ [0,1]) than it truly is, placing excessive weight on new private information.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Diagnostic expectations&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;A representativeness-based distortion (Bordalo et al., 2020; Gennaioli-Shleifer, 2010) in which, with diagnosticity ϕ &amp;gt; 0, forecasters overweight outcomes representative relative to a &amp;ldquo;no news&amp;rdquo; reference scenario, generating overreaction to recent news.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Encompassing weight&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;The model-comparison metric — a weight wₖ from a constrained linear regression of realized one-quarter-ahead values on competing models&amp;rsquo; forecasts, with weights summing to one; a larger weight indicates a better-fitting model.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Delayed overshooting&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;The Angeletos et al. (2020) pattern of initial underreaction followed by later overreaction to a shock; in this paper, only misspecified expectations produces the sign switch in the forecast-error impulse response that signals it.&lt;/dd&gt;
&lt;dt&gt;&lt;strong&gt;Overreaction vs. underreaction&lt;/strong&gt;&lt;/dt&gt;
&lt;dd&gt;Individual-level overreaction is measured via the Bordalo et al. (2020) errors-on-revisions regression; aggregate/consensus-level underreaction via the Coibion-Gorodnichenko (2015) regression — the data exhibit both, and a successful non-FIRE model must reproduce both.&lt;/dd&gt;
&lt;/dl&gt;</description></item><item><title>Mixing It Up: Inflation at Risk</title><link>https://macropaperwarehouse.com/papers/mixing-it-up-inflation-at-risk/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/mixing-it-up-inflation-at-risk/</guid><description>&lt;p&gt;This paper introduces a Bayesian Gaussian mixture density regression framework that estimates the complete forecast distribution of inflation — not just selected quantiles — and decomposes the entire risk outlook into contributions from individual economic predictors. The methodology accommodates multimodality, skewness, and fat tails without parametric restrictions, and allows construction of risk measures calibrated to the central bank&amp;rsquo;s own loss function rather than generic percentile-based measures. Applied to the recent U.S. inflation surge, the framework finds that post-pandemic inflation risk was primarily driven by the recovery of the U.S. business cycle and surging commodity prices, while adjustments in monetary policy contributed negatively — partially mitigating the increase in right-tail inflation risk — and credit spreads also offset some risk. The Gaussian mixture structure enables fast MCMC estimation and produces well-calibrated density forecasts across a range of macroeconomic variables.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-key-methodological-contribution-relative-to-existing-inflation-at-risk-approaches"&gt;Q1. What is the key methodological contribution relative to existing inflation-at-risk approaches?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Existing approaches to macroeconomic at-risk measures focus on specific quantiles of the forecast distribution — typically the 5th or 25th percentile — discarding information contained in the rest of the distribution; this paper redirects attention to the full forecast distribution while retaining the nonparametric flexibility of quantile regression.&lt;/strong&gt; The Gaussian mixture density regression estimates a conditional distribution that is a weighted mixture of Gaussians, capturing multimodality, asymmetry, and fat tails simultaneously. The key innovation is decomposability: each predictor&amp;rsquo;s contribution to any region of the forecast distribution can be quantified, enabling a driver-level accounting of what generates tail risk in any given period.&lt;/p&gt;
&lt;h3 id="q2-what-does-the-us-application-reveal-about-the-inflation-surge"&gt;Q2. What does the U.S. application reveal about the inflation surge?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The framework attributes the increase in right-tail U.S. inflation risk during 2021–2023 primarily to surging commodity prices and the recovery of the domestic business cycle, while monetary policy tightening contributed negatively — its effect partially offset the upward pressure from commodity and cycle drivers.&lt;/strong&gt; Credit spreads also partially mitigated the risk. The decomposition implies that the dominant drivers of inflation risk were supply-side and aggregate-demand factors, and that monetary policy, when it tightened, reduced the right-tail risk as intended — providing quantitative support for the interpretation that policy was reactive but directionally correct.&lt;/p&gt;
&lt;h3 id="q3-how-does-the-framework-construct-policy-relevant-risk-measures"&gt;Q3. How does the framework construct policy-relevant risk measures?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The framework allows weighting probability mass over the forecast distribution by any user-specified loss function, including asymmetric central bank preferences, yielding risk measures that integrate the full distributional information in proportion to the policymaker&amp;rsquo;s actual valuation of different inflation outcomes.&lt;/strong&gt; A central bank that penalizes above-target inflation more heavily than below-target inflation (consistent with empirical evidence on CB loss functions) would weight the upper tail more, producing a risk statistic that is higher than a symmetric measure for the same distribution. This policy-preference-aligned risk measure could have provided a more accurate signal of the urgency of the 2021–2023 inflation risk than standard percentile measures.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;inflation at risk&lt;/strong&gt; : the quantile-based or distribution-based characterization of future inflation uncertainty; extended in this paper from a single quantile to the complete forecast distribution and its risk decomposition by driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;density regression&lt;/strong&gt; : a regression model in which the conditional distribution of the outcome — not just its mean or a specific quantile — is the object of estimation; the paper uses a Gaussian mixture density regression to capture non-standard distributional shapes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;risk decomposition&lt;/strong&gt; : the attribution of shifts in the full forecast distribution to individual predictor variables; the paper&amp;rsquo;s key tool for identifying which economic factors drive right-tail inflation risk in any period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;CB-preference-aligned risk measure&lt;/strong&gt; : a summary statistic constructed by weighting probability mass over the forecast distribution by the central bank&amp;rsquo;s loss function; captures asymmetric preferences and goes beyond standard percentile measures.&lt;/p&gt;</description></item><item><title>Monetary Policy and the Drifting Natural Rate of Interest</title><link>https://macropaperwarehouse.com/papers/monetary-policy-and-the-drifting-natural-rate-of-interest/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-and-the-drifting-natural-rate-of-interest/</guid><description>&lt;p&gt;This paper analyzes how monetary policy should respond to a long-run natural interest rate that can drift permanently — following a bounded random walk with upper bound 3 percent and lower bound 0 percent — when the zero lower bound (ZLB) on nominal interest rates is a binding constraint. The central result is that the long-run neutral rate (the real policy rate consistent with stable inflation in long-run equilibrium) should fall more than one-for-one with the long-run natural rate as the latter approaches zero, because the mere risk of future ZLB episodes — even when the economy is currently away from the ZLB — imparts a persistent downward bias on inflation expectations that can only be offset by maintaining a pre-emptive expansionary bias. Quantitatively, the model implies that the neutral rate should be zero as soon as the long-run natural rate falls to 75 basis points — well above the near-zero estimates prevailing in the late 2010s — and that the ZLB would bind one-third of the time under optimal policy when the natural rate fluctuates between 0 and 3 percent. Price level targeting with a 10-basis-point upward drift closely approximates optimal commitment policy and has the advantage of not requiring knowledge of the natural rate level.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-empirical-fact-motivates-the-model"&gt;Q1. What empirical fact motivates the model?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Empirical analyses of the long-run natural rate — the real interest rate prevailing over a long-run equilibrium in which nominal rigidities are absent — consistently find that it is time-varying in a manner best described by a random walk, meaning it can drift without reverting to a constant long-run level.&lt;/strong&gt; The paper cites Holston, Laubach, and Williams (2017), Fiorentini et al. (2018), and Hamilton et al. (2016) as the main empirical references. Holston et al. (2017) place the long-run natural rate at between 0 and 1 percent in the U.S. and possibly slightly negative in the euro area as of 2016. The paper draws one central lesson: because the natural rate is time-varying and its future level is uncertain, a model with constant natural rate will give unreliable guidance for monetary policy, especially at low natural rate levels near zero.&lt;/p&gt;
&lt;h3 id="q2-what-is-the-model-and-what-are-the-key-equilibrium-concepts"&gt;Q2. What is the model and what are the key equilibrium concepts?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper embeds a new Keynesian model in which the long-run natural rate follows a bounded random walk with upper bound 3 percent and lower bound 0 percent, calibrated to post-WWII U.S. TFP data, and studies optimal monetary policy under commitment while imposing the zero lower bound.&lt;/strong&gt; A critical distinction separates two notions of the long-run equilibrium interest rate: the &amp;ldquo;long-run natural rate&amp;rdquo; (denoted ¯r) is the real rate that would prevail in flexible-price equilibrium, determined by fundamentals outside the central bank&amp;rsquo;s control; the &amp;ldquo;neutral rate&amp;rdquo; (r*) is the real policy rate consistent with stable inflation in the long run, which the central bank operationally targets. The two coincide in standard models with constant ¯r, but diverge in this paper because ZLB risk drives a wedge between them.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-main-theoretical-result"&gt;Q3. What is the main theoretical result?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Under optimal commitment, the neutral rate r&lt;/em&gt; should fall more than one-for-one with the long-run natural rate ¯r — that is, the central bank should maintain a negative gap (r&lt;/em&gt; &amp;lt; ¯r) that widens as ¯r falls toward zero — because permanent downward movements in ¯r make future ZLB binding episodes permanently more likely, creating a persistent downward bias on inflation expectations that requires pre-emptive accommodation even in periods when the ZLB is not currently binding.** This result contrasts with the existing literature on optimal commitment at the ZLB, which has emphasized forward guidance — the promise to maintain low rates even after the economy recovers from a ZLB episode — as the primary stabilization tool. The paper shows that forward guidance alone is not sufficient when ¯r can permanently drift lower, because each downward drift permanently raises the probability of future ZLB episodes, reducing the central bank&amp;rsquo;s scope for fulfilling future inflation promises.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-quantitative-implications"&gt;Q4. What are the quantitative implications?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;The model implies that the neutral rate r&lt;/em&gt; reaches zero when the long-run natural rate ¯r is at 75 basis points — a level that was well above the near-zero estimates of ¯r prevailing at the end of the 2010s — and that the ZLB binds one-third of the time under optimal policy when ¯r fluctuates between 0 and 3 percent.&lt;/em&gt;* The 75 basis-point threshold means that a central bank operating in an environment where ¯r has declined to its estimated late-2010s levels would already be constrained to a neutral rate of zero under optimal policy. The one-third ZLB frequency is higher than what would be predicted by models with constant ¯r at typical calibrations, reflecting the permanent nature of ¯r shocks and their cumulative effect on the neutral rate.&lt;/p&gt;
&lt;h3 id="q5-what-do-the-adjustment-dynamics-look-like-after-a-negative-r-shock"&gt;Q5. What do the adjustment dynamics look like after a negative ¯r shock?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Following a permanent reduction in ¯r, the real policy rate adjusts gradually rather than immediately — remaining temporarily above the new long-run neutral rate during the transition — implying that monetary policy is contractionary along the adjustment path and that a permanent decline in ¯r is followed by a temporary disinflation before the economy settles at the new r&lt;/em&gt;.&lt;/em&gt;* This history-dependence of optimal commitment policy means the central bank does not immediately jump to the new, lower r* after a ¯r shock; it moves gradually, making the short-run policy stance more contractionary than the long-run position. The temporary disinflation is consistent with the general principle of history-dependence of optimal policy under commitment.&lt;/p&gt;
&lt;h3 id="q6-what-role-does-price-level-targeting-play"&gt;Q6. What role does price level targeting play?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Price level targeting variants — particularly a rule with an optimally chosen upward drift of 10 basis points — closely approximate the economic outcomes achieved under optimal commitment policy in the model, with the practical advantage that such rules do not require the central bank to know or estimate the current level of the long-run natural rate ¯r.&lt;/strong&gt; The Eggertsson-Woodford (2003) price level target works well in models with constant ¯r by generating positive inflation expectations in the wake of deflationary ZLB episodes. Adding a small upward drift of 10 basis points strengthens this property under a drifting ¯r, because it provides additional buffer against the downward expectations bias that permanent ¯r drift generates. Under price level targeting rules, the neutral rate reaches the ZLB as soon as ¯r falls below 1 percent.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;long-run natural rate (¯r)&lt;/strong&gt; : the real interest rate prevailing over a long-run equilibrium in which nominal rigidities are absent; in this paper modelled as a bounded random walk with upper bound 3 percent and lower bound 0 percent, calibrated to post-WWII TFP data.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;neutral rate (r&lt;/em&gt;)&lt;/em&gt;* : the real policy rate consistent with stable inflation in the long run; distinct from ¯r in this paper because ZLB risk drives a negative gap (r* &amp;lt; ¯r) that widens as ¯r approaches zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;zero lower bound (ZLB)&lt;/strong&gt; : the constraint that nominal policy rates cannot fall below zero; in this model the reason that permanent reductions in ¯r create a persistent downward bias on inflation expectations even when the ZLB is not currently binding.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;expansionary bias&lt;/strong&gt; : the paper&amp;rsquo;s finding that optimal commitment policy should maintain r* &amp;lt; ¯r — a pre-emptive accommodation away from the ZLB — to offset the downward bias on inflation expectations created by the risk of future ZLB episodes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;price level targeting&lt;/strong&gt; : a monetary policy rule in which the central bank targets the price level rather than the inflation rate; shown in this paper to approximate optimal commitment policy and to have the practical advantage of not requiring knowledge of ¯r.&lt;/p&gt;</description></item><item><title>Monetary Policy, Employment Shortfalls, and the Natural Rate Hypothesis</title><link>https://macropaperwarehouse.com/papers/monetary-policy-employment-shortfalls-and-the-natural-rate-hypothesis/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monetary-policy-employment-shortfalls-and-the-natural-rate-hypothesis/</guid><description>&lt;p&gt;This paper examines optimal monetary policy under discretion when the loss function is asymmetric — placing greater weight on employment shortfalls than on equivalently sized employment strength. The model satisfies the natural rate hypothesis (NRH): monetary policy is neutral in the long run, so persistent accommodation of above-potential activity raises inflation expectations without permanently boosting employment. The central paradox the paper establishes is that an asymmetric shortfalls-oriented loss function, despite its stated goal of reducing shortfalls, exacerbates them: the mechanism runs through the NRH expectation-adjustment channel, which creates an inflationary bias structurally analogous to the Barro-Gordon result. Mandating a central bank objective that is more symmetric than the social loss function — a conservative-in-asymmetry design — lowers both the frequency of activity shortfalls and the inflationary bias. As a corollary, the analysis implies that monetary accommodation of labor market strength requires justifications beyond the asymmetric costs of shortfalls, such as permanent effects of strong labor markets on economic potential.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-the-asymmetric-loss-function-exacerbate-employment-shortfalls"&gt;Q1. How does the asymmetric loss function exacerbate employment shortfalls?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The mechanism runs through the natural rate hypothesis: under a loss function that places no weight on activity above potential, the optimal policy fully accommodates positive supply shocks by allowing above-potential output, but the NRH then raises the expectational baseline, making shortfalls more frequent as the perceived natural rate adjusts upward.&lt;/strong&gt; Because the central bank treats above-potential activity as costless, it does not resist the accumulation of above-potential output in good states; expectations of future activity then rise, effectively moving the benchmark against which shortfalls are measured, and making shortfalls a more common outcome. The asymmetric policy thus generates a self-defeating dynamic: attempts to minimize shortfalls through accommodation of strength create an expectational environment in which shortfalls are more frequent.&lt;/p&gt;
&lt;h3 id="q2-how-does-the-inflationary-bias-emerge"&gt;Q2. How does the inflationary bias emerge?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The inflationary bias is structurally analogous to the Barro-Gordon (1983) time-inconsistency result: the central bank&amp;rsquo;s asymmetric desire to reduce shortfalls leads it to ease policy more aggressively than a symmetric loss function would warrant, and this tendency transmits into persistently higher inflation through the NRH expectations-adjustment channel.&lt;/strong&gt; The classic Barro-Gordon mechanism operates through the desire to push output above its natural rate; here the analog is the desire to push activity above the shortfalls threshold. The paper&amp;rsquo;s model is constructed so that no Barro-Gordon bias exists in the baseline symmetric case, isolating the asymmetry as the sole source of the inflationary bias.&lt;/p&gt;
&lt;h3 id="q3-what-policy-prescription-follows-from-the-analysis"&gt;Q3. What policy prescription follows from the analysis?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper recommends mandating a central bank objective that is more symmetric than the social loss function, analogous to Rogoff&amp;rsquo;s (1985) conservative-central-banker result but applied to the dimension of asymmetry rather than the level of inflation aversion.&lt;/strong&gt; A mandate that requires the CB to weight above-potential and below-potential activity more equally than society does lowers both the frequency and depth of shortfalls and reduces inflationary bias, improving welfare relative to a CB that faithfully implements the asymmetric social preference. The paper further shows that optimal policy under this design does not accommodate fluctuations from aggregate demand shocks, implying that accommodation of labor market strength requires other justifications — such as permanent productivity effects — not the shortfalls-cost asymmetry alone.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;shortfalls asymmetry&lt;/strong&gt; : the specification in which the central bank&amp;rsquo;s or social loss function places greater weight on employment below its natural rate than on equivalently sized employment above it; the paper&amp;rsquo;s central object of analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;natural rate hypothesis (NRH)&lt;/strong&gt; : the assumption that monetary policy is neutral in the long run — persistent monetary accommodation does not permanently raise employment above its natural rate but does raise the price level; imposes the constraint that bounds the central bank&amp;rsquo;s ability to durably lower shortfalls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;inflationary bias&lt;/strong&gt; : the systematic tendency of a central bank operating under a shortfalls-oriented asymmetric loss function to allow above-target inflation on average; emerges in this model via the NRH expectations-adjustment channel, analogous to but distinct from the Barro-Gordon result.&lt;/p&gt;</description></item><item><title>Monopsony Makes Firms Not Only Small but Also Unproductive: Why East Germany Has Not Converged</title><link>https://macropaperwarehouse.com/papers/monopsony-makes-firms-not-only-small-but-also-unproductive-why-east-germany-has-not-converged/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/monopsony-makes-firms-not-only-small-but-also-unproductive-why-east-germany-has-not-converged/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;When employers face a trade-off between growing large and paying low wages — that is, when they have monopsony power — some productive employers will decide to acquire fewer customers, forgo sales, and remain small; these decisions have adverse consequences for aggregate labor productivity beyond the standard monopsony result that firms are too small. The paper documents that East German plants (compared to West German ones) face a steeper size-wage curve, invest less into marketing, and remain smaller, with the share of employment at plants with more than 249 employees standing at roughly 25% in East Germany versus 39% in West Germany in 2014 (and 31% versus 55% in manufacturing specifically). The steeper size-wage curve in East Germany is traceable to the historically determined underrepresentation of collective bargaining and union membership in small East German plants — a legacy of communist-era labor organization that caused union membership to collapse after reunification. The authors combine this evidence with a heterogeneous-plant model in which plants have product market power and choose how many customers to acquire subject to an upward-sloping size-wage schedule; two channels reduce aggregate productivity: a love-of-variety loss (fewer active plants means consumers bundle from a smaller variety of suppliers) and a compositional reallocation loss (labor is shifted from more productive to less productive plants, an effect exacerbated by product market power). When the model is calibrated to West Germany and the steeper East German size-wage trade-off is imposed, it predicts 10 percentage points lower aggregate labor productivity in East Germany — and for manufacturing, where East-West differences in plant size and the size-wage trade-off are particularly pronounced, the model predicts 18 percentage points lower productivity; in both cases the compression of the plant size distribution accounts for the largest share of the predicted productivity loss. The paper thus offers an explanation for why, more than thirty years after reunification, labor productivity and wages remain roughly 25% lower in the East German private sector despite uniform legal institutions across the two regions.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core mechanism by which monopsony power reduces aggregate productivity, and how does it differ from the standard &amp;ldquo;firms are too small&amp;rdquo; result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the standard monopsony account, firms face an upward-sloping labor supply curve and choose to employ fewer workers than the competitive optimum, so individual firms are below efficient scale. The paper identifies an additional, investment-distortion channel: plants must also decide how large a customer base to acquire, and doing so requires marketing expenditure as well as the labor to service additional customers — labor whose cost rises with plant size along the size-wage schedule. A steeper size-wage curve therefore makes customer acquisition more expensive at the margin, and some productive plants optimally choose to acquire fewer customers, forgo sales, and remain small. The new aggregate productivity loss stems from this distorted investment margin: plants that could generate high value added at large scale instead operate at sub-optimal customer networks, suppressing aggregate output through both a love-of-variety effect (fewer active large plants means consumers access a smaller product variety) and a misallocation effect (the compressed size distribution shifts employment toward less productive plants).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What empirical patterns do the authors document to link the East-West productivity gap to missing large plants and steeper size-wage curves?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors document three nested empirical facts using the German Structure of Earnings Survey (SES) pooled across 2006, 2010, and 2014, supplemented by administrative wage panel data (AWFP) and national accounts (VGR). First, East German labor productivity in the private non-primary sector is about 25% below West Germany&amp;rsquo;s and has not converged since roughly 1995. Second, the share of employment at large plants (&amp;gt;249 employees) is substantially smaller in the East, and this gap is present both cross-sectionally across survey years and conditionally: East German plants enter smaller and remain smaller over their life-cycles, so plant age does not explain the difference. Third, industries where missing large plants are most pronounced in East Germany relative to West Germany are also the industries with the largest East-West productivity and wage gaps — the employment-weighted correlation between the large-plant share gap and the productivity gap is 0.53 across industries. The steeper size-wage curve itself is documented using within-industry comparisons: on average the plant size elasticity of wages is one-fifth larger in East Germany, and those industries with a steeper East-West size-wage differential are also the industries with the most missing large plants and the lowest average wages in the East.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Why is the steeper size-wage curve specific to East Germany, and why does it persist decades after reunification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In communist East Germany, trade unions did not have the role of representing worker interests; consequently, after reunification, union membership fell dramatically. The key institutional consequence is that collective bargaining coverage in East Germany is underrepresented specifically in small plants. Workers at small plants in East Germany are more likely to have individually rather than collectively bargained wages than their West German counterparts, whereas workers at large plants in both regions are more similarly covered. Because collective bargaining flattens the size-wage curve (larger plants pay a smaller premium over small plants&amp;rsquo; wages when both are covered by the same bargaining agreement), its absence in small East German plants produces a steeper gradient of wages with plant size in the East. This is a persistent structural feature rather than a transitional one: government policies and their enforcement are essentially uniform across regions, so the asymmetric bargaining coverage, which originates in communist-era institutional history, has not been erased by market forces or policy since 1990.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How is the model structured, and what are the three decision stages for plants?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is a static, long-run heterogeneous-plant framework that yields closed-form solutions. Within a period, plants face a three-stage decision problem. First, they decide whether to enter the market. Second, after entry, they choose how many customers to acquire, trading off additional sales revenue against marketing costs and the labor cost of servicing a larger customer base — a cost that rises with the number of customers because the upward-sloping size-wage curve means each additional worker hired requires a higher wage for all infra-marginal workers. Third, taking into account their product market power (each plant is a monopolistic competitor with its own customers), plants set prices to each customer and thereby determine how many workers they need. The size-wage schedule enters the second stage directly, so a steeper schedule reduces optimal customer acquisition across all plants, with the distortion being largest for the most productive plants (which would otherwise grow the largest).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Through what two channels does the steeper size-wage trade-off reduce aggregate labor productivity in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The first channel is a love-of-variety effect in the product market: because more productive plants acquire fewer customers and operate at smaller scale under a steeper size-wage schedule, the average consumer bundles goods from a smaller number of distinct plants, and aggregate efficiency falls through the standard CES love-of-variety mechanism. The second channel is a misallocation effect in the labor market: the steeper size-wage schedule compresses the employment distribution across plants, reallocating labor from more productive to less productive plants relative to the benchmark with a flatter schedule. The paper shows that this second channel is exacerbated by product market power, because plants with stronger pricing power respond more aggressively to the changed labor cost trade-off. In the model&amp;rsquo;s decomposition, the compression of the plant size distribution (the misallocation channel) accounts for the largest part of the predicted 10 percentage point productivity shortfall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What quantitative predictions does the model make, and how does it perform in untargeted moments?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is calibrated to two moments for West Germany: average plant size and the share of large plants (&amp;gt;249 employees). When the steeper East German size-wage trade-off is imposed without re-calibrating other parameters, the model predicts 10 percentage points lower aggregate labor productivity in East Germany — accounting for at least 10 of the roughly 25 percentage point observed gap. For the manufacturing sector alone, where East-West differences in plant size, the size-wage trade-off, and aggregate productivity are particularly pronounced, the calibrated model predicts 18 percentage points lower productivity. As an untargeted validation, the model also replicates the plant size distribution in East Germany, matching both the smaller average plant size and the relatively small number of large plants. These untargeted predictions provide additional support for the mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What alternative explanations for East Germany&amp;rsquo;s non-convergence does the paper rule out or place in context?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper addresses several confounds. In Appendix A, the authors show that East-West aggregate labor productivity differences are driven by differences in aggregate total factor productivity, not by labor quality differences, capital intensity differences, or capital quality differences — confirming within-country the finding that TFP explains a large fraction of productivity dispersion. The TFP differences are shown to be unlikely the result of greater labor market flexibility in West Germany or differences in industry composition. Appendix B shows that the East-West plant size distribution gap is not driven by differences in urbanization (West Germany has more metropolitan areas). The paper also addresses plant age: East German plants enter smaller and remain smaller at every age and across entry cohorts, ruling out the hypothesis that the size gap is purely a transitional legacy of the restructuring that destroyed many large East German plants at reunification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does this paper relate to the Heise and Porzio (2021) finding that plant productivity differences, not worker quality differences, drive the East-West wage gap?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Heise and Porzio (2021) use matched employer-employee data to document that plant productivity differences (as opposed to worker quality differences) account for most of the East-West wage differential, and they explain why low worker mobility does not remove these differences. The present paper complements this by providing an explanation for why plant productivity is lower in East Germany in the first place and why firm-level convergence does not occur: the steeper size-wage curve induced by the legacy of missing collective bargaining coverage in small East German plants distorts the investment and customer acquisition decisions of productive plants, keeping them small and unproductive. The two papers are thus complementary: Heise and Porzio take the plant productivity gap as given; Bachmann et al. endogenize it through the size-wage mechanism.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Size-wage curve:&lt;/strong&gt; The empirical relationship between plant size (measured by employment) and wages paid to workers, conditional on worker characteristics. A steeper size-wage curve means that the wage premium for working at a large plant relative to a small plant is larger. In this paper&amp;rsquo;s model, plants internalize that expanding their customer base and workforce requires paying higher wages to all workers (not just the marginal hire), making growth more costly when the size-wage curve is steeper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monopsony power (monopsonistic competition):&lt;/strong&gt; The market structure in which an individual employer faces an upward-sloping labor supply curve — i.e., it must raise wages to attract additional workers. The paper uses &amp;ldquo;monopsonistic competition&amp;rdquo; to describe a setting with many such employers, each with some wage-setting power, in contrast to oligopsony. The paper focuses on allocative effects of this power, not on normative efficiency questions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Customer capital / customer acquisition:&lt;/strong&gt; Plants must incur marketing expenses to build a customer base; each customer relationship generates a stream of sales but requires labor to service. The size of the customer network is a long-run investment decision. Under monopsonistic labor markets, the cost of expanding the customer base includes not only marketing expenses but also the higher wages that a larger workforce requires, making customer acquisition a margin that is distorted by labor market power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Love-of-variety effect:&lt;/strong&gt; A welfare loss that arises in models with monopolistic competition and CES preferences when the number of active product varieties declines. In this paper it applies to the product market: when plants remain small and acquire fewer customers, the effective number of distinct varieties consumed falls, reducing aggregate efficiency even holding plant-level productivity fixed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Misallocation / compressed size distribution:&lt;/strong&gt; A situation in which factors of production are not allocated to their highest-value uses. Here, the steeper size-wage curve induces productive plants to remain small, so labor that would otherwise be employed at high-productivity large plants is instead employed at lower-productivity small plants. The resulting compression of the plant size distribution — fewer very large plants, more mass in the middle — is both the key empirical fact and the primary quantitative driver of the predicted aggregate productivity shortfall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collective bargaining coverage:&lt;/strong&gt; The fraction of workers whose wages are set by collective agreements between employers (or employer associations) and trade unions, rather than by individual negotiation. The paper establishes that collective bargaining flattens the size-wage curve by compressing wages across plants of different sizes. The historically low collective bargaining coverage among small East German plants — a legacy of communist-era labor relations — is the institutional root cause of the steeper East German size-wage schedule.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on IZA Discussion Paper 15293. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Motivating banks to lend? Credit spillover effects of the Main Street Lending Program</title><link>https://macropaperwarehouse.com/papers/motivating-banks-to-lend-credit-spillover-effects-of-the-main-street-lending-program/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/motivating-banks-to-lend-credit-spillover-effects-of-the-main-street-lending-program/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Minoiu, Zarutskie, and Zlate ask whether participation in the Main Street Lending Program (MSLP)—a Federal Reserve emergency facility launched in mid-2020 to channel credit to small and mid-sized firms during the COVID-19 pandemic—caused banks to lend more &lt;em&gt;outside&lt;/em&gt; the program. The authors focus on credit spillover effects: did MSLP-participating banks ease standards and expand volumes on their general commercial and industrial (C&amp;amp;I) loan books, beyond the direct loans originated under the program itself?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional Context.&lt;/strong&gt; The MSLP opened for lender registration on June 15, 2020 and began accepting loan submissions on July 6, 2020, expiring December 31, 2020. Of $600 billion in available SPV capacity, only $16.05 billion was actually deployed, making overall take-up approximately 2.7% of capacity. Despite this, the program required participating banks to retain 5% of each loan&amp;rsquo;s credit risk while offloading 95% to the SPV, and charged borrowers LIBOR plus 300 bps. Registration rate among all Call Report banks was 11.7% (614 out of 5,242 banks), with participation rising steeply with bank size: from 6.5% of banks in the below-$1-billion asset group to 63.8% of banks with assets above $50 billion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology.&lt;/strong&gt; The analysis draws on multiple data sources: (a) supervisory Y-14Q H1 loan-level data covering C&amp;amp;I loans above $1 million commitments, reported by 32 bank holding companies (BHCs) that account for roughly three-quarters of total U.S. C&amp;amp;I loans; (b) Y-14Q A9 loan portfolio segment data for small business C&amp;amp;I loans (below $1 million commitments) from 22 BHCs; (c) quarterly Senior Loan Officer Opinion Survey (SLOOS) microdata for April, July, and October 2020, providing bank-level assessments of lending standard changes, loan terms, demand shifts, and stated reasons for tightening; (d) Dealscan syndicated loan originations for 262 banks (51 MSLP participants); and (e) bank balance sheet data from Call Reports, including the Ellul-Yerramilli risk management index (RMI) for 16 BHCs. The core empirical design is a difference-in-differences (DiD) comparing MSLP-participating vs. non-participating banks before (2020:Q1–Q2) and after (2020:Q3) program implementation. To address nonrandom selection, the authors instrument MSLP participation with three variables: (i) a dummy for banks that cited registration as &amp;ldquo;too burdensome&amp;rdquo; in the September 2020 supplementary SLOOS; (ii) a dummy for banks with prior experience pledging loan collateral at the Fed&amp;rsquo;s discount window; and (iii) a dummy for banks with prior experience pledging securities collateral at the discount window. Firm×quarter fixed effects absorb time-varying credit demand at the borrower level (Khwaja-Mian design), and bank×borrower fixed effects further control for relationship-specific lending patterns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Extensive Margin (Large Business Loans).&lt;/strong&gt; In the Y-14Q H1 data, MSLP banks were 30–32% more likely to renew existing loans than non-MSLP banks in 2020:Q3, with the probability of renewal 1.6–1.7 percentage points higher (against a sample average renewal rate of 5.3%). New loan originations were 22–27% more likely at MSLP banks, or 1.1–1.4 percentage points higher (against a sample average origination rate of 5.1%). 2SLS estimates are similar in magnitude to OLS, indicating selection bias is modest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Extensive Margin (Small Business Loans and Survey Data).&lt;/strong&gt; In the A9 small business segment data, MSLP lenders had 17.3% more small business loan accounts outstanding in 2020:Q3 than non-MSLP banks. In SLOOS microdata, MSLP banks were approximately 15 percentage points less likely to report tightening C&amp;amp;I lending standards in 2020:Q3 (conditional on demand controls), compared to an actual tightening rate of 37.5%. This effect is larger for small (more financially constrained) firms (16–17 percentage points) than for large firms (13–14 percentage points).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings — Intensive Margin.&lt;/strong&gt; On loan terms, MSLP banks charged spreads that were approximately 9 basis points lower on renewed/originated C&amp;amp;I loans in the Y-14Q data, and 13.5 basis points lower in the Dealscan syndicated loan sample, compared to non-MSLP banks in 2020:Q3. 2SLS estimates are somewhat larger (19–30 bps). In the Dealscan sample, MSLP banks also extended syndicated loans that were 11.2% larger (about $2.4 million more given a $22 million average loan size). Survey data confirm MSLP banks were less likely to tighten most individual loan terms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Aggregate Magnitude.&lt;/strong&gt; The authors estimate that, in the absence of the MSLP, total loan renewals and originations at Y-14Q reporting banks in 2020:Q3 would have been approximately 10% lower. Scaling to the broader banking sector, the estimated credit spillover effect is approximately $44.8 billion in C&amp;amp;I lending—nearly three times the $16.05 billion in direct MSLP loan purchases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism.&lt;/strong&gt; Survey and objective evidence both point to reduced risk aversion as the primary channel, rather than immediate balance sheet constraint relief. MSLP banks were significantly less likely to cite &amp;ldquo;reduced tolerance for risk&amp;rdquo; as a reason for tightening lending standards after the program&amp;rsquo;s introduction, while showing no differential propensity to cite capital or liquidity deterioration. Banks with higher risk management index scores (more risk-averse institutions) exhibited larger spillover effects on two of three lending margins. Indicators of immediate balance sheet tightness (excess capital cushions, cost of capital, core deposit reliance) do not predict larger spillovers, with a partial exception for lower excess capital and higher loan loss reserves — suggesting future rather than current balance sheet constraints may have played some role.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Robustness.&lt;/strong&gt; The backstop mechanism is explicitly tied to the program&amp;rsquo;s credibility period: the spillover effects are smaller in 2020:Q4, consistent with the Treasury&amp;rsquo;s November 19, 2020 announcement that the program would not be extended, which diminished its backstop role. Placebo regressions using 2018 and 2019 data find no differential lending behavior between MSLP and non-MSLP banks before the program, supporting parallel trends. Results are robust to controls for PPP participation, credit line drawdown exposure, loan loss provisioning, and bank-level loan portfolio cyclicality.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What precisely is the &amp;ldquo;spillover effect&amp;rdquo; that the paper measures, and how does it differ from the direct effect of the MSLP?&lt;/strong&gt;
A: The direct effect is the $16.05 billion in MSLP loans purchased by the SPV — credit extended specifically through the program. The spillover effect refers to changes in banks&amp;rsquo; general C&amp;amp;I lending behavior outside the program: renewals and originations of non-MSLP loans, changes in lending standards and terms for all business borrowers, and changes in small business loan volumes. The sample in the Y-14Q regression explicitly excludes MSLP loans themselves, so the estimates reflect only the indirect, broader credit effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What instruments does the paper use for MSLP participation, and why are they plausibly exogenous?&lt;/strong&gt;
A: Three IVs are employed: (1) a dummy for banks that cited program registration as &amp;ldquo;too burdensome&amp;rdquo; as a very important reason for not joining (from the September 2020 supplementary SLOOS); (2) a dummy for banks that pledged loan collateral at the Fed&amp;rsquo;s discount window in December 2019; and (3) a dummy for banks that pledged securities collateral at the discount window in the same period. The exclusion restriction argument is that (1) reflects banks&amp;rsquo; administrative capacity and prior Fed engagement rather than underlying balance sheet strength or lending appetite, and that (2) and (3) reflect familiarity with Fed collateral processes in ways that made a loan-based program easier to understand and join — without independently affecting lending standards or volumes in 2020:Q3.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How large are the spillover effects on the extensive margin of large corporate lending?&lt;/strong&gt;
A: In the Y-14Q H1 data across 32 BHCs, MSLP banks renewed loans 1.6–1.7 percentage points more frequently and originated new loans 1.1–1.4 percentage points more frequently in 2020:Q3, relative to non-MSLP banks. Against sample averages of 5.3% renewal rate and 5.1% origination rate, these translate to MSLP banks being 30–32% more likely to renew and 22–27% more likely to originate loans. The 2SLS estimates are broadly similar in magnitude, suggesting that self-selection bias in OLS is limited.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the estimated aggregate dollar spillovers from the MSLP?&lt;/strong&gt;
A: The paper calculates that, in the absence of the program, total loan renewals and originations at Y-14Q H1 MSLP banks in 2020:Q3 would have been lower by approximately $33.6 billion (derived from 44,274 bank-borrower pairs × 1.38 existing loans per pair × 3.06 percentage points of extra loan activity × $17.98 million average loan size). Scaling to all Y-14Q banks (MSLP and non-MSLP alike), the shortfall would represent roughly a 10% reduction in total 2020:Q3 loan renewals and originations. Extrapolating to the full banking sector (since Y-14Q banks cover about 75% of total C&amp;amp;I lending), and assuming similar spillover magnitudes for banks outside the sample, total MSLP spillovers amount to roughly $44.8 billion — approximately three times the $16.05 billion in direct MSLP loan purchases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the estimated effect on C&amp;amp;I lending standards using survey data?&lt;/strong&gt;
A: Using SLOOS microdata, the paper estimates that MSLP banks were approximately 15 percentage points less likely to tighten C&amp;amp;I lending standards in 2020:Q3 compared to non-MSLP banks, after controlling for demand conditions. The actual tightening rate in 2020:Q3 was 37.5%, meaning the counterfactual tightening rate absent the program would have been approximately 5 percentage points higher. In a further hypothetical where all SLOOS sample banks had participated, the counterfactual tightening rate would have been nearly 10 percentage points higher than actual.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Are spillover effects larger for small or large borrowers, and what does this imply?&lt;/strong&gt;
A: The SLOOS-based estimates show that MSLP banks were 16–17 percentage points less likely to tighten lending standards for small firms (annual sales below $50 million), compared to 13–14 percentage points less likely for large and middle-market firms — a statistically significant difference. The authors interpret this as consistent with the MSLP reducing risk aversion broadly, with the largest effect on borrowers facing greater credit constraints where uncertainty about creditworthiness was highest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What evidence supports the risk aversion (psychological backstop) mechanism over the balance sheet constraint mechanism?&lt;/strong&gt;
A: From SLOOS data, MSLP banks were significantly less likely (at the 1% level) to cite &amp;ldquo;reduced tolerance for risk&amp;rdquo; as a reason for tightening lending standards after the program&amp;rsquo;s introduction, while showing no differential likelihood of citing deteriorating capital or liquidity positions as reasons. Furthermore, splitting banks by the risk management index (RMI), the spillover effects are stronger for high-RMI (more risk-averse) banks on two of three lending outcomes. Conversely, proxies for immediate balance sheet constraints — excess capital cushions, core deposit ratios, equity issuance, and cost of capital — do not yield consistently stronger spillover effects for more constrained banks. The only partial exception is lower excess capital and higher loan loss reserves, which are associated with more loan renewals, suggesting future rather than current balance sheet constraints may have contributed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the risk management index (RMI), and how is it used here?&lt;/strong&gt;
A: The RMI is an index developed by Ellul and Yerramilli (2013) that captures the strength of a bank&amp;rsquo;s internal risk management function, constructed from variables including whether the bank has a chief risk officer (CRO), the CRO&amp;rsquo;s executive status and relative compensation, risk committee member experience, and meeting frequency. Available for 61 BHCs over 2011–2013, it is matched to 16 BHCs in the Y-14Q H1 sample and used as a pre-COVID proxy for institutional risk aversion. Banks above the median RMI show larger MSLP spillover effects on loan renewals and tightening standards, consistent with the interpretation that the MSLP reduced effective risk aversion more for banks that had higher baseline risk-consciousness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do the authors address the concern that PPP participation — not MSLP participation — might drive the results?&lt;/strong&gt;
A: First, they test directly that MSLP participation does not predict outstanding PPP/federally-guaranteed loan balances (in Q2 or Q3 2020) in the A9 loan segment data, finding no correlation. Second, they add an interaction of PPP loan balances (divided by total assets) × Post to the baseline regression in Table A10 and find that while PPP lending is positively associated with loan renewals and originations, the MSLP bank × Post coefficient remains statistically significant and similar in magnitude to the baseline, ruling out PPP participation as the driver of the baseline results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What explains the low take-up of the MSLP despite its large designed capacity?&lt;/strong&gt;
A: Survey responses from the September 2020 supplementary SLOOS indicate several demand- and supply-side constraints: banks reported they could generally meet credit demand outside the program; borrower leverage limits (capped at 4–6× EBITDA depending on facility) were seen as too restrictive; the LIBOR plus 300 bps interest rate was high relative to historical pricing for eligible firms; and registration and loss-sharing arrangements were viewed as burdensome and uncertain. The paper interprets these findings as consistent with banks treating the MSLP primarily as a backstop — a facility they would activate only if economic conditions deteriorated significantly — rather than a primary lending channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper address the threat that MSLP participation reflects bank-level cyclicality in loan portfolios?&lt;/strong&gt;
A: Table 10 controls for bank-specific C&amp;amp;I loan portfolio cyclicality, measured as the correlation between each bank&amp;rsquo;s C&amp;amp;I loan growth and aggregate banking-sector C&amp;amp;I loan growth estimated over 1985:Q1–2021:Q2 using two functional forms. The MSLP bank × Post coefficient estimates remain very similar to the baseline after including these controls, ruling out the concern that MSLP participants were simply banks with naturally more procyclical or countercyclical lending patterns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What happens to the estimated spillover effects in 2020:Q4, and what does this reveal?&lt;/strong&gt;
A: The paper shows (Table A6) that extending the sample to include 2020:Q4 yields somewhat smaller estimated spillover effects than in the baseline 2020:Q3 period. The authors attribute this to the November 19, 2020 announcement by Treasury Secretary Mnuchin that the MSLP would not be extended beyond year-end, which effectively ended the program&amp;rsquo;s backstop role and — consistent with the psychological backstop mechanism — reduced banks&amp;rsquo; confidence in the program&amp;rsquo;s future availability and thus the spillover motivation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Does the paper find spillover effects on intensive margin loan terms, and how large are they?&lt;/strong&gt;
A: On loan spreads, MSLP banks charged approximately 9 basis points lower spreads on floating-rate C&amp;amp;I loans renewed or originated in 2020:Q3 in the Y-14Q data (2SLS: 19 bps), and approximately 13.5 bps lower spreads in the Dealscan syndicated loan sample (2SLS: 30 bps). The 9 bps OLS estimate implies the average spread across all LIBOR-indexed C&amp;amp;I loans in 2020:Q3 would have been approximately 4 bps higher absent the program (i.e., 0.43 × 9 bps), relative to an actual average spread of 235 bps — an effect the authors characterize as economically small. On loan size, the Dealscan evidence indicates MSLP banks extended syndicated loans that were 11.2% larger (2SLS: 25% larger).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Credit Spillover Effects:&lt;/strong&gt; As used in this paper, spillover effects refer to the impact of MSLP participation on participating banks&amp;rsquo; lending behavior &lt;em&gt;outside and beyond&lt;/em&gt; the program itself — specifically, changes in loan renewal rates, new loan origination rates, lending standards, and loan terms for non-MSLP C&amp;amp;I loans. This is distinct from the direct effect (i.e., loans originated through the MSLP proper).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Psychological Backstop:&lt;/strong&gt; The paper&amp;rsquo;s term for the mechanism by which the MSLP reduced participating banks&amp;rsquo; effective risk aversion without necessarily easing their immediate balance sheet constraints. By committing to provide lending support if conditions deteriorated, the program built banks&amp;rsquo; confidence to lend ex ante, functioning as &amp;ldquo;insurance&amp;rdquo; against bad outcomes rather than a direct funding facility. The mechanism is distinguished from balance sheet easing by the fact that constrained and unconstrained banks exhibited similar spillover effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive Margin of Lending:&lt;/strong&gt; The binary dimension of lending activity — specifically, whether a bank renews an existing loan or originates a new loan within a bank-borrower pair. In this paper, measured as the share of existing loan commitments within each bank-borrower pair that are renewed or newly originated each quarter. Contrasted with the intensive margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive Margin of Lending:&lt;/strong&gt; The quantitative dimension of existing lending relationships — specifically, the average loan size and average spread on loans renewed or originated in a given period, conditional on a loan being extended.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Senior Loan Officer Opinion Survey (SLOOS):&lt;/strong&gt; A quarterly Federal Reserve survey of senior lending officers at large U.S. banks covering self-reported changes in C&amp;amp;I lending standards, terms (including spreads, maximum loan size, maturity, covenants, collateral requirements), demand conditions, and — in supplementary editions — reasons for changing standards. Used in this paper both as an outcome variable (tightening standards) and as a control variable (changes in loan demand) and as a source of IV variation (burden of MSLP registration).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk Management Index (RMI):&lt;/strong&gt; An index developed by Ellul and Yerramilli (2013) measuring the strength of a bank&amp;rsquo;s internal risk management function, combining information on the presence and compensation of a chief risk officer, risk committee composition, and meeting frequency. Used in this paper as a pre-pandemic proxy for institutional risk aversion to test whether the MSLP disproportionately reduced risk aversion in banks with stronger risk controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Difference-in-Differences with Granular Fixed Effects:&lt;/strong&gt; The primary identification strategy, comparing changes in lending outcomes between MSLP-participating and non-participating banks before (2020:Q1–Q2) and after (2020:Q3) program implementation. The paper uses firm×quarter fixed effects following Khwaja and Mian (2008) to absorb borrower-level credit demand, and bank×borrower fixed effects following Chodorow-Reich (2013) to absorb relationship-specific supply factors — isolating the bank credit supply effect attributable to MSLP participation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Originate-and-Distribute Feature (of MSLP):&lt;/strong&gt; The MSLP&amp;rsquo;s design in which banks originate MSLP loans but sell 95% of the credit exposure to the SPV, retaining only 5%. This feature was intended to free up balance sheet capacity for further lending. The paper tests whether this channel (easing current balance sheet constraints) explains the observed spillovers, finding limited support relative to the risk aversion reduction channel.&lt;/p&gt;</description></item><item><title>Mussa Puzzle Redux</title><link>https://macropaperwarehouse.com/papers/mussa-puzzle-redux/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/mussa-puzzle-redux/</guid><description>&lt;p&gt;The Mussa (1986) puzzle is the empirical observation of a sharp, simultaneous increase in the volatility of both nominal and real exchange rates following the end of the Bretton Woods fixed exchange rate system in 1973 — a fact commonly interpreted as evidence for monetary non-neutrality. This paper resolves the puzzle by developing a model in which the dominant driver of nominal exchange rate fluctuations is a &amp;ldquo;financial shock&amp;rdquo; — a shock to the international demand for a country&amp;rsquo;s assets that is orthogonal to goods market fundamentals. Under a fixed rate, the central bank offsets financial shocks through reserve intervention, preventing them from moving the exchange rate; under a float, financial shocks freely move the nominal and real exchange rate simultaneously. The same framework also reconciles the Meese-Rogoff disconnect (exchange rates are unpredictable from macro fundamentals), the Backus-Smith puzzle, and the forward premium puzzle within a single unified model, with the financial shock accounting for the dominant share of exchange rate variance in each case.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-the-financial-shock-and-how-does-it-differ-from-standard-macro-shocks"&gt;Q1. What is the financial shock and how does it differ from standard macro shocks?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The financial shock is an orthogonal disturbance to international portfolio demand — the preference of foreign investors for holding domestic versus foreign assets — that is disconnected from productivity, monetary policy, and goods-market conditions.&lt;/strong&gt; Because it is uncorrelated with macro fundamentals, it generates exchange rate movements without corresponding movements in output, prices, or interest rate differentials, producing the observed disconnect between exchange rates and macro variables.&lt;/p&gt;
&lt;h3 id="q2-why-does-the-mussa-pattern-arise-from-regime-switching"&gt;Q2. Why does the Mussa pattern arise from regime switching?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Under a fixed rate, the central bank absorbs financial shocks via reserve intervention, sterilizing their exchange rate effects; the real exchange rate is equally insulated because the nominal rate is fixed and prices adjust slowly. Under a float, the same financial shocks freely move the nominal exchange rate, and with sticky prices this passes through to the real exchange rate.&lt;/strong&gt; The variance of the real exchange rate therefore jumps discontinuously at the regime switch, matching the sharp Mussa empirical finding without requiring any change in the shock process.&lt;/p&gt;
&lt;h3 id="q3-how-unified-is-the-resolution-across-exchange-rate-puzzles"&gt;Q3. How unified is the resolution across exchange rate puzzles?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A single model with the financial shock, sticky prices, and a standard asset pricing kernel simultaneously matches the Mussa pattern (regime-switching real volatility), the Meese-Rogoff disconnect (exchange rates unpredictable from fundamentals), the Backus-Smith puzzle (exchange rates and relative consumption uncorrelated), and the forward premium puzzle (high-interest-rate currencies appreciate).&lt;/strong&gt; The financial shock accounts for the majority of exchange rate variance in each application.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Mussa puzzle&lt;/strong&gt; : the discrete jump in real exchange rate volatility at the Bretton Woods breakdown (1973); resolved in this paper as the change in the central bank&amp;rsquo;s absorption of financial shocks between fixed and floating regimes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;financial shock&lt;/strong&gt; : a disturbance to international portfolio demand orthogonal to goods-market fundamentals; the paper&amp;rsquo;s key mechanism for exchange rate disconnect, the Mussa pattern, and several other exchange rate puzzles.&lt;/p&gt;</description></item><item><title>Narratives about the Macroeconomy</title><link>https://macropaperwarehouse.com/papers/narratives-about-the-macroeconomy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/narratives-about-the-macroeconomy/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates two related empirical questions in the context of the historic surge in US inflation in late 2021 and 2022: (1) What narratives—causal stories—do people invoke to explain why inflation increased? (2) How do those narratives shape economic expectations? A companion theoretical component asks how narrative heterogeneity affects aggregate macroeconomic outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors recruit more than 10,000 US households across five descriptive survey waves (November 2021, December 2021, January 2022, March 2022, May 2022) via Lucid, plus a separate expert survey of 111 academic economists with JEL-E publications in top journals, recruited simultaneously with the November 2021 household wave. Household samples are broadly representative of the US population in terms of gender, age, region, and income. The expert sample is highly credentialed: on average 18.6 years post-PhD, 2.7 top-five publications, and 5,534 Google Scholar citations.&lt;/p&gt;
&lt;p&gt;Narratives are elicited through open-ended questions asking respondents to explain in their own words why inflation increased. Each text response is coded by two independent, blinded research assistants as a Directed Acyclic Graph (DAG) — a network of causal nodes representing factors (demand-side: government spending, monetary policy, pent-up demand, demand shift; supply-side: supply chain disruptions, labor shortage, energy crisis; miscellaneous: pandemic, government mismanagement, price gouging, Russia-Ukraine war) connected by directed causal edges. Inter-rater reliability is high: if one coder identifies a factor, the other does so 88% of the time; for specific causal connections between factors, agreement is 77%.&lt;/p&gt;
&lt;p&gt;Three experiments study the causal effect of narratives on expectations: (1) A pent-up demand vs. energy crisis narrative provision experiment (April 2022, n=2,397 baseline, n=1,329 follow-up); (2) A monetary policy vs. energy crisis narrative provision experiment (June 2022, n=1,069 baseline, n=736 follow-up); (3) A 2×2 belief-updating experiment crossing narrative type (government spending vs. energy crisis) with information type (low vs. high government spending forecast) (April 2022, n=997).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Households&amp;rsquo; narratives are substantially coarser than experts&amp;rsquo;: expert DAGs contain on average 4.3 factors and 3.6 causal links, while household DAGs contain only 3.5 factors and 2.8 links (both differences p &amp;lt; 0.01). Households focus predominantly on supply-side explanations: 57% invoke at least one supply-side factor vs. only 32% invoking any demand-side factor. The most common household narrative factors are supply chain disruptions (30%), labor shortage (27%), and general supply-side factors (22%); the leading demand-side factor is government spending, appearing in only 17% of household narratives, while loose monetary policy appears in just 5%. By contrast, 90% of experts invoke at least one supply-side factor and 84% at least one demand-side factor, with government spending mentioned by 50% of experts and monetary policy by 38%.&lt;/p&gt;
&lt;p&gt;Among households who invoke at least one supply or demand narrative, only 34% mention both supply and demand factors; among the corresponding subsample of experts, 77% mention both. Government mismanagement—a politicized judgment of policy failure—appears in 32% of household narratives but only 1% of expert narratives. Price gouging appears in 8% of household narratives and 0% among experts.&lt;/p&gt;
&lt;p&gt;Partisan polarization is large: Democrat-leaning respondents are 26 pp more likely to attribute inflation to the pandemic as a root cause (p &amp;lt; 0.01); Republican-leaning respondents are 38 pp more likely to blame government mismanagement (p &amp;lt; 0.01), and 19 pp more likely to mention high government spending (p &amp;lt; 0.01) and 14 pp more likely to mention high energy prices (p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Narratives are correlated with inflation expectations in OLS regressions controlling for demographics and survey wave fixed effects (n=2,951): households invoking government mismanagement predict 1.155 pp higher 1-year-ahead inflation (p &amp;lt; 0.01) and 0.805 pp higher 5-year-ahead inflation (p &amp;lt; 0.01). Energy crisis narratives predict 0.661 pp higher 1-year-ahead inflation (p &amp;lt; 0.01). Pent-up demand narratives predict 0.640 pp lower 5-year-ahead inflation (p &amp;lt; 0.05). Narrative variables explain approximately 10% of the out-of-sample variation in 1-year-ahead inflation expectations via LASSO, comparable to or exceeding the explanatory power of demographics and inflation experiences found in prior work.&lt;/p&gt;
&lt;p&gt;In Experiment 1 (pent-up demand vs. energy crisis), providing the pent-up demand narrative reduces 12-month inflation expectations by 0.71 pp relative to the energy crisis treatment (p &amp;lt; 0.01, in the main survey), corresponding to 24% of a standard deviation. This effect persists in the follow-up survey one day later (−0.63 pp, p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;In Experiment 2 (monetary policy vs. energy crisis), the monetary policy narrative reduces 12-month inflation expectations by 0.40 pp at the time of the main survey (p &amp;lt; 0.01) and by 0.62 pp in the follow-up (p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;In Experiment 3 (information updating), respondents exposed to the government spending narrative increase 12-month inflation expectations by 1.79 pp in response to a high-spending forecast (p &amp;lt; 0.01), while those exposed to the energy crisis narrative show no significant reaction (0.34 pp, p = 0.205). In IV regressions instrumenting government spending expectations with the high/low forecast treatment, a 1 pp increase in perceived government spending growth raises inflation expectations by 0.378 pp among those holding the government spending narrative (p &amp;lt; 0.01) versus only 0.051 pp among those holding the energy narrative (p = 0.184; difference p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;The New Keynesian DSGE model shows that a modest shift in perceived importance of monetary policy relative to productivity (raising ω_ν from 0.1 to 0.2, holding ω_g fixed) raises equilibrium consumption by 27 basis points and reduces equilibrium inflation by 27 basis points in the calibrated model with φ = 1.5; with a less reactive central bank (φ = 1.25), the same shift raises consumption by 30 basis points and reduces inflation by 62 basis points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All empirical results are drawn from the US context during the 2021–2022 inflation surge. The authors note that the extent of partisan polarization in US narratives may not generalize to less politically polarized countries. The test-retest correlation of narrative factors across a three-day interval is 0.63 (p &amp;lt; 0.01), indicating significant but not perfect stability. The experiment results may partly reflect that narratives were especially malleable because the inflation surge was a relatively recent and salient phenomenon at the time of data collection.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: How do the authors define and operationalize &amp;ldquo;narratives&amp;rdquo;?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper defines economic narratives as causal accounts for why an economic event occurred — agents&amp;rsquo; assessments of cause-effect relationships across events. Each text response is coded as a Directed Acyclic Graph (DAG) where nodes are economic factors and directed edges represent perceived causal links. DAGs can represent both simple mono-causal accounts and complex multi-factor chains. The authors use a predefined coding scheme of 16+ factor categories spanning demand-side, supply-side, and miscellaneous nodes, with inflation as the terminal node.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the inter-rater reliability of the DAG coding, and what does it imply for the quality of the narrative data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Two independent, blinded coders annotate each response. If one coder assigns a given factor, the other does so 88% of the time; for specific causal connections between factors, agreement is 77%. Approximately 95% of assigned factors and 89% of assigned connections make it to the final coded version. At the coarser level of &amp;ldquo;any demand-side factor,&amp;rdquo; agreement rises to 94%; for &amp;ldquo;any supply-side factor,&amp;rdquo; to 93%. Test-retest reliability across a three-day interval averages a correlation of 0.63 across all narrative factors (p &amp;lt; 0.01), comparable in magnitude to the measured persistence of economic preferences in prior work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How do expert and household narratives differ in their structural complexity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Expert DAGs contain on average 4.3 factors and 3.6 causal links, compared to 3.5 factors and 2.8 links for households (both p &amp;lt; 0.01). These differences persist even after controlling for response time and word count, indicating genuine differences in economic understanding rather than effort. Among agents who invoke at least one supply or demand factor, 77% of experts mention both, compared to only 34% of households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the most prevalent factors in household narratives versus expert narratives, and why does this matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Supply chain disruptions (30%), labor shortage (27%), and general supply-side factors (22%) top household narratives, while monetary policy appears in only 5% of household DAGs. Expert narratives are more balanced: 90% cite supply-side factors and 84% cite demand-side factors, with government spending mentioned by 50% and monetary policy by 38%. This matters because factors with different persistence imply different trajectories for future inflation; households&amp;rsquo; supply-side emphasis, combined with low awareness of monetary policy, shapes their inflation expectations in systematically different ways than experts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the structure of household narrative clusters, and how fragmented are they?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Agglomerative hierarchical clustering using the Jaccard distance between DAG edge lists reveals 15 optimal clusters (Silhouette criterion), of which eight have at least 30 members. Four supply-side clusters account for 55% of households: pandemic-related supply chain disruptions (20%), general supply-side causes (18%), energy crisis often attributed to government mismanagement (11%), and labor shortages attributed to the pandemic or government spending (7%). The only clear demand-side cluster—combining government spending and loose monetary policy—captures just 8%. Simple mono-causal clusters attributing inflation to the pandemic alone (15%), government mismanagement alone (11%), and price gouging alone (4%) are collectively prominent, underscoring how fragmented and often single-factor household reasoning is.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do partisan affiliations correlate with narrative content?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Republicans are 38 pp more likely than Democrats to attribute inflation to government mismanagement (p &amp;lt; 0.01), 19 pp more likely to mention high government spending (p &amp;lt; 0.01), and 14 pp more likely to mention high energy prices (p &amp;lt; 0.01). Democrats are 26 pp more likely to cite the pandemic as a root cause of inflation (p &amp;lt; 0.01) and more frequently cite pandemic-related supply chain issues and corporate greed. Government mismanagement appears in 32% of all household narratives (and is often portrayed as a root cause of spending, monetary policy, and energy prices) but in only 1% of expert narratives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How did the composition of household narratives shift over time (November 2021 to May 2022)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The energy crisis narrative rose sharply from 12% in January 2022 to 28% in March 2022, coinciding with Russia&amp;rsquo;s invasion of Ukraine in late February 2022. The Russia-Ukraine war narrative went from virtually zero before February 2022 to 28% in March 2022. By contrast, pandemic references, which climbed from 44% in November 2021 to 55% in January 2022, fell back to 47% in March 2022 and 39% in May 2022. Labor shortage references fell sharply from 32% in January 2022 to 15% in May 2022. These abrupt shifts suggest household narratives respond to major news events and, by extension, could drive rapid revisions in inflation expectations around such events.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the correlational evidence that narratives predict inflation expectations, and how large is the explanatory power?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: OLS regressions on pooled data from November 2021–January 2022 (n=2,951), controlling for survey wave fixed effects and sociodemographics, show: government mismanagement narratives predict 1.155 pp higher 1-year inflation expectations (p &amp;lt; 0.01) and 0.805 pp higher 5-year expectations (p &amp;lt; 0.01); energy crisis narratives predict 0.661 pp higher 1-year expectations (p &amp;lt; 0.01); monetary policy narratives predict 1.005 pp higher 1-year expectations (p &amp;lt; 0.01); pent-up demand narratives predict 0.640 pp lower 5-year expectations (p &amp;lt; 0.05). LASSO out-of-sample prediction using DAG factor dummies and connection dummies explains approximately 10% of variation in 1-year-ahead inflation expectations — comparable to the 10% within-sample R² found by D&amp;rsquo;Acunto et al. (2021) for grocery price exposure, and substantially above the 2–7% found by Giglio et al. (2021) for investor characteristics explaining stock return expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does Experiment 1 (pent-up demand vs. energy crisis) show about the causal effect of narratives?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Providing the pent-up demand narrative (relative to the energy crisis narrative) increases the fraction of respondents invoking pent-up demand by 37.8 pp in the follow-up survey (baseline: 2.8%, p &amp;lt; 0.01) and reduces the fraction invoking the energy crisis by 7.9 pp (p &amp;lt; 0.01), establishing successful first-stage uptake. In the main survey (n=2,397), the pent-up demand treatment reduces 12-month inflation expectations by 0.71 pp relative to the energy treatment (p &amp;lt; 0.01), equivalent to 24% of a standard deviation; the effect persists at −0.63 pp in the follow-up one day later (p &amp;lt; 0.01). The energy crisis treatment has no significant effect on expectations relative to a pure control (−0.02 pp, p = 0.911), suggesting that energy crisis implications were already salient at the time.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does Experiment 2 (monetary policy vs. energy crisis) add, given it was conducted after significant Fed tightening?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The experiment was run in June 2022, when 61% of respondents were already aware the Fed had raised rates. The monetary policy narrative increases the fraction invoking monetary policy by 39 pp and reduces the energy fraction by 50 pp relative to the energy group (both p &amp;lt; 0.01). The monetary policy narrative reduces 12-month inflation expectations by 0.40 pp in the main survey (p &amp;lt; 0.01) and 0.62 pp in the follow-up (p &amp;lt; 0.01). The mechanism is that attributing past inflation to loose monetary policy — which has since been tightened — leads respondents to infer lower future inflation, consistent with the narrative about persistence of the underlying cause.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What does Experiment 3 demonstrate about how narratives filter the interpretation of new information?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the 2×2 design, all respondents first receive either a government spending narrative or an energy crisis narrative, then either a low (−4%) or high (+6%) government spending forecast from the Survey of Professional Forecasters. Among those with the government spending narrative, the high-spending forecast raises 12-month inflation expectations by 1.79 pp (p &amp;lt; 0.01); among those with the energy crisis narrative, the high-spending forecast raises inflation expectations by a non-significant 0.34 pp (p = 0.205). The IV estimate shows that a 1 pp increase in expected government spending growth raises inflation expectations by 0.378 pp for those holding the spending narrative (p &amp;lt; 0.01) vs. 0.051 pp for those holding the energy narrative (p = 0.184); this difference is highly significant (p &amp;lt; 0.01). Importantly, the first-stage effect on expected government spending growth is similar across narrative groups (4.7 pp vs. 6.8 pp, difference not significant), ruling out differential interpretation of the forecast itself as the mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How do the authors formalize narratives in the DSGE model, and what is the key mapping result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Narratives are formalized as subjective causal models (SCMs): linear mappings from N observable factors to inflation, π_t = ψ_1(i)z_{1,t} + &amp;hellip; + ψ_N(i)z_{N,t}, combined with perceived AR(1) processes for each factor. The &amp;ldquo;subjective inflation narrative&amp;rdquo; of agent i is summarized by perceived contribution shares ω_z(i). The paper&amp;rsquo;s Proposition 2 gives closed-form expressions for equilibrium inflation and consumption as functions of these perceived shares, without imposing that they be correct or identical across agents. The key result is that subjective causal models always affect equilibrium outcomes so long as the perceived persistence parameters differ across factors — the mechanism being that different narratives produce different inflation expectations, which feed back into consumption and pricing decisions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What are the quantitative implications of narrative shifts in the calibrated DSGE model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The baseline calibration uses standard New Keynesian parameters (β=0.99, γ=1, ς=5, Calvo price duration=4 quarters, φ=1.5, ρ_a=0.9, ρ_g=0.8, ρ_ν=0.5) with a scenario of a 10% productivity decline, 10% government spending increase, and policy rate 2 pp below the Taylor rule. Under rational expectations, π_t=3.68% and c_t=−11.79%. Raising the perceived importance of monetary policy in household and firm inflation narratives from ω_ν=0.1 to ω_ν=0.2 (lowering ω_a by the same amount, holding ω_g fixed) increases equilibrium consumption by 27 basis points and reduces equilibrium inflation by 27 basis points. With a less reactive central bank (φ=1.25), the same narrative shift raises consumption by 30 basis points and reduces inflation by 62 basis points. The paper notes that these effects are approximately linear in the narrative shift, meaning the directional implication holds across a wide range of narrative configurations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: How does narrative heterogeneity across households affect aggregate outcomes in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: When households hold heterogeneous narratives, aggregate outcomes depend on the joint distribution of perceived factor importance (ω_z(i)) and perceived factor persistence (ρ_z(i)) across agents, rather than on average values alone. Specifically, the model shows that if households who assign higher importance to a given factor also perceive that factor as more persistent, the aggregate effect on expectations and consumption is amplified beyond what the average narrative predicts. Additionally, narrative heterogeneity generates consumption heterogeneity even when the efficient allocation requires all households to consume the same amount, representing a welfare-relevant distortion absent under rational expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: What is the practical implication for central bank communication?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Under full-information rational expectations, central bank narrative communication about the drivers of inflation is irrelevant because agents already hold the correct model. Once subjective causal models can deviate from the truth, central bank narrative provision shifts aggregate equilibrium outcomes (inflation and consumption) in a benchmark New Keynesian model. The paper argues that central banks need to measure the distribution of household narratives to know whether their communication shifts agents toward or away from the rational expectations equilibrium — moving agents in the direction of the correct narrative produces better aggregate outcomes from the central bank&amp;rsquo;s perspective, conditional on inflation being above target and output below first-best.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Economic Narrative (as used in this paper):&lt;/strong&gt; An agent&amp;rsquo;s causal account for why a given economic event occurred — specifically, an assessment of cause-effect relationships that explains the drivers of an economic outcome. Distinguished from more general notions of &amp;ldquo;story&amp;rdquo; in that causality is the core; the paper does not count descriptions of correlation or simple statements of fact as narratives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Directed Acyclic Graph (DAG) representation of narratives:&lt;/strong&gt; Each narrative is coded as a network of factor nodes connected by directed edges indicating perceived causation. Acyclicity rules out feedback loops in a respondent&amp;rsquo;s causal account. Factors with nonzero ψ(i) are included; the direction of edges indicates causal flow. This representation allows quantitative comparison across respondents via adjacency matrices or Jaccard distances between edge lists.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Subjective Causal Model (SCM) of inflation:&lt;/strong&gt; The paper&amp;rsquo;s formal theoretical counterpart to a narrative: a linear mapping π_t = Σ_n ψ_n(i) z_{n,t} in which individual i assigns perceived marginal effect ψ_n(i) to each factor z_n, combined with a perceived AR(1) law of motion for each factor. The SCM does not need to be correct or shared across agents. The rational expectations equilibrium is the special case where all agents&amp;rsquo; SCMs match the true data-generating process.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Perceived contribution share (ω_z):&lt;/strong&gt; The ratio ψ_z(i)·z_t / π_t — agent i&amp;rsquo;s perceived percentage contribution of factor z to current inflation. This is the sufficient statistic for the effect of household narratives on inflation expectations and, through the NK model, on equilibrium aggregate outcomes. The aggregate distribution of ω_z(i) and perceived persistence ρ_z(i) determines the consumption Euler equation at the aggregate level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Government mismanagement (as a narrative factor):&lt;/strong&gt; A coding category that captures explicit reference to policy failure or low-quality decision-making by policymakers in a politicized sense — distinct from the economic factors of government spending or monetary policy. It represents households&amp;rsquo; attribution of inflation to the incompetence or malfeasance of officials, rather than to any specific economic mechanism. This factor appears in 32% of household narratives but only 1% of expert narratives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Narrative cluster:&lt;/strong&gt; A group of respondents whose DAGs are mutually similar (measured by Jaccard distance between edge lists) and whose typical DAG differs from other clusters. Identified via agglomerative hierarchical clustering. The paper identifies eight substantively meaningful clusters, ranging from supply-chain-focused to mono-causal pandemic or mismanagement narratives, with no single cluster capturing more than 20% of households.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Test-retest reliability of narratives:&lt;/strong&gt; The correlation between the same respondent&amp;rsquo;s narrative elicited on two occasions three days apart. The paper estimates an average correlation of 0.63 across all narrative factors (p &amp;lt; 0.01), interpreted as indicating significant stability in households&amp;rsquo; causal beliefs rather than survey noise. Comparable in magnitude to test-retest correlations of economic preferences in other studies.&lt;/p&gt;</description></item><item><title>Normal Approximation in Large Network Models</title><link>https://macropaperwarehouse.com/papers/normal-approximation-in-large-network-models/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/normal-approximation-in-large-network-models/</guid><description>&lt;p&gt;This paper proves a central limit theorem (CLT) for network formation models with strategic interactions and homophilous agents, addressing a foundational inferential gap in the econometrics of large networks. The setting is one where the econometrician observes a single large network — the asymptotic framework sends network size n to infinity — which is the empirically relevant case for most network datasets. The network moments of interest are averages of node-level statistics (1/n) Σ ψ_i, where ψ_i can capture degree, clustering coefficients, or subnetwork counts (triangles, k-stars) that have been used for structural inference in network formation games.&lt;/p&gt;
&lt;p&gt;The model is a pairwise-stability network formation game augmented onto a latent-space/geometric-graph structure. Each node i has an i.i.d. type (X_i, Z_i), where X_i is a continuously distributed position vector capturing homophilous attributes. Two nodes i and j form a link if a joint-surplus function V(·) exceeds zero, where V depends on the scaled distance r_n^{-1}‖X_i − X_j‖ between positions, a vector of strategic interaction statistics S_{ij} (functions of neighboring links), node attributes Z_i, Z_j, and an i.i.d. utility shock ζ_{ij}. Homophily enters as a monotonicity requirement: V is decreasing in the distance component, so dissimilar nodes are less likely to link. Sparsity is ensured by setting r_n = (κ/n)^{1/d}, which keeps expected degree asymptotically bounded.&lt;/p&gt;
&lt;p&gt;Strategic interactions enter through S_{ij}, which depends on links involving neighbors of i or j (local externalities), generating chains of cross-sectional dependence that are the central obstacle to the CLT. The paper identifies two distinct sources of dependence: (1) link interdependencies from best-response chains, where the realization of one link influences neighboring links; and (2) global coordination in equilibrium selection, where agents may condition on a common signal.&lt;/p&gt;
&lt;p&gt;The main technical contribution is adapting &amp;ldquo;stabilization&amp;rdquo; conditions from the literature on geometric graphs (Penrose and Yukich 2003, 2008) to the strategic setting. Exponential stabilization (Assumption 5) requires that the radius of stabilization R_i — the smallest neighborhood of i such that ψ_i depends only on nodes within that neighborhood — has a distribution with exponential tails. This bounds the effective dependence neighborhood and provides the weak dependence structure needed for the CLT.&lt;/p&gt;
&lt;p&gt;To verify stabilization from primitive conditions, the paper employs branching process theory. The key construct is the &amp;ldquo;strategic neighborhood&amp;rdquo; C_i^+, the component of i in the network of non-robust links D (pairs where strategic interactions can change the link outcome). The paper bounds |C_i^+| by a subcritical Galton-Watson branching process: if the mean offspring is below 1 (subcriticality, Assumption 7, stated as ‖h*‖_m &amp;lt; 1), the process is non-explosive and its size has exponential tails, yielding the required stabilization. The subcriticality condition directly restricts the strength of strategic interactions and is the network analog of the condition ‖β‖ &amp;lt; 1 in linear autoregressive models. A second condition (Assumption 8, decentralized selection) requires that equilibrium selection operates independently across disjoint strategic neighborhoods, ruling out global coordination; this holds under myopic best-response dynamics.&lt;/p&gt;
&lt;p&gt;For inference, the paper proposes a network HAC variance estimator hat_Σ_n = (1/n) Σ_i Σ_j k(d_{ij}/b_n) hat_ψ_i hat_ψ_j^T, where k(·) is a kernel, d_{ij} is the path distance in A, and b_n is a bandwidth, and a network bootstrap that resamples nodes with replacement. Both are shown to be consistent (Theorem 3). Simulation results with n up to 500, varying strategic interaction strength θ_2 from 0 to 0.5, show that the network HAC estimator achieves nominal 5% rejection rates and 95% coverage for n ≥ 500, while the bootstrap slightly over-rejects in small samples and performance degrades as θ_2 increases.&lt;/p&gt;
&lt;p&gt;The scope conditions are explicit: the CLT applies to sparse networks (expected degree bounded), undirected networks with local externalities, models admitting a pairwise-stability equilibrium, and equilibrium selection satisfying decentralization. Extensions to directed or denser networks are left for future work.&lt;/p&gt;
&lt;p&gt;Q: What is the primary research question and why does it require new theory?
A: The paper asks when sample averages of network statistics — degree, clustering, subnetwork counts — satisfy a CLT in strategic network formation models observed as a single large network. Standard CLT proofs require weakly dependent observations, but strategic interactions generate chains of link dependence of a priori unbounded length, and multiple equilibria allow global coordination, both of which can destroy asymptotic normality. Prior work (Leung 2019b; Menzel 2024) established laws of large numbers but not CLTs, which require stronger conditions.&lt;/p&gt;
&lt;p&gt;Q: What is the stabilization condition and why is it the right formulation of weak dependence?
A: Exponential stabilization (Assumption 5) requires that the radius of stabilization R_i — the smallest K such that ψ_i depends only on the K-neighborhood of i in the network — has a distribution with exponential tails: lim sup_{w→∞} w^{-η} max{log τ_{b,ε}(w), log τ_p(w)} &amp;lt; 0 for some η ∈ (0,1]. This implies that each node&amp;rsquo;s statistic depends effectively only on a bounded fraction of the network, making {ψ_i} weakly dependent. The condition is a modification of stabilization conditions from the geometric graph literature (Penrose and Yukich 2003, 2008) adapted to allow strategic interactions.&lt;/p&gt;
&lt;p&gt;Q: How does the paper connect the abstract stabilization condition to primitive model conditions?
A: The paper defines the strategic neighborhood C_i^+ as the union of one-step network neighborhoods of nodes in i&amp;rsquo;s component in the non-robust link network D (where D_{ij} = 1 iff the link A_{ij} can be switched by strategic interactions). The size |C_i^+| controls the radius of stabilization. By mapping exploration of C_i via breadth-first search onto a Galton-Watson branching process, subcriticality (mean offspring &amp;lt; 1, i.e., ‖h*‖_m &amp;lt; 1) implies that |C_i^+| has exponential tails, which yields exponential stabilization with η = 1 (Theorem 2).&lt;/p&gt;
&lt;p&gt;Q: What is the subcriticality condition and what does it restrict?
A: Subcriticality (Assumption 7) requires that the mean interaction-strength measure satisfies ‖h*‖_m &amp;lt; 1, where h* bounds the probability that a given link is non-robust as a function of node attributes. This restricts how strongly the existence of one link influences the probability of neighboring links. The authors explicitly analogize this to the condition ‖β‖ &amp;lt; 1 in linear autoregressive models: both bound the magnitude of &amp;ldquo;autoregressive&amp;rdquo; dependence below one to prevent explosive propagation of dependence.&lt;/p&gt;
&lt;p&gt;Q: What is the decentralized selection condition and what does it rule out?
A: Assumption 8 (decentralized selection) requires that the equilibrium selection mechanism operates independently across disjoint strategic neighborhoods: A_{H_l} = λ_{|H_l|}(r^{-1}T_{H_l}, ζ_{H_l}) for each disjoint strategic neighborhood H_l. This rules out global coordination where agents condition on a common signal (such as the type of a particular node) to jointly select an equilibrium. The condition is satisfied by myopic best-response dynamics and is described as the single-network analog of requiring equilibrium selection to be independent across networks under many-network asymptotics.&lt;/p&gt;
&lt;p&gt;Q: What is the structure of the CLT proof?
A: The proof has two steps. Step 1 proves a CLT for the Poissonized model where the number of nodes N_n ~ Poisson(n), leveraging results from Penrose and Yukich (2008) for geometric graphs extended to the strategic setting. Step 2 is a de-Poissonization argument that transfers the Poissonized CLT back to the fixed-n model. The abstract CLT (Theorem 1) requires Assumptions 5 and 6, and Theorem 2 establishes that Assumptions 1–8 imply Assumption 5 with η = 1.&lt;/p&gt;
&lt;p&gt;Q: How does the network HAC estimator work and what are its consistency conditions?
A: The estimator is hat_Σ_n = (1/n) Σ_i Σ_j k(d_{ij}/b_n) hat_ψ_i hat_ψ_j^T, where d_{ij} is the path distance between i and j in the observed network A, k(·) is a kernel function, b_n is a bandwidth, and hat_ψ_i = ψ_i(N_n) − (1/n) Σ_j ψ_j(N_n) is the demeaned statistic. Consistency (hat_Σ_n →^p Σ_n) is established under appropriate conditions on the bandwidth b_n (Theorem 3). The bandwidth plays the same role as in time-series HAC estimation, controlling the window over which covariances are summed.&lt;/p&gt;
&lt;p&gt;Q: What do the simulations show about finite-sample performance?
A: Using a DGP with X_i ~ U([0,1]^2), ζ_{ij} ~ N(0,1), and θ_2 varying from 0 to 0.5 to control strategic interaction strength, the network HAC estimator achieves nominal 5% rejection rates and 95% coverage at n ≥ 500 across all settings. The bootstrap slightly over-rejects in small samples. Performance of all procedures degrades as θ_2 increases (stronger strategic interactions), consistent with the theoretical condition that subcriticality must hold. These results support practical use of the inference procedures based on Theorem 1.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to prior work on CLTs for network data?
A: Kojevnikov et al. (2021) prove a CLT for node-level data conditional on the network, but this does not apply to network formation because the network is the outcome, not a conditioning variable. Leung (2019b) and Menzel (2024) prove laws of large numbers for strategic network formation but not CLTs. Kuersteiner (2019) takes a different approach using a conditional mixingale assumption. The paper&amp;rsquo;s abstract CLT extends Penrose and Yukich (2008) by modifying the stabilization condition to accommodate strategic interactions; the primitive conditions are new and use branching process tools that build on Leung (2019b).&lt;/p&gt;
&lt;p&gt;Q: What network moments can the CLT be applied to?
A: The CLT applies to any average of node statistics ψ_i that depends only on the K-neighborhood of i in the network (Assumption 4 with finite K). Explicit examples include average degree (ψ_i = Σ_j A_{ij}), average clustering coefficient, and counts of connected subnetworks such as triangles and k-stars. Subnetwork counts have been used as the basis for structural identification and estimation of network formation games (Sheng 2020), making the CLT directly applicable to inference in those models.&lt;/p&gt;
&lt;p&gt;Q: What are the scope limitations and directions for future work?
A: The CLT applies to sparse undirected networks with local externalities (Assumption 2), homophily in positions (Assumption 1), and equilibrium selection satisfying decentralization (Assumption 8). It does not cover directed networks, denser networks where expected degree grows with n, or models with global link externalities. The authors identify extending results to directed and denser networks and developing more powerful inference procedures exploiting network structure as priorities for future work.&lt;/p&gt;
&lt;p&gt;Stabilization (exponential): The condition that the radius of stabilization R_i — the smallest neighborhood of i beyond which ψ_i does not depend on further nodes — has a distribution with exponential tails (lim sup_{w→∞} w^{-η} log τ(w) &amp;lt; 0 for η ∈ (0,1]). This is the paper&amp;rsquo;s operative formulation of weak dependence for network statistics and is adapted from geometric graph theory to the strategic setting.&lt;/p&gt;
&lt;p&gt;Strategic neighborhood (C_i^+): The union of one-step neighborhoods of nodes in i&amp;rsquo;s component in the non-robust link network D. A link (i,j) is non-robust (D_{ij} = 1) if strategic interactions can change its realization — i.e., the surplus V can be positive under some interaction configurations and non-positive under others. The size of C_i^+ governs the radius of stabilization and hence the degree of cross-sectional dependence.&lt;/p&gt;
&lt;p&gt;Subcriticality (‖h*‖_m &amp;lt; 1): The condition that the mean-field interaction strength measure satisfies ‖h*‖_m &amp;lt; 1, where h* bounds the conditional probability that a link is non-robust. Subcriticality ensures that breadth-first search of the strategic neighborhood is dominated by a subcritical Galton-Watson process (mean offspring &amp;lt; 1), preventing explosive growth of the dependence neighborhood. The paper explicitly frames this as the network analog of ‖β‖ &amp;lt; 1 in autoregressive models.&lt;/p&gt;
&lt;p&gt;Decentralized selection (Assumption 8): The requirement that the equilibrium selection mechanism assigns outcomes independently across disjoint strategic neighborhoods: A_{H_l} = λ_{|H_l|}(r^{-1}T_{H_l}, ζ_{H_l}) for each disjoint H_l. This rules out global coordination — agents conditioning on a common signal to select among equilibria — while permitting local coordination within strategic neighborhoods. Satisfied by myopic best-response dynamics.&lt;/p&gt;
&lt;p&gt;Pairwise stability: The solution concept underlying the model. A network A satisfies pairwise stability under transferable utility if A_{ij} = 1{V_{ij} &amp;gt; 0}, meaning a link forms exactly when the joint surplus is positive. This is the equilibrium condition from which the strategic interaction statistics S_{ij} and non-robustness indicators D_{ij} are derived.&lt;/p&gt;
&lt;p&gt;Network HAC estimator: The variance estimator hat_Σ_n = (1/n) Σ_i Σ_j k(d_{ij}/b_n) hat_ψ_i hat_ψ_j^T, where d_{ij} is the path distance in the observed network, k(·) is a kernel, and b_n is a bandwidth. It is the network analog of heteroskedasticity- and autocorrelation-consistent (HAC) estimators in time series, using path distance in place of temporal lag distance.&lt;/p&gt;
&lt;p&gt;Homophily (in this paper&amp;rsquo;s sense): The property that the joint-surplus function V is decreasing in the first argument r_n^{-1}‖X_i − X_j‖ (scaled positional distance), so nodes that are more dissimilar in position are strictly less likely to form links. Combined with the sparsity scaling r_n = (κ/n)^{1/d}, this ensures that links decay with distance in social space and that the network remains sparse as n grows.&lt;/p&gt;</description></item><item><title>On the Nature of Entrepreneurship</title><link>https://macropaperwarehouse.com/papers/on-the-nature-of-entrepreneurship/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/on-the-nature-of-entrepreneurship/</guid><description>&lt;p&gt;This paper uses a novel longitudinal administrative dataset drawn from U.S. Internal Revenue Service (IRS) and Social Security Administration (SSA) records to characterize income dynamics and the determinants of entrepreneurial entry for pass-through business owners — sole proprietors, partners, and S corporation owners — who collectively account for over 50 percent of all U.S. business net income. The sample covers 2000–2015 and includes up to 1.3 billion person-year observations for individuals aged 25–65. The authors construct balanced panels using birth cohorts 1950–1975, impute education (college attainment) and skill (cognitive, interpersonal, manual) via machine-learning classifiers trained on CPS and O*NET data, and estimate life-cycle income profiles using a three-component model that separates individual fixed effects, group-specific time effects, and group-cohort-specific age effects.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central departure from prior work is coverage of the full income distribution, including the high-earning right tail that household surveys such as the CPS misrepresent due to top-coding and small samples. When the IRS and CPS samples are compared on a consistent classification basis, median self-employment income is lower in the IRS data at all ages, consistent with the survey literature&amp;rsquo;s emphasis on the &amp;ldquo;typical&amp;rdquo; self-employed individual. However, mean incomes diverge sharply: the IRS shows mean self-employment income rising from $23 thousand at age 25 to $93 thousand at age 55, whereas the CPS (with incorporated owners reclassified) shows a rise from only $41 thousand to $73 thousand. Roughly 80 percent of self-employment income in the IRS data accrues to individuals above the $100 thousand threshold, compared to 42–53 percent in the CPS. The IRS-CPS gap is dominated by the right tail and concentrated in professional services and health care. For paid-employed individuals, the IRS and CPS medians and means are close at all ages, confirming the discrepancy is specific to self-employment.&lt;/p&gt;
&lt;p&gt;The life-cycle estimation finds that individuals who have &amp;ldquo;tried self-employment&amp;rdquo; — a group earning virtually all self-employment income — start at similar average incomes to primarily paid-employed peers at age 25 but reach $134 thousand by age 55, compared with $79 thousand for paid-employed peers with the same observable characteristics. Age effects for the self-employed are 63 percent higher than for the paid-employed at age 26 and remain elevated until age 55. Time effects show dramatically greater cyclical volatility for the self-employed: income growth declined by $9,655 (2008) and $8,785 (2009) for the self-employed versus $373 and $1,583 for paid-employed in the same years, concentrated in real estate and construction.&lt;/p&gt;
&lt;p&gt;On the determinants of entry, the paper finds: (i) no evidence that house-price appreciation raises entry rates, contra collateral-constraint hypotheses; (ii) most entrants have lower asset incomes than future entrants with the same characteristics, arguing against a liquid-wealth precondition; (iii) most entrants have higher prior labor income than future entrants, consistent with entry being driven by on-the-job experience rather than fallback from low-paid work; (iv) almost all founders report positive individual tax income in their first year of operation despite negative business net income and no external debt financing. Self-employed income growth exhibits greater dispersion — a 10th-to-90th percentile range roughly 2.5 times wider than for the paid-employed — and a Kelly skewness about 0.1 higher. A standard consumption-risk model calibrated with household-finance estimates of risk aversion rationalizes the patterns if individuals are insured against the most adverse downside shocks. Entry and exit rates are stable across the sample period, including the Great Recession, and the entrepreneurship share does not decline.&lt;/p&gt;
&lt;p&gt;The subgroup congruent with non-pecuniary motivation — primarily self-employed individuals earning less than paid-employed peers with matching characteristics — comprises roughly 57 percent of primarily self-employed by count but earns only 16 percent of total self-employment income.&lt;/p&gt;
&lt;p&gt;Q1: Why do IRS and CPS data give such different pictures of self-employment income?
The CPS suffers from top-coding of high incomes and small samples that underrepresent high earners in key industries. The IRS-CPS mean income gap for the self-employed is dominated by the right tail: in the main IRS sample, individuals above the $100 thousand threshold earn roughly 80 percent of all self-employment income, versus 42 percent in the comparable CPS sample. The average income of top earners above $100 thousand is $355 thousand in the IRS versus $218 thousand in the CPS. The gap is concentrated in professional services and health care and persists across all income thresholds and sample definitions tested. No analogous discrepancy exists for paid-employed individuals, where IRS and CPS medians and means are close at all ages.&lt;/p&gt;
&lt;p&gt;Q2: What does the comparison look like at the median versus the mean?
At the median, IRS self-employment income is lower than both CPS samples at all ages, with the gap largest for younger owners and those with incorporated businesses — a pattern consistent with the survey-based &amp;ldquo;self-employment discount&amp;rdquo; narrative. At the mean, the IRS shows much higher income at older ages: by age 55, IRS mean self-employment income is $93 thousand versus $73 thousand in the CPS sample that includes reclassified incorporated-owner wages. The divergence arises because the mean is sensitive to the right tail, which the CPS systematically underrepresents.&lt;/p&gt;
&lt;p&gt;Q3: How does the paper estimate life-cycle income profiles while separating age, time, and cohort effects?
Individual income is decomposed into an individual fixed effect (permanent latent ability and preferences), a group-specific time effect (business-cycle fluctuations common to a group), and a group-cohort-specific age effect (life-cycle income growth). Identification exploits the overlapping cohort structure of the 16-year panel: age effects are assumed equal across cohort bins of size at least two, allowing time and age effects to be separately identified. The model is estimated in levels rather than logs to accommodate business losses. Groups are defined as a Cartesian product of 32,256 subgroups based on education, three skill dimensions, industry (21 two-digit NAICS codes), demographics (gender, cohort, marital status, children), and employment-status history.&lt;/p&gt;
&lt;p&gt;Q4: What are the headline life-cycle income profile findings for self- versus paid-employed?
Among the &amp;ldquo;primarily employed&amp;rdquo; group, those who have tried self-employment and those who are primarily paid-employed have similar average incomes at age 25. By age 55 the self-employed reach an estimated $134 thousand (2012 dollars) versus $79 thousand for paid-employed peers with identical observable characteristics. The estimated age effect for the self-employed is 63 percent higher than for the paid-employed at age 26 and remains higher through age 55. These gaps would widen further if incomes were adjusted upward for the BEA-estimated net misreporting rates of 46 percent for unincorporated owners and 14 percent for S corporation owners.&lt;/p&gt;
&lt;p&gt;Q5: How large is the group consistent with non-pecuniary motivation, and how much income does it earn?
The non-pecuniary subgroup — primarily self-employed individuals (at least 12 years in self-employment) who earn less on average than primarily paid-employed peers matched on gender, education, skills, and other characteristics — is numerically larger, comprising approximately 57 percent of primarily self-employed by count. However, this group earns only 16 percent of total self-employment income. Adjusting for paid-employed fringe benefits and self-employed income misreporting can change the group&amp;rsquo;s size but does not alter the finding that it accounts for a small income share. The paper concludes that non-pecuniary motives may guide occupational choice for many individuals but are not the driver of the typical dollar earned in self-employment.&lt;/p&gt;
&lt;p&gt;Q6: How does idiosyncratic income risk compare between self- and paid-employed?
Self-employed income changes are substantially more dispersed: the 10th-to-90th percentile range of income growth is roughly 2.5 times wider for the self-employed than for the paid-employed. Income changes for the self-employed are also more right-skewed, with a Kelly skewness difference of approximately 0.1. When a standard consumption-risk model — augmented with a lower bound on consumption growth to allow for external insurance — is parameterized with risk-aversion estimates from the household finance literature, the observed patterns are rationalized if individuals are insured against the most adverse downside shocks, i.e., the attractive aspect of self-employment is large potential upside with insured downside.&lt;/p&gt;
&lt;p&gt;Q7: What happened to self-employed income and exit rates during the Great Recession?
Time effects show steep income growth declines for the self-employed of -$9,655 in 2008 and -$8,785 in 2009, compared with much more modest declines of -$373 and -$1,583 for paid-employed peers. The aggregate income declines are concentrated in cyclically sensitive self-employed subgroups in real estate and construction, with their paid-employed counterparts experiencing only modest declines. Despite these large income shocks, exit rates from self-employment showed little change during the Great Recession, either in aggregate or in the cyclically sensitive sectors. Entry rates were likewise stable, and the share of entrepreneurs in the population did not decline over the full sample period.&lt;/p&gt;
&lt;p&gt;Q8: Does the evidence support collateral constraints as a binding barrier to entrepreneurial entry?
No. The paper tests the hypothesis, standard in the liquidity-constraints literature, that entry rates should be higher for homeowners experiencing house-price appreciation (which raises collateral value). The IRS data do not support this prediction. Separately, comparing asset incomes (interest, dividends, capital gains) of current entrants and future entrants with the same characteristics, the paper finds that most current entrants have lower asset incomes and less liquid wealth than those who switch later, which also argues against a liquid-wealth precondition for entry.&lt;/p&gt;
&lt;p&gt;Q9: What does prior labor income reveal about why people enter self-employment?
Current entrants have higher prior labor income than matched future entrants with the same characteristics, indicating they enter with accumulated on-the-job experience rather than being pushed into self-employment as a fallback after failure in paid work. This is consistent with self-employment being a deliberate, experience-driven career transition for most entrants rather than a last resort for low earners. The paper interprets this as positive evidence for the role of experience-based human capital in driving entrepreneurial choice.&lt;/p&gt;
&lt;p&gt;Q10: How do founders finance startup costs if most have negative business net income in early years?
Almost all founders in the sample report positive income on their personal (individual) tax form in the first year of operation, even though most report negative business net income and carry no external debt financing. This pattern suggests founders rely on personal income sources — prior savings, part-time paid employment, or spousal income — to cover startup costs rather than external debt, implying that formal credit-market financing constraints are not the primary barrier to entry for most entrants in the sample.&lt;/p&gt;
&lt;p&gt;Q11: What are the scope conditions and key limitations?
The sample covers pass-through owners (sole proprietors, partners, S corporation owners) and excludes C corporation shareholders, whose entrepreneurial income does not flow to individual returns until distributed. Income measures exclude most employer fringe benefits; capital gains are excluded from self-employment income, and the authors note their inclusion would strengthen the main findings. The analysis covers 2000–2015 for cohorts born 1950–1975, and income is reported before taxes and transfers. Baseline estimates are not adjusted for misreporting, though BEA-implied adjustments of 46 percent for unincorporated owners and 14 percent for S corporation owners would widen the income gaps further.&lt;/p&gt;
&lt;p&gt;Pass-through business owner: An individual who owns a sole proprietorship, partnership, or S corporation, such that business net income flows directly onto the owner&amp;rsquo;s personal tax return; excludes C corporation shareholders whose income appears only upon dividend or capital-gains distributions.&lt;/p&gt;
&lt;p&gt;Tried self-employment: The paper&amp;rsquo;s primary self-employed comparison group within the &amp;ldquo;primarily employed&amp;rdquo; category — individuals with any years in self-employment (including frequent switchers and those with most years in self-employment) — who collectively earn virtually all self-employment income.&lt;/p&gt;
&lt;p&gt;Group-specific age effect: The paper&amp;rsquo;s estimate of how individual income changes with age within a defined subgroup (determined by education, skill, industry, demographics, and employment history), identified by exploiting overlapping birth cohorts in the 16-year panel and separated from individual fixed effects and business-cycle time effects.&lt;/p&gt;
&lt;p&gt;Primarily employed: Individuals with at least 12 of 16 sample years in either self- or paid-employment, with at most one intermediate year of non-employment; the paper&amp;rsquo;s main analytical focus for life-cycle income comparisons.&lt;/p&gt;
&lt;p&gt;SOI Databank: The Statistics of Income Databank, a de-identified balanced panel combining SSA demographic records with IRS tax filing data for all living U.S. individuals with a Social Security number over 1996–2015; the paper&amp;rsquo;s primary data source providing Schedule C, K-1, W-2, and related filing information.&lt;/p&gt;
&lt;p&gt;Kelly skewness: A robust measure of distributional asymmetry used by the paper to characterize income growth; the paper reports that Kelly skewness of self-employed income changes exceeds that of paid-employed by approximately 0.1, indicating greater right-skewness in self-employment income dynamics.&lt;/p&gt;
&lt;p&gt;Non-pecuniary motivation subgroup: Primarily self-employed individuals who earn less on average than primarily paid-employed peers matched on observable characteristics, taken by the paper as consistent with non-wage job amenities (autonomy, flexibility) driving occupational choice; found to be 57 percent of primarily self-employed by count but earning only 16 percent of total self-employment income.&lt;/p&gt;</description></item><item><title>On the Optimal Design of a Financial Stability Fund</title><link>https://macropaperwarehouse.com/papers/on-the-optimal-design-of-a-financial-stability-fund/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/on-the-optimal-design-of-a-financial-stability-fund/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how to optimally design a Financial Stability Fund (Fund) for a union of sovereign countries that must simultaneously (i) prevent sovereign default, (ii) provide risk-sharing and consumption smoothing, (iii) respect countries&amp;rsquo; sovereignty (limited enforcement on both sides), (iv) address moral hazard from governments&amp;rsquo; non-contractable policy reform effort, and (v) never impose permanent transfers or incur undesired expected losses. The paper develops the formal theory of such a Fund and evaluates it quantitatively against an incomplete-markets economy with sovereign default (IMD), calibrated to euro area &amp;ldquo;stressed countries&amp;rdquo; (Greece, Italy, Portugal, Spain — the GIPS).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model Setup and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Fund is modeled as a long-term contract between a risk-neutral lender (the Fund) and a risk-averse, relatively impatient borrower (a small open-economy sovereign). The government maximizes lifetime utility over consumption, leisure, and effort, where effort is private information (non-contractable) and determines the distribution of future endogenous government expenditure shocks. Two-sided limited enforcement (LE) constraints govern the contract: the borrower&amp;rsquo;s constraint ensures the country never prefers autarky-with-default to staying in the Fund; the lender&amp;rsquo;s constraint ensures the Fund never prefers investing at the risk-free rate to continuing the contract. The lender&amp;rsquo;s constraint is set with Z = 0 in the benchmark, meaning the Fund never accepts any expected permanent transfers — no ex-ante or ex-post redistribution.&lt;/p&gt;
&lt;p&gt;Because LE and moral hazard (MH) constraints are forward-looking, standard dynamic programming cannot be applied directly. The paper uses recursive contracts (a Saddle-Point Functional Equation, SPFE) with a discounted relative Pareto weight x as the co-state variable. The SPFE characterizes the constrained-efficient allocation. The paper then proves two welfare theorems, providing a novel decentralization of the Fund contract as a recursive competitive equilibrium (RCE) with state-contingent long-term bonds, Pigouvian taxes on Arrow securities (budget-neutral in equilibrium), and endogenous borrowing limits.&lt;/p&gt;
&lt;p&gt;The benchmark (IMD) economy features long-term non-contingent defaultable debt modeled following Chatterjee–Eyigungor, with asymmetric default penalties and probabilistic market re-entry after default (λ = 0.264). Both economies are calibrated to GIPS data for 1980–2015 using a panel Markov regime-switching AR(1) productivity process with three regimes (crisis, intermediate, normal). Key parameters: β = 0.929, r = 2.48%, δ = 0.814, κ = 0.083, labor share α = 0.566.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Borrowing capacity&lt;/strong&gt;: The Fund supports a long-run average debt-to-GDP ratio of 191 percent, compared with 78.6 percent in the IMD economy — more than double — while eliminating default episodes entirely. At the state-level, the maximum debt capacity of the Fund ranges from roughly 99–293 percent of GDP across states, versus 1.6–184 percent in the IMD economy; capacity in bad states (low θ, high g) under the IMD falls to under 2 percent, while the Fund can absorb close to 100 percent even in the worst state.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Consumption volatility&lt;/strong&gt;: The relative volatility of consumption to output falls from 139 percent in the IMD economy to 36 percent under the Fund, reflecting greatly improved risk sharing through state-contingent payments.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Primary surplus co-movement&lt;/strong&gt;: The cyclical correlation of the primary surplus with output rises from 0.23 (mildly procyclical — consistent with some consumption smoothing but limited by borrowing constraints and default risk) in the IMD to 0.94 under the Fund, enabling counter-cyclical primary deficits during crises.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Effort&lt;/strong&gt;: The long-run mean effort is 17 percent higher under the Fund than in the IMD economy in normal times, reflecting the Fund&amp;rsquo;s long-horizon incentive structure. However, during a crisis, effort is lower under the Fund than under the IMD — the Fund deems high effort in a crisis not part of the efficient allocation, in contrast to the IMD where spreads and borrowing constraints impose austerity-like discipline.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Welfare gains&lt;/strong&gt;: Starting from zero initial debt, the consumption-equivalent steady-state average welfare gain of the Fund is approximately 8.5 percent (ergodic mean-weighted), ranging from 7.0 percent in the best state (high θ, low g) to 10.3 percent in the worst state (low θ, high g). In a counterfactual crisis simulation initialized at pre-crisis GIPS levels (70 percent debt-to-GDP, 0.8 percent spread), the welfare gain rises to approximately 10.59 percent in consumption-equivalent terms, exceeding the zero-debt benchmark of 8.57 percent for the same shock state.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Welfare decomposition&lt;/strong&gt;: For the two worst-shock states examined, higher debt capacity (channel iii) and state-contingent insurance (channel iv) together account for more than 90 percent of total welfare gains — specifically, 63.65 percent and 28.10 percent for (θl, gh), and 51.92 percent and 41.39 percent for (θl, gl), respectively. The direct costs of default (output penalty and market exclusion) together contribute less than 10 percent of total gains.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Spreads&lt;/strong&gt;: The IMD economy generates positive spreads reflecting default risk. The Fund economy generates only non-positive spreads in equilibrium — negative spreads arise when the lender&amp;rsquo;s limited enforcement constraint is binding (i.e., when continuing to lend risks permanent Fund losses, so the Fund restrains the borrower). This negative spread is interpretable as a Debt Sustainability Analysis signal.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Calibration is to GIPS countries over 1980–2015. The Fund assumes full exclusivity (absorbs all sovereign debt). A follow-up paper by other authors shows similar welfare gains hold when only a minimal fraction of debt is absorbed. The benchmark sets Z = 0 (no solidarity transfers); relaxing Z &amp;lt; 0 would allow greater risk sharing. The borrower is strictly more impatient than the lender (η = β(1+r) = 0.9684 &amp;lt; 1).&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the two limited enforcement (LE) constraints in the Fund contract, and what do they individually prevent?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The borrower&amp;rsquo;s LE constraint (constraint 1) ensures the country&amp;rsquo;s continuation value under the Fund always weakly exceeds its outside option V°(s) — the value of defaulting and entering incomplete markets as a defaulter. This prevents the borrower from reneging on the Fund contract. The lender&amp;rsquo;s LE constraint (constraint 3) ensures the Fund&amp;rsquo;s expected net present value of transfers never falls below Z (set to 0 in the benchmark), preventing the Fund from making permanent expected losses. Together, these two constraints define an interval [x(s), x̄(s)] for the relative Pareto weight within which both parties remain voluntarily in the contract.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does moral hazard enter the model, and what is the key assumption enabling the first-order-condition (FOC) approach?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Government effort e ∈ [0,1] is non-contractable; it shifts the distribution of future government expenditure shocks g in a first-order stochastically dominant direction (higher effort → lower expected g). The incentive compatibility constraint (ICC, constraint 2) imposes that the marginal cost of effort v′(e) equals the marginal benefit in terms of expected future utility changes. The FOC approach is validated by Assumption 1 (monotone likelihood ratio condition on the g-shock transition, and convexity of the CDF with respect to effort), which guarantees the ICC is sufficient as well as necessary. Without this assumption, the full optimization problem would need to replace the ICC, making the recursive formulation substantially more complex.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the paper achieve a recursive formulation despite forward-looking LE and MH constraints?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper uses the saddle-point Lagrangian approach (following Marcet–Marimon). Rather than tracking the full history of constraints, it introduces a discounted relative Pareto weight x ≡ [β(1+r)]^t · (µ_b,t / µ_l,t) as the sufficient co-state variable. The law of motion for x adjusts at each state realization: the borrower&amp;rsquo;s LE multiplier ν_b raises x (rewards the borrower), the lender&amp;rsquo;s LE multiplier ν_l lowers x (restrains the borrower), and the MH multiplier ρ̺ shifts x up or down depending on whether the realized g provides a positive or negative signal about effort (monotone likelihood ratio). This collapses the problem to a stationary Saddle-Point Functional Equation (SPFE) in (x, s).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the key properties of the optimal Fund allocation characterized in the paper?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: (i) When neither LE constraint binds, consumption increases with x and is constant in s (perfect Pareto weight-determined risk sharing), labor supply is undistorted and increases in θ, and x declines over time due to borrower impatience (η &amp;lt; 1). (ii) When the borrower&amp;rsquo;s LE binds (x ≤ x̄(s)), consumption, labor, and x are pinned at x̄(s) and the borrower is prevented from receiving less. (iii) When the lender&amp;rsquo;s LE binds (x ≥ x̄(s)), the same constancy holds and the lender is prevented from being overexposed. Moral hazard introduces state-contingency in the inter-period evolution of x even when neither LE binds, via the likelihood ratio term. The paper shows that immiseration (consumption converging to zero) is prevented by the borrower&amp;rsquo;s LE constraint, even in the presence of moral hazard.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the modified inverse Euler equation in this model, and how does it differ from standard formulations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the standard pure moral hazard problem, the inverse of the marginal utility process is a positive supermartingale, leading to immiseration (consumption converging to zero) when the borrower is impatient. In this model with two-sided LE and MH, the inverse Euler equation (Lemma 4, equation 21) has the form: E_s[{1/u′(c(x′,s′))} · {(1+ν_l)/(1+ν_b)}] = η · {1/u′(c(x,s))}. The LE multipliers truncate the supermartingale whenever borrower or lender constraints bind, recurrently preventing both immiseration and permanent lender losses. The MH constraint introduces state-contingent perturbations to the path of consumption (via likelihood ratios) even between binding episodes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the novel decentralization result, and why is it theoretically significant?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper provides two welfare theorems (Propositions 1 and 2). The Second Welfare Theorem shows that any constrained-efficient Fund contract can be decentralized as a recursive competitive equilibrium with: (a) long-term state-contingent (Arrow security) assets, (b) Pigouvian state-contingent taxes τ^a(s′) on Arrow securities — which are budget-neutral in equilibrium — where 1/(1+τ^a(s′)) = 1 + χ(x,s)·u′(c(x,s))·[∂_e π(s′|s,e)/π(s′|s,e)], and (c) endogenous borrowing limits &amp;ldquo;not too tight&amp;rdquo; relative to outside options. The First Welfare Theorem shows the reverse. This decentralization is novel because it handles both limited commitment and dynamic moral hazard simultaneously — prior work handled each in isolation. The taxes internalize the full social value of effort by creating a wedge between the borrower&amp;rsquo;s and lender&amp;rsquo;s intertemporal rates of substitution, removing the need to impose the ICC directly as a constraint in the competitive equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What drives the negative spreads in the Fund economy, and how do they differ from the positive spreads in the IMD economy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the IMD economy, positive spreads reflect the probability of default: the bond price embeds an expected default discount. In the Fund economy, default is eliminated by construction. Negative spreads arise when the lender&amp;rsquo;s LE constraint is binding in some future state s′ (i.e., ν_l(x′,s′) &amp;gt; 0): this means the borrower&amp;rsquo;s Pareto weight is so high that the Fund risks permanent losses by continuing to lend. The asset price equation (45) shows the Arrow security price equals the maximum of the borrower&amp;rsquo;s discounted marginal utility valuation and the risk-free discounted return — so when the lender&amp;rsquo;s constraint binds, the price is driven by the risk-free return (q(s′|s) = π(s′|s,e)·A(s′)/(1+r)), which generates a negative implicit spread. The negative spread acts as a DSA-like signal: the Fund is better off restraining lending in those states.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the calibration match the GIPS data, and what is the main misfit?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The IMD economy is calibrated to average GIPS moments over 1980–2015 using a panel Markov regime-switching AR(1) for productivity (three regimes: crisis, intermediate, normal) and a three-state government expenditure process. The model matches well: average debt/GDP of 78.57 percent (data: 78.33), average spread of 4.17 percent (data: 4.15), labor moments, relative volatility of spreads (1.74 vs. 1.67 in data), government-output correlation (0.38 matches data), and relative volatility of the primary surplus (0.97 vs. 1.00 in data). The main misfit is the average primary surplus/GDP: the model generates a positive value (consistent with stationarity and debt servicing), while the data shows a slight deficit over the sample, plausibly reflecting growth expectations. The paper notes this level misfit does not compromise its core welfare-comparison results, since what matters is the relative time-series behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the Fund compare to the IMD economy in the crisis simulation initialized at pre-2008 GIPS conditions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The economy is initialized at 70 percent debt-to-GDP and 0.8 percent spread (consistent with 2005–2007 GIPS averages), then hit with a negative productivity and high government expenditure shock. In the IMD economy, this shock generates a wave of defaults (Figure 6), sharp spread increases (spreads spike, consistent with GIPS experience of 2009–2010 where spreads reached 4.04 percent on average), and a required increase in labor supply despite low productivity. Under the Fund, no defaults occur: instead, the country runs a large primary deficit financed by the state-contingent component of the Fund contract (debt actually falls under the Fund while rising in the IMD), consumption is higher than in the IMD for approximately the first 10 periods of the crisis, and labor supply is allowed to fall (consistent with efficiency). The welfare gain in this counterfactual is approximately 10.59 percent in consumption-equivalent terms, exceeding the zero-debt-initial-condition gain of 8.57 percent for the same shock state, demonstrating that welfare gains are amplified when the Fund takes over pre-existing debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the Fund affect effort incentives differently in normal times versus crisis times?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In normal times, the Fund provides better incentives for effort: long-run average effort is 17 percent higher under the Fund than in the IMD economy. The Fund&amp;rsquo;s long-term contract links future government expenditure outcomes directly to future lifetime utility via the law of motion for x (equation 5): low g realizations shift x upward (reward the borrower), creating forward-looking incentives. In crisis times, the Fund allows effort to fall relative to the IMD economy; the IMD imposes higher effort in bad states through spread increases and effective borrowing constraints that make budget relief through effort more valuable. The paper interprets this as the efficient outcome: &amp;ldquo;austerity&amp;rdquo; (high effort during a crisis) is not part of the constrained-efficient Fund allocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the welfare decomposition methodology, and what does it reveal about channels of welfare gain?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The authors construct a sequence of counterfactual IMD economies. Channel (i) removes the output penalty upon default, isolating its welfare cost: contributes 6.58 percent (θl, gh) and 5.31 percent (θl, gl) of total gain. Channel (ii) additionally removes market exclusion after default (immediate return): contributes 1.67 percent and 1.38 percent respectively. Channel (iii) solves counterfactual economies with the Fund&amp;rsquo;s state-specific endogenous borrowing limits but no default allowed, quantifying the value of greater debt capacity: contributes 63.65 percent and 51.92 percent. Channel (iv) is the residual attributable to state-contingent insurance payments: contributes 28.10 percent and 41.39 percent. The decomposition reveals that in the worst state (θl, gh), debt capacity dominates (63.65 percent), while in (θl, gl) — where the low government expenditure partially offsets low productivity — state-contingent insurance is relatively more important (41.39 percent). Together, channels (iii) and (iv) exceed 90 percent of total gains in both cases examined.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Why is the Fund&amp;rsquo;s decentralization unlikely to emerge from private international capital markets?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Two reasons are given. First, private international lenders typically lack the legal authority to impose state-contingent taxes (τ^a(s′)) on domestic economies; these taxes are a necessary component of the decentralization to internalize the social value of effort. Second, even if such taxes were optimal from the joint perspective of borrower and lender, the borrower has no unilateral incentive to impose them given market conditions — the taxes are only individually rational within the Fund&amp;rsquo;s constrained-efficient contract. This provides a rationale for an institutional implementation of the Fund rather than reliance on decentralized sovereign debt markets.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Financial Stability Fund (Fund)&lt;/strong&gt;: A long-term partnership contract between a risk-neutral lender (the Fund) and a risk-averse sovereign borrower, designed to provide risk-sharing and consumption smoothing through state-contingent transfers subject to two-sided limited enforcement and moral hazard constraints, without ever incurring expected permanent losses. Distinguished from standard lending by its long-term contingent structure and dual role as risk-sharing mechanism and crisis-resolution tool.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Two-sided limited enforcement (LE) constraints&lt;/strong&gt;: Forward-looking constraints in the Fund contract that prevent either party from reneging. The borrower&amp;rsquo;s LE constraint ensures the contract always delivers at least as much lifetime utility as defaulting and entering incomplete debt markets. The lender&amp;rsquo;s LE constraint (with Z = 0 in the benchmark) ensures the Fund never accumulates a negative expected net present value from its contractual obligations — i.e., no permanent transfers occur. Both constraints are binding recurrently in the long-run ergodic set.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Moral hazard (MH) / incentive compatibility constraint (ICC)&lt;/strong&gt;: The constraint arising from the fact that government policy reform effort e is non-contractable (sovereign right). The ICC requires that the marginal cost of effort v′(e) equals the marginal lifetime benefit, which depends on the likelihood ratio of future shocks with respect to effort. The Fund contract provides long-horizon performance-based rewards and punishments (via the law of motion of the relative Pareto weight x) to induce efficient effort, without imposing ex-ante austerity conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Discounted relative Pareto weight (x)&lt;/strong&gt;: The key co-state variable in the recursive formulation, defined as x_t = [β(1+r)]^t · (µ_b,t / µ_l,t), where µ_b and µ_l are the time-varying Pareto weights of borrower and lender. It captures the entire history of binding constraints and serves as the state variable summarizing the borrower&amp;rsquo;s &amp;ldquo;entitlement&amp;rdquo; in the contract. Declines over time due to borrower impatience (η = β(1+r) &amp;lt; 1), but is upward-adjusted when the borrower&amp;rsquo;s LE constraint binds, and shifts state-contingently due to MH likelihood ratios.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Saddle-Point Functional Equation (SPFE)&lt;/strong&gt;: The recursive formulation of the Fund contracting problem (equation 6), analogous to Bellman&amp;rsquo;s equation but for saddle-point (min-max) problems. Required because standard dynamic programming fails when constraints are forward-looking; solved by the Marcet–Marimon recursive contract approach. The SPFE characterizes the constrained-efficient Fund allocation as a function of the co-state x and exogenous state s.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incomplete markets with default (IMD) economy&lt;/strong&gt;: The benchmark comparison economy in which the sovereign borrows via non-contingent long-term defaultable bonds (parameterized by maturity δ and coupon κ), with asymmetric output penalties upon default and probabilistic market re-entry. Calibrated to GIPS countries 1980–2015. Generates positive spreads that reflect default risk; serves as both the status quo and the source of the borrower&amp;rsquo;s outside option V°(s) in the Fund contract.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pigouvian Arrow security taxes&lt;/strong&gt;: State-contingent taxes τ^a(s′) on Arrow security holdings, defined by 1/(1+τ^a(s′)) = 1 + χ(x,s)·u′(c)·[∂_e π/π], introduced in the decentralization of the Fund contract. These taxes create a wedge between the borrower&amp;rsquo;s and lender&amp;rsquo;s intertemporal rates of substitution to internalize the full social value of non-contractable effort. Budget-neutral in equilibrium: the government&amp;rsquo;s lump-sum transfer τ(s) exactly offsets expected tax revenue.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt Sustainability Analysis (DSA) interpretation&lt;/strong&gt;: The paper interprets the lender&amp;rsquo;s LE constraint (Z = 0) as a Fund-level DSA: it sets the boundary beyond which the contract would embed permanent transfers. A negative spread in the Fund economy signals that the lender&amp;rsquo;s LE constraint is binding in some future state — a DSA warning that the Fund is better off investing at the risk-free rate rather than extending more credit.&lt;/p&gt;</description></item><item><title>Online Business Models, Digital Ads, and User Welfare</title><link>https://macropaperwarehouse.com/papers/online-business-models-digital-ads-and-user-welfare/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/online-business-models-digital-ads-and-user-welfare/</guid><description>&lt;p&gt;Acemoglu, Huttenlocher, Ozdaglar, and Siderius develop a two-sided platform model to study the welfare consequences of digital advertising as an online business model. The platform intermediates between a firm selling a horizontally differentiated product and a continuum of users who derive utility from both entertaining content and informative signals about product quality embedded in ads. Users have a two-dimensional type: a sophistication dimension (sophisticated with probability lambda, naïve with probability 1-lambda) and a product-quality dimension (high quality with prior probability q). The central departure from the standard informational-advertising literature is that sophisticated users hold the correct model of the ad signal process, while naïve users underestimate the false-positive rate — the probability that a low-quality product generates a positive ad signal (phi_0). Naïve users perceive this false-positive rate to be phi_{0,N} = omega_N * omega_P * phi_0, where omega_N &amp;lt;= 1 captures inherent naïveté and omega_P &amp;lt;= 1 captures failure to understand personalized targeting, so phi_{0,N} &amp;lt; phi_0. The equilibrium concept is Berk-Nash equilibrium (Esponda and Pouzo 2016), meaning all agents are Bayesian given their subjective model.&lt;/p&gt;
&lt;p&gt;The platform chooses ad load alpha (Poisson rate of ad displays), subscription fees, and the monetary transfer from the firm; the firm sets product price p after observing the platform&amp;rsquo;s contract. The central finding (Proposition 2) is that when the objective false-positive rate phi_0 exceeds a threshold phi-hat_0(lambda, phi_1, phi_{0,N}) — which is increasing in lambda and phi_{0,N} and decreasing in the true-positive rate phi_1 — the unique equilibrium is an advertising-based plan that fully segments the market: naïve users receive an ad load that extracts all their surplus, while sophisticated users are excluded entirely. In this regime the firm charges a strictly higher price p-hat* &amp;gt; p-bar*, where p-bar* = (beta*q + c)/2 is the monopoly price without advertising. The ad-based equilibrium emerges precisely when ads are more misleading (larger gap between phi_0 and phi_{0,N}), not when they are more informative — a comparative static the authors describe as paradoxical.&lt;/p&gt;
&lt;p&gt;Welfare consequences (Proposition 4) are unambiguous in the advertising regime: both naïve and sophisticated users are strictly worse off than the baseline without any platform. Naïve users over-purchase due to inflated posteriors from misread signals; sophisticated users are harmed through the price channel — the firm&amp;rsquo;s higher profit-maximizing price p-hat* applies to all buyers. In the fully rational benchmark (phi_{0,N} = phi_0), the unique equilibrium is subscription-based and user welfare equals the no-platform baseline (Proposition 3).&lt;/p&gt;
&lt;p&gt;These results extend to richer menus (Proposition 5), mixed subscription-plus-advertising plans (Proposition 7), and to multi-firm and multi-platform competition (Propositions 9-12). Digital ads soften Bertrand competition by generating endogenous horizontal differentiation among otherwise identical firms, so equilibrium prices can exceed marginal cost even with two competing firms. Platform competition similarly fails to restore welfare: platforms compete away subscription fees but both adopt ad-based plans targeting naïfs when phi_1 exceeds a threshold, maintaining the welfare loss.&lt;/p&gt;
&lt;p&gt;On policy, the first best (planner observes types) cannot be decentralized because naïve users prefer more ads than is socially optimal, inverting the usual self-selection constraint. The second best (planner subject to incentive-compatibility constraints) is a single pooling plan with an intermediate ad load alpha^{SB} in [alpha^{FB}_N, alpha^{FB}_S] and yields average welfare above the no-platform baseline, though below first best (Proposition 13). This second best can be decentralized with a nonlinear digital ad tax, a per-unit product subsidy, and a platform subscription subsidy (Proposition 14). A simpler flat tax on digital ad revenues — above a threshold gamma-bar &amp;lt; 1 — also improves welfare relative to the ad-based equilibrium, though it does not restore the second best (Proposition 15).&lt;/p&gt;
&lt;p&gt;Four robustness extensions are developed: endogenous manipulation (platform always chooses the most manipulative environment, lowest phi_{0,N}); naïve learning dynamics (learning raises the sophisticate share in steady state, making ad-based models less profitable but not overturning the main results); imperfect price discrimination by the firm (naïfs are unambiguously worse off, threshold for advertising equilibrium shifts down); and an added price-sensitivity dimension (the platform runs a 2x2 menu separating by both sophistication and price sensitivity, preserving the result that naïve users tolerate and receive more ads than sophisticates in every stratum).&lt;/p&gt;
&lt;p&gt;Q: What is the key asymmetry between naïve and sophisticated users that drives the main results?
A: Sophisticated users hold the correct Bayesian model of the ad signal process and thus correctly account for the false-positive rate phi_0 when updating beliefs from positive ad signals. Naïve users perceive the false-positive rate as phi_{0,N} = omega_N * omega_P * phi_0 &amp;lt; phi_0, so they treat positive signals as stronger evidence of high product quality than they actually are. Because naïve users overestimate the informativeness of ads, their (interim) subjective valuation of an ad-based plan is higher, making them more tolerant of ad loads and more willing to join platforms with heavy advertising. This asymmetry is what makes it profitable to target naïfs with high ad loads while excluding or charging subscription fees to sophisticates.&lt;/p&gt;
&lt;p&gt;Q: Why does advertising to sophisticated users generate no additional firm profit, while advertising to naïve users does?
A: Lemma 1 establishes that with linear-quadratic utility the firm extracts no surplus from advertising to sophisticates: because sophisticated agents are fully Bayesian, their expected posterior equals the prior (E_S[pi_i] = q), so expected demand after advertising is identical to demand before advertising. By contrast, Lemma 2 shows that the firm&amp;rsquo;s profit from naïve agents is positive and strictly increasing in ad load alpha, because naïve users&amp;rsquo; average demand curve drifts upward as alpha rises — their inflated perceived informativeness of ads causes them to over-update on positive signals, systematically raising their willingness to pay. The platform captures this surplus from the firm via the advertising transfer m*.&lt;/p&gt;
&lt;p&gt;Q: What is the threshold condition determining whether the equilibrium is subscription-based or advertising-based?
A: Proposition 2 identifies a threshold phi-hat_0(lambda, phi_1, phi_{0,N}) that is increasing in the sophisticate share lambda and in the naïve false-positive perception phi_{0,N}, and decreasing in the true-positive rate phi_1. When the objective false-positive rate phi_0 is below this threshold, the profit-maximizing business model is subscription-based with price P* = T - v and product price p* = p-bar* = (beta&lt;em&gt;q + c)/2. When phi_0 exceeds the threshold, the advertising model dominates: the platform sets a high ad load alpha-hat&lt;/em&gt; that makes naïve users exactly indifferent between participating and their outside option v, excludes sophisticates, and the firm charges p-hat* &amp;gt; p-bar*. The threshold falls with phi_1, meaning more informative ads expand the range of phi_0 over which the advertising equilibrium obtains.&lt;/p&gt;
&lt;p&gt;Q: How does allowing the platform to offer menus change the results relative to the baseline two-plan case?
A: Proposition 5 shows that with menus the platform can simultaneously serve both user types: sophisticates receive a subscription plan at P* = T - v and naïve users receive an ad-based plan with the same high load alpha-hat* as in the baseline. The threshold for the advertising equilibrium shifts down to phi*&lt;em&gt;0(lambda, phi_1, phi&lt;/em&gt;{0,N}) &amp;lt; phi-hat_0, so advertising business models arise for a strictly larger set of parameters. Welfare consequences are unchanged (Corollary 1): when phi_0 &amp;gt; phi*_0, both types have welfare strictly below the no-platform baseline. Proposition 6 further shows consumer welfare is monotonically decreasing in both phi_0 and phi_1: higher phi_1 (more informative true-positive signals) also reduces welfare because any surplus from greater informativeness is fully captured by the platform.&lt;/p&gt;
&lt;p&gt;Q: What is the welfare ranking across the three regimes: no platform, advertising equilibrium, and subscription equilibrium?
A: In the subscription equilibrium (regime (a) of Proposition 2 or 4), user welfare for both types equals the no-platform base case W_base(tau) — the platform captures all surplus it creates and users are no better or worse off. In the advertising equilibrium (regime (b)), both naïve and sophisticated users are strictly worse off than with no platform: W-hat*(tau) &amp;lt; W_base(tau) for both tau in {S, N}. The first-best, where a planner controls ad loads separately by type, yields W^{FB}(tau) &amp;gt; W_base(tau) for both types because informative ads can genuinely improve sophisticated users&amp;rsquo; decisions and a constrained amount improves naïve users&amp;rsquo; decisions too.&lt;/p&gt;
&lt;p&gt;Q: How does firm-level competition interact with digital advertising to affect prices and welfare?
A: Without advertising, two ex ante identical firms compete à la Bertrand and price at marginal cost (p*_1 = p*_2 = c). Proposition 9 establishes that when phi_1 &amp;gt; phi^F_1 and phi_0 &amp;gt;= phi^F_0(phi_1), the platform offers an ad-based plan and equilibrium prices p-hat*_1 and p-hat*_2 are both strictly above p-bar* — the monopoly price without advertising. The mechanism is endogenous horizontal differentiation: users who see positive ad signals for one firm&amp;rsquo;s product form higher valuations for that product, so the two products become differentiated in the eyes of consumers even though they are ex ante identical, breaking Bertrand logic. Example 1 further illustrates that advertising can be more prevalent with competition than without: a second firm&amp;rsquo;s entry can push the equilibrium from no-advertising to separating.&lt;/p&gt;
&lt;p&gt;Q: Does platform competition protect users from the welfare losses associated with digital advertising?
A: Not fully. Proposition 11 shows that with two competing platforms (M=2, N=1) and no advertising, platforms compete away both subscription fees and ad loads, and welfare reaches the fully rational benchmark. However, when phi_1 exceeds threshold phi^P_1, both platforms adopt ad-based plans targeting naïve users, charge no subscription fees, and the product price rises to p-hat*_P &amp;gt; p-bar* (Proposition 12). Competition reduces subscription fees to zero but does not eliminate the incentive to target naïfs with heavy ads, because naïve users&amp;rsquo; over-valuation of ads means they remain willing to join ad-heavy plans. The fundamental inefficiency from naïve users&amp;rsquo; misspecified model persists under platform competition.&lt;/p&gt;
&lt;p&gt;Q: Why is the first-best allocation not implementable as a decentralized equilibrium?
A: Proposition 13 explains the obstacle: the social planner would ideally offer naïve users fewer ads (alpha^{FB}_N) than sophisticated users (alpha^{FB}_S), with alpha^{FB}_N &amp;lt;= alpha^{FB}_S. However, naïve users have a higher subjective valuation for ads than sophisticates because they believe ads are more informative. If offered a menu with both options, naïve users would self-select into the plan with the higher ad load alpha^{FB}_S — the exact opposite of what the planner wants. The incentive-compatibility constraints therefore force the planner toward a single pooling plan with an intermediate ad load alpha^{SB} in [alpha^{FB}_N, alpha^{FB}_S]. Average welfare under the second best exceeds the no-platform baseline, confirming that some advertising is socially valuable, but falls short of the first best whenever alpha^{FB}_N &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;Q: How does a flat digital ad tax improve welfare, and what are its limitations?
A: Proposition 15 establishes that whenever the equilibrium features an ad-based plan, a flat tax on digital ad revenues at rate gamma &amp;gt; gamma-bar &amp;lt; 1 improves welfare by discouraging advertising-based business models and inducing the platform to shift toward subscription-based plans. The mechanism is that taxing ad revenue reduces the platform&amp;rsquo;s marginal gain from increasing ad load, making the subscription plan relatively more profitable. However, the flat tax does not achieve the second best because it operates linearly rather than targeting the nonlinear distortion: the optimal nonlinear tax-subsidy scheme (Proposition 14) requires a threshold-style ad tax at rate mu &amp;gt; mu-bar combined with a per-unit product subsidy delta* and a platform subscription subsidy eta &amp;gt; eta-bar.&lt;/p&gt;
&lt;p&gt;Q: What happens when the platform can endogenously choose how manipulative its ads are?
A: Proposition 16 shows that a profit-maximizing platform always chooses the lowest feasible phi_{0,N} = phi-bar — the most manipulative environment. Two reinforcing channels drive this: the pricing channel (lower phi_{0,N} amplifies naïve demand shifts per positive signal, so the downstream firm raises price and sales, increasing ad revenues extracted by the platform) and the participation channel (lower phi_{0,N} raises naïve users&amp;rsquo; perceived informational value of ads, relaxing their participation constraint and permitting a higher ad load alpha). Platform competition constrains the equilibrium ad load through tighter participation constraints but does not alter the choice of phi_{0,N} = phi-bar, so competition limits ad quantity but not ad manipulativeness.&lt;/p&gt;
&lt;p&gt;Q: How do naïve learning dynamics affect the main results?
A: Proposition 17 introduces a birth-death environment where exposure to disconfirming evidence gradually converts naïve agents to sophisticates. A unique steady-state sophisticate share lambda*(alpha_N, phi_0) exists; both higher ad load alpha_N and higher phi_0 accelerate the conversion of naïfs, raising future sophisticate share and reducing future ad revenues. This creates a new intertemporal trade-off that constrains the platform&amp;rsquo;s choice of ad loads relative to the static case. The key result (part ii) is that the main characterization of Proposition 7 carries through under a modified cutoff phi-tilde^{dynamic}&lt;em&gt;0 &amp;gt;= phi-tilde_0(lambda-tilde, phi_1, phi&lt;/em&gt;{0,N}), so learning dynamics make the ad-based business model less likely but do not overturn the fundamental welfare results.&lt;/p&gt;
&lt;p&gt;Q: How does imperfect price discrimination by the firm affect naïve users?
A: Proposition 18 considers a firm that observes a user&amp;rsquo;s sophistication type with probability kappa in [0,1]. With price discrimination, the firm sets type-specific prices satisfying p*_N &amp;gt;= p* &amp;gt;= p*_S, moving toward the type-specific monopoly levels. Naïfs are unambiguously worse off: when identified (with probability kappa), they face the higher price p*_N and a higher equilibrium ad load. The threshold for the advertising equilibrium also shifts down relative to the baseline, meaning advertising business models emerge for a larger parameter range when price discrimination is possible.&lt;/p&gt;
&lt;p&gt;Q: How does the paper define and measure user welfare, and why is ex post rather than interim welfare the relevant concept?
A: User welfare W(tau_i) is defined as ex post utility, which depends on the actual product quality theta_i realized after consumption, not on interim beliefs formed after viewing ads. Naïve users&amp;rsquo; interim assessment inflates expected product quality, but their ex post utility depends on whether the product is genuinely high quality for them (theta_i = 1 with probability q, theta_i = 0 with probability 1-q). Because naïve users over-purchase due to misread signals — consuming more than optimal when theta_i = 0 — their ex post utility is strictly lower than their interim expected utility, and strictly lower than the no-platform baseline in the advertising equilibrium. The ex post welfare concept is the relevant one precisely because it captures the actual material consequences of manipulation, not the subjectively perceived gains from ads.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Naïve vs. Sophisticated Users&lt;/strong&gt;: The paper&amp;rsquo;s primary user heterogeneity dimension. Sophisticated users hold the correct model of the ad signal process, setting phi_{0,S} = phi_0 (the true false-positive rate). Naïve users hold a misspecified model with phi_{0,N} = omega_N * omega_P * phi_0 &amp;lt; phi_0, underestimating the probability that a low-quality product generates a positive ad signal, due to inherent naïveté (omega_N) and failure to understand personalized targeting (omega_P).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ad Load (alpha)&lt;/strong&gt;: The Poisson rate at which ads are displayed to a user per unit time. Total ad displays follow a Poisson(alpha*T) distribution. Higher ad load means less time on entertaining content — expected entertainment time is (1-alpha)&lt;em&gt;T — and a higher probability (1 - exp(-alpha&lt;/em&gt;T)) that the user sees the ad at least once. The platform chooses alpha as its primary instrument for extracting surplus from naïve users.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;False-Positive Rate (phi_0)&lt;/strong&gt;: The objective probability that a low-quality product (theta_i = 0) generates a positive (&amp;ldquo;good&amp;rdquo;) ad signal. The gap between phi_0 (objective) and phi_{0,N} (naïve users&amp;rsquo; perceived rate) is the key parameter driving all welfare results: a larger gap implies greater de facto manipulation and a stronger incentive for the platform to adopt an advertising-based model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Berk-Nash Equilibrium&lt;/strong&gt;: The solution concept from Esponda and Pouzo (2016), used to model agents with misspecified subjective models. All agents are Bayesian conditional on their own subjective model. Sophisticates&amp;rsquo; subjective model equals the objective model (standard Bayesian), while naïfs update using the misspecified phi_{0,N}. Perfection requires sequential rationality at each information set given beliefs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;De Facto Manipulation&lt;/strong&gt;: The paper&amp;rsquo;s term for a situation in which the platform and firm exploit naïve users&amp;rsquo; misspecified model to boost demand and extract surplus, without requiring any outright deception in the formal sense. It arises because naïve users voluntarily choose high-ad-load plans (believing ads to be highly informative) and voluntarily over-purchase (having updated on what they mistakenly think are strong positive signals). The manipulation is &amp;ldquo;de facto&amp;rdquo; because it operates through the users&amp;rsquo; own rational (but misspecified) decision-making.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Separating Equilibrium&lt;/strong&gt;: An equilibrium in which naïve and sophisticated users self-select into distinct platform plans. In the advertising equilibrium, naïve users join an ad-heavy plan (extracting all their surplus via inflated willingness to pay for ads) while sophisticated users are either excluded or placed on a subscription plan. This separation is the vehicle through which the platform maximizes revenue from naïf manipulation while limiting the disciplining force of sophisticates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-Best Allocation&lt;/strong&gt;: The welfare-maximizing allocation subject to the incentive-compatibility constraints that users self-select into plans. Because naïve users prefer more ads than sophisticated users (the inverse of what the planner desires), the second best is a single pooling plan with an intermediate ad load alpha^{SB} in [alpha^{FB}_N, alpha^{FB}_S]. This is strictly worse than the first best but achieves average welfare above the no-platform baseline, and can be decentralized with a nonlinear ad tax, product subsidy, and platform subscription subsidy.&lt;/p&gt;</description></item><item><title>Open Rule Legislative Bargaining</title><link>https://macropaperwarehouse.com/papers/open-rule-legislative-bargaining/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/open-rule-legislative-bargaining/</guid><description>&lt;p&gt;This paper revisits the open rule legislative bargaining model of Baron and Ferejohn (1989) — the dominant workhorse model in political economy for analyzing how legislatures divide a surplus — and provides a more complete characterization of its stationary equilibria. The core research question is whether the equilibrium typically cited in the literature as the &amp;ldquo;open rule equilibrium&amp;rdquo; is actually the unique equilibrium, or whether it rests on implicit and unstated assumptions that, once relaxed, reveal a much richer equilibrium set.&lt;/p&gt;
&lt;p&gt;The model features n=3 negotiators dividing a surplus normalized to one, operating under simple majority rule (2 of 3 votes required). The common discount factor is Delta in (0,1). In each period, a proposer is selected uniformly at random; under the open rule, an amender is then selected uniformly at random from the two non-proposers and may either accept or counter-propose. Sincere voting determines the outcome. The authors analyze stationary subgame perfect equilibria (SSPE), in which strategies depend only on current role, not history.&lt;/p&gt;
&lt;p&gt;The existing literature implicitly adopted what the authors call the &amp;ldquo;standard assumption&amp;rdquo;: when given the opportunity to amend, the amender proposes the same allocation she would propose as a proposer in a closed rule game. Under this assumption, the unique SSPE has the proposer receiving share 1-Delta and each of the other two negotiators receiving Delta/2 (in the Pareto-efficient equilibrium). The literature treated this as the definitive open rule solution.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s first main result is that this standard-assumption equilibrium is indeed a valid SSPE, but it is not the only one. The key mechanism generating multiplicity is the treatment of off-path behavior: what the amender does when the proposer deviates to a non-equilibrium proposal. With n=3, a deviating proposer can exploit the structure so that the amender becomes a &amp;ldquo;free&amp;rdquo; coalition member — the proposer does not need to buy the amender&amp;rsquo;s vote separately, because the amender is already included in the majority once she counter-proposes. This expands the set of credible threats and supports a continuum of additional Pareto-undominated SSPEs.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s second main result characterizes the broader equilibrium set: all Pareto-undominated SSPEs belong to a class in which the proposer offers (1-Delta) to herself and equal shares to both other negotiators. In the non-standard equilibria, the amender always amends, generating equilibrium delay — agreements are not reached immediately, and payoffs are discounted by Delta^(t-1) for each period of delay.&lt;/p&gt;
&lt;p&gt;The third main result is that among all Pareto-undominated SSPEs, the unique Pareto-efficient one is the standard-assumption equilibrium (no delay). All other equilibria involve delay and are therefore Pareto-inferior in expectation.&lt;/p&gt;
&lt;p&gt;The institutional design implication reverses a widely held view: the open rule was thought to promote more egalitarian allocations relative to the closed rule. The authors show this is not the case for Pareto-efficient equilibria. The Pareto-efficient open rule equilibrium is actually a special case of the closed rule equilibrium — the proposer captures 1-Delta and offers Delta to the coalition. More broadly, open rule bargaining tends to generate longer equilibrium delays and less egalitarian surplus allocations than previously predicted by Baron and Ferejohn. Scope conditions: the formal analysis is restricted to n=3 negotiators; generalization to larger legislatures is noted as an open direction.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;standard assumption&amp;rdquo; and why does the existing literature rely on it?&lt;/p&gt;
&lt;p&gt;A: The standard assumption holds that when an amender gets the opportunity to counter-propose, she proposes the same allocation she would choose if she were the proposer in a closed rule game. The existing open rule literature — including Baron and Ferejohn (1989), Jackson and Morelli (2004), Baron (2012), van Weelden (2013), and Austen-Smith and Banks (1999) — accepted this assumption implicitly, treating the resulting equilibrium as the unique open rule equilibrium. The assumption sidesteps the question of off-path behavior: what happens when the proposer deviates to a non-equilibrium proposal that the amender would want to amend. Because deviations are resolved within the same bargaining session under the open rule, off-path specifications are consequential.&lt;/p&gt;
&lt;p&gt;Q: What is the unique SSPE under the standard assumption, and what are its payoff implications?&lt;/p&gt;
&lt;p&gt;A: Under the standard assumption with n=3 and discount factor Delta, the unique SSPE has the proposer receiving a share of 1-Delta of the surplus and each of the other two negotiators receiving Delta/2. There is no delay: the proposal passes immediately in the period it is made. This equilibrium is Pareto-efficient relative to all other stationary equilibria identified in the paper.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism by which the equilibrium set is larger than the standard assumption predicts?&lt;/p&gt;
&lt;p&gt;A: With n=3, when a proposer deviates to a non-equilibrium proposal, the amender — who responds by counter-proposing — automatically becomes part of the passing coalition without the proposer needing to separately compensate her. This makes the amender a &amp;ldquo;free&amp;rdquo; coalition member in the deviation subgame, which changes the cost structure of deviations and expands the range of proposals the proposer can credibly make. Consequently, a wider set of strategies by the amender can be sustained as equilibrium responses, yielding a continuum of additional Pareto-undominated SSPEs beyond the standard-assumption equilibrium.&lt;/p&gt;
&lt;p&gt;Q: What do the non-standard equilibria look like in terms of proposals, delay, and payoffs?&lt;/p&gt;
&lt;p&gt;A: In the non-standard Pareto-undominated SSPEs, the proposer offers (1-Delta) to herself and equal shares (Delta/2 each) to the other two negotiators — note the proposer&amp;rsquo;s own share is the same as in the standard equilibrium, but the off-path behavior differs — and the amender always chooses to amend rather than accept. The amendment triggers a vote in which the amendment fails (or the process repeats), pushing resolution to the next period. This generates equilibrium delay: agreements take multiple periods to reach, and all payoffs are discounted by Delta^(t-1) per period of delay, making these equilibria Pareto-inferior to the no-delay equilibrium.&lt;/p&gt;
&lt;p&gt;Q: Which equilibrium is Pareto-efficient among all Pareto-undominated SSPEs, and why?&lt;/p&gt;
&lt;p&gt;A: The unique Pareto-efficient SSPE is the standard-assumption equilibrium, because it is the only one that involves no delay. All other Pareto-undominated SSPEs involve at least one period of delay, which destroys surplus through discounting (payoffs shrink by a factor of Delta per period). Since delay is costly for all negotiators and generates no compensating redistribution, any equilibrium with delay is Pareto-dominated by the no-delay equilibrium.&lt;/p&gt;
&lt;p&gt;Q: What are the implications for the classic efficiency comparison between open and closed rules?&lt;/p&gt;
&lt;p&gt;A: The closed rule always generates an efficient outcome (no delay in SSPE). The open rule can also generate an efficient outcome — under the standard-assumption equilibrium — but uniquely admits a continuum of inefficient equilibria involving delay. Therefore the open rule is weakly dominated by the closed rule from an efficiency standpoint: at best it matches the closed rule (one efficient equilibrium), and at worst it generates costly delay. This reverses the common inference that open rule unambiguously improves outcomes.&lt;/p&gt;
&lt;p&gt;Q: What are the implications for the classic fairness comparison between open and closed rules?&lt;/p&gt;
&lt;p&gt;A: The open rule was commonly believed to promote more egalitarian surplus divisions relative to the closed rule, which allows the proposer to extract a large share. The paper shows this view is misleading. In the Pareto-efficient open rule equilibrium, the proposer still captures 1-Delta — the same as under the closed rule — and the result is no more egalitarian. In the delay equilibria, the proposer does offer equal shares to both other negotiators, but this comes at the cost of inefficiency (delay). There is no Pareto-undominated open rule equilibrium that is both efficient and more egalitarian than the closed rule.&lt;/p&gt;
&lt;p&gt;Q: What is the class of &amp;ldquo;Pareto-undominated stationary strategies&amp;rdquo; and why does the paper focus on it?&lt;/p&gt;
&lt;p&gt;A: A stationary strategy profile is Pareto-undominated if no other stationary strategy profile gives every negotiator at least as high an expected payoff with at least one strictly better off. The paper focuses on this class to provide a tractable but principled selection criterion within the large set of SSPEs: it eliminates equilibria that are dominated from every player&amp;rsquo;s perspective, retaining only those that could plausibly arise if players coordinate on mutually beneficial outcomes. The characterization of this class reveals that equilibrium multiplicity is already substantial even after imposing this selection.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the formal results, and what is left open?&lt;/p&gt;
&lt;p&gt;A: The formal analysis is restricted to n=3 negotiators with simple majority rule (2 of 3 votes). The authors acknowledge that generalization to larger n is an important open question. The three-legislator case is the simplest non-trivial instance of the majority-rule bargaining problem, and the authors use it to isolate the mechanism cleanly. The model assumes sincere voting, a common discount factor Delta in (0,1), and stationary strategies.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to Baron and Ferejohn (1989)?&lt;/p&gt;
&lt;p&gt;A: Baron and Ferejohn (1989) originated both the closed rule and open rule bargaining frameworks and derived the standard-assumption equilibrium for the open rule. Subsequent literature (Eraslan 2002, Cho and Duggan 2003, 2009, Banks and Duggan 2000) extended various aspects of the B&amp;amp;F framework. The present paper takes the B&amp;amp;F open rule model as given but demonstrates that B&amp;amp;F&amp;rsquo;s open rule analysis was incomplete: it did not systematically address off-path behavior, and as a result the equilibrium it identified is not unique. The paper&amp;rsquo;s main contribution is to show that the B&amp;amp;F open rule predictions — more egalitarian allocations and prompt agreement — do not hold generally across the full equilibrium set.&lt;/p&gt;
&lt;p&gt;Open Rule: A bargaining protocol in which, after an initial proposal is made, a nominated amender may make a counter-proposal before a vote is taken; contrasted with the closed rule, under which the initial proposal is voted on without amendment.&lt;/p&gt;
&lt;p&gt;Closed Rule: A bargaining protocol in which a vote is taken directly on the first proposal, with no opportunity for amendment.&lt;/p&gt;
&lt;p&gt;Standard Assumption: The implicit assumption, used by Baron and Ferejohn (1989) and subsequent literature, that when the amender counter-proposes under the open rule, she proposes the same allocation she would choose as a proposer in a closed rule game; the paper shows this assumption is consequential for equilibrium uniqueness.&lt;/p&gt;
&lt;p&gt;Stationary Subgame Perfect Equilibrium (SSPE): An equilibrium concept in which each player&amp;rsquo;s strategy depends only on her current role (proposer, amender, or voter) and not on the history of play; the paper characterizes SSPEs of the open rule model.&lt;/p&gt;
&lt;p&gt;Pareto-Undominated Stationary Strategy Profile: A stationary strategy profile for which no other stationary strategy profile gives every negotiator weakly higher expected payoff with at least one strictly higher; used as a selection criterion to prune the large equilibrium set.&lt;/p&gt;
&lt;p&gt;Equilibrium Delay: The phenomenon in which agreement is not reached in the current period because the amender always counter-proposes and the counter-proposal also fails, pushing resolution to a future period and discounting payoffs; all non-standard-assumption Pareto-undominated SSPEs involve delay.&lt;/p&gt;
&lt;p&gt;Off-Path Behavior: The specification of what strategies players use following a deviation from equilibrium play; the paper shows that different specifications of off-path behavior by the amender support different equilibria, and that the existing literature was not systematic about this.&lt;/p&gt;</description></item><item><title>Optimal Decision Rules When Payoffs are Partially Identified</title><link>https://macropaperwarehouse.com/papers/optimal-decision-rules-when-payoffs-are-partially-identified/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-decision-rules-when-payoffs-are-partially-identified/</guid><description>&lt;p&gt;This paper derives asymptotically optimal statistical decision rules for discrete choice problems when the payoffs associated with some choices are only partially identified. The research question is: how should a decision maker who can bound but not point-identify a payoff-relevant parameter θ use data to make optimal policy choices?&lt;/p&gt;
&lt;p&gt;The framework separates two parameter types. The reduced-form parameter µ is point-identified and can be estimated from data. The structural parameter θ — such as the average treatment effect (ATE) in a target population — is set-identified, meaning only that θ ∈ Θ0(µ) can be established, where the identified set is indexed by µ. The decision maker confronts both ambiguity (arising from partial identification of θ given µ) and statistical uncertainty (µ must be estimated).&lt;/p&gt;
&lt;p&gt;The authors propose a hybrid optimality criterion that applies minimax reasoning to the partially-identified parameter θ — choosing actions that minimize maximum risk over Θ0(µ) — while applying average (integrated) risk minimization over µ, reflecting the asymmetric nature of the two identification problems. This asymmetric treatment follows the generalized Bayes-minimax principle of Hurwicz (1951).&lt;/p&gt;
&lt;p&gt;The optimal decision rule is implemented by computing, for each action, the maximum risk (or regret) over θ ∈ Θ0(µ) conditional on µ, then averaging this maximum risk across either (i) a bootstrap distribution for an efficient estimator µ̂, (ii) a posterior distribution for µ in parametric models, or (iii) a quasi-posterior based on a limited-information criterion in semiparametric models. The optimal action is whichever choice has the smallest average maximum risk.&lt;/p&gt;
&lt;p&gt;A central theoretical result (Theorems 1 and 4) establishes formal asymptotic optimality for both parametric and semiparametric settings: Bayes and quasi-Bayes decisions with any prior whose density is positive, bounded, and continuous are asymptotically equivalent and optimal. Critically, the optimality of these rules is asymptotically independent of the choice of prior for µ. The authors also establish a necessity result (Theorems 2 and 5): any decision rule not asymptotically equivalent to the Bayes or bootstrap rule is strictly sub-optimal.&lt;/p&gt;
&lt;p&gt;A key finding is that &amp;ldquo;plug-in&amp;rdquo; rules — which substitute an efficient point estimate µ̂ directly into the oracle decision rule — can be sub-optimal. This failure occurs generically under partial identification because the maximum risk function R(d,µ) is typically only directionally differentiable (not fully differentiable) in µ, owing to max and min operators in intersection bounds, linear program value functions, or other bound constructions. When full differentiability holds, Corollary 1 confirms plug-in rules are optimal; otherwise they are not. The empirical illustration demonstrates the practical consequence: for German male youths deciding whether to adopt a job-training program based on 14 RCT studies from Card, Kluve, and Weber (2017), the optimal rule recommends treatment (average quasi-posterior robust welfare contrast b̄n &amp;gt; 0) while the plug-in rule recommends against treatment (plug-in value b(µ̂) &amp;lt; 0). The lower bound maximum of µ̂k − C‖x0 − xk‖ is −0.3190 for the leading US study and −0.3298 for the second-best Brazilian study; because these two values are close relative to the average standard error of 0.034 across studies, the lower bound distribution is right-skewed (behaving like the maximum of two Gaussians), pushing b̄n positive even though b(µ̂) is negative.&lt;/p&gt;
&lt;p&gt;The paper extends optimality theory to semiparametric models via a least favorable parametric submodel, introduces the concept of σ-optimality for cases where the average maximum risk criterion is infinite (relevant when the dimension K of µ exceeds 1), and provides detailed implementation guides for treatment assignment under intersection bounds, IV-like estimands, and non-separable panel data, as well as for optimal pricing decisions where revealed-preference demand theory bounds counterfactual demand responses via linear programming.&lt;/p&gt;
&lt;p&gt;Scope conditions: optimality results apply to discrete action spaces, require efficient estimation of µ, require the identified set Θ0(µ) to be known as a set-valued mapping, and assume no &amp;ldquo;first-order ties&amp;rdquo; (the oracle decision is unique at µ0). The asymptotic framework is local, mimicking the finite-sample problem where µ is not known with certainty.&lt;/p&gt;
&lt;p&gt;Q: What is the core decision problem this paper addresses?&lt;/p&gt;
&lt;p&gt;A: A decision maker must choose from a finite set of actions D = {0, 1, &amp;hellip;, D}. Payoffs depend on a structural parameter θ that is only set-identified — the data can establish θ ∈ Θ0(µ) but not pin down θ exactly. The reduced-form parameter µ is point-identified and estimated from data. The decision maker faces both ambiguity (which θ in Θ0(µ) is true?) and sampling uncertainty (what is µ?). The paper asks how to construct decision rules that are optimal in large samples under this dual uncertainty.&lt;/p&gt;
&lt;p&gt;Q: What is the proposed optimality criterion, and why is it asymmetric across parameters?&lt;/p&gt;
&lt;p&gt;A: The criterion applies minimax reasoning to the partially-identified θ — the maximum risk over Θ0(µ) given µ is the relevant loss — and integrates this maximum risk over µ using Lebesgue measure on local perturbations h = √n(µ − µ0) of a fixed µ0. The asymmetry reflects the fact that θ is not updated by the data (the prior for θ is not identified), while µ can be learned efficiently from the data. Full minimax over both (θ, µ) is rarely tractable even for simple binary treatment problems; the asymmetric approach yields tractable optimal rules for a broad empirically relevant class of settings.&lt;/p&gt;
&lt;p&gt;Q: What are the Bayes, bootstrap, and quasi-Bayes implementations of the optimal rule?&lt;/p&gt;
&lt;p&gt;A: In all three cases, the decision maker computes R̄n(d) — the average maximum risk for action d — and chooses the action that minimizes it. The Bayes rule averages R(d, µ) over the posterior πn(µ|Xn) for µ using Bayes&amp;rsquo; theorem with a prior π on M. The bootstrap rule averages R(d, µ̂*) over bootstrap redraws µ̂* of the efficient estimator µ̂. The quasi-Bayes rule (for semiparametric models) uses a limited-information quasi-posterior N(µ̂, (nÎ)−1) combining a Gaussian quasi-likelihood with a prior for µ. All three implementations are asymptotically equivalent and optimal under the regularity conditions of Theorems 1 and 4.&lt;/p&gt;
&lt;p&gt;Q: What do Theorems 1 and 2 (and their semiparametric analogues Theorems 4 and 5) establish?&lt;/p&gt;
&lt;p&gt;A: Theorem 1 establishes sufficiency: Bayes decisions with any prior in the class Π are asymptotically equivalent to each other and are optimal; any rule asymptotically equivalent to such a Bayes decision is also optimal. Theorem 2 establishes necessity: any rule in the admissible class D that is not asymptotically equivalent to the Bayes rule has strictly higher average excess risk at any µ0 where asymptotic equivalence fails. Together, these theorems fully characterize the class of asymptotically optimal rules and show that the Bayes/bootstrap class is not merely sufficient but also necessary for optimality.&lt;/p&gt;
&lt;p&gt;Q: When are plug-in rules sub-optimal, and when are they optimal?&lt;/p&gt;
&lt;p&gt;A: Plug-in rules substitute an efficient point estimate µ̂ directly into the oracle decision δo(µ̂). If R(d, µ) is fully differentiable at µ0 for all oracle-optimal actions d, then the directional derivative is linear and plug-in and Bayes rules are asymptotically equivalent; Corollary 1 confirms plug-in rules are then optimal. However, under partial identification, max and min operators in bound constructions — intersection bounds, linear program value functions, revealed-preference bounds — generically induce only directional (non-linear) differentiability of R(d, µ). In these cases asymptotic equivalence can fail, and Theorem 2 implies plug-in rules are sub-optimal. Manski (2021, 2023) documents poor finite-sample performance of plug-in rules numerically; the authors&amp;rsquo; necessity result provides a general theoretical explanation under the asymptotic average risk criterion.&lt;/p&gt;
&lt;p&gt;Q: How does the treatment assignment empirical illustration demonstrate the difference between optimal and plug-in rules?&lt;/p&gt;
&lt;p&gt;A: Using data from Ishihara and Kitagawa (2021) with K = 14 RCT studies from Card, Kluve, and Weber (2017) and Lipschitz constant C = 0.25, the decision is whether to adopt a job-training program for German male youths or female youths in 2010 (GDP growth 3.48%, unemployment 9.45%). For male youths, the largest lower bound value µ̂k − C‖x0 − xk‖ is −0.3190 (US study) and the second-largest is −0.3298 (Brazilian study), separated by only 0.0108 against an average standard error of 0.034 across studies, so the lower bound distribution is right-skewed (maximum of two near-tied Gaussians). This right-skew pushes the quasi-posterior mean b̄n positive, yielding a treatment recommendation, while the plug-in value b(µ̂) is negative, yielding a non-treatment recommendation — a concrete reversal of the policy decision. For female youths, the minima and maxima are better separated, the distribution is near-Gaussian, and b̄n ≈ b(µ̂), so both rules agree on treatment.&lt;/p&gt;
&lt;p&gt;Q: What are intersection bounds and why do they generate directional differentiability?&lt;/p&gt;
&lt;p&gt;A: Intersection bounds arise when the ATE is bounded in K separate observational studies by lower bounds bL,k(µk) and upper bounds bU,k(µk). The combined identified set uses bL(µ) = max_{1≤k≤K} bL,k(µk) and bU(µ) = min_{1≤k≤K} bU,k(µk). Even if each component bound is smooth in µk, the max and min operators make bL and bU only directionally differentiable (not fully differentiable) in µ. The directional derivative is positively homogeneous of degree one but non-linear, which is the property that drives the wedge between Bayes and plug-in rules.&lt;/p&gt;
&lt;p&gt;Q: How does the paper extend to semiparametric models, and what technical tool does it use?&lt;/p&gt;
&lt;p&gt;A: In semiparametric models, the data distribution depends on both µ ∈ R^K and an infinite-dimensional nuisance parameter η. Integrating over local perturbations of η as well as µ raises measure-theoretic problems in infinite-dimensional spaces. The authors instead restrict attention to local perturbations of µ0 within a least favorable parametric submodel, which is the direction that makes the problem hardest. The quasi-posterior N(µ̂, (nÎ)−1) is then used as the averaging distribution, combining a Gaussian quasi-likelihood with a prior for µ. Theorem 4 establishes optimality and Theorem 5 establishes necessity under these semiparametric conditions, mirroring the parametric Theorems 1 and 2.&lt;/p&gt;
&lt;p&gt;Q: What is σ-optimality and why is it needed?&lt;/p&gt;
&lt;p&gt;A: When the dimension K of µ exceeds 1, the integrated average excess risk criterion R({δn}; µ0) — which integrates over Lebesgue measure on R^K — may be infinite for all decision sequences in D, making the criterion uninformative. σ-optimality approximates the improper Lebesgue prior on h by a sequence of proper priors indexed by σ, and requires that the decision rule minimize the resulting criterion for all σ. Theorem 3 shows that the limiting behavior of σ-optimal rules coincides with that of the Bayes rule δ*n(·; π), preserving the practical implementation.&lt;/p&gt;
&lt;p&gt;Q: How is the optimal pricing application structured and what role do revealed-preference bounds play?&lt;/p&gt;
&lt;p&gt;A: A monopolist observes repeated cross-sections of individual demands across B budget sets and must choose a price vector from D = O ∪ C, where O contains observed prices and C contains counterfactual prices. For observed prices, average demand is identified; for counterfactual prices, only bounds are available. Following Kitamura and Stoye (2019), the space of goods is partitioned into GARP-compatible regions, and sharp bounds on counterfactual demand are computed by solving linear programs over the mass allocated to each region subject to GARP consistency constraints. The reduced-form parameter µ collects empirical choice probabilities across observed budget-region cells, estimated consistently by sample frequencies. The optimal pricing decision averages the linear-program bound solutions across quasi-posterior draws of µ.&lt;/p&gt;
&lt;p&gt;Q: How does this approach relate to minimax and conditional Γ-minimax approaches?&lt;/p&gt;
&lt;p&gt;A: Full minimax over (θ, µ) requires strong distributional assumptions and tractable finite-sample distributions; the authors note that no minimax treatment rule exists even for binary treatment with binary outcomes and estimated bounds. Conditional Γ-minimax (DasGupta and Studden, 1989; Giacomini, Kitagawa, and Read, 2021) fixes a prior for µ and takes minimax over the set of priors for θ conditional on µ; this is closely related to the authors&amp;rsquo; approach but can be conservative when the marginal prior for µ varies. The authors&amp;rsquo; framework fixes the marginal prior for µ and takes minimax over θ ∈ Θ0(µ) conditional on µ, which is shown to arise as the equilibrium of a two-player zero-sum game where adversarial nature chooses a prior for θ ∈ Θ0(µ) conditional on µ and the available data for µ.&lt;/p&gt;
&lt;p&gt;Q: What is the technical contribution regarding directionally differentiable functions?&lt;/p&gt;
&lt;p&gt;A: Hirano and Porter (2009) derived asymptotic optimality for treatment rules under fully differentiable welfare contrasts. This paper extends that theory to settings with directional (but not full) differentiability — a generic feature whenever bounds involve max/min operators or linear program values. The key technical building block is the asymptotic distribution of the quasi-posterior mean of directionally differentiable functions (Propositions 2 and 3 in Appendix C). While Kitagawa, Montiel Olea, Payne, and Velez (2020) characterized the asymptotic behavior of the posterior distribution of such functions, this paper instead characterizes the frequentist distribution of the posterior mean — a distinct and novel contribution to the literature on asymptotics for non-smooth functions (Dümbgen, 1993; Fang and Santos, 2019).&lt;/p&gt;
&lt;p&gt;Q: What are the key scope conditions and limitations of the optimality results?&lt;/p&gt;
&lt;p&gt;A: The action space D must be finite and discrete (continuous pricing must be approximated by a grid of whole-currency units, as noted in the introduction). The identified set mapping Θ0(·) must be known. Efficient estimation of µ is required, along with a consistent estimator of its asymptotic variance for quasi-Bayes implementation. The optimality criterion assumes &amp;ldquo;no first-order ties&amp;rdquo; — the oracle decision must be unique at µ0. The framework is asymptotic (local perturbations around a fixed µ0), and the theory is designed for settings where deriving exact finite-sample optimal rules is intractable. The results do not cover the case where θ affects the data distribution (only payoffs are partially identified, not identification of µ itself).&lt;/p&gt;
&lt;p&gt;Partially-identified parameter (θ): A structural parameter — such as the ATE in a target population — about which the data can establish only set membership θ ∈ Θ0(µ), not a point value. The identified set Θ0(µ) is indexed by the point-identified reduced-form parameter µ.&lt;/p&gt;
&lt;p&gt;Oracle decision (δo(µ)): The infeasible first-best decision that minimizes maximum risk over the identified set Θ0(µ) for a known value of µ. It serves as the benchmark against which practical rules are evaluated; any data-dependent rule can only do weakly worse.&lt;/p&gt;
&lt;p&gt;Maximum risk (R(d, µ)): The supremum of risk r(d, θ, µ) = Eθ[l(d, Y, θ, µ)] over all θ ∈ Θ0(µ) conditional on µ. Under the regret criterion for binary treatment, R(0, µ) = (bU(µ))+ and R(1, µ) = −(bL(µ))−.&lt;/p&gt;
&lt;p&gt;Robust welfare contrast (b(µ)): In the treatment assignment application, b(µ) = (bU(µ))+ + (bL(µ))−, whose sign determines the oracle decision: treat if b(µ) ≥ 0. The optimal rule replaces b(µ) with its quasi-posterior mean b̄n.&lt;/p&gt;
&lt;p&gt;Directional differentiability: A function f : M → R^k is directionally differentiable at µ0 if limits of (f(µ0 + tn hn) − f(µ0))/tn exist for all sequences tn ↓ 0 and hn → h, yielding a directional derivative ḟµ0[·] that is positively homogeneous but not necessarily linear. Max/min operators and linear program value functions are generically only directionally differentiable, not fully differentiable. This property is what causes plug-in rules to fail.&lt;/p&gt;
&lt;p&gt;Quasi-posterior: In semiparametric models, a posterior-like distribution for µ formed by combining a limited-information Gaussian quasi-likelihood N(µ̂, (nÎ)−1) with a prior π, yielding πn(µ|Xn) ∝ exp(−½(µ − µ̂)T(nÎ)(µ − µ̂))π(µ). Used in place of a full Bayesian posterior when the exact likelihood of the data-generating process is unavailable.&lt;/p&gt;
&lt;p&gt;σ-optimality: An optimality concept that replaces the improper Lebesgue prior on local perturbations h ∈ R^K with a sequence of proper priors indexed by σ, used when the average excess risk criterion is infinite for K &amp;gt; 1. Theorem 3 establishes that the σ-optimal decision rule converges to the Bayes rule as σ → ∞.&lt;/p&gt;
&lt;p&gt;Plug-in rule (δplug_n): A decision rule formed by substituting an efficient point estimate µ̂ directly into the oracle decision: δplug_n = δo(µ̂). Optimal when R(d, µ) is fully differentiable (Corollary 1), but generically sub-optimal under partial identification because directional differentiability of R(d, µ) breaks the asymptotic equivalence between the plug-in and Bayes rules.&lt;/p&gt;</description></item><item><title>Optimal Fiscal Policy with Heterogeneous Agents and Capital: Overturning Chamley-Judd</title><link>https://macropaperwarehouse.com/papers/optimal-fiscal-policy-with-heterogeneous-agents-and-capital-overturning-chamley-judd/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-fiscal-policy-with-heterogeneous-agents-and-capital-overturning-chamley-judd/</guid><description>&lt;p&gt;The Chamley-Judd result (1986) states that the optimal long-run capital income tax rate is zero in representative-agent models. This paper shows that introducing heterogeneous agents — specifically, agents with uninsurable idiosyncratic income risk who use precautionary saving — overturns this result. When agents differ in their wealth and income realizations, a capital income tax serves as a form of insurance that representative-agent models cannot provide. The paper derives a tractable analytical characterization of the optimal capital tax in an Aiyagari-type heterogeneous-agent model and finds that the optimal rate lies in the range of 10–30 percent at the steady state — strictly positive, in direct contradiction to Chamley-Judd. The magnitude of the optimal tax depends on the degree of idiosyncratic risk and the availability of alternative redistribution instruments: when other redistributive tools are limited, the optimal capital tax is higher.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-why-does-heterogeneity-overturn-chamley-judd"&gt;Q1. Why does heterogeneity overturn Chamley-Judd?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In representative-agent models, all agents hold the same capital stock, so a capital tax distorts intertemporal decisions identically and the Ramsey planner finds it optimal to zero out the distortion in the long run. With heterogeneous agents and uninsurable risk, the capital tax has an additional insurance role: taxing capital income and redistributing it reduces consumption variance across agents, generating welfare gains that outweigh the intertemporal distortion costs.&lt;/strong&gt; The insurance benefit makes the optimal tax positive at the steady state because the tax-and-redistribute mechanism provides risk-sharing that incomplete markets cannot.&lt;/p&gt;
&lt;h3 id="q2-how-tractable-is-the-analytical-result"&gt;Q2. How tractable is the analytical result?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper derives closed-form expressions for the optimal tax rate as a function of the degree of idiosyncratic risk, the wealth distribution&amp;rsquo;s spread, and the available redistribution instruments, enabling comparative statics that go beyond what purely computational approaches provide.&lt;/strong&gt; This tractability distinguishes the result from earlier numerical work that demonstrated positive optimal capital taxes without clear analytical structure.&lt;/p&gt;
&lt;h3 id="q3-what-is-the-quantitative-magnitude"&gt;Q3. What is the quantitative magnitude?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The optimal steady-state capital income tax is in the range of 10–30 percent, substantially above zero but well below confiscatory rates, in the paper&amp;rsquo;s benchmark calibration matched to U.S. income and wealth inequality.&lt;/strong&gt; The range reflects the sensitivity to available redistribution instruments: the lower bound applies when the government has a rich set of redistribution tools, the upper bound when capital taxation is the only available instrument.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Chamley-Judd result&lt;/strong&gt; : the proposition that the optimal long-run capital income tax is zero in representative-agent Ramsey taxation models; overturned in this paper once heterogeneous agents with uninsurable risk are introduced.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;insurance role of capital taxation&lt;/strong&gt; : the mechanism by which a capital income tax reduces consumption inequality in a heterogeneous-agent economy, generating welfare gains that outweigh the intertemporal distortion costs and making the optimal capital tax positive.&lt;/p&gt;</description></item><item><title>Optimal Public Transportation Networks: Evidence from the World's Largest Bus Rapid Transit System in Jakarta</title><link>https://macropaperwarehouse.com/papers/optimal-public-transportation-networks-evidence-from-the-worlds-largest-bus-rapid-transit-system-in-jakarta/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-public-transportation-networks-evidence-from-the-worlds-largest-bus-rapid-transit-system-in-jakarta/</guid><description>&lt;p&gt;This paper studies how commuter preferences over wait times, travel times, and transfers should shape the design of urban bus networks, using the world&amp;rsquo;s largest Bus Rapid Transit (BRT) system — TransJakarta in Jakarta, Indonesia — as the empirical laboratory. The setting provides unusually rich identification: between January 2016 and February 2020, TransJakarta launched 93 new BRT and non-BRT feeder routes in a staggered, city-wide expansion, during which the operating bus fleet more than doubled from roughly 700 to over 1,600 vehicles. The authors combine over 500 million smart-card tap records, GPS tracking of every bus at 5–10 second intervals, and anonymized smartphone location data covering 35 million weekday trips from 2.3 million devices.&lt;/p&gt;
&lt;p&gt;The paper proceeds in three steps. First, the authors classify new route launches into three event types and estimate their causal impact on ridership via difference-in-differences. Event 1: a new direct connection between an origin-destination pair already served by transfer only, with no travel-time improvement — raises BRT ridership by 0.16 log points. Event 2: a new direct connection that also reduces travel time (by 0.29 log points on average) — raises ridership by 0.27 log points. Event 3: additional buses on an already-directly-connected pair, which increases the bus arrival rate by 0.32 log points and reduces wait times — raises ridership by 0.09 log points, implying a ridership elasticity with respect to wait times of approximately −0.29 for BRT. For non-BRT routes the implied wait-time elasticity is −1.05, raising the possibility of multiple equilibria in service levels. Crucially, none of the three event types produce detectable increases in aggregate trip volumes measured by smartphone data, implying the ridership gains reflect modal substitution toward the bus rather than trip generation.&lt;/p&gt;
&lt;p&gt;Second, the authors estimate a structural demand model. At its core is a route-choice model in which bus arrivals follow independent Poisson processes, so wait times are exponentially distributed and idiosyncratic. This formulation avoids the red-bus/blue-bus aggregation problem endemic to logit models. Commuters are also allowed to be partially inattentive to routes whose travel time exceeds the fastest available option by more than an estimated threshold. Structural parameters are recovered by classical minimum distance, matching seven reduced-form moments. Key findings: wait time is valued 2.4 times more than time on the bus for BRT routes, and 4.2 times more for non-BRT routes. There is no additional transfer penalty beyond the wait time and travel time costs of the second leg. Commuters pay significantly less attention to options with travel time more than roughly 34–44 percent above the fastest option in their choice set.&lt;/p&gt;
&lt;p&gt;Third, the authors use the estimated preference parameters to characterize optimal bus networks. Because the optimization problem is high-dimensional (418 grid cells, 1,536 possible edges, yielding on the order of 10^500 configurations) and exhibits neither global convexity nor simple complementarity, they reformulate the social planner&amp;rsquo;s problem as a discrete choice over networks with additive logit shocks — effectively sampling from a multinomial logit distribution via simulated annealing. The result: optimal networks cover approximately 66 percent of grid cells versus 42 percent under the actual TransJakarta network, and would give 91 percent of Jakarta residents bus access versus 73 percent currently. Bus frequency in the city center is somewhat lower in the optimal network. Despite commuters&amp;rsquo; high sensitivity to wait times, the current network concentrates too many buses in the city center where wait times are already short, rather than extending reach to underserved areas. Comparative statics show that doubling the wait-time cost parameter produces much more concentrated optimal networks (23 percent of origin-destination pairs connected, 41 percent fewer than baseline), while increasing the transfer penalty by the equivalent of 15 minutes of wait time raises the direct-connection share of served pairs from 12 to 16 percent.&lt;/p&gt;
&lt;p&gt;Q: What are the three event types and why are they analytically distinct?&lt;/p&gt;
&lt;p&gt;A: Event 1 is the launch of the first direct route between an origin-destination pair already connected by transfer, where the direct route is not faster than the existing transfer option; it isolates the effect of directness absent a travel-time change. Event 2 is the same but with a faster direct route (average reduction of 0.29 log points in travel time), combining directness and speed improvements. Event 3 is the launch of a new route that overlaps an existing direct route, increasing bus frequency and cutting wait times (arrival rate up 0.32 log points) without substantially changing travel time or directness. The three events together provide variation across the key dimensions — directness, speed, and frequency — needed to separately identify commuter preference parameters.&lt;/p&gt;
&lt;p&gt;Q: What are the main ridership effects and how large are they in levels?&lt;/p&gt;
&lt;p&gt;A: For BRT routes, Event 1 raises ridership by 0.16 log points (approximately 19 additional riders per week for a treated origin-destination pair with a baseline of 111 weekly riders), Event 2 by 0.27 log points (approximately 24 additional riders per week), and Event 3 by 0.09 log points (approximately 20 additional riders per week). For non-BRT routes, proportional effects are larger but level effects are similar: Event 1 yields roughly 34 additional weekly riders, Event 2 roughly 21, and Event 3 roughly 15. Event-study graphs show clear, discrete jumps in ridership at route launch with no pre-trends, and some gradual adjustment in the months following.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about aggregate trip generation versus modal substitution?&lt;/p&gt;
&lt;p&gt;A: Using smartphone location data to measure all trips regardless of mode, the authors find no statistically significant increase in aggregate trip volumes for any of the three event types. For BRT Event 1, the estimated aggregate-trip coefficient is −0.008 with a standard error of 0.051, allowing rejection at the 95 percent level of any positive impact above roughly 0.091 log points — small relative to the precise 0.11 log-point bus ridership effect in the same sample. The authors interpret this as evidence that the ridership gains over the 10-month post-event window reflect substitution from private modes (motorcycles, cars, taxis) toward TransJakarta rather than trip generation, and they use this null result to justify holding destination choices fixed in the structural model.&lt;/p&gt;
&lt;p&gt;Q: How does the model avoid the red-bus/blue-bus aggregation problem?&lt;/p&gt;
&lt;p&gt;A: The paper&amp;rsquo;s route-choice model assumes bus arrivals follow independent Poisson processes, so wait times are exponentially distributed. A key proposition (Proposition 1) proves that splitting one route into two identical routes with half the buses each produces exactly the same choice probabilities and expected utility as the original single route — because the sum of two independent Poisson processes is itself Poisson with the summed rate. Standard logit models fail this invariance because splitting a route creates two options with independent error draws, artificially inflating expected utility. The invariance property is essential for the optimal network design exercise, where the planner freely reallocates buses across routes.&lt;/p&gt;
&lt;p&gt;Q: What are the estimated preference parameters and what do they imply about commuter behavior?&lt;/p&gt;
&lt;p&gt;A: The paper estimates that wait time is valued 2.4 times more than time on the bus for BRT routes and 4.2 times more for non-BRT routes. There is no additional transfer disutility beyond the wait time and travel time costs implied by the extra leg. Commuters become substantially inattentive to routes with travel time more than approximately 34 percent above the fastest available option (BRT threshold) or 44 percent (non-BRT). The high relative cost of waiting versus riding reflects both the discomfort of waiting at exposed non-BRT stops and the fact that TransJakarta runs without a published schedule, so commuters cannot minimize wait time by timing arrivals.&lt;/p&gt;
&lt;p&gt;Q: What explains the non-BRT wait-time elasticity exceeding −1?&lt;/p&gt;
&lt;p&gt;A: For non-BRT routes, Event 3 raises ridership by 0.450 log points while raising the bus arrival rate by 0.425 log points, yielding an implied elasticity of ridership with respect to wait times of −1.05. Because the baseline arrival rate for non-BRT treated pairs is 2–4 times lower than for BRT pairs, the absolute reduction in wait time per additional bus is much larger. An elasticity exceeding −1 in absolute value implies that adding buses on some non-BRT routes could increase ridership enough to maintain or even raise average ridership per bus — the extreme form of the Mohring effect — suggesting the possibility of a high-ridership/low-wait-time equilibrium distinct from the current low-ridership/high-wait-time one.&lt;/p&gt;
&lt;p&gt;Q: How is the optimal network characterized and what algorithm is used?&lt;/p&gt;
&lt;p&gt;A: The social planner chooses a network to maximize utilitarian welfare (average expected utility across all commuters) from the estimated demand model, plus a network-level logit shock capturing cost and other factors outside the model. This transforms the combinatorially explosive optimization into sampling from a multinomial logit distribution over networks, which the authors approximate using simulated annealing. They run the algorithm multiple times to obtain a sample of networks drawn asymptotically from the planner&amp;rsquo;s distribution, then estimate optimal network characteristics and comparative statics from sample analogs. The theoretical framework is general and, the authors note, applicable to other high-dimensional spatial planning problems where welfare differences can be computed for pairs of counterfactuals.&lt;/p&gt;
&lt;p&gt;Q: How does the optimal network differ from the current TransJakarta network?&lt;/p&gt;
&lt;p&gt;A: The typical optimal network covers approximately 66 percent of 2km grid cells versus 42 percent for the actual network, and 91 percent of Jakarta residents would have bus access versus 73 percent currently. The optimal network reduces bus frequency in the city center relative to the current network, accepting longer wait times there in order to extend reach to peripheral areas. The paper finds no tension between distributional and efficiency concerns in this setting — expanding coverage improves both aggregate welfare and access for underserved areas.&lt;/p&gt;
&lt;p&gt;Q: What do the comparative statics reveal about the sensitivity of optimal network design to preference parameters?&lt;/p&gt;
&lt;p&gt;A: Doubling the wait-time cost parameter leads to substantially more concentrated optimal networks: only 23 percent of origin-destination pairs are connected, 41 percent fewer than in the baseline optimal network. This is because higher wait-time costs make it more valuable to concentrate buses on fewer routes to achieve short headways. Increasing the transfer penalty by the equivalent of 15 minutes of wait time raises the share of connected location pairs with a direct (non-transfer) connection from 12 to 16 percent. These comparative statics link micro-level preference parameters to macro-level network topology, clarifying which parameters most influence design choices.&lt;/p&gt;
&lt;p&gt;Q: How does the paper validate the destination imputation from tap-in-only smart card data?&lt;/p&gt;
&lt;p&gt;A: For the subset of BRT stations where tap-out is enforced (36 percent of stations), the authors estimate bivariate regressions of imputed daily ridership shares against actual observed ridership shares, obtaining R-squared of 0.85. They also show robustness by varying the grid cell size from 500 meters to 2 kilometers, finding no systematic decline in treatment effect magnitudes, which rules out large displacement effects within the network as an explanation for the results.&lt;/p&gt;
&lt;p&gt;Q: Does the response to network improvements vary by local poverty rates?&lt;/p&gt;
&lt;p&gt;A: The authors interact all six event types with an indicator for above-median poverty rate at the origin grid cell (from SMERU 2014 data), controlling for population. They find no clear pattern of heterogeneity by income level — richer and poorer areas respond similarly to service improvements. The paper notes this absence of heterogeneity as relevant context for interpreting optimal network design: the case for extending reach is not offset by a differential preference for frequency among poorer commuters.&lt;/p&gt;
&lt;p&gt;Mohring Effect: The externality arising from ridership responsiveness to wait times — more riders justify more buses, which reduce wait times for all riders, further increasing ridership. The paper estimates a BRT wait-time elasticity of −0.29, confirming the effect operates in Jakarta; for non-BRT the elasticity of −1.05 suggests the possibility of multiple equilibria in service levels.&lt;/p&gt;
&lt;p&gt;Negative Exponential Distribution Model (Daganzo 1979): The route-choice model used in the paper, in which bus arrivals on each route follow independent Poisson processes and wait times are exponentially distributed. The model is invariant to aggregation of identical routes (avoids the red-bus/blue-bus problem) and yields tractable closed-form expressions for choice probabilities and expected utility.&lt;/p&gt;
&lt;p&gt;Partial Inattention: The model feature whereby commuters assign near-zero effective arrival rates to bus options whose travel time exceeds the fastest available option by more than an estimated threshold (34–44 percent depending on route type). Captures the empirical finding that commuters in a large, complex network do not appear to consider all available options.&lt;/p&gt;
&lt;p&gt;Event Types (1, 2, 3): The paper&amp;rsquo;s taxonomy of service improvements induced by new route launches. Event 1 isolates the value of directness (new direct route, no speed gain). Event 2 combines directness and speed (new direct route that is also faster). Event 3 isolates the value of frequency (additional buses on an already-direct route, reducing wait time without changing travel time).&lt;/p&gt;
&lt;p&gt;Optimal Network Characterization via Social Planner&amp;rsquo;s Logit: The paper&amp;rsquo;s approach to the combinatorially intractable network optimization problem. The planner is modeled as making a logit discrete choice over all possible networks, with welfare from the demand model plus a network-level idiosyncratic shock. Sampling via simulated annealing yields estimates of optimal network characteristics and comparative statics without requiring identification of a single globally optimal network.&lt;/p&gt;
&lt;p&gt;Network Concentration vs. Extensiveness Tradeoff: The core design tension the paper formalizes — for a fixed bus fleet, concentrating buses on fewer routes reduces wait times on served routes but leaves more areas without coverage, while spreading buses across more routes extends reach at the cost of longer headways. The estimated preference parameters (high wait-time sensitivity) make this tradeoff non-trivial; nonetheless, the paper finds the current network is too concentrated relative to the optimum.&lt;/p&gt;</description></item><item><title>Optimal Resilience in Multitier Supply Chains</title><link>https://macropaperwarehouse.com/papers/optimal-resilience-in-multitier-supply-chains/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-resilience-in-multitier-supply-chains/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Grossman, Helpman, and Sabal ask what market failures arise in vertical supply chains with multiple production tiers, limited (non-anonymous) supply networks, arms-length transactions, and recurrent risks of disruption at every node. They then ask what government policies would be required to implement the socially efficient (first-best) allocation as a decentralized equilibrium, and — in a second-best environment where subsidies to firm-to-firm transactions are politically infeasible — how optimal policies to promote resilience and network formation differ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops a general-equilibrium model of a closed economy with an arbitrary number S+1 of vertical production tiers (tier 0 through tier S). A finite measure of &amp;ldquo;lead&amp;rdquo; firms in tier S produce differentiated consumer goods under monopolistic competition using labor and a CES bundle of intermediate inputs from tier S-1 suppliers. Firms in each intermediate tier combine labor and inputs from the tier above using a Cobb-Douglas production function. Tier 0 firms produce from labor alone.&lt;/p&gt;
&lt;p&gt;Every firm faces an independent, non-zero probability of a catastrophic disruption (complete inability to produce). Firms may invest labor up front to moderate this risk — endogenous &amp;ldquo;resilience&amp;rdquo; — or may invest to forge relationships with a larger fraction of potential suppliers in the next upstream tier — endogenous &amp;ldquo;network thickness.&amp;rdquo; Each formed relationship costs k units of labor.&lt;/p&gt;
&lt;p&gt;After disruption shocks are realized, surviving firms negotiate quantities and payments bilaterally. Bargaining is sequential (beginning with lead firms negotiating with tier S-1, then tier S-1 with tier S-2, and so on to tier 0), and within each round is governed by Nash-in-Nash equilibrium (Horn and Wolinsky, 1988): each firm takes as given the outcomes of its negotiations with all other partners. The Nash surplus is split with exogenous bargaining weight β_s for the downstream buyer in the s-to-s−1 negotiation.&lt;/p&gt;
&lt;p&gt;The paper solves the planner&amp;rsquo;s direct-control problem and then characterizes the three sets of policy instruments needed to decentralize the first best: subsidies to input transactions between adjacent tiers, subsidies to investments in resilience (agility), and subsidies to network formation (redundancy). It then solves the second-best problem in which transaction subsidies are constrained to zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Transaction subsidies.&lt;/em&gt; In the competitive bargaining equilibrium, each pair of firms undervalues input transactions because the upstream firm anticipates paying a marked-up price when it bargains with its own suppliers. This cascading distortion means the private marginal cost of producing a tier-s good exceeds the social marginal cost. The optimal first-best transaction subsidy on sales by tier s firms (τ*&lt;em&gt;s) equals [γ_s + (1−γ_s)μ&lt;/em&gt;{s−1}]^{−1}, where γ_s is the labor share in tier s production and μ_{s−1} is the endogenous markup factor from bargaining at the s-to-s−1 interface. This subsidy depends only on production function parameters and bargaining weights at the immediately adjacent tier. No subsidy is needed at tier 0 (the most upstream tier), and no subsidy is applied to final-good sales. Under Assumption 1 — inputs become weakly less substitutable as goods proceed downstream — the optimal purchase subsidies rise monotonically as one moves downstream along the supply chain.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Resilience subsidies (first best).&lt;/em&gt; Two offsetting forces govern the optimal subsidy to resilience investments θ*&lt;em&gt;s at intermediate tiers: (i) firms capture only the fraction (1−β&lt;/em&gt;{s+1}) of the joint surplus that their resilience creates for downstream customers, creating underinvestment; (ii) optimal transaction subsidies inflate private profitability, creating a countervailing overinvestment incentive. The net optimal first-best subsidy for intermediate-tier firms is θ*&lt;em&gt;s = (1−β&lt;/em&gt;{s+1}) / τ*_s. This formula depends only on technological and bargaining parameters of tier s and the tier immediately adjacent; it does not depend on conditions elsewhere in the chain. When production parameters and bargaining weights are uniform across tiers, the first-best resilience subsidy is the same at every interior tier. If goods become strictly less substitutable downstream, the first-best subsidy for resilience declines monotonically as one moves downstream, and may turn into an optimal tax for middle tiers where the transaction subsidy is large enough to over-incentivize resilience investment. The first-best resilience subsidy always applies at both extreme ends of the chain.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Network formation subsidies (first best).&lt;/em&gt; Despite firms&amp;rsquo; private incentive to manipulate their number of upstream suppliers to improve bargaining position, the net strategic effect of network formation in general equilibrium exactly cancels the off-equilibrium spillovers to non-partners. As a result, the optimal first-best policy toward network formation at every tier is identical to the optimal policy toward resilience investment.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Second-best policies.&lt;/em&gt; When transaction subsidies are unavailable, uncorrected markups downstream from tier s depress demand for tier-s output, reducing profitability and incentives to invest in resilience below the first-best level. Second-best optimal subsidies for resilience and network formation therefore reflect production function parameters and bargaining weights throughout the entire downstream supply chain, not just at the immediately adjacent tier. Specifically, when buyer bargaining weights are non-increasing along the chain (β_{s+1} ≤ β_s for all s), the second-best subsidy to resilience falls monotonically as one moves downstream. This is the opposite pattern from what might be inferred from the first-best analysis when transaction subsidies are available: with non-increasing bargaining weights, second-best subsidies are larger for upstream producers than for downstream producers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results are derived for a closed economy. Welfare is measured by the CES utility of the representative consumer over differentiated final goods. The sequential bargaining structure assumes contracts are written after disruption shocks are realized. Assumption 1 (σ_1 ≥ σ_2 ≥ … ≥ σ_S &amp;gt; ε, where σ_s is the elasticity of substitution between inputs at tier s and ε is the demand elasticity for final goods) is maintained for sharper monotonicity results on the structure of optimal subsidies across tiers.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the precise structure of the supply chain in the model, and why does the bargaining take place sequentially rather than simultaneously across all tiers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The economy has S+1 tiers. Tier 0 firms use only labor; tier s firms (s = 1,…,S−1) use labor and a CES bundle of tier s−1 inputs with elasticity of substitution σ_s &amp;gt; 1; tier S firms produce final differentiated goods using labor and tier S−1 inputs under Cobb-Douglas technology. Sequential bargaining is imposed because the vast number of simultaneous negotiations across all tiers makes a grand coalition impractical. The timing is that lead firms (tier S) first negotiate input quantities and payments with their tier S−1 suppliers; those suppliers, now contractually obligated to their downstream customers, then negotiate with tier S−2, and so on up the chain until tier 1 firms contract with tier 0 suppliers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How is the markup factor defined, and what parameters determine it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The markup factor μ_s is the ratio of the payment per unit made by tier s+1 firms to the production cost of tier s firms. It equals μ_s = (1−β_{s+1}) · [σ_{s+1}/(σ_{s+1}−1)] + β_{s+1}, where β_{s+1} is the exogenous bargaining weight of the downstream (tier s+1) buyer. When the downstream firm has all bargaining power (β_{s+1} = 1), the markup equals unity (competitive outcome). When the upstream firm has all bargaining power (β_{s+1} = 0), the markup equals the standard monopoly markup σ_{s+1}/(σ_{s+1}−1). For intermediate bargaining weights, the markup is a weighted average. The markup enters the optimal transaction subsidy formula by inflating the private marginal cost of producing tier-s inputs above the social marginal cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Why are no subsidies needed for the most upstream (tier 0) transactions or for final-good sales?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: For tier 0 transactions: when tier 0 and tier 1 firms bargain, the negotiations occur last sequentially and so do not affect any prior agreements. There are no downstream cascading markup effects — tier 0 firms produce from labor alone, so their private marginal cost equals their social marginal cost. The joint surplus maximization by the pair thus aligns with the planner&amp;rsquo;s objective, yielding τ*_0 = 1 (no intervention needed). For final-good sales: final producers do mark up above marginal cost under monopolistic competition, but all varieties are symmetric, so the markup affects all goods equally and does not distort relative consumption choices. Hence τ*_S = 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the two offsetting forces that determine the optimal first-best subsidy to resilience investments at an intermediate tier?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: First, a firm in tier s captures only the fraction (1−β_{s+1}) of the joint surplus that its survival creates for its downstream customers (the rest is appropriated through bargaining by those customers), leading to underinvestment relative to the social optimum. Second, the optimal transaction subsidy τ*_s &amp;lt; 1 raises the private profitability of firms in tier s above its social value, because public finances bear part of the cost of their input purchases. This inflated private profitability encourages resilience investment beyond what the planner desires. The net optimal policy is θ*&lt;em&gt;s = (1−β&lt;/em&gt;{s+1}) / τ*_s, which may be a subsidy (θ*_s &amp;lt; 1) or a tax (θ*_s &amp;gt; 1) depending on which force dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why does the first-best subsidy for resilience at an intermediate tier depend only on local parameters (at tier s and its immediate neighbors), even though resilience investments generate spillovers to firms throughout the network?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: When optimal transaction subsidies are in place at all tiers, a firm&amp;rsquo;s value becomes independent of the joint surplus in sales that occur between firms in tiers other than its own. That is, the positive spillovers to all firms farther upstream and downstream in a firm&amp;rsquo;s own network are exactly offset by the negative spillovers to firms in rival networks (including rival firms in the same tier). What remains after this general-equilibrium cancellation is only the benefit to the firm&amp;rsquo;s immediate downstream customers and the wedge created by the transaction subsidy. This result implies that the formula θ*&lt;em&gt;s = (1−β&lt;/em&gt;{s+1}) / τ*_s does not involve conditions at tiers other than s and s−1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why does the optimal policy for network formation (supplier link investment) equal the optimal policy for resilience investment, despite the fact that network formation also strategically improves a firm&amp;rsquo;s bargaining position?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Firms in intermediate tiers do have a private incentive to form additional supplier links specifically to improve their bargaining position vis-à-vis their upstream suppliers (by improving their outside options) and vis-à-vis their downstream customers (by the same mechanism). However, the authors show by comparing the firm&amp;rsquo;s first-order condition for link formation with the planner&amp;rsquo;s first-order condition that this strategic motivation exactly balances the offsetting general-equilibrium effects from rival firms doing the same. After this cancellation, the residual wedge between private and social incentives for network formation is identical to that for resilience investment. Hence #&lt;em&gt;_s = θ&lt;/em&gt;_s for all tiers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How do second-best policies differ from first-best policies in terms of both the magnitude of subsidies and the information required to set them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the first best, the subsidy for resilience at tier s depends only on the bargaining weight β_{s+1} and the markup factor μ_{s−1} — parameters relevant to tier s and its immediate neighbors. In the second best, when transaction subsidies are unavailable, the optimal resilience subsidy at tier s is θ†&lt;em&gt;s = J^{−1} · [1 − (cumulative distortion of all downstream tiers)] · (1−β&lt;/em&gt;{s+1}), where J captures aggregate labor-market effects of all markups throughout the chain. This formula requires knowledge of production function parameters (labor shares γ_j, markups μ_j, elasticities σ_j) for every tier j downstream from s. The second-best subsidy may be larger or smaller than the first-best subsidy; it is more likely to exceed the first-best subsidy for upstream tiers, where the cumulative downstream distortions (uncorrected markups contracting demand) produce a larger shortfall in private profitability and hence a larger underinvestment in resilience.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Under what condition do second-best subsidies fall monotonically as one moves downstream, and how does this compare to the first-best pattern?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The ratio of second-best subsidies at adjacent tiers (θ†_{s−1} / θ†&lt;em&gt;s) equals [(1−β_s) / (1−β&lt;/em&gt;{s+1})] · [τ*&lt;em&gt;s]^{−1}, where τ*&lt;em&gt;s is the first-best transaction subsidy. If buyer bargaining weights are non-increasing along the chain — β&lt;/em&gt;{s+1} ≤ β_s for all s — then (1−β_s) ≤ (1−β&lt;/em&gt;{s+1}) and, combined with τ*&lt;em&gt;s ≤ 1, the second-best subsidy is larger upstream than downstream (θ†&lt;/em&gt;{s−1} ≥ θ†_s). This contrasts with the first-best policy: when parameters are uniform across tiers, first-best resilience subsidies are the same at every interior tier, while second-best subsidies are strictly larger upstream than downstream.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What role does Assumption 1 (elasticities of substitution non-increasing as goods move downstream) play in the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Assumption 1 (σ_1 ≥ σ_2 ≥ … ≥ σ_S &amp;gt; ε) ensures that the operating profit function ~v_s(η) is concave in a firm&amp;rsquo;s network size η, which in turn ensures interior solutions to the network formation problem. It also delivers sharper monotonicity results: under this assumption, if other production parameters and bargaining weights are similar across tiers, the optimal purchase subsidies rise monotonically downstream, and the optimal first-best resilience subsidies decline monotonically downstream (potentially turning into taxes at some interior tiers). The assumption reflects the realistic view that inputs become more differentiated and specialized as they approach the final consumer good.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the limitations the authors identify regarding their model, and what extensions do they suggest?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Three main limitations are identified. First, the model assumes bargaining occurs after disruption shocks are realized, ruling out contingent contracts. Pre-disruption bargaining with contingent payments could mitigate double-marginalization inefficiencies and help internalize resilience externalities, though complex network-wide contingent contracts would likely be needed for full efficiency even in the second-best environment. Second, the model assumes symmetric firms within each tier, so downstream firms cannot sort on upstream firms&amp;rsquo; observable resilience levels; if observable differences existed, downstream firms could seek out more reliable partners, partially internalizing the resilience externality. Third, the model covers only a closed economy with idiosyncratic (uncorrelated) shocks. Extensions to global supply chains, correlated (geographic) shocks, cross-country differences in wages and technologies, and optimal cooperative versus unilateral policy are identified as important directions for future research.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Resilience (agility):&lt;/strong&gt; In the paper&amp;rsquo;s usage, a firm&amp;rsquo;s endogenous investment in reducing the probability of a catastrophic disruption to its own operations. A firm in tier s hires r_s units of labor up front, which raises its survival probability φ_s(r_s), with φ&amp;rsquo;_s &amp;gt; 0 and φ&amp;rsquo;&amp;rsquo;_s &amp;lt; 0. Resilience is a relationship-specific investment in the sense that its payoff is realized only conditional on the firm surviving and then trading with its downstream customers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Network thickness (redundancy):&lt;/strong&gt; The fraction η_s of firms in the next upstream tier with whom a firm in tier s forms a supply relationship prior to the disruption shock. Forming k units of labor per link creates a thicker network that hedges against supplier disruption, increases input variety (and thus CES productivity), and improves bargaining positions vis-à-vis both upstream suppliers and downstream customers. Distinct from resilience: resilience reduces the firm&amp;rsquo;s own probability of disruption; network thickness provides substitutability across suppliers should some fail.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Markup factor (μ_s):&lt;/strong&gt; The ratio of the per-unit payment made by tier s+1 firms to the production cost of tier s firms, as determined by Nash bargaining. Specifically, μ_s = (1−β_{s+1}) · [σ_{s+1}/(σ_{s+1}−1)] + β_{s+1}. The markup distorts private marginal costs above social marginal costs, causing underinvestment in transactions between firms and, transitively, in resilience and network formation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Nash-in-Nash equilibrium:&lt;/strong&gt; The bargaining solution concept used in the paper (following Horn and Wolinsky, 1988). Each pair of firms negotiates as if all other bilateral negotiations involving either party proceed at their equilibrium outcomes, both on and off the equilibrium path. This is the appropriate equilibrium concept when grand coalitions across all firms and all tiers are impractical.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sequential bargaining:&lt;/strong&gt; The specific timing structure in which negotiations proceed from the most downstream tier (lead firms bargaining with tier S−1 suppliers) sequentially upstream until tier 1 firms bargain with tier 0 suppliers. Each tier of firms, at the time they bargain with their own suppliers, are already contractually obligated to deliver specified quantities to their downstream customers. This obligation anchors the downstream firm&amp;rsquo;s outside option in any given bilateral negotiation.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;First-best transaction subsidy (τ&lt;/em&gt;_s):&lt;/em&gt;* The fraction of the cost of a tier-s input that, under the optimal policy, the downstream (tier s+1) buyer must pay. Equals [γ_s + (1−γ_s) · μ_{s−1}]^{−1} &amp;lt; 1 for all intermediate tiers, i.e., it is always a subsidy. Designed to align private marginal cost in the bilateral negotiation with the social marginal cost by offsetting the distortion introduced by anticipated markups on the upstream firm&amp;rsquo;s own inputs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-best subsidy:&lt;/strong&gt; The optimal policy toward resilience and network formation when subsidizing firm-to-firm transactions is infeasible (constrained to τ_s = 1 for all s). Unlike first-best subsidies — which depend only on local tier parameters — second-best subsidies depend on production function parameters and bargaining weights throughout the entire downstream supply chain due to the uncorrected cumulative markup distortions.&lt;/p&gt;</description></item><item><title>Optimal Taxation and Market Power</title><link>https://macropaperwarehouse.com/papers/optimal-taxation-and-market-power/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-taxation-and-market-power/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper asks whether and how optimal income taxation should change when firms have market power. The question is motivated by the documented rise in economy-wide markups since 1980, which has compressed the labor share, widened the gap between worker and entrepreneurial income, and generated allocative inefficiency through excessive pricing.&lt;/p&gt;
&lt;p&gt;The authors develop a Mirrleesian optimal taxation framework augmented with three features absent from the canonical literature: (i) oligopolistic intermediate goods markets with endogenous, variable markups, (ii) heterogeneous firm productivities, and (iii) two occupational groups—wage-earning workers and profit-earning entrepreneurs—whose abilities are private information. Entrepreneurs strategically set prices under Cournot competition, which means that the tax system affects profits both through a firm&amp;rsquo;s own behavior and through the responses of its competitors. This strategic interaction is the critical novelty relative to prior work that assumes monopolistic competition.&lt;/p&gt;
&lt;p&gt;The main theoretical contribution is the derivation of optimal tax formulas for both labor income and profit income that decompose into four named components: (i) the Mirrleesian incentive component, which reflects the standard trade-off between redistribution and labor supply distortions; (ii) the Pigouvian component, which corrects for the externality from market power by subsidizing labor and entrepreneurial effort to offset the output shortfall from high markups; (iii) the Reallocation Effect (RE), which shifts the profit tax to redirect labor inputs from low-markup firms to high-markup firms where labor is inefficiently scarce, and which emerges only under heterogeneous markups; and (iv) the Indirect Redistribution Effect (IRE), which uses changes in competitors&amp;rsquo; product prices—a channel present only under oligopolistic (not monopolistic) competition—to redistribute income between entrepreneurs.&lt;/p&gt;
&lt;p&gt;For the labor income tax, the dominant force is the Pigouvian component. As average markups rise, the Pigouvian subsidy to labor supply grows, mechanically reducing optimal labor income tax rates. The profit tax is shaped by all four components in opposing directions; the net quantitative effect is resolved empirically.&lt;/p&gt;
&lt;p&gt;The model is calibrated to match distributions of labor income (from the Current Population Survey), profits (from Compustat-based data in De Loecker, Eeckhout, and Unger 2020), and firm-level markups (also from De Loecker, Eeckhout, and Unger 2020, using the cost-minimization approach) for the US in 1980 and 2019. The cost-weighted average markup rose from 1.25 in 1980 to 1.33 in 2019, with the increase concentrated at the top of the markup distribution.&lt;/p&gt;
&lt;p&gt;The central quantitative prescription is that the optimal labor income tax rate should decline by 7.7 percentage points between 1980 and 2019 (average optimal rate falls from 22.0 percent to 14.3 percent), while the optimal profit tax rate should rise by 2.2 percentage points on average (from 58.4 percent to 60.5 percent) and by 29.1 percentage points at the top. The decline in the labor income tax is driven primarily by the rise in average markups reducing the Pigouvian component. The increase in the profit tax, especially at the top, is driven primarily by the Mirrleesian component operating through the skill gap, which rises because higher markups reduce profit elasticity. The Pigouvian and reallocation components push in the opposite direction on the profit tax, but the Mirrleesian effect dominates.&lt;/p&gt;
&lt;p&gt;The optimal profit tax structure is regressive for large, high-markup firms—reflecting the RE, which requires lower tax rates for high-markup firms to incentivize labor reallocation toward them—but less regressive in 2019 than in 1980, reflecting the distributional tightening from rising markup inequality.&lt;/p&gt;
&lt;p&gt;Robustness checks across parameter values for the social welfare curvature k, the span of control ξ, and the elasticity of substitution σ confirm that the directional results hold: labor income tax rates decrease and profit tax rates increase from 1980 to 2019 across all parameter configurations. Extensions to nonlinear sales taxes and conditioning on markups confirm that even when the planner can observe markups directly, the first-best is not achievable because markups are endogenous to entrepreneurs&amp;rsquo; unobservable decisions.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the fundamental difference between this paper&amp;rsquo;s model and prior work on optimal taxation with market power?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior work using monopolistic competition (e.g., Gürer 2021; Boar and Midrigan 2019) assumes each entrepreneur holds monopoly power in its own market, so no strategic interaction exists between firms. Under monopolistic competition, entrepreneurs price to maximize utility given competitors&amp;rsquo; choices, and the envelope theorem implies that tax changes have no first-order effect on prices or utility through the pricing channel—the Indirect Redistribution Effect (IRE) disappears. In this paper, entrepreneurs compete in Cournot oligopolistic markets with a finite number of firms I, so each firm&amp;rsquo;s pricing depends on competitors&amp;rsquo; output. A change in one firm&amp;rsquo;s output (induced by taxation) shifts competitors&amp;rsquo; prices, opening a redistribution channel through product markets that is entirely absent in monopolistic competition. Additionally, the Reallocation Effect (RE) emerges only when firm-level markups are heterogeneous, which requires oligopolistic (not perfectly competitive) markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the four components of the optimal tax formula and how does each relate to market power?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The optimal tax wedge for both labor and profit income decomposes into four components. First, the Mirrleesian component reflects the standard trade-off between redistribution and the efficiency cost of taxation; in the presence of market power, it is modified because the skill gap for entrepreneurs depends on markups through the profit elasticity. Second, the Pigouvian component corrects the externality from market power, which causes prices to exceed marginal cost and output to be inefficiently low; it implies a subsidy to both worker and entrepreneurial effort, scaled by the reciprocal of the average markup (for the labor tax) or firm-level markup (for the profit tax). Third, the Reallocation Effect (RE) applies only to the profit tax and reflects that labor should be shifted toward high-markup firms where it is inefficiently underemployed; it reduces the tax rate for firms whose markup exceeds the average. Fourth, the Indirect Redistribution Effect (IRE) captures redistribution through competitor price changes under oligopolistic interaction; it can either raise or lower the profit tax rate depending on the distribution of social welfare weights and the cross-inverse demand elasticity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What happens to the labor income tax formula as average markups rise?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The labor income tax formula contains a Pigouvian component equal to the reciprocal of the employment-weighted average markup. As average markups rise, this reciprocal falls, reducing the optimal labor income tax rate. Quantitatively, the optimal average labor income tax rate declines from 22.0 percent in 1980 to 14.3 percent in 2019, a decrease of 7.7 percentage points. In a purely competitive benchmark economy, the top labor income tax rate would be around 60 percent (consistent with Saez 2001); in the calibrated model with market power, it is 34.2 percent in 1980 and 28.7 percent in 2019. The Pigouvian component accounts for essentially the entire difference because the Mirrleesian component, when calibrated to the same labor income distribution, is unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the Mirrleesian component cause the top profit tax rate to rise with market power?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Mirrleesian component of the profit tax is driven by the skill gap, defined as the proportional rate of change in the composite entrepreneur ability measure. The skill gap depends on markups through the profit elasticity: as markups rise, profit elasticity falls (since profit elasticity is approximately the reciprocal of markup minus the span-of-control parameter minus the inverse of the labor supply elasticity term), which increases the skill gap. A higher skill gap amplifies the income divergence across entrepreneur types, increasing the Mirrleesian incentive to redistribute at the top. Quantitatively, Figure 5 shows that the rise in the skill gap from 1980 to 2019 tracks almost exactly the change in the inverse of profit elasticity, confirming that markup changes—not changes in the ability distribution—are the primary driver of increased Mirrleesian pressure on top profit taxes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the Reallocation Effect influence the structure (progressivity) of the profit tax?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The RE term equals the ratio of the average markup to the firm-level markup minus one: RE(θe) = μ/μ(θe) − 1. For firms with markups above the average, RE is negative, reducing their optimal tax rate; for firms below the average, RE is positive, increasing it. This implies that the optimal profit tax should be regressive relative to markup (i.e., high-markup firms face lower marginal tax rates), even though the overall profit tax rises on average. This provides a novel rationale for why the profit tax schedule in practice is less progressive—or even regressive—for large firms. As markups rise across the distribution, the reallocation effect pushes down the top profit tax but does not offset the larger increase from the Mirrleesian component in the quantitative exercise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the Indirect Redistribution Effect and why does it disappear under monopolistic competition?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The IRE captures the change in entrepreneurial utility that arises because a tax reduction for one entrepreneur increases their output, which reduces the prices of substitute goods produced by competitors, thereby lowering competitors&amp;rsquo; incomes. Under oligopolistic competition with I &amp;gt; 1 firms per market, the cross-inverse demand elasticity is nonzero, so competitor prices are sensitive to any one firm&amp;rsquo;s output decision, and this redistribution channel is open. Under monopolistic competition (I = 1), each entrepreneur is the sole producer in its market; competitors&amp;rsquo; prices do not depend on the firm&amp;rsquo;s output, the cross-inverse demand elasticity is zero, and the IRE vanishes by the envelope theorem. The IRE is also absent in perfectly competitive economies. Empirical evidence for the US suggests the hazard ratio of profits is sufficiently high that the IRE generally pushes toward a lower top profit tax rate, but the Mirrleesian effect dominates in the quantitative results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the quantitative effect of rising markups on the optimal tax rates, and what drives the net change in the profit tax?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model calibrated to 1980 and 2019 US data prescribes a decline in the optimal average labor income tax rate of 7.7 percentage points (from 22.0 to 14.3 percent) and an increase in the optimal average profit tax rate of 2.2 percentage points (from 58.4 to 60.5 percent). At the top of the profit distribution, the increase is 29.1 percentage points. The net profit tax increase results from four opposing forces: the Pigouvian component falls (pushing toward lower taxes) and the RE decreases for high-markup firms (also pushing down the top rate), while the IRE and especially the Mirrleesian component rise (pushing up top rates). The Mirrleesian effect is the dominant force, driven by rising markup inequality reducing profit elasticity and widening the skill gap for top entrepreneurs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the counterfactual analysis isolate the role of markups from productivity changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The counterfactual fixes the markup distribution at its 1980 level while holding the 2019 productivity distribution constant, then solves for optimal taxes. The result is that high-profit entrepreneurs would face lower optimal tax rates under 1980 markups than under 2019 markups, while low-profit entrepreneurs would face higher rates. Decomposing the difference, the Pigouvian component and the RE are larger for high incomes under 1980 (lower) markups, making the profit tax more regressive, while the IRE and the Mirrleesian component are smaller under 1980 markups, producing a lower top rate. The increase in the Mirrleesian component due to the markup increase from 1980 to 2019 is identified as the primary reason top profit taxes rise. This isolates the markup channel from the productivity channel in accounting for changes in optimal taxes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the robustness analysis reveal about parameter sensitivity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main qualitative result—labor income taxes decline and profit taxes rise from 1980 to 2019—holds across a broad parameter space. The optimal profit tax rate is largely insensitive to the social welfare curvature parameter k: across k ∈ {0.77, 1, 3}, the average optimal profit tax rate is approximately 58 percent in 1980 and 61 percent in 2019. The optimal average labor income tax rate is more sensitive to k: for k = 0.7, 1, and 3, the 1980 rates are 20.3, 26.7, and 44.6 percent, and the 2019 rates are 12.5, 19.4, and 39.1 percent, respectively. Changes in the span-of-control parameter ξ and the substitution elasticity σ do not affect the labor income tax wedge schedule directly but do influence it indirectly through the markup distribution. The directional results are confirmed for all tested parameter configurations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the role of the &amp;ldquo;additivity property&amp;rdquo; from prior externality literature, and why does it fail here?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The additivity property from the Pigouvian externality literature (see Kopczuk 2003; Sandmo 1975) states that the Pigouvian correction is separable from other components of the optimal tax formula, implying that rising markups would simply decrease the optimal tax rate (since 1/μ falls). This property holds under simplifying assumptions that abstract from the general equilibrium and incentive effects of market power. In the present model, the additivity property does not hold because markups enter all four components of the optimal tax formula—not just the Pigouvian term—through the skill gap (Mirrleesian component), the RE, and the IRE. As a result, rising markups can increase the optimal profit tax rate even though the Pigouvian component falls, because the skill gap and Mirrleesian force dominate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Can the government attain the first-best by conditioning taxes on markups?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The paper demonstrates that even if the planner can observe and condition taxes on firm-level markups, the first-best is not achievable. The reason is that markups are endogenous to the entrepreneurs&amp;rsquo; unobservable decisions: an entrepreneur&amp;rsquo;s markup depends on their privately known type and chosen output. When the planner designs a mechanism that conditions on markup, the incentive constraint facing entrepreneurs remains the same as in the benchmark model, because the promise-keeping constraints are independent of the entrepreneur&amp;rsquo;s true type when markups are observable. The optimal allocation with markup-conditioned taxes is shown to be equivalent to the second-best with nonlinear sales taxes, which still falls short of the first-best.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the policy implications for the design of the profit tax schedule?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model yields three concrete prescriptions for the joint design of labor and profit income taxes in the context of rising market power. First, labor income taxes should be reduced and top profit taxes should be increased as market power rises. Second, for large, high-productivity firms the profit tax should be designed to be appropriately regressive to enhance allocative efficiency through the Reallocation Effect—this provides a new normative justification for why profit tax schedules observed in practice are often less progressive than labor income taxes. Third, while profit taxes should be regressive for large firms, the degree of regressivity should decrease as market power rises, reflecting the trade-off between efficiency and equality: higher markups increase the Mirrleesian pressure for redistribution at the top, reducing the optimal regressivity.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Mirrleesian component (of the optimal tax formula):&lt;/strong&gt; The standard incentive component of the optimal tax, capturing the trade-off between direct redistribution and the efficiency cost of taxation. In the presence of market power, this component is modified because the skill gap for entrepreneurs depends on markups through the profit elasticity: higher markups reduce profit elasticity, widen the skill gap, and amplify the Mirrleesian force toward higher top profit taxes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pigouvian component:&lt;/strong&gt; The correction in the optimal tax formula for the externality from market power. Because oligopolistic pricing causes output to be inefficiently low, the optimal tax subsidizes both worker and entrepreneurial labor supply. In the labor income tax formula, the Pigouvian component is the reciprocal of the employment-weighted average markup; in the profit tax formula, it is the reciprocal of the firm-level markup. As average markups rise, the Pigouvian component reduces the optimal labor income tax rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reallocation Effect (RE):&lt;/strong&gt; A component of the optimal profit tax formula that captures the efficiency gain from reallocating labor inputs from low-markup firms (where labor&amp;rsquo;s marginal product is high relative to value) to high-markup firms (where labor demand is inefficiently low). It equals the ratio of the average markup to the firm-level markup minus one. It implies a lower optimal marginal tax rate for firms with markups above the average, producing a regressive structure in the profit tax for large firms. This effect is absent under monopolistic competition (uniform markups) and in competitive markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Indirect Redistribution Effect (IRE):&lt;/strong&gt; A component of the optimal profit tax formula specific to oligopolistic competition, capturing redistribution through competitor prices. Lowering the marginal tax rate of a high-productivity entrepreneur raises their output, which reduces the prices of substitutable goods produced by their competitors, thereby lowering competitors&amp;rsquo; incomes and redistributing toward workers who benefit from lower prices. This effect is present only when the cross-inverse demand elasticity is nonzero—i.e., only under oligopolistic (Cournot) competition with multiple firms per market—and vanishes under monopolistic competition and in the limit as the number of firms grows to infinity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Skill gap (for entrepreneurs):&lt;/strong&gt; The proportional rate of change in the composite entrepreneur ability measure with respect to entrepreneur type, analogous to the Mirrleesian skill gap for workers. Under market power, the entrepreneur skill gap depends on the markup through the profit elasticity: as firm-level markups rise, profit elasticity falls, the skill gap increases, and the income dispersion across entrepreneurs widens, which amplifies the Mirrleesian incentive to redistribute at the top and raises the optimal top profit tax rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Symmetric Cournot Competitive Tax Equilibrium (SCCTE):&lt;/strong&gt; The equilibrium concept used in the paper. It is a combination of a tax system, symmetric allocation, and symmetric price system such that all agents (final goods producer, entrepreneurs of each type, workers) are optimizing, strategic interaction in the intermediate goods market is a Cournot Nash equilibrium within each granular market, and all commodity and labor markets clear. Strategic interaction is restricted to within each granular market (firms in the same market compete), so decisions across markets are taken as given.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Composite ability:&lt;/strong&gt; A combined measure of entrepreneur productivity that determines equilibrium allocations and optimal taxation in the nested-CES economy. It aggregates the entrepreneur&amp;rsquo;s raw ability (affecting output capacity) and the demand parameter (affecting the market-level markup). The markup-relevant component and the quantity-relevant component are not perfect substitutes in the composite, since equilibrium prices depend on their specific composition while equilibrium quantities depend only on their combined value.&lt;/p&gt;</description></item><item><title>Optimal Tests Following Sequential Experiments</title><link>https://macropaperwarehouse.com/papers/optimal-tests-following-sequential-experiments/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/optimal-tests-following-sequential-experiments/</guid><description>&lt;p&gt;This paper addresses a practical gap in the inference literature for sequential and adaptive experiments: while the design of such experiments has been studied extensively, there is little theory characterizing which tests are optimal once the experiment concludes. Adusumilli asks what the best hypothesis test looks like after a sequential experiment — a costly sampling design, a group sequential trial, or a bandit experiment — and whether the complexity of the adaptive protocol can be reduced to a manageable set of sufficient statistics for inference purposes.&lt;/p&gt;
&lt;p&gt;The methodological core is the derivation of two Asymptotic Representation Theorems (ARTs). The first ART applies to stopping-time experiments, where the sampling rule is fixed in advance but the stopping time is fully adaptive (updated after every observation). The second ART allows the sampling rule itself to be adaptive, but requires that both the sampling and stopping decisions are updated only a finite number of times after observing batches of data. Both ARTs establish that the asymptotic power function of any test in the original sequential experiment can be matched by a test in a limit experiment in which a Gaussian process is observed for each treatment and inference is made on the drifts of those processes.&lt;/p&gt;
&lt;p&gt;The key sufficiency result is a dimension reduction: regardless of the number of batches or the complexity of the adaptive protocol, any candidate test&amp;rsquo;s asymptotic power can be reproduced by a test that depends only on a fixed, finite set of statistics. For stopping-time experiments, the sufficient statistics are the stopped value of the score process (parametric) or the efficient influence function process (non-parametric), together with the stopping time. For batched experiments with adaptive sampling, the sufficient statistics are the final allocation proportions for each treatment (q_1, q_0) and the final values of the influence function processes (x_1, x_0) — a fixed dimension of 2d+2 regardless of the number of batches. This stands in contrast to the earlier ART of Hirano and Porter (2023), whose state variables grow linearly with the number of batches.&lt;/p&gt;
&lt;p&gt;The paper then characterizes optimal tests within the limit experiment under several criteria. Under no restriction, the Neyman-Pearson lemma yields the uniformly most powerful (UMP) test for a point alternative. For testing linear combinations of the parameter vector, a further dimension reduction applies and a UMP test exists in the limit experiment, depending only on a scalar projection of the sufficient statistic. Under unbiasedness, any valid test must satisfy an orthogonality condition on the stopped process. Under an alpha-spending constraint — where the overall size alpha is pre-allocated across stages — optimal stage-specific thresholds are derived. Under a weighted average power criterion, the optimal test takes the form of a likelihood ratio statistic integrated against the weight function.&lt;/p&gt;
&lt;p&gt;Three application classes are treated with explicit optimal procedures. For horizontal boundary designs (stopping when a test statistic crosses a fixed threshold, including the SPRT and the Neyman-allocation design from Adusumilli 2022), the most powerful asymptotically unbiased test rejects when the stopping time falls below a specific quantile of its null distribution. Monte Carlo simulations show the test achieves nominal 5% size even for small n, while the standard two-sample test has actual size near 9% in the same setting. For group sequential trials (including O&amp;rsquo;Brien-Fleming designs with T=2 stages), the paper derives stage-specific critical values satisfying the alpha-spending constraint, with numerical simulations confirming the asymptotic approximation is close to nominal for small n, though accuracy degrades for larger values of the null mean. For bandit experiments run with a batched Thompson-sampling algorithm (K=2 treatments, J=10 batches), the paper constructs the power envelope and shows it is asymmetric: distinguishing (a, 0) from (0, 0) is easier than distinguishing (-a, 0) from (0, 0) for a &amp;gt; 0, because Thompson sampling directs more observations to the arm with higher estimated mean, reducing informativeness from the other arm. Simulations confirm the asymptotic approximation is accurate for as few as n=20 observations per batch (200 total).&lt;/p&gt;
&lt;p&gt;The framework covers both parametric and non-parametric models. The non-parametric setting replaces the score process with the efficient influence function process, and the asymptotic power bound translates directly. Results also apply to conditional power given the stopping time.&lt;/p&gt;
&lt;p&gt;Q: What is the core methodological contribution of the paper?
A: The paper derives two Asymptotic Representation Theorems (ARTs) showing that the asymptotic power function of any test following a sequential experiment can be matched by a test in a Gaussian-diffusion limit experiment. The first ART covers stopping-time experiments with fully adaptive stopping rules; the second covers batched experiments with adaptive sampling rules. These ARTs reduce the infinite-dimensional adaptive experiment to a tractable limit object.&lt;/p&gt;
&lt;p&gt;Q: What are the sufficient statistics for inference, and why does this matter?
A: For stopping-time experiments, the sufficient statistics are the stopped value of the score (parametric) or efficient influence function (non-parametric) process, together with the stopping time. For batched experiments with adaptive sampling over K treatments, the sufficient statistics are the final allocation fractions (q_1, q_0) and the final influence function process values (x_1, x_0), a fixed dimension of 2d+2. This matters because it establishes that all the adaptive complexity of the protocol can be discarded: a test that uses only these statistics is asymptotically as powerful as any test that uses the full sample path.&lt;/p&gt;
&lt;p&gt;Q: How does this paper extend or differ from Hirano and Porter (2023)?
A: Hirano and Porter (2023) derive an ART for batched sequential experiments whose state variables grow linearly with the number of batches, making the limit experiment increasingly complex. Adusumilli shows that only a fixed number of sufficient statistics (2d+2) are needed to match unconditional asymptotic power, irrespective of the number of batches. The paper also extends to non-parametric models, derives optimal conditional tests given stopping times, and covers fully adaptive stopping-time experiments via a different route (Le Cam 1979) that does not require the batching restriction.&lt;/p&gt;
&lt;p&gt;Q: What is the result for testing linear combinations of the parameter?
A: When the null hypothesis is H0: a^T h = 0 in the limit experiment, a further dimension reduction applies: the UMP test depends only on a scalar projection x-tilde(tau) = sigma^{-1} a^T I^{-1/2} x(tau) and the stopping time tau. Because under the null this projection is a standard Brownian motion evaluated at the stopping time, the test is pivotal and uniformly most powerful for the composite hypothesis, regardless of the nuisance components of h.&lt;/p&gt;
&lt;p&gt;Q: What is the unbiasedness condition in the limit experiment?
A: A test phi is unbiased if its power exceeds its size under all alternatives. In the Gaussian limit experiment, Proposition 2 shows that any unbiased test of H0: h=0 vs H1: h≠0 must satisfy the moment condition E_0[x(tau) phi(tau, x(tau))] = 0, which is obtained by differentiating the power function at h=0 and applying the unbiasedness constraint. This condition restricts which tests can be considered, and the optimal unbiased test is characterized within this class.&lt;/p&gt;
&lt;p&gt;Q: What is the alpha-spending criterion and what does the paper show about it?
A: Alpha-spending (introduced by Gordon Lan and DeMets, 1983) pre-allocates the total size alpha across T stages via a spending vector (alpha_1, &amp;hellip;, alpha_T) with sum equal to alpha, and requires that the conditional rejection probability at stage t not exceed alpha_t. Theorem 2 shows that for discrete stopping times, the asymptotic conditional power beta_n(h|t) converges to beta(h|t) in the limit experiment on subsequences, enabling the derivation of optimal stage-specific thresholds satisfying the spending constraint.&lt;/p&gt;
&lt;p&gt;Q: What is the key finding for horizontal boundary designs with a fixed sampling rule?
A: For experiments that stop when the influence function process first crosses a fixed threshold gamma — including the SPRT and the Neyman-allocation costly-sampling design of Adusumilli (2022) — Lemma 1 establishes that the UMP asymptotically unbiased test of H0: mu_1 = mu_0 is the test that rejects when the stopping time tau-hat falls below the alpha-quantile of its null distribution. Monte Carlo evidence shows this test achieves nominal 5% size even for small n, while a naive two-sample test ignoring the adaptive stopping rule has actual size near 9%.&lt;/p&gt;
&lt;p&gt;Q: What does the power envelope look like for Thompson-sampling bandit experiments, and why is it asymmetric?
A: For Thompson-sampling bandit experiments with K=2 arms and J=10 batches, the power envelope for testing H0: (mu_1, mu_0) = (0, 0) is asymmetric: it is easier to distinguish the alternative (a, 0) from the null than to distinguish (-a, 0) for the same a &amp;gt; 0. The mechanism is that Thompson sampling allocates more observations to the arm with the higher estimated mean, so a positive treatment effect leads to more data for treatment arm 1 and less for arm 0, making the joint test more informative in one direction than the other.&lt;/p&gt;
&lt;p&gt;Q: How accurate are the asymptotic approximations in finite samples?
A: For horizontal boundary designs, Monte Carlo simulations show size is close to nominal 5% even for small n. For group sequential trials with an O&amp;rsquo;Brien-Fleming design (T=2 stages), the approximation is close to nominal for small n but degrades for larger values of the null mean mu-bar. For Thompson-sampling bandit experiments with K=2 arms and J=10 batches, the approximation is accurate for as few as n=20 observations per batch (200 total observations).&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle non-parametric models?
A: In non-parametric settings, the sufficient statistic is the efficient influence function process x_n(t) = (sigma^{-1}/sqrt(n)) sum_{i=1}^{floor(nt)} psi(Y_i), where psi is the efficient influence function for the functional of interest and sigma^2 = E[psi^2]. Proposition 3 establishes that the asymptotic power of any test is bounded above by the power envelope in the Gaussian limit experiment indexed by this process. The non-parametric and linear-combination parametric cases share the same limit structure.&lt;/p&gt;
&lt;p&gt;Q: What are the open questions identified by the author?
A: Two main limitations are noted. First, the ART for adaptive sampling rules is established only for batched experiments; whether it extends to fully adaptive (non-batched) sampling rules without loss of power is conjectured but not formally verified. Second, for fully adaptive experiments, the alpha-spending characterization is not yet available, and the author suggests exploring invariance restrictions or conditional inference as alternative optimality criteria.&lt;/p&gt;
&lt;p&gt;Asymptotic Representation Theorem (ART): A result showing that the asymptotic power function of any test in the original sequential experiment can be matched by that of a test in a Gaussian-diffusion limit experiment; used to transfer optimality results from the limit to the original problem.&lt;/p&gt;
&lt;p&gt;Limit experiment (Gaussian diffusion): The limiting statistical model in which one observes a Gaussian process x(t) = I^{1/2} h t + W(t) for each treatment, with unknown drift vector h; inference on h in this experiment characterizes optimal tests in the original sequential experiment.&lt;/p&gt;
&lt;p&gt;Sufficient statistics (for sequential inference): The finite set of statistics that, in the limit experiment, capture all power-relevant information from the adaptive experiment: for stopping-time experiments, the stopped score/influence function process value and the stopping time; for batched adaptive experiments, the final allocation fractions (q_a) and final influence function values (x_a) for each treatment arm.&lt;/p&gt;
&lt;p&gt;Alpha-spending constraint: A strengthened size requirement in group sequential trials that pre-allocates the total Type I error alpha across stages via a spending vector (alpha_1, &amp;hellip;, alpha_T); requires that conditional rejection probability at each stage t not exceed alpha_t, and sum alpha_t = alpha.&lt;/p&gt;
&lt;p&gt;Efficient influence function process: In a non-parametric model, the partial-sum process x_n(t) = (sigma^{-1}/sqrt(n)) sum_{i=1}^{floor(nt)} psi(Y_i), where psi is the efficient influence function for the target functional; this process is the non-parametric analogue of the score process and serves as the sufficient statistic for non-parametric sequential inference.&lt;/p&gt;
&lt;p&gt;Stopping-time experiment: A sequential experiment in which the sampling rule (how to allocate observations across treatments) is fixed before the experiment begins but the stopping rule (when to terminate) is fully adaptive and updated after every observation.&lt;/p&gt;
&lt;p&gt;Power envelope: The supremum of the asymptotic power function over all tests of a given size; computed in the limit experiment via the Neyman-Pearson lemma and the Girsanov theorem, and serves as an upper bound on the power of any feasible test in the original sequential experiment.&lt;/p&gt;</description></item><item><title>Organizational Change and Reference-Dependent Preferences</title><link>https://macropaperwarehouse.com/papers/organizational-change-and-reference-dependent-preferences/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/organizational-change-and-reference-dependent-preferences/</guid><description>&lt;p&gt;Schmidt and von Wangenheim develop a dynamic model of organizational change in which workers have reference-dependent preferences — specifically loss aversion and social comparisons — to explain several empirically observed patterns that standard models cannot easily account for: organizational inertia in normal times, sudden productivity jumps during crises, persistent total factor productivity (TFP) differences across firms in the same industry, and effort and wage compression within firms.&lt;/p&gt;
&lt;p&gt;The motivating empirical puzzle is the early-1980s collapse of the Great Lakes iron ore and steel industry, which had been geographically shielded from foreign competition for over 100 years. When Brazilian competitors undercut prices, the industry responded by roughly doubling labor productivity within a few years — not through new technology or capital investment, but through organizational improvements and more efficient use of existing capital (Schmitz 2007). The broader puzzle is Syverson&amp;rsquo;s (2004) finding that at the four-digit industry level, the 90th-percentile firm has TFP 1.9 times that of the 10th-percentile firm, a gap that cannot be explained by observable input differences.&lt;/p&gt;
&lt;p&gt;The model features a principal (firm owner) bargaining with loss-averse workers (represented by a union) over organizational change — represented as a worker effort level x that adapts the firm to the state of technology θ. Workers&amp;rsquo; reference point is a convex combination of the status quo contract and their rational expectations of the agreed contract, with weight α on the status quo. Loss aversion parameter λ &amp;gt; 0 means that losses relative to the reference point are weighted more heavily than gains.&lt;/p&gt;
&lt;p&gt;The core static result (Proposition 1) is that loss aversion drives a wedge of 1 + αλ between the workers&amp;rsquo; marginal cost and the firm&amp;rsquo;s marginal benefit of organizational change. Below a threshold θ defined by ∂v(x₀,θ)/∂x = 1 + αλ, there is complete inertia: the firm does not change the effort level at all. Above θ, the firm adjusts effort, but to x(θ) &amp;lt; x^ME(θ), undershooting the materially efficient level. Higher λ or higher α both widen the inertia range and reduce the amount of implemented change (Proposition 2).&lt;/p&gt;
&lt;p&gt;A crisis — modeled as a cost shock that makes the status quo contract generate negative profits, threatening firm closure — changes workers&amp;rsquo; outside option from their current utility U₀ to the unemployment utility of zero. Workers are now willing to accept either wage cuts or effort increases to keep their jobs. Crucially, because both concessions are perceived as losses of equal size by workers, the firm prefers to increase effort rather than cut wages, since increasing effort is more productive when x &amp;lt; x^ME. The model thus provides a microfoundation for downward nominal wage rigidity: in a recession, workers make concessions through harder work rather than wage cuts.&lt;/p&gt;
&lt;p&gt;In the infinite-horizon dynamic model, workers accumulate a quasi-rent over time equal to αλ(x_{t-1} − x₀), which represents compensation paid for past effort increases. This quasi-rent is what the firm expropriates during a crisis, allowing a discontinuous jump in effort toward the materially efficient level. Firms founded at different times or hitting different idiosyncratic shocks will therefore have different effort histories and different productivity levels, generating persistent TFP differences even among firms with identical technologies. When forward-looking players anticipate the possibility of crisis, inertia in normal times actually widens further (x̃(θ) ≤ x(θ)), because firms rationally delay effort adaptation knowing it will be cheaper to implement change during a crisis.&lt;/p&gt;
&lt;p&gt;The expectations-management extension (Section 4) introduces a moral-hazard problem with a manager who chooses the probability of successful change. Because a higher probability of change raises the workers&amp;rsquo; expectation-based reference point and reduces their perceived adaptation cost, the firm&amp;rsquo;s optimization problem becomes convex when the cost of effort for management is sufficiently low relative to (1−α)λΔx. This delivers a bang-bang result: the principal induces either full implementation (p = 1) or no change (p = 0), never an interior probability. This formalizes the management-consulting advice that commitment and urgency are essential to organizational change.&lt;/p&gt;
&lt;p&gt;The social-comparisons extension (Section 5) shows that when workers compare their wages and effort to colleagues, the firm optimally compresses effort differences across workers — inducing the less productive worker to work more than efficiency requires and the more productive worker to work less. If productivity differences between workers are sufficiently small, the firm sets identical effort levels. Wage compression follows from effort compression. To avoid the cost of social comparisons entirely, it may be optimal for the firm to split into separate legal entities whose workers no longer form a common reference group — a new explanation for organizational unbundling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the core mechanism by which loss aversion generates organizational inertia in normal times?&lt;/strong&gt;
A: Workers have a reference point that is a convex combination (weight α on status quo, weight 1−α on rational expectations) of their current contract and the expected new contract. Because workers perceive an effort increase above their reference effort as a loss, the firm must pay a wage premium of αλ per unit of additional effort on top of the material effort cost of 1. This raises the effective marginal cost of implementing change from 1 to 1 + αλ, so the firm only implements change when the marginal revenue of effort strictly exceeds 1 + αλ. Below the threshold technology level θ (defined by ∂v(x₀,θ)/∂x = 1 + αλ), there is complete inertia and the firm keeps x* = x₀.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does a crisis break the inertia?&lt;/strong&gt;
A: A crisis is a cost shock large enough to make the firm&amp;rsquo;s profits negative under the status quo contract, so the firm would close unless workers make concessions. Workers&amp;rsquo; outside option shifts from their accumulated utility U₀ to the unemployment utility of zero. Because wage cuts and effort increases are both perceived as losses of equal magnitude, the firm prefers to demand effort increases (which raise revenue) over wage cuts (which do not). At the margin, when workers are at zero utility, the loss-aversion terms cancel from the marginal rate of substitution, and the firm can push effort up to the materially efficient level x^ME — a discontinuous jump.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why do wages not fall during a recession in this model?&lt;/strong&gt;
A: Workers perceive both wage cuts and effort increases as losses of equal per-unit utility cost. Since increasing effort by one unit and cutting wages by one unit impose the same utility cost on workers but effort increases raise firm revenue while wage cuts do not, it is always more efficient for the firm to extract concessions through higher effort rather than lower wages. The firm therefore first drives effort to x^ME before cutting wages, and cuts wages only if the zero-utility constraint still is not binding at x^ME. This provides a microfoundation for Bewley&amp;rsquo;s (1999) observation that wages do not fall during recessions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Where does the quasi-rent exploited during a crisis come from?&lt;/strong&gt;
A: Every time the firm implements an effort increase in normal times it must compensate workers with a permanent wage increase to cover both the permanent higher effort cost (x_{t}−x_{t-1}) and the one-time behavioral adaptation cost αλ(x_{t}−x_{t-1}). Because the compensation for the adaptation cost must be spread over all future periods as a permanent payment, workers accumulate a quasi-rent that by period t equals αλ(x_{t-1}−x₀) above their initial utility U₀ = w₀−x₀. This is the rent the firm expropriates in a crisis to fund the discontinuous effort increase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the dynamic model generate persistent TFP differences across firms in the same industry?&lt;/strong&gt;
A: Firms founded at different times start with different initial status-quo effort levels relative to the current technology θ. Because each firm&amp;rsquo;s path of organizational adaptation is history-dependent — inertia regions, timing of crises, and accumulated quasi-rents all depend on when the firm was founded and what idiosyncratic shocks it experienced — firms that start later (or hit crises earlier) can remain more productive than older firms for extended periods. The numerical example with v(x,θ) = θ ln(x), α = 0.5, λ = 1, δ implied parameters shows that a firm founded when θ = 7 at the materially efficient point can maintain a substantial productivity advantage over a firm founded when θ = 4 that has accumulated inertia, even though both firms have access to the same technology.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does rational anticipation of a future crisis increase or decrease inertia in normal times?&lt;/strong&gt;
A: It strictly increases inertia. When players assign probability µ &amp;gt; 0 to a crisis each period, forward-looking workers demand higher compensation for effort increases in normal times — specifically, the per-period compensation for behavioral adaptation cost rises from (1−δ)αλ to γ = (1−δ(1−µ))αλ, which is increasing in µ. Simultaneously, the firm anticipates that effort adaptation will be cheaper to achieve in a crisis and therefore delays effort increases. The result is that the inertia threshold shifts from x(θ) to x̃(θ) ≤ x(θ), a strictly wider inertia region (Proposition 6).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the expectations-management result and what drives it?&lt;/strong&gt;
A: When a manager chooses the probability of successful change p at cost c(p) = (c/2)p², the wage the firm must pay workers is concave in p (equation 22): w = x₀ + p(1+λ)Δx − p²(1−α)λΔx + U₀. The concavity arises because a higher p raises the expectation-based component of the reference point, lowering workers&amp;rsquo; perceived adaptation cost. When c &amp;lt; (1−α)λΔx, this makes the principal&amp;rsquo;s profit function convex in p, so the optimum is at a corner: the principal induces either p = 1 (full implementation) or p = 0 (no change). Even when an interior solution obtains, a decrease in α (more weight on expectations) increases p. This formalizes the practitioner prescription that organizational change requires convincing everyone that change is certain and unavoidable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the effort and wage compression result under social comparisons?&lt;/strong&gt;
A: When each worker compares his situation to his colleague&amp;rsquo;s, with weight β on the peer&amp;rsquo;s wage and effort in forming the reference point, the firm must pay both workers a social-comparison premium of λβ(x₂−x₁) per unit of effort difference (Lemma 5). The firm therefore optimally compresses effort differences: it induces the less productive worker to exert effort above his efficient level and the more productive worker below his efficient level, at first-order conditions ∂v₁/∂x = 1 − 2λβ and ∂v₂/∂x = 1 + 2λβ respectively. If the productivity difference is small enough (specifically if ∂v₂(x*,θ)/∂x &amp;lt; 1 + 2λβ at the equal-effort point), the firm sets x₁* = x₂* = x*, eliminating wage inequality entirely (Proposition 8).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why might it be optimal for a firm to split into separate entities?&lt;/strong&gt;
A: Social comparisons impose costs on the firm by requiring higher wages for both workers (each receives a premium of λβ(x₂−x₁) regardless of their relative rank) and by distorting effort levels away from their efficient values. If workers employed by legally separate firms no longer treat each other as part of their reference group — because β falls to zero across firm boundaries — the firm can eliminate these comparison costs by spinning off activities into independent entities. This provides an efficiency rationale for organizational unbundling that does not rely on asset specificity or transaction costs, addressing what the authors call the &amp;ldquo;Williamson puzzle.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the implications for older workers and for social insurance policy?&lt;/strong&gt;
A: Older workers have two compounding reasons to be more resistant to organizational change: shorter remaining time horizons reduce the present value of permanent wage compensation for adaptation costs, and Gächter, Johnson, and Herrmann (2022) report that loss aversion λ increases with age, income, and wealth. Both factors raise the cost of implementing change with older workers. For social insurance, generous unemployment benefits or policies preventing layoffs (such as short-time work schemes) reduce workers&amp;rsquo; concession costs in a crisis, weakening the mechanism by which crises trigger change. The model suggests this may contribute to slower technology adoption in countries with stronger labor market protections.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What empirical facts from the existing literature does the model account for?&lt;/strong&gt;
A: The model accounts for: (1) Syverson&amp;rsquo;s (2004) finding of a 90th/10th percentile TFP ratio of 1.9 in four-digit US industries; (2) the iron ore and steel case study (Schmitz 2007) in which labor productivity doubled within a few years of a competitive shock with no new technology; (3) Bloom et al.&amp;rsquo;s (2014) correlation between more intense competition and higher TFP; (4) Holmes and Schmitz&amp;rsquo;s (2010) survey finding that competitive shocks raise industry productivity mainly through survival and improvement of existing firms; (5) Bewley&amp;rsquo;s (1999) downward nominal wage rigidity; and (6) Hjort, Li, and Sarsons (2022) on multinational firms using headquarters wages as reference points for wages in low-wage locations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Loss aversion (λ):&lt;/strong&gt; The parameter measuring the degree to which workers weight losses relative to their reference point more heavily than gains. A meta-analysis (Brown et al. 2023) across 607 empirical estimates finds an average loss aversion parameter of 1 + λ = 1.955. In this paper, λ &amp;gt; 0 means workers perceive a wage cut and an effort increase as losses, raising the effective marginal cost of organizational change by a factor of 1 + αλ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reference point (w^r, x^r):&lt;/strong&gt; The benchmark wage and effort level against which workers evaluate outcomes. Defined as a convex combination of the status quo contract (w₀, x₀) with weight α and the rational expectation of the agreed contract (w^e, x^e) with weight 1−α. Losses occur when the realized wage falls below w^r or the realized effort exceeds x^r.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Organizational inertia:&lt;/strong&gt; The firm&amp;rsquo;s failure to implement materially efficient organizational change even when doing so would increase total surplus. In the model, inertia arises because the effective marginal cost of effort to the firm is 1 + αλ rather than 1, so the firm only implements change above a threshold technology level θ. The range of inertia widens with higher λ, higher α, and higher initial effort x₀.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quasi-rent:&lt;/strong&gt; The utility accumulated by workers above their initial utility U₀ = w₀−x₀ as compensation for past effort increases. By period t it equals αλ(x_{t-1}−x₀). This quasi-rent is the source of concessions the firm can extract in a crisis: workers accept higher effort (or lower wages) in exchange for keeping their jobs rather than losing this accumulated utility through unemployment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Behaviorally efficient effort x(θ):&lt;/strong&gt; The effort level that maximizes joint surplus taking behavioral adaptation costs into account, defined by ∂v(x,θ)/∂x = 1 + (1−δ)αλ in the dynamic model. This is strictly below the materially efficient effort x^ME(θ) (defined by ∂v/∂x = 1) and strictly above the firm&amp;rsquo;s privately optimal effort in normal times.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effort compression:&lt;/strong&gt; The result under social comparisons that the principal optimally reduces the effort difference between workers relative to the efficient allocation — inducing the less productive worker to work more and the more productive worker to work less than efficiency requires. Driven by social-comparison costs λβ(x₂−x₁) that both workers receive as premiums regardless of relative rank.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expectations management:&lt;/strong&gt; The strategic use of commitment to high probability of change in order to shift workers&amp;rsquo; expectation-based reference point and reduce the perceived adaptation cost. When α is small (rational expectations dominate the reference point), making change more certain lowers the wage cost of implementation, creating a complementarity between commitment and cost reduction that produces the bang-bang result: implement with certainty or not at all.&lt;/p&gt;</description></item><item><title>Passive Quantitative Easing: Bond Supply Effects through Lower Debt Issuance</title><link>https://macropaperwarehouse.com/papers/passive-quantitative-easing-bond-supply-effects-through-lower-debt-issuance/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/passive-quantitative-easing-bond-supply-effects-through-lower-debt-issuance/</guid><description>&lt;p&gt;The paper introduces the concept of &amp;ldquo;passive quantitative easing&amp;rdquo; (passive QE): a deliberate reduction in government debt issuance that lowers anticipated future bond supply and reduces long-term yields through the same supply channel as central bank asset purchases, without involving asset purchases or reserves creation. The authors develop a unified classification scheme for central bank balance sheet policies organized by their net effect on anticipated future bond supply, and show that the Danish government&amp;rsquo;s unexpected January 2015 debt halt — which removed approximately 29.9 billion DKK from the outstanding bond stock over roughly nine months — was followed by a two-day yield decline of approximately 25 basis points across the entire yield curve. Regression estimates controlling for concurrent ECB and SNB actions imply that the halt raised the safety premium on Danish bonds by 17–22 basis points and reduced the ten-year term premium by 37–70 basis points, with combined effects pointing to 54–92 basis points in lower yields relative to the counterfactual. The Danish episode ranks approximately on par with the Federal Reserve&amp;rsquo;s QE3 in the classification scheme, and the paper argues that passive QT — unexpectedly higher debt issuance — is contractionary through two additional portfolio balance channels not present in active QT and should be treated as an active policy tool rather than a neutral background condition.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-passive-qe-and-what-distinguishes-it-from-conventional-qe"&gt;Q1. What is &amp;ldquo;passive QE&amp;rdquo; and what distinguishes it from conventional QE?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper defines passive QE as a reduction in government debt issuance that lowers anticipated future bond supply, arguing this is functionally equivalent to central bank asset purchases in its effects on long-term yields, even though it involves neither asset purchases nor reserves creation.&lt;/strong&gt; The supply-side equivalence holds because what matters for term premia and safe-asset premia is the anticipated future stock of bonds available to private investors: whether the central bank withdraws bonds via outright purchases or the government simply issues fewer new ones, the anticipated future supply declines, requiring downward adjustment in the compensation investors demand for duration risk and scarcity. The distinction from active QE is therefore operational rather than economic: passive QE leaves the central bank&amp;rsquo;s balance sheet unchanged, makes no reserve injection, and requires no fiscal–monetary coordination beyond the government&amp;rsquo;s own debt management decisions.&lt;/p&gt;
&lt;h3 id="q2-how-do-the-authors-classify-central-bank-balance-sheet-policies"&gt;Q2. How do the authors classify central bank balance sheet policies?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper proposes a unified classification scheme that maps central bank balance sheet policies by their net effect on anticipated future bond supply, placing passive QE in the same stimulative category as active QE programs and ranking the Danish halt at approximately −0.0104 on this measure — nearly on par with the Federal Reserve&amp;rsquo;s QE3 at −0.0120.&lt;/strong&gt; The scheme allows cross-country and cross-program comparisons of unconventional monetary policy actions by reducing them to a common currency of anticipated supply change. The classification also distinguishes passive QT from active QT: the paper argues that passive QT (higher-than-anticipated issuance) is more contractionary than active QT of equal magnitude because higher issuance also reduces safe-asset scarcity value and shifts duration risk back to the market through two additional portfolio balance channels.&lt;/p&gt;
&lt;h3 id="q3-what-does-the-danish-debt-halt-episode-show"&gt;Q3. What does the Danish debt halt episode show?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The January 30, 2015 announcement by Denmark&amp;rsquo;s debt management office that it would halt new government bond issuance for the remainder of the year was unexpected and was followed within two trading days by a yield decline of approximately 25 basis points across the entire yield curve.&lt;/strong&gt; The halt lasted roughly nine months and reduced the outstanding Danish government bond stock by approximately 29.9 billion DKK. The reaction is interpreted as evidence that market participants immediately revised down their expectations of future bond supply, compressing the compensation required for holding duration risk and raising the relative value of the now-scarcer safe assets.&lt;/p&gt;
&lt;h3 id="q4-what-do-the-regression-estimates-imply"&gt;Q4. What do the regression estimates imply?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Controlling for the concurrent SNB and ECB announcements in January 2015, the authors&amp;rsquo; regression estimates imply that the Danish halt raised the safety premium on Danish bonds by 17–22 basis points and reduced the ten-year term premium by 37–70 basis points, pointing to a combined reduction in bond yields of 54–92 basis points relative to the counterfactual without the halt, measured over the halt period.&lt;/strong&gt; The term-premium decline is interpreted as consistent with supply-induced portfolio balance effects: fewer bonds requiring lower duration-risk compensation. The safety-premium increase is consistent with safe-asset scarcity effects: a tighter supply of high-quality government bonds raising their relative scarcity value. These two channels are identified separately in the yield decomposition and estimated to be independently significant.&lt;/p&gt;
&lt;h3 id="q5-how-does-the-paper-treat-passive-qt"&gt;Q5. How does the paper treat passive QT?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper argues that passive QT — a higher-than-anticipated level of government debt issuance — is not a neutral background condition but an active contractionary force, and potentially more contractionary than active QT of equal magnitude through two additional portfolio balance channels.&lt;/strong&gt; The argument is that higher issuance reduces safe-asset scarcity value and directly shifts duration risk from the central bank to the market, while active QT (central bank balance sheet reduction) lacks these two additional channels. This implies that fiscal authorities&amp;rsquo; debt issuance decisions carry monetary policy implications that are not captured in frameworks treating issuance as a non-monetary decision.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;passive QE&lt;/strong&gt; : a deliberate reduction in government debt issuance that lowers anticipated future bond supply and reduces long-term yields through supply effects; the paper treats it as functionally equivalent to central bank asset purchase programs despite involving no asset purchases or reserves creation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;passive QT&lt;/strong&gt; : higher-than-anticipated government debt issuance; the paper treats it as an active contractionary tool, potentially more contractionary than active QT of equal magnitude, because it triggers two additional portfolio balance channels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;safety premium&lt;/strong&gt; : the premium on high-quality safe assets such as government bonds reflecting their scarcity value; in the Danish halt episode this rose as supply tightened.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;term premium&lt;/strong&gt; : the component of a long-term bond yield compensating investors for bearing duration risk; in the Danish halt episode this fell as anticipated future bond supply declined.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;classification scheme&lt;/strong&gt; : the paper&amp;rsquo;s taxonomy of central bank balance sheet policies organized by their net effect on anticipated future bond supply, allowing cross-program comparisons including passive QE and passive QT.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Danish debt halt&lt;/strong&gt; : the January 30, 2015 announcement by Denmark&amp;rsquo;s debt management office of a halt to new government bond issuance for the remainder of the year, used as the natural experiment to test the passive QE hypothesis.&lt;/p&gt;</description></item><item><title>Patent Term, Innovation, and the Role of Technology Disclosure Externalities</title><link>https://macropaperwarehouse.com/papers/patent-term-innovation-and-the-role-of-technology-disclosure-externalities/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/patent-term-innovation-and-the-role-of-technology-disclosure-externalities/</guid><description>&lt;p&gt;This paper examines how anticipated changes in patent term affect R&amp;amp;D and innovation, using the U.S. ratification of the Trade-Related Aspects of Intellectual Property Rights (TRIPs) agreement in 1995 as a quasi-natural experiment. The central research question is whether and how policy anticipation shapes the short- and long-run dynamics of innovative activity, given ambiguous theoretical predictions: news of a patent term reduction could either deter innovation (by signaling lower future returns) or accelerate it (by inducing innovators to file under the more favorable existing regime before it expires).&lt;/p&gt;
&lt;p&gt;The identification strategy exploits a difference-in-differences (DiD) design using two sources of variation across 621 4-digit International Patent Classification (IPC) technological fields. The first is cross-sectional variation in field-specific pending periods — the time between patent application and grant during which monopoly rights are not fully enforceable — which determines whether TRIPs increased or reduced each field&amp;rsquo;s effective patent term (from 17 years post-grant to 20 years post-application minus the pending period). Fields with average pending periods exceeding three years faced expected reductions; those below faced extensions. On average across fields, TRIPs extended patent term by approximately 473 days (about 15 months), but approximately 45% of fields faced greater than 5% probability that individual patents would receive a term reduction. The second source is time variation from two events: a news event at the end of 1992 (when the Blair House Accord substantially reduced uncertainty about TRIPs adoption) and implementation in June 1995. The empirical sample spans 1985Q1–2000Q4 using PATSTAT patent data, augmented by firm-level R&amp;amp;D data from NBER-Compustat for 2,410 listed U.S. firms.&lt;/p&gt;
&lt;p&gt;Three main empirical facts emerge. First (Fact 1), innovation and R&amp;amp;D accelerate more during the anticipation phase (1992Q4–1995Q2) in fields with a higher probability of patent term reduction. A one-percentage-point higher reduction probability corresponds to a 1.4% larger increase in granted patent applications before implementation; a one-month shorter average patent term extension corresponds to a 2.9% larger increase. At the firm level, a one-percentage-point higher reduction probability is associated with a 1.9% increase in annual R&amp;amp;D expenditure (approximately $1.7 million), ruling out the interpretation that rising patent counts merely reflect strategic filing adjustments.&lt;/p&gt;
&lt;p&gt;Second (Fact 2), this heightened innovative activity persists for at least five years after implementation. Two years post-implementation, a one-percentage-point higher reduction probability corresponds to 1.44 additional quarterly patents (+2.7% in Poisson estimates), and a one-month shorter term extension corresponds to 3.3 more patents (+5.9%). This persistence is driven by indirect effects: the anticipation-induced burst in patenting generates additional follow-on innovation through technology disclosure externalities linked to cumulative knowledge creation. The elasticity of post-implementation innovation to news-phase innovation is estimated at approximately 2.1.&lt;/p&gt;
&lt;p&gt;Third (Fact 3), the direct effect of patent term on innovation — estimated by augmenting the DiD specification to control for field-specific innovation histories — is negative for shorter extensions and consistent with prior literature. A one-month shorter patent term extension reduces quarterly patents by 1.7%, and a one-year reduction reduces them by 20.9%. These estimates align with Budish, Roin, and Williams (2015, 2016), who find that a one-year extension of patent monopoly increases R&amp;amp;D by 7%–22% in pharmaceuticals. The identification is supported by the absence of pre-trends, by the finding that pre-news pending period distributions predict realized post-news variation with coefficients near one (0.957–1.104), and by extensive robustness checks.&lt;/p&gt;
&lt;p&gt;Q: What was the effective change in U.S. patent term under TRIPs, and why did it differ across fields?
A: TRIPs shifted patent expiry from 17 years after grant to 20 years after application date. Because monopoly rights are only fully enforceable after grant, the effective term became 20 years minus the pending period. Fields with average pending periods shorter than three years received net extensions; fields with longer average pending periods faced net reductions. Cross-field variation in pending periods arises because applications in different technical fields are reviewed by distinct USPTO technical units with different complexity and backlog levels.&lt;/p&gt;
&lt;p&gt;Q: What was the news event, and how was anticipation established?
A: The paper identifies November 1992 — when the Blair House Accord substantially reduced uncertainty about TRIPs adoption — as the news event, with formal ratification in December 1994 and implementation in June 1995. Documentary evidence confirms anticipation: U.S. business executives were involved in TRIPs negotiations from 1986; the patent term change appeared in a 1991 GATT draft; an Advisory Committee report co-signed by IBM, 3M, Motorola, and others referenced it in August 1992; and a New York Times article noted proposed changes in September 1992.&lt;/p&gt;
&lt;p&gt;Q: How is the probability of patent term reduction (PL_j) constructed, and what is its distribution?
A: PL_j is the fraction of patents in field j granted before the TRIPs news with a pending period exceeding three years, computed using PATSTAT data on U.S. patents granted between January 1990 and May 1992. Approximately 45% of fields faced a reduction probability exceeding 5%, and 15% faced a probability exceeding 10%. Even fields with an average term extension greater than one year had individual-patent reduction probabilities as high as 40%. A 10-percentage-point increase in PL_j corresponds to approximately a four-month shorter average term extension.&lt;/p&gt;
&lt;p&gt;Q: What is Fact 1 and what are its quantitative magnitudes?
A: Fact 1 states that during the news phase, innovation and R&amp;amp;D increase relatively more in fields with higher patent term reduction probability and shorter average term extension. One year after the news (two years before implementation), a one-percentage-point higher reduction probability generates 0.19 additional quarterly patents (+0.5% in Poisson estimates); a one-month shorter average extension generates 0.35 additional units (+0.8%). These effects approximately triple one year before implementation. At the firm level, a one-percentage-point higher probability is associated with a 1.9% increase in annual R&amp;amp;D (~$1.7 million) in 1993.&lt;/p&gt;
&lt;p&gt;Q: Why does news of a potential patent term reduction accelerate rather than deter innovation?
A: Innovators who anticipate a reduction in future patent protection under the new regime have strong incentives to file applications before implementation to secure the longer 17-years-from-grant term while it remains available. The acceleration is therefore consistent with innovators preferring longer protection: they rush to file under the more favorable old regime rather than curtailing innovation. Complementary analyses exploiting within-field dispersion in pending periods find that firms were particularly responsive to scenarios involving adverse policy changes, consistent with loss aversion. The dynamics of the news-phase acceleration are also consistent with an R&amp;amp;D gestation lag of approximately two years, as estimated by Pakes and Schankerman (1984).&lt;/p&gt;
&lt;p&gt;Q: What is Fact 2 and what drives the post-implementation persistence?
A: Fact 2 states that the heightened innovation in fields with higher reduction probability persists for at least five years after June 1995, even though the direct effect of a shorter patent term is innovation-reducing. Two years post-implementation, a one-percentage-point higher reduction probability corresponds to 1.44 additional quarterly patents (+2.7% Poisson) and a one-month shorter extension to 3.3 additional patents (+5.9% Poisson). The persistence is driven by technology disclosure externalities: the news-phase acceleration generates new patented knowledge that subsequent innovations build upon. Fields where new inventions rely more heavily on past innovations from the same field — proxied by backward citation intensity — display stronger post-implementation persistence.&lt;/p&gt;
&lt;p&gt;Q: How does the paper separate direct from indirect (externality-driven) post-implementation effects?
A: Following Angrist and Pischke (2009), the paper augments the baseline DiD specification to control for field-specific innovation histories via a lagged moving average of past outcomes and pre-determined field attributes interacted with quarterly fixed effects. The resulting coefficients capture the effect of patent term variation orthogonal to the news-induced innovation dynamics. The direct effect estimates are negative post-implementation (Fact 3), while the overall estimates are positive (Fact 2), confirming that the indirect externality channel outweighs the direct channel in the post-implementation period.&lt;/p&gt;
&lt;p&gt;Q: What is Fact 3 and how does its magnitude compare to prior literature?
A: Fact 3 states that, controlling for the news shock, a shorter patent term extension leads to a relative decline in innovation post-implementation. The estimated semi-elasticity is 1.7% per one-month increase in patent term and 20.9% per one-year increase. These estimates align with Budish, Roin, and Williams (2015, 2016), who find a 7%–22% increase in pharmaceutical R&amp;amp;D per one-year extension, and with Hemous et al. (2023), whose model implies a 1.2% innovation increase per one-month extension.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated elasticity of post-implementation innovation to news-phase innovation, and what does it imply?
A: Point estimates imply that one additional patent during the news phase generates approximately 5.1 additional patents post-implementation. Given average patent counts of 408.5 during the news phase and 1,000.3 post-implementation, this corresponds to a percent-to-percent elasticity of approximately 2.1. This elasticity captures the technology disclosure externality channel by which transitory accelerations in patenting generate persistent follow-on innovation.&lt;/p&gt;
&lt;p&gt;Q: Why is ignoring anticipation (as in Abrams 2009) a problem for DiD identification?
A: Anticipation inflates patenting in fields with higher reduction probability during the pre-implementation period, violating the DiD assumption that pre-implementation outcomes provide an unaffected baseline. For example, between April 1994 and March 1995, average monthly patents in field C12P (high reduction probability) were 15.1 units above pre-news levels, versus only 2.4 in field E05D (low reduction probability). Using this inflated pre-implementation level as the DiD reference baseline reverses the sign of the estimated implementation effect relative to the specification that uses the unaffected pre-news baseline.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports the technology disclosure externality mechanism over alternative explanations?
A: The paper proxies technological dependence by backward citation intensity at the field level and finds that the news-phase acceleration propagates more strongly into post-implementation innovation in fields where new inventions more heavily cite prior same-field patents. Time-varying measures of technological dependence identify this channel as the primary driver of indirect post-implementation effects. Two alternative mechanisms — changes in technological competition and adjustments in patenting strategies — lack comparable empirical support. The finding is consistent with Hegde, Herkenhoff, and Zhu (2023), who document that permanent increases in knowledge diffusion speed permanently raise follow-on innovation rates.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of jointly considering anticipation and knowledge spillovers?
A: Standard patent term analyses that abstract from anticipation effects and knowledge spillovers may substantially mischaracterize full welfare implications. The paper shows that innovation-policy interventions shape both short- and long-run outcomes, and that near-term variation in innovative activity can itself drive medium- to long-term effects through technological externalities. The estimated semi-elasticities of news, direct, and indirect effects provide empirical calibration targets for normative endogenous growth models used to derive optimal patent term, complementing prior normative recommendations ranging from zero protection (Boldrin and Levine, 2013) to infinite protection (Gilbert and Shapiro, 1990).&lt;/p&gt;
&lt;p&gt;Effective patent term: The duration of legally enforceable monopoly granted by a patent, equal to 17 years after grant under the pre-TRIPs U.S. regime and 20 years after application minus the pending period under the post-TRIPs regime. Because enforcement begins only at grant, the pending period directly erodes effective protection.&lt;/p&gt;
&lt;p&gt;Patent term reduction probability (PL_j): The field-specific fraction of pre-TRIPs patents with a pending period exceeding three years, representing the probability that individual patent applications in that field obtain a net reduction in patent term under the new 20-years-from-filing rule.&lt;/p&gt;
&lt;p&gt;News effect: The incremental change in innovation or R&amp;amp;D at the time of policy announcement, induced by future anticipated changes in patent term, before the new policy enters into force. In this paper&amp;rsquo;s setting, the news effect is positive: higher reduction probability accelerates patenting as innovators rush to file under the favorable existing regime.&lt;/p&gt;
&lt;p&gt;Direct implementation effect: The component of the post-implementation change in innovation attributable to the patent term change itself, isolated by controlling for field-specific innovation histories (i.e., abstracting from the indirect effects of anticipation-induced knowledge accumulation). It is negative for shorter patent term extensions, with a semi-elasticity of 1.7% per one-month increase.&lt;/p&gt;
&lt;p&gt;Technology disclosure externality: The mechanism by which newly patented knowledge, disclosed through the patent system, enables subsequent inventors to build on prior innovations, generating follow-on inventive activity. In this paper, the transitory news-phase burst in patenting generates a persistent externality, particularly in fields with high backward citation intensity.&lt;/p&gt;
&lt;p&gt;Policy anticipation: The phenomenon whereby forward-looking agents adjust behavior in response to credible news about future policy changes before those changes take effect. In this paper, anticipation induces a pre-implementation acceleration in patenting that temporarily pushes innovation in the opposite direction from the direct long-run effect and generates persistent indirect post-implementation effects through knowledge spillovers.&lt;/p&gt;
&lt;p&gt;Pending period: The time between patent application and grant during which USPTO examines the application and during which full monopoly rights are not enforceable. Field-level heterogeneity in pending periods — arising from differences in examination complexity and USPTO unit congestion — is the source of cross-sectional identification in the DiD design.&lt;/p&gt;</description></item><item><title>Peer Effects and Rank Concerns in the Classroom</title><link>https://macropaperwarehouse.com/papers/peer-effects-and-rank-concerns-in-the-classroom/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/peer-effects-and-rank-concerns-in-the-classroom/</guid><description>&lt;p&gt;This paper investigates the mechanisms behind peer effects in the classroom using exogenous variation in study disruptions generated by the 2010 Maule mega-earthquake in Chile (magnitude 8.8, the seventh-largest ever instrumentally recorded). The central research question is why classroom peers can shape academic achievement — specifically, whether beyond production complementarities and a desire to conform, a desire to compete for classroom rank can drive peer influence on learning.&lt;/p&gt;
&lt;p&gt;The author constructs a novel dataset linking administrative and survey data from Chile&amp;rsquo;s Ministry of Education (SIMCE test scores, GPA, curriculum coverage, and school expenditure records) for two cohorts of roughly 150,000 eighth-grade students — one measured in 2009 before the earthquake, one measured in 2011 roughly 20–22 months after — to newly constructed measures of housing damage. Damage to each student&amp;rsquo;s home is built in three steps: (1) ground-shaking intensity using an established attenuation formula for the 2010 earthquake; (2) seismic vulnerability of each student&amp;rsquo;s home inferred from a latent-class-analysis model trained on census data linking housing construction materials to vulnerability classes; and (3) a combined expected &amp;ldquo;damage ratio&amp;rdquo; (fraction of home that needs to be rebuilt). Identification uses a difference-in-differences strategy that exploits the differential correlation between pre-existing seismic vulnerability and outcomes across the pre- and post-earthquake cohorts, controlling for socioeconomic composition.&lt;/p&gt;
&lt;p&gt;The main findings, holding fixed a student&amp;rsquo;s own earthquake exposure, are as follows. (1) Own home damage reduced test scores by 0.03 standard deviations (SD) per SD increase in damages (a 4.4 percentage-point increase in collapsed home fraction, approximately USD 3,600) and raised self-reported cost of study effort. GPA effects (–0.02 SD) are statistically insignificant. (2) A 1 SD increase in the mean damage among classroom peers raised test scores by 0.05 SD and GPA by 0.04 SD. School expenditure data (available for the 42% of schools in the preferential subsidy program) show schools responded by reallocating funds away from administrative activities toward educational and psychological support, accounting for this positive effect. (3) A 1 SD increase in the within-classroom standard deviation of peer damages lowered test scores and GPA by approximately 0.085 SD on average, but with sharply heterogeneous effects across the prior-achievement distribution: it lowered test scores and GPA of high-prior-achievement students by 0.08–0.11 SD and raised achievement of low-prior-achievement students, without corresponding changes in those students&amp;rsquo; GPA rank. Neither curriculum-coverage data nor school spending data show significant responses to damage dispersion, pointing to peer-to-peer interactions rather than school mediation.&lt;/p&gt;
&lt;p&gt;The null effect on GPA rank despite heterogeneous GPA effects is the pivotal empirical finding motivating the paper&amp;rsquo;s theory. The author argues that high-achieving students reduced effort in response to a less threatening competitive environment while maintaining their classroom standing — consistent with rank concerns driving effort decisions. Direct survey evidence shows a majority of students agreed they like to do better than classmates.&lt;/p&gt;
&lt;p&gt;Motivated by this evidence, the paper introduces a game-of-status model where each student chooses effort to maximize a utility function combining academic achievement and classroom GPA rank, with rank weighted by a preference parameter lambda &amp;gt; 0. The model admits a unique symmetric Bayesian Nash equilibrium. The model rationalizes all four main empirical patterns: positive mean-damage effects (school compensation); heterogeneous dispersion effects (rank competition changes the density of nearby competitors); null dispersion effects on GPA rank (simultaneous equilibrium adjustment preserves rank ordering); and the survey evidence on competitive preferences.&lt;/p&gt;
&lt;p&gt;The study is confined to Chilean public and subsidized private schools in earthquake-affected, non-coastal regions, with outcomes measured at the 8th grade. The pre/post cohort design removes schools that closed or received earthquake evacuees. Findings apply to a context where classroom rank is observable to peers (GPA) and where competitive preferences are prevalent among students.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification strategy and why does it avoid the usual confounds in peer-effects research?
A: The paper uses a difference-in-differences estimator that exploits the differential relationship between pre-existing seismic vulnerability and outcomes across a pre-earthquake cohort (outcomes measured in 2009) and a post-earthquake cohort (outcomes measured in 2011). Because identification relies on variation in peer disruptions rather than in peer characteristics — and because students did not reallocate across classrooms or schools in response to the earthquake in the estimation sample — the strategy avoids the reflection problem and selection confounds that typically plague peer-effects identification. The identifying assumption is that the relationship between seismic vulnerability and outcomes would have been the same across cohorts absent the earthquake.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports the identifying assumption?
A: The paper provides three pieces of supporting evidence. First, the fraction of students switching schools or classrooms between grades 7 and 8 is identical across the pre- and post-earthquake cohorts in the estimation sample, indicating no earthquake-induced reallocation. Second, pre-trend tests show precise zero effects of own damage, mean peer damage, and SD of peer damage on lagged (4th-grade) test scores and GPA. Third, placebo tests using students in regions unaffected by the earthquake show no significant differential relationships between seismic vulnerability measures and outcomes across cohorts.&lt;/p&gt;
&lt;p&gt;Q: How was housing damage measured, and why does this matter for identification?
A: Damage is estimated in three steps: ground-shaking intensity at the student&amp;rsquo;s town is calculated from a validated attenuation formula; seismic vulnerability of the home is predicted using a latent-class-analysis model trained on pre-earthquake census housing data and then applied to student records; and the two are combined into a damage ratio (fraction of home to be rebuilt) using structural engineering damage-grade distributions. This constructed measure is not self-reported and is determined by physical and housing-quality factors largely predetermined before the earthquake, which supports exogeneity. Coastal towns are excluded because the accompanying tsunami caused damages not captured by the damage-ratio formula, and results are robust to different definitions of coastal proximity.&lt;/p&gt;
&lt;p&gt;Q: What were the effects of damage to a student&amp;rsquo;s own home on achievement?
A: A 1 SD increase in own home damages (corresponding to a 4.4 percentage-point increase in the collapsed fraction of the home, or roughly USD 3,600) reduced test scores by 0.03 SD. GPA fell by 0.02 SD but this was not statistically significant. Survey data show that own-home damages raised students&amp;rsquo; self-reported cost of study effort, suggesting this effort channel may mediate the achievement effects. These negative effects did not vary significantly across the baseline achievement distribution.&lt;/p&gt;
&lt;p&gt;Q: What were the effects of mean peer damage on own achievement, and what mechanism explains them?
A: A 1 SD increase in mean peer home damage raised own test scores by 0.05 SD and GPA by 0.04 SD. School spending data from SEP-program schools (42% of the sample) show that schools responded to higher average student damage by reallocating expenditures away from administrative activities (recruitment of non-teaching staff, equipment purchases) toward educational support and psychological support activities. This reallocation more than offset potential negative peer-environment effects, generating positive net achievement effects that were approximately uniform across the prior-achievement distribution.&lt;/p&gt;
&lt;p&gt;Q: What were the effects of within-classroom damage dispersion on achievement, and how do they vary across students?
A: A 1 SD increase in the within-classroom standard deviation of peer damages lowered average test scores and GPA by approximately 0.085 SD. These average effects mask sharp heterogeneity: high-prior-achievement students experienced losses of 0.08–0.11 SD in test scores and GPA, while low-prior-achievement students saw gains. For some students the dispersion effect was comparable to or larger than the effect of damage to their own home.&lt;/p&gt;
&lt;p&gt;Q: Why is the null effect of damage dispersion on GPA rank theoretically important?
A: Students with high prior achievement experienced drops in GPA in classrooms with more dispersed damages, but without an accompanying drop in their GPA rank. The paper argues this is inconsistent with students passively absorbing a changed study environment: instead, students appear to have adjusted effort precisely enough to maintain their classroom standing. This equilibrium pattern — GPA changes that leave rank ordering intact — is the paper&amp;rsquo;s key empirical signature of rank-motivated competition as a mechanism for peer influence.&lt;/p&gt;
&lt;p&gt;Q: What direct survey evidence is presented on rank concerns?
A: Survey data from the post-earthquake cohort show that a majority of students agreed with the statement &amp;ldquo;I like to do better than my classmates in school,&amp;rdquo; providing direct evidence that students value classroom rank. Additionally, students with higher initial achievement reported reductions in self-reported ability to engage with course content in classrooms with more dispersed damages, consistent with these students reducing effort when the competitive environment became less threatening to their rank.&lt;/p&gt;
&lt;p&gt;Q: Do schools mediate the damage-dispersion spillovers?
A: The available data on curriculum coverage and school spending do not show statistically significant responses to within-classroom damage dispersion (as distinct from mean damage). Emergency reconstruction funds were also allocated by schools based on overall damage severity, not its within-classroom dispersion. This absence of a detectable school-mediation channel for dispersion effects strengthens the interpretation that the heterogeneous achievement effects of dispersion reflect peer-to-peer interactions rather than differential school responses.&lt;/p&gt;
&lt;p&gt;Q: How does the game-of-status model rationalize the empirical findings?
A: In the model, each student maximizes a utility function over academic achievement and GPA rank, with rank weighted by lambda &amp;gt; 0. Students choose effort simultaneously, and their cost-of-effort type is shaped by prior test scores, socioeconomic characteristics, and earthquake damage. The model admits a unique symmetric Bayesian Nash equilibrium. In this equilibrium: schools&amp;rsquo; compensating inputs in response to mean damage raise achievement uniformly (rationalizing positive mean-damage effects); changes in damage dispersion alter the density of nearby types differently for high- and low-cost-effort students, changing the marginal benefit of exerting effort to overtake competitors (rationalizing heterogeneous GPA effects); and because all students adjust effort simultaneously, the rank ordering is approximately preserved (rationalizing null rank effects).&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism by which damage dispersion produces heterogeneous effort incentives?
A: The key mechanism is that when students derive utility from rank, the marginal benefit of a unit of additional effort depends on how many competitors are &amp;ldquo;nearby&amp;rdquo; in the effort-cost distribution. When dispersion increases, the density of types just below a high-achiever (low-cost-effort student) decreases, reducing the gain from exerting more effort to maintain rank over nearby rivals; high-achievers therefore reduce effort and GPA falls. Conversely, when dispersion increases, low-achievers face a distribution where they can more effectively compete for higher ranks, raising their effort incentive and GPA.&lt;/p&gt;
&lt;p&gt;Q: How does this paper&amp;rsquo;s theory differ from prior theories of peer influence?
A: Prior theories have emphasized two mechanisms: production complementarities (peer ability directly improves own learning) and a desire to conform (students prefer to match their peers&amp;rsquo; effort or achievement). Both rationalize a linear-in-means model that captures only mean peer characteristics. This paper&amp;rsquo;s theory is the first in the peer-effects literature to rationalize why higher-order moments of the peer distribution (specifically dispersion) affect learning, through a competitive rank-concern mechanism that is parsimonious and does not require extensions to production technology or preferences beyond adding rank to the utility function.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the competitive-motive theory?
A: The theory implies that classroom composition policies affecting the dispersion of student ability — such as ability tracking, gifted programs, or reshuffling policies — can have heterogeneous and potentially perverse effects: policies that reduce ability dispersion may concentrate competitive incentives in ways that harm some students while benefiting others. Standard linear-in-means models of peer effects, which capture only mean peer characteristics, would not predict these distributional consequences. The author argues this means the competitive mechanism has been largely unexplored despite its intuitive appeal, and calls for structural estimation and policy analysis in future work.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the empirical findings?
A: The findings apply to 8th-grade students in Chilean public and private subsidized schools located in earthquake-affected, non-coastal regions, with outcomes observed approximately 20–22 months post-earthquake. The sample excludes schools that closed due to the earthquake and schools that received evacuees. The paper notes that while the theory is formulated around an earthquake shock, the competitive-motive mechanism applies whenever the dispersion of students&amp;rsquo; cost-of-effort types changes — including through classroom assignment policies or other shocks — and is not specific to the natural-disaster context.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Damage ratio&lt;/strong&gt;: The fraction of a student&amp;rsquo;s home that needs to be rebuilt, constructed by combining geocoded ground-shaking intensity (via the Astroza et al. attenuation formula for the 2010 Chilean earthquake) with the predicted seismic vulnerability class of the home (derived from a latent-class-analysis model trained on census housing data). Used as the paper&amp;rsquo;s measure of disruption to each student&amp;rsquo;s environment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exogenous peer effect&lt;/strong&gt; (in the sense of Manski 1993): The reduced-form impact on a student&amp;rsquo;s outcome of a change in the distribution of an exogenous characteristic — here, earthquake damage — among classroom peers, holding fixed the student&amp;rsquo;s own characteristics. Distinguished in the paper from endogenous peer effects (best-response functions).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rank concern&lt;/strong&gt;: Students&amp;rsquo; utility derived from their position (rank) in the classroom GPA distribution, irrespective of whether that rank is formally rewarded. The paper treats rank concern as a preference parameter (lambda &amp;gt; 0 in the utility function) and identifies it as a mechanism for peer influence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Game-of-status model&lt;/strong&gt;: The paper&amp;rsquo;s theoretical framework, in which students simultaneously choose study effort to maximize utility over own academic achievement and GPA rank. The model admits a unique symmetric Bayesian Nash equilibrium. The central insight is that the density of nearby competitors in the effort-cost distribution determines the marginal benefit of effort, generating heterogeneous incentives when peer cost-of-effort types become more dispersed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effort-cost type&lt;/strong&gt;: Each student&amp;rsquo;s marginal cost of exerting study effort, shaped by prior test scores, socioeconomic characteristics, and earthquake damages to the student&amp;rsquo;s own home. The key primitive of the model that links individual disruptions to equilibrium effort choices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;SEP (Subvencion Escolar Preferencial)&lt;/strong&gt;: Chile&amp;rsquo;s preferential school subsidy program for disadvantaged students, which requires participating schools (42% of the sample) to submit detailed annual spending reports to the Ministry of Education. The paper uses these reports to identify school spending responses to mean and dispersed peer damages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Seismic vulnerability class&lt;/strong&gt;: A classification of a home&amp;rsquo;s resistance to earthquake damage based on its construction materials (exterior walls, roof, floor), assigned using a logistic latent-class-analysis model estimated on census data. Found to align strongly with household socioeconomic status, enabling prediction of housing vulnerability from administrative student records.&lt;/p&gt;</description></item><item><title>Peer Effects and the Gender Gap in Corporate Leadership</title><link>https://macropaperwarehouse.com/papers/peer-effects-and-the-gender-gap-in-corporate-leadership/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/peer-effects-and-the-gender-gap-in-corporate-leadership/</guid><description>&lt;p&gt;This paper investigates whether exposure to a larger share of female peers during an MBA program causally affects the gender gap in senior corporate leadership positions. The research question is motivated by the persistent underrepresentation of women in top management: in S&amp;amp;P 1500 companies, women hold only 6% of CEO positions despite comprising 40% of the workforce.&lt;/p&gt;
&lt;p&gt;The authors merge administrative data from a top-10 U.S. business school (graduating classes 2000–2018, excluding 2009) with public LinkedIn profile data covering full employment histories, firm-level data from multiple sources including InHerSight crowdsourced female-employee ratings, and a 2023–2024 alumni survey of female graduates. Senior management is defined as Vice President, Director, Senior Vice President, or C-level executive, identified from exact job titles in LinkedIn CVs.&lt;/p&gt;
&lt;p&gt;Identification exploits the quasi-random assignment of incoming MBA students to one of eight sections of approximately 60 students each, based on alphabetical order with balance checks on gender, undergraduate institution, and ethnicity. This assignment generates exogenous variation in the share of female section peers (mean 34%, standard deviation 4 percentage points). Randomization tests following Guryan et al. (2009) and Caeyers and Fafchamps (2021) confirm the assignment is as good as random. The estimating equation is a linear-in-means model with class, year, and class-by-year fixed effects interacted with gender, plus individual and section-level controls.&lt;/p&gt;
&lt;p&gt;The paper first documents a baseline gender gap: despite 96% of both male and female MBA graduates entering management within 15 years, women are 24% less likely than men to hold senior management positions. This gap emerges immediately after graduation, persists for at least 15 years, and is partly attributable to lower promotion rates from first-level management (43% of women in first-level management transition to senior management within five years, versus 57% of men).&lt;/p&gt;
&lt;p&gt;The main causal finding is that a 4 percentage point (1 SD) increase in the share of female MBA section peers increases the probability of a woman holding a senior management position by 8.4% (a 3.3 percentage point increase off a 39.1% baseline), equivalent to a 26% reduction in the management gender gap. There is no corresponding effect for men. The effect emerges as early as two years post-graduation, peaks around year seven, and persists through the 15-year horizon.&lt;/p&gt;
&lt;p&gt;The increase is concentrated in female-friendly firms, defined as those with above-median ratings on InHerSight metrics including maternity leave generosity, flexible work schedules, and professional support. Women with more female peers are significantly more likely to transition into female-friendly firms 6 to 10 years after graduation — a period coinciding with prime childbearing years — where they subsequently attain senior management roles. The effect on senior management in female-friendly firms is statistically distinguishable from the null effect in non-female-friendly firms (p-value = 0.03). The results are largest in male-dominated industries (consulting, tech, finance) where women face greater barriers to informal networks.&lt;/p&gt;
&lt;p&gt;A survey of 283 female MBA alumnae (10% response rate) reveals three mechanisms: (i) information sharing, especially gender-specific advice about employer policies and culture; (ii) higher ambitions and self-confidence through role modeling and emotional support; and (iii) increased perceived support from male MBA peers as female section representation rises. Corroborating the information-sharing channel, women with more female peers are more likely to work at the same firms as their female section peers, particularly when those firms are female-friendly.&lt;/p&gt;
&lt;p&gt;A counterfactual exercise shows that reallocating the existing stock of female students so that all sections have at least 34% women would yield 2 to 5 additional female senior managers per graduating class (a 2.4% to 8.4% increase), holding the total number of female students fixed.&lt;/p&gt;
&lt;p&gt;Q: What is the baseline gender gap in senior management among MBA graduates, and how does it evolve over time?
A: Female MBA graduates are 24% less likely than male graduates to hold senior management positions in the 15 years after graduation. The gap emerges immediately after the MBA and persists for at least 15 years without closing. At year 15, 74% of men hold a senior management position compared to 59% of women.&lt;/p&gt;
&lt;p&gt;Q: How is female peer share defined and what is its distribution across sections?
A: Female peer share is the proportion of female students in an individual&amp;rsquo;s assigned MBA section of approximately 60 students, excluding the individual themselves. The average section female share is 34% with a standard deviation of 4 percentage points. The distribution ranges from 19% at the 1st percentile to 45% at the 99th percentile, with the interquartile range spanning approximately 32% to 36%.&lt;/p&gt;
&lt;p&gt;Q: What is the main causal estimate of female peers on women&amp;rsquo;s senior management probability?
A: A 4 percentage point (1 SD) increase in female section peer share increases the probability of a woman holding a senior management position by 8.4% (3.3 percentage points off a 39.1% mean), averaged across the 15 post-MBA years. This translates to a 26% reduction in the management gender gap. There is no statistically significant effect on men.&lt;/p&gt;
&lt;p&gt;Q: When does the effect of female peers emerge and how does it evolve dynamically?
A: The effect on women emerges as early as two years after MBA graduation and grows over time, peaking around seven years post-graduation. The effect is persistent across the 15-year horizon studied. Estimates become less precise toward the end of the sample period as recent cohorts contribute fewer observations.&lt;/p&gt;
&lt;p&gt;Q: How do female-friendly firms mediate the main result?
A: The main effect is entirely concentrated in female-friendly firms (those with above-median InHerSight ratings). The coefficient on female peer share is positive and significant for senior management in female-friendly firms, and statistically indistinguishable from zero in non-female-friendly firms. The difference between the two coefficients is significant at p = 0.03.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism linking female peers to female-friendly firm transitions?
A: Women with more female peers are significantly more likely to be employed at female-friendly firms 6 to 10 years after graduation, a window corresponding to prime childbearing years. This suggests female peers facilitate sorting into supportive firm environments when family-work tradeoffs become most acute. Once at female-friendly firms, women attain senior management positions at higher rates.&lt;/p&gt;
&lt;p&gt;Q: Does the increase in female senior managers reflect easier paths (smaller firms, lower pay, non-P&amp;amp;L roles)?
A: No. The effect is significant for both small (under 500 employees) and large (over 5,000 employees) firms, with no significant effect on the firm size of employment itself. There is no consistent pattern of women being promoted in firms with higher or lower average compensation. The increase in female senior managers includes those with Profit and Loss responsibilities, indicating these are substantive management positions.&lt;/p&gt;
&lt;p&gt;Q: In which industries is the effect largest, and what does this imply?
A: The effect is concentrated in male-dominated industries (consulting, tech, finance), with no significant effect in female-dominated industries (consumer goods, healthcare). The difference between coefficients is significant at the 3% level. Entry rates into male-dominated industries are not significantly affected, suggesting the mechanism is higher promotion rates within these industries rather than differential sorting into them. The authors interpret this as evidence that female MBA networks are most valuable where women face greater barriers to informal workplace networks.&lt;/p&gt;
&lt;p&gt;Q: What does the survey evidence reveal about mechanisms?
A: Among 283 survey respondents (10% response rate), three mechanisms emerge: information sharing about gender-specific employer attributes and policies; raising ambitions and self-confidence through role modeling; and increased perceived support from male MBA peers as section female share rises. Women with more female peers are also more likely to work at the same firms as their female section peers, especially female-friendly ones, consistent with referral and information-sharing channels.&lt;/p&gt;
&lt;p&gt;Q: Does the effect operate through greater attachment to the corporate pipeline (fewer career breaks, higher entry into management)?
A: No. Female peers do not significantly affect employment rates, career break incidence, entry into first-level management positions, or self-employment rates. The results thus reflect higher promotion rates from first-level management into senior management, not changes in pipeline attachment.&lt;/p&gt;
&lt;p&gt;Q: What do the randomization tests show about identification validity?
A: Two randomization tests confirm as-good-as-random assignment. Following Guryan et al. (2009), the section-level leave-out mean female share is not significantly different from zero after controlling for the class-level leave-out mean. Following Caeyers and Fafchamps (2021), after netting out the asymptotic exclusion bias, the female share coefficient is insignificant across all specifications. A simulation test (Bietenbeck 2020) finds no statistically significant difference between the actual and simulated within-class female share distributions.&lt;/p&gt;
&lt;p&gt;Q: What placebo tests are conducted and what do they show?
A: Two placebo tests are run. First, 1,000 random reassignments of students to sections within the same class show the true estimated effect for women lies outside the distribution of placebo effects, while the null effect for men lies within it. Second, estimating the main equation for up to three years before MBA enrollment finds no consistent pre-treatment effect of female share on future female graduates, supporting the identification strategy.&lt;/p&gt;
&lt;p&gt;Q: What is the counterfactual policy exercise and what does it imply?
A: Holding the total number of female students fixed, reallocating them so that all sections contain at least 34% women would yield 2 to 5 additional female senior managers per graduating class (a 2.4% to 8.4% increase). This assumes nonlinearity in the relationship and suggests meaningful gains from rebalancing section composition without increasing overall female enrollment.&lt;/p&gt;
&lt;p&gt;Q: How do the results compare to the Thomas (2021) finding that more male peers raise female MBA earnings?
A: The authors note several differences: Thomas (2021) focuses on starting earnings while this paper studies senior management positions over 15 years; the two studies use different universities and time periods; and this paper employs gender-by-cohort fixed effects to account for time trends in female labor market outcomes. The authors suggest these design and outcome differences explain the divergent findings.&lt;/p&gt;
&lt;p&gt;Section peers: Students assigned to the same MBA section of approximately 60 students who take core classes together and form the primary peer network; sections are assigned quasi-randomly based on alphabetical order with balance adjustments, generating exogenous variation in gender composition.&lt;/p&gt;
&lt;p&gt;Female-friendly firms: Firms with above-median ratings on InHerSight, a crowdsourced platform where female employees rate employers on metrics including maternity leave generosity, flexible work schedules, mentorship programs, and female representation in management; defined in this paper&amp;rsquo;s own terms as firms whose cultures and policies help women balance work-family responsibilities and support career advancement.&lt;/p&gt;
&lt;p&gt;Senior management: Positions defined as Vice President (VP), Director, Senior Vice President (SVP), or C-level executive, identified using keyword matching on exact job titles from LinkedIn CVs; distinguished from first-level management (managers and supervisors) and representing the upper rungs of the corporate management ladder.&lt;/p&gt;
&lt;p&gt;Female share (treatment variable): The proportion of female students among an individual&amp;rsquo;s section peers, excluding the individual themselves (leave-out mean); averaged 34% with a 4 percentage point standard deviation across sections, after residualizing by graduating class.&lt;/p&gt;
&lt;p&gt;Management gender gap: The 24 percentage point (24%) difference in the likelihood of female versus male MBA graduates holding senior management positions within 15 years of graduation; emerges immediately post-MBA and does not close over the observed horizon.&lt;/p&gt;
&lt;p&gt;Information sharing mechanism: The channel through which female MBA peers provide gender-specific advice and information about employer policies, culture, and female-friendliness that is otherwise difficult to observe; evidenced by the co-location of women with more female peers at the same female-friendly firms as their section peers.&lt;/p&gt;
&lt;p&gt;Exclusion bias: The systematic negative correlation between an individual&amp;rsquo;s own characteristic and her leave-out peer mean that arises mechanically when individuals cannot be their own peer under assignment without replacement; addressed via the Caeyers and Fafchamps (2021) correction in randomization tests.&lt;/p&gt;</description></item><item><title>Peer Effects in Consideration and Preferences</title><link>https://macropaperwarehouse.com/papers/peer-effects-in-consideration-and-preferences/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/peer-effects-in-consideration-and-preferences/</guid><description>&lt;p&gt;This paper develops a general nonparametric model of discrete choice in which peers influence agents through two distinct channels: (1) the set of alternatives an agent considers (consideration set effects) and (2) the agent&amp;rsquo;s preferences over those alternatives (preference effects). The framework embeds these peer mechanisms in a continuous-time Markov process where agents revise choices at Poisson alarm-clock rates. A peer is classified as a consideration peer, a preference peer, or both, and the network is encoded as two directed edge sets rather than one.&lt;/p&gt;
&lt;p&gt;The central identification challenge is recovering network structure, consideration probabilities, and preferences simultaneously, without relying on exogenous variation in covariates or the menu of available options. The paper shows this is achievable using time-series variation in the choices made by connected agents. The key insight is that consideration peers who adopt alternative v change the probability that the focal agent considers v — entering only the &amp;ldquo;consideration&amp;rdquo; term of the conditional choice probability (CCP) — while preference peers who adopt alternatives other than v change only the &amp;ldquo;conditional-on-consideration&amp;rdquo; selection probability. These cross-alternative patterns in the CCPs allow the researcher to distinguish the two channels. Once consideration-only peers are isolated, their choices serve as exclusion restrictions that mimic artificial menu variation, enabling nonparametric recovery of preferences.&lt;/p&gt;
&lt;p&gt;Identification proceeds in stages: (i) recover the full reference group of each agent from changes in CCPs; (ii) separate consideration-only peers from preference-affecting peers using cross-order effects across alternatives; (iii) distinguish preference-only peers from consideration-and-preference peers under an exclusion restriction (Assumption 4) requiring that an agent with a dual-channel peer also has at least one single-channel peer; (iv) recover consideration ratios Q(v|n+1)/Q(v|n) and then the full choice rule. The results allow arbitrary heterogeneity across agents and do not require exogenous menu variation or covariate shifters.&lt;/p&gt;
&lt;p&gt;For continuous-time data (Dataset 1), the CCPs and Poisson rates are exactly identified from the observed revision history. For discrete-time panel data (Dataset 2), identification is generic under a mild eigenvalue condition on the transition rate matrix.&lt;/p&gt;
&lt;p&gt;The empirical application studies store-opening decisions by China&amp;rsquo;s two dominant high-end tea chains — Heytea and Nayuki — across prefecture-level cities from their founding through end-2020. By that date, Nayuki had 485 stores in 57 cities and Heytea had 729 stores in 46 cities, in an industry whose total revenue grew from 42.2 to 83.1 billion yuan between 2017 and 2020. Each firm-market pair is modeled as an agent deciding whether to open a new store. The key exclusion restriction is that the cumulative store count of either firm in geographically neighboring markets shifts consideration probabilities but does not enter marginal profitability directly.&lt;/p&gt;
&lt;p&gt;Estimation via maximum likelihood yields four substantive findings: (1) Firms exhibit limited consideration — consideration probabilities for markets with no prior presence by either firm are substantially below one. (2) Stores in neighboring markets significantly raise consideration probabilities for a given market, for both own-firm and rival stores; this peer effect in consideration is described as economically large. (3) Own-market store density raises marginal profitability (density economies) while rival presence lowers it (competitive effects). (4) A full-consideration model that omits the attention stage overestimates the negative competitive effect and underestimates positive density effects.&lt;/p&gt;
&lt;p&gt;Counterfactual simulations show that removing attention constraints (full consideration) accelerates market penetration substantially: firms enter new markets earlier and achieve broader geographic coverage. Removing peer effects in consideration only — while retaining attention constraints — slows the diffusion of store openings across neighboring markets, because peer effects in consideration function as an informational cascade. Limited consideration also reduces competition by delaying rival entry into high-profitability markets, explaining a significant share of the geographic concentration in first- and second-tier cities during the early expansion phase. The paper&amp;rsquo;s scope is limited to settings with repeated, non-durable choices; it does not model forward-looking behavior or multiple equilibria, which the authors note as directions for future research.&lt;/p&gt;
&lt;p&gt;Q: What are the two peer-effect channels in the model, and how do they differ structurally?
A: A consideration peer influences whether an alternative enters the agent&amp;rsquo;s consideration set — specifically, the probability Q_a(v | n) that alternative v is considered is a function of the number n of consideration peers currently adopting v. A preference peer influences the choice rule R_a(v | y, C) — the probability that v is selected conditional on it being in the consideration set. Importantly, the paper models the two channels as affecting logically separate stages of the decision process, so the observed CCP factors into a consideration term and a conditional-selection term that respond to distinct sets of peers.&lt;/p&gt;
&lt;p&gt;Q: Why does the standard identification approach of varying menus fail here, and how does the paper substitute for it?
A: Menu variation requires the researcher to observe the same agent facing different sets of available alternatives, which is unavailable in many empirical settings. The paper replaces exogenous menu variation with endogenous variation generated by consideration-only peers: when a consideration-only peer adopts alternative v, the focal agent&amp;rsquo;s probability of considering v rises, effectively mimicking the removal of other alternatives from her consideration set. This peer-induced variation in consideration is then used to trace out the choice rule R_a over counterfactual menus without any actual menu changes.&lt;/p&gt;
&lt;p&gt;Q: How does the paper separate consideration peers from preference peers in the data?
A: The decomposition exploits an asymmetry in how the two peer types appear in the log-CCP. When a consideration peer switches to alternative v, the term ln Q_a(v | .) changes but the conditional-selection term ln D_a(v | .) remains unchanged, because the agent already considers v. Conversely, when a preference peer adopts an alternative other than v, only the conditional-selection term shifts. The paper formalizes this via cross-order effects of peers across alternatives in the CCPs (Propositions 3.1–3.3) and invokes Assumption 4 — requiring at least one single-channel peer when a dual-channel peer exists — to complete the separation.&lt;/p&gt;
&lt;p&gt;Q: What is Assumption 4 and why is it necessary?
A: Assumption 4 states that if agent a has a peer in N_CR_a (a peer affecting both consideration and preferences), then a also has at least one additional peer affecting only consideration or only preferences. Without this exclusion restriction, the consideration and preference effects of a dual-channel peer are not separately identified from each other; the single-channel peer provides the variation needed to pin down each component separately.&lt;/p&gt;
&lt;p&gt;Q: What does Proposition 2.1 establish and what does it require?
A: Proposition 2.1 establishes existence and uniqueness of an invariant equilibrium distribution mu over choice configurations, with full support. It requires Assumptions 1 (independent consideration), 2(i) (strictly positive consideration probability for every alternative), and 3(i) (strictly positive probability of selecting any non-default alternative from some reachable consideration set). The continuous-time Poisson structure ensures zero probability of simultaneous revisions, which rules out multiple equilibria in the data-generating process.&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle discrete-time panel data, where only periodic snapshots of choices are observed?
A: The paper invokes results from Blevins (2017, 2026) to show that the transition rate matrix W of the continuous-time process is generically identified from the discrete-time transition matrix observed at interval Delta, provided the eigenvalues of W do not differ by integer multiples of 2&lt;em&gt;pi&lt;/em&gt;i/Delta. Once W is identified, the CCPs P and Poisson rates lambda_a are recovered. This result is described as generic, meaning it holds except on a measure-zero set of parameter values.&lt;/p&gt;
&lt;p&gt;Q: What data does the empirical application use, and what are the key sample statistics?
A: The application uses city-level store registration data sourced from the National Enterprise Credit Information Publicity System (via CnOpenData, 2021), supplemented by regional statistics from the China City Statistical Yearbook (2016–2021). The sample ends in 2020 to avoid COVID-19 demand shifts. By end-2020, Nayuki had 485 stores across 57 cities and Heytea had 729 stores across 46 cities. The high-end tea industry&amp;rsquo;s total revenue grew from 42.2 to 83.1 billion yuan between 2017 and 2020.&lt;/p&gt;
&lt;p&gt;Q: What is the key exclusion restriction in the empirical specification, and why is it plausible?
A: Stores in geographically neighboring markets (parameterized by distance bins d(m,m&amp;rsquo;)) enter the attention index pi_tilde but are excluded from the marginal profit index pi_bar. The rationale is that nearby store counts are informative signals that draw managerial attention to a market (an informational spillover) but do not directly alter the profitability of operating in that market — profitability depends on local demand, competition within the market, and own firm density, not on activity in adjacent markets. This restriction identifies the consideration-only peer channel.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about biases from ignoring limited consideration?
A: When the two-stage model (consideration + choice) is replaced by a single-stage full-consideration model, the estimated payoff parameters differ substantially. Specifically, the full-consideration model overestimates the negative effect of competition (rival presence in the same market) and underestimates the positive effect of own-store density. The intuition is that correlated entry patterns driven by shared consideration spillovers are misattributed to payoff interactions when the consideration stage is omitted.&lt;/p&gt;
&lt;p&gt;Q: What do the counterfactual simulations show about the role of limited consideration in market dynamics?
A: Three counterfactuals are compared against the baseline. Under full consideration (no attention constraints), market penetration is substantially faster — firms enter new markets earlier and achieve broader geographic coverage. Removing peer effects in consideration while retaining attention constraints slows geographic diffusion because the informational cascade that propagates entry to neighboring markets is eliminated. Limited consideration also reduces competition by delaying rival entry into high-profitability markets; markets with high potential demand remain underserved for longer. Collectively, limited consideration explains a significant portion of the geographic concentration of tea chain stores in first- and second-tier cities during the early expansion period.&lt;/p&gt;
&lt;p&gt;Q: What forms of heterogeneity does the identification allow, and what does it not require?
A: The nonparametric identification results accommodate arbitrary heterogeneity across agents in consideration mechanisms Q_a, choice rules R_a, Poisson revision rates lambda_a, and network positions. The identification requires neither exogenous covariates that shift preferences or consideration, nor variation in the set of available alternatives across observations. It relies solely on time-series variation in the choices made by connected agents, which are endogenous to the model and are themselves identified in the first stage.&lt;/p&gt;
&lt;p&gt;Q: How does the paper model history dependence, and does it change the main identification results?
A: Section 4.1 extends the model to allow consideration probabilities and choice rules to depend on the agent&amp;rsquo;s own choice history h_t in addition to the current configuration y. Proposition 4.1 states that under Assumptions 1–4 applied conditional on both y_{at} and h_t, all identification propositions from Section 3.1 remain valid. The extension also allows consideration probabilities to equal one, enabling nontrivial dynamics in consideration sets driven by past choices.&lt;/p&gt;
&lt;p&gt;Q: How is the unobservable default handled in the empirical application?
A: When the default alternative (e.g., &amp;ldquo;do not open a store&amp;rdquo;) is unobserved, the Poisson revision rate lambda_a cannot be separately identified from the CCPs without normalization. The paper normalizes lambda_a = 1 for each agent in the empirical application, treating the revision opportunity rate as fixed and recovering all remaining primitives under this normalization.&lt;/p&gt;
&lt;p&gt;Consideration set: The subset C of the full menu Y that agent a actually attends to at the moment of revision; formed before the choice rule is applied. Alternative v enters C independently with probability Q_a(v | n), where n is the number of consideration peers currently adopting v. The default alternative is always in the consideration set.&lt;/p&gt;
&lt;p&gt;Conditional choice probability (CCP): P_a(v | y), the ex-ante probability that agent a selects alternative v given choice configuration y; equal to the product of the consideration probability Q_a(v | .) and the conditional-selection probability D_a(v | .), integrated over all possible consideration sets.&lt;/p&gt;
&lt;p&gt;Choice configuration: The vector y = (y_a)_{a in A} recording the current alternative selected by every agent in the network simultaneously; the state variable of the continuous-time Markov process.&lt;/p&gt;
&lt;p&gt;Consideration-only peer: A peer a&amp;rsquo; in N_C_a \ N_R_a whose choices enter the consideration probability Q_a but not the choice rule R_a. Variation in the choices of consideration-only peers serves as an exclusion restriction that mimics artificial menu variation for identifying preferences.&lt;/p&gt;
&lt;p&gt;Preference-only peer: A peer a&amp;rsquo; in N_R_a \ N_C_a whose choices enter the choice rule R_a but not the consideration probability Q_a.&lt;/p&gt;
&lt;p&gt;Cross-order peer effect: The pattern in the CCP by which a consideration peer&amp;rsquo;s adoption of alternative v changes ln P_a(v | .) but not the conditional-selection component, while a preference peer&amp;rsquo;s adoption of a different alternative v&amp;rsquo; changes the conditional-selection component but not the consideration component; this asymmetry is the key to separating the two channels.&lt;/p&gt;
&lt;p&gt;Limited consideration: The situation in which Q_a(v | n) is strictly less than one for at least some alternatives v and peer counts n, so that the agent does not evaluate all available options before choosing; distinct from full rationality in which all alternatives are always considered.&lt;/p&gt;
&lt;p&gt;Mean attention index (pi_tilde): The latent index governing the consideration probability in the empirical specification; it depends on own and rival store counts in the same and neighboring markets and on firm fixed effects, but is excluded from the marginal profit index — constituting the empirical exclusion restriction that separates the consideration and payoff channels.&lt;/p&gt;</description></item><item><title>Permanent Capital Losses after Banking Crises</title><link>https://macropaperwarehouse.com/papers/permanent-capital-losses-after-banking-crises/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/permanent-capital-losses-after-banking-crises/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates two interrelated questions about historical banking crises: (1) whether bank losses during banking crises are primarily temporary or permanent in nature, and (2) whether policy interventions — particularly liquidity-based interventions — are effective at restoring bank capitalization after such crises. The paper positions these questions against a theoretical divide: models stressing temporary price dislocations (binding borrowing constraints, depositor fragility, information frictions) versus models in which crises reflect fundamental and permanent deterioration in the value of bank assets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors construct three new historical datasets spanning 46 economies from 1870 to 2019. The first is a country-level panel of annual and monthly bank and nonfinancial equity index total returns, building on Baron, Verner, and Xiong (2021). The second is an individual-bank-level dataset covering the ten largest banks per country across 17 economies (from Jordà, Schularick, and Taylor 2017), containing equity returns, balance sheet quantities, net income decomposed into write-downs and trading income, and equity issuance within ±5-year windows around each crisis. The third is a new database of the monthly starting dates of policy interventions — extraordinary central bank liquidity support, blanket liability guarantees, and government recapitalizations — extending the databases of Laeven and Valencia (2020) and Metrick and Schmelzing (2024).&lt;/p&gt;
&lt;p&gt;Bank equity crises are identified using a real-time, data-driven indicator requiring: (1) a greater than 30% annual decline in the bank equity index and (2) the failure of a top-20 bank within the country. This definition yields 76 bank equity crises, nearly all of which overlap with prior narrative-based chronologies (Reinhart-Rogoff, JST, Laeven-Valencia), and results are robust to all alternative crisis definitions examined.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Permanent losses.&lt;/em&gt; In the year of a bank equity crisis onset, bank equity experiences average abnormal returns of -68 log-points (or -49% in arithmetic terms), while nonfinancial equity falls by -36 log-points (-30%). Over the subsequent five years, bank equity does not earn elevated returns relative to the country&amp;rsquo;s unconditional average — point estimates are consistently negative, and significantly so in years three and four after crisis onset. Bank equity does not recover to its pre-crisis level. By contrast, nonfinancial equity earns cumulative abnormal returns of roughly 30 log-points (35% arithmetic) over five years, recovering to pre-crisis trend, consistent with a discount-rate-driven decline for nonfinancial firms.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Earnings-driven, not discount-rate-driven.&lt;/em&gt; Panel regressions at both the country and individual-bank level show coefficients of roughly 1 to 2 on the relationship between the initial bank equity return in the crisis year and the subsequent five-year change in real dividends and real earnings. The initial equity decline thus predicts a roughly commensurate long-run decline in banks&amp;rsquo; dividends and earnings, inconsistent with the temporary-loss view&amp;rsquo;s prediction of discount-rate-driven declines that should subsequently reverse.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Short-run bounce-backs are modest and transient.&lt;/em&gt; At the monthly frequency, bank equity does rebound modestly from its trough — the bounce-back averages only about 30% of the initial decline, even assuming perfect market timing. This gain partially reverses after approximately twelve months, so cumulative five-year returns remain not elevated above the unconditional average.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Write-downs, not fire sales, drive losses.&lt;/em&gt; Realized book losses in the first year of crisis onset account for only about 30% of market-value losses — contrary to what fire-sale models predict. By year five, cumulative book losses reach roughly 35% of pre-crisis book equity and approximately 100% of market-value losses. Decomposing net income, write-downs track cumulative book losses closely and fully account for market-value losses by year five. Trading losses (from securities sales and asset dispositions) account for only a small share on average, though for banks in the top quartile of securities-to-assets ratios, immediate accounting losses are larger and more trading-loss-driven — consistent with fire-sale dynamics being important specifically for banks with large tradable securities portfolios.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Nonperforming loans confirm the mechanism.&lt;/em&gt; At the country level, larger bank equity declines are associated with higher peak NPL rates in the subsequent five years (adjusted R² of 0.53 excluding two outliers; 0.606 for the 2008-2010 subsample only). No analogous relationship exists for nonfinancial equity returns.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Policy interventions are insufficient.&lt;/em&gt; Liquidity-based interventions (extraordinary central bank support and blanket guarantees) implemented after bank equity crises are followed by an approximately 20% short-run rebound in bank equity, which reverses between months 12 and 36. No large or permanent increase in bank value follows. Government recapitalization programs have historically been small (averaging 24% of pre-crisis book equity and 43% of realized losses), narrow (65% classified as narrow, median of five banks recapitalized), and delayed. Banks cannot self-recapitalize through high post-crisis profitability.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Crisis type matters.&lt;/em&gt; Panic-only crises (banking panics without large bank equity declines, N=85) exhibit very different dynamics: bank equity recovers to pre-crisis levels within five years, dividends fall only temporarily, liquidity interventions produce large and permanent rebounds, and macroeconomic output losses are smaller. In 75% of bank equity crises, the bank equity decline strictly precedes the banking panic, indicating that fundamental weaknesses — not liquidity shocks escalating into solvency problems — are the primary driver. Only 19 cases (25%), labelled &amp;ldquo;mismanaged banking panics&amp;rdquo; (including the U.S. Great Depression), saw the panic precede the equity decline, mostly in the pre-1945 Gold Standard era. Early liquidity intervention is essentially a necessary condition for averting incipient crises, but it is effective only when a steep bank equity decline has not yet occurred.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: How do the authors define a &amp;ldquo;bank equity crisis&amp;rdquo; and why does the definition matter for their empirical strategy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A bank equity crisis is defined as the first year when (1) the bank equity index declines by more than 30% in annual excess total returns in any year within the past five years, and (2) a top-20 bank (ranked by assets) fails within the country. This purely data-driven, real-time definition avoids the look-ahead bias inherent in narrative-based chronologies. The authors identify 76 such crises. Results are robust to using Reinhart-Rogoff, JST, Laeven-Valencia, and 30%-decline-only definitions, alleviating concerns that the differential bank versus nonfinancial equity dynamics are mechanical artifacts of the crisis identification approach.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the quantitative magnitude of the initial equity shock to banks versus nonfinancial firms at crisis onset?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the year of a bank equity crisis, the average abnormal cumulative log excess total return is -68 log-points for bank equity and -36 log-points for nonfinancial equity (corresponding to -49% and -30% in arithmetic abnormal returns, respectively). These are relative to the country&amp;rsquo;s unconditional average returns, estimated using country fixed effects in panel regressions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Do bank stocks earn elevated returns after banking crises, as temporary-loss models predict?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. Over the five years following crisis onset, bank equity point estimates of cumulative abnormal returns are consistently negative, and significantly so at years three and four. Bank equity does not recover to its pre-crisis level at any horizon out to five years (and Figure A.9 extends to ten years with similar conclusions). This pattern holds across advanced and emerging economies, before and after 1945, excluding the Global Financial Crisis, and across a variety of methods for computing abnormal returns. Even for surviving banks — excluding those that failed or exited — the pattern holds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How do the earnings and dividend dynamics of banks versus nonfinancial firms differ after crises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For banks, both real dividends per share and real earnings per share remain well below their long-term average five years after crisis onset, with no recovery visible by year five. For nonfinancial firms, dividends and earnings decline at crisis onset but rebound, though only slowly through year five. Panel regressions at both the country and individual-bank level find coefficients of approximately 1 to 2 on the relationship between the crisis-year bank equity return and the five-year-ahead change in real dividends and real earnings — indicating a roughly commensurate earnings-driven decline, not a transitory discount-rate shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the magnitude of the short-run bounce-back in bank equity, and does it represent a profit opportunity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Even with perfect knowledge of the crisis trough (which is not available in real time), the rebound in bank equity from trough to peak averages only about 30% of the initial decline. This gain partially reverses within approximately twelve months, so that cumulative five-year abnormal returns remain not elevated above the unconditional average. Trading strategies that account for risk and factor returns (market, value, size, momentum, global equity) yield even lower risk-adjusted returns, strengthening the conclusion that bank equity is not cheap at crisis troughs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do write-downs compare to trading losses in explaining the accounting losses of banks during crises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Realized book losses in the first year of crisis onset account for only about 30% of market-value losses. By year five, cumulative book losses reach approximately 35% of pre-crisis book equity and roughly 100% of market-value losses. Decomposing net income, write-downs (revaluations of assets remaining on the balance sheet — loan loss provisions, impairments, goodwill write-downs) track cumulative book losses closely and fully account for market-value losses by year five. Trading losses (realized gains and losses from securities trading and all asset sales) account for only a small share of total losses on average.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Under what conditions do fire sales rather than write-downs dominate the accounting losses?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For banks in the top quartile of the ratio of securities to total assets, immediate accounting losses in the first year of crisis onset are substantially larger and driven to a significant extent by trading losses rather than write-downs. The six bank equity crises with the highest securities-to-assets ratios (weighted across banks) all occurred during the 2007-2008 crisis (Belgium, France, Germany, Switzerland, the U.K., and the U.S.), when fire sales of securitized assets were significant. Banks holding mostly loans (bottom quartile of securities-to-assets) show slower-to-materialize book losses driven predominantly by write-downs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do nonperforming loan rates relate to the magnitude of bank equity declines across crises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At the country level, more negative unlevered bank equity returns at crisis onset are statistically significantly associated with higher peak NPL rates over the subsequent five years. The adjusted R² for the full available sample is 0.233, rising to 0.533 after excluding two outliers (U.S. 1990, Sweden 1991). For the 2008-2010 crisis episodes only, the adjusted R² is 0.606. No analogous association between NPL rates and nonfinancial equity returns is found, suggesting the mechanism is specific to the banking sector&amp;rsquo;s asset-quality deterioration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Do liquidity-based interventions (central bank support or blanket guarantees) restore bank capitalization after bank equity crises?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. Following the implementation of liquidity-based interventions during bank equity crises, bank equity prices initially continue to decline for about two months, then rise by approximately 20%, but this gain reverses between months 12 and 36. Bank equity values remain persistently low thereafter. This is inconsistent with models in which forceful lender-of-last-resort interventions accomplish the same result as direct recapitalizations. The authors caution that interventions are not randomly assigned — deeper crises may receive stronger interventions — so the analysis cannot identify counterfactual outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the historical characteristics of government recapitalization programs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Based on a new database covering all government recapitalization programs across 17 economies since 1870, recapitalizations have historically been small (averaging 24% of pre-crisis book equity and 43% of realized market-value losses), narrow (65% classified as narrow, with a median of five banks recapitalized), and delayed. Total equity issuance (government and private combined) is only a small fraction of realized losses. Government-funded issuance accounts for about one-fourth of total bank equity issuance. The U.S. TARP after 2008 was unusual in being both broad (over 700 banks) and timely (about one month after the Lehman collapse). Japan&amp;rsquo;s crisis of the 1990s is a prominent example of extreme delay, with the first recapitalization program implemented in March 1999, nearly a decade after the real estate collapse began.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How do &amp;ldquo;panic-only crises&amp;rdquo; differ from bank equity crises in terms of equity dynamics and policy effectiveness?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Panic-only crises (N=85) are banking panics without a 30% bank equity decline. They feature significant initial negative returns followed by elevated bank equity returns that bring valuations back to pre-crisis levels within five years. Dividends fall only temporarily. Liquidity interventions during panic-only crises produce a full rebound in bank equity in the month of intervention, contrasting sharply with the modest and transient response observed in bank equity crises. Panic-only crises are also associated with shallower real GDP declines and smaller bank credit contractions than bank equity crises.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: In what fraction of bank equity crises does the bank equity decline precede the banking panic, and what does this imply about the root cause?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In 57 of the 76 bank equity crises (75%), the bank equity decline strictly precedes the emergence of the banking panic. This timing implies that most bank equity crises are not liquidity shocks that evolved into solvency problems — rather, fundamental weaknesses in the banking system are already present at the early stages of the crisis. Only 19 cases (25%), called &amp;ldquo;mismanaged banking panics,&amp;rdquo; saw the panic precede the equity decline; these occurred predominantly in the pre-1945 period, often in countries on the Gold Standard with limited central bank capacity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: Under what conditions can early liquidity interventions avert an incipient banking crisis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Of 183 episodes of incipient liquidity shocks in which a prior 30% bank equity decline had not yet occurred, 126 received early liquidity interventions, of which 92 were successfully averted (approximately 50% of the original 183 episodes). The two strongest predictors of a successfully averted crisis — essentially necessary conditions — are: (1) the pre-panic bank equity decline remains below 30%, and (2) liquidity intervention occurs within one month of the panic. War outbreak and single-bank focus of the run are additional factors that substantially increase the probability of aversion. Combining the small-equity-decline and early-intervention conditions predicts averted panics with a true-positive rate of 99% (91/92), though with a 24% false-positive rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: Does cross-sectional heterogeneity at the bank level confirm the permanent-loss interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Sorting the ten largest banks by country into five bins by market-to-book (M/B) ratio at crisis onset shows monotonic relationships with five-year outcomes. The most distressed banks (M/B below 0.2) experience reduced credit growth of 26 percentage points and reduced income-to-book-equity of 87 percentage points (both cumulative over five years) relative to the healthiest banks (M/B above 0.8). The M/B ratio at crisis onset is persistently low in subsequent years, because market values crash permanently while book values are sticky (slow write-down recognition). These results hold with crisis fixed effects, meaning the patterns reflect within-crisis cross-sectional variation, not merely crisis-level heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: Do crises preceded by credit booms have worse post-crisis outcomes for banks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Crises preceded by above-median growth in the credit-to-GDP ratio (from pre-crisis trough to peak) are associated with an additional 60 log-point abnormal decline in bank equity excess total returns occurring around year three after crisis onset, persisting through year five. By contrast, crises not preceded by credit booms earn bank equity returns similar to the country&amp;rsquo;s unconditional average after the initial decline. This supports the hypothesis that credit-boom-driven crises involve unexpected future deterioration in asset quality, possibly linked to persistently negative housing returns (which do not recover to pre-crisis levels within five years after banking crises).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Bank equity crisis (paper-specific definition):&lt;/strong&gt; An episode identified in real time when two criteria are jointly met for the first time: (1) the bank equity index declines by more than 30% in annual excess total returns within any year of the past five years, and (2) a top-20 bank (ranked by total assets within the country) fails. This definition is purely data-driven and does not require any look-ahead information. It produces 76 crises across 46 economies from 1870 to 2019.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Permanent-loss view:&lt;/strong&gt; The theoretical interpretation that banking crises primarily reflect fundamental, lasting deterioration in the value of bank assets — arising either from fire sales that permanently destroy value or (more commonly in the authors&amp;rsquo; evidence) from deterioration in asset quality (rising nonperforming loans, loan impairments). Under this view, bank equity declines are earnings-driven rather than discount-rate-driven and do not reverse even after funding and market liquidity are restored.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Temporary-loss view:&lt;/strong&gt; The theoretical interpretation that bank losses during crises are primarily due to temporary price dislocations — assets held by financial intermediaries trade at sharp discounts due to binding borrowing constraints or depositor fragility, but recover their fundamental value once central banks provide liquidity support. Under this view, bank equity should earn elevated future returns after crises, and forceful liquidity interventions should be equivalent to direct recapitalizations in restoring bank value.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Write-downs (paper-specific definition):&lt;/strong&gt; Revaluations of assets that remain on the balance sheet, reflecting expected future reductions in cash flows. They include loan loss provisions, additions to loan loss reserves, write-downs of fixed assets, and goodwill impairments. Distinguished from trading income (realized gains and losses from securities trading and all asset dispositions). Write-downs are subject to accounting discretion and are recognized slowly over multiple years after crisis onset, while equity markets price in expected total losses rapidly at crisis onset.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trading income (paper-specific definition):&lt;/strong&gt; Realized gains and losses from securities trading and all asset sales, including sales of real estate, loans, and subsidiary divisions. Unlike write-downs, trading losses must be recognized immediately (they are realized transactions), so large trading losses at crisis onset would be evidence consistent with fire-sale dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Panic-only crises:&lt;/strong&gt; Banking panics (sustained bank runs or depositor withdrawals) that do not coincide with a greater-than-30% bank equity decline. Identified as N=85 in the full sample. These episodes are characterized by temporary equity declines, full recovery within five years, large positive responses to liquidity interventions, and smaller macroeconomic output losses — consistent with the temporary-loss view.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mismanaged banking panics:&lt;/strong&gt; The minority of bank equity crises (19 cases, 25%) in which the banking panic occurred first or concurrently with the 30% bank equity decline, rather than the equity decline preceding the panic. Concentrated in the pre-1945 period, often in Gold Standard countries with limited central bank flexibility. The U.S. Great Depression is the prominent example.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Averted crisis:&lt;/strong&gt; An incipient liquidity shock to the banking sector that fully recedes within two months without any bank failures or 30% bank equity declines. Empirically, all averted crises in the sample had not yet experienced a 30% bank equity decline and all received early liquidity interventions (within one month of the incipient panic onset).&lt;/p&gt;</description></item><item><title>Pigovian Transport Pricing in Practice</title><link>https://macropaperwarehouse.com/papers/pigovian-transport-pricing-in-practice/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/pigovian-transport-pricing-in-practice/</guid><description>&lt;p&gt;This paper reports on the MOBIS experiment, a large-scale randomized controlled trial (RCT) implementing a multi-modal Pigovian transport pricing scheme in urban areas of German- and French-speaking Switzerland. The central research question is whether a first-best transport pricing scheme — one that charges users the full marginal external costs of their travel choices, varying across time, space, and mode — generates meaningful behavioral responses, and how those responses compare to a pure information intervention.&lt;/p&gt;
&lt;p&gt;The study recruited participants from urban areas, requiring them to be between 18 and 65 years old and to use a car at least two days per week. After contacting over 90,000 individuals and an initial online screening of 21,800 respondents, 3,656 participants completed the RCT. Each participant agreed to have their daily travel tracked via a smartphone app (&amp;ldquo;Catch-My-Day&amp;rdquo;) for eight weeks: four weeks of observation followed by four weeks of treatment. Assignment to treatment and control groups was fully randomized without stratification.&lt;/p&gt;
&lt;p&gt;The pricing treatment gave participants a budget equal to their observed external costs during the observation period plus a 20% buffer, from which the external costs of their actual travel were deducted in real time; any remaining balance was theirs to keep. External costs were computed across all modes using official Swiss Federal Roads Office monetization factors, including congestion (via a MATSim-based average marginal cost approach), CO2 climate costs (CHF 136.08/ton), health costs from air pollution (PM10 and NOx), and accident and physical activity effects for active and public modes. Public transport also carried a peak-hour surcharge of CHF 0.10/km for congested zone-pairs. A second &amp;ldquo;information-only&amp;rdquo; treatment provided identical information about external costs but imposed no financial charge. A control group received only weekly summaries of kilometers traveled by mode.&lt;/p&gt;
&lt;p&gt;The regression framework is a difference-in-differences specification with person, calendar-day, and day-of-study fixed effects, estimated in levels for external-cost outcomes (due to negative values from walking&amp;rsquo;s net external benefit) and via Poisson Pseudo-Maximum Likelihood for non-negative outcomes.&lt;/p&gt;
&lt;p&gt;The pricing treatment reduced total external costs by CHF 0.215 per day (p &amp;lt; 0.01), a 5.1% reduction relative to the control group. The average private cost of transport for the control group during the treatment period was CHF 25.72 per day; the external cost was CHF 4.22 per day, implying that Pigovian pricing raised total transport costs by 16.4% on average. The implied price elasticity of external costs with respect to this price increase is -0.31. The reduction is attributable to mode substitution toward public transport and active modes and to departure time shifting away from peak hours, but not to a reduction in total distance traveled.&lt;/p&gt;
&lt;p&gt;The information-only treatment produced a coefficient of -0.087, which is not statistically significant at conventional levels for the full sample. The differential effect of adding pricing to information is -0.127 (marginally significant, p &amp;lt; 0.1), with the pricing increment particularly important for reducing congestion costs. Sensitivity analysis shows that removing the control group and time fixed effects inflates the before-vs.-after elasticity to between -0.57 and -0.71, substantially larger than the preferred estimate of -0.31, underscoring the importance of the experimental design.&lt;/p&gt;
&lt;p&gt;Heterogeneity analysis reveals that men respond more strongly than women, German speakers more than French speakers, participants under 30 more than older participants, and those with above-median altruistic values respond significantly even to information alone. Correct knowledge of the definition of external costs (present in 45% of the sample) is a key driver of the pricing treatment effect. These scope conditions — mode availability, urban Swiss context, short 4-week treatment window, mandatory car use eligibility, and the specific external cost monetization framework — bound the generalizability of the elasticity estimate.&lt;/p&gt;
&lt;p&gt;Q: What is the main treatment effect of the Pigovian pricing scheme on external transport costs?
A: The pricing treatment reduced total external costs by CHF 0.215 per day, which is a 5.1% reduction relative to the control group (p &amp;lt; 0.01). About half of the reduction came from health costs, with congestion and climate costs following in magnitude. The implied elasticity of external costs with respect to the Pigovian price increase is -0.31, meaning a 10% increase in total transport costs from Pigovian pricing would reduce external costs by approximately 3.1% in the short run.&lt;/p&gt;
&lt;p&gt;Q: How was the Pigovian price increase calculated, and what was its magnitude relative to private costs?
A: The average private cost of transport for the control group during the treatment period was CHF 25.72 per day, and the average external cost was CHF 4.22 per day. The external cost thus represents 16.4% of total (private plus external) transport costs, and dividing the 5.1% reduction in external costs by this 16.4% price increase yields the elasticity of -0.31.&lt;/p&gt;
&lt;p&gt;Q: What mechanisms drove the reduction in external costs?
A: The reduction resulted from a combination of mode substitution — a shift away from car use toward public transport and active modes — and departure time shifting away from peak hours. Critically, total distance traveled did not decline; the behavioral adjustment operated entirely through changes in how and when people traveled, not in how much.&lt;/p&gt;
&lt;p&gt;Q: What was the effect of the information-only treatment?
A: The information-only treatment produced a coefficient of -0.087 CHF per day, which was not statistically significant at conventional levels for the full sample. It was statistically significant only for subgroups, notably participants with above-median altruistic values. The differential effect of adding pricing to information (alpha_P minus alpha_I = -0.127) was marginally significant (p &amp;lt; 0.1) and was particularly concentrated in congestion cost reductions, suggesting that the monetary incentive is especially important for internalizing the congestion externality.&lt;/p&gt;
&lt;p&gt;Q: Why is the control group critical, and how does removing it affect the estimated elasticity?
A: The tracking data show a seasonal negative trend in external costs over the study period; without a control group, this trend would be incorrectly attributed to the treatment, inflating the estimated effect. When both day-of-study and calendar-day fixed effects are removed (approximating a before-vs.-after design without a control group), the estimated elasticity rises to between -0.57 and -0.71, roughly double the preferred estimate of -0.31. This highlights that most prior studies in the literature, which lack control groups, are likely to overestimate treatment effects.&lt;/p&gt;
&lt;p&gt;Q: What heterogeneity is observed in the treatment response?
A: Men respond more strongly than women to both treatments, with the gender gap particularly pronounced for congestion costs. German speakers respond more strongly than French speakers. Participants under age 30 show stronger responses than older participants. Those scoring above the median on an altruistic values index respond significantly not only to pricing but also to information alone. Participants who correctly defined external costs (45% of the sample) drive the pricing treatment effect; a causal forest analysis confirms knowledge of external costs, age below 30, and language region as key heterogeneity drivers.&lt;/p&gt;
&lt;p&gt;Q: How were external costs computed across modes, and what are the key monetization parameters?
A: For private road transport, GPS tracks were map-matched using Graphhopper and processed via MATSim modules; emission factors came from the HBEFA 3.3 database, and congestion was assessed via an average marginal cost approach incorporating spillback effects. Externalities were monetized at CHF 136.08/ton for CO2, CHF 515,497–1,358,461/ton for PM10 (rural vs. urban), CHF 7,109/ton for NOx (regional), and a value of travel time savings of CHF 25.77/hour. For other modes, per-km values from the Swiss Federal Roads Office were applied. Walking carries net external benefits (negative external costs), while cycling carries small net external costs because accident costs exceed physical activity benefits.&lt;/p&gt;
&lt;p&gt;Q: How was public transport priced in the experiment, and why was it simplified?
A: A second-best zonal peak-hour surcharge of CHF 0.10/km was applied to public transport stages between zone-pairs experiencing peak demand, with peak windows set at 7–9 am and 5–7 pm. Full first-best pricing of public transport crowding was deemed infeasible because crowding effects are highly heterogeneous spatially and temporally, often concentrated in very short windows on specific lines, making aggregate distribution unreasonable.&lt;/p&gt;
&lt;p&gt;Q: Was there evidence of gaming the mode detection system?
A: Because participants could manually correct the app&amp;rsquo;s algorithmic mode assignments — and the pricing group had an incentive to overclaim low-cost modes — the potential for strategic misreporting was examined. While the analysis could not rule out some gaming, the main results were shown to be robust to excluding potential gamers, suggesting that gaming did not materially distort the treatment effect estimates.&lt;/p&gt;
&lt;p&gt;Q: What does the study imply for transport pricing policy?
A: The elasticity of -0.31 provides a benchmark for policymakers: a full Pigovian pricing scheme that raises total transport costs by about 16% can be expected to reduce external costs by about 5% in the short run in an urban context. The finding that congestion costs respond more to pricing than to information alone suggests the monetary component is essential for this externality. Heterogeneous responses — particularly the weaker responses by women and French speakers — have distributional implications. The experiment is a proof of concept that first-best transport pricing can generate meaningful behavioral responses, but scaling it would require addressing privacy concerns from GPS tracking, technical infrastructure, and political economy challenges.&lt;/p&gt;
&lt;p&gt;Pigovian transport pricing: A pricing scheme that charges each user the marginal external costs of their transport choices — including health, climate, congestion, and noise costs — as they vary across time, space, and mode, intended to internalize the gap between private and social costs of travel.&lt;/p&gt;
&lt;p&gt;External costs of transport: Costs borne by society rather than the individual traveler, including congestion (delay imposed on others), climate damages (CO2 emissions), health costs (local air pollution, accidents), and noise; in this paper, computed in real time from tracked trips using official Swiss monetization values.&lt;/p&gt;
&lt;p&gt;Average treatment effect (ATE): The difference-in-differences estimate of the causal effect of the pricing or information treatment on outcomes, identified from the randomized assignment and controlling for person, calendar-day, and day-of-study fixed effects.&lt;/p&gt;
&lt;p&gt;Mode substitution: The behavioral response in which travelers shift from higher-external-cost modes (primarily car) to lower-external-cost modes (public transport, walking, cycling) in response to pricing, as distinct from reducing total travel distance.&lt;/p&gt;
&lt;p&gt;Departure time shifting: The behavioral response in which travelers adjust when they depart to avoid peak-hour congestion surcharges, contributing to reduced congestion externalities without reducing total distance traveled.&lt;/p&gt;
&lt;p&gt;Information-only treatment: An experimental arm receiving identical information about external costs as the pricing group but facing no financial charge, used to isolate the informational component of the pricing treatment from the monetary incentive component.&lt;/p&gt;
&lt;p&gt;Source text origin: pdf&lt;/p&gt;</description></item><item><title>Place-Based Redistribution</title><link>https://macropaperwarehouse.com/papers/place-based-redistribution/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/place-based-redistribution/</guid><description>&lt;h2 id="place-based-redistribution-overview"&gt;Place-Based Redistribution: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Should national governments redistribute income to residents of poor areas through place-based transfers, or should redistribution rely solely on place-blind (income-only) taxes? The longstanding view in urban economics—&amp;ldquo;help poor people, not poor places&amp;rdquo;—holds that place-based aid is inefficient because it channels activity to less productive locations. This paper challenges that view by formalizing the conditions under which place-based redistribution improves on purely income-based transfers, using tools from optimal tax theory embedded in a spatial equilibrium model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops a two-location model (&amp;ldquo;Distressed&amp;rdquo; and &amp;ldquo;Elsewhere&amp;rdquo;) with a unit mass of heterogeneous households who differ in skill level (θ) and idiosyncratic preference for living in Distressed (φ). Households choose where to live and how much to earn, facing competitive labor and housing markets in each location. Locations may differ in amenity levels, wage schedules (which may embody skill-specific comparative advantage), and housing costs. A utilitarian planner sets location-specific income tax schedules—observed earnings and location are the only signals of unobserved skill—maximizing a weighted average of household utilities and landlord profits subject to a budget constraint.&lt;/p&gt;
&lt;p&gt;The paper proceeds in three steps. First, it derives closed-form conditions for the optimality of a lump-sum place-based transfer under a fixed income tax. Second, it characterizes fully general optimal nonlinear, location-specific marginal tax rate (MTR) schedules (Proposition 2). Third, it calibrates the model numerically, anchoring to the U.S. Empowerment Zone (EZ) program.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three Sorting Mechanisms and Their Policy Implications&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper identifies three polar mechanisms that generate sorting of lower-skill households into Distressed:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;em&gt;Skill-taste correlation&lt;/em&gt;: higher-skill households have stronger tastes for Elsewhere, independent of wages or rents.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Comparative advantage&lt;/em&gt;: higher-skill workers are relatively more productive in Elsewhere.&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Income-based sorting&lt;/em&gt;: because Elsewhere is more expensive, lower-income households are priced into Distressed.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Under skill-taste correlation, place-based transfers to Distressed are unambiguously welfare-improving even when income taxes are already optimal, because high-skill households prefer Elsewhere for reasons that are orthogonal to income. Under comparative advantage, the direction of the optimal transfer depends on migration elasticities: low migration elasticities favor transfers to Distressed, while high migration elasticities can reverse the sign. Under pure income-based sorting (with homogeneous locational preferences), the conditions for superfluous commodity taxation (Atkinson-Stiglitz 1976) are satisfied, and optimal place-based transfers are zero—though idiosyncratic preference heterogeneity restores non-zero optimal transfers even in this case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Numerical simulations use Census data and ACS moments calibrated to EZ areas. With high migration responsiveness (κ = 0.5, approximating urban EZs) and skill-taste correlation as the sole sorting driver, the optimal average place-based transfer to Distressed is &lt;strong&gt;$4,805&lt;/strong&gt;, with about 40% ($1,943) arising from lower MTRs rather than a higher demogrant. With low migration responsiveness (κ = 4, approximating rural EZs), the optimal transfer more than doubles to &lt;strong&gt;$10,918&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;When comparative advantage alone drives sorting and migration is low (κ = 4), the optimal transfer to Distressed is &lt;strong&gt;$7,091&lt;/strong&gt;, with a $3,740 larger demogrant. With high migration and comparative advantage, the transfer reverses to &lt;strong&gt;−$2,763&lt;/strong&gt; (i.e., Elsewhere receives the subsidy). For intermediate migration under comparative advantage (e.g., κ ≈ 1), the optimal policy is nonlinear: the poorest Distressed residents receive a place-based transfer of &lt;strong&gt;$1,254&lt;/strong&gt;, while high-skill Distressed residents face a place-based tax of &lt;strong&gt;$12,398&lt;/strong&gt; at the 99th percentile.&lt;/p&gt;
&lt;p&gt;In the empirically calibrated &lt;strong&gt;urban EZ baseline&lt;/strong&gt; (migration elasticity 0.82, rent ratio 0.86, sorting driven by skill-taste correlation and income effects), the optimal average place-based transfer is &lt;strong&gt;$3,143&lt;/strong&gt;, roughly matching the magnitude of actual EZ wage tax credits (~$3,000 for full-time eligible workers). The demogrant advantage for Distressed is &lt;strong&gt;$1,462&lt;/strong&gt;, with just over half of the transfer arising from lower MTRs.&lt;/p&gt;
&lt;p&gt;In the &lt;strong&gt;rural EZ baseline&lt;/strong&gt; (migration elasticity 0.20, rent ratio 0.54, comparative advantage and income effects), the optimal average transfer rises to &lt;strong&gt;$4,329&lt;/strong&gt;, concentrated in lower MTRs rather than a larger demogrant. Halving the migration elasticity from the rural baseline raises the optimal transfer to &lt;strong&gt;$6,906&lt;/strong&gt;, while doubling it reduces the transfer to near zero (&lt;strong&gt;$573&lt;/strong&gt;).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;All results are derived under the assumption of &lt;em&gt;no market failures&lt;/em&gt;; the model deliberately excludes agglomeration spillovers or other Pigouvian motives, attributing the case for place-based redistribution purely to redistributive goals.&lt;/li&gt;
&lt;li&gt;The planner observes only earnings and location, not skill type directly.&lt;/li&gt;
&lt;li&gt;Household Pareto weights are set equal to one across types in the simulations, so redistribution is driven solely by diminishing marginal utility of consumption.&lt;/li&gt;
&lt;li&gt;The model abstracts from interactions with subnational governments, local public services, and endogenous amenities.&lt;/li&gt;
&lt;li&gt;Results on the desirability of transfers to Distressed hinge critically on the motive for sorting, not simply on the existence of spatial income inequality.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the equity-efficiency tradeoff formula for a lump-sum place-based transfer, and what does it reveal?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Lemma 1 shows that the first-order welfare effect of a small per-capita transfer from Elsewhere to Distressed starting from a place-blind tax system is dSWF/dt = (λ̄₁ − λ̄₀) + Eθ{m(0)·[T(z₁*) − T(z₀*)]}. The equity gain (λ̄₁ − λ̄₀) is positive when Distressed households have higher average social marginal welfare weights, which holds when their skill distribution is first-order stochastically dominated by Elsewhere&amp;rsquo;s. The fiscal cost equals the earnings-tax-revenue loss from movers: households induced to migrate to Distressed who earn less there generate lower tax payments. This formula identifies the earnings response to migration as a sufficient statistic for the efficiency cost of place-based policy.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Q2: What characterizes the optimal lump-sum transfer t&lt;/em&gt; in Proposition 1?&lt;/em&gt;*&lt;/p&gt;
&lt;p&gt;Proposition 1 shows t* = [λ̄₁(t*) − λ̄₀(t*) + Eθ{m(t*)·[T(z₁*) − T(z₀*)]}] / (Eθ[m(t*)] / [L₀(t*)L₁(t*)]). The optimal transfer is larger when (i) the average social marginal welfare weight gap between Distressed and Elsewhere is greater, (ii) migration responses m(t*) are small, and (iii) the earnings difference between locations for marginal movers is small. This formula holds regardless of whether the income tax schedule T(·) is itself set optimally.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Under skill-taste correlation, why are place-based transfers always welfare-improving even under an optimal income tax?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When sorting is driven by skill-taste correlation (high-skill households have stronger preferences for Elsewhere despite identical wages and rents), the equity gain λ̄₁ − λ̄₀ is positive because low-skill households concentrate in Distressed. A small positive transfer starting from t = 0 also incurs zero fiscal cost because movers between locations face identical wages and do not change their earnings. Thus, welfare unambiguously increases. The key insight is that skill-taste correlation violates the Atkinson-Stiglitz condition: high earners would still prefer Elsewhere even if forced to earn less, so location serves as a proxy for skill not captured by income taxes alone.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Under comparative advantage, why can the sign of the optimal transfer reverse with migration elasticity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When higher-skill workers are more productive in Elsewhere, movers to Distressed experience wage and earnings reductions, generating a fiscal externality. When migration elasticities are high (low κ), this fiscal cost is large and can dominate the equity gain, making transfers to Elsewhere optimal (simulated optimal transfer of −$2,763 at κ = 0.5). When migration elasticities are low (high κ), the fiscal cost is small and equity considerations dominate, yielding transfers to Distressed ($7,091 at κ = 4). At intermediate elasticities, the optimal policy is nonlinear, redistributing to poor Distressed residents while taxing rich Distressed residents more.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why are place-based transfers superfluous under pure income-based sorting with homogeneous locational preferences?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Example 6 (and its formal proof in Appendix B.3.5) demonstrates that when sorting arises solely from higher rents in Elsewhere and preferences over location are homogeneous (no idiosyncratic φ heterogeneity), the Atkinson-Stiglitz sufficient condition for commodity tax superfluousness is met: hypothetically forcing high earners to earn less would not change their preferred consumption bundle relative to low earners. Hence a place-blind income tax implements optimal redistribution without spatial supplements. As the variance of idiosyncratic location preferences κ shrinks toward zero, Figure 3 confirms that optimal place-based transfers tend toward zero across all three sorting motives.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What new terms appear in the optimal location-specific MTR formulas (Proposition 2) relative to a standalone-economy optimum?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The optimal MTR schedules in Proposition 2 contain two new terms beyond the standard Mirrlees (1971)/Saez (2001) formula. The term Δτ+(θ) captures the fiscal externality from migration: raising Elsewhere&amp;rsquo;s MTR at skill level θ and above induces movers to Distressed who change their tax revenue by T₁(z₁*(s)) − T₀(z₀*(s)). The term (λ_L − 1)Δr+(θ) captures the equilibrium rent effect: MTR changes shift households between locations, altering rents in both communities and redistributing between renters and landlords. When λ_L &amp;lt; 1 (landlords are weighted less than average households), the rent term creates additional motives for spatial redistribution depending on the ratio of rents to housing supply elasticities across locations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How do housing supply elasticities affect the optimal spatial transfer, and why does the sign differ between urban and rural settings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The rent redistribution term Δr+(θ) has sign determined by r₁/ϱ₁ − r₀/ϱ₀. For urban EZs, where Distressed has lower rents but also lower housing supply elasticity than Elsewhere (ϱ₁ = 0.24, ϱ₀ = 0.34 in the baseline), this ratio is positive, meaning transfers to Distressed shift households into relatively inelastic markets, raising rents there and generating landlord income. When λ_L &amp;lt; 1, this reduces the desirability of transfers to Distressed. For rural EZs, Distressed has higher housing supply elasticity (ϱ₁ = 0.60), so the ratio is negative: transfers shift households to more elastic markets where rents rise minimally. When λ_L &amp;lt; 1, this actually motivates more transfers to rural Distressed areas. In the 75%-landlord-weight sensitivity, optimal urban transfers fall by ~$1,000 while rural transfers rise by ~$1,000, illustrating this asymmetry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What does the urban EZ baseline calibration find about optimal transfers and how does it compare to actual EZ policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The urban baseline targets a migration elasticity of 0.82 (from Busso et al. 2013), a Distressed-to-Elsewhere rent ratio of 0.86, and 56% of Distressed residents earning under $50,000. The calibrated κ is 0.44. At the optimum, Distressed residents receive an average place-based transfer of $3,143, with $1,462 as a higher demogrant and the remainder from lower MTRs. By comparison, actual EZs provide a wage tax credit of approximately $3,000 per eligible full-time worker. The paper concludes that the magnitude—but not the capped, flat structure—of EZ transfers approximates the optimal level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the rural EZ calibration find, and how sensitive are results to migration assumptions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The rural baseline targets a migration elasticity of 0.20 (from Sprung-Keyser et al. 2022), a rent ratio of 0.54, and 60% of Distressed residents earning under $50,000, with sorting attributed to comparative advantage and income effects. The calibrated κ is 4.06. The optimal average transfer is $4,329, primarily arising from lower MTRs rather than a higher demogrant ($532). Doubling the migration elasticity reduces the optimal transfer to near zero ($573); halving it raises it to $6,906. The direction and magnitude of optimal transfers are therefore highly sensitive to the assumed level of migration responsiveness, highlighting the empirical importance of estimating migration elasticities—particularly heterogeneity in migration by income level and earnings changes for marginal movers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Do within-income transfers arising from differences in marital and parental status across communities effectively constitute place-based redistribution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Online Appendix A investigates this by estimating the implicit place-based transfer induced by marital and parental status differences between EZ communities and the rest of the country. Using ACS tract-level data merged with Piketty-Saez-Zucman distributional national accounts (DINA), the authors find that marital status and parental status have offsetting effects: marital status raises taxes on single households (common in Distressed), while parental status increases transfers to households with children (also common in Distressed). Across all preferred CPS-adjusted estimates, net within-earnings transfers are below $1,000 in magnitude, and the two factors essentially cancel. The authors conclude that marital and parental status differences do not yield substantial de facto place-based redistribution within income levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What does the MTR decomposition (Table 3) reveal about why sorting motives generate different MTR patterns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The decomposition separates the optimal MTR into a within-community component (standard equity-efficiency tradeoff) and a between-community component (fiscal externality from migration). Under skill-taste correlation with high migration (κ = 0.5), both components contribute positively to the Distressed MTR (0.246 within + 0.234 between = 0.479), yielding lower MTRs in Distressed (0.479) than in Elsewhere (0.510). Under comparative advantage with high migration, the within-community component is negative (−0.111) because high MTRs at the optimum reduce the concentration of high-skill types in Distressed, depressing the standard revenue-raising benefit of MTRs. The large positive between-community component (0.655) reflects the large fiscal externality from movers and overcomes this, yielding higher Distressed MTRs (0.544 vs. 0.509 in Elsewhere). With low migration (κ = 4), between-community components shrink substantially, and MTRs in Distressed fall below Elsewhere in all sorting scenarios.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What does the crosswalk from urban to rural baseline reveal about which assumptions drive the change in optimal transfers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Table 5 traces the urban-to-rural transition step by step. Starting from the urban baseline ($3,143 average transfer), replacing the migration elasticity target with the rural value of 0.20 triples the optimal transfer to $9,870. Subsequently replacing skill-taste correlation with comparative advantage as the sorting mechanism reduces the transfer by roughly half ($6,402). Adjusting rent to match the rural ratio (0.54) reduces it further to $2,780, as lower Distressed rent reduces the marginal utility of consumption at the bottom and increases income-based sorting. Targeting the rural income share (60% below $50K) raises it back to $4,140, and incorporating rural housing supply elasticities yields the rural baseline result of $4,329. This decomposition reveals that lower migration responsiveness is the single largest driver of higher optimal transfers in rural settings.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Place-based redistribution&lt;/strong&gt;: Transfer schemes in which economic benefits or tax burdens are conditioned on the geographic location of residence, as distinct from place-blind income taxes that condition only on earned income. In this paper, modeled as location-specific tax schedules T_j(z) that may differ across communities j.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Skill-taste correlation&lt;/strong&gt;: A source of spatial sorting in which households with higher skill levels (θ) have systematically stronger preferences for the &amp;ldquo;Elsewhere&amp;rdquo; location, independently of wage or rent differences. Formally, the conditional distribution G_θ(φ) of locational tastes given skill is weakly increasing in θ. This correlation breaks the Atkinson-Stiglitz sufficient condition for commodity tax superfluousness and generates unambiguously positive optimal transfers to Distressed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Comparative advantage (spatial)&lt;/strong&gt;: A sorting mechanism in which higher-skill workers are disproportionately more productive in Elsewhere than in Distressed, captured by the wage elasticity with respect to skill being higher in Elsewhere (γ₀(θ) &amp;gt; γ₁(θ)). Households with skill above a threshold sort into Elsewhere even with homogeneous locational preferences. The existence of spatial comparative advantage means that migrants to Distressed earn less, creating a fiscal externality for place-based transfers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Income-based sorting&lt;/strong&gt;: Sorting of lower-income, lower-skill households into Distressed arising purely from the higher cost of living in Elsewhere, without any systematic skill-taste correlation or comparative advantage. Because high-skill households are less sensitive to rent differences, they sort into Elsewhere when rents there are higher. When this is the sole sorting mechanism and locational preferences are homogeneous, the Atkinson-Stiglitz commodity tax superfluousness conditions are satisfied and optimal place-based transfers are zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fiscal externality (migration)&lt;/strong&gt;: The change in income tax revenue caused by migration responses to place-based policy changes, not by changes in incentives for stayers. When movers from Elsewhere to Distressed earn less in their new location, they generate lower tax payments, imposing a first-order cost on the government budget. This externality is measured by Δτ+(θ) in the optimal MTR formulas and equals the earnings-tax-revenue loss from movers across all skill levels above θ. This term is a &amp;ldquo;sufficient statistic&amp;rdquo; for the efficiency cost of place-based transfers in the sense of Chetty (2009).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demogrant (∆₀)&lt;/strong&gt;: The difference in lump-sum transfers provided to zero-earners across the two locations (−T₀(0) − (−T₁(0)) = T₀(0) − T₁(0)). A positive ∆₀ means Distressed provides a larger transfer to non-earners. It represents the place-based redistribution that occurs at the bottom of the earnings distribution, independently of MTR differences. In the paper&amp;rsquo;s decomposition, total optimal place-based redistribution (∆_z) exceeds ∆₀ when Distressed also has lower MTRs, meaning redistribution grows with income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Income-constant average tax difference (∆_z)&lt;/strong&gt;: The paper&amp;rsquo;s preferred summary measure of the average place-based transfer, defined as an equally weighted average of two tax-difference indices: the tax difference evaluated at Elsewhere earnings levels and the tax difference evaluated at Distressed earnings levels. This measure isolates tax schedule differences from productivity differences across locations, avoiding conflation of tax policy and wage effects on measured income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Landlord welfare weight (λ_L)&lt;/strong&gt;: The social marginal welfare weight assigned to landlords relative to the multiplier on the government budget constraint. When λ_L &amp;lt; 1, the planner values a marginal dollar of public funds more than a marginal dollar to landlords, creating a motive to use place-based taxes to shift rent incidence. The rent redistribution effect on optimal MTRs operates through the term (λ_L − 1)Δr+(θ), which has opposite signs in urban (positive) and rural (negative) distressed areas because of their different housing supply elasticities.&lt;/p&gt;</description></item><item><title>Policy Biases in a Model with Labor‐Market Frictions</title><link>https://macropaperwarehouse.com/papers/policy-biases-in-a-model-with-labormarket-frictions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/policy-biases-in-a-model-with-labormarket-frictions/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Dennis and Kirsanova ask whether shocks to labor-market matching efficiency and worker bargaining power pose a significant problem for monetary policy, and whether the inability to commit (discretion versus commitment) generates important stabilization bias in a model with labor-market matching frictions. They also examine how several popular simple monetary policy rules perform in response to these and other shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops a fully nonlinear DSGE model featuring: (1) a goods market characterized by monopolistic competition and Rotemberg-style quadratic price-adjustment costs; and (2) a labor market characterized by a constant-returns-to-scale matching function (Mortensen-Pissarides) and Nash bargaining over wages and hours worked. Because the flex-price equilibrium is inefficient — owing to both monopolistic competition and the matching friction — a linear-quadratic approximation is not valid for the discretionary policy problem, and the authors solve the model using Smolyak sparse-grid methods with Chebyshev polynomial basis functions.&lt;/p&gt;
&lt;p&gt;The model is calibrated to quarterly U.S. data. Key parameter values include: discount factor β = 0.99 (annualized real interest rate ≈ 4 percent), elasticity of substitution across goods ε = 11 (steady-state markup of 10 percent), price-adjustment cost φ = 80, quarterly separation rate δ = 0.12, job-finding rate f = 0.65 (delivering an employment rate close to 0.94 and an unemployment rate near 5.95 percent in steady state), elasticity of matching function with respect to unemployment ξ = 0.72, and workers&amp;rsquo; mean bargaining power equal to ξ = 0.72 (satisfying the Hosios condition at steady state). Five AR(1) shocks are included: aggregate technology (persistence 0.95, standard deviation 0.008), matching efficiency (persistence 0.80, standard deviation 0.032), bargaining power (persistence 0.80, standard deviation 0.028), consumption preference (persistence 0.70, standard deviation 0.006), and elasticity of substitution (persistence 0.85, standard deviation 0.12).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The central finding is that optimal monetary policy — whether conducted under commitment (Ramsey) or discretion — is highly efficient at responding to labor-market shocks, producing impulse responses that closely replicate the flex-price equilibrium for real variables. Specifically, in response to matching efficiency shocks and bargaining power shocks, the commitment and discretionary equilibria both track the flex-price equilibrium closely for output, consumption, employment, tightness, and the real wage.&lt;/p&gt;
&lt;p&gt;Discretion generates a pronounced inflation bias of approximately 1.82 percent per annum — large but not implausible — but does not generate a meaningful stabilization bias for the class of shocks studied (technology, matching efficiency, bargaining power, and consumption preference). The one exception is the elasticity of substitution shock (analogous to a markup shock in linearized models): for this shock, the impulse responses under discretion diverge noticeably from those under commitment, revealing a discretionary stabilization bias — consistent with conventional New Keynesian results.&lt;/p&gt;
&lt;p&gt;Regarding simple rules, strict inflation targeting (SIT) performs closely in line with commitment and discretion for all shocks. The two Taylor-type rules — one responding to inflation and output growth, the other to inflation and the unemployment rate — generate substantially greater volatility in inflation and the nominal interest rate relative to optimal policy. The unemployment-gap Taylor rule is the worst performer among the three simple rules; nevertheless, all three simple rules produce household welfare outcomes close to those under optimal monetary policy. The suboptimality of the simple rules is most evident in nominal variables, particularly inflation and the nominal interest rate, and less evident in real variables — though labor-market inefficiencies under the Taylor-type rules do emerge in response to matching efficiency and bargaining power shocks, with hours worked and the real wage deviating noticeably from flex-price outcomes.&lt;/p&gt;
&lt;p&gt;The probability of encountering the zero lower bound is, for all policies considered, considerably less than 0.5 percent across one million simulated observations, suggesting that ZLB concerns are not material for the shocks under study.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;These results hold within the context of a model with a fixed labor force (no participation margin), balanced-budget fiscal authority, no capital accumulation, and Nash bargaining over both wages and hours. The Hosios condition is satisfied at steady state (though the authors report that relaxing it has little effect on results). The analysis abstracts from the zero lower bound constraint when solving the model.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the Hosios condition and what role does it play in this model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Hosios condition requires that workers&amp;rsquo; bargaining power equal the elasticity of matches with respect to unemployment in the matching function (ξ = 0.72). When the condition holds, bargaining is efficient in the sense that the decentralized search equilibrium replicates the social planner&amp;rsquo;s allocation. The authors impose it at steady state (mean bargaining power &amp;amp; = ξ = 0.72) so that the flex-price equilibrium is distorted only by monopolistic competition, not by inefficient search. The authors state they also analyzed versions where the Hosios condition does not hold and found it had little effect on results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How are matching efficiency shocks transmitted through the economy, and how does optimal policy respond?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;An improvement in matching efficiency raises the rate at which vacancies are filled and the unemployed find jobs, increasing employment from existing vacancy and unemployment levels. Employment rises, unemployment falls, labor market tightness increases, and the real wage rises. Firms substitute toward more workers (extensive margin) and away from hours-per-worker (intensive margin), so hours worked per employee decline even as aggregate hours rise. Both commitment and discretion track the flex-price equilibrium closely for all these real variables. Some difference is visible in inflation: under discretion the real wage rises by more than under commitment, pushing real marginal costs and inflation higher in the short run.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does a bargaining power shock affect the economy under optimal monetary policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;An increase in worker bargaining power shifts the match surplus toward workers, raising real wages and hours worked per employee. Firms, receiving a smaller surplus share, post fewer vacancies and hire fewer workers, leading to a decline in employment, a fall in labor market tightness, and a rise in unemployment. The employment decline is large enough to lower household income, goods production, and aggregate consumption. Under both commitment and discretion, the real economy tracks the flex-price equilibrium closely. Notable differences between commitment and discretion appear in inflation: under discretion, the inflation response on impact is larger and more persistent than under commitment, and monetary policy tightens more aggressively (higher nominal rate) under discretion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the key difference between the commitment and discretionary equilibria, and why is stabilization bias mostly absent?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Commitment (Ramsey) policy differs from discretionary policy primarily in the level of inflation, not in the dynamics of the real economy. Discretion generates an inflation bias of approximately 1.82 percent per annum. However, the impulse responses for real variables (output, consumption, employment, tightness, real wage) under commitment and discretion are very similar to each other and to the flex-price equilibrium for four of the five shocks. This indicates that forward guidance — which commitment provides and discretion does not — is not an important factor in this model&amp;rsquo;s response to these shocks. The intuition is that the economy&amp;rsquo;s fluctuations in response to matching efficiency and bargaining power shocks are largely efficient, so the central bank needs only to avoid creating additional distortions, which both commitment and discretion achieve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What distinguishes the elasticity of substitution shock from the other shocks in terms of policy performance?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The elasticity of substitution shock behaves similarly to a markup shock in linearized models: an increase in substitutability reduces firms&amp;rsquo; monopolistic power, lowers the price markup, raises output and consumption, increases hours worked, posted vacancies, employment, and the real wage. For this shock, the impulse responses under discretion diverge noticeably from those under commitment — the decline in inflation is larger and more persistent under discretion than under commitment, and the nominal interest rate response differs in sign across policies. This is the only shock in the model for which a meaningful discretionary stabilization bias is evident, consistent with conventional wisdom from linearized New Keynesian models that markup shocks generate stabilization bias.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do the three simple rules compare with optimal policy for labor-market shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Strict inflation targeting (SIT) behaves similarly to commitment and discretion and hence closely replicates the flex-price equilibrium for all five shocks. The two Taylor-type rules — one responding to inflation and output growth (parameterized with φ_π = 2.5, φ_y = 0.5/4) and one responding to inflation and the unemployment rate (φ_π = 2.5, φ_u = 1.5/4) — both generate substantially more volatility in inflation and the nominal interest rate relative to optimal policy. The unemployment-gap Taylor rule generally results in inflation moving more in response to shocks and in the economy returning more slowly to baseline, making it the worst-performing simple rule. However, all three simple rules produce welfare outcomes close to those under optimal policy; the suboptimality of the Taylor-type rules is most evident in nominal rather than real variables.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Does the zero lower bound (ZLB) pose a concern under any of the policies studied?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Based on simulating one million observations from each model, the unconditional probability of encountering the ZLB is very small — well below 0.5 percent — for all policies considered. The commitment policy has a ZLB probability of approximately 0.077 percent, reflecting its near-zero average inflation. Discretion&amp;rsquo;s positive inflation bias of 1.82 percent reduces the ZLB probability to approximately 0.001 percent. The Taylor-type rules — especially the unemployment-gap rule (ZLB probability approximately 0.296 percent) — have higher probabilities than discretion, though these remain very small. These results suggest that for the shocks analyzed, violations of the ZLB are extremely unlikely.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the steady-state and stochastic simulation mean outcomes, and how do they compare across regimes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The deterministic steady-state unemployment rate is approximately 5.95 percent, rising slightly to a mean of 6.04 percent in the stochastic flex-price economy. The stochastic means for output, consumption, employment, and the real wage are all slightly below their deterministic steady states across all regimes, because in the absence of capital households respond to increased volatility by substituting away from labor toward leisure (precautionary leisure) rather than precautionary saving. Mean outcomes for real variables under discretion (e.g., output mean ≈ 0.3730, unemployment mean ≈ 6.025 percent) and commitment (output mean ≈ 0.3729, unemployment mean ≈ 6.028 percent) are very similar to each other and to the flex-price means (output mean ≈ 0.3728, unemployment mean ≈ 6.038 percent). The key difference is in inflation: commitment delivers near-zero mean inflation (≈ 0.00043 percent annually) while discretion delivers ≈ 1.82 percent annually.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Why is a nonlinear solution method used, and what does this allow the paper to capture that log-linearized approaches cannot?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The nonlinear solution is required because the flex-price equilibrium is not efficient (monopolistic competition and the matching friction both create distortions), so the discretionary policy problem cannot be formulated as a linear-quadratic problem. The nonlinear approach allows the paper to analyze both level biases (the steady-state inflation bias) and stabilization biases (the dynamic response to shocks) in a unified framework — something that log-linearization around the efficient steady state would preclude. Related papers by Furlanetto and Groshenny (2016) and Zhang (2017) focus on log-linearized models and the natural rate of unemployment; this paper focuses instead on optimal policy and policy biases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What role does the consumption preference shock play, and how does it differ from the other shocks?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The consumption preference shock is the only shock in the model that acts somewhat like a demand shock. A one standard deviation increase raises the utility obtained from consumption, leading households to increase consumption and hours worked (at a slightly lower real wage), which induces firms to post more vacancies and raise employment. Most of the labor market response comes through higher hours rather than higher employment. Both commitment and discretionary policy cope well with this shock — the real economy closely tracks the flex-price equilibrium — because the shock has relatively little impact on inflation (inflation declines slightly due to lower real marginal costs from the lower real wage). The nominal interest rate rises because the increase in the real interest rate (driven by households&amp;rsquo; desire to borrow) more than offsets the decline in inflation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Matching efficiency shock&lt;/strong&gt;: A stochastic shock to the parameter mt in the constant-returns-to-scale matching function Mt = mt * u_t^xi * v_t^(1-xi), which governs the overall rate at which unemployed workers and posted vacancies are matched. A decline in mt reduces the number of matches formed at any given levels of unemployment and vacancies, raising unemployment and reducing employment. The paper treats this as an empirically relevant shock motivated by evidence of a sustained decline in aggregate matching efficiency during the Great Recession.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Discretionary inflation bias&lt;/strong&gt;: The tendency for a central bank conducting policy without the ability to commit to produce systematically higher inflation than would occur under a commitment (Ramsey) regime. In this model, discretion generates an annualized inflation rate of approximately 1.82 percent, while commitment produces near-zero average inflation. This reflects the time-inconsistency problem (Kydland and Prescott, 1977; Barro and Gordon, 1983) arising from the interaction of monopolistic competition and price stickiness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stabilization bias&lt;/strong&gt;: A distortion that arises under discretionary policy, in which the central bank&amp;rsquo;s inability to commit leads it to respond to shocks in a manner that departs from optimal commitment responses, producing suboptimal dynamics for real variables in addition to the inflation bias. In this paper, stabilization bias is found to be largely absent for matching efficiency, bargaining power, technology, and consumption preference shocks, but is present for the elasticity of substitution shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hosios condition&lt;/strong&gt;: The condition, derived in Hosios (1990), that efficient decentralized search-and-matching equilibrium requires workers&amp;rsquo; bargaining power to equal the elasticity of matches with respect to the unemployment rate (ξ). In the paper&amp;rsquo;s notation: &amp;amp; = ξ. When the condition holds, the flex-price equilibrium replicates the social planner&amp;rsquo;s allocation in the labor market; deviations cause either excessive or insufficient vacancy posting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor market tightness (θ)&lt;/strong&gt;: Defined as the ratio of vacancies to unemployed searchers, θt = vt/ut. When tightness is high, the labor market is tight and firms have difficulty filling vacancies (low job-filling rate q(θ)) while workers find jobs easily (high job-finding rate f(θ)). Tightness is the key state variable linking vacancy posting decisions by firms to employment dynamics and wage bargaining outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bargaining power shock&lt;/strong&gt;: A stochastic shock to the worker&amp;rsquo;s share of the Nash bargaining surplus (&amp;amp;t), which follows an AR(1) process. The Hosios condition holds at steady state but is violated when the shock is realized. A positive shock shifts surplus from firms to workers, raising real wages, depressing vacancy posting, and reducing employment, while a negative shock has the reverse effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rotemberg price-adjustment cost&lt;/strong&gt;: A quadratic cost φ/2 * (π_t)^2 * y_t paid by firms when they change prices, creating price stickiness without the &amp;ldquo;menu cost&amp;rdquo; lumpiness of Calvo pricing. This creates a role for monetary policy and generates a nonlinear Phillips curve. The coefficient φ is set to 80, based on the estimate in Ireland (2001).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Flex-price equilibrium&lt;/strong&gt;: The benchmark equilibrium in which prices are fully flexible and bargaining is efficient (Hosios condition satisfied exactly). In this equilibrium there is no role for monetary policy over the price-adjustment margin, and the economy responds to shocks in a manner that is efficient conditional on the remaining frictions (monopolistic competition and the matching friction). The paper uses deviations of commitment and discretionary outcomes from this benchmark to measure the efficiency of optimal monetary policy.&lt;/p&gt;</description></item><item><title>Policy Diffusion and Polarization across U.S. States</title><link>https://macropaperwarehouse.com/papers/policy-diffusion-and-polarization-across-u.s.-states/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/policy-diffusion-and-polarization-across-u.s.-states/</guid><description>&lt;p&gt;DellaVigna and Kim study the innovation and diffusion of policies across U.S. states using a dataset of over 700 state laws spanning seven decades. The central question is what predicts whether a state adopts a policy — and how those predictors have changed over time. The paper draws on two primary data sources: the State Policy Innovation and Diffusion (SPID) Database (Boehmke et al., 2020), covering 676 policies, and a hand-collected sample of 57 policies from 91 NBER working papers (April 2012–September 2021) that feature state-level policy variation. The combined dataset covers 733 policies adopted from the 1950s onward across the contiguous 48 states.&lt;/p&gt;
&lt;p&gt;On policy innovation, the paper finds that state capacity plays only a small role: larger and richer states are only slightly more likely to introduce new policies, innovation originates from both Republican and Democratic states, and the patterns are largely idiosyncratic with respect to observable state characteristics. California is the most frequent innovator, but large states like Florida and Texas rank in the middle.&lt;/p&gt;
&lt;p&gt;For policy diffusion, the paper employs both a static Geary&amp;rsquo;s C clustering statistic (measuring whether the first 10 adopting states cluster geographically or politically relative to a random-diffusion benchmark) and a dynamic logit hazard model estimated separately by decade. The hazard model identifies three similarity channels — geographic, demographic, and political — and allows their coefficients to vary over time.&lt;/p&gt;
&lt;p&gt;The central finding is a structural break in diffusion patterns around 2000. From the 1950s to the 1990s, geographic proximity is the dominant predictor of policy adoption: the coefficient on geographic similarity is 0.34 in the 1970s and remains roughly constant at 0.33 in the most recent decade. Demographic similarity is consistently positive and stable (approximately 0.20 in the 1980s, 0.22 in the 2010s). Political similarity — measured by closeness in Republican vote-share from the most recent presidential election — is a modest predictor before 2000, with coefficients between 0.14 (1970s) and 0.17 (1990s). Since 2000, the political similarity coefficient triples: 0.46 in the 2000s and 0.52 in the 2010s, making it by far the strongest predictor. The overall pseudo R-squared rises from 0.13 in the 1970s to 0.19 in the 2010s.&lt;/p&gt;
&lt;p&gt;These patterns are more pronounced for policies studied by economists: in the NBER subsample, the political similarity coefficient reaches 0.66 (s.e.=0.09) in the most recent two decades, versus 0.42 (s.e.=0.04) in the SPID sample.&lt;/p&gt;
&lt;p&gt;The paper tests whether the increased role of political similarity reflects correlated voter preferences, learning, or competition versus party discipline. Against correlated-preferences explanations: adding cross-state migration flows as a similarity measure reduces geographic predictive power but leaves the political similarity coefficient entirely unchanged; and typical policy-outcome variables (poverty rate, opioid mortality, income) have not become more correlated among politically similar states over time. In favor of party discipline: similarity in unified state government has zero predictive power through the 1990s but a coefficient of 0.42 (s.e.=0.06) in the 2000s–2010s. An event study of switches to unified party control confirms this causally for 1991–2020: switching to unified government raises the probability of passing ideologically aligned laws by approximately 2 percentage points in the four years following the switch, with no pre-trends and no effect on neutral-leaning laws; the same event study for 1950–1990 yields no detectable effect.&lt;/p&gt;
&lt;p&gt;COVID policies (77 state laws since October 2019) show strong political similarity in adoption; historical vaccination mandate policies (28 laws since 1975) show no political similarity effect. The paper concludes that rising party polarization at the state level — detectable from the 2000s onward, lagging the Congressional trend by roughly four to five decades — is the primary driver of the shift in diffusion patterns. The authors additionally classify each of the 57 NBER-sample policies by type of diffusion as an input for difference-in-differences research design assessment.&lt;/p&gt;
&lt;p&gt;Q: What data do the authors use and what is its scope?
A: The main source is the SPID Database (Boehmke et al., 2020), covering 676 policies over seven decades. The authors supplement this with 57 policies hand-collected from 91 NBER working papers (2012–2021) that use state-level policy variation. The combined sample covers 733 policies adopted from the 1950s onward in the contiguous 48 states, with the SPID sample averaging 23 adopting states per policy and the NBER sample averaging 29.&lt;/p&gt;
&lt;p&gt;Q: Do states with more resources or larger populations systematically innovate more policies?
A: The evidence for a state-capacity hypothesis is weak. There is only suggestive evidence that higher per-capita income predicts being in the top-20% of innovators, and no clear difference in population between the top and bottom innovators. Innovations arise from both Republican and Democratic states. One consistent correlate is urban population share, but overall innovation is largely idiosyncratic with respect to observable characteristics.&lt;/p&gt;
&lt;p&gt;Q: What was the dominant predictor of policy diffusion before 2000?
A: Geographic proximity was the dominant predictor. The coefficient on geographic similarity in the hazard model is 0.34 in the 1970s and remains stable at approximately 0.33 in the 2010s. Demographic similarity contributes consistently at approximately 0.20. Political similarity before 2000 is modest, ranging from 0.14 in the 1970s to 0.17 in the 1990s — roughly one-third to one-half the magnitude of the geographic coefficient.&lt;/p&gt;
&lt;p&gt;Q: How dramatically does political similarity change after 2000, and is this finding robust?
A: The political similarity coefficient triples, rising from 0.17 in the 1990s to 0.46 in the 2000s and 0.52 in the 2010s, making it the largest single predictor in recent decades. This pattern is robust across linear probability models, alternative measures of political similarity, alternative thresholds for &amp;ldquo;closest&amp;rdquo; states (closest fifth, fourth, third, or half all yield comparable coefficients), and alternative ways of computing adoption counts.&lt;/p&gt;
&lt;p&gt;Q: Is the shift toward political diffusion stronger for policies economists study?
A: Yes. In the NBER subsample, the political similarity coefficient reaches 0.66 (s.e.=0.09) in the 2000s–2010s, compared to 0.42 (s.e.=0.04) in the SPID sample. Geographic similarity also has somewhat higher coefficients in the NBER sample throughout the period. This implies that the policies most studied for difference-in-differences evaluation are also those most subject to politically-driven diffusion.&lt;/p&gt;
&lt;p&gt;Q: What does the Medicaid case study illustrate about political polarization?
A: ACA Medicaid expansion spread almost exclusively along partisan lines, with Republican vote-share accurately predicting the year of adoption. Crucially, the states that delayed or declined adoption — higher Republican vote-share states — had a higher share of population that would benefit from the expansion and therefore face a worse policy-need match. By contrast, the original 1966 Medicaid rollout showed no relationship between state political leaning and timing of adoption, and neither did the 1960s–1970s food stamp program expansion.&lt;/p&gt;
&lt;p&gt;Q: How do the authors distinguish party discipline from correlated voter preferences as the mechanism?
A: Two tests point away from correlated preferences: (1) cross-state migration flows, when added as a similarity measure, absorb geographic predictive power but leave the political similarity coefficient entirely unaffected; (2) typical policy-outcome variables (opioid mortality, poverty rate, income, etc.) have not become more correlated among politically similar states over time, contradicting the hypothesis that local needs or environments have become politically correlated.&lt;/p&gt;
&lt;p&gt;Q: What is the direct evidence for party discipline as the operative mechanism?
A: The authors construct a measure of similarity based on unified party control (governor and both chambers of the same party). This variable has zero predictive power through the 1990s (point estimate near zero). In the 2000–2020s, the coefficient for unified-government similarity is 0.42 (s.e.=0.06), making it the strongest single predictor of adoption in those decades. States with divided governments show no predictive power of adoption by other divided-government states, further isolating the role of party control.&lt;/p&gt;
&lt;p&gt;Q: What does the event-study of switches to unified party control show?
A: Switches to unified party control in 1991–2020 produce a statistically significant increase of approximately 2 percentage points in the probability of adopting ideologically aligned laws within four years of the switch, relative to the year before. The effect emerges in year t+1 and is persistent, with no pre-trends, and the effect on neutral-leaning laws is zero, ruling out a simple reduced-gridlock story. The same event study for 1950–1990 detects no effect.&lt;/p&gt;
&lt;p&gt;Q: How do COVID state policies compare to historical vaccination policies in terms of political diffusion?
A: COVID policies (77 state laws, October 2019–August 2021) show significant political similarity in adoption, consistent with the recent-decade patterns. Vaccination mandate laws (28 policies since 1975) show no political similarity effect whatsoever, with demographic and modest geographic similarity being the relevant predictors. This contrast underscores that political polarization in policy adoption is a recent phenomenon that has spread even to policy areas without prior partisan patterning.&lt;/p&gt;
&lt;p&gt;Q: How does partisan polarization at the state level compare temporally to polarization in Congress?
A: Congressional polarization (measured by DW-NOMINATE) has been rising since the 1950s. State-level policy polarization, as documented here, does not emerge until the 2000s — a lag of roughly four to five decades. The paper notes it has risen rapidly and has already reached policy domains (such as COVID mandates) that showed no political patterning historically.&lt;/p&gt;
&lt;p&gt;Q: Does the diffusion pattern vary across policy types?
A: Yes. For economic policies, geography and demographics decline in importance over time with a smaller increase in political predictors. For non-economic (social) policies, geographic importance remains stable while political polarization is especially strong. Political polarization is strongest in Republican-leaning and Democratic-leaning states, and weaker among battleground states, consistent with a party-driven model where ideologically extreme states adopt from each other.&lt;/p&gt;
&lt;p&gt;Q: How do the authors classify individual NBER-sample policies by diffusion type?
A: Using Geary&amp;rsquo;s C statistics computed separately for geographic and political clustering for each of the 57 NBER policies, the authors identify three approximate clusters: (1) primarily politically-clustered (e.g., Medicaid expansion); (2) jointly geographically and politically clustered (e.g., ban on asking about past salary history); and (3) largely idiosyncratic, neither geographically nor politically clustered (e.g., anti-bullying laws). This classification has direct implications for assessing identification threats in difference-in-differences designs.&lt;/p&gt;
&lt;p&gt;Q: What does the overall predictability of policy adoption look like over time?
A: The pseudo R-squared from the logit hazard model rises from 0.13 in the 1970s to 0.19 in the 2010s. The increase in political similarity is large enough not only to surpass geographic similarity as a predictor but to make the overall process of state policy adoption more predictable over time.&lt;/p&gt;
&lt;p&gt;Policy diffusion: The process by which a policy adopted in one state subsequently spreads to other states; measured here along geographic, demographic, and political dimensions using a logit hazard model estimated by decade.&lt;/p&gt;
&lt;p&gt;Geary&amp;rsquo;s C statistic: A ratio of weighted to unweighted average pairwise squared differences in adoption status, adapted from spatial statistics (Geary, 1954). Values below 1 indicate clustering; values above 1 indicate anti-clustering. The paper reports 1−C so higher values mean more clustering among similar states.&lt;/p&gt;
&lt;p&gt;Policy innovation: First-year adoption of a law in any state; a state is an &amp;ldquo;innovator&amp;rdquo; if it adopts in the first year the policy appears anywhere. The paper distinguishes innovation (origination) from diffusion (spread).&lt;/p&gt;
&lt;p&gt;Logit hazard model: A discrete-time logit model estimated at the state-year-policy level for all states that have not yet adopted a given policy, with policy-decade fixed effects as a baseline hazard and three time-varying similarity measures (geographic, demographic, political) as key predictors.&lt;/p&gt;
&lt;p&gt;Political similarity: Closeness of two states&amp;rsquo; Republican vote-shares from the most recent presidential election; the closest third of states in this dimension are used to construct the diffusion measure. Shown to be independent of — and to have grown far more predictive than — geographic similarity since 2000.&lt;/p&gt;
&lt;p&gt;Unified party control: A state government in which the governor and both state legislative chambers belong to the same party. The paper shows this is the variable most predictive of politically-driven policy diffusion in the 2000s–2020s, with a coefficient of 0.42 where it was effectively zero before 2000.&lt;/p&gt;
&lt;p&gt;Party discipline / party polarization: The paper&amp;rsquo;s preferred explanation for post-2000 patterns: state politicians increasingly vote and adopt policies along party lines beyond what voter preferences alone would predict, with the effect detectable since the 2000s at the state level, lagging the Congressional polarization trend by roughly four decades.&lt;/p&gt;</description></item><item><title>Politics at Work</title><link>https://macropaperwarehouse.com/papers/politics-at-work/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/politics-at-work/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Do individual political views shape firm behavior and labor market outcomes in the private sector? Specifically, do business owners sort copartisan workers into their firms, and does employers&amp;rsquo; political discrimination drive this sorting?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper studies the complete Brazilian formal labor market over 2002–2019, assembling a novel longitudinal worker-firm-owner-party matched dataset from three administrative sources: (1) RAIS (Relação Anual de Informações Sociais), the universe of formal-sector workers (87 million unique workers, 7.6 million unique firms); (2) the Receita Federal do Brazil (RFB) and Cadastro Nacional de Empresas (CNE), containing business ownership structures for all registered firms; and (3) the Tribunal Superior Eleitoral (TSE) registry of all party members (19.3 million individuals) over 2002–2019. Matching these sources yields political affiliation for 11.4% of all private-sector owners and 7.8% of all private-sector workers in the sample. Party affiliation in Brazil requires an active registration step and is interpreted as a signal of strong and visible political views, distinguishing affiliated from unaffiliated individuals who likely hold milder views. The 35 parties in the sample are highly fragmented; the top 7 account for nearly 70% of all party members.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Political assortative matching.&lt;/em&gt; Using a likelihood ratio index (Eika et al., 2019; Chiappori et al., 2020), the paper finds that workers and owners belonging to the same party are on average about twice as likely to match in the labor market relative to random matching. Once within-municipality geographical sorting is accounted for, this figure falls to approximately 55% excess probability of copartisan matching, and increases over time: from 1.41 in 2002–2006 to 1.67 in 2016–2019. A dyadic regression approach — constructing all worker-firm dyads within industry-municipality labor markets and controlling for shared gender, race, age, and education — confirms the result: across all years, a politically affiliated worker is between 41% and 75% more likely to be employed by a copartisan owner than by an owner affiliated with a different party. Political assortative matching is driven both by higher hiring probabilities (range: 32%–59% more likely for copartisans, hiring margin only) and by longer tenure: copartisan workers stay in the firm roughly 5.5% longer than otherwise comparable workers of a different party, even within the same firm and hire-year (column 3 of Table 2). In every year and by every method, the degree of political assortative matching exceeds that of gender (15%–31% excess probability under dyadic approach) and race (approximately 3.4%), which are themselves both positive and significant.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Mechanisms: political discrimination.&lt;/em&gt; Three sets of evidence point to employer political discrimination as a relevant driver. First, in the administrative micro-data: assortative matching decreases strongly with firm size — it is more than twice as large in firms with up to 10 employees than in medium firms and more than six times as large as in firms with more than 50 employees — and is stronger for higher occupational layers and for jobs requiring above-median social skills or interpersonal relationships. Political assortative matching is, if anything, larger for parties not in power locally, inconsistent with a patronage mechanism. An event study of 5,262 owners who switched party finds a sharp increase of about 0.2 standard deviations in hires from the new party and a corresponding drop in hires from the old party at the time of the switch, with the share of workers from the new party rising by roughly 5 percentage points persistently. Second, an incentivized resume rating (IRR) field experiment (150 business owners; nondeceptive design) shows that owners rate copartisan resumes 0.213 points higher on a 1–7 Likert scale (a 7.4% increase relative to the mean rating for different-party resumes, statistically significant at p &amp;lt; 0.05), with no significant effect on perceived candidate acceptance probability. Third, a representative survey of 891 owners and 1,003 workers finds that belief-based and taste-based discrimination are ranked as the leading explanations by both groups; 47% of owners and 58% of workers agree with the belief-based discrimination statement. Additionally, 29% of surveyed owners (22% say &amp;ldquo;Yes&amp;rdquo; and 7% &amp;ldquo;In some cases&amp;rdquo;) explicitly reveal that political views affect their hiring decisions.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Real consequences.&lt;/em&gt; Conditional on employment, copartisan workers are promoted faster: they are 0.448 percentage points more likely to be promoted from white-collar to managerial positions (against a base rate of 2.58%) and 0.44 percentage points more likely to be promoted from blue-collar to white-collar positions (base rate 2.98%). Workers from a different party than the owner face a promotion penalty of 0.104–0.180 percentage points for white-collar-to-manager promotions. On wages, copartisan workers earn 3.9% more than unaffiliated coworkers within the same firm and year (firm-year FE specification); the effect is 2.8% when restricting to the same occupation within the firm. Workers from a different party earn 1.6% less. Decomposing by tier: managers (copartisan premium 1.6%), white-collar workers (3.4%), blue-collar workers (1.5%). Despite better outcomes, copartisan workers are 2.1 percentage points (2.3% relative to the mean) less likely to be educationally qualified for their occupation, conditional on firm-year and controlling for a full set of demographics. Finally, a higher share of copartisan workers in the prior year is associated with lower firm employment growth (estimated β = −0.071), corresponding to approximately a 1 percentage point gap in annual growth rate for a one-standard-deviation difference in copartisan share — substantial relative to an average annual growth rate of 10%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All findings pertain to the formal private sector in Brazil over 2002–2019. Political affiliation in the Brazilian system requires an active step and signals strong views; results apply to the approximately 7.8%–11.4% of workers and owners who are party-registered. The field experiment sample is limited to 150 business owners affiliated with major Brazilian parties who were actively seeking to hire. The firm growth result is explicitly characterized as suggestive, without a source of exogenous variation.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the likelihood ratio index and what does it show for political matching in Brazil?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The likelihood ratio index measures how many times more likely a match between a worker and owner of the same party is, relative to the expected frequency under random matching (conditional on the population shares of each party). Across 2002–2019, the unconditional index ranges from 1.56 to 1.85, implying workers and employers of the same party are on average about twice as likely to match as under random matching. After accounting for geographic sorting within municipalities, the index ranges from approximately 1.41 (2002–2006 average) to 1.67 (2016–2019 average), showing a clear increasing trend. The corresponding gender and race indexes average about 1.2 and 1.35, respectively, in the basic specification, both significantly lower than the party index in every year of the sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the dyadic regression estimates control for omitted characteristics, and what do they find?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The dyadic regression constructs all possible worker-firm pairs within each municipality-industry labor market in a given year. The dependent variable is an indicator for whether worker i is employed by firm f. The key coefficient of interest is the differential probability of employment for a copartisan pair relative to a different-party pair, controlling for indicators for shared gender, race, age bracket, and education level, as well as worker occupation fixed effects and experience. This controls for the concern that politically affiliated individuals share non-political traits that correlate with employment choices. After these controls, a politically affiliated worker is 41%–75% more likely (depending on year) to be employed by a copartisan owner than by a different-party owner. The effect stems primarily from copartisan workers being preferentially hired (not just from unaffiliated owners preferring any affiliated worker indiscriminately). The analogous dyadic estimate for shared gender is 15%–31% and for shared race is approximately 3.4%, both lower than the party estimate in all years.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How is political assortative matching decomposed into hiring versus retention margins?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To isolate the hiring margin, the authors estimate the dyadic regression restricting to newly hired workers (not present in the firm in year t-1). They find that the probability of being hired by a copartisan owner is 32%–59% higher than by a different-party owner across years. The retention (tenure) margin is estimated by regressing the share of subsequent years a worker remains at the firm on partisan alignment at the time of hire. In the most stringent specification (year-of-hire × firm fixed effects), copartisan hires stay 5.5 percentage points longer (as a share of post-hire years) than different-party hires from the same firm and hire-year cohort. Both margins are significant, and both exhibit stronger political sorting than equivalent estimates for gender or race.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the evidence against political patronage as the primary driver of political assortative matching?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If political patronage (parties pressuring owners to hire copartisans) were the main driver, we would expect political assortative matching to be stronger when the owner&amp;rsquo;s party is in power locally, as those parties have greater leverage over business owners. The authors estimate a modified dyadic regression distinguishing between cases where the owner&amp;rsquo;s party is in the ruling coalition of the municipal mayor or state governor versus not in power. The results show that political assortative matching is, if anything, larger for parties not in power. This is inconsistent with patronage being the dominant mechanism and consistent with the discrimination channel being driven by owner preferences rather than external political pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the event study of owner party changes show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The event study tracks 5,262 owners who switch party affiliation during 2002–2019, comparing their firms to control firms in the same market whose owners remain affiliated to the original party. At the time of the switch, there is a sharp increase of approximately 0.2 standard deviations in hires from the owner&amp;rsquo;s new party and a corresponding sharp decrease in hires from the old party. Hires from other parties and unaffiliated hires also decline modestly. The share of the workforce affiliated with the new party increases by roughly 5 percentage points and remains elevated in subsequent years. Because nonpolitical network ties (shared school, neighborhood, sports team) are unlikely to dissolve abruptly when an owner changes party, this design provides additional evidence that the change in hiring is driven by a direct change in the owner&amp;rsquo;s political preferences rather than by network overlap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What was the design of the incentivized resume rating experiment and why does it identify political discrimination?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The experiment was conducted with 150 Brazilian business owners recruited from the administrative data (who are already known to be affiliated with one of six major parties), targeting owners with active hiring interest through a leading job platform. Owners rated 20 synthetic resumes with fully randomized features (education, experience, training, skills, formatting). Sixteen resumes had no partisan cues; two contained cues signaling copartisanship with the rating owner; two signaled a party from the opposite side of the political spectrum. Incentives were provided by committing to send respondents real job-seeker profiles from the platform chosen by machine learning based on revealed preferences. Because all resume features other than the partisan cue were randomized, the experiment shuts down shared nonpolitical networks and patronage as explanations; the only channel is the employer&amp;rsquo;s direct preference for the candidate&amp;rsquo;s partisan affiliation. The response rate was 11% and the survey was conducted March–May 2022.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the quantitative magnitude of the field experiment result?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Owners rate copartisan resumes 0.213 points higher on the 1–7 Likert scale relative to resumes from the opposite side of the political spectrum (statistically significant at p &amp;lt; 0.05), representing a 7.4% increase relative to the mean rating of different-party resumes (2.950). When resume-level controls (gender, high-skill experience flag, years of experience, programming skills, training) are added, the estimate is 0.254. There is no statistically significant effect on owners&amp;rsquo; perceived likelihood that a candidate would accept a job offer (coefficient 0.150–0.158, not significant), suggesting that the observed difference in interest ratings reflects a genuine direct preference for copartisans, not an expectation that copartisans are more likely to accept.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What do the survey findings add about mechanisms and the prevalence of political discrimination?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The survey of 891 owners and 1,003 workers (response rate 26.84%) presents five candidate mechanisms and asks respondents to evaluate each. Both groups rank belief-based discrimination (owners believe copartisans would be more productive) as the most likely explanation: 47% of owners and 58% of workers partially or strongly agree. Taste-based discrimination is second (36% owners, 52% workers agree), followed by networks (39% owners, 49% workers). Patronage and workers&amp;rsquo; preferences attract little agreement from either group. Among owners ranked by single strongest agreement, 29.7% most strongly agree with belief-based discrimination and 22.0% with taste-based, while 29% of all surveyed owners explicitly stated that political views do affect their hiring decisions. These patterns are broadly similar regardless of the respondent&amp;rsquo;s own political affiliation status.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How large are the political promotion and wage premia, and how do they compare to gender and race effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For promotions, copartisan white-collar workers are 0.448 percentage points more likely to be promoted to manager (relative to unaffiliated co-workers hired in the same firm-year), against a base promotion rate of 2.58% — an effect of approximately 17% of the mean. For blue-collar-to-white-collar promotion, the copartisan premium is 0.44 percentage points against a base rate of 2.98%. For wages, copartisans earn 3.9% more than unaffiliated co-workers within the same firm and year; restricting to the same occupation within the firm, the premium is 2.8%. The political wage premium (3.9%) exceeds the gender wage premium (1.5%) and the race wage premium (1.0%) in the same specification. Workers from a different party than the owner earn 1.6% less than unaffiliated co-workers within the same firm-year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Are copartisan workers better qualified than those they displace, and what does this imply for firm performance?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Copartisan workers are significantly less qualified in terms of education relative to their occupation: they are 2.1 percentage points less likely to be educationally qualified for their position than their unaffiliated co-workers within the same firm-year (2.3% relative to the mean qualification rate of 93.2%), with the largest effects for managers. Workers of a different party show only a small and economically negligible qualification gap. The fact that copartisans are paid more, promoted faster, and yet are less qualified is consistent with political discrimination substituting for competence in personnel decisions. The qualification shortfall is specifically attributed to copartisanship and not to shared gender, race, age, or education between owner and worker, as those coefficients are economically small.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the evidence on firm growth and what are the limitations of that evidence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Firms with a higher share of copartisan workers in the prior year grow less. The estimated coefficient β = −0.071, and a one-standard-deviation difference in the copartisan share is associated with approximately a 1 percentage point gap in annual employment growth, relative to a mean growth rate of 10%. The specification compares firms of the same size and with the same number of affiliated workers in the same year. The result is robust to adding municipality and municipality-industry fixed effects. The authors explicitly characterize this evidence as suggestive, noting the absence of an exogenous source of variation in political discrimination. The negative association is more consistent with taste-based discrimination (Becker, 1957) — in which politically homogeneous firms sacrifice productivity for the owners&amp;rsquo; amenity of employing copartisans — than with accurate belief-based discrimination.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How is political assortative matching distributed across parties and does it depend on party ideology?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The likelihood ratio index shows large assortative matching across the entire political spectrum. For most years, relatively more ideologically extreme parties — on the left (PT, PDT) and on the right (PP, DEM) — display higher assortative matching than more centrist parties (PMDB, PSDB). This pattern is consistent with stronger partisan identity at the extremes leading to stronger preferences for copartisan workers, but the paper does not formally model the mechanism behind this heterogeneity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What is the role of workers&amp;rsquo; preferences as opposed to employers&amp;rsquo; discrimination, and how can wages distinguish them?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;If workers have a preference for working with copartisan owners (treating this as a job amenity), compensating differentials theory would predict a negative wage premium for copartisan workers — they would accept lower wages in exchange for working with like-minded owners. The data show the opposite: copartisan workers earn significantly more, not less, than their unaffiliated co-workers. This evidence is inconsistent with workers&amp;rsquo; preferences being the primary driver of political assortative matching, and is instead consistent with employers&amp;rsquo; discrimination. The survey evidence corroborates this: both owners and workers assign low priority to the &amp;ldquo;workers&amp;rsquo; preferences&amp;rdquo; mechanism.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Political assortative matching&lt;/strong&gt;: The phenomenon by which workers and business owners belonging to the same political party are matched in the labor market at rates significantly exceeding what would occur under random matching within the local labor market. Measured via the likelihood ratio index and dyadic regressions that control for shared demographic characteristics. In this paper, political assortative matching is larger in magnitude than assortative matching along gender or racial lines.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Likelihood ratio index (S)&lt;/strong&gt;: A measure of assortative matching defined as the weighted sum of the ratios of observed same-party co-occurrence probabilities to their expected probabilities under random matching. S &amp;gt; 1 indicates positive assortative matching. The paper uses both a basic version and a geography-adjusted version that computes the index within municipalities to control for geographic concentration of party membership.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dyadic regression&lt;/strong&gt;: A regression approach that constructs all possible worker-firm pairs within a defined labor market (municipality × 2-digit industry) to estimate the differential probability that a worker is employed by a copartisan firm relative to a different-party firm. The key advantage is the ability to control simultaneously for multiple shared demographic characteristics between worker and owner, accounting for the correlation of assortative criteria.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Incentivized resume rating (IRR) experiment&lt;/strong&gt;: A nondeceptive field experiment design (following Kessler et al., 2019) in which business owners rate synthetic resumes with fully randomized characteristics. Truthful rating is incentivized because respondents are told that their revealed preferences will be used to select real job-seeker profiles sent to them by a partner platform via machine learning. This design allows direct identification of employer preference for copartisan candidates while ruling out alternative channels such as shared nonpolitical networks or patronage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Political wage premium&lt;/strong&gt;: The percentage wage difference earned by copartisan workers relative to unaffiliated co-workers within the same firm-year (and occupation), after controlling for a full set of socio-demographic characteristics. A positive political wage premium is the paper&amp;rsquo;s primary piece of evidence that workers&amp;rsquo; compensating differentials cannot explain political assortative matching, since amenity-based sorting would predict a negative premium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Political promotion premium&lt;/strong&gt;: The differential probability that a copartisan worker is promoted to a higher organizational layer (blue-collar to white-collar, or white-collar to manager) relative to an unaffiliated co-worker hired in the same firm and year, net of demographic controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Educational mismatch (Qualified)&lt;/strong&gt;: An indicator variable equal to one if a worker&amp;rsquo;s educational level meets or exceeds the educational level required by their specific occupation in the CBO (Classificação Brasileira de Ocupações) classification. Used to assess whether politically favored (copartisan) workers are less competent along this observable dimension.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Belief-based discrimination vs. taste-based discrimination&lt;/strong&gt;: Two distinct theoretical channels for employer political discrimination. Belief-based discrimination (Phelps, 1972; Arrow, 1973) occurs when employers perceive copartisans to be more productive — e.g., because shared political views reduce intra-firm conflict. Taste-based discrimination (Becker, 1971) occurs when employers have a direct utility-affecting preference for copartisan workers, independent of productivity beliefs. The paper treats these as observationally distinct from patronage and network overlap, and uses the negative correlation between political homogeneity and firm growth as suggestive evidence favoring the taste-based channel.&lt;/p&gt;</description></item><item><title>Praying for Rain</title><link>https://macropaperwarehouse.com/papers/praying-for-rain/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/praying-for-rain/</guid><description>&lt;p&gt;This paper studies rainmaking as an instrumental religious belief. The central research question is: why do people believe that prayer can bring rain, even though it does not work? The authors develop a model of cultural evolution in which a religious leader prays for rain at an arbitrary time, and people update their beliefs about whether the leader can cause rainfall based on whether rain follows. The key mechanism is the local rainfall hazard function — the probability of rain conditional on how many days have passed since the last rainfall. In environments where the hazard is increasing (rain becomes more likely the longer a drought continues), a leader who prays during a drought will tend to be followed by rain, creating the illusion of efficacy. In environments with a flat or declining hazard, prayer cannot be systematically followed by rain in a persuasive way. The model yields five predictions: rain ritual traditions will select for prayers correlated with rainfall; the level of average rainfall does not determine persuasiveness; constant-hazard environments cannot support persuasive prayer; increasing-hazard environments are more likely to adopt rainmaking; and higher net benefits of rainfall (e.g., settled agriculture) further increase the likelihood of ritual.&lt;/p&gt;
&lt;p&gt;The authors test these predictions with two empirical strategies. First, they use daily data from the Catholic church in Murcia, Spain, covering 1600 to 1836. Church records provide the daily timing of pro pluvia rogations (prayers for rain), while municipal council records — kept independently of the church — record notable rainfall events. Murcia&amp;rsquo;s rainfall hazard is estimated to be increasing after long dry spells: the hazard rate after a long drought is roughly double the hazard rate two months after the last rainfall. The main finding is that a prayer for rain in the last 30 days predicts a 0.144 percentage-point higher daily probability of notable rainfall (standard error 0.057 pp), relative to a baseline mean daily rainfall probability of 0.203 pp — a 71% increase in the predicted probability. Prayer also Granger-causes rainfall conditional on lags of recent rainfall, and the predictive power holds within a given calendar month, ruling out a purely seasonal coincidence.&lt;/p&gt;
&lt;p&gt;Second, the authors construct an original dataset covering rainmaking practice for 1,208 ethnic groups drawn from the Ethnographic Atlas (Murdock, 1967), coded from 370 anthropological sources. They match each ethnic group to its nearest weather station and estimate the rainfall hazard function each group faces in its ancestral location. Of the 1,208 groups, 33% face an increasing rainfall hazard, and 39% of all groups practice rain ritual. The main global finding is that ethnic groups facing an increasing rainfall hazard are 14 percentage points more likely to practice rainmaking (standard error 3.7 pp), relative to a base rate of 30% among groups facing a non-increasing hazard — a 47% increase. This result is robust to continent fixed effects, geographic and climatic controls (longitude, latitude, elevation, distance to coast, ruggedness, mean temperature, mean rainfall, coefficient of variation of rainfall, maximum dry spell length, and the Giuliano-Nunn 2021 climatic variability measure), alternative hazard estimation methods, and linguistic family fixed effects. Crucially, lower average rainfall, longer droughts, and greater climatic variability are not associated with more rain ritual conditional on hazard shape — it is specifically the shape of the hazard function, not aridity or variability per se, that drives adoption.&lt;/p&gt;
&lt;p&gt;A second global finding concerns demand: groups dependent on agriculture are 11 pp more likely to practice rainmaking; those dependent on intensive agriculture, 21 pp more likely; and those dependent on intensive irrigated agriculture, 32 pp more likely (on a base of 32%). The scope of the findings is the pre-modern or traditional period captured by the Atlas; the Murcia case covers 1600–1836. The authors conclude that some environments create an illusion of efficacy that sustains instrumental religious belief through cultural selection, without requiring that believers be irrational.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central theoretical claim about why rainmaking beliefs persist?
A: The paper argues that in environments where the rainfall hazard is increasing during a drought, a leader who begins praying during a dry spell will tend to be followed by rain, because the probability of rain rises as the drought lengthens. People who cannot observe the counterfactual hazard (what rainfall would have been without prayer) interpret this coincidence as evidence that prayer works. Cultural selection then favors leaders whose prayer timing is more persuasive, causing the belief to persist across generations even though prayer does not actually cause rain.&lt;/p&gt;
&lt;p&gt;Q: What is the rainfall hazard function, and why does its shape determine whether prayer can be persuasive?
A: The hazard function h(t) gives the instantaneous probability of rain at time t days after the last rainfall. If the hazard is flat, the probability of rain is the same regardless of whether prayer was offered or not, so there is no systematic correlation between prayer and rainfall to exploit. If the hazard is declining, prayer during a drought will be followed by lower-than-average rainfall probability, undermining the leader. Only if the hazard is increasing does prayer during a long dry spell systematically coincide with a higher probability of rain, creating a persuasive correlation.&lt;/p&gt;
&lt;p&gt;Q: What do Propositions 2 and 3 of the model establish?
A: Proposition 2 establishes that if the hazard rate is constant and a person&amp;rsquo;s prior belief that prayer works is below 0.5, then no prayer start time can persuade them to support the leader. Proposition 3 establishes the converse: if the hazard rate is increasing and the prior is below 0.5, there exists a meaningful belief for which a person will support the leader for any prayer start time. Together these propositions identify the increasing hazard as the necessary and sufficient structural condition for persuasive prayer.&lt;/p&gt;
&lt;p&gt;Q: What is the main quantitative finding from Murcia, and what identification strategy supports it?
A: A prayer for rain in the last 30 days predicts a 0.144 percentage-point higher daily probability of notable rainfall (standard error 0.057 pp) relative to a baseline mean of 0.203 pp, a 71% increase. The authors additionally demonstrate that prayer Granger-causes rainfall conditional on lags of recent rainfall, and that the effect holds within a given calendar month, ruling out the explanation that prayer simply tracks the rainy season. The prayer and rainfall records are kept by independent institutions (church and municipal council), reducing the risk of strategic recording.&lt;/p&gt;
&lt;p&gt;Q: How does the hazard rate in Murcia behave, and does it satisfy the model&amp;rsquo;s key condition?
A: The hazard of rainfall in Murcia is initially high just after rain, declines to a minimum roughly two months after the last rainfall, and then increases significantly thereafter, reaching or exceeding its initial level after a long drought. The fluctuations are large: the hazard after a long dry spell is roughly double the hazard two months after rainfall. This U-shaped pattern means the hazard is increasing during a prolonged drought, satisfying the model&amp;rsquo;s key condition for persuasive prayer.&lt;/p&gt;
&lt;p&gt;Q: How was the global rainmaking dataset constructed, and what is its coverage?
A: The authors used the Ethnographic Atlas (Murdock, 1967) as a template, covering 1,290 ethnic groups, and combed 370 anthropological sources — primarily group-specific ethnographic monographs — to code rainmaking practice for 1,208 groups. A group is coded as practicing rain ritual only if there is clear evidence of a practice specifically intended to bring rain through supernatural means. The authors treat their measure as a lower bound. They find that 39% of the 1,208 groups practice rainmaking, across every settled continent.&lt;/p&gt;
&lt;p&gt;Q: What is the main global regression result and how robust is it?
A: Ethnic groups facing an increasing rainfall hazard are 14 percentage points more likely to practice rain ritual (standard error 3.7 pp) relative to a base rate of 30%, a 47% proportional increase. This coefficient is positive and statistically significant across all specifications, including those adding continent fixed effects, a full battery of geographic and climatic controls (longitude, latitude, elevation, distance to coast, ruggedness, mean temperature, mean rainfall, coefficient of variation of rainfall, maximum dry spell length, and the Giuliano-Nunn 2021 climatic variability measure), alternative hazard estimation methods, linguistic family fixed effects, and restrictions to groups with high-quality rainfall data.&lt;/p&gt;
&lt;p&gt;Q: Does aridity or climatic variability explain rainmaking adoption?
A: No. Lower average rainfall, longer droughts, and greater climatic variability (measured using the Giuliano-Nunn 2021 index) are not associated with more rain ritual practice, conditional on the shape of the hazard function. This rules out the naive hypothesis that people pray for rain simply because they do not get enough, or because their rainfall is unreliable. It is specifically the shape of the hazard — whether it is increasing during a drought — that drives adoption, not the level or volatility of rainfall.&lt;/p&gt;
&lt;p&gt;Q: How does demand for rainfall, proxied by agricultural subsistence, affect rainmaking adoption?
A: Groups dependent on agriculture are 11 percentage points more likely to practice rainmaking relative to other subsistence modes. Groups dependent on intensive agriculture are 21 percentage points more likely, and groups dependent on intensive irrigated agriculture are 32 percentage points more likely, all on a base of 32%. This gradient is consistent with Proposition 5 and 6 of the model: settled, location-specific agricultural investment raises the net benefit of rainfall control, increasing support for rain ritual independently of the persuasion channel.&lt;/p&gt;
&lt;p&gt;Q: What does the model&amp;rsquo;s cultural evolution mechanism (Proposition 4) predict about how prayer timing changes over generations?
A: Proposition 4 states that rituals with high support are more likely to persist. In increasing-hazard environments, random variation in prayer timing means some leaders gain more support than others; those with more persuasive timing are more likely to persist. Each generation then adopts a policy at least as persuasive as the prior generation, so support rises over time and prayers gradually converge toward the timing that maximizes persuasiveness. This mechanism does not require deliberate optimization by any individual leader.&lt;/p&gt;
&lt;p&gt;Q: How does the paper&amp;rsquo;s finding relate to the long-standing anthropological debate between the traditional and revisionist schools on rainmaking?
A: The traditional school (following Frazer 1890) holds that belief is instrumental — people engage in rainmaking to make rain, and belief responds to empirical evidence. The revisionist school (Wittgenstein, Durkheim) argues that religious belief and rationality are fundamentally separate, and religious practice is performative rather than evidence-responsive. The paper&amp;rsquo;s finding that rainmaking is more prevalent precisely where it is more persuasive — i.e., where the environment makes prayer appear to work — supports the traditional, instrumental interpretation that belief responds to evidence of efficacy.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions for the paper&amp;rsquo;s conclusions?
A: The Murcia case study covers the period 1600–1836, ending when the abolition of tithes reduced the church&amp;rsquo;s funding and influence; it applies to a sophisticated Catholic institutional context. The global analysis covers traditional practices of pre-modern ethnic groups as recorded in the Ethnographic Atlas and anthropological literature; it does not speak to modern religious practice or to religions after substantial modernization. The persuasion mechanism requires that people cannot directly observe what rainfall would have been without prayer, a condition satisfied in pre-scientific contexts.&lt;/p&gt;
&lt;p&gt;Rainfall hazard function: In this paper&amp;rsquo;s usage, the function h(t) = f(t)/(1-F(t)) giving the instantaneous probability of rainfall at time t days since the last rainfall. Its shape — whether flat, declining, or increasing during a drought — determines whether prayer can be persuasive, not the overall level of rainfall.&lt;/p&gt;
&lt;p&gt;Increasing hazard: A hazard rate that rises as the length of a dry spell increases, so that rain becomes more likely the longer the drought has continued. The paper defines this specifically as the derivative of the hazard function evaluated at the 99th percentile of spell length. This is the necessary structural condition for prayer to seem efficacious.&lt;/p&gt;
&lt;p&gt;Instrumental religious belief: Belief directed at achieving a worldly outcome (here, rainfall), as opposed to purely expressive or social belief. The paper treats belief as instrumental if it responds to perceived evidence of efficacy and is adopted where it appears to work.&lt;/p&gt;
&lt;p&gt;Persuasion (in the model): The process by which a leader&amp;rsquo;s prayer timing causes people to update their belief that prayer works, by generating a correlation between prayer and subsequent rainfall that exceeds what people expect from the background hazard rate. Persuasion is possible only when the hazard is increasing.&lt;/p&gt;
&lt;p&gt;Pro pluvia rogations: The Catholic church&amp;rsquo;s formal prayers for rain, practiced in Murcia since at least the 14th century. In the paper&amp;rsquo;s data, these prayers follow a pattern of escalation — increasing in number and intensity — during prolonged droughts, consistent with the model&amp;rsquo;s prediction about prayer timing.&lt;/p&gt;
&lt;p&gt;Cultural evolution: The paper&amp;rsquo;s framework (drawing on Henrich 2015) in which religious leaders act as cultural entrepreneurs; leaders whose prayer timing happens to be more persuasive gain greater support and are more likely to survive across generations, so prayer traditions drift toward more persuasive timing without deliberate design.&lt;/p&gt;
&lt;p&gt;Rain ritual (global measure): A binary indicator coded as one for an ethnic group if the anthropological literature contains clear evidence of a practice specifically intended to bring rain through supernatural means, including dances, sacrifices, prayers, and petitioning of rain deities. Treated by the authors as a lower bound on actual prevalence.&lt;/p&gt;</description></item><item><title>Present Bias Amplifies the Household Balance-Sheet Channels of Macroeconomic Policy</title><link>https://macropaperwarehouse.com/papers/present-bias-amplifies-the-household-balance-sheet-channels-of-macroeconomic-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/present-bias-amplifies-the-household-balance-sheet-channels-of-macroeconomic-policy/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Maxted, Laibson, and Moll study fiscal and monetary policy in a partial-equilibrium heterogeneous-agent model in which homeowners have present-biased time preferences (Instantaneous Gratification preferences, the continuous-time limit of quasi-hyperbolic discounting) and naive beliefs, alongside a liquid savings account, an illiquid home, and access to credit card and mortgage debt. Because present bias substantially increases households&amp;rsquo; marginal propensity to consume — in the calibrated model the quarterly MPC rises from 4% under exponential discounting to 14% under present bias, and the quarterly marginal propensity for expenditure (MPX) rises from 13% to 30% — present bias powerfully increases the effect of fiscal stimulus. Present bias also amplifies the overall effect of expansionary monetary policy, but at the same time slows down the speed of monetary transmission: interest rate cuts incentivize households to conduct cash-out refinances, which become targeted liquidity injections to households near the liquidity constraint who have especially high MPCs, but present bias with naive beliefs also introduces a motive for households to procrastinate on refinancing their mortgage, which substantially slows the speed at which this channel operates. A noteworthy feature of the model is that present bias amplifies the direct effect of monetary policy on household consumption while simultaneously delivering larger MPCs — a combination that is in contrast to standard heterogeneous-agent models, where modeling choices that amplify MPCs typically deliver smaller consumption responses to interest rate changes. The calibrated present-biased economy also replicates several empirical regularities that are difficult to match with exponential discounting: high-cost credit card borrowing by homeowners, empirically plausible cash-out behavior and loan-to-value ratios, and refinancing inertia.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the core modeling innovation and why is it needed?&lt;/strong&gt;
A: The paper introduces naive Instantaneous Gratification (IG) preferences — the continuous-time limit of quasi-hyperbolic (beta-delta) discounting — into a two-asset heterogeneous-agent model with a liquid savings account and illiquid home equity accessible via mortgage refinancing. The naivete assumption (households do not foresee their own future present bias) is essential because it generates procrastination: naive households perpetually intend to refinance &amp;ldquo;soon&amp;rdquo; but keep delaying. A model with exponential discounting that merely sets parameters to match empirical MPCs would not generate procrastination behavior, and would require implausible interest rate calibrations (very low credit card rates or very high illiquid asset returns) to simultaneously match low liquid wealth accumulation and high credit card borrowing. Present bias with interest rates taken from the data resolves both issues.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the key quantitative MPC results and why do they matter for fiscal policy?&lt;/strong&gt;
A: In the exponential discounting benchmark, the quarterly MPC is 4% and the quarterly MPX (which includes nondurables and durables) is 13%. Under the present-bias benchmark, the MPC rises to 14% and the MPX rises to 30%. The empirical literature estimates quarterly nondurable spending responses on the order of 15%–25%, and total expenditure responses typically two to three times larger, so the present-biased model is substantially more consistent with the data. Because fiscal stimulus (modeled as an unexpected one-time lump-sum payment, financed by a flow income tax) operates through household spending propensities, the higher MPCs and MPXs under present bias directly and powerfully increase the aggregate consumption response to fiscal policy relative to the exponential benchmark.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does present bias amplify the effect of monetary policy?&lt;/strong&gt;
A: Interest rate cuts incentivize households to conduct cash-out refinances — they borrow against accumulated home equity, converting illiquid home equity into liquid wealth. Because this liquidity is targeted to households who are near their borrowing constraint (and thus have especially high MPCs), the aggregate consumption response to a given rate cut is amplified. Crucially, present bias amplifies this channel beyond the exponential benchmark precisely because higher MPCs mean each dollar of liquidity injected generates more consumption. This stands in contrast to the standard result in the heterogeneous-agent literature (Auclert 2019; Olivi 2017; Kaplan, Moll, and Violante 2018) that MPC-amplifying modeling choices reduce the consumption response to interest rate changes because MPC enters the substitution effect with a negative sign in standard one-asset models. The two-asset structure with home equity and the cash-out refinance channel breaks this trade-off.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does present bias slow the speed of monetary transmission?&lt;/strong&gt;
A: Present bias with naive beliefs introduces a motive for households to procrastinate on refinancing their mortgage. Refinancing is an immediate-cost, delayed-reward task: it requires the borrower to spend weeks gathering documents, filling out paperwork, and negotiating with lenders, with benefits (lower mortgage payments or extracted home equity) accruing afterward. Naive present-biased households discount current effort costs very heavily relative to future benefits, so they delay, all the while (counterfactually) believing they will complete the task in the near future. This procrastination substantially slows down the speed at which the cash-out refinance channel of monetary policy operates: even though a rate cut eventually incentivizes households to refinance and extract equity, the timing of that response is stretched out relative to what exponential discounters would do.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the role of naive beliefs versus sophisticated (partially or fully aware) present bias?&lt;/strong&gt;
A: Naivete is necessary to generate procrastination from small effort costs. A fully sophisticated present-biased household (one who correctly anticipates its own future self-control problems) would not indefinitely defer a task it correctly anticipates will keep being deferred. The paper extends the analysis to partial and full sophistication in Online Appendix D.5. The key takeaway is that procrastination — and thus the speed-reduction effect on monetary transmission — is driven by at least partial naivete. The MPC-amplification and fiscal-policy amplification results are more robust across sophistication levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What empirical regularities does the present-biased calibration match that the exponential model cannot easily match?&lt;/strong&gt;
A: The present-biased economy replicates: (1) empirically plausible levels of high-cost credit card debt held simultaneously with home equity (a puzzle under exponential discounting); (2) cash-out behavior and loan-to-value ratios consistent with data; (3) a buildup of liquidity-constrained households consistent with empirical propensities to spend out of credit card limit increases (Gross and Souleles 2002; Agarwal et al. 2018); (4) consumption function discontinuities at the borrowing constraint consistent with Ganong and Noel (2019); (5) MPCs and MPXs that remain elevated for large shocks (Fagereng, Holm, and Natvik 2021); (6) the intertemporal MPC profile consistent with Auclert, Rognlie, and Straub (2018); (7) differential MPCs out of liquid versus illiquid transfers (Ganong and Noel 2020); and (8) refinancing inertia — the proclivity for households to delay refinancing when financially optimal (Keys, Pope, and Pope 2016; Johnson, Meier, and Toubia 2019; Andersen et al. 2020).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the model&amp;rsquo;s scope — what does it abstract from?&lt;/strong&gt;
A: The model is set in partial equilibrium, so general equilibrium effects (e.g., endogenous interest rate responses, aggregate demand externalities) are not captured; the authors describe their results as inputs for a fuller general equilibrium analysis. The model focuses on homeowners (two-thirds of U.S. housing units), abstracting from renters. House prices are fixed (consistent with their slow movement over short horizons), with an extension to house price shocks in Online Appendix D.2.1. The model does not allow for home equity lines of credit, second mortgages, or reverse mortgages, because these products are more commonly used when interest rates are rising, and the paper focuses on the stimulative effect of rate cuts. The interest rate in the model is a long rate (e.g., 10-year TIPS), with the implicit assumption that the Federal Reserve implements the necessary short-rate adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the present-biased model compare to the standard HANK picture on the monetary-MPC trade-off?&lt;/strong&gt;
A: In standard one-asset heterogeneous-agent models, a household&amp;rsquo;s MPC is a sufficient statistic that enters the substitution effect of interest rate changes with a negative sign — so modeling choices that raise MPCs reduce monetary policy effectiveness. The present-biased two-asset model breaks this result: because interest rate cuts trigger cash-out refinances that inject liquidity targeted to high-MPC households near the constraint, higher MPCs translate into larger, not smaller, aggregate consumption responses to monetary policy. Present bias therefore simultaneously amplifies fiscal policy (via higher MPCs) and amplifies the overall effect of monetary policy (via the targeted liquidity channel), while introducing the procrastination-driven speed reduction as the offsetting cost.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Present bias (Instantaneous Gratification preferences):&lt;/strong&gt; The paper uses &amp;ldquo;present bias&amp;rdquo; to refer to quasi-hyperbolic discounting. In the continuous-time limit (Instantaneous Gratification, or IG, preferences, following Harris and Laibson 2013), the current self discounts all future selves by factor β &amp;lt; 1, while exponential discounting of the future (rate ρ) applies from any future vantage point. This creates a discontinuity in the discount function at t = 0 whenever β &amp;lt; 1. Setting β = 1 recovers standard exponential discounting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Naive beliefs:&lt;/strong&gt; Households do not foresee their own future present bias. The current self believes all future selves will be exponential discounters (β = 1), even though this belief is incorrect. Naivete is what transforms present bias into procrastination: the household perpetually expects its future self to complete effortful tasks, but each future self faces the same bias.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cash-out refinance channel:&lt;/strong&gt; When market interest rates fall, households have an incentive to refinance their fixed-rate mortgage, locking in a lower interest rate. If the household has accumulated home equity (illiquid), it can simultaneously borrow against that equity — a cash-out refinance — converting illiquid home equity into liquid wealth. In the model, this acts as a targeted liquidity injection to households near their borrowing constraint (who have high MPCs), amplifying the aggregate consumption response to rate cuts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Procrastination motive:&lt;/strong&gt; Present bias introduces a motive to procrastinate on immediate-cost, delayed-reward tasks such as mortgage refinancing. The effort and paperwork costs of refinancing are borne immediately, while the financial benefits accrue over time. A naive present-biased household heavily discounts the current effort cost relative to future benefits, leading it to defer refinancing repeatedly. This substantially slows the speed at which the cash-out refinance channel of monetary policy operates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal propensity to consume (MPC) vs. marginal propensity for expenditure (MPX):&lt;/strong&gt; The paper distinguishes the quarterly MPC (response of nondurable consumption to a one-unit cash transfer) from the quarterly MPX (which also includes durables). Under exponential discounting, MPC = 4% and MPX = 13%; under the present-bias benchmark, MPC = 14% and MPX = 30%. The higher MPXs are more consistent with empirical estimates (quarterly nondurable responses of 15%–25%; total spending responses two to three times larger).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Refinancing inertia:&lt;/strong&gt; The empirical regularity that households delay mortgage refinancing even when it is financially optimal to do so. The paper provides a theoretical foundation for this behavior through the procrastination motive generated by naive present bias combined with the small effort cost of refinancing.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Summary based on LSE Research Online published version. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;</description></item><item><title>Professional Motivations in the Public Sector: Evidence from Police Officers</title><link>https://macropaperwarehouse.com/papers/professional-motivations-in-the-public-sector-evidence-from-police-officers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/professional-motivations-in-the-public-sector-evidence-from-police-officers/</guid><description>&lt;p&gt;This paper studies how public sector workers balance professional motivations against private economic concerns, using arrest decisions by Dallas Police Department (DPD) officers as the empirical laboratory. The central institutional feature exploited is that arrests made near the end of an officer&amp;rsquo;s shift typically require the officer to stay and work overtime, generating private costs that must be weighed against the professional benefits of making an arrest (e.g., crime reduction or duty fulfillment). The paper further leverages variation from DPD&amp;rsquo;s &amp;ldquo;secondary employment&amp;rdquo; program: approximately 30% of officers held a registered second job at some point during 2019–2021, and on days when a second job is scheduled after the police shift, the opportunity cost of late-shift policing is higher.&lt;/p&gt;
&lt;p&gt;The data cover all DPD arrests from January 2015 to December 2021, linked to officer shift assignments, charge types, prosecutorial outcomes (whether the Dallas County Attorney chose to prosecute), and second-job schedules. The sample excludes traffic violations and arrests without shift information. The authors observe wide variation in prosecution rates by charge type: drug and gang offenses exceed 70%, property and violent crimes run 30–50%, and minor charges fall below 20%.&lt;/p&gt;
&lt;p&gt;Four main findings emerge. First, arrest rates fall sharply in the last 30–40 minutes of a shift, with the decline most pronounced for drug and gang charges (approximately 50% drop in arrest rate) and smallest for violent charges, consistent with officers having more discretion over the former. Second, arrests that do occur late in the shift are of higher quality: conditional on being made, they are approximately 1.5–2.5 percentage points more likely to result in prosecution than arrests made earlier, with the quality premium larger in more discretionary charge categories (drugs/gang &amp;gt; property &amp;gt; violent). Third, on days when an officer has a second job scheduled, arrest rates are lower by roughly 5–10% relative to baseline across the full shift, with effects concentrated in the second half; and the conditional probability of prosecution on those days is 1–2 percentage points higher than on non-second-job days. The second-job effect appears even earlier in the shift than the overtime effect alone, consistent with the second job magnifying the opportunity cost mechanism.&lt;/p&gt;
&lt;p&gt;Fourth, the authors estimate a dynamic structural model of the arrest decision. At each moment of the shift the officer chooses whether to arrest, trading off a professional benefit b_p against a private cost c(t, secondjob) that rises when overtime begins and rises further on second-job days. Structural estimates indicate the overtime cost is large enough to reduce the expected professional value of an arrest in the final 30 minutes of the shift by roughly 20–30%. The additional second-job cost reduces expected professional value by a further 10–20%. Counterfactual simulation implies that eliminating the overtime cost would increase overall arrests by approximately 5–8%, a magnitude the authors describe as economically significant. Welfare analysis shows that the desirability of high overtime costs depends on whether citizens weight quantity of arrests or quality: under quality-weighted preferences the current overtime-cost regime may be socially optimal because officers self-select toward arrests they perceive as likely to result in prosecution; under quantity preferences, reducing overtime costs would increase police activity.&lt;/p&gt;
&lt;p&gt;The identification strategy relies on within-officer variation in second-job scheduling, absorbing officer fixed effects (and officer-by-month fixed effects in robustness checks) and time fixed effects. The key identifying assumption is that second-job days are not systematically assigned to low-crime or low-patrol days. Supporting evidence includes balance tests showing second-job status is uncorrelated with local crime call patterns conditional on fixed effects, and the observation that officers who take second jobs do not exhibit a systematically different enforcement style (measured by arrest patterns across the shift) relative to officers who do not.&lt;/p&gt;
&lt;p&gt;Scope conditions: results are from a single medium-sized urban police department (approximately 3,000 officers) in Dallas, Texas, a city described as diverse by race, income, and political affiliation. The department is 29% Black, 43% Hispanic, 27% White, and 15% female. Generalizability to other jurisdictions or institutional structures is not established by this study.&lt;/p&gt;
&lt;p&gt;Q: What is the main research question?
A: The paper asks how public sector workers balance professional motivations (e.g., crime reduction, duty fulfillment) against private economic concerns (e.g., overtime costs, opportunity costs from second jobs). It uses police arrest decisions as the empirical setting because the shift-end timing of arrests generates a clear, observable private cost that varies within officer across days.&lt;/p&gt;
&lt;p&gt;Q: What is the key institutional feature that generates identification?
A: Arrests made near the end of a shift typically require the arresting officer to stay past the shift and work overtime. This creates a personal cost — more time, delayed transition to off-duty activities — that makes late-shift arrests more costly without changing their professional value. The DPD secondary employment program adds a second source of variation: on days when an officer has a registered second job scheduled after the police shift, the opportunity cost of any arrest (and especially a late-shift arrest) is higher.&lt;/p&gt;
&lt;p&gt;Q: How large is the drop in arrest rates near shift end?
A: The baseline arrest rate declines by approximately 0.12 percentage points per six-minute time bucket in the last 30 minutes of the shift, or about 5% relative to the mean arrest rate of 2.3 percentage points. The drop is most dramatic for drug and gang charges, where the arrest rate falls by approximately 50%, and smallest for violent charges, where officers appear to arrest regardless of shift timing.&lt;/p&gt;
&lt;p&gt;Q: How does arrest quality change near shift end?
A: Arrests made in the last 30 minutes of a shift are approximately 1.5–2.5 percentage points more likely to result in prosecution than arrests made earlier in the shift, after controlling for charge type composition and officer fixed effects. The quality premium is larger in more discretionary charge categories (drugs/gang, then property, then violent), consistent with officers becoming more selective to avoid overtime costs on arrests unlikely to result in prosecution.&lt;/p&gt;
&lt;p&gt;Q: Does the shift-end drop reflect officer fatigue or overtime cost?
A: The paper argues both pieces of evidence point to overtime cost rather than fatigue alone. First, arrest rates increase sharply after the official shift end when the officer is already earning overtime pay — if fatigue were the mechanism, arrests would also decline post-shift. Second, on second-job days arrest rates fall earlier in the shift and by more, consistent with higher opportunity costs rather than accumulated fatigue.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of having a second job scheduled on arrest rates?
A: Having a second job scheduled reduces arrest rates by roughly 5–10% relative to the baseline across the full shift, with effects concentrated in the second half. The reduction is even larger in the final 30 minutes, consistent with the second job amplifying the overtime cost mechanism.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of second-job days on arrest quality?
A: Arrests made on second-job days are 1–2 percentage points more likely to result in prosecution compared to arrests on non-second-job days, after controlling for time of day, charge type composition, and officer fixed effects. This parallels the shift-end quality effect and is consistent with officers applying higher selectivity thresholds when opportunity costs are elevated.&lt;/p&gt;
&lt;p&gt;Q: How is the second-job variation used for identification?
A: The main specification compares the same officer&amp;rsquo;s behavior on shifts where a second job is scheduled versus shifts where it is not, absorbing officer fixed effects and time fixed effects. The identifying assumption is that second-job scheduling is uncorrelated with unobservable determinants of enforcement intensity conditional on fixed effects. The authors support this with balance tests showing second-job status is not predicted by lagged activity measures or contemporaneous crime call patterns.&lt;/p&gt;
&lt;p&gt;Q: What does the dynamic structural model add?
A: The structural model formalizes the arrest decision as a dynamic problem where the officer compares the professional benefit b_p of an arrest to the private cost c(t, secondjob), which rises discontinuously when overtime begins and rises further on second-job days. Estimating the model by matching moments (baseline arrest rates, shift-timing patterns, quality changes, second-job effects) yields preference parameters. The model enables counterfactual and welfare analysis that the reduced-form estimates alone cannot provide.&lt;/p&gt;
&lt;p&gt;Q: What are the structural estimates of overtime and second-job costs?
A: The overtime cost c_ot is estimated to be large enough that arresting someone in the final 30 minutes of the shift reduces the expected professional value of that arrest by roughly 20–30%. The additional second-job cost c_sj reduces expected professional value by a further 10–20%. Both estimates are described as statistically precise.&lt;/p&gt;
&lt;p&gt;Q: What does the counterfactual removal of overtime costs imply for arrests?
A: Eliminating the overtime cost is estimated to increase overall arrests by approximately 5–8%, which the authors characterize as economically significant. This implies that officers&amp;rsquo; private costs have a first-order impact on the quantity of law enforcement activity.&lt;/p&gt;
&lt;p&gt;Q: What does the welfare analysis conclude about overtime costs?
A: The welfare effect of eliminating overtime costs depends on citizen preferences. Under quality-weighted preferences — where citizens value the probability that an arrest results in prosecution — the current overtime-cost regime may be socially optimal because it induces officers to self-select toward arrests they perceive as likely to stick. Under quantity preferences — where citizens value the total number of arrests per period — reducing overtime costs would increase police activity and benefit citizens.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions of the study?
A: The study is conducted entirely within the Dallas Police Department, a single medium-sized urban department with approximately 3,000 officers. Dallas is described as a diverse city by race, income, and political affiliation, and the department itself is relatively diverse (29% Black, 43% Hispanic, 27% White, 15% female). The findings may not generalize to departments with different overtime rules, labor contracts, or institutional cultures.&lt;/p&gt;
&lt;p&gt;professional motivations: The non-pecuniary benefits officers derive from making arrests, such as crime reduction, duty fulfillment, or the legitimacy of their work; modeled as a professional benefit b_p that motivates arrest independent of financial compensation.&lt;/p&gt;
&lt;p&gt;private costs of arrest: The personal costs borne by officers when making an arrest, chiefly the overtime cost when an arrest extends the shift past its scheduled end, and the opportunity cost on days when a second job is scheduled. These costs are distinct from professional motivations and respond to economic incentives.&lt;/p&gt;
&lt;p&gt;arrest quality: The conditional probability that an arrest results in prosecution by the Dallas County Attorney&amp;rsquo;s office; used as a revealed-preference measure of the officer&amp;rsquo;s assessment of arrest strength. Higher arrest quality near shift end reflects greater selectivity under elevated private costs.&lt;/p&gt;
&lt;p&gt;secondary employment (second job): A formal DPD program allowing officers to register as certified police officers for private security work after their primary shift. Approximately 30% of DPD officers held a second job at some point during 2019–2021. The scheduled second job raises the opportunity cost of late-shift primary-shift arrests and provides a second source of variation in private costs.&lt;/p&gt;
&lt;p&gt;overtime cost: The cost incurred when an arrest requires an officer to remain past the end of the scheduled shift to complete paperwork and processing. Modeled as c_ot per period spent in overtime, this cost is the primary mechanism reducing late-shift arrest rates and increasing arrest selectivity.&lt;/p&gt;
&lt;p&gt;dynamic model of arrest decisions: A structural model in which officers decide each moment whether to arrest, balancing professional benefit against private cost as a function of shift timing and second-job status. Estimated by minimum distance on moments from the data; used to recover preference parameters and conduct counterfactual welfare analysis.&lt;/p&gt;</description></item><item><title>Professional survey forecasts and expectations in DSGE models</title><link>https://macropaperwarehouse.com/papers/professional-survey-forecasts-and-expectations-in-dsge-models/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/professional-survey-forecasts-and-expectations-in-dsge-models/</guid><description>&lt;p&gt;This paper asks whether Survey of Professional Forecasters (SPF) data can be efficiently integrated into medium-scale DSGE models, and whether models with imperfectly rational expectations based on Adaptive Learning (AL) outperform the standard Rational Expectations (RE) hypothesis when survey forecasts are used as observables. The authors work with quarterly US data spanning 1981q2–2019q2, using the Philadelphia Fed Real-Time Data Set (first and second releases) alongside SPF nowcasts for inflation, consumption, investment, and output growth. The SPF nowcast is defined as a prediction formed in the middle of period t+1 for period t+1 given information for period t, making it a suitable proxy for the model-based expectation E_t y_{t+1}.&lt;/p&gt;
&lt;p&gt;The core methodological contribution is a re-specification of structural shocks into persistent (AR) and transitory (i.i.d.) components. For the risk premium, investment-specific technology, government spending, and markup shocks, each shock is decomposed into two independent innovations, yielding 12 total structural innovations. A reduced-form VAR exercise motivates this: SPF nowcast innovations explain 19–33% of the 5-year forecast error variance of the macro variables and 44–71% of the variance of the nowcasts themselves. The 1-quarter RMSFE of the baseline RE model without SPF is 1.10 for inflation, 1.26 for consumption, 1.19 for investment, and 1.26 for GDP — all significantly exceeding the SPF RMSFEs of 0.21, 0.43, 1.49, and 0.35.&lt;/p&gt;
&lt;p&gt;Log marginal likelihood improves monotonically as shocks are progressively re-specified: baseline RE (–577.37), RE with two-component markups (RE_mu, –536.63), adding real shocks stepwise (–473.29, –410.84), and finally all shocks (RE_all, –385.07). RE_all matches or beats SPF 1-quarter forecast accuracy (RMSFE ratio to SPF of 1.00 for inflation and investment; beats SPF for consumption growth), and Diebold-Mariano tests show no significant difference from SPF up to 5 quarters ahead. The paper further shows that once this two-component structure is imposed, exogenous sentiment shocks become unnecessary: RE_all (–385.07) outperforms RES_all (–388.17), and the RE model with all real shocks re-specified but without sentiment decisively dominates.&lt;/p&gt;
&lt;p&gt;Three AL belief specifications are then estimated: MSVflex (full RE information set with an independently and rapidly updating constant, posterior autocorrelation 0.9937 — nearly a random walk), RBflex (restricted information set augmented with shock innovations, with meaningful time-variation of belief coefficients at rho_AL = 0.87), and HBflex (agents switch between MSV and RB based on past forecasting performance; average RB weight 0.34, weight sensitivity delta = 4.77). All AL models outperform RE_all: MSVflex (–381.38), HBflex (–355.09), RBflex (–351.59), with RB and HB yielding the largest gains particularly during and after the Great Financial Crisis.&lt;/p&gt;
&lt;p&gt;AL models address three specific RE limitations. First, trend breaks: the ALM constant tracks persistent deviations, with ALM constants for consumption and investment successfully picking up rising macroeconomic trends in earlier sub-periods, yielding superior long-term forecasts. Second, time-varying transmission: the RB model generates cyclical volatility that stays lower in normal times and rises during distress, reducing reliance on large persistent investment-technology shocks relative to RE. Third, predictability of forecast errors: the RE model&amp;rsquo;s investment forecast inherits the SPF underreaction (b-coefficient 0.72, p &amp;lt; 0.001), while RBflex and HBflex reduce this to 0.17 and 0.34 respectively, both statistically insignificant.&lt;/p&gt;
&lt;p&gt;On an extended sample including the Covid recession, the RBflex model underperforms because its restricted information set cannot handle abrupt complex dynamics; MSVflex and HBflex continue to perform well, with the MSV regime dominating in the HB model during Covid and post-Covid periods. Scope conditions: the dataset is US, 1981q2–2019q2 for baseline estimation; the predictability (underreaction) problem is confirmed only for investment SPF, not for inflation, consumption, or GDP growth in this sample.&lt;/p&gt;
&lt;p&gt;Q1: What is the SPF nowcast, and why do the authors treat it as a proxy for model-based expectations?
The SPF nowcast is defined as a prediction formed in the middle of quarter t+1 for the value of a variable in quarter t+1, conditional on information available through quarter t. Because agents are assumed to make decisions for period t and form expectations for t+1 based on information through t, this timing aligns precisely with the model-based conditional expectation E_t y_{t+1}. The authors use first-release data (r1) and the SPF nowcast (f0) both published in the course of t+1 as measurement variables, with the Kalman filter recovering implied structural shocks.&lt;/p&gt;
&lt;p&gt;Q2: How large is the informational content of SPF nowcasts in reduced-form analysis?
A 7-variable Cholesky VAR places each SPF series last, so the survey innovation is orthogonal to standard macro variables by construction. The 5-year forecast error variance decompositions show SPF nowcast shocks explain 19% of inflation variance, 33% of consumption variance, 33% of investment variance, and 29% of GDP variance (Table 1). The nowcasts themselves are explained 44–71% by their own innovations. SPF nowcasts also substantially outperform the baseline RE model: the RE model without SPF produces RMSFE ratios of 1.10 for inflation, 1.26 for consumption, 1.19 for investment, and 1.26 for GDP relative to SPF (all statistically significant by Diebold-Mariano test).&lt;/p&gt;
&lt;p&gt;Q3: What is the shock re-specification, and why is it necessary to exploit survey data?
The Smets-Wouters (2007) ARMA(1,1) shock structure conflates the transitory and persistent innovation into a single disturbance, making it impossible for the Kalman filter to separately attribute high-frequency and low-frequency movements. The re-specification splits each shock b_t into a persistent component b_t^ar (driven by epsilon^bar with persistence rho_b) and an i.i.d. transitory component b_t^iid (driven by epsilon^biid), yielding 12 total structural innovations. This allows survey nowcasts — which are forward-looking — to identify the persistent component separately from the transitory one. Without this, marginal likelihood improvements are far smaller (RE: –577 vs. RE_all: –385).&lt;/p&gt;
&lt;p&gt;Q4: Does re-specification of real shocks render exogenous sentiment shocks redundant?
Yes. Models with standard real shock processes but exogenous sentiment shocks (RES: –477.88; RES_mu: –488.96) do fit substantially better than models without sentiment (RE: –577.37; RE_mu: –536.63), confirming Milani&amp;rsquo;s (2017) result. However, once the two-component real shock structure is introduced, RE_all (–385.07) outperforms RES_all (–388.17) and the estimated sentiment shocks become small and explain little of the business cycle. The fundamental shock re-specification subsumes what sentiment shocks were previously capturing.&lt;/p&gt;
&lt;p&gt;Q5: How do AL models compare to RE in terms of model fit?
All three AL models outperform RE_all: MSVflex (–381.38, improvement of 3.69 log-likelihood units), HBflex (–355.09, improvement of 29.98 units), RBflex (–351.59, improvement of 33.48 units). The RB and HB specifications, which assume more severe deviation from RE with restricted information sets and time-varying transmission, achieve the largest gains. The MSV improvement accumulates gradually, concentrating in the late 1990s and 2000s, while RB shows sustained improvement in the 1980s and mid-1990s and performs exceptionally well during and after the GFC.&lt;/p&gt;
&lt;p&gt;Q6: How does the AL mechanism handle macroeconomic trend shifts?
Under RE with fixed coefficients, expectations anchor around a constant steady state, so persistent deviations from trend generate systematic forecast errors. Under AL, the ALM constant mu_t in the Actual Law of Motion evolves over the business cycle. In the MSVflex model, the autocorrelation parameter for the constant is estimated at 0.9937 (posterior mean), making it nearly a random walk that can track long-lasting trends. ALM constants for consumption and investment in the MSV setup successfully pick up rising macroeconomic trends in earlier sub-periods, translating into superior longer-term forecast performance relative to RE.&lt;/p&gt;
&lt;p&gt;Q7: How does the RB model generate time-varying volatility, and why does this matter for investment dynamics?
In RBflex, as beliefs are revised via the Kalman filter, the sensitivity of expectations and realized variables to shocks changes over the business cycle. The model generates cyclical volatility that remains lower in normal times and rises during distress — a realistic pattern absent from RE models. Consequently, RB does not need to rely as heavily on large persistent risk premium and investment-specific technology shocks: average volatility of these processes in the RB model does not increase in the last sub-period and remains generally lower across the whole sample, in contrast to RE&amp;rsquo;s behavior during the GFC. The RB model also shows a 3-times-smaller estimated measurement error in the investment SPF equation relative to the AL specification without restricted beliefs.&lt;/p&gt;
&lt;p&gt;Q8: What happens to predictability of model-based forecast errors under AL versus RE?
Using the Coibion-Gorodnichenko (2015) regression of forecast errors on forecast revisions, the RE model&amp;rsquo;s investment forecast shows a b-coefficient of 0.72 (p &amp;lt; 0.001), inheriting the underreaction documented in SPF investment data (b = 0.49, p = 0.006). AL models break this inheritance: RBflex ALM b-coefficient for investment is 0.17 (not statistically significant) and HBflex is 0.34 (not statistically significant). AL models achieve this because they relax the RE constraint of internal consistency between agents&amp;rsquo; and model forecasts, allowing the ALM to generate efficient forecasts even when agent PLMs display sluggish adjustment.&lt;/p&gt;
&lt;p&gt;Q9: How do the models perform during the Covid recession?
The RBflex model does not perform optimally on the extended sample including the Covid recession. The authors attribute this to the restricted information set in the RB PLM being insufficient to describe the abrupt, complex macroeconomic dynamics of the Covid crisis. The MSVflex and HBflex models continue to perform well. In the HBflex model, the MSV regime naturally dominates during the Covid and post-Covid periods, while the RB regime had been more prominent between recessions in the pre-Covid sample.&lt;/p&gt;
&lt;p&gt;Q10: What is the role of heterogeneous beliefs, and how do agents switch between PLMs?
In HBflex, expectations are a weighted average of MSV and RB predictions with weights evolving as a function of past belief forecast errors. The weight sensitivity parameter is estimated at delta = 4.77, indicating weights are relatively sensitive to fitness. The average estimated weight on the RB PLM is 0.34 (MSV receives 0.66 on average). The RB weight tends to increase and reach its highest values between recessions, consistent with the restricted model being more parsimonious and useful in stable periods, while the fuller MSV model dominates in high-volatility episodes such as the Covid recession.&lt;/p&gt;
&lt;p&gt;Q11: What are the out-of-sample forecasting results?
The out-of-sample evaluation covers 2008q1–2019q2. The RB model outperforms the RE model in predicting investment and interest rate dynamics, and for investment it also outperforms professional forecasters during this period. At longer horizons (up to 5 quarters ahead), RE model forecasts are generally not statistically significantly different from SPF predictions once SPF nowcasts are included as observables, suggesting that observing the SPF data is sufficient to capture the most informative content from surveys for longer-horizon predictions.&lt;/p&gt;
&lt;p&gt;Q12: What is the relationship to Milani (2017) and the prior literature on sentiment shocks?
Milani (2017) found that exogenous sentiment shocks orthogonal to fundamentals were needed to fit SPF forecasts alongside an AL model and explained a significant portion of US business cycle fluctuations. The current paper shows this result is not robust to re-specifying fundamental shocks into persistent and transitory components: once the two-component structure is introduced, sentiment shocks become small and economically unimportant (RES_all at –388.17 versus RE_all at –385.07). What Milani attributed to sentiment was largely capturing the inability of single-innovation shocks to separately account for high-frequency and low-frequency variance.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;SPF Nowcast as proxy for model expectations: The Survey of Professional Forecasters&amp;rsquo; nowcast is defined as a prediction formed in the middle of quarter t+1 for the value of a variable in that same quarter, conditional on information available through quarter t. This timing makes it directly comparable to the model-based conditional expectation E_t y_{t+1}, so the SPF nowcast can be added to the DSGE model&amp;rsquo;s observable set with a straightforward measurement equation linking it to model expectations plus i.i.d. measurement error.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Shock re-specification into persistent and transitory components: Each structural shock (risk premium, investment-specific technology, government spending, and markup shocks) is decomposed into an AR(1) persistent component driven by epsilon^bar and an i.i.d. transitory component driven by epsilon^biid, replacing the ARMA(1,1) specification in Smets-Wouters (2007) that conflates both into a single innovation. This decomposition is the key technical device enabling survey data to separately identify low-frequency and high-frequency sources of volatility.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Adaptive Learning (AL): An expectation-formation mechanism in which agents do not know true model parameters and instead estimate linear forecasting models (PLMs) that are updated each period via a Kalman filter algorithm. This produces a time-varying Actual Law of Motion — transmission parameters mu_t, T_t, R_t all evolve with beliefs — enabling endogenous trend drift and time-varying shock responses absent from RE models with fixed coefficients.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Minimum State Variable (MSV) beliefs with flexible constant: An AL specification in which agents use the same endogenous state variables and shocks as in the RE solution but with the constant term updated at an independent, more rapid rate. The constant&amp;rsquo;s autocorrelation is estimated at 0.9937, making it nearly a random walk capable of tracking persistent macroeconomic trend deviations from the deterministic steady state.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Restricted Beliefs (RB): An AL specification in which each agent&amp;rsquo;s PLM uses a reduced information set — autoregressive terms of the forward-looking variable augmented with selected shock innovations — rather than the full RE state space. This more severe departure from RE yields the largest marginal-likelihood gain over RE_all, generates realistic cyclical volatility amplification, and produces a 3-times-smaller measurement error for investment SPF, but underperforms during the Covid recession due to the restricted set&amp;rsquo;s inability to handle abrupt complex dynamics.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Heterogeneous Beliefs (HB): An AL specification in which agents may switch between MSV and RB PLMs as a weighted average, with weights evolving as a function of past belief forecast errors. The average weight on RB is 0.34 and the weight sensitivity delta is estimated at 4.77; the RB weight tends to be highest between recessions and lowest during high-volatility episodes such as the Covid recession when the fuller MSV information set dominates.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;FIRE predictability test (Coibion-Gorodnichenko regression): Under Full Information Rational Expectations, the regression of forecast errors on forecast revisions should yield a b-coefficient of zero. A positive and significant b indicates systematic underreaction to news. The paper confirms b = 0.49 (p = 0.006) for investment SPF — but not for inflation, consumption, or GDP — and shows the RE model inherits this inefficiency (b = 0.72, p &amp;lt; 0.001 for investment), while AL models reduce it to insignificance (RBflex: 0.17; HBflex: 0.34).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Quantifying Supply-Side Climate Policies</title><link>https://macropaperwarehouse.com/papers/quantifying-supply-side-climate-policies/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/quantifying-supply-side-climate-policies/</guid><description>&lt;p&gt;This paper asks three questions about supply-side climate policies in the oil market: how do oil companies respond to production-based taxes; what are the aggregate effects of such taxes on global CO2 emissions; and what are the distributional consequences across consumers, producers, and governments? The study addresses a gap in empirical evidence at a time when supply-side restrictions on fossil fuel production are gaining policy traction but the quantitative literature remains limited.&lt;/p&gt;
&lt;p&gt;The authors use proprietary company-level data from Rystad Energy&amp;rsquo;s UCube database covering 49,023 oil assets across 84 countries representing 98.1% of global oil production from 2000 to 2019. They identify 84 production tax reforms (54 increases, 30 decreases) with an average magnitude of roughly 5–6 percentage points. The empirical strategy is a difference-in-differences design that compares a company&amp;rsquo;s activity in a treated tax regime before and after a reform to the same company&amp;rsquo;s activity in other regimes over the same period, absorbing company-tax regime fixed effects, company-year fixed effects, and region-year fixed effects. This within-company cross-border comparison is used to test for, and rule out, activity-shifting spillovers. Two-stage least squares instruments the after-tax oil price with production taxes to isolate tax-driven price variation.&lt;/p&gt;
&lt;p&gt;The primary behavioral margin is exploration: a one-percentage-point increase in the production tax rate reduces exploration expenditure by 2.6% on average over the study period, growing to 4.1% beyond five years. The elasticity of exploration with respect to the after-tax oil price is 1.96. Reduced exploration translates into fewer discoveries; a one-percentage-point tax increase reduces discovered oil amounts by 4.3% on average and by 8.9% beyond five years. The authors find no statistically significant effect of taxes on production from existing conventional fields, consistent with high adjustment costs for already-producing wells. Unconventional production (shale, oil sands, tar sands) exhibits a statistically significant intensive-margin production response to taxes. Taxes also have no detectable effect on the extraction cost of newly discovered deposits, indicating that firms do not redirect search toward lower- or higher-cost deposits at the margin.&lt;/p&gt;
&lt;p&gt;Translating these firm-level responses into market outcomes, the authors build a dynamic field-level model spanning 2020–2100, combining field-by-field production profiles calibrated from Rystad data with demand elasticities of −0.2 and −0.5 drawn from the literature. The existing average production-weighted royalty of 21% already implies an indirect carbon price of approximately $32/tCO2 at a reference oil price of $65/barrel, an order of magnitude above the current global average demand-side carbon price of $3.1/tCO2.&lt;/p&gt;
&lt;p&gt;Under a permanent global climate royalty surcharge of 20 percentage points, annual emissions from oil fall by 5–7% in the first five years and by 9–20% in the medium term (by year 2100). The cumulative reduction over 2020–2100 is 85–161 GtCO2, or 1.0–2.0 GtCO2 per year on average. The oil price rises by $8–14/bbl initially and by $23–27/bbl by year 2100. Tax revenue to oil-producing governments increases by $590–870 billion per year; consumer surplus falls by roughly $500–730 billion per year; producer surplus falls by $270–310 billion per year. The policy breaks even in direct economic terms at a social cost of carbon of $72–84/tCO2.&lt;/p&gt;
&lt;p&gt;When the surcharge is adopted only by OECD countries (30% of current production, 49% of global exploration), short-term carbon leakage is 16–37%, rising to 58–82% by year 2100 as non-OECD producers increase exploration and development in response to the higher oil price. Net cumulative global emission reductions under the OECD-only scenario are 54–107 GtCO2 (47–73% of what the OECD reduction alone would achieve), roughly two-thirds of the global scenario outcome.&lt;/p&gt;
&lt;p&gt;Q: What is the primary behavioral margin through which oil companies respond to production taxes?
A: The primary margin is exploration expenditure. A one-percentage-point increase in the production tax rate reduces exploration by 2.6% on average across the study period, growing to 4.1% in the period six to twenty years after the reform. The after-tax oil price elasticity of exploration is 1.96, meaning a 1% increase in the after-tax price raises exploration by approximately 2%. The Poisson regression, which accounts for firms with zero exploration in a regime, yields consistent results, indicating the finding is not driven by firm entry or exit.&lt;/p&gt;
&lt;p&gt;Q: Do production taxes affect output from existing oil wells?
A: For conventional oil fields, the production response is statistically indistinguishable from zero across all specifications and time horizons, consistent with high adjustment costs making already-producing conventional wells insensitive to tax-driven price changes. Unconventional production (shale oil, oil sands, tar sands, extra heavy oil) is the exception, exhibiting a statistically significant intensive-margin production response to taxes. This asymmetry aligns with Bjørnland et al. (2021), who find that unconventional production is more price-sensitive than conventional production.&lt;/p&gt;
&lt;p&gt;Q: Do taxes affect the cost profile of newly discovered deposits?
A: No. The paper finds no statistically significant effect of production tax changes on the extraction cost of newly discovered fields, across all specifications and time horizons. This implies that, at the margin, firms do not redirect exploration toward lower-cost or higher-cost deposits in response to taxes; the volume and cost distribution of new discoveries are therefore treated as invariant to the tax regime in the quantitative model.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address potential activity-shifting spillovers across countries?
A: The paper directly tests for spillovers by including both the own-regime tax rate and the company&amp;rsquo;s exploration-weighted average tax rate abroad as regressors; the foreign average tax rate has no statistically significant effect on domestic exploration. The analysis is also repeated restricting to small companies operating in two or fewer countries, where spillovers would be most pronounced; the null result on spillovers holds. Dropping these small companies from the main sample leaves the primary estimates unchanged.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address the potential endogeneity of tax reforms?
A: The event study plots show no statistically significant pre-trends before reforms, supporting the parallel trends assumption. The paper also finds no significant correlation between tax reforms and observable oil-sector or macroeconomic variables in the pre-period. Subsamples minimizing lobbying concerns — private (non-national) oil companies, small companies, companies without pre-existing production in the country, and non-OPEC countries — all yield similar estimates, suggesting that large incumbents&amp;rsquo; influence over tax-setting does not drive the findings.&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle the staggered difference-in-differences design?
A: To address potential bias from heterogeneous and dynamic treatment effects in a two-way fixed effects framework, the paper implements a stacked regression following Cengiz et al. (2019), constructing 18 cohort-specific datasets using never-treated countries as controls. The stacked specification yields significant effects on exploration and discoveries and null results on production and extraction costs, consistent with the main estimates. The stacked event study shows no pre-trends.&lt;/p&gt;
&lt;p&gt;Q: What is the implicit carbon price of existing production-based oil taxes?
A: At the production-weighted average royalty rate of 21% and a reference oil price of $65/bbl, the existing taxes correspond to an indirect carbon price of approximately $32/tCO2, calculated using a CO2 content of 0.43 tCO2/bbl. This figure is an order of magnitude larger than the current global average demand-side carbon price of $3.1/tCO2 (a production-weighted average including zeros for unpriced emissions). This calculation pertains only to downstream combustion emissions and excludes upstream production emissions.&lt;/p&gt;
&lt;p&gt;Q: What are the quantified effects of a global 20-percentage-point climate royalty surcharge on emissions?
A: In the first five years, the surcharge reduces annual oil-embedded emissions by 0.7–1.0 GtCO2, a 5–7% reduction. By year 2100, annual reductions reach 1.2–2.6 GtCO2, a 9–20% reduction relative to baseline. The cumulative reduction over 2020–2100 is 85–161 GtCO2 (1.0–2.0 GtCO2 per year on average), representing 17–32% of the remaining carbon budget for 1.5°C warming or 7–14% of the budget for 2°C warming. All ranges span demand elasticities of −0.2 to −0.5.&lt;/p&gt;
&lt;p&gt;Q: What happens to the global oil price under a global supply-side surcharge?
A: The immediate contraction of unconventional oil production raises the oil price by $8–14/bbl in the short term. As new exploration and field development are suppressed over time, the price effect grows, reaching $23–27/bbl by year 2100. This price increase is roughly equivalent to a global carbon price of $53–63/tCO2 levied on oil consumers in the medium term.&lt;/p&gt;
&lt;p&gt;Q: How does the paper analyze distributional incidence under the global surcharge?
A: A 20-percentage-point surcharge reduces average annual consumer surplus by $500–730 billion and producer surplus by $270–310 billion per year. Tax revenue to oil-producing governments increases by $590–870 billion per year. The net present value of the aggregate economic loss is $1,000–1,400 billion; the policy breaks even in direct welfare terms at a social cost of carbon of $72–84/tCO2. Oil-producing governments are the primary beneficiaries; both consumers and oil companies lose surplus.&lt;/p&gt;
&lt;p&gt;Q: What is the carbon leakage rate under an OECD-only supply-side coalition?
A: In the short term, leakage is 16–37%, as non-OECD unconventional producers ramp up output in response to the higher oil price. By 2050 the leakage rate rises to 41–70%. By year 2100 the coalition has reduced annual production by 9,000–9,400 million barrels while non-OECD countries have increased theirs by 5,200–7,800 million barrels, implying a terminal leakage rate of 58–82%. The net cumulative global emission reduction of 54–107 GtCO2 represents 47–73% of what the OECD reduction alone achieves, and roughly two-thirds of the global scenario.&lt;/p&gt;
&lt;p&gt;Q: Why are the authors&amp;rsquo; supply elasticity estimates somewhat larger than the prior literature?
A: The authors offer two reasons. First, their approach captures elasticity through changes in exploration activity rather than only production or field development, a broader and more forward-looking margin. Second, they use tax-driven variation in prices rather than market-price variation; the event studies show that tax reforms produce persistent changes in tax rates and after-tax prices throughout the sample, so firms are likely responding to changes perceived as durable, which would naturally elicit larger responses than responses to short-run price fluctuations.&lt;/p&gt;
&lt;p&gt;Q: What are the key limitations and scope conditions of the model?
A: The quantification omits upstream (well-to-refinery) emissions and natural gas, meaning the estimated climate effects are conservative. The demand curve is held constant over time, abstracting from long-run substitution toward clean energy. The model does not account for depletion of low-cost reserves beyond 80 years. The empirical elasticities are estimated from tax reforms that may have been perceived as temporary, meaning permanent-policy elasticities could be larger, which would imply both larger emission reductions under a global policy and higher leakage rates under a partial coalition.&lt;/p&gt;
&lt;p&gt;Q: How do distributional consequences differ between the OECD-only and global scenarios?
A: Under the OECD-only surcharge, OECD consumers and OECD producers both lose surplus, while non-OECD producers and governments everywhere gain — non-OECD governments solely through the oil price increase without bearing any tax burden. The sum of OECD producer surplus losses and non-OECD producer surplus gains is slightly negative overall. The aggregate annual global economic loss under the OECD scenario is $120–170 billion, slightly lower than the global scenario ($130–220 billion), because the oil price increase and quantity reduction are both smaller in the OECD case.&lt;/p&gt;
&lt;p&gt;Production-based tax (royalty): A tax levied on gross oil production or gross income from oil, not on profit. Unlike profit-based taxes, these are not deductible against costs and therefore create incentives to curtail exploration and production. In the paper&amp;rsquo;s framework they are equivalent to a supply-side climate instrument because they reduce the after-tax price received by producers.&lt;/p&gt;
&lt;p&gt;Climate royalty surcharge: An additional production-based tax, layered on top of existing taxes, proposed as an explicit supply-side climate policy instrument. Following Prest and Stock (2023), the paper defines this as an ad valorem levy on oil production that implicitly prices downstream CO2 emissions through its effect on the after-tax oil price.&lt;/p&gt;
&lt;p&gt;Carbon leakage: The offsetting increase in oil production by non-coalition countries in response to an oil price rise caused by a supply-restricting policy adopted by a subset of producers. Measured as the ratio of the production increase in non-coalition countries to the production reduction in coalition countries, expressed as a percentage.&lt;/p&gt;
&lt;p&gt;After-tax oil price elasticity of exploration: The percentage change in exploration expenditure per one-percent change in the after-tax oil price, estimated via 2SLS instrumenting the after-tax price with production taxes. The preferred estimate is 1.96, implying elastic exploration responses to tax-driven price changes.&lt;/p&gt;
&lt;p&gt;Extraction cost (breakeven price): The constant oil price at which the net present value of developing a field equals zero, computed using a real discount rate of 7.5%. It is the minimum price at which a field is commercially viable absent profit taxes. In the quantitative model, fields are developed if and only if extraction cost falls below the after-tax oil price.&lt;/p&gt;
&lt;p&gt;Indirect carbon price: The implicit CO2 price embedded in a production-based oil tax, calculated as the ad valorem royalty rate multiplied by the oil price and divided by the CO2 content of oil. The paper calculates that the existing average 21% royalty at $65/bbl corresponds to an indirect carbon price of approximately $32/tCO2, applicable only to downstream combustion emissions.&lt;/p&gt;
&lt;p&gt;Stacked regression (staggered DiD): A robustness approach to two-way fixed effects with staggered treatment timing, constructing cohort-specific datasets for each treatment year using only never-treated units as controls, thereby avoiding contamination from using already-treated units as comparisons for later-treated units.&lt;/p&gt;</description></item><item><title>Quota Mechanisms: Finite-Sample Optimality and Robustness</title><link>https://macropaperwarehouse.com/papers/quota-mechanisms-finite-sample-optimality-and-robustness/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/quota-mechanisms-finite-sample-optimality-and-robustness/</guid><description>&lt;p&gt;Ball and Kattwinkel study quota mechanisms — linking mechanisms that impose aggregate constraints on agents&amp;rsquo; reports across multiple decision problems — and provide the first theoretical analysis under realistic finite-sample conditions with uncertainty about the type distribution. The canonical examples are mandatory grading curves, prescription drug monitoring programs, storable votes procedures, and lifetime assistance caps (TANF). Prior literature (Jackson and Sonnenschein 2007; Matsushima et al. 2010) established only asymptotic results under the assumption that the designer knows the exact population distribution, leaving the practical rationale for quotas incomplete.&lt;/p&gt;
&lt;p&gt;The paper works in the Jackson–Sonnenschein (2007) decision framework: a principal and n agents face K independent copies of a primitive collective decision problem with independent private values and additively separable utilities. A quota mechanism requires each agent&amp;rsquo;s K reported type distributions to average to a fixed quota; in each problem copy the social choice function is applied to independently sampled types from the submitted distributions. The key methodological innovation is a reformulation of each agent&amp;rsquo;s best-response as an optimal transport problem, enabling tight bounds.&lt;/p&gt;
&lt;p&gt;The central result (Theorem 1) is a tight ex-post decision error guarantee: for any q-cyclically monotone social choice function, the (x,q)-quota mechanism has a Bayes–Nash equilibrium in which the average frequency of incorrect decisions across K problems is bounded by the sum over agents of (|Θ_i| − 1) times the total variation distance between agent i&amp;rsquo;s quota and the empirical distribution of agent i&amp;rsquo;s realized type vector. The constants (|Θ_i| − 1) are tight — they cannot be reduced even by arbitrary linking mechanisms without transfers. The core technical challenge is a &amp;ldquo;cascade of lies&amp;rdquo;: when an agent&amp;rsquo;s realized type frequencies depart from his quota, he may misreport in a way that propagates errors across types. The optimal transport reformulation shows this cascade is bounded because, under a cyclically monotone social choice function, an optimal coupling of the empirical and quota distributions can always be chosen whose support contains no nontrivial cycles, so every transport path has length at most |Θ_i| − 1.&lt;/p&gt;
&lt;p&gt;Taking expectations (Theorem 2), with quotas set equal to the prior π, the expected decision error is at most (1/√(2K)) times the sum over agents of (|Θ_i| − 1)^(3/2), which is of order 1/√K and tight to within a factor of approximately 1.25. Applied concretely: with three treatment types and K = 200 patients, the expected share receiving the wrong treatment is at most 10%.&lt;/p&gt;
&lt;p&gt;Theorem 3 establishes implementation equivalence: a social choice function is (a) one-shot implementable with transfers, (b) π-cyclically monotone, (c) asymptotically implemented by quota mechanisms, and (d) asymptotically implementable by any linking mechanism with transfers, all if and only if each other holds. No linking mechanism, even with transfers, can asymptotically implement social choice functions that quota mechanisms cannot. A quota–transfer duality is identified: the transfer T_i(θ_i&amp;rsquo;) in the one-shot problem corresponds to the Lagrange multiplier on the quota constraint for type θ_i&amp;rsquo;, with the two implementations requiring dual pieces of information about the environment.&lt;/p&gt;
&lt;p&gt;Theorem 4 bounds the error from misspecified quotas: if the true distribution is π but the quota is set to q, the mechanisms asymptotically implement some social choice function x_π whose expected distance from the target is bounded by Σ_i (|Θ_i| − 1)||q_i − π_i||. With many patients and a quota that underestimates the need for one of three treatments by 1 percentage point, at most 2% of patients receive the wrong treatment. The constants are again tight.&lt;/p&gt;
&lt;p&gt;Theorem 5 addresses robustness to agents&amp;rsquo; beliefs: in the Bergemann–Morris (2005) rich type-space framework, for any type space satisfying exchangeability and independence, the (x,π)-quota mechanism admits a belief-free equilibrium in which each agent&amp;rsquo;s strategy depends only on his own payoff type, and the expected average decision error vanishes as K → ∞. The mechanism is belief-robust because each agent knows his opponents must respect the quota, which pins down the marginal distribution of their reports regardless of their beliefs. Extensions treat interdependent values and dynamic settings with sequentially arriving information.&lt;/p&gt;
&lt;p&gt;Q1: What is the fundamental practical problem with quota mechanisms that the paper addresses?
The prior literature showed quota mechanisms work asymptotically when the designer knows the true type distribution and the number of linked decisions is large. In practice, both conditions fail: any finite sample produces an empirical type distribution that deviates from the quota due to sampling variation, and quotas are typically set using imperfect estimates of the population distribution. The paper is the first to quantify the decision errors arising from these two sources of discrepancy.&lt;/p&gt;
&lt;p&gt;Q2: What is the decision-error guarantee in Theorem 1 and why are the constants tight?
For a q-cyclically monotone social choice function x and any realization of agents&amp;rsquo; private information, the average fraction of incorrect decisions is bounded by the sum over agents i of (|Θ_i| − 1) times ||q_i − marg(θ_i)||. The constants |Θ_i| − 1 are exactly tight: if they were reduced even slightly, the bound would fail for some realization under some linking mechanism. Tightness is demonstrated via a lower bound (Remark 3) that, in the case of a single agent with two types, agrees exactly with the upper bound.&lt;/p&gt;
&lt;p&gt;Q3: What is the &amp;ldquo;cascade of lies&amp;rdquo; and how does optimal transport resolve it?
When an agent&amp;rsquo;s empirical type distribution differs from his quota, truthful reporting is infeasible; he must misreport some types, which can propagate further misreporting — a cascade. The key insight is that the agent&amp;rsquo;s best-response is equivalent to choosing a coupling (joint distribution) of his empirical distribution and his quota that maximizes a linear objective. Because the social choice function is cyclically monotone, Lemma 2 establishes that an optimal coupling exists whose support contains no nontrivial cycles; consequently transport paths visit each type at most once and have length at most |Θ_i| − 1, bounding the total probability moved at (|Θ_i| − 1) times the total variation distance.&lt;/p&gt;
&lt;p&gt;Q4: What does the expected error bound (Theorem 2) say quantitatively?
With the quota set equal to the prior π and K problem copies, the expected average fraction of incorrect decisions is at most (1/√(2K)) × Σ_i (|Θ_i| − 1)^(3/2). For a single agent with |Θ| = 3 types and K = 200 problems, the bound evaluates to (1/√400) × (2)^(3/2) ≈ 0.10, so at most 10% of patients receive the wrong treatment. The bound is of order 1/√K and cannot be improved by more than a factor of approximately 1.25.&lt;/p&gt;
&lt;p&gt;Q5: What is the implementation equivalence result (Theorem 3) and why is it significant?
Theorem 3 shows that four conditions are mutually equivalent for any social choice function x: being one-shot implementable with transfers (Rochet 1987), being π-cyclically monotone, being asymptotically implemented by (x,π)-quota mechanisms, and being asymptotically implementable by any linking mechanism including those with transfers. The significance is that no richer linking mechanism — even one with monetary transfers — can asymptotically implement anything that quota mechanisms cannot, justifying the focus on quota mechanisms.&lt;/p&gt;
&lt;p&gt;Q6: What is the quota–transfer duality identified in Section 5.2?
In the one-shot problem, the transfer T_i(θ_i&amp;rsquo;) for agent i reporting type θ_i&amp;rsquo; corresponds exactly to the Lagrange multiplier on the quota constraint for type θ_i&amp;rsquo;. The two implementations require dual pieces of information: quota implementation requires knowledge of the type distribution π_i (to set the quota) but not the utility function or cross-agent beliefs; transfer implementation requires knowledge of agent i&amp;rsquo;s utility function and interim beliefs but not the marginal distribution π_i. A concrete allocation example illustrates that transfers can implement the social choice function without knowing the type distribution, while quotas cannot.&lt;/p&gt;
&lt;p&gt;Q7: How does Theorem 4 bound the error from a misspecified quota?
If the quota q is set based on an incorrect estimate but the true distribution is π, the (x,q)-quota mechanisms asymptotically implement some social choice function x_π whose expected total variation distance from the target x is bounded by Σ_i (|Θ_i| − 1)||q_i − π_i||. The constants |Θ_i| − 1 are again tight. Applied to opioid prescription with |Θ| = 3 and a 1 percentage point underestimate (||q − π|| = 0.01) for one treatment, the long-run expected error is at most 2 × 0.01 = 0.02, so at most 2% of patients receive the wrong treatment.&lt;/p&gt;
&lt;p&gt;Q8: How is belief robustness (Theorem 5) formalized and what does it require?
The paper adopts the Bergemann–Morris (2005) rich type-space framework, in which each agent has a payoff type and a belief type. Theorem 5 requires the type space to satisfy exchangeability (joint distribution over payoff types is exchangeable across problem copies) and independence (payoff types are independent across agents). Under these conditions, the (x,π)-quota mechanism has a Bayes–Nash equilibrium in which each agent&amp;rsquo;s strategy depends only on his payoff type vector, not his belief type, and the expected average decision error converges to zero as K → ∞.&lt;/p&gt;
&lt;p&gt;Q9: Why is cyclical monotonicity the key structural condition, and what is its relationship to Rochet (1987)?
Cyclical monotonicity requires that no cycle of types would strictly gain, on average, if each type received the allocation intended for the next type in the cycle. Rochet (1987) proved that a social choice function is one-shot implementable with transfers if and only if it is cyclically monotone. Ball and Kattwinkel&amp;rsquo;s Theorem 3 adds that this same condition characterizes asymptotic implementability by quota mechanisms and by any linking mechanism with transfers, establishing a deep equivalence between the transfer-based and quota-based approaches.&lt;/p&gt;
&lt;p&gt;Q10: How does the new quota mechanism formulation differ from Jackson and Sonnenschein (2007) and what are the consequences?
Jackson and Sonnenschein require agents to report a K-vector of types with type frequencies matching the quota, which requires quotas whose components are integer multiples of 1/K and involves additional modifications for general quotas. Ball and Kattwinkel allow each agent to report a type distribution on each problem, with the average of the K distributions constrained to equal the quota. This enables direct application of optimal transport theory; every type gets weakly higher expected utility under the Theorem 1 equilibrium than under the JS equilibrium. Under JS&amp;rsquo;s definition, Theorem 1 still holds but with an additional error term of order 1/K.&lt;/p&gt;
&lt;p&gt;Q11: Does the optimality result in Theorem 1 extend to linking mechanisms with transfers?
Yes. Theorem 1 states that the constants |Θ_i| − 1 cannot be reduced even using arbitrary linking mechanisms — and the text specifies this holds even for mechanisms without transfers. Theorem 3 further establishes that the class of social choice functions asymptotically implementable does not expand when transfers are added, reinforcing the conclusion that quota mechanisms are not dominated by richer mechanisms in the asymptotic sense.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Quota mechanism: A linking mechanism in which each agent&amp;rsquo;s K reported type distributions must average to a fixed quota profile q; the social choice function is then applied to types independently sampled from each reported distribution. Generalizes mandatory grading curves, prescription quotas, and storable votes procedures.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Cyclical monotonicity (q-cyclical monotonicity): A condition on a social choice function x requiring that no cycle of types would strictly gain, on average, if each type in the cycle received the allocation intended for the next type. With multiple agents, taken in expectation over co-agents&amp;rsquo; types drawn from q. Equivalent by Rochet (1987) to one-shot implementability with transfers.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Ex-post decision error: The average, over K problem copies, of the total variation distance between the implemented decision lottery and the socially desired decision lottery, evaluated at a particular realization of private information — not in expectation over types.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Cascade of lies: The phenomenon in which an agent whose empirical type distribution departs from the quota finds it optimal to propagate misreporting across multiple types, amplifying the decision error beyond the minimum necessary to satisfy the quota constraint. Bounded in magnitude by the optimal transport analysis.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Optimal transport reformulation: Each agent&amp;rsquo;s best-response choice of report vector is recast as selecting a coupling (joint distribution) of his empirical type distribution marg(θ_i) and his quota q_i to maximize a linear objective. The acyclic structure of optimal couplings under cyclical monotonicity yields the tight error bound (|Θ_i| − 1)||q_i − marg(θ_i)||.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Implementation equivalence: The result (Theorem 3) that one-shot implementability with transfers, π-cyclical monotonicity, asymptotic implementation by quota mechanisms, and asymptotic implementability by any linking mechanism with transfers are mutually equivalent conditions on a social choice function.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Belief-free equilibrium: An equilibrium of a quota mechanism in the Bergemann–Morris type-space framework in which each agent&amp;rsquo;s strategy depends only on his payoff type, not his belief type. Exists under exchangeability and independence, because the quota pins down the marginal distribution of opponents&amp;rsquo; reports regardless of beliefs.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Distributional robustness: The property that when the quota q_i is set based on an incorrect estimate of the true distribution π_i, the long-run decision error is bounded by (|Θ_i| − 1)||q_i − π_i||, proportional to the estimation error.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Racial disparities in crime and wealth</title><link>https://macropaperwarehouse.com/papers/racial-disparities-in-crime-and-wealth/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/racial-disparities-in-crime-and-wealth/</guid><description>&lt;p&gt;This paper asks whether racial differences in labor income can simultaneously explain both the crime gap and the wealth gap between Black and White individuals in the United States. The authors build a large-scale overlapping generations (OLG) model in which property crime is endogenously determined — agents choose whether to steal alongside their consumption and savings decisions — while drug-related incarcerations are treated as exogenous, reflecting evidence that racial profiling distorts enforcement independently of offending behavior. The model is calibrated to match several well-documented racial disparities: Black individuals comprise 12.36% of the adult population but 33.8% of the incarcerated population; 42.7% of Black individuals fall in the bottom wealth quintile (below $3,400 in assets) versus 15.1% of White individuals; the median Black-White wealth gap is 89.5% (SCF 2019). Data sources include the Survey of Consumer Finances (SCF 2019), Uniform Crime Reports (UCR 1996–2011), NLSY79, PSID (1968–2021), and MORG (2000–2019).&lt;/p&gt;
&lt;p&gt;The model incorporates four dimensions of labor market disparity between Black and White agents: educational attainment, unemployment risk and duration, age-earnings profiles, and idiosyncratic income shock processes. It also incorporates race-skill-specific survival probabilities (life expectancy at birth: 73 years for Black, 78 years for White), scarring effects from incarceration on future labor income, a progressive income tax, means-tested transfers, and accidental bequests distributed within race groups.&lt;/p&gt;
&lt;p&gt;The benchmark model successfully replicates key data moments. Black individuals constitute 34.3% of the incarcerated population (data: 33.8%). The model-generated median wealth gap is 83.6% (data: 89.5%). The share of Black individuals in the bottom wealth quintile is 37.7% in the model versus 42.7% in the data. The model does not match the average wealth gap: the model-generated gap is 58.9% versus 84.4% in the SCF.&lt;/p&gt;
&lt;p&gt;The main counterfactual experiments yield three findings. First, equalizing labor market conditions — particularly age-earnings profiles — is the dominant driver of both racial wealth and crime disparities. When all labor market conditions are equalized, the Black crime rate falls by 66.25% (from 11.97% to 4.04%), the median wealth gap declines by 69.6% (from 83.58% to 25.4%), and the share of Black individuals in the bottom wealth quintile falls from 37.73% to 20.75%. Equalizing age-earnings profiles alone accounts for the largest single-factor effect: the median wealth gap declines from 83.58% to 44.16% and the Black crime rate from 11.97% to 7.59%. The resource cost of equalizing age-earnings profiles is estimated at 3.29% of GDP for the No-HS group and 22.2% of GDP for the HS group.&lt;/p&gt;
&lt;p&gt;Second, higher crime and incarceration rates among Black individuals do not significantly contribute to their lower wealth. When crime is entirely eradicated, the share of Black individuals in the bottom quintile barely moves (37.73% to 37.61%), and the median wealth gap falls only from 83.58% to 82.6%. The mechanism is that most crimes are committed by young, already-poor individuals who are not saving in any case; income loss during incarceration is not large enough to affect wealth accumulation meaningfully.&lt;/p&gt;
&lt;p&gt;Third, equalizing life expectancy generates a 25.39% reduction in the median wealth gap and a 12.4% decline in the share of Black individuals in the bottom wealth quintile, with negligible effect on crime rates.&lt;/p&gt;
&lt;p&gt;The paper also validates the model against Cesarini et al. (2023), who find a small, statistically insignificant effect of lottery wealth on criminal behavior in Sweden. The model replicates this finding: a $150,000 windfall reduces incarceration risk over seven years by 0.81 percentage points. The mechanism is that lottery winnings displace means-tested transfers, winnings gradually dissipate as low income persists, and individuals eventually return to poverty and resume criminal activity.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why does the paper treat property crime and drug crime differently?
A: The paper asks whether racial labor income differences can simultaneously account for both crime and wealth disparities. Property crimes are modeled endogenously because offending behavior responds rationally to economic incentives. Drug crime incarcerations are exogenous to capture evidence that racial profiling in enforcement — rather than differential offending alone — drives racial disparities in drug arrests: Beck and Blumstein (2018) show differential offending explains only about 52% of the drug imprisonment gap, versus over 70% for overall imprisonment.&lt;/p&gt;
&lt;p&gt;Q: What are the benchmark model&amp;rsquo;s key calibration targets and how well does it fit the data?
A: The benchmark targets Black individuals as 34.3% of the incarcerated population (data: 33.8%), a median Black-White wealth gap of 83.6% (data: 89.5%), and 37.7% of Black individuals in the bottom wealth quintile (data: 42.7%). The model does not match the average wealth gap: the model-generated gap is 58.9% versus 84.4% in the SCF, which the authors acknowledge explicitly.&lt;/p&gt;
&lt;p&gt;Q: What is the quantitative effect of equalizing all labor market conditions?
A: Experiment 5 (equalize educational attainment, unemployment risk, and age-earnings profiles jointly) reduces the Black crime rate by 66.25% (from 11.97% to 4.04%), the median wealth gap by 69.6% (from 83.58% to 25.4%), and the share of Black individuals in the bottom quintile from 37.73% to 20.75%. Equalizing all factors including life expectancy drives the median wealth gap to 0%, with the bottom-quintile share for Black individuals at 19.31%.&lt;/p&gt;
&lt;p&gt;Q: Which single labor market factor matters most for the wealth gap and crime rate?
A: Equalizing age-earnings profiles (Experiment 3) is the single most important factor, reducing the median wealth gap from 83.58% to 44.16% and the Black crime rate from 11.97% to 7.59%. By contrast, equalizing educational attainment or unemployment risk each reduces the median wealth gap only to 76.71%, with smaller crime effects.&lt;/p&gt;
&lt;p&gt;Q: Does education-group heterogeneity matter for interpreting the age-earnings equalization effect?
A: Yes, substantially. Equalizing age-earnings profiles for the No-HS group reduces the Black crime rate by 21% with little effect on the median wealth gap. Equalizing profiles for the HS group reduces the median wealth gap by approximately 40% with a much smaller effect on crime rates. The earnings channel to crime operates primarily at the bottom of the education distribution, while the earnings channel to wealth accumulation operates more strongly in the high school group.&lt;/p&gt;
&lt;p&gt;Q: Why does crime have so little effect on the wealth distribution?
A: Criminals are predominantly young and already-poor individuals who are not accumulating savings. Because these individuals have minimal assets and rely heavily on means-tested transfers for consumption, the income loss during incarceration does not reduce their wealth meaningfully. When crime is completely eradicated, the share of Black individuals in the bottom quintile falls only from 37.73% to 37.61% and the median wealth gap declines from 83.58% to only 82.6%.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of eliminating drug-related incarcerations on the Black wealth distribution?
A: Experiment 4 (eliminating drug crime incarcerations) reduces the share of Black individuals in the bottom quintile only slightly, from 37.73% to 37.30%. Eliminating the scarring effect of all incarcerations likewise has negligible effects on the bottom-quintile share (37.53% versus 37.73% in the benchmark) and the zero-assets share (32.56% versus 33.24%). Neither the direct incarceration penalty nor its labor market scarring meaningfully affects wealth accumulation.&lt;/p&gt;
&lt;p&gt;Q: What happens to crime and wealth when the property crime clearance rate changes?
A: Doubling the clearance rate from 17.2% to 34.4% reduces the Black crime rate from 11.97% to 1.72% and the White rate from 3.05% to 0.52%, with minimal change in the wealth distribution (Blacks in bottom quintile: 37.83%). Halving the clearance rate to 8.6% more than doubles Black crime to 27.53% and White crime to 9.55%, and increases the share of Black individuals in the bottom quintile by about 11% to 42.01%. This asymmetry — crime reduction barely helps wealth but crime increase does hurt — is consistent with the poverty-trap mechanism.&lt;/p&gt;
&lt;p&gt;Q: How does the model validate against the Cesarini et al. (2023) Swedish lottery study?
A: Cesarini et al. find a small, statistically insignificant negative effect of a $150,000 lottery windfall on conviction rates. The model replicates this: simulating 34,709 individuals per skill-race group, a $150,000 windfall reduces incarceration risk over the following seven years by 0.81 percentage points. When the authors use model-generated property crime records rather than incarceration records as the dependent variable, they find a statistically significant effect more than twice as large, suggesting incarceration data systematically understates the crime-reducing effect of wealth shocks.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism by which lottery winnings have minimal persistent effects on crime?
A: Lottery winners in the model are disproportionately drawn from low-income, low-wealth individuals who also receive means-tested transfers. After winning, these individuals lose transfer eligibility, so winnings substitute for lost transfers rather than being invested. With income levels remaining low, winnings dissipate over time, individuals return to poverty, and resume criminal activity. Larger lottery prizes extend the crime-free interval but do not permanently alter behavior.&lt;/p&gt;
&lt;p&gt;Q: What is the role of life expectancy differences in racial wealth and crime gaps?
A: Equalizing survival probabilities generates a 25.39% reduction in the median wealth gap and a 12.4% reduction in the share of Black individuals in the bottom quintile, with virtually no change in crime rates. The channel operates through savings incentives: a shorter expected lifetime (73 years for Black versus 78 for White) reduces the return to wealth accumulation independently of income.&lt;/p&gt;
&lt;p&gt;Q: What are the fiscal resource requirements implied by the income equalization experiments?
A: Implementing equalized age-earnings profiles for the No-HS group would require resources equal to 3.29% of total GDP, while equalization for the HS group would require 22.2% of GDP. These figures reflect the scale of redistribution needed to close earnings profiles and serve as a benchmark for assessing policy feasibility.&lt;/p&gt;
&lt;p&gt;Q: How does incarceration scarring affect lifetime income in the benchmark, and how does this validate against external data?
A: A Black high school graduate who experiences at least one incarceration earns 16.8% less over his lifetime than one who is never incarcerated; for White high school graduates the gap is 28.7%. Gordon et al. (2023) report corresponding empirical estimates of 18.6% for Black and 32.7% for White high school graduates, closely validating the model&amp;rsquo;s scarring calibration.&lt;/p&gt;
&lt;p&gt;Endogenous property crime: A rational choice by working-age agents who weigh the expected gain from stealing (fraction γ = 6.4% of average labor income y) against the probability of apprehension (clearance rate πa = 17.2%), the loss of means-tested transfers, scarring of future labor income, and the minimum consumption floor in jail. Retired agents face no such choice.&lt;/p&gt;
&lt;p&gt;Exogenous drug incarceration: Incarceration for drug possession modeled as an exogenous shock with race-age-specific probabilities, not responsive to individual optimization, capturing the possibility that racial profiling in enforcement generates disparities in drug arrests independently of offending behavior.&lt;/p&gt;
&lt;p&gt;Scarring effect: Post-incarceration labor income penalty modeled as a higher probability of drawing a lower idiosyncratic income shock state upon labor market re-entry, calibrated so the model reproduces lifetime income gaps between ever-incarcerated and never-incarcerated individuals by race-skill group (18.6% for Black HS, 32.7% for White HS per Gordon et al. 2023).&lt;/p&gt;
&lt;p&gt;Age-earnings profile (ε^{i,ζ}_j): The deterministic, skill-race-age-specific component of labor income estimated from PSID data for each of six race-education groups. The gap between Black and White age-earnings profiles is identified as the dominant driver of both the racial wealth gap and racial crime disparities, accounting for the largest single-factor reduction in both outcomes across all counterfactual experiments.&lt;/p&gt;
&lt;p&gt;Means-tested transfer floor: A consumption support program that fills the gap between an agent&amp;rsquo;s post-tax income plus assets and a minimum threshold κ (5.8% of average net tax income and assets). This transfer is a critical mechanism linking wealth shocks to crime: lottery winnings and other wealth gains displace transfer eligibility, causing winnings to be consumed rather than saved, and eventually exhausted.&lt;/p&gt;
&lt;p&gt;Median wealth gap: The percentage difference between median White and median Black wealth — 89.5% in the 2019 SCF, 83.6% in the benchmark model — used as the primary scalar summary of racial wealth disparity, chosen because the model does not match the average wealth gap (model: 58.9%, data: 84.4%).&lt;/p&gt;
&lt;p&gt;Victimization probability (πv(Y)): A step-wise decreasing function of taxable income capturing spatial concentration of property crime in low-income neighborhoods; in equilibrium this equals the aggregate property crime rate χp, ensuring market clearing in the crime sector and implying that poorer agents face higher victimization risk.&lt;/p&gt;</description></item><item><title>Racial Disparities in Federal Sentencing: Evidence from Drug Mandatory Minimums</title><link>https://macropaperwarehouse.com/papers/racial-disparities-in-federal-sentencing-evidence-from-drug-mandatory-minimums/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/racial-disparities-in-federal-sentencing-evidence-from-drug-mandatory-minimums/</guid><description>&lt;p&gt;This paper studies racial disparities in federal criminal sentencing by analyzing abnormal bunching in the distribution of crack-cocaine amounts recorded at sentencing. The identifying variation comes from the Fair Sentencing Act (FSA) of 2010, which raised the 10-year mandatory minimum threshold for crack-cocaine from 50 grams to 280 grams. Because the new 280g threshold was set at a point with essentially zero pre-existing bunching, the author implements a difference-in-bunching design (following Kleven 2016) that compares the pre-2010 distribution of charged drug amounts — treated as the counterfactual — to the post-2010 distribution. The primary data are case-level records from the United States Sentencing Commission (USSC) covering all federal drug cases sentenced 1999–2015, restricted to crack-cocaine offenses (approximately 50,273 cases, of which 83.3% involve black defendants, 9.2% Hispanic, and 7.6% white).&lt;/p&gt;
&lt;p&gt;The main finding is that after 2010, the fraction of cases charged with amounts in the 280–290g range increases by 3.3 percentage points overall. This increase is disproportionately concentrated among minority defendants: black and Hispanic offenders are more than 2.5 times as likely as white offenders to be charged with 280–290g after the threshold shifts to that level. Approximately 80% of the excess mass at 280g is drawn from cases that had previously been charged in the 50–280g range, indicating that prosecutors are moving cases upward to cross the new threshold rather than negotiating downward from above it. For black and Hispanic offenders specifically, cases from the 50–280g range account for 88% of the increase at the new threshold.&lt;/p&gt;
&lt;p&gt;The author rules out differential drug involvement as an explanation. The pre-2010 distributions of charged amounts from 60–280g are nearly identical across racial groups; a Kolmogorov-Smirnov test fails to reject equality (p-value = 0.792). This implies the post-2010 racial disparity in bunching is a conditional disparity — arising not from differences in underlying drug involvement but from differential treatment of similarly situated defendants.&lt;/p&gt;
&lt;p&gt;The paper then traces the bunching to prosecutorial discretion specifically. Drug seizure records (NIBRS, DEA STRIDE), survey data on drug use and selling (NSDUH), and state-level conviction records from Florida all show no change in drug quantities or behaviors at the offender or law enforcement level coinciding with the FSA. Critically, there is no bunching at 280g in drug seizure data, pointing to decisions made after arrest. By contrast, case management files from the Executive Office of the US Attorney (EOUSA) show the fraction of cases recorded in the 280–290g range increases by 7.8 percentage points after 2010. Approximately 22–30% of prosecutors (depending on the detection method) are responsible for the rise in 280g cases. Bunching patterns persist across districts and mandatory minimum thresholds for the same prosecutors, indicating it reflects a prosecutor-level characteristic.&lt;/p&gt;
&lt;p&gt;The Supreme Court&amp;rsquo;s 5-4 decision in Alleyne v. United States (June 2013) raised the evidentiary standard for facts that trigger mandatory minimums and shifted that factual determination to juries. The share of EOUSA cases recorded in the 280–290g range fell from 9.1% (2011–2013) to 6.8% (2014–2016) after Alleyne, and a difference-in-discontinuities design confirms that bunching was partially reined in by this decision.&lt;/p&gt;
&lt;p&gt;On the question of discrimination, the racial disparity in bunching cannot be explained by observable defendant characteristics — education, sex, age, criminal history, seized drug amount, or other offense elements. Approximately 70% of the disparity persists after controlling for state-by-post fixed effects and 60% after district-by-post fixed effects. The disparity can be largely explained by a state-level measure of racial animus based on Google search data (Stephens-Davidowitz 2014): prosecutors operating in higher-animus states apply more disparate treatment, a pattern consistent with taste-based rather than statistical discrimination.&lt;/p&gt;
&lt;p&gt;Cases charged just above the 280g threshold receive longer sentences than those just below it in the post-2010 period, confirming that prosecutorial bunching has real consequences for sentence length.&lt;/p&gt;
&lt;p&gt;Q: What is the central empirical strategy of the paper?
A: The paper uses a difference-in-bunching design exploiting the Fair Sentencing Act of 2010, which shifted the 10-year mandatory minimum threshold for crack-cocaine from 50g to 280g. Because the 280g point had essentially zero bunching before 2010, the pre-2010 distribution of charged drug amounts serves as an empirical counterfactual for the post-2010 distribution absent the threshold change. The design allows the author to isolate bunching caused by the new threshold and to test whether that bunching is racially disparate.&lt;/p&gt;
&lt;p&gt;Q: What is the main quantitative finding on bunching?
A: After 2010, offenders sentenced for crack-cocaine are 3.3 percentage points more likely to be charged with amounts in the 280–290g range (Column 1, Table 2). Black and Hispanic offenders are more than 2.5 times as likely as white offenders to be charged with 280–290g after the threshold change (Column 2, Table 2). This racial gap is the central disparity the paper investigates.&lt;/p&gt;
&lt;p&gt;Q: Does the racial disparity in bunching reflect genuine differences in drug involvement?
A: No. The pre-2010 distributions of charged amounts from 60–280g are nearly identical across racial groups; a Kolmogorov-Smirnov test fails to reject equality with a p-value of 0.792. Because these pre-period distributions are taken as reflecting true drug involvement, their similarity by race implies the post-2010 disparity is a conditional racial disparity — arising from differential treatment of similarly situated defendants, not from differential drug involvement.&lt;/p&gt;
&lt;p&gt;Q: Where in the criminal justice process does the bunching originate?
A: The bunching originates in prosecutorial decisions, not at the arrest or law enforcement stage. Drug seizure records (NIBRS and DEA STRIDE) show no bunching at 280g, and survey data (NSDUH) show no post-FSA change in drug use or selling by minority defendants. Florida state-level records show no shift in the share of high drug-weight cases. By contrast, EOUSA case management files — which capture quantities recorded by prosecutors — show an increase of 7.8 percentage points in the fraction of cases in the 280–290g range after 2010.&lt;/p&gt;
&lt;p&gt;Q: What fraction of prosecutors engage in this bunching behavior?
A: Approximately 29.7% of prosecutors have a higher-than-normal percentage of cases at 280–290g after 2010 under a straightforward outlier criterion. Using the outlier detection procedure from Ridgeway and MacDonald (2009), approximately 22% are flagged as outliers. A Bayesian shrinkage method estimates approximately 30% (SE = 0.042) of prosecutors engage in this bunching. The behavior persists across districts and across multiple mandatory minimum thresholds for the same prosecutors, indicating it is a durable prosecutor-level characteristic.&lt;/p&gt;
&lt;p&gt;Q: What evidence links the bunching to upward manipulation rather than downward negotiation?
A: Approximately 80% of the excess mass at 280g is drawn from cases previously charged in the 50–280g range rather than from cases above 290g. For black and Hispanic offenders the share is 88%. This pattern indicates prosecutors are pushing amounts upward past the new threshold to secure longer sentences, not negotiating amounts downward from above the threshold — reversing the direction assumed in prior qualitative discussions.&lt;/p&gt;
&lt;p&gt;Q: What was the effect of Alleyne v. United States on bunching?
A: The Supreme Court&amp;rsquo;s 5-4 decision in Alleyne (June 2013) raised the evidentiary standard for facts triggering mandatory minimums and assigned those factual determinations to juries rather than judges. The share of EOUSA cases in the 280–290g range fell from 9.1% in 2011–2013 to 6.8% in 2014–2016. A difference-in-discontinuities design confirms that bunching expanded in the run-up to Alleyne and was partially curtailed afterward, providing additional evidence that the bunching reflects prosecutorial manipulation rather than genuine drug amounts.&lt;/p&gt;
&lt;p&gt;Q: Can observable defendant characteristics explain the racial disparity in bunching?
A: No. The racial disparity in bunching persists after controlling for education, sex, age, criminal history, seized drug amount, and other offense elements. Approximately 70% of the disparity remains after controlling for state-by-post fixed effects and 60% after controlling for district-by-post fixed effects. The disparity exists among observably similar defendants, ruling out the hypothesis that it is driven by correlated case characteristics.&lt;/p&gt;
&lt;p&gt;Q: What evidence distinguishes taste-based from statistical discrimination?
A: The racial disparity in bunching is largely explained by a state-level measure of racial animus constructed from Google search data (Stephens-Davidowitz 2014): prosecutors in higher-animus states apply more racially disparate treatment. Because statistical discrimination would predict disparate outcomes based on informative case characteristics rather than on the ambient racial attitudes of the jurisdiction, the correlation with racial animus is more consistent with taste-based discrimination than with statistical discrimination.&lt;/p&gt;
&lt;p&gt;Q: Does bunching at 280g have real consequences for sentence length?
A: Yes. Cases charged just above the 280g threshold receive longer sentences than those charged just below it in the post-2010 period, confirming that the mandatory minimum threshold is binding and that prosecutorial bunching translates into materially longer sentences for the affected defendants.&lt;/p&gt;
&lt;p&gt;Q: How does this paper contribute relative to Rehavi and Starr (2014)?
A: Rehavi and Starr (2014) linked arrest to sentencing records to show black offenders receive harsher sentences, driven by prosecutorial charging of mandatory minimums, but acknowledged that unobserved differences in criminal conduct within offense codes remained a concern. This paper addresses that concern by using the pre-2010 distribution of charged amounts as a counterfactual for drug involvement, documenting near-identical pre-period distributions by race, and tracing the post-FSA disparity through multiple data sources to isolate prosecutorial decisions specifically. The paper also quantifies the fraction of prosecutors involved and tests discrimination mechanisms.&lt;/p&gt;
&lt;p&gt;Q: What is the relationship between this paper&amp;rsquo;s findings and the policy goals of the Fair Sentencing Act?
A: The FSA achieved its stated goal of narrowing racial gaps attributable to the crack-powder disparity in mandatory minimum thresholds, and in line with prior work the author confirms a net decline in sentences after 2010. However, the increase in bunching at 280g by prosecutors — disproportionately applied to black and Hispanic defendants — dampened the FSA&amp;rsquo;s effectiveness. The paper thus documents a strategic response by a subset of prosecutors that partially offset the reform&amp;rsquo;s intended benefits for minority defendants.&lt;/p&gt;
&lt;p&gt;Q: How robust are the main bunching estimates?
A: The 3.3 percentage point overall increase and the 2.5x racial disparity are robust to various sample restrictions, inclusion of state fixed effects, time trends, state-specific time trends, offender-level controls, Logit/Probit/Poisson models, wider bunching range definitions (e.g., 280–380g), inclusion of cases with weights coded as a range, and alternative standard error calculations. Including range-coded cases actually exacerbates the estimated degree of bunching and the racial disparity.&lt;/p&gt;
&lt;p&gt;Bunching (in this paper&amp;rsquo;s sense): An excess mass of cases charged with a drug amount at or just above the mandatory minimum threshold, defined operationally as a disproportionate concentration of cases in the 280–290g range relative to the counterfactual distribution. Bunching reflects discretionary upward adjustment of charged amounts by prosecutors to trigger longer mandatory minimum sentences rather than true drug seizure quantities.&lt;/p&gt;
&lt;p&gt;Difference-in-bunching design: An empirical strategy adapted from Kleven (2016) that compares the actual post-2010 distribution of charged drug amounts to the pre-2010 distribution as a counterfactual for what the post-2010 distribution would have looked like absent the FSA threshold change. The method exploits the fact that the 280g threshold was a point of essentially zero bunching before 2010.&lt;/p&gt;
&lt;p&gt;Conditional racial disparity in bunching: A racial gap in the probability of being charged at 280–290g that remains after conditioning on similar underlying drug involvement, operationalized by the near-identical pre-2010 distributions of charged amounts from 60–280g across racial groups. The conditional disparity isolates differential treatment from differential conduct.&lt;/p&gt;
&lt;p&gt;Prosecutorial discretion (in this context): The legal authority of federal prosecutors to determine the drug quantity attributed to a defendant for sentencing purposes, which is not strictly bound to the amount physically seized at arrest. Prosecutors can rely on informant testimony, conspiracy attribution, or approximations to establish amounts above what was seized, giving them effective control over whether the mandatory minimum threshold is crossed.&lt;/p&gt;
&lt;p&gt;Taste-based discrimination: Racially disparate prosecutorial behavior that cannot be explained by observable case characteristics or informative statistical inference about defendant conduct, and that correlates instead with ambient state-level racial animus. In this paper&amp;rsquo;s framing, taste-based discrimination is distinguished from statistical discrimination by its correlation with the Stephens-Davidowitz racial animus measure rather than with defendant or offense characteristics.&lt;/p&gt;
&lt;p&gt;Mandatory minimum threshold (in federal crack-cocaine sentencing): A drug quantity cutoff — set at 50g before 2010 and 280g after the FSA — above which federal law mandates a sentence of at least 10 years unless specific departure conditions are met. The threshold creates a sharp discontinuity in expected sentence length that gives prosecutors an incentive to place cases just above it.&lt;/p&gt;
&lt;p&gt;State-level racial animus measure: A proxy for the prevalence of racially prejudiced attitudes in a state, constructed by Stephens-Davidowitz (2014) from Google Trends search volume data (2004–2007) for a specific racial slur and its plural, normalized by total search volume. Used here as a predictor of the size of the racial disparity in prosecutorial bunching across states.&lt;/p&gt;</description></item><item><title>Racial Disparities in Housing Returns</title><link>https://macropaperwarehouse.com/papers/racial-disparities-in-housing-returns/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/racial-disparities-in-housing-returns/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper estimates the racial/ethnic gap in realized housing returns using administrative data on individual housing transactions, and investigates the mechanisms that generate those gaps. The central question is: why do Black and Hispanic homeowners accumulate less housing wealth than White homeowners, even as minority homeownership rates have risen substantially over the last century?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors merge three primary data sources. First, a nationwide panel of residential property records from ATTOM covering 146.8 million arm&amp;rsquo;s-length home purchases from 1990 to 2020, which records transaction prices, mortgage characteristics, and property-level identifiers. Second, Home Mortgage Disclosure Act (HMDA) records, which contain self-reported race and ethnicity for mortgage applicants. Third, supplementary administrative sources including McDash mortgage servicing records, Equifax credit bureau data, Fannie Mae/Freddie Mac/ABSNet modification records, and the Survey of Income and Program Participation (SIPP). After applying sample restrictions — including requiring an observed purchase price, a linked HMDA record, an arm&amp;rsquo;s-length repeat sale, a combined loan-to-value ratio of at most 102.5%, and an ownership spell of at least 12 months — the baseline analysis sample comprises 13.6 million ownership spells for Black, Hispanic, and White homeowners who purchased homes with a mortgage between 1990 and 2016 in 40 states. Ownership spells unsold by March 2020 have their value imputed using the FHFA county-level house price index, a procedure that is conservative in that it understates racial gaps.&lt;/p&gt;
&lt;p&gt;The authors construct two complementary return measures. The &lt;strong&gt;unlevered return&lt;/strong&gt; compares the annualized ratio of sale price to purchase price. The &lt;strong&gt;levered return&lt;/strong&gt; (internal rate of return) sets the net present value of all homeowner cash flows — down payment, monthly mortgage payments, implicit rent, maintenance, taxes, insurance, transaction costs, and limited liability in foreclosure — equal to zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Among mortgaged home purchases, mean annual unlevered returns are 0.5% for Black homeowners, 0.6% for Hispanic homeowners, and 2.8% for White homeowners, implying Black-White and Hispanic-White gaps of approximately &lt;strong&gt;2.3 percentage points per year&lt;/strong&gt;. Mean annual levered returns are 1.6%, −3.0%, and 6.6% for Black, Hispanic, and White homeowners respectively, yielding gaps of &lt;strong&gt;5.0 and 9.6 percentage points&lt;/strong&gt;. After adjusting for the approximately one-fourth of purchases made in cash (for which no racial gap is found), preferred estimates of the unlevered gap are 1.9 (Black-White) and 1.4 (Hispanic-White) percentage points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distressed sales — foreclosures and short sales — statistically account for the entire gap in returns.&lt;/strong&gt; Within non-distressed sales, the Black-White gap in annual unlevered returns falls to less than 40 basis points, and the Hispanic-White gap reverses sign. Two distinct factors drive the role of distressed sales: (1) Black and Hispanic homeowners are approximately &lt;strong&gt;twice as likely&lt;/strong&gt; as White homeowners to experience a distressed sale, and (2) minority homeowners live in neighborhoods where distressed sale price discounts are larger — estimated at 39%–40% for Black and Hispanic homeowners versus 28% for White homeowners. A Blinder-Oaxaca decomposition indicates that equalizing distressed sale rates (holding the distressed sale penalty fixed) would eliminate &lt;strong&gt;84.6%&lt;/strong&gt; of the Black-White unlevered returns gap and &lt;strong&gt;133.6%&lt;/strong&gt; of the Hispanic-White gap, confirming that the frequency margin dominates the severity margin.&lt;/p&gt;
&lt;p&gt;A counterfactual wealth-accumulation exercise using PSID data shows that &lt;strong&gt;equalizing housing returns reduces the Black-White gap in housing wealth at retirement by 37%&lt;/strong&gt;. Equalizing first-time purchase rates reduces the gap by only 1%, illustrating that promoting homeownership without addressing the returns gap is largely ineffective. Equalizing both returns and purchase rates reduces the gap by 49%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Approximately one-third of the gap in unlevered returns can be explained by purchase year and county fixed effects, with much of this timing effect attributable to the Great Recession. Controlling additionally for income, family structure, gender, and leverage reduces the gap by a further ~0.3 percentage points, leaving a substantial residual. About half of the racial gap in mortgage default can be attributed to observable credit risk (family structure, income, leverage, credit score). The remainder is associated with &lt;strong&gt;unobservable liquidity shortfalls and income instability&lt;/strong&gt;: median liquid wealth among Black and Hispanic homeowners is $2,400 and $5,400 respectively, and minority homeowners are 2–4 percentage points more likely to transition to unemployment conditional on pre-unemployment income. Using quasi-experimental variation from adjustable-rate mortgage resets, the paper shows that in response to a 10% increase in monthly payments, White homeowners increase 90-day mortgage default by 3.0 percentage points after 12 months, while Black and Hispanic homeowners show increases of 4.5 and 7.1 percentage points respectively — excess sensitivity that is not captured by credit scores. The early-2000s credit supply expansion through private securitization and portfolio lending channels (as distinct from GSE/FHA) contributed to &lt;strong&gt;61.5%&lt;/strong&gt; of the 6.2-percentage-point increase in the Black-White distressed-sale gap between the 2002 and 2006 purchase cohorts, and &lt;strong&gt;52.0%&lt;/strong&gt; of the 12.2-percentage-point increase in the Hispanic-White gap. Evidence from the National Survey of Mortgage Originations suggests that Black homeowners hold overoptimistic expectations about future house price growth and income growth relative to their realized outcomes, which may explain why high-risk minority households do not self-select out of homeownership.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to mortgaged home purchases (approximately three-fourths of all purchases) by Black, Hispanic, and White homeowners in 40 states (non-disclosure states excluded), with primary coverage from 2000 to 2016. No racial gap in returns is found for cash purchases. The racial gap in non-distressed returns is small and not economically meaningful, so the findings specifically pertain to the realized-return distribution that includes the distressed-sale tail.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: How large is the racial gap in housing returns, and how does it compare to previously documented racial disparities in housing costs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Among mortgaged purchases, Black and Hispanic homeowners each realize annual unlevered returns approximately 2.3 percentage points lower than White homeowners; levered return gaps are 5.0 percentage points (Black-White) and 9.6 percentage points (Hispanic-White). In dollar terms, this translates to a difference of roughly $5,920 per year for the average Black homeowner and $6,762 per year for the average Hispanic homeowner on a ten-year holding horizon. These gaps are an order of magnitude larger than previously documented racial disparities in housing costs, such as post-origination interest rate disparities of about 40 basis points (~$500 annually for a $200,000 home) or inflated property tax assessments amounting to $300–$390 per year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the role of distressed sales in explaining racial gaps in returns, and how do frequency versus severity contribute?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Distressed sales statistically account for nearly the entire racial gap in realized housing returns. Within non-distressed sales, the Black-White unlevered gap falls to less than 40 basis points and the Hispanic-White gap inverts. Two channels operate: (1) Black and Hispanic homeowners are approximately twice as likely as White homeowners to experience a distressed sale; and (2) within distressed sales, minority homeowners realize lower returns because they tend to live in neighborhoods with larger distressed-sale price discounts (estimated at 39–40% below imputed market value for Black and Hispanic homeowners, vs. 28% for White homeowners). A Blinder-Oaxaca decomposition indicates that equalizing distressed sale frequency (holding severity fixed) would close 84.6% of the Black-White gap and 133.6% of the Hispanic-White gap, so the frequency margin is quantitatively dominant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Are racial differences in house price appreciation responsible for the gap in non-distressed returns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No. Among non-distressed sales, realized returns closely track county-level FHFA house price index growth for Black, Hispanic, and White homeowners alike, essentially one-for-one regardless of race. There is no economically meaningful racial gap in house price appreciation conditional on avoiding a distressed sale. This finding implies that the gap in average realized returns is not generated by differential neighborhood-level appreciation but rather by the incidence of distressed sales and the price penalties they entail.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How much of the racial gap in housing returns can be explained by observable homeowner characteristics such as income, family structure, and leverage?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Controlling for county and purchase year fixed effects reduces the raw Black-White and Hispanic-White unlevered returns gaps from 2.3 to 1.5 and 1.6 percentage points, respectively. Additionally controlling for income, family structure (gender and co-applicant status), and leverage reduces the gap by a further ~0.3 percentage points. Even among the ostensibly safest group — high-income couples with low leverage — the Black-White (Hispanic-White) gap in unlevered returns is 0.7 (0.5) percentage points. Among high-leverage, low-income, single-male homeowners the gap is 1.8 (1.7) percentage points. Gaps exist within every demographic subgroup, and neighborhoods (Census tract fixed effects) explain roughly half of the remaining gap for Black homeowners and one-third for Hispanic homeowners, but substantial residual gaps persist even within neighborhood.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What observable credit risk characteristics explain racial differences in mortgage default?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Raw racial gaps in 90-day mortgage delinquency are 2.6 percentage points (Black-White) and 1.8 percentage points (Hispanic-White). Controlling for purchase year and county reduces these to 2.2 and 1.6 percentage points respectively. Controlling for family structure, income, leverage, and credit score reduces the gaps to 0.98 and 0.94 percentage points — implying that observable characteristics explain approximately 55% and 41% of the Black-White and Hispanic-White default gaps respectively. Credit scores contribute the most explanatory power among these controls, while mortgage contract characteristics (a test of differential lender treatment) contribute negligibly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the evidence that liquidity and income instability — factors not observable to lenders — explain the residual racial gap in default?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Survey data from SIPP reveal that median liquid wealth (bank accounts, stocks, bonds) for Black and Hispanic homeowners is only $2,400 and $5,400 respectively, while minority homeowners are 2–4 percentage points more likely to transition to unemployment conditional on pre-unemployment income. In SIPP mortgage delinquency regressions, controlling for liquidity, job loss in the prior year, and income reduces the Black-White coefficient by about 30% and the Hispanic-White coefficient by about 41% (and 29% and 70% respectively when also controlling for income level, current loan-to-value, and family composition). In administrative data using ARM payment resets as liquidity shocks, a 10% increase in monthly payments raises 90-day default by 3.0 percentage points for White homeowners, 4.5 percentage points for Black homeowners, and 7.1 percentage points for Hispanic homeowners after 12 months. This excess sensitivity is not substantially reduced by controlling for credit scores, income, or leverage — indicating that the liquidity risk of minority homeowners is largely unobservable to lenders at origination.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Is there evidence that strategic default explains higher minority distress rates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No meaningful evidence supports strategic default as a driver of excess minority distress. Using quasi-experimental variation in ex-post leverage from diverging option ARM indices (following Gupta and Hansman 2022), the paper finds large causal impacts of leverage on default but no evidence that these impacts are larger for minority homeowners. Separate survey evidence from the NSMO shows a statistically insignificant Black-White difference of 0.05 percentage points (s.e. 0.65) in agreement that &amp;ldquo;it is okay to default if it is in the borrower&amp;rsquo;s financial interest&amp;rdquo; (relative to a White mean of 6.1%). The absence of larger leverage-driven default responses combined with the presence of larger payment-shock-driven responses points specifically to liquidity — not strategic behavior — as the relevant mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the evidence for information frictions contributing to excess minority homeownership risk?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Black homeowners in the NSMO report future house price expectations that are 0.07 standard deviations more optimistic than White homeowners, conditional on past price experiences, yet realized house price growth in the subsequent two years is actually 1.1 percentage points lower for Black homeowners. Although Black homeowners are 2.8 percentage points more likely to report past personal financial crises, their stated expectations about future financial crises are similar to those of White homeowners — despite 90-day default rates that are 2.5 percentage points higher in the first two years post-origination. Black homeowners also report income growth expectations 0.3 standard deviations higher than White homeowners, while SIPP and CPS data show minorities are more likely to experience income losses. These patterns of overoptimistic expectations relative to realized outcomes are consistent with information frictions causing high-risk minority households to suboptimally select into homeownership.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How much of the racial gap in distress can be attributed to the early-2000s credit supply expansion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper identifies the expansion as concentrated in portfolio loans and privately securitized mortgages, which are distinct from GSE/FHA mortgages that did not exhibit a comparable supply increase. Between the 2002 and 2006 purchase cohorts, the Black-White gap in distressed sales rose by 6.2 percentage points overall but only 2.4 percentage points among GSE/FHA loans. A decomposition using this contrast attributes 61.5% of the overall 6.2-percentage-point increase to the credit supply expansion. Analogously, 52.0% of the 12.2-percentage-point increase in the Hispanic-White gap between 2002 and 2006 is attributed to credit supply. Within-race decompositions find that credit supply accounts for 42%, 30%, and 35% of the increase in distress relative to 2002 for Black, Hispanic, and White homeowners respectively, for mortgages originated 2004–2006.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the implied contribution of the returns gap to the racial wealth gap?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Using a simple wealth accumulation model calibrated to PSID data on first-time homebuyer rates and home values (average first home for Black households: $142,587; for White households: $208,621), the paper finds an estimated Black-White gap in housing wealth at retirement of $169,389 versus an observed PSID gap of $182,771. Equalizing housing returns would reduce this gap by 37%. In contrast, equalizing first-time purchase rates alone reduces the gap by only about 1%, because low returns nullify the benefit of purchasing earlier. Equalizing both returns and purchase rates reduces the gap by 49%. Housing wealth in the primary home constitutes 43% of total net wealth for the average retirement-age Black household in PSID, implying the returns gap explains a quantitatively large share of the overall racial wealth gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What do the COVID-19 pandemic forbearance experience and mortgage modification evidence imply for policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Quasi-experimental estimates using servicer-level variation in modification propensity show that mortgage modifications cause economically large increases in housing returns for Black, Hispanic, and White homeowners alike, suggesting that since minority homeowners are more likely to become distressed, expanded modifications would disproportionately benefit them. The pandemic experience provides macroeconomic confirmation: after the onset of COVID-19 forbearance and foreclosure moratoria in March 2020, the Black-White gap in unlevered returns and distressed sales fell by approximately half, while the Hispanic-White gap (whose pre-pandemic distress convergence was already underway) remained comparatively stable. Administratively, Black homeowners who default are already 3–7 percentage points more likely than observationally similar White homeowners to receive a modification, even controlling for neighborhood and servicer, suggesting servicers partially internalize the larger distressed-sale discounts in minority neighborhoods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Are neighborhood-level factors — specifically distressed-sale price discounts from illiquid real estate markets — important for explaining racial heterogeneity in returns conditional on distress?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Yes. Using MLS data on median days-on-market as a measure of real estate market thickness, the paper shows that distressed sale discounts are substantially larger in less-liquid markets, with discounts experienced by Black homeowners approximately 13 percentage points lower in the least-thick markets relative to the thickest. Black and Hispanic homeowners are disproportionately likely to realize distressed sales in thin markets. Regular sale returns are not affected by market thickness. This establishes that neighborhood market illiquidity is a second-order channel through which neighborhood-level factors contribute to the racial gap — primarily by amplifying the severity of distressed sale penalties rather than by affecting ordinary house price appreciation.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Distressed sale&lt;/strong&gt;: In this paper&amp;rsquo;s usage, an ownership spell that ends in either a foreclosure (where a lender seizes and sells the property after payment default) or a short sale (where the lender allows the homeowner to sell for less than the outstanding mortgage balance without holding the homeowner liable for the deficiency). Distressed sales are the central mediating factor between race and housing returns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Unlevered return&lt;/strong&gt;: The annualized ratio of sale price to purchase price, capturing property-level capital gains without reference to the financing structure. Computed as (P_sale / P_purchase)^(1/T) − 1. Does not capture leverage amplification or limited homeowner liability in foreclosure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Levered return (internal rate of return)&lt;/strong&gt;: The discount rate that sets the net present value of all homeowner cash flows to zero, including down payment at purchase; monthly payments (principal, interest, taxes, insurance, maintenance); implicit rent; and the net proceeds at sale (property sale price minus outstanding principal balance, subject to a floor of $0.01 capturing limited liability). This measure accounts for both the amplifying effect of leverage on gains and the homeowner&amp;rsquo;s limited liability in underwater foreclosures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distressed sale frequency versus severity&lt;/strong&gt;: The two distinct components through which distressed sales generate racial gaps. Frequency refers to the higher probability that a minority homeowner&amp;rsquo;s ownership spell terminates in a distressed sale. Severity refers to the larger price discount at distressed sale that minority homeowners experience, concentrated in neighborhoods with illiquid real estate markets. The paper&amp;rsquo;s decomposition finds frequency is the dominant margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Unobservable liquidity risk&lt;/strong&gt;: Default risk arising from insufficient liquid wealth (cash, bank deposits, liquid securities) and income instability that is not captured by credit scores or other characteristics observable to lenders at mortgage origination. The paper&amp;rsquo;s ARM-reset event study shows this risk generates excess minority default responses even conditional on credit score and income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Information friction (overoptimism)&lt;/strong&gt;: The tendency of minority homeowners, particularly Black homeowners, to hold expectations about future house prices, personal financial crises, and income growth that are more optimistic than their realized outcomes and than observationally similar White homeowners&amp;rsquo; expectations. The paper uses this to explain why high-risk minority households do not self-select out of homeownership despite the high cost of distressed sales.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Credit supply channel&lt;/strong&gt;: The mechanism by which the early-2000s expansion of private securitization and portfolio lending — channels that exhibited substantially greater growth among Black and Hispanic borrowers than among White borrowers — contributed to increased rates of minority distress during the Great Recession. Distinguished from GSE/FHA channels that did not exhibit comparable credit expansion and serve as the counterfactual.&lt;/p&gt;</description></item><item><title>Random Utility with Unobservable Alternatives</title><link>https://macropaperwarehouse.com/papers/random-utility-with-unobservable-alternatives/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/random-utility-with-unobservable-alternatives/</guid><description>&lt;p&gt;This paper addresses a foundational gap in the random utility model (RUM) literature: existing axiomatizations by Falmagne (1978) and McFadden and Richter (1990) assume that whenever a menu is observed, the choice frequencies of all alternatives in that menu are observable. In practice, the choice frequencies of some alternatives are routinely missing. The paper derives the full testable implications of the random utility model for such incomplete datasets, delivering a finite, nonredundant system of linear inequalities as a necessary and sufficient condition for RU-rationalizability.&lt;/p&gt;
&lt;p&gt;The empirical backdrop motivates the formal contribution directly. In transportation choice (bus, train, walk, drive), revenue data from transit operators can reveal the market shares of bus and train but not walking or driving without survey data. In school choice, governments observe enrollment across public schools but may lack data on private school selections. In market-share analysis, private firms may not disclose sales figures. In each case, researchers typically aggregate all unobservable alternatives into a single &amp;ldquo;outside option,&amp;rdquo; treating it as one composite choice. The paper calls this the outside option approach and establishes its formal limitations.&lt;/p&gt;
&lt;p&gt;The main theorem (Theorem 3.2) states that an incomplete dataset is RU-rationalizable if and only if two conditions hold jointly. The first is the classical nonnegativity of Block-Marschak (BM) polynomials, which appears in Falmagne&amp;rsquo;s original characterization and requires that certain inclusion-exclusion quantities over observed choice frequencies are nonneg. The second is a novel balance condition: for any &amp;ldquo;essential test collection&amp;rdquo; of choice sets, a specific net signed sum of BM polynomials across observable arcs crossing the boundary of that collection must be nonneg. This second condition captures the informational content that is lost when unobservable alternatives are collapsed. The characterization is nonredundant in the strong sense that removing any single inequality from either condition produces a strictly weaker system — every inequality is independently binding for some dataset.&lt;/p&gt;
&lt;p&gt;The limitation of the outside option approach is made precise by Proposition 3.5: the reduced dataset formed by the outside option approach is RU-rationalizable whenever the original incomplete dataset satisfies condition (i) and condition (ii) for singleton essential test collections only. Consequently, if the original data violates condition (ii) for non-singleton essential test collections — meaning it is not genuinely RU-rationalizable — the outside option approach will nonetheless return a verdict of rationalizability. False acceptance of the random utility model is therefore possible under the outside option approach.&lt;/p&gt;
&lt;p&gt;The proofs translate the rationalizability problem into a network flow problem on the hypercube lattice over subsets of alternatives, following Fiorini (2004). Each path from the empty set to the full alternative set corresponds to a linear order (ranking). The key methodological innovation is applying a feasibility theorem from network flow theory — specifically a generalization drawing on the max-flow min-cut theorem — to derive the necessary and sufficient conditions in the incomplete-data setting.&lt;/p&gt;
&lt;p&gt;The paper also provides an efficient algorithm for computing tight bounds on unobservable choice frequencies, formulated as a minimum-cost transshipment problem. Because the constraint matrix is totally unimodular (it is the incidence matrix of a network), the network simplex algorithm applies directly. Applied to a lottery-choice dataset from McCausland et al. (2020) — 141 participants each choosing from subsets of five lotteries, with choices made six times per choice set — the authors treat two of the five lotteries as unobservable and compare bound widths. Their method yields significantly tighter bounds than the outside option approach and, critically, correctly identifies that lottery 4 is more desirable than lottery 3 among the unobservable alternatives. The outside option approach yields identical trivial bounds for both lotteries and thus cannot distinguish their relative desirability at all.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question?
A: The paper asks: what are the testable implications of the random utility model when the choice frequencies of some alternatives are unobservable? The goal is a necessary and sufficient condition for RU-rationalizability under incomplete observation, along with a demonstration of what is lost when the standard outside option approach is used instead.&lt;/p&gt;
&lt;p&gt;Q: What is the random utility model and why is it the focus?
A: The random utility model posits a probability distribution over strict rankings of alternatives; each individual&amp;rsquo;s preferences correspond to one ranking. It is a cornerstone of discrete choice analysis in economics. Falmagne (1978) and McFadden-Richter (1990) characterized it under full observability of choice frequencies, making the extension to incomplete data a natural and practically important frontier.&lt;/p&gt;
&lt;p&gt;Q: What does &amp;ldquo;incomplete dataset&amp;rdquo; mean formally in this paper?
A: An incomplete dataset is a nonneg vector of choice frequencies satisfying: (i) for menus composed entirely of observable alternatives, frequencies sum to one; (ii) for menus that include at least one unobservable alternative, the sum of observable-alternative frequencies is at most one. The gap between the sum and one corresponds to the unobserved probability mass on unobservable alternatives.&lt;/p&gt;
&lt;p&gt;Q: What are Block-Marschak polynomials and why do they appear?
A: The Block-Marschak (BM) polynomial K(rho, D, x) is defined by inclusion-exclusion: it sums, with alternating signs, the choice frequency of alternative x over all supersets E of D. In Falmagne&amp;rsquo;s complete-data characterization, nonnegativity of all BM polynomials is necessary and sufficient for RU-rationalizability. In the incomplete-data setting, nonnegativity of BM polynomials remains necessary but is no longer sufficient.&lt;/p&gt;
&lt;p&gt;Q: What is the novel condition in Theorem 3.2 beyond BM nonnegativity?
A: Condition (ii) of Theorem 3.2 requires that for any &amp;ldquo;essential test collection&amp;rdquo; C of choice sets, the net observable outflow — the sum of BM polynomials on arcs leaving C minus the sum on observable arcs entering C — is nonneg. This balance condition captures the constraint that unobservable flow must be nonneg on every cut of the network corresponding to an essential test collection.&lt;/p&gt;
&lt;p&gt;Q: What makes the characterization nonredundant, and why does nonredundancy matter?
A: The characterization is nonredundant in the sense that for every individual inequality in conditions (i) and (ii), there exists an incomplete dataset that violates only that inequality and satisfies all others. This is established as part (b) of Theorem 3.2. Nonredundancy is essential for identifying precisely which inequalities the outside option approach discards: without it, some of the novel condition (ii) inequalities might be implied by others, and the argument that the outside option approach loses independent information would not hold.&lt;/p&gt;
&lt;p&gt;Q: What does the outside option approach actually discard?
A: Proposition 3.5 shows that the outside option approach retains only condition (i) (BM nonnegativity) and condition (ii) for singleton essential test collections. All condition (ii) inequalities corresponding to non-singleton essential test collections are discarded. Because the characterization is nonredundant, each discarded inequality is a genuinely independent constraint, meaning a dataset can violate any one of them while satisfying all others — including all conditions the outside option approach checks.&lt;/p&gt;
&lt;p&gt;Q: Can the outside option approach produce a false acceptance of the random utility model?
A: Yes. If the true incomplete dataset violates condition (ii) for some non-singleton essential test collection but satisfies all other conditions of Theorem 3.2 — including all conditions the outside option approach checks — then the original dataset is not RU-rationalizable, but the reduced dataset formed by collapsing unobservables into one outside option is RU-rationalizable. Researchers using the outside option approach would therefore erroneously conclude that the data-generating process follows a random utility model.&lt;/p&gt;
&lt;p&gt;Q: How is the problem translated into a network flow problem?
A: The authors build a directed network on the power set of alternatives, with arcs from D to D union {x} for each alternative x not in D, source at the empty set, and terminal at the full set X. Each source-to-terminal path corresponds to a unique linear order. A probability distribution over rankings corresponds to a flow, with flow conservation at interior nodes and total flow equal to one. The BM polynomial of an observable arc equals the required flow on that arc. Feasibility of this flow — guaranteed by a theorem generalizing max-flow min-cut — is equivalent to RU-rationalizability.&lt;/p&gt;
&lt;p&gt;Q: What is the algorithmic contribution for bounding unobservable choice frequencies?
A: The bounds problem is formulated as a minimum-cost transshipment problem on the same network. Because the constraint matrix is the incidence matrix of a network (totally unimodular), the network simplex algorithm applies and yields exact solutions efficiently. The algorithm produces tight upper and lower bounds for each unobservable choice frequency by optimizing the flow subject to all feasibility constraints from Theorem 3.2.&lt;/p&gt;
&lt;p&gt;Q: How does the paper demonstrate tighter bounds empirically?
A: The paper applies its method to a lottery stochastic choice dataset from McCausland et al. (2020), involving 141 participants choosing from subsets of five lotteries, with six repeated choices per choice set. The authors treat two of the five lotteries as unobservable. Their network-flow bounds are significantly tighter than the trivial bounds from the outside option approach. Specifically, their method correctly identifies that lottery 4 is more desirable than lottery 3 among the unobservable alternatives, a distinction the outside option approach cannot draw because it assigns identical trivial bounds to both lotteries.&lt;/p&gt;
&lt;p&gt;Q: What is the monotonicity-based lower bound for unobservable choice frequencies?
A: Under monotonicity (a weaker condition than full RU-rationalizability), the lower bound L(x*) for the choice frequency of unobservable alternative x* from menu D is the sum over observable alternatives a of the difference rho(D{x*}, a) minus rho(D, a), when D{x*} is in the domain. This lower bound is larger when removing x* from the menu substantially increases observable choice frequencies, indicating that x* was drawing demand away from observables and is therefore relatively desirable.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to McFadden-Richter (1990)?
A: McFadden and Richter (1990) allow for menus to be unobserved but require that when a menu is observed, all its alternative frequencies are observed — a distinct setup from the present paper. Their characterization also involves infinitely many inequalities and is redundant. The present paper&amp;rsquo;s characterization uses finitely many inequalities and is nonredundant, making it more tractable both theoretically and computationally.&lt;/p&gt;
&lt;p&gt;Q: What is the scope of the model regarding which alternatives are unobservable?
A: The paper focuses on the case where the set of unobservable alternatives X* is fixed and consistent across all menus: a given alternative is either always observable or always unobservable. The domain of choice sets D is assumed to be an upper set (if a menu is in D, all supersets are too). The paper does not handle cases where observability of an alternative varies by menu.&lt;/p&gt;
&lt;p&gt;Incomplete dataset: A nonneg vector of choice frequencies in which, for menus containing unobservable alternatives, the observable frequencies sum to at most one (not exactly one), with the residual mass attributable to unobservable alternatives.&lt;/p&gt;
&lt;p&gt;Block-Marschak (BM) polynomial: An inclusion-exclusion quantity K(rho, D, x) defined as the alternating-sign sum of rho(E, x) over all supersets E of D; its nonnegativity is the classical Falmagne condition for RU-rationalizability under complete observation.&lt;/p&gt;
&lt;p&gt;Essential test collection: A collection C of choice sets used to define the novel balance condition in Theorem 3.2; for each such C, the net observable outflow of BM polynomial values across the boundary of C must be nonneg for RU-rationalizability.&lt;/p&gt;
&lt;p&gt;Outside option approach: The empirical practice of aggregating all unobservable alternatives into a single composite &amp;ldquo;outside option,&amp;rdquo; so that all remaining choice frequencies sum to a value below one and the residual is assigned to that composite. This approach retains only a subset of the testable implications of the random utility model.&lt;/p&gt;
&lt;p&gt;Nonredundant characterization: A system of inequality conditions in which no single inequality is implied by the conjunction of all others; every inequality is independently binding for some dataset. This property is essential for identifying precisely which implications the outside option approach discards.&lt;/p&gt;
&lt;p&gt;Network flow representation: A directed network on the power set of alternatives (source: empty set, terminal: full set X) in which each source-to-terminal path encodes a linear order, flow conservation corresponds to probability conservation, and feasibility of a flow with prescribed values on observable arcs is equivalent to RU-rationalizability.&lt;/p&gt;
&lt;p&gt;Minimum-cost transshipment problem: The optimization problem used to compute tight bounds on unobservable choice frequencies; tractable via the network simplex algorithm because the constraint matrix is totally unimodular (the incidence matrix of a network).&lt;/p&gt;</description></item><item><title>Rationing by Race</title><link>https://macropaperwarehouse.com/papers/rationing-by-race/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/rationing-by-race/</guid><description>&lt;p&gt;Singh and Venkataramani ask whether resource scarcity causes discriminatory rationing of health care by patient race, with patient death as the starkest possible outcome of biased allocation decisions. They examine 107,221 inpatient admissions from 2015 to 2018 at two large urban academic teaching hospitals (each with over 500 beds) in a Southeastern U.S. city with a sizable Black population. Black patients accounted for 60% of admissions, were on average younger (52 vs. 59 years), more likely to be female (65% vs. 50%), and had similar comorbidity burdens and baseline in-hospital death rates (approximately 2% for both groups), but waited over two hours longer on average for an inpatient bed and were 27% less likely to be admitted to the ICU.&lt;/p&gt;
&lt;p&gt;The authors exploit quasi-exogenous hour-to-hour variation in hospital capacity strain — measured as the share of inpatient beds occupied at the hour of a patient&amp;rsquo;s arrival — which clinical and qualitative literature establishes is difficult to predict even day-to-day. Capacity strain is coded in hospital-specific deciles (beds filled ranged from 69–78% in decile 1 to 91–95% in decile 10). The core regression interacts patient race with strain decile, controlling for hospital-specific hour-of-day, day-of-week, month-of-year, and year fixed effects; physician-of-record fixed effects; and a rich vector of patient characteristics including Elixhauser comorbidity indices, insurance status, and vital signs. Identification rests on the assumption that strain at the hour of arrival is conditionally independent of unobserved patient characteristics correlated with race — an assumption validated through balance tests on demographics, comorbidities, vital signs, machine-learning-derived admission themes, and selective discharge patterns.&lt;/p&gt;
&lt;p&gt;The main finding is that in-hospital mortality rises for Black patients but not for White patients as hospitals approach capacity. At the tenth decile of strain, Black patients face a mortality rate 0.7 percentage points higher than White patients — a 47.6% relative increase over the 1.47% White mortality rate at the same decile. A pooled difference-in-differences estimate implies that approximately 15% of Black patient deaths at high strain (decile 10) would not have occurred had Black patients faced the same strain-mortality relationship as White patients (coefficient 0.0052, p = 0.025). This pattern is concentrated among patients with the greatest ex ante medical need as measured by above-median Elixhauser mortality index scores (a score with AUC of 0.92 for predicting in-hospital mortality) and, in qualitatively similar but less precisely estimated form, by abnormal vital signs at arrival.&lt;/p&gt;
&lt;p&gt;The authors identify wait time for an inpatient bed as the primary mechanism. At all levels of capacity strain, high-need Black patients wait longer than low-need White patients — a pattern the authors characterize as a striking inversion of any need-based allocation principle. Racial disparities in wait times widen further at the highest decile of strain, exactly mirroring the mortality pattern. As an additional, more suggestive mechanism, the authors analyze free-text clinical documentation (the Reason for Admission field) using descriptive text features (time to completion, character count, average word length), sentiment analysis (subjectivity and polarity scores via TextBlob), and adjective counts. Documentation for Black patients exhibits features consistent with lower provider effort at all strain levels — shorter notes, less time deferred to completion — and subjectivity of notes and adjective counts diverge further by race at the highest strain decile, with White patients receiving increasingly detailed and descriptive notes as strain rises.&lt;/p&gt;
&lt;p&gt;The findings are robust across sparse models (age, gender, hospital fixed effects only) through fully saturated specifications (DRG fixed effects, interactions of all controls with race and strain), and to replacing Elixhauser index composites with their 31 individual comorbidity components. The authors explicitly scope their findings to a pre-COVID-19 period (2015–2018), while noting that pandemic-era record capacity strain and racial disparities in health outcomes suggest de facto race-based rationing may have been far more severe during COVID-19.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why is the health care setting chosen?
A: The paper asks whether increasing resource scarcity causes discriminatory rationing on the basis of race in consequential, high-stakes real-world decisions. Health care is chosen because it is high-stakes (patient death is the outcome), has a long documented history of racial discrimination at both provider and system levels, and offers uniquely detailed time-stamped electronic health record data that enables identification from hour-to-hour variation in capacity strain — a finer temporal resolution than most prior work.&lt;/p&gt;
&lt;p&gt;Q: How is hospital capacity strain measured and what is the identifying variation?
A: Strain is measured as the total number of patients occupying inpatient beds at the specific hour of a patient&amp;rsquo;s arrival, converted into hospital-specific deciles. The first decile corresponds to 69–78% of beds filled and the tenth decile to 91–95%. The identifying variation is residual hour-to-hour fluctuation in this measure after removing hospital-specific hour-of-day, day-of-week, month-of-year, and year fixed effects, which absorbs all predictable capacity patterns. Clinical and qualitative evidence establishes that even day-to-day strain is difficult to anticipate, making hour-to-hour residual variation plausibly as-if random.&lt;/p&gt;
&lt;p&gt;Q: What are the main mortality findings, and how large are the racial disparities at peak strain?
A: At the tenth decile of capacity strain, Black patients face a mortality rate 0.7 percentage points higher than White patients, representing a 47.6% relative increase over the 1.47% White mortality rate at that decile. The pooled difference-in-differences estimate (comparing decile 10 to deciles 1–9) implies that approximately 15% of Black patient deaths at high strain would not have occurred if Black patients had the same strain-mortality relationship as White patients (coefficient 0.0052, p = 0.025). White patient mortality does not increase at high strain; if anything, small (imprecisely estimated) decreases appear at deciles 7–9.&lt;/p&gt;
&lt;p&gt;Q: Which patients drive the racial mortality disparity?
A: The disparity is concentrated among patients with above-median Elixhauser mortality index scores — the ex ante sickest patients. The Elixhauser Mortality Index has a predictive AUC of 0.92 for in-hospital mortality. At decile 10, high-need Black patients experience a sharp increase in mortality not seen for high-need White patients or for low-need Black patients. Qualitatively similar but less precisely estimated results appear when acute need is measured by abnormal vital signs at arrival, with the difference that the triple interaction (race × strain × high-need vitals) is not statistically significant, consistent with vital signs being noisier proxies for severity than the Elixhauser indices.&lt;/p&gt;
&lt;p&gt;Q: How do the authors validate the identifying assumption that strain is conditionally independent of patient composition by race?
A: They document five types of supporting evidence: (i) the distribution of Black and White patients across hours of arrival and across strain deciles is nearly identical; (ii) regressions of patient demographics, all five Elixhauser comorbidity measures, and five vital signs abnormalities on race × strain interactions show no significant differential selection by race at different strain levels; (iii) machine-learning (Latent Dirichlet Allocation) topic themes from free-text admission notes change similarly by strain for Black and White patients; (iv) there is no evidence of selective discharge to hospice care by race and strain, with point estimates running counter to the hypothesis; and (v) strain is computed at time of arrival to the hospital rather than time of admission to an inpatient bed, preserving exogeneity.&lt;/p&gt;
&lt;p&gt;Q: What is the primary identified mechanism for the mortality finding?
A: Wait time for an inpatient bed is the primary mechanism. Black patients experience greater increases in wait times as strain rises compared to White patients, with the clearest divergence at decile 10 — exactly mirroring the mortality pattern. More strikingly, at every decile of strain (including decile 1, when beds are most abundant), high-need Black patients wait longer for a bed than low-need White patients, implying that the disparity is not solely a product of logistical constraints but reflects ingrained factors in clinical protocols, likely including implicit or explicit provider bias.&lt;/p&gt;
&lt;p&gt;Q: What does the wait time evidence reveal about the role of medical need vs. race in allocation decisions?
A: At lower strain levels, low-need patients appropriately wait longer than high-need patients. However, at higher strain levels (deciles 8–10) this need-based gap almost entirely disappears, while the racial gap in wait times persists. The gap between high-need Black and low-need White patients is larger than the gap between high-need and low-need patients of the same race, meaning race is a stronger predictor of wait times than medical need. This pattern is consistent with the paper&amp;rsquo;s conceptual framework in which increasing strain reduces providers&amp;rsquo; ability to accurately assess medical need while increasing the weight assigned to racial identity.&lt;/p&gt;
&lt;p&gt;Q: How is provider effort measured and what are the findings?
A: Provider effort is inferred from features of free-text Reason for Admission documentation: time to completion, character count, average word length, TextBlob subjectivity and polarity scores, and adjective counts. Across all strain levels, Black patients&amp;rsquo; documentation exhibits features consistent with lower effort — shorter completion times (providers less likely to defer documentation for clinical tasks), shorter notes with fewer characters and shorter words. At the highest strain decile, subjectivity scores for Black patients&amp;rsquo; notes increase relative to White patients&amp;rsquo; (driven by both rising Black and falling White subjectivity), and White patients receive more adjectives as strain rises while Black patients&amp;rsquo; adjective counts do not increase. Polarity scores remain stable by race and strain.&lt;/p&gt;
&lt;p&gt;Q: What do the documentation patterns suggest about compensatory behavior by providers?
A: The authors speculate that providers may anticipate reduced care quality at high strain and compensate by becoming more conscientious with White patients — writing longer, more detailed, more descriptive notes as strain increases, and potentially exerting greater care effort correlated with these documentation improvements. This protective compensatory behavior appears substantially less pronounced or absent for Black patients, which the authors suggest may translate into the small imprecisely estimated decrease in White patient mortality at higher strain deciles. They explicitly characterize this interpretation as speculative and requiring further investigation.&lt;/p&gt;
&lt;p&gt;Q: How robust are the main mortality findings to specification choices?
A: The mortality findings hold across: (i) sparse models with only age, gender, and hospital/year fixed effects; (ii) linear probability and logistic models; (iii) models with DRG fixed effects to compare within-diagnosis; (iv) models interacting all control variables with patient race and strain; (v) models replacing the Elixhauser composite index with its 31 individual comorbidity components; and (vi) models additionally controlling for five individual abnormal vital sign indicators. Results are substantively unchanged across all these specifications.&lt;/p&gt;
&lt;p&gt;Q: What additional care intensity measures are examined and what do they show?
A: The authors also examine ICU admission, ICU length of stay, total inpatient length of stay, and inpatient charges. They find no strain-related racial disparities on these margins. However, they note that unconditionally (across all strain levels), Black patients receive fewer resources on average — they are 27% less likely to be admitted to the ICU. The authors treat these care intensity measures as harder to interpret because both over- and under-provision can harm patients, and thus view them as less informative for their research question.&lt;/p&gt;
&lt;p&gt;Q: What conceptual framework guides the empirical predictions?
A: The framework models providers as assessing perceived medical need N&lt;em&gt;ij(t) = Ni × exp(−γ × S(t)), where the parameter γ captures the diminishing ability to accurately assess true need as strain S(t) rises. Simultaneously, the racial weight R&lt;/em&gt;ij(t) = Ri × φ(S(t)) increases with strain through the parameter φ(S(t)). When γ = 0 and φ = 0, allocation is race-neutral and need-based. When both parameters are positive, increasing strain simultaneously degrades need assessment and amplifies reliance on racial identity in allocation decisions — the paper&amp;rsquo;s core prediction, which is confirmed empirically.&lt;/p&gt;
&lt;p&gt;Q: How do the findings relate to the COVID-19 pandemic?
A: The data predate COVID-19 (2015–2018). The authors argue that pandemic conditions — record hospital capacity strain (especially in hospitals serving Black patients), extreme provider burnout, and documented racial disparities in health access — suggest race-based rationing may have been considerably more severe during COVID-19. The paper also contextualizes its findings within the pandemic-era debate over whether explicit race-based triage protocols were ethical or legal, arguing that de facto rationing by race appears to occur in ordinary care settings under typical stressors irrespective of that normative debate.&lt;/p&gt;
&lt;p&gt;Q: What policy interventions do the authors suggest?
A: The authors propose: increasing provider awareness of implicit biases; developing new algorithms to improve triage decisions for high-mortality-risk patients who might otherwise be overlooked; correcting existing care algorithms with documented racial bias; building provider peer networks to reduce biased treatment decisions; supporting patient self-advocacy; improving capacity prediction systems (as spurred by COVID-19); and creating load-shifting protocols and inter-hospital transfer networks to prevent resources from being stretched beyond capacity during high-strain periods.&lt;/p&gt;
&lt;p&gt;Capacity strain: The state of a hospital when a high share of inpatient beds are occupied, measured here at the hour of patient arrival as hospital-specific deciles of bed occupancy (ranging from 69–78% full at decile 1 to 91–95% full at decile 10); the paper&amp;rsquo;s primary measure of resource scarcity.&lt;/p&gt;
&lt;p&gt;Rationing by race: The paper&amp;rsquo;s term for the phenomenon whereby, as resource scarcity deepens, allocation decisions increasingly reflect patient racial identity rather than medical need — a form of discriminatory rationing that the authors distinguish from explicit (de jure) race-based triage and document as de facto practice.&lt;/p&gt;
&lt;p&gt;Perceived need (N*): In the paper&amp;rsquo;s conceptual framework, the provider&amp;rsquo;s assessment of a patient&amp;rsquo;s medical need, which deviates from true need Ni by the factor exp(−γ × S(t)) as strain S(t) increases; captures the provider team&amp;rsquo;s diminishing ability or willingness to accurately assess true medical need under cognitive and resource constraints.&lt;/p&gt;
&lt;p&gt;Racial weight (R*): The weight assigned to a patient&amp;rsquo;s racial identity in allocation decisions, modeled as Ri × φ(S(t)), where the function φ is increasing in capacity strain; represents the potential for discrimination — from implicit bias, algorithmic bias, reduced patient advocacy, or provider-patient social distance — to intensify as strain rises.&lt;/p&gt;
&lt;p&gt;Wait time inversion: The condition, documented throughout the paper, where high-need Black patients wait longer for an inpatient bed than low-need White patients at every decile of capacity strain, including decile 1 when resources are most abundant — inverting the normative principle that greater medical need should yield faster access to care.&lt;/p&gt;
&lt;p&gt;Elixhauser Mortality Index: A widely validated composite score of patient comorbid conditions used to predict in-hospital mortality (AUC = 0.92); used in this paper as the primary measure of chronic medical need, with patients split at the median into relatively sick (above median) and relatively healthy (below median) groups.&lt;/p&gt;
&lt;p&gt;Provider effort (inferred): An unobserved construct inferred in this paper from features of free-text clinical documentation in the Reason for Admission field, including time to note completion, character count, average word length, TextBlob subjectivity and polarity scores, and adjective counts; features argued to reflect how much attention, detail, and care a provider invested in documenting — and by extension, in assessing — a patient&amp;rsquo;s condition.&lt;/p&gt;</description></item><item><title>Redistributive Policy Shocks and Monetary Policy with Heterogeneous Agents</title><link>https://macropaperwarehouse.com/papers/redistributive-policy-shocks-and-monetary-policy-with-heterogeneous-agents/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/redistributive-policy-shocks-and-monetary-policy-with-heterogeneous-agents/</guid><description>&lt;h2 id="layer-1--what-this-paper-finds-and-why-it-matters"&gt;Layer 1 — What this paper finds and why it matters&lt;/h2&gt;
&lt;p&gt;Governments in emerging market and developing economies (EMDEs) routinely intervene in agricultural markets — procuring grain and redistributing it to poor households — in response to food price shocks or expanded food security mandates (India&amp;rsquo;s 2013 National Food Security Act is the leading example). This paper asks how monetary policy should respond to such &amp;ldquo;redistributive policy shocks,&amp;rdquo; and what those shocks do to sectoral inflation and the consumption distribution between rich and poor households. The authors build a two-sector (agriculture with flexible prices; manufacturing with sticky prices), two-agent (Ricardian rich; rule-of-thumb poor) New Keynesian DSGE model, calibrated to India, that extends the TANK framework of Debortoli and Gali (2018) to two sectors and introduces explicit government procurement and redistribution. They show that a redistributive policy shock raises aggregate inflation and the output gap but also raises poor consumption and aggregate welfare, because the subsidy-in-kind effect on poor households more than offsets the decline in rich consumption and the inflationary pressure. They further show that consumer heterogeneity matters for whether monetary policy responses to various shocks raise or reduce aggregate welfare: in models with a flexible-price agricultural sector, contractionary monetary shocks produce larger deflation but smaller declines in real consumption relative to one-sector benchmarks, so the welfare cost of monetary contraction is lower than standard NK models imply.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on MPRA working paper (No. 101651, July 2020). The extracted PDF text was truncated before the calibration, impulse response, and welfare sections; quantitative parameter values and figure-level results are not available in the source text used here. AI-assisted, human review pending. See the linked original for authoritative claims.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="layer-2--in-depth"&gt;Layer 2 — In depth&lt;/h2&gt;
&lt;h3 id="q1-what-is-a-redistributive-policy-shock-and-how-does-the-model-capture-it"&gt;Q1. What is a &amp;ldquo;redistributive policy shock&amp;rdquo; and how does the model capture it?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A redistributive policy shock is a sudden increase in the fraction of government-procured agricultural output that is redistributed to poor households.&lt;/strong&gt; In the model, the government taxes rich (Ricardian) households via lump-sum levies each period, uses those proceeds to purchase agricultural output at the open market price, and then redistributes a fraction φ_t of the procured quantity to poor households as an in-kind subsidy. The remaining fraction goes into a buffer stock. The shock to redistribution is modeled as a positive innovation to φ_t (AR(1) process), distinct from a shock to the procurement quantity Y^P_{A,t} itself. Because the in-kind transfer reduces the effective price paid by the poor for agricultural goods — the poor face an effective price of (1 − λ_t)P_{A,t} — the redistributive shock operates as a proportional price subsidy on agriculture consumption for the poor, even though the quantity is what the government directly controls.&lt;/p&gt;
&lt;h3 id="q2-what-are-the-two-types-of-households-and-how-do-they-differ"&gt;Q2. What are the two types of households and how do they differ?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Rich households are Ricardian (forward-looking) and hold one-period risk-free bonds; poor households are rule-of-thumb consumers who do not save.&lt;/strong&gt; Both types consume goods from both the agricultural and manufacturing sectors according to Cobb-Douglas indices, but they differ in three ways. First, poor households have a higher budget share for agricultural goods (δ_P &amp;gt; δ_R), consistent with Engel&amp;rsquo;s Law. Second, the inverse of the intertemporal elasticity of substitution (IES) is higher for the poor (σ_P &amp;gt; σ_R), following Atkeson and Ogaki (1996) estimates for Indian household data; this means the poor are less willing to substitute consumption across time and respond differently to real wage changes. Third, rich households have both labor income and dividend income from monopolistically competitive manufacturing firms, while poor households have only labor income.&lt;/p&gt;
&lt;h3 id="q3-what-happens-to-inflation-and-consumption-when-a-positive-agricultural-productivity-shock-hits"&gt;Q3. What happens to inflation and consumption when a positive agricultural productivity shock hits?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A positive agricultural productivity shock leads to a decline in inflation, a rise in the output gap, and higher consumption for both rich and poor households.&lt;/strong&gt; Because the agriculture sector has flexible prices, a positive productivity improvement lowers agricultural prices immediately, reducing the terms of trade (the relative price of agriculture to manufacturing). Aggregate CPI inflation falls. The rise in agricultural output increases real income for both household types, raising consumption and aggregate welfare. These dynamics are compared to the Aoki (2001) representative-agent two-sector benchmark.&lt;/p&gt;
&lt;h3 id="q4-what-are-the-aggregate-and-distributional-effects-of-a-positive-redistributive-policy-shock"&gt;Q4. What are the aggregate and distributional effects of a positive redistributive policy shock?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;A procurement-and-redistribution shock raises aggregate inflation, the output gap, and poor consumption, while lowering rich consumption; aggregate welfare rises because the redistribution effect dominates.&lt;/strong&gt; The mechanism has two parts. First, the government procures additional agricultural output at the market price, financed by higher lump-sum taxes on the rich; this reduces rich consumption. Second, the redistributed grain lowers the effective price of the agricultural good for the poor, raising poor consumption through a &amp;ldquo;redistribution effect.&amp;rdquo; Because poor households spend a higher share of income on the agricultural good than rich households, and because the poor receive a fraction of their agricultural consumption for free, market demand for the agricultural good in the open market is less than it would be without redistribution. Consequently, the inflationary impact of the procurement shock is substantially lower in the two-agent model than in the Aoki representative-agent model (where there is no redistribution to dampen open-market demand).&lt;/p&gt;
&lt;h3 id="q5-how-does-consumer-heterogeneity-alter-the-transmission-of-a-contractionary-monetary-policy-shock"&gt;Q5. How does consumer heterogeneity alter the transmission of a contractionary monetary policy shock?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;In models with a flexible-price agricultural sector, a contractionary monetary shock produces a larger deflation but a smaller decline in consumption and smaller welfare losses than in single-sector or representative-agent benchmarks.&lt;/strong&gt; A rise in the nominal interest rate induces intertemporal substitution of consumption, reducing aggregate demand and the aggregate price level. This deflationary effect is amplified when a flexible-price sector is present alongside the sticky-price sector, because agricultural prices can fall immediately. However, the same flexible-price sector means that real interest rates rise by less (compared to an all-sticky-price economy), so the reduction in rich and poor consumption is also smaller. The paper compares this to three benchmarks: the simple one-sector one-agent NK model (Gali 2015, Chapter 3), the Debortoli-Gali (2018) one-sector two-agent model, and the Aoki (2001) two-sector one-agent model. The welfare losses from monetary contraction are lower in the two-sector models (the authors&amp;rsquo; framework and Aoki&amp;rsquo;s) than in the one-sector models.&lt;/p&gt;
&lt;h3 id="q6-how-does-the-model-differ-from-its-three-main-benchmark-frameworks"&gt;Q6. How does the model differ from its three main benchmark frameworks?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The model merges the two-sector production structure of Aoki (2001) with the TANK distributional structure of Debortoli and Gali (2018), and adds explicit government procurement and redistribution — none of the benchmarks have all three features.&lt;/strong&gt; Relative to Aoki: the paper adds poor/rich heterogeneity, different IES parameters, and the government redistribution mechanism. Relative to Debortoli-Gali: the paper adds an agricultural flexible-price sector and the redistribution shock, and assumes complete markets (Debortoli-Gali assumes incomplete markets; their model is treated as an approximation). Relative to Gali (2015, Chapter 3): the paper adds both a second sector and household heterogeneity. The three differences from the simple NK benchmark in the Dynamic IS and NKPC equations are: (i) the presence of a terms of trade channel, (ii) heterogeneous agents with different IES parameters and budget shares, and (iii) redistribution policy that shifts the effective price index of the poor.&lt;/p&gt;
&lt;h3 id="q7-what-role-do-terms-of-trade-play-in-the-models-transmission-mechanism"&gt;Q7. What role do terms of trade play in the model&amp;rsquo;s transmission mechanism?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The terms of trade between agriculture and manufacturing (T_t = P_{A,t}/P_{M,t}) is a central transmission variable that affects both aggregate consumption and inflation.&lt;/strong&gt; Aggregate CPI inflation can be decomposed as π_t = δ_R·π_{A,t} + (1 − δ_R)·π_{M,t} = δ_R·ΔT_t + π_{M,t}, so movements in the terms of trade feed directly into headline inflation. Total agricultural and manufacturing consumption both depend on T_t, rich consumption C_{R,t}, and poor consumption C_{P,t} through equations (22) and (23). A rise in the terms of trade (higher relative agricultural prices) makes the consumption basket of the poor more expensive because they spend a larger share of income on agricultural goods, inducing them to reduce agricultural purchases. This terms-of-trade channel is absent from one-sector benchmarks and is a key reason the paper&amp;rsquo;s framework generates different aggregate dynamics than Debortoli-Gali.&lt;/p&gt;
&lt;h3 id="q8-what-is-the-welfare-metric-used-and-what-is-the-papers-welfare-conclusion"&gt;Q8. What is the welfare metric used, and what is the paper&amp;rsquo;s welfare conclusion?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Welfare is defined to depend on aggregate consumption in the standard fashion, and the paper&amp;rsquo;s central welfare conclusion is that consumer heterogeneity matters for whether monetary policy responses to shocks raise or reduce aggregate welfare.&lt;/strong&gt; For a redistributive policy shock, aggregate welfare rises despite higher inflation, because the gain in poor consumption (driven by the subsidy) exceeds the loss in rich consumption and the distortionary cost of inflation. For a contractionary monetary shock, welfare losses are smaller in the two-sector framework than in single-sector frameworks, because the flexible-price agricultural sector moderates the real interest rate increase and limits the consumption decline. The paper does not report specific numerical welfare loss figures in the portion of text available in this source extract.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Redistributive policy shock&lt;/strong&gt; : in this paper&amp;rsquo;s usage, a positive shock to the fraction (φ_t) of government-procured agricultural output that is redistributed to poor households as an in-kind subsidy; distinct from a procurement level shock. Modeled as an AR(1) process on φ_t.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;TANK (Two-Agent New Keynesian) model&lt;/strong&gt; : a tractable heterogeneous-agent NK framework with exactly two household types — Ricardian (forward-looking, hold bonds) and rule-of-thumb (hand-to-mouth, do not save) — that Debortoli and Gali (2018) showed provides a good approximation to the aggregate dynamics of a full HANK model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rule-of-thumb (hand-to-mouth) consumers&lt;/strong&gt; : households that maximize static utility subject to a static budget constraint, consuming all current income each period. In this model, the poor are rule-of-thumb consumers with only labor income and no bond holdings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective price of agriculture for the poor&lt;/strong&gt; : P&amp;rsquo;&lt;em&gt;{A,t} = (1 − λ_t)P&lt;/em&gt;{A,t}, where λ_t is the fraction of poor agricultural consumption provided for free via the redistributive subsidy. The poor face a price index P&amp;rsquo;&lt;em&gt;t = {(1−λ_t)P&lt;/em&gt;{A,t}}^{δ_P} · P_{M,t}^{1−δ_P}, which differs from the rich price index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Terms of trade (TOT)&lt;/strong&gt; : T_t = P_{A,t}/P_{M,t}, the relative price of the agricultural good to the manufactured good. Changes in TOT affect the sectoral composition of consumption for both household types and transmit through the Dynamic IS and NKPC equations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intertemporal elasticity of substitution (IES)&lt;/strong&gt; : 1/σ_K for household type K. The paper assumes σ_P &amp;gt; σ_R (poor have lower IES than rich), following Atkeson and Ogaki (1996) estimates for Indian household data; this differential drives asymmetric labor supply responses to real wage changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Procurement shock&lt;/strong&gt; : a shock to the quantity Y^P_{A,t} of agricultural output the government procures each period, modeled as a separate AR(1) process from the redistribution-fraction shock. Together, the procurement level and redistribution fraction determine the total subsidy received by poor households.&lt;/p&gt;</description></item><item><title>Regulatory Competition in the US Life Insurance Industry</title><link>https://macropaperwarehouse.com/papers/regulatory-competition-in-the-us-life-insurance-industry/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/regulatory-competition-in-the-us-life-insurance-industry/</guid><description>&lt;p&gt;This paper quantitatively assesses the consequences of jurisdictional competition in the US life insurance industry, an $8 trillion market. The central question is whether competition between state regulators over capital requirements for captive reinsurance subsidiaries — a form of regulatory competition — increases or decreases total surplus, and by how much.&lt;/p&gt;
&lt;p&gt;US life insurers are regulated at the state level. Since the early 2000s, states have competed to attract captive reinsurance subsidiaries (captives) by setting lower capital requirements on these entities. The externality structure is asymmetric: the captive state earns tax revenues on liabilities transferred to captives and sets their capital requirements, but bears default costs only for policyholders in its own state. Consumer states bear default costs for their own residents even when those policies have been transferred to an out-of-state captive. This mismatch between who sets capital requirements and who bears default costs creates the externality that drives the race-to-the-bottom dynamic studied in the paper.&lt;/p&gt;
&lt;p&gt;The empirical setting draws on a novel dataset covering 66 US life insurers from 2005 to 2020, with total liabilities of $1.9 trillion (approximately 25% of the sector). Data sources include NAIC filings via S&amp;amp;P, CompuLife pricing data, A.M. Best ratings, SEC filings, and state legislative records. The author assembles novel data on captives&amp;rsquo; capital levels from SEC filings, Iowa Insurance Department captive financial statements, and insurer reinsurance exhibits.&lt;/p&gt;
&lt;p&gt;Three motivating empirical findings ground the structural model. First, captives materially reduce insurers&amp;rsquo; capital: in 2019, risk-based capital ratios are 23% lower on average after accounting for captives, with the median insurer&amp;rsquo;s capital declining 24%, and this translates into an increase in 10-year default probability from 1.0% to 2.9%. Second, states&amp;rsquo; capital requirements are the primary determinant of where insurers locate captives: a 1 percentage point increase in a state&amp;rsquo;s captive capital rate is associated with a 1.6 percentage point decrease in the probability an insurer chooses that state (against a 1.1 percentage point unconditional probability), and this holds when insurers switch states over time as capital requirements change. Tax rates, geographic proximity, and amenities are not meaningfully correlated with captive location choice. Third, a difference-in-differences design exploiting Regulation XXX (effective January 1, 2000), which raised capital requirements differentially across product term lengths, shows that 30-year term products — which faced the largest capital requirement increases — experienced price increases averaging 10.3% relative to 10-year term products, with quantities declining monotonically for longer-term products, consistent with an inward supply shift.&lt;/p&gt;
&lt;p&gt;The paper develops a structural model of the insurance market with imperfectly competitive insurers, endogenous default following a Leland (1994) framework, discrete choice consumer demand (Berry, 1994), and state regulators who set captive capital rates to maximize a weighted objective over tax revenues, default costs, consumer surplus, and producer surplus. Regulators deviate from a utilitarian social planner in two ways: they are state-based (generating competition and default externalities) and face agency frictions (captured by welfare weights that differ from unity). The demand side implies an average price elasticity of 2.4. The regulator side reveals that state regulators are willing to trade $1 of default costs against $3.5 of tax revenues and $0.59 of consumer surplus — both diverging from the social planner&amp;rsquo;s equal weighting.&lt;/p&gt;
&lt;p&gt;The main counterfactual finding is that eliminating competition by federalizing insurance regulation would cause regulators to raise capital requirements by 19% (3 percentage points), reducing expected default costs by $2.4 billion while lowering consumer surplus by $880 million, for a net total surplus gain of $1.5 billion. Regulator utility would increase by $3.3 billion in equivalent tax revenues. Because regulators over-value consumer surplus relative to default costs, competition exacerbates rather than counteracts their agency frictions, making competition unambiguously welfare-reducing in the baseline. A social planner would set capital requirements even higher than a federal regulator. On distribution, large states such as California and New York gain most from federalization (they bear substantial default costs), while Vermont — the largest captive state by market share — loses because it would forfeit captive tax revenues. Unilateral bans are found to have limited equilibrium consequences: a New York ban on captive use by insurers selling in New York would achieve only 23% of the national default cost reduction that federalization achieves, and a ban on captives domiciled in Vermont would achieve only 10%, as insurers would redirect captives to other states.&lt;/p&gt;
&lt;p&gt;Q: What is a captive reinsurance subsidiary and why do states compete to attract them?
A: A captive is a wholly-owned subsidiary of a life insurance holding company that reinsures policies written by the operating company, moving liabilities off the operating company&amp;rsquo;s balance sheet. Captive states earn tax revenues on liabilities transferred to captives and can set their own capital requirements on those entities, which are lower than the uniform NAIC standards applied to operating companies. Because captives are taxed by the state where they are domiciled — not the consumer&amp;rsquo;s state — captive states can earn tax revenues on policies sold elsewhere, incentivizing competition through lower capital requirements to attract insurers.&lt;/p&gt;
&lt;p&gt;Q: What is the default externality at the core of this paper&amp;rsquo;s argument?
A: When an insurer defaults, the shortfall on policies sold to consumers in a given state is borne by that state&amp;rsquo;s guaranty fund and consumers, regardless of where the captive holding those liabilities is domiciled. So Vermont, as the captive state, sets the capital requirement on liabilities transferred from (for example) Massachusetts policyholders, but does not bear the default cost on those Massachusetts policies. This means Vermont internalizes only the default cost on its own consumers, leading it to set capital requirements lower than it would if it bore the full default cost — a classic externality.&lt;/p&gt;
&lt;p&gt;Q: How large is the effect of captives on insurers&amp;rsquo; capital levels?
A: Using novel data on captives&amp;rsquo; actual balance sheets, the author finds that in 2019, the size-weighted average risk-based capital ratio of sample insurers is 23% lower after consolidating captives into the operating company&amp;rsquo;s balance sheet. The median insurer&amp;rsquo;s capital ratio decreases by 24%. In terms of default risk, this adjustment corresponds to an increase in the 10-year default probability from 1.0% to 2.9% based on historical insurer default rates.&lt;/p&gt;
&lt;p&gt;Q: What is the state of competition among captive domiciles in the data?
A: Twenty-two states had passed laws allowing captives as of the sample period, with the set of competing states largely stabilizing after 2013. The market is moderately concentrated: the top five states (Vermont, Arizona, Delaware, Iowa, and South Carolina) account for 80% of all captive liabilities, and the Herfindahl-Hirschman Index is 0.20. Vermont has maintained its position as the largest captive state throughout the period.&lt;/p&gt;
&lt;p&gt;Q: What evidence shows that capital requirements — rather than taxes or other factors — drive captive location choice?
A: In a linear probability model of captive location with insurer-year fixed effects, a 1 percentage point increase in a state&amp;rsquo;s captive capital rate is associated with a 1.6 percentage point decrease in the probability that an insurer chooses that state (versus a 1.1 percentage point unconditional probability). Captive tax rates are not meaningfully correlated with location choice, consistent with federal tax laws prohibiting the use of reinsurance to reduce tax liabilities. A changes-on-changes specification confirms that insurers are more likely to shift their captives to states that lower their capital requirements over time.&lt;/p&gt;
&lt;p&gt;Q: How does the Regulation XXX natural experiment identify the supply-side effect of capital requirements on insurance prices?
A: Regulation XXX, effective January 1, 2000, increased reserve requirements for operating companies on a mechanical basis tied to policy term length, with longer-term products facing larger increases. Using a difference-in-differences design at the insurer-product-month level with insurer-product and month fixed effects, the paper finds that products with larger capital requirement increases experienced larger price increases immediately after the regulation took effect. Thirty-year term products experienced price increases averaging 10.3% relative to 10-year term products (the reference group) within three months. Quantities also declined monotonically for longer-term products, confirming an inward shift of the supply curve rather than a demand shift.&lt;/p&gt;
&lt;p&gt;Q: What are the estimated regulator welfare weights, and what do they imply about agency frictions?
A: Normalizing the weight on tax revenues to 1, the paper recovers that regulators value $1 of default costs as worth $0.29 (implying $3.5 of tax revenues trades off against $1 of default costs) and value consumer surplus at $0.59 per dollar. Because the social planner sets all weights equal to 1, these estimates show regulators over-weight tax revenues and consumer surplus relative to default costs. The higher weight on consumer surplus is consistent with political backlash from consumers facing high insurance prices.&lt;/p&gt;
&lt;p&gt;Q: What is the total surplus effect of eliminating regulatory competition through federalization?
A: Federalizing insurance regulation — modeled as a single federal regulator setting a uniform capital rate while holding fixed regulatory frictions — would lead regulators to raise captive capital requirements by 19% (3 percentage points) to internalize the default externality. Expected default costs would fall by $2.4 billion. However, higher capital requirements would raise insurance prices and reduce consumer surplus by $880 million. The net effect is a total surplus increase of $1.5 billion. Regulator utility (in equivalent tax revenues) would increase by $3.3 billion.&lt;/p&gt;
&lt;p&gt;Q: Would eliminating both competition and regulatory frictions (i.e., a social planner) produce a different outcome than just federalizing?
A: In the baseline estimates, a social planner would set capital requirements even higher than a federal regulator, because regulators&amp;rsquo; agency frictions lead them to under-weight default costs relative to consumer surplus, pushing capital requirements below the socially optimal level even absent competition. Competition further exacerbates these frictions by providing an additional incentive to lower capital rates. Thus, in the baseline, competition unambiguously decreases total surplus. The paper also reports results under alternative assumptions, providing a &amp;ldquo;menu&amp;rdquo; for policymakers that maps different assumptions about regulators&amp;rsquo; frictions to quantitative welfare statements.&lt;/p&gt;
&lt;p&gt;Q: What distributional consequences across states explain why federalization has not been adopted?
A: Federalization would benefit large states such as California and New York most, because those states bear substantial default costs on large volumes of policies sold to their consumers. States with large captive market shares, primarily Vermont, would be made worse off because they would lose captive tax revenues. These predicted gains and losses align with actual state policy positions: New York has called for a national ban on captives, California forbids insurers from setting up captives there, and Vermont has been the most aggressive state in attracting captive domiciles.&lt;/p&gt;
&lt;p&gt;Q: How effective are unilateral state bans as an alternative to federal coordination?
A: The paper estimates that a unilateral ban by New York on insurers selling in New York from using captives would achieve only 23% of the national default cost reduction that full federalization would achieve. A unilateral ban on captives domiciled in Vermont — the largest captive state — would achieve only 10% of federalization&amp;rsquo;s default cost reduction, because insurers would simply relocate their captives to other states that still allow them. This finding underscores the importance of cross-state coordination for meaningful regulatory reform.&lt;/p&gt;
&lt;p&gt;Q: What does the model&amp;rsquo;s demand estimation imply about consumer sensitivity to insurance prices?
A: The discrete choice demand model estimated on state-level sales, prices, and product characteristics implies an average price elasticity of demand of 2.4 for life insurance products. This elasticity disciplines the quantitative impact of capital requirements on product markets through their effect on insurance prices.&lt;/p&gt;
&lt;p&gt;Q: How does the paper recover regulators&amp;rsquo; objective functions?
A: The author uses the revealed preferences of state regulators, exploiting regulators&amp;rsquo; utility maximization first-order conditions and performing numerical perturbations around those conditions to calibrate the welfare weights (lambdas) on each component of the regulators&amp;rsquo; utility function. This approach recovers regulators&amp;rsquo; tradeoff weights from their observed policy choices — specifically their captive capital rate decisions — without directly observing regulators&amp;rsquo; preferences.&lt;/p&gt;
&lt;p&gt;Captive reinsurance subsidiary: A wholly-owned subsidiary of a life insurance holding company that reinsures liabilities from the operating company. Unlike operating companies, captives are regulated by the state in which they are domiciled (the captive state) under that state&amp;rsquo;s own capital requirements, which are typically lower than the uniform NAIC standards. Captives allow insurers to reduce their overall capital requirements by allocating liabilities to the captive.&lt;/p&gt;
&lt;p&gt;Default externality: The mismatch between who sets capital requirements for captives (the captive state) and who bears default costs when an insurer fails (the consumer&amp;rsquo;s state and its guaranty fund). Because the captive state bears default costs only for its own residents — not for residents of states where the insurer also sells — it has an incentive to set lower capital requirements than it would if it internalized the full default cost, leading to an externality on other states.&lt;/p&gt;
&lt;p&gt;Risk-based capital ratio (adjusted for captives): The author&amp;rsquo;s measure of insurer capitalization after consolidating the captive&amp;rsquo;s balance sheet with the operating company&amp;rsquo;s. This adjusted ratio is lower than the statutory risk-based capital ratio that ignores captives, by 23-24% in the 2019 sample, and translates into meaningfully higher default probabilities (from 1.0% to 2.9% over 10 years).&lt;/p&gt;
&lt;p&gt;Regulatory agency frictions: Deviations of state regulators&amp;rsquo; objective functions from a utilitarian social planner&amp;rsquo;s, captured by welfare weights (lambdas) on each component of the regulator&amp;rsquo;s utility. In the paper&amp;rsquo;s estimates, regulators over-weight tax revenues ($3.5 of tax revenues per $1 of default costs) and consumer surplus ($0.59 per $1 of default costs) relative to the social planner&amp;rsquo;s equal weighting, consistent with political economy pressures from consumers and revenue incentives.&lt;/p&gt;
&lt;p&gt;Captive capital rate: The state-level capital requirement on captives, defined empirically as the sum of capital divided by the sum of liabilities of all captives in the state each year. Higher values represent more stringent requirements. The mean in the sample is 4% with a standard deviation of 3%, and captive capital rates are lower on average than operating company capital rates.&lt;/p&gt;
&lt;p&gt;Race to the bottom: The dynamic under which competition between state regulators leads each state to set lower capital requirements than it would absent competition, in order to attract captive tax revenues, resulting in a collectively worse equilibrium with higher default risks. The paper finds this outcome in the baseline: competition lowers capital requirements by 19% (3 percentage points) relative to a federal regulator.&lt;/p&gt;
&lt;p&gt;External financing frictions: The costs insurers face in raising equity capital, modeled as a per-dollar cost theta on required capital. These frictions create the supply-side channel through which capital requirements affect insurance prices: higher capital requirements raise insurers&amp;rsquo; marginal costs, leading to higher prices and lower quantities, as documented in the Regulation XXX natural experiment.&lt;/p&gt;</description></item><item><title>Religion, Education, and the State</title><link>https://macropaperwarehouse.com/papers/religion-education-and-the-state/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/religion-education-and-the-state/</guid><description>&lt;p&gt;This paper studies how Indonesia&amp;rsquo;s Islamic education sector responded to one of the largest state-driven mass schooling expansions in history — the SD INPRES program (Sekolah Dasar Presidential Instruction) launched in 1973 — and whether that program achieved its secular nation-building objectives. The research question is three-part: Did Islamic schools enter or exit markets where the state built more primary schools? How did religious school choice shift across cohorts? And did the program advance secular identity formation among exposed individuals?&lt;/p&gt;
&lt;p&gt;The empirical setting is Indonesia in the 1970s onward. Under SD INPRES, the government used windfall oil revenues to build more than 61,000 primary schools between 1973 and 1980, allocating construction across districts proportional to the non-enrolled primary-school-age population. Because Islamic schools were historically more prevalent in underserved areas, this rule produced a strong positive correlation between SD INPRES intensity and pre-existing Islamic school density — the same markets where the state expanded were precisely those with the greatest Islamic education presence.&lt;/p&gt;
&lt;p&gt;The authors use several novel data sources: administrative registries covering nearly 220,000 secular and 160,000 Islamic schools with establishment dates; six rounds of the National Socioeconomic Survey (Susenas) from 2012–18; the Indonesia Family Life Survey (IFLS, 1993–2014); a 2018–19 curriculum timetable registry (SIAP) covering nearly 20% of madrasa; and a 2016 political/religious attitudes survey. Identification relies on difference-in-differences (DID) exploiting cross-district variation in SD INPRES intensity, the synthetic DID approach of Arkhangelsky et al. (2021) for robustness to violations of parallel trends, and a staggered village-level event study using the Borusyak et al. (2024) estimator.&lt;/p&gt;
&lt;p&gt;The main findings are as follows. First, Islamic schools did not exit markets where the state expanded — they entered in greater numbers. A one standard deviation increase in SD INPRES construction led to 1.4 additional Islamic school entries per district above a mean of 1.9 per district in 1972. Entry was competitive at the primary level, where new madrasa (MI) entered at twice the baseline annual rate in the years immediately following INPRES construction, and strategic at the secondary level, where Islamic junior secondary schools (MTs) peaked 6–9 years after INPRES entry as graduates sought continued education. The Islamic sector financed this expansion through waqf (inalienable religious endowments), informal taxation (infaq, zakat), and revenues from a concurrent rice price spike; entry responses were stronger in villages with above-median waqf endowments and above-median potential rice yields.&lt;/p&gt;
&lt;p&gt;Second, rather than converging toward secular curricula, newly established Islamic schools in high-INPRES districts devoted more time to religious content. Each additional SD INPRES is associated with a 1.2 percentage point increase in the religious curriculum share among newly created madrasa, with increases of 1.3 and 2.4 percentage points at the primary and junior secondary levels respectively — the latter equaling 82% of the cross-school standard deviation. Some of this increase came at the expense of Pancasila/civic education and national language instruction.&lt;/p&gt;
&lt;p&gt;Third, while SD INPRES reduced Islamic primary school enrollment by roughly 7%, it increased overall Islamic school attendance: each additional SD INPRES increased the likelihood of attending any Islamic school by approximately 5%, as demand for secondary education outweighed substitution at the primary level. Female students exhibited stronger secondary-level demand effects, amplified in districts with a concurrent state ban on the Islamic veil in public schools.&lt;/p&gt;
&lt;p&gt;Fourth, SD INPRES did not advance its ideological objectives. In the 1977 and 1982 elections, Golkar&amp;rsquo;s vote share fell and the Islamic PPP&amp;rsquo;s rose by 0.5–1.0 percentage points per SD INPRES school in high-INPRES districts. Among exposed cohorts, SD INPRES did not increase Pancasila proficiency, national language use at home, or support for secular governance, but did increase Arabic literacy by approximately 3% per additional SD INPRES. Exposed cohorts also prayed more frequently, fasted more during Ramadan, gave more to charity, and expressed greater pilgrimage intentions. These religious patterns were transmitted to children of exposed cohorts, who were more likely to attend Islamic schools themselves.&lt;/p&gt;
&lt;p&gt;Q: What was the allocation rule for SD INPRES and why did it create confrontation with Islamic schools?
A: Presidential Instruction No. 10/1973 allocated school construction across districts proportional to the non-enrolled primary-school-age population in 1971. Because Islamic schools historically served underserved populations, this rule meant the state built more schools precisely where Islamic education was most prevalent. The paper shows graphically and in Table 1 that the number of SD INPRES schools built is strongly correlated with the pre-existing stock of Islamic schools, conditional on district population and enrollment.&lt;/p&gt;
&lt;p&gt;Q: How large was the Islamic sector&amp;rsquo;s entry response to SD INPRES at the district level?
A: In the standard DID specification (Table 2, panel a), a one standard deviation increase in SD INPRES schools led to 0.013 more Islamic schools per district-year per 1,000 children, equivalent to 1.4 additional Islamic school entries in the average district relative to a mean of 1.9 Islamic schools per district in 1972. The synthetic DID (panel b) delivers positive and slightly larger estimates, indicating the result is not an artifact of diverging pre-trends.&lt;/p&gt;
&lt;p&gt;Q: What was the timing of the Islamic sector entry response at the village level?
A: Using the Borusyak et al. (2024) estimator on a balanced panel from 1960 to 1999, the paper finds (Figure 4) that INPRES construction is followed by a jump in Islamic school entry. Primary madrasa (MI) entered at twice the baseline annual rate in the years immediately following INPRES construction and this elevated rate persisted for six years before reverting to baseline. Islamic junior secondary entry (MTs) peaked around years 6–9 after SD INPRES construction, consistent with newly graduated primary students seeking continued schooling.&lt;/p&gt;
&lt;p&gt;Q: How did the Islamic sector finance its expansion?
A: The sector relied on waqf endowments (inalienable religious land assets), informal faith-based contributions (infaq), and obligatory alms (zakat). Fortuitously, the initial year of SD INPRES coincided with a large spike in the global price of rice, Indonesia&amp;rsquo;s main agricultural commodity, boosting harvest revenues channeled through informal Islamic taxation. Table 3 shows that entry responses were significantly stronger in villages with above-median waqf endowments and above-median potential rice yields, and these heterogeneous effects did not arise in non-INPRES periods or for non-Islamic private schools. Survey data from 2007–13 further show higher rates of informal taxation in villages with Islamic schools built during this period.&lt;/p&gt;
&lt;p&gt;Q: Did Islamic schools converge toward secular curricula under competitive pressure from SD INPRES?
A: No. Table 4 shows that madrasa established in high-INPRES districts after 1972 devote more time to religious content, not less. Each additional SD INPRES is associated with a 1.2 percentage point increase in the share of classroom time devoted to religious subjects among newly created Islamic schools, with increases of 1.3 percentage points at the primary level and 2.4 percentage points at the junior secondary level — the latter equal to 82% of the cross-school standard deviation. Similar patterns hold for Arabic instruction, and the junior secondary increase comes partially at the expense of Pancasila/civic education and national language instruction.&lt;/p&gt;
&lt;p&gt;Q: Did curriculum differentiation responses vary with local religious ideology?
A: Yes. Appendix Table A.14 shows a stronger curriculum differentiation response in markets with greater historical support for conservative Islam, proxied by Islamic political party vote shares in the 1950s elections. The paper also constructs a school-name-based predicted ideology index using a ridge shrinkage estimator and finds (Appendix Table A.15) that madrasa entering high-INPRES districts after the program onset have a more religious ideology on this measure.&lt;/p&gt;
&lt;p&gt;Q: What happened to the formalization of the Islamic sector?
A: Figure 5 and Appendix Table A.6 show that formal madrasa entry increased as a share of all new school entry, while informal Islamic schools (pesantren, diniyah) declined as a share of all new schools and all new Islamic schools. This formalization mirrors the organizational structure of state schools (primary-to-secondary progression), facilitating switching between public and religious schools and providing option value to moderate but still religious families. Crucially, the newly entering formal madrasa introduced more religious curriculum than incumbent madrasa, so formalization did not reduce religious instruction.&lt;/p&gt;
&lt;p&gt;Q: What was the net effect of SD INPRES on Islamic school attendance?
A: Table 5 shows that SD INPRES reduced the likelihood of attending Islamic primary school by roughly 7% per additional SD INPRES school but increased Islamic secondary attendance, with the net effect being a roughly 5% increase in the likelihood of attending any Islamic school (column 4). This finding holds in both DID and synthetic DID. The IFLS validation (Appendix Table A.18) confirms decreased Islamic elementary attendance and increased Islamic junior secondary attendance, consistent with the Susenas results.&lt;/p&gt;
&lt;p&gt;Q: How does selection into secondary education affect the religious schooling results?
A: The authors address selection using parametric (Heckman 1976) and semiparametric (Newey 2009) selection-correction procedures, using exposure to a 1960s pilot compulsory schooling program as an exclusion restriction. Table 6, panels (c) and (d), show that selection-adjusted estimates are broadly consistent with unadjusted estimates, with similar signs and magnitudes. The selection-corrected estimates approximately identify a local average treatment effect among compliers: those induced to attend elementary school were less likely to attend Islamic elementary; those induced to continue to secondary were more likely to attend Islamic secondary.&lt;/p&gt;
&lt;p&gt;Q: How did gender shape the effects of SD INPRES on religious school choice?
A: Table 7 shows that SD INPRES had more limited impacts on total schooling for women than men (consistent with Duflo 2001) but that the secondary-level demand effect toward Islamic schools was stronger for women. Table 8 shows that within high-INPRES areas, the SD INPRES-induced increase in Islamic secondary education is three times larger for women in districts with greater exposure to the 1982 state ban on the Islamic veil in public schools, and this differential is specific to Islamic schooling rather than total schooling.&lt;/p&gt;
&lt;p&gt;Q: Did SD INPRES strengthen or weaken the secular ruling regime&amp;rsquo;s political standing?
A: It weakened it. Table 10 shows that in the 1977 and 1982 elections, Golkar&amp;rsquo;s vote share decreased and the Islamic PPP&amp;rsquo;s vote share increased in high-INPRES districts, in the range of 0.5–1.0 percentage points per SD INPRES school. This represents a 1.5–3.0% change in PPP vote share and a 0.5–1.0% change in Golkar vote share per standard deviation in SD INPRES intensity. The PPP gained most in areas where SD INPRES had the greatest potential to draw students away from Islamic schools.&lt;/p&gt;
&lt;p&gt;Q: Did SD INPRES produce a secular ideological shift among exposed cohorts?
A: No. Table 11 shows that SD INPRES did not increase self-reported Pancasila proficiency, national language use at home, national language literacy, or attitudes in favor of secular governance. By contrast, Arabic literacy increased by approximately 3% per additional SD INPRES among exposed cohorts, indicating that Islamic schooling exposure rather than secular schooling drove literacy gains in that language.&lt;/p&gt;
&lt;p&gt;Q: Did SD INPRES increase religiosity among exposed cohorts?
A: Yes. Table 12 shows that SD INPRES increased prayer frequency, fasting during Ramadan, charitable giving, and pilgrimage intentions among exposed cohorts. These effects on prayer and fasting are stronger among women, consistent with the stronger shift toward Islamic secondary schooling found in Table 7. These outcomes are consistent with greater exposure to Islamic education increasing religiosity rather than the secular curriculum reducing it.&lt;/p&gt;
&lt;p&gt;Q: Were the effects on religious identity and Arabic literacy transmitted to the next generation?
A: Yes. Table 13 shows that SD INPRES increased Arabic literacy among the children of exposed cohorts, and that children of exposed cohorts were more likely to attend Islamic schools themselves. These intergenerational results confirm that the preference for Islamic education instilled during the SD INPRES era persisted into the next generation rather than converging toward secular norms over time.&lt;/p&gt;
&lt;p&gt;Q: What are the key robustness checks for the school entry results?
A: Several checks support causal interpretation. The Roth and Rambachan (2022) procedure finds no systematic pre-trends in the standard DID. Historical Podes data from 1980, 1983, 1990, and 1993 confirm the post-1973 increase in Islamic school entry, addressing survival bias in the 2019 registry. Results are robust to allowing differential trends in waqf endowments, Muslim population share, Islamic party vote shares, historical Arab immigration, Islamist insurgency, and Transmigration resettlement. The heterogeneous entry responses by waqf and rice yield do not appear in non-INPRES periods or for non-Islamic schools.&lt;/p&gt;
&lt;p&gt;Q: What do the results imply for the political economy of education reform more broadly?
A: The paper argues that state capacity to homogenize culture through education is limited when strong non-state actors can mobilize their own resources and provide differentiated alternatives. Rather than crowding out religious schools, state expansion triggered competitive entry, curriculum differentiation, and formalization in the religious sector, producing an equilibrium where both sectors expanded simultaneously with distinct clienteles. The findings imply that the long-run cultural effects of education programs cannot be evaluated without accounting for equilibrium responses by competing non-state providers.&lt;/p&gt;
&lt;p&gt;SD INPRES (Sekolah Dasar Presidential Instruction): Indonesia&amp;rsquo;s 1973 mass public primary school construction program, financed by oil windfalls, which built more than 61,000 elementary schools between 1973 and 1980 by allocating schools to districts proportional to the non-enrolled child population; the program&amp;rsquo;s explicitly secular nation-building objectives brought it into direct confrontation with the Islamic education sector.&lt;/p&gt;
&lt;p&gt;Waqf: Inalienable Islamic religious endowments — of land, agricultural assets, or other property — that under Islamic law can only be used for religious or charitable purposes and cannot be seized or repurposed by the state; in this paper, the pre-existing waqf base in a village serves both as a long-run financing mechanism for Islamic school construction and as an index of Islamic sector organizational capacity.&lt;/p&gt;
&lt;p&gt;Madrasa: Formal day Islamic schools operating at the same primary-to-secondary grade levels as secular state schools, teaching standard academic subjects alongside a religious curriculum (including Islamic law, doctrine, ethics, Qur&amp;rsquo;an, Arabic, and history of the Prophets) that averages 26% of total instruction hours; distinct from the more informal pesantren (boarding schools) and madrasa diniyah (afternoon Qur&amp;rsquo;anic study schools).&lt;/p&gt;
&lt;p&gt;Curriculum differentiation: The strategy by which newly entering madrasa in high-INPRES districts increased the share of classroom time devoted to religious and Arabic instruction rather than converging toward the secular state curriculum; measured as classroom hours devoted to Islamic subjects, Arabic, Pancasila/civic education, and national language instruction from 2018–19 SIAP timetable data.&lt;/p&gt;
&lt;p&gt;Pancasila: The official secular nationalist ideology of the Indonesian state, consisting of five principles (monotheism, humanitarianism, national unity, democracy, and social justice) intended to transcend ethnic and religious divisions; SD INPRES sought to transmit Pancasila through civic education and national language instruction as part of its homogenizing nation-building agenda.&lt;/p&gt;
&lt;p&gt;Synthetic difference-in-differences (SDID): The Arkhangelsky et al. (2021) estimator used throughout the paper, which reweights and matches pre-INPRES trends in Islamic school construction across high- and low-INPRES exposure districts to deliver estimates more robust than standard DID to violations of parallel trends; applied with a binary treatment indicator (districts above the 51st percentile in INPRES intensity).&lt;/p&gt;
&lt;p&gt;Formalization: The documented shift in the composition of the Islamic sector after SD INPRES, whereby formal madrasa (organized along the same grade-level progression as state schools) increased as a share of all new Islamic school entry while informal pesantren and diniyah declined as a share; interpreted as a competitive response that expanded parental option value without sacrificing religious instruction intensity.&lt;/p&gt;</description></item><item><title>Rent Guarantee Insurance</title><link>https://macropaperwarehouse.com/papers/rent-guarantee-insurance/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/rent-guarantee-insurance/</guid><description>&lt;p&gt;Abramson and Van Nieuwerburgh study Rent Guarantee Insurance (RGI), a product in which an insurer pays the landlord on behalf of a tenant who defaults on rent due to a negative income or health expenditure shock, in exchange for a monthly premium proportional to rent. The central question is whether RGI can be designed to be both welfare-improving and financially viable, given the frictions of moral hazard and adverse selection.&lt;/p&gt;
&lt;p&gt;The authors develop a dynamic overlapping-generations equilibrium model of the rental market that features endogenous rent default, security deposits, evictions, and homelessness. Households face idiosyncratic persistent and transitory income risk, idiosyncratic medical expenditure risk, and aggregate (cyclical) income risk. Rental contracts are non-contingent, households face borrowing constraints, and housing is indivisible with a minimum quality floor. Landlords set deposits to break even in expectation given observed tenant characteristics. An insurance agency can offer RGI and must also break even in the long run. The model is calibrated to the United States at monthly frequency. Income dynamics are estimated from CPS data (1994–2023) and incorporate transitions among employment, unemployment, out-of-labor-force, and retirement states along with transfer income (unemployment insurance, disability, food stamps) and a progressive tax system. Key moments targeted by Simulated Method of Moments include a delinquency rate of 12.15% (model: 12.69%), average security deposit of $984 (model: $992, from approximately 500,000 Craigslist listings across the 100 largest MSAs), homelessness rate of 1.43% (model: 1.42%), and home-ownership rate of 63.6% (model: 63.2%).&lt;/p&gt;
&lt;p&gt;The model&amp;rsquo;s pre-RGI analysis establishes that persistent income shocks — not transitory shocks or medical shocks — are the primary driver of rent defaults. Default risk remains elevated for 3–6 months following a persistent shock, implying that short-duration RGI coverage is insufficient to prevent eviction; coverage must span multiple months.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s main policy experiments introduce RGI under different access rules and provider types. Unrestricted RGI (available to all renters) generates large welfare gains through improved risk-sharing and lower security deposits — because insured tenants pose less default risk, landlords lower deposit requirements — but is not financially viable for either a public or private insurer due to moral hazard and adverse selection. Even a public insurer that internalizes the fiscal savings from reduced homelessness cannot break even under unrestricted access.&lt;/p&gt;
&lt;p&gt;Restricting access changes the viability calculus sharply. A publicly provided RGI targeted to households at the bottom of the wealth distribution can achieve financial viability: these households are precisely those most prone to homelessness, so the reduction in homelessness expenses — which the public insurer internalizes — offsets the insurance deficit. This restricted public RGI generates substantial welfare gains for the most vulnerable households.&lt;/p&gt;
&lt;p&gt;A privately provided RGI must instead target higher-wealth renters to break even, because these households have low default risk (limiting claim payouts) while remaining sufficiently risk averse to pay the premium. The intersection of financial viability and take-up is small, yielding a limited target audience. The private program has minimal impact on housing insecurity, and the most vulnerable households derive little benefit. This pattern matches observed private RGI markets, where providers restrict access to renters in good financial condition.&lt;/p&gt;
&lt;p&gt;An RGI mandate — requiring all renters to purchase coverage — mitigates adverse selection by improving the pool of insured tenants, dramatically increasing financial viability and allowing the insurer to reduce the premium substantially while still breaking even. Mandated RGI is highly effective at preventing housing insecurity and generates welfare gains concentrated among the most financially vulnerable households.&lt;/p&gt;
&lt;p&gt;Scope conditions: results are calibrated to U.S. income, medical, and housing market parameters as of 2019. The insurer&amp;rsquo;s borrowing cost matters: the public insurer faces lower, counter-cyclical municipal bond spreads, whereas private insurers face higher, pro-cyclical corporate spreads, which constrains the generosity of private contracts in recessions.&lt;/p&gt;
&lt;p&gt;Q: What is Rent Guarantee Insurance and how does it work mechanically in the model?
A: RGI is a contract under which a tenant pays a flat monthly premium equal to a fraction kappa of rent. When the insured tenant defaults, the insurer pays the landlord directly and deducts one period from the tenant&amp;rsquo;s stock of &amp;ldquo;insurance credit.&amp;rdquo; The tenant remains housed. Once insurance credit is exhausted, the insurer no longer covers defaults. The insurer sets the premium and the maximum coverage duration to break even in the long run.&lt;/p&gt;
&lt;p&gt;Q: Why do most rent defaults arise from persistent rather than transitory shocks?
A: The model shows that the renter population is disproportionately exposed to persistent unemployment and labor-force-exit spells, and that negative persistent income shocks are harder to smooth through savings than transitory ones. Default risk remains elevated for 3–6 months after a persistent shock but dissipates quickly after a transitory shock. This implies that RGI coverage periods of only a few months would fail to prevent eviction for the majority of defaulting tenants.&lt;/p&gt;
&lt;p&gt;Q: How does RGI affect security deposits in equilibrium?
A: Because landlords observe the tenant&amp;rsquo;s insurance status at lease signing and deposits are set to make landlords break even in expectation, insured tenants pose lower default risk and thus face lower upfront deposit requirements. This deposit reduction is a key welfare channel of RGI, as large deposits tie up a disproportionate share of poor households&amp;rsquo; wealth and price the most vulnerable out of housing entirely.&lt;/p&gt;
&lt;p&gt;Q: Why is unrestricted RGI financially non-viable even for the public insurer?
A: Unrestricted access induces both adverse selection — riskier households self-select into coverage — and moral hazard — insured households alter their default and savings behavior. These effects cause the insurer to run a persistent deficit. Even a public insurer that internalizes the fiscal cost savings from reduced homelessness cannot recoup enough to break even, implying that an unrestricted program would require an ongoing subsidy.&lt;/p&gt;
&lt;p&gt;Q: How does publicly provided restricted RGI achieve financial viability?
A: By targeting households at the bottom of the wealth distribution — precisely those most prone to homelessness — the public RGI program produces large reductions in homelessness. Because the public insurer internalizes the fiscal expenses associated with shelters, health services, and policing that accompany homelessness, these savings are passed through to the insurer and are sufficient to offset the insurance deficit. No such mechanism is available to a private insurer.&lt;/p&gt;
&lt;p&gt;Q: Why must private RGI target higher-wealth renters, and what are the consequences?
A: Private insurers must break even using only premium revenue, without access to homelessness cost savings. Higher-wealth renters have lower default probabilities, which limits claim payouts, while remaining sufficiently risk averse to demand coverage and pay the premium. The viable target audience is small given these competing requirements. As a result, private RGI covers few households, has minimal effect on housing insecurity, and provides essentially no benefit to the most vulnerable renters. This pattern is consistent with observed private RGI markets.&lt;/p&gt;
&lt;p&gt;Q: What are the two differences between public and private insurers in the model?
A: First, the public insurer internalizes the fiscal costs of homelessness (shelters, health services, policing), raising its net benefit from offering coverage. Second, the public insurer borrows at municipal bond spreads — which are lower than corporate spreads and counter-cyclical — whereas the private insurer faces higher, pro-cyclical corporate spreads. Counter-cyclical borrowing costs allow the public insurer to extend more generous coverage precisely when aggregate conditions deteriorate and claims rise.&lt;/p&gt;
&lt;p&gt;Q: How does an RGI mandate improve financial viability?
A: Mandatory enrollment forces all renters, including low-risk ones, into the insurance pool, which counteracts adverse selection. The expanded and higher-quality pool dramatically reduces per-insured expected claim costs, allowing the insurer to lower the premium substantially while still breaking even. The low-premium mandated policy is then both affordable and effective at preventing housing insecurity, with welfare gains concentrated among the most financially vulnerable renters.&lt;/p&gt;
&lt;p&gt;Q: What novel data does the paper use for calibration of security deposits?
A: The authors construct a dataset of approximately 500,000 Craigslist rental listings scraped across the 100 largest U.S. metropolitan statistical areas between November 2022 and March 2024 to measure the cross-sectional distribution of security deposits. The average deposit in this dataset is $984, which the model matches closely at $992. The data also reveal that the deposit-to-rent ratio is decreasing in house quality, reflecting the higher default risk of low-income renters in lower-quality units.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s definition of homelessness and what rate does the model match?
A: Homelessness is defined broadly to include sheltered homeless, unsheltered homeless (0.6% of households), and doubled-up families (0.83% of households), for a total of 1.43% of U.S. households. The model matches this rate closely at 1.42%.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s key implication for the design of housing policy?
A: The central implication is that financial viability and impact on housing insecurity are in tension for private insurers, and cannot both be achieved simultaneously. Only a publicly provided program that internalizes homelessness fiscal costs and faces counter-cyclical borrowing spreads can target the most vulnerable renters, break even, and materially reduce housing insecurity. Private RGI, while viable for a narrow segment, cannot substitute for public provision as a tool against homelessness.&lt;/p&gt;
&lt;p&gt;Q: How does RGI relate conceptually to rental assistance programs?
A: The paper distinguishes RGI from rental assistance on a structural basis: insurance contracts require tenants to pay premiums, making them potentially self-financing for private providers, whereas rental assistance is a net transfer that can never be self-financing. This conceptual distinction motivates studying whether RGI can be designed to eliminate the need for ongoing fiscal transfers, though the analysis ultimately shows that a public subsidy or mandate is required to serve the most vulnerable renters.&lt;/p&gt;
&lt;p&gt;Rent Guarantee Insurance (RGI): A contract under which an insured tenant pays a monthly premium equal to a flat percentage of rent; when the tenant defaults, the insurer pays the landlord directly, preserving tenancy, for a limited number of periods governed by the tenant&amp;rsquo;s stock of insurance credit.&lt;/p&gt;
&lt;p&gt;Insurance Credit: An endowment of periods of RGI coverage that households receive upon entry into the model; each time the insurer pays on behalf of a defaulting tenant, one unit of credit is consumed, and no further coverage is available once credit is exhausted.&lt;/p&gt;
&lt;p&gt;Housing Insecurity: In the paper&amp;rsquo;s framework, the set of outcomes — rent delinquency, eviction, and homelessness — arising from the combination of non-contingent rental contracts, borrowing constraints, and idiosyncratic or aggregate income and medical shocks.&lt;/p&gt;
&lt;p&gt;Security Deposit: An upfront payment from tenant to landlord, set by the competitive landlord to break even in expectation given the tenant&amp;rsquo;s characteristics and insurance status; a key channel through which RGI affects welfare by reducing the upfront cost barrier to obtaining housing.&lt;/p&gt;
&lt;p&gt;Moral Hazard (in RGI context): The change in a tenant&amp;rsquo;s default, savings, and housing choices induced by the presence of insurance coverage, which increases expected claim costs for the insurer relative to a world where behavior is held fixed.&lt;/p&gt;
&lt;p&gt;Adverse Selection (in RGI context): The tendency of renters with higher default risk to self-select into RGI when access is unrestricted, worsening the insurer&amp;rsquo;s risk pool and driving up expected payouts relative to premiums.&lt;/p&gt;
&lt;p&gt;Homelessness Externality: The fiscal costs borne by government — for shelters, health services, and policing — that accompany homelessness; the public insurer internalizes these costs, creating a net benefit from RGI that private insurers cannot capture.&lt;/p&gt;
&lt;p&gt;Counter-cyclical Borrowing Spread: The feature of public (municipal bond) financing whereby borrowing costs fall during recessions, allowing the public insurer to expand coverage when claims are highest; contrasted with private insurers&amp;rsquo; pro-cyclical corporate bond spreads that tighten precisely when aggregate conditions worsen.&lt;/p&gt;</description></item><item><title>Republican Support and Economic Hardship: The Enduring Effects of the Opioid Epidemic</title><link>https://macropaperwarehouse.com/papers/republican-support-and-economic-hardship-the-enduring-effects-of-the-opioid-epidemic/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/republican-support-and-economic-hardship-the-enduring-effects-of-the-opioid-epidemic/</guid><description>&lt;p&gt;This paper establishes a causal connection between the opioid epidemic and the political realignment toward the Republican Party in the United States from the mid-2000s through 2022. The authors—Carolina Arteaga and Victoria Barone—exploit rich geographic variation in Purdue Pharma&amp;rsquo;s initial marketing strategy for OxyContin, drawn from unsealed litigation records, to construct a quasi-exogenous measure of community-level exposure to the epidemic.&lt;/p&gt;
&lt;p&gt;The identification strategy rests on a documented feature of OxyContin&amp;rsquo;s 1996 launch: Purdue initially targeted the established cancer pain market—physicians and patients already using MS Contin—as an entry point into the much larger noncancer pain market. Areas with higher cancer mortality in 1996 received disproportionate pharmaceutical marketing, leading to outsized opioid prescription growth that spilled over from cancer patients to the broader population through shared physicians. The authors use 1996 commuting-zone (CZ) cancer mortality rates as a proxy for this initial targeting, interacted with year fixed effects in an event-study specification with CZ and state-year fixed effects. The sample covers 625 CZs across the continental United States from 1982 to 2022.&lt;/p&gt;
&lt;p&gt;The empirical chain runs through three stages. First, the instrument strongly predicts opioid supply: by 2012, a one-standard-deviation higher 1996 cancer mortality rate led to an additional 0.97 opioid doses prescribed per capita, 65% above the baseline mean. Second, the resulting epidemic caused measurable mortality and economic hardship. A one-standard-deviation increase in 1996 cancer mortality caused drug-induced deaths in 2017 to be 46% above the pre-epidemic average; by 2012 the same increase caused prescription opioid deaths to be 61% higher. Excess mortality was concentrated among individuals under age 55, with no significant effects for those aged 55 and older. The epidemic also raised disability applications: SSDI applications rose by 12% and SSI applications by 7.6% by 2012, effects that persisted through 2020. SNAP enrollment in exposed CZs was 8% higher by 2022, equivalent to a 0.14 standard deviation increase.&lt;/p&gt;
&lt;p&gt;Third, and centrally, the communities that endured these health and economic shocks shifted persistently toward the Republican Party. By the 2022 House elections, a one-standard-deviation increase in 1996 cancer mortality increased the Republican two-party vote share by 4.5 percentage points. Effects of similar magnitude appear in presidential elections (4.6 percentage points) and gubernatorial elections (4.3 percentage points). The vote-share shift is consistent across gender, age, race, and education, with no detectable change in voter turnout. The shift translates into actual seat gains: beginning in 2012, exposed areas consistently elected more Republican House members, moving the chamber&amp;rsquo;s roll-call voting in a more conservative direction. The effect is not driven by anti-incumbent sentiment—results hold regardless of which party held the seat at the time.&lt;/p&gt;
&lt;p&gt;The paper identifies three reinforcing mechanisms. First, the Republican Party repositioned itself during this period as the advocate of &amp;ldquo;forgotten America&amp;rdquo; and working-class economic hardship, a message that resonated acutely in opioid-devastated communities. Second, conservative-leaning newspapers covered the epidemic at higher rates, and their coverage tracked local mortality; liberal-leaning outlets showed no such correlation. Fox News covered opioid stories at 1.5 times the rate of CNN and 1.7 times the rate of MSNBC, emphasizing crime, trafficking, and cartels at twice the frequency of liberal outlets. Third, exposed communities expressed stronger preferences for Republican-favored policy responses: higher police presence, greater sense of safety around law enforcement, and lower support for marijuana legalization on state ballot initiatives.&lt;/p&gt;
&lt;p&gt;Pre-trend tests show no relationship between 1996 cancer mortality and outcomes before OxyContin&amp;rsquo;s launch. Out-of-sample exercises using 1976 cancer mortality find no analogous pattern in the pre-epidemic period (1982–1994). Placebo instruments based on unrelated causes of death yield null results. The baseline findings are robust to controlling for the China import shock, NAFTA, the 1994 Republican Revolution, the 2001 and 2008–2009 recessions, declining unionization, robot adoption, Fox News introduction, deaths of despair, and Southern and rural political realignment.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central research question?
A: The paper asks whether the opioid epidemic causally increased Republican vote share in communities most severely affected by the crisis. It documents a causal chain from pharmaceutical marketing through drug mortality and economic hardship to political realignment, contributing the first causal estimate of a major public health crisis&amp;rsquo;s effect on partisan voting.&lt;/p&gt;
&lt;p&gt;Q: What is the identification strategy, and why is 1996 cancer mortality a valid instrument?
A: Purdue Pharma explicitly targeted physicians in the cancer pain market at OxyContin&amp;rsquo;s 1996 launch, then used those established relationships to expand into the noncancer pain market. CZs with higher cancer mortality in 1996 received disproportionate marketing, generating differential opioid prescription growth unrelated to pre-existing political or economic trends. Pre-trend tests confirm no differential patterns before 1996, out-of-sample tests using 1976 cancer mortality find no relationship with pre-epidemic outcomes, and placebos using unrelated causes of death yield null results.&lt;/p&gt;
&lt;p&gt;Q: How strong is the first stage linking 1996 cancer mortality to opioid prescriptions?
A: The relationship between 1996 cancer mortality and opioid prescriptions is positive and statistically significant from 1998 through 2020. By 2012—the year prescription rates peaked nationally at 81.3 per 100 persons—a one-standard-deviation higher cancer mortality rate led to an additional 0.97 morphine-equivalent doses prescribed per capita, 65% above the baseline mean. CZs in the highest cancer mortality quartile experienced a 1,800% increase in grams of oxycodone per capita between 1997 and 2010, compared to less than half that in the lowest quartile.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on drug-induced mortality?
A: Drug-induced mortality (a broad measure covering deaths from prescription opioids, heroin, and fentanyl) rose continuously in exposed CZs after 1996. By 2017, a one-standard-deviation increase in 1996 cancer mortality caused drug-induced deaths to be 46% above the pre-epidemic average. By 2012, the same increase caused prescription opioid deaths specifically to be 61% higher relative to the pre-epidemic average. Excess mortality was concentrated among individuals under age 55, with no statistically significant effects for those aged 55 and older.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on disability program take-up?
A: Applications for Social Security Disability Insurance (SSDI) rose by 12% and Supplemental Security Income (SSI) applications rose by 7.6% by 2012 for a one-standard-deviation increase in 1996 cancer mortality. These effects persisted: SSDI recipients grew by 15% and SSI recipients by 3.2% by 2020 in similarly exposed CZs. The increases in disability were concentrated among individuals under age 55, paralleling the mortality effects.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on SNAP enrollment?
A: Exposed CZs showed a continuous increase in SNAP enrollment over two decades following the epidemic&amp;rsquo;s onset. By 2020, a one-standard-deviation increase in 1996 cancer mortality corresponded to an 8% increase in the share of the population receiving SNAP benefits, equivalent to 0.14 standard deviations. By 2022, the corresponding figure remains 8%, indicating persistent economic strain in exposed communities.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the political effects in House elections?
A: A one-unit increase in the 1996 cancer mortality rate yielded a 7.9-percentage-point increase in the 2022 Republican House vote share relative to 1996. Scaled to one standard deviation (0.58 units), this corresponds to a 4.5-percentage-point increase in the Republican two-party vote share by the 2022 midterms. The vote-share shift became statistically significant beginning in 2006, but only translated into consistent seat-level Republican gains starting in 2012.&lt;/p&gt;
&lt;p&gt;Q: When did opioid exposure start winning Republicans additional House seats?
A: Although the Republican vote share in exposed areas began increasing around 2006, actual seat flips did not become consistent until 2012. The paper explains this lag by noting that initial vote-share gains were concentrated in communities with low baseline Republican support, where additional votes did not immediately cross the winning threshold. Starting in 2010, median-baseline-Republican CZs also began shifting, enabling additional seat changes.&lt;/p&gt;
&lt;p&gt;Q: How large are the presidential and gubernatorial election effects?
A: In presidential elections, a one-standard-deviation increase in 1996 cancer mortality raised the Republican vote share by 4.6 percentage points. In gubernatorial elections, the same increase raised the Republican vote share by 4.3 percentage points after approximately six election cycles (corresponding to 2017–2020). These effects are described as comparable in magnitude to the difference in Republican vote share between the top and bottom quartiles of NAFTA vulnerability.&lt;/p&gt;
&lt;p&gt;Q: Does the political shift reflect increased polarization toward extremist candidates?
A: No. The paper finds no increase in the probability of electing candidates at the extremes of the Nokken-Poole ideological scale in any given election year. The ideological shift in the House composition arises from changes in which party wins seats rather than from the election of more extreme Republicans. Campaign donations to Republican candidates did not increase; rather, donations to Democratic candidates declined (in 2016, a one-standard-deviation increase in cancer mortality widened the Republican-Democrat donation gap by 0.44 standard deviations). The shift is interpreted as a change in voting preferences in previously Democratic-leaning areas, not heightened polarization.&lt;/p&gt;
&lt;p&gt;Q: Is the shift driven by anti-incumbent sentiment?
A: The authors test this by splitting the sample by the incumbent&amp;rsquo;s party at the time of each election and by redefining the outcome as the incumbent&amp;rsquo;s vote share. Neither exercise produces evidence of a systematic anti-incumbent response. The changes in Republican vote share are not statistically distinguishable based on whether the incumbent was a Republican or Democrat. If anything, after 2016 there is a slight increase in the likelihood of incumbents retaining their seats.&lt;/p&gt;
&lt;p&gt;Q: Where geographically are the Republican gains largest?
A: Using state-level treatment effects estimated from an in-differences model interacting cancer mortality with state-year indicators, the paper finds a strong positive correlation between the magnitude of the epidemic&amp;rsquo;s effect on economic hardship (measured by SNAP participation) and the magnitude of the Republican vote-share increase. This correlation is strongest with a lag: SNAP effects measured in 2006 are most predictive of vote-share shifts in 2022, indicating that deterioration in community economic fabric preceded and predicted the political realignment.&lt;/p&gt;
&lt;p&gt;Q: How did conservative and liberal media differ in covering the opioid epidemic?
A: Republican-leaning local newspapers covered the opioid epidemic more extensively than Democratic-leaning papers throughout the epidemic period, and their coverage tracked local opioid mortality rates; Democratic-leaning coverage showed no such correlation with local incidence. Fox News covered opioid stories at 1.5 times the rate of CNN and 1.7 times the rate of MSNBC. In terms of content, Republican-leaning newspapers showed 23% higher frequency of economic hardship keywords, 19% higher frequency of illegal activity and crime keywords, and 22% higher frequency of rehabilitation and treatment keywords relative to Democratic-leaning papers. Fox News emphasized crime, drug trafficking, and cartels at double the frequency of more liberal outlets.&lt;/p&gt;
&lt;p&gt;Q: How did voter policy preferences align with Republican versus Democratic platforms?
A: Using 2020 CCES data, the authors find that higher 1996 cancer mortality predicts a greater expressed preference for increasing the number of police officers on the street and a greater reported sense of safety around law enforcement—both consistent with the Republican Party&amp;rsquo;s law enforcement approach. Conversely, exposure to the epidemic predicts lower support for marijuana legalization on state ballot initiatives across 18 states from 2012 to 2023, indicating opposition to a key Democratic harm-reduction policy.&lt;/p&gt;
&lt;p&gt;Q: What role did political actors themselves play in driving the realignment?
A: Relatively little. The opioid epidemic was largely absent from House floor speeches until 2015 and from campaign advertising until 2020. Neither party took a clear legislative lead on the issue during the first two decades of the crisis. The authors interpret the political realignment as driven primarily by the Republican Party&amp;rsquo;s broader repositioning as the champion of working-class economic hardship and by differential media framing, rather than by active legislative competition over opioid policy.&lt;/p&gt;
&lt;p&gt;Q: What major confounds are ruled out?
A: The authors control for exposure to the China import shock, NAFTA, the 1994 Republican Revolution, the 2001 and 2008–2009 recessions, declining unionization, robot adoption, Fox News entry, deaths of despair (which include but are not limited to opioid deaths), and the political realignment of the South, rural areas, evangelicals, and the population over 65. Results remain robust across all these specifications. Placebo instruments using unrelated causes of death yield null results.&lt;/p&gt;
&lt;p&gt;Q: Could the vote-share effects be mechanically driven by opioid-related deaths removing Democratic voters from the electorate?
A: The authors perform a back-of-the-envelope calculation and estimate that even if all opioid-related deaths would have been Democratic votes, the mechanical effect on the Republican vote share is at most 0.22 percentage points relative to the observed 2020 vote share—far smaller than the estimated 4.5-percentage-point shift by 2022. The result is also inconsistent with a turnout mechanism, as voter turnout shows no meaningful change with epidemic exposure.&lt;/p&gt;
&lt;p&gt;Opioid epidemic exposure instrument: The paper measures community-level exposure to the opioid epidemic using cancer mortality rates in 1996, the year OxyContin launched. This instrument is grounded in Purdue Pharma&amp;rsquo;s documented marketing strategy of targeting the cancer pain market first; areas with more cancer patients received disproportionate pharmaceutical marketing, generating differential opioid prescription growth that extended well beyond cancer patients to the broader noncancer population through shared physicians.&lt;/p&gt;
&lt;p&gt;Commuting zone (CZ): The paper&amp;rsquo;s unit of geographic analysis, defined to capture local economic markets. There are 720 CZs in the US, encompassing all metropolitan and nonmetropolitan areas. The authors use 625 CZs with more than 20,000 residents, which account for more than 99% of all opioid deaths and total population.&lt;/p&gt;
&lt;p&gt;Two-party Republican vote share: The ratio of votes for Republican candidates to the total votes for both Republican and Democratic candidates in a given election. The paper tracks this measure for House, presidential, and gubernatorial elections from 1976 or 1982 through 2020 or 2022, depending on data availability.&lt;/p&gt;
&lt;p&gt;Drug-induced mortality: The paper&amp;rsquo;s broadest mortality measure, covering deaths from poisoning and medical conditions caused by legal or illegal drugs, including prescription opioids, heroin, and synthetic opioids such as fentanyl. It is distinguished from the narrower measures of prescription opioid deaths and all opioid deaths.&lt;/p&gt;
&lt;p&gt;Issue ownership: The political science concept, used in the paper to describe how the Republican Party repositioned itself during the epidemic period as the voice of working-class economic hardship, &amp;ldquo;forgotten America,&amp;rdquo; and &amp;ldquo;America left behind.&amp;rdquo; The paper contrasts this with Democratic ownership of income inequality and argues that Republican ownership of the hardship narrative made the party&amp;rsquo;s message especially salient in heavily opioid-affected communities.&lt;/p&gt;
&lt;p&gt;Path dependency in pharmaceutical marketing: Purdue&amp;rsquo;s strategy of concentrating initial OxyContin promotion in cancer-market areas, then later focusing on top-prescribing physicians (the highest three deciles of the distribution), meant that areas receiving high initial cancer-market promotion continued to receive disproportionate promotion as the company expanded to the noncancer market. This created a persistent targeting advantage for high-cancer CZs throughout the epidemic&amp;rsquo;s first wave.&lt;/p&gt;
&lt;p&gt;Nokken-Poole ideological measure: A roll-call-based measure of elected House members&amp;rsquo; ideology along the liberal-conservative dimension. The paper uses this measure to show that the epidemic shifted the composition of the House toward more conservative members, not by electing more extreme candidates in any given election, but by changing which party won seats over time.&lt;/p&gt;</description></item><item><title>Revolutionary Transition: Inheritance Change and Fertility Decline</title><link>https://macropaperwarehouse.com/papers/revolutionary-transition-inheritance-change-and-fertility-decline/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/revolutionary-transition-inheritance-change-and-fertility-decline/</guid><description>&lt;p&gt;Gay, Gobbi, and Goñi test Le Play&amp;rsquo;s (1875) hypothesis that the French Revolution contributed to France&amp;rsquo;s early fertility decline by abolishing impartible inheritance. In 1793, a series of decrees culminating in the Loi de Nivôse (January 6, 1794) abolished testamentary rights and imposed equal partition of assets among all children — partible inheritance — across France, overriding the mosaic of local customs and written laws that had governed inheritance in the Ancien Régime.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central argument is that this reform reduced the economic incentive to have children through indivisibility constraints in agricultural land. Under impartible inheritance, land passed to a single heir undivided, keeping plots above the subsistence productivity threshold even at high fertility. Under partible inheritance, each additional child fragments the land further, potentially pushing plots below the minimum productive size, so households face a strong incentive to limit fertility. A Stone-Geary production function with a minimum land threshold L̄ formalizes this mechanism: when landholdings fall in the binding range (L̄ &amp;lt; L &amp;lt; L̃), fertility is strictly higher under impartible than under partible inheritance.&lt;/p&gt;
&lt;p&gt;The authors construct the first complete map of inheritance rules across France&amp;rsquo;s 435 judicial districts as of 1789, classifying each along two dimensions: partible versus impartible, and whether women were included or excluded. This atlas draws on Brette&amp;rsquo;s (1904) Atlas des Bailliages and the Nouveau Coutumier Général (Bourdot de Richebourg 1724), covering 141 distinct customs. Treatment is defined as municipalities under impartible inheritance before 1793 whose system was altered by the reforms; control municipalities were already under partible inheritance.&lt;/p&gt;
&lt;p&gt;The main identification strategy is a difference-in-differences (DD) design comparing women with varying lengths of remaining fertile years after 1793 — from 0 for women aged 40+ at the reform to 25 for women aged 15 or younger — across treated and untreated municipalities. This is augmented by a regression-discontinuity difference-in-differences (RD-DD) design exploiting sharp discontinuities at judicial district borders. Two independent datasets are used: the Enquête Louis Henry (34,812 women in 39 rural municipalities, family-reconstitution method) and Geni.com crowdsourced genealogies (11,649 women across 2,966 locations after the Blanc 2023 horizontal restriction).&lt;/p&gt;
&lt;p&gt;Each additional fertile year of exposure to the 1793 reforms reduced completed fertility by approximately 1 percent. Over the full 25-year fertile cycle, this corresponds to a reduction of roughly 0.7 children, or 24 percent relative to the pre-reform mean of 2.92 surviving children in treated areas. This magnitude equals the entire pre-reform fertility gap between impartible- and partible-inheritance areas (2.9 versus 2.2 children), meaning the reforms closed this gap entirely. DD and RD-DD estimates are similar and not statistically distinguishable from each other, and results replicate across both datasets. Results hold on both the extensive margin (childlessness) and intensive margin (fertility of mothers).&lt;/p&gt;
&lt;p&gt;The mechanism is most relevant where smallholder landownership is widespread. France — where 40–80 percent of households owned land at the eve of the Revolution — meets this condition. England and Prussia, with more concentrated landownership, would not be expected to show the same response because the indivisibility constraint would not bind even after partition.&lt;/p&gt;
&lt;p&gt;Q: What was France&amp;rsquo;s inheritance system before the Revolution, and how heterogeneous was it?
A: Before 1793, inheritance was governed by 141 distinct customary and written laws applied within 435 judicial districts. The country was broadly divided between the customary-law north (Pays de droit coutumier) and the Roman written-law south (Pays de droit écrit), with substantial local variation within regions. Systems ranged from strictly partible (equal division among all offspring) to impartible (primogeniture, ultimogeniture, or unigeniture). Systems also varied in whether women could inherit or received only a dowry. This geographic variation — rooted in the laws of Germanic peoples after the fall of Rome in 476 CE — is exogenous to late eighteenth-century economic conditions and provides the identifying variation for the paper.&lt;/p&gt;
&lt;p&gt;Q: What exactly did the 1793 reforms change, and were they enforced?
A: The Loi de Nivôse an II (January 6, 1794) abolished testamentary rights entirely and mandated equal partition of assets among all children, including women, throughout France. The reforms came unexpectedly — only 8 of 571 cahiers de doléances analyzed by Goy (1988) mentioned inheritance — and were motivated by the equality principle, legal unification, and the fear that revolutionary sympathizers would be disinherited (Lataste et al. 1901). Offspring quickly asserted their new rights, and by the late 1790s inheritance disputes were the most common cases before family tribunals (Desan 1997; Poumarède 2011).&lt;/p&gt;
&lt;p&gt;Q: What is the model&amp;rsquo;s core mechanism linking inheritance reform to fertility decline?
A: The model uses a Stone-Geary production function with a minimum land threshold L̄ below which output falls to zero. Under impartible inheritance, land passes undivided to a single heir, keeping the farm above L̄ regardless of family size. Under partible inheritance, each child receives an equal share, so adding children risks fragmenting plots below L̄ — a powerful incentive to limit family size. The fertility gap between impartible and partible households is at its maximum when landholdings fall in the intermediate range (L̄ &amp;lt; L &amp;lt; L̃) where the constraint is binding. As land size increases, the constraint becomes less binding but the positive fertility differential persists.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s main quantitative estimate of the reform&amp;rsquo;s effect on completed fertility?
A: Each additional fertile year of exposure to the 1793 reforms reduced completed fertility by approximately 1 percent. Over the full 25-year fertile cycle (ages 15–40), this implies a reduction of roughly 0.7 children, or 24 percent relative to the pre-reform mean of 2.92 surviving children in treated areas. This is nearly identical to the pre-existing fertility gap between impartible- and partible-inheritance areas (0.7 children: 2.9 versus 2.2 surviving children), implying the reforms effectively eliminated the fertility differential.&lt;/p&gt;
&lt;p&gt;Q: Are the DD and RD-DD estimates consistent with each other, and do both datasets agree?
A: Yes. The DD and RD-DD estimates are similar and not statistically different from each other. The RD-DD design compares women born close to judicial district borders where inheritance rules differed, before and after 1793, exploiting the sharp spatial discontinuity at those borders. Consistency across these two designs — which rely on different identifying assumptions — strengthens causal interpretation. Results are also consistent across the Enquête Louis Henry (family-reconstitution) and Geni.com (crowdsourced genealogies) datasets, which are produced by fundamentally different methodologies.&lt;/p&gt;
&lt;p&gt;Q: How do the authors verify the parallel trends assumption?
A: Figure 6 shows that for cohorts who completed their fertile cycle before 1793, fertility trended downward in parallel across partible- and impartible-inheritance areas: a constant gap of approximately 0.7 children was maintained from women born in the early 1700s (3 versus 2.3 children) through women born in the early 1750s (2.7 versus 2.0 children), the last cohorts to complete fertility before the reforms. The convergence — from 0.7 to 0 children — only begins among cohorts fertile after 1793. The authors also include flexible trend controls interacted with municipality-level religiosity, political support for the Revolution, proximity to administrative centers, and wheat prices, and confirm the main estimate is robust.&lt;/p&gt;
&lt;p&gt;Q: What role did the extension of inheritance rights to women play?
A: The extension of rights to women was a companion mechanism distinct from abolishing impartible inheritance. Beyond increasing the number of heirs (which directly reduces land per heir), the right to inherit improves a woman&amp;rsquo;s outside option and postpones entry into marriage, following de Moor and van Zanden (2010). The DD and RD-DD estimates suggest that including women in inheritance and abolishing impartible inheritance had similar effects on fertility. The paper treats these as separate but reinforcing channels.&lt;/p&gt;
&lt;p&gt;Q: How do the authors address potential confounders — mortality, migration, and economic conditions?
A: On mortality: child mortality did not evolve differently after 1793 across areas with different inheritance rules (Appendix Table A3), and baseline adult mortality (age at death, probability of dying before completing the fertile cycle) was balanced across treated and control areas (Table 1). On migration: the authors explicitly rule out that results are driven by migration. On economic conditions: municipality-specific decade-average wheat prices (Ridolfi 2019) are included as controls for local Malthusian dynamics, and results are robust to their inclusion.&lt;/p&gt;
&lt;p&gt;Q: What do the balance tests show?
A: Panel A of Table 1 shows that before the reforms, areas with impartible versus partible inheritance were balanced on 9 of 11 individual-level characteristics — including husband and wife age at death, probability of dying before completing the fertile cycle, probability that parents-in-law were alive at marriage, literacy, data accuracy, and age at marriage. The only systematic pre-reform difference was fertility itself (0.7 children). Municipality-level climatic variables, soil suitability, and proxies for mortality uncertainty were also balanced. This is consistent with the origins of these systems in post-Roman Germanic law, which are unrelated to late eighteenth-century economic conditions.&lt;/p&gt;
&lt;p&gt;Q: What robustness checks are reported?
A: The authors report: (1) permutation tests reshuffling treatment exposure across women and municipalities; (2) non-linear treatment effects across cohorts, showing the heterogeneity required to explain away the baseline estimate is implausibly large per de Chaisemartin and d&amp;rsquo;Haultfoeuille (2020); (3) exclusion of outlier municipalities; (4) a placebo test for cohorts who completed their fertile cycle before 1793; (5) robustness to alternative sample definitions, treatment definitions, outcome variables, and control groups; (6) Cummins (2020) first-name repetition technique to correct for under-reported child deaths in Henry; (7) terrain characteristics including climatic and soil suitability (Galor and Özak 2016) and ruggedness (Nunn and Puga 2012); (8) for RD-DD: alternative bandwidths, running variable specifications, kernel functions, samples, and border-segment fixed effects. All checks support the main finding.&lt;/p&gt;
&lt;p&gt;Q: Why did France experience a fertility decline from inheritance reform while other countries with similar reforms did not?
A: The model rationalizes this through landownership structure. The fertility-reducing mechanism operates through indivisibility constraints that bind only when landholdings are small and fragmented — as in France, where 40–80 percent of households owned their land and plots were small. Where landownership is concentrated (England, Prussia), land per heir remains above L̄ even after partible division, so the indivisibility constraint is non-binding and fertility is unaffected by the reform. This provides a structural reason why France&amp;rsquo;s particular agrarian structure made it uniquely susceptible to this mechanism.&lt;/p&gt;
&lt;p&gt;Q: What is the broader historical significance for understanding France&amp;rsquo;s early demographic transition?
A: France&amp;rsquo;s fertility decline began roughly 50 years before industrialization, making it anomalous relative to standard quantity-quality tradeoff theories linking fertility decline to technological progress and rising returns to human capital. The 1793 reforms provide a legal-institutional explanation for the sharp post-Revolution acceleration visible in Figure 1, which is difficult to attribute to slowly-evolving cultural factors or human capital considerations not yet operative. The estimates imply the reforms brought large impartible-inheritance areas to the low-fertility regime that already characterized partible-inheritance areas, thus sharply accelerating the national transition.&lt;/p&gt;
&lt;p&gt;Impartible inheritance: A system under which parents could designate a single heir (through primogeniture, ultimogeniture, or unigeniture) to receive the bulk of the family estate, preventing fragmentation of wealth; in pre-revolutionary France this was associated with extended family households and higher fertility (2.9 surviving children on average) relative to partible areas (2.2).&lt;/p&gt;
&lt;p&gt;Partible inheritance: A system under which family wealth was divided equally among all offspring upon death; in the paper&amp;rsquo;s model this creates an incentive to limit fertility to prevent land fragmentation below the subsistence productivity threshold L̄.&lt;/p&gt;
&lt;p&gt;Indivisibility constraint (land threshold L̄): In the Stone-Geary production function, a minimum land input below which agricultural output falls to zero; this is the mechanism through which partible inheritance generates fertility-limiting incentives, since dividing a small plot among many heirs risks crossing L̄ into zero production.&lt;/p&gt;
&lt;p&gt;Difference-in-differences (DD) exposure design: The paper&amp;rsquo;s main identification strategy, using remaining fertile years after 1793 as a continuous treatment-intensity variable (0 for cohorts past fertility at the reform date, up to 25 for cohorts entirely within their fertile years), compared between treated municipalities (impartible → partible) and control municipalities (already partible).&lt;/p&gt;
&lt;p&gt;Regression-discontinuity difference-in-differences (RD-DD): An augmented design exploiting the sharp geographic discontinuity at borders between judicial districts with different pre-reform inheritance rules, comparing outcomes on both sides before and after 1793, to address smooth unobserved confounders.&lt;/p&gt;
&lt;p&gt;Completed fertility (net): The number of children surviving to age six, preferred over total births because child mortality before 1800 was high (1–1.5 children per mother did not survive to age six per Houdaille 1984), making net fertility the more economically meaningful measure for inheritance and bequest decisions.&lt;/p&gt;
&lt;p&gt;Horizontal restriction: A sampling correction applied to crowdsourced genealogical data (Blanc 2023a) that retains an observation only if at least one of the four preceding generations has more than one recorded offspring, correcting for the over-representation of single-child families that arises because Geni users tend to record direct ancestors rather than collateral relatives.&lt;/p&gt;</description></item><item><title>Riding the Housing Wave: Home Equity Withdrawal and Consumer Debt Composition</title><link>https://macropaperwarehouse.com/papers/riding-the-housing-wave-home-equity-withdrawal-and-consumer-debt-composition/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/riding-the-housing-wave-home-equity-withdrawal-and-consumer-debt-composition/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates how rising house prices affect the composition of household debt portfolios in Sweden during 2010–2014. Specifically, the authors ask whether homeowners who experience housing wealth gains use home equity withdrawals to substitute relatively expensive unsecured consumer (non-mortgage) debt with cheaper collateralized mortgage debt — a form of debt re-optimization — and what individual and policy factors drive this behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The study uses a monthly individual-level panel dataset sourced from Upplysningscentralen (UC), the Swedish credit bureau, covering approximately 4.8 million individuals (62 percent of the Swedish adult population) from July 2010 to July 2014. The UC data captures approximately 80 percent of total household credit volume and 97 percent of household mortgage loans. Parish-level house price indices come from Valueguard, and municipality-level education data come from Statistics Sweden. The empirical analysis draws on a random sample of approximately 150,000 individuals, of whom 81,667 (81 percent) are classified as homeowners — defined as individuals holding a mortgage throughout the entire sample period.&lt;/p&gt;
&lt;p&gt;The primary identification strategy uses renters as a control group for homeowners in a difference-in-differences (DiD) framework, exploiting the variation in local (parish-level) house price growth. Because Sweden&amp;rsquo;s rental market is heavily regulated and uses a queuing allocation system, the rent-versus-own decision is largely exogenous to individual wealth, making renters a credible counterfactual for homeowners. The authors also use two instrumental variables to address endogeneity of house price growth: (1) historical house price volatility at the municipal level from 1981–2005 (the &amp;ldquo;Palmer instrument&amp;rdquo;), and (2) a &amp;ldquo;building-friendly&amp;rdquo; instrument measured as the share of municipal planning appeals overruled by county authorities, derived from Sweden&amp;rsquo;s 2013 National Board of Housing survey. A difference-in-difference-in-differences (DDD) approach is employed to examine the role of DTI constraints and financial literacy. Home equity withdrawals are identified as increases in outstanding mortgage balances of at least SEK 20,000, after excluding cases where the equity was used to purchase a new property.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Total debt and mortgage growth&lt;/strong&gt;: A one percentage point increase in local house prices is associated with an increase of SEK 959.1 in total household debt for homeowners relative to renters, driven primarily by mortgage growth. This effect is robust to instrumental variable estimation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Debt re-optimization — unsecured loans&lt;/strong&gt;: Conditional on withdrawing home equity in month t, homeowners reduce their outstanding unsecured consumer loan balances by 53.5 percent in the following month (t+1). This is large relative to the U.S. benchmark of 16.7 percent reported in Bhutta and Keys (2016). The average reduction in unsecured loan balances across all equity withdrawers is SEK 9,624 per withdrawal event, while credit card debt declines by only SEK 73.3 — an economically negligible amount. For equity withdrawers who had pre-existing unsecured loan balances and actively repaid them, outstanding unsecured loans fell by SEK 55,040 — nearly six times the full-sample average. For this subsample, 17.7 percent of the total withdrawn home equity was applied to unsecured loan repayment (versus 2.98 percent for the full sample of equity withdrawers).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Credit card debt&lt;/strong&gt;: The effect of equity withdrawal on credit card balances is not statistically significant. This reflects the institutional feature that credit cards in Sweden are used primarily as payment instruments within a 30–45 day interest-free grace period, not as a credit facility. Swedish credit card outstanding balances average only 16 percent of a debtor&amp;rsquo;s monthly disposable income, compared to 201 percent in the U.S.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by homeowner type&lt;/strong&gt;: The debt re-optimization finding is specific to equity withdrawers. House traders increase non-mortgage debt alongside mortgage debt. Amortizers show neither effect at meaningful scale. The substitution between unsecured loans and mortgage debt is not observed for non-withdrawing homeowners.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;DTI and financial literacy&lt;/strong&gt;: The debt re-optimization effect is strongest for borrowers with above-median DTI ratios residing in municipalities with above-median education levels (used as a proxy for financial literacy). Borrowers in this high-DTI, high-literacy group paid down approximately SEK 10,000 more in unsecured loans after a home equity withdrawal than high-DTI borrowers in low-literacy areas. A larger fraction of their withdrawn equity was also directed toward unsecured loan repayment.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Macroprudential policy&lt;/strong&gt;: The introduction of an 85 percent LTV cap in October 2010 is associated with an increase in non-mortgage debt, particularly unsecured consumer loans, by both existing equity withdrawers and new mortgage borrowers. For new mortgagors entering after the LTV cap, the ratio of unsecured loans to mortgage debt increased by 1.68 percentage points, consistent with borrowers using unsecured loans to fund the required 15 percent downpayment. The debt re-optimization behavior itself (i.e., paying back unsecured loans with withdrawn equity) was found to persist both before and after the LTV cap introduction, with no statistically significant difference between regimes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Interest rates&lt;/strong&gt;: Both the probability and the size of home equity withdrawal are negatively correlated with the mortgage rate and positively correlated with the spread between the unsecured loan rate and the mortgage rate. During the sample period, mortgage rates averaged between 2.5 and 3 percent, while unsecured loan rates were on average two to three times higher.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The results are specific to Sweden during a housing boom period (2010–2014), under interest-only floating-rate mortgages with full recourse, and in the context of a tightly regulated rental market that makes the renter vs. owner distinction largely exogenous. The re-optimizing behavior requires actively rising house prices to generate the equity needed for withdrawal; the authors note this strategy is fragile if house prices were to decline. Swedish households increased their total debt levels even while re-optimizing its composition, raising financial stability concerns.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What exactly is &amp;ldquo;home equity withdrawal&amp;rdquo; in the Swedish institutional context, and how does it differ from the U.S.?&lt;/strong&gt;
A: In Sweden, home equity withdrawal occurs exclusively by increasing the existing outstanding mortgage balance against an updated home valuation; there are no HELOCs, home equity loans, or cash-out refinancing products as in the U.S. Households must pass a credit check and comply with the 85 percent LTV limit (post-October 2010). Some banks require a minimum withdrawal of SEK 100,000. Fixed transaction costs include a bank administration fee (around SEK 700 for apartment owners) and a fixed fee to the building association (around SEK 750), making the process cheap but not costless.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the authors identify home equity withdrawal events in the data?&lt;/strong&gt;
A: An equity withdrawal event for individual i in month t is defined as a positive change in outstanding mortgage balance greater than SEK 20,000 (approximately the average monthly disposable income), conditional on no simultaneous change in residential address, property type, or acquisition of a second property. This threshold is applied to avoid measurement error from minor rounding or bank adjustments. After applying all exclusion criteria, the authors identify 46,499 equity withdrawal events over the sample period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the identification strategy for isolating the causal effect of house prices on debt portfolios?&lt;/strong&gt;
A: The primary identification uses renters as a control group in a DiD framework. Because Sweden&amp;rsquo;s heavily regulated rental market (with queuing systems and rents far below market rates) makes the rent-vs-own decision largely exogenous to individual wealth, renters experience the same local economic conditions as homeowners but cannot access the equity-based financing channel. The key identifying assumption is that unobserved local economic shocks — which may jointly drive house prices and credit demand — affect renters and homeowners similarly. Two IVs are used as robustness checks: historical municipal house price volatility (1981–2005) and a &amp;ldquo;building-friendly&amp;rdquo; regulation index.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the first-stage strength of the Palmer instrumental variable?&lt;/strong&gt;
A: The estimated coefficient on the historical house price volatility instrument in the first-stage IV regression is 0.00022 and is statistically significant at the 1 percent level. The first-stage F-statistic is 38.41, which exceeds conventional weak-instrument thresholds, confirming that historical volatility is a strong predictor of current house price growth across municipalities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why is credit card debt not reduced by equity withdrawals in Sweden, even though it carries higher interest rates than unsecured loans?&lt;/strong&gt;
A: Credit cards in Sweden function predominantly as payment instruments within a 30–45 day interest-free grace period rather than as actual credit facilities. Average outstanding credit card balances amount to only 16 percent of debtors&amp;rsquo; monthly disposable income (versus 201 percent in the U.S. during the same period), and balances are typically repaid in full at month-end. Because cardholders are not accruing significant interest on their balances, there is no financial incentive to extinguish credit card debt using withdrawn home equity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How is the 2.98 percent figure for equity used in debt repayment to be interpreted?&lt;/strong&gt;
A: Across all home equity withdrawers (including those who have no pre-existing unsecured loans), the average share of the total amount withdrawn that is applied to unsecured loan repayment in the following month is 2.98 percent. This low average reflects that the majority of homeowners do not hold outstanding unsecured consumer loans and therefore have no debt to repay. When the sample is restricted to equity withdrawers who both held outstanding unsecured loans before the withdrawal and actively repaid some portion in the following month, the repayment share rises to 17.7 percent of the withdrawn amount.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the DDD specification used to identify the roles of DTI and financial literacy, and what do the triple interaction terms reveal?&lt;/strong&gt;
A: The DDD specification interacts the equity withdrawal indicator with a high-DTI dummy (above-median DTI at the individual level in the current month) and a high-financial-literacy dummy (municipality&amp;rsquo;s share of post-secondary educated residents above the national median in that year). The triple interaction term (EquityWithdrawal × HighDTI × HighLit) is negatively significant at approximately −SEK 9,913 to −9,966 (in thousands, i.e., around −SEK 10,000) in the unsecured loan repayment regression. This implies that, conditional on withdrawing equity, borrowers with both high DTI and high financial literacy municipality background reduced their unsecured loans by roughly SEK 10,000 more than high-DTI borrowers in low-literacy areas.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the introduction of the 85 percent LTV cap in October 2010 affect non-mortgage debt?&lt;/strong&gt;
A: Comparing a three-month window before and after October 2010, the authors find that: (a) before the LTV cap, changes in household debt did not respond significantly to house price growth for any debt type; (b) after the LTV cap, all debt types — including unsecured consumer loans — increased significantly in areas with higher cumulative house price growth. The interaction term between house price growth and the post-LTV dummy is positively significant for non-mortgage debt, driven by unsecured loans. For new mortgage borrowers, the ratio of unsecured loans to mortgage debt increased by 1.68 percentage points after the LTV cap, consistent with constrained borrowers using blanco (unsecured) loans to fund the mandatory 15 percent downpayment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Does the LTV cap affect the debt re-optimization behavior (i.e., the use of withdrawn equity to repay unsecured loans)?&lt;/strong&gt;
A: The authors find that equity withdrawers reduce unsecured loans both before and after the LTV cap introduction. The interaction terms between the LTV dummy and equity withdrawal indicators (both dummy and size) are not statistically significant, indicating that the debt re-optimization behavior per se — the channel of using withdrawn equity to pay down non-mortgage debt — was not materially altered by the macroprudential tightening. The authors caution that the very short pre-cap period (only three months of data from July to September 2010) limits statistical power for this comparison.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the role of interest rate spreads in driving equity withdrawal decisions?&lt;/strong&gt;
A: Both the probability of withdrawing equity and the size of the withdrawal are negatively correlated with the prevailing mortgage rate and positively correlated with the spread between the unsecured loan rate and the mortgage rate. This implies that equity withdrawal is more common and larger in magnitude when mortgages are cheaper or when the relative cost premium on unsecured lending is higher — consistent with the debt re-optimization motive. Results for the interest rate analysis are reported in Appendix B.2.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How do the results differ across homeowner subgroups (equity withdrawers, house traders, amortizers)?&lt;/strong&gt;
A: Among equity withdrawers: mortgage increases and unsecured loan decreases are both statistically significant (debt re-optimization). Among house traders: mortgage increases significantly and non-mortgage debt also increases (no substitution — they borrow across all categories to finance property purchases). Among amortizers: changes in both mortgage and non-mortgage debt are smaller in magnitude and primarily reflect active principal repayment rather than refinancing activity. The substitution between unsecured and mortgage debt is thus exclusive to equity withdrawers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What is the overall change in Swedish house prices and aggregate debt during the sample period?&lt;/strong&gt;
A: The house price index rose by 20 percent between July 2010 and July 2014, with particularly strong appreciation after January 2012 following a mild dip in the second half of 2011. Over the same period, aggregate mortgage balances of homeowners increased by 16 percent. Aggregate non-mortgage debt also increased, though from a much smaller base. In the cross-sectional regression, a one percentage point increase in house prices is associated with an SEK 926.7 increase in total individual debt (4 percent of average house value of SEK 21,500 per percentage point).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What are the robustness checks and do they alter the conclusions?&lt;/strong&gt;
A: The following robustness checks are reported: (1) redefining equity withdrawers as those who withdrew exactly once (Tables A4–A6); (2) restricting equity withdrawers to those withdrawing SEK 20,000–100,000 to exclude potential house traders; (3) using alternative house price growth windows of 12, 24, and 48 months (Tables A7–A9); (4) using the &amp;ldquo;building-friendly&amp;rdquo; regulation IV (Tables A2–A3); (5) supplementary time-series panel regressions (Appendix B.1). All robustness checks yield qualitatively consistent results, with the substitution from unsecured loans to mortgages preserved across specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What are the financial stability implications the authors identify?&lt;/strong&gt;
A: Despite the debt re-optimization behavior, total indebtedness among Swedish equity withdrawers does not decline — they increase their mortgage balances more than they reduce unsecured loans. Swedish average household DTI is approximately double that of the U.S. (OECD, 2022). The authors note that if house prices were to fall, homeowners relying on equity withdrawal for debt restructuring would lose access to this financing channel and face the full cost of high-interest unsecured debt. Additionally, the circumvention of the LTV cap through unsecured loan substitution raises financial stability concerns because it concentrates households in more expensive, unprotected debt.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Home Equity Withdrawal (Sweden-specific)&lt;/strong&gt;: The act of increasing an existing outstanding mortgage balance against a revalued home, which is the only channel for equity extraction in Sweden. Unlike the U.S., there are no HELOCs, home equity loans, or cash-out refinancing products. Subject to the 85 percent LTV cap introduced in October 2010 and a minimum threshold (SEK 100,000 at some banks).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt Re-optimization&lt;/strong&gt;: The behavior by which homeowners substitute relatively expensive unsecured consumer debt with cheaper collateralized mortgage debt during a housing boom, using the proceeds of home equity withdrawal to repay unsecured loans. In the paper&amp;rsquo;s usage, this implies a deliberate, financially sophisticated portfolio adjustment — not merely passive debt accumulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Blanco Loans (Unsecured Consumer Loans)&lt;/strong&gt;: Unsecured personal loans in Sweden (referred to as &amp;ldquo;blanco&amp;rdquo; loans in Swedish). These carry interest rates historically two to three times higher than mortgage rates. In the Swedish context, they are used both as consumer finance and — especially after the 85 percent LTV cap — as a source of downpayment funds. They are the primary non-mortgage debt instrument that equity withdrawers pay down.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Loan-to-Value (LTV) Cap&lt;/strong&gt;: The macroprudential regulation introduced by the Swedish Financial Supervisory Authority in October 2010, limiting mortgage debt (including home equity withdrawals) to 85 percent of the property&amp;rsquo;s market value. This applied both to new mortgage originations and to existing mortgagors increasing their mortgage balance. In the paper, this is treated as an exogenous policy event against which behavioral responses are measured.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial Literacy Proxy (Municipal Education Level)&lt;/strong&gt;: Because individual-level financial literacy data are unavailable, the paper uses the share of a municipality&amp;rsquo;s residents with post-secondary education in a given year as a municipality-level proxy for financial literacy. Municipalities above the national median in this share are classified as high-literacy areas. The classification can change year to year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt-to-Income (DTI) Ratio&lt;/strong&gt;: The ratio of an individual&amp;rsquo;s total outstanding debt to annual disposable income, used in the paper as a measure of financial constraint. A borrower is classified as &amp;ldquo;high DTI&amp;rdquo; if their DTI exceeds the cross-sectional median for all borrowers in that month. High-DTI borrowers in the paper&amp;rsquo;s sample tend to be younger, have larger mortgages, and have more unsecured loan balances.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Interest-Only Floating-Rate Mortgage&lt;/strong&gt;: The predominant Swedish mortgage structure during the sample period. Most mortgages are effectively three-month floating-rate contracts with no amortization requirement (until June 2016), making Swedish borrowers more sensitive to short-term interest rate movements than borrowers in fixed-rate amortizing mortgage systems. This institutional feature means that increases in home equity during the sample period derived almost entirely from house price appreciation rather than principal repayment.&lt;/p&gt;</description></item><item><title>Risk Sharing Tests and Covariate Shocks: Drought, Floods, and Pests in Uganda</title><link>https://macropaperwarehouse.com/papers/risk-sharing-tests-and-covariate-shocks-drought-floods-and-pests-in-uganda/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/risk-sharing-tests-and-covariate-shocks-drought-floods-and-pests-in-uganda/</guid><description>&lt;p&gt;This paper identifies and corrects a fundamental flaw in the standard methodology for testing efficient risk-sharing when shocks are covariate (affecting common prices rather than only individual incomes). The standard Townsend (1994) approach infers marginal utilities of expenditure (MUEs) from total expenditures, which implicitly assumes homothetic preferences — specifically Constant Relative Risk Aversion (CRRA) — under which all goods have unitary income elasticities and a single scalar price index captures all price effects. Ligon demonstrates that this assumption causes the standard test to fail when applied to covariate shocks such as droughts, floods, and agricultural pests, because these shocks change relative prices in ways that cannot be captured by a single price index. The perverse consequence is that in Ugandan data, every covariate shock — drought, floods, pests, and adverse prices — appears to improve household welfare under the CRRA specification (significant positive coefficients of 0.046, 0.097, 0.095, and 0.103 respectively, all significant at p&amp;lt;0.01), a result the paper argues is mechanically induced by the mis-specification rather than reflecting reality.&lt;/p&gt;
&lt;p&gt;The paper makes two core theoretical contributions. First, it characterizes the complete class of preferences that permit MUE inference from expenditure data alone — specifically, requiring that item-level expenditures be &amp;ldquo;lambda-separable&amp;rdquo; (additively separable in the MUE and prices). Solving the resulting functional equations yields exactly two families of semiparametric demand systems: Constant Frisch Elasticity (CFE) demands (a generalization of CRRA) and Generalized Stone-Geary demands. Only CFE demands are tractable for panel estimation. Second, the paper shows that under CFE preferences, log expenditures on each good j follow the system: log x^j_it = a_j(p_t) + g_j(z_it) + beta_j * w_it + epsilon^j_it, where beta_j is the good-specific Frisch elasticity and w_it = -log lambda_it is the negative log MUE. This allows price effects to enter flexibly through good-time fixed effects rather than a single index, and MUEs to be recovered via factor analysis on the residual covariance matrix.&lt;/p&gt;
&lt;p&gt;The empirical work uses eight waves of the Ugandan National Panel Surveys (2005–2020), an unbalanced panel of 5,601 distinct households yielding 22,791 usable household-year observations across 41 consumption goods (primarily food items). Uganda is divided into four regional markets, producing 32 market-year cells and 1,312 market-year-good dummies. Estimated Frisch elasticities vary substantially across goods — passion fruit is roughly three times as income elastic as cassava — emphatically rejecting the hypothesis of equal elasticities required by CRRA.&lt;/p&gt;
&lt;p&gt;Using CFE-estimated MUEs, the risk-sharing test shows that none of the four covariate shocks has a significant effect on welfare (CFE coefficients: drought 0.010, floods 0.035, pests 0.041, adverse prices -0.043, all insignificant). The pattern holds across all time windows from 0–12 months: 42 of 52 covariate shock coefficients are significant and positive in the CRRA specification, versus only 4 of 52 in the CFE specification — barely above the 2.6 false positives expected under the null. These findings indicate that the welfare impacts of covariate shocks in Uganda operate primarily through the common price channel rather than through idiosyncratic income variation, meaning they are broadly shared within market-regions. Idiosyncratic income shocks, by contrast, show the expected pattern: they reduce welfare significantly in both specifications (CFE: 0.050***, CRRA: 0.071***), and health shocks are significant only in CFE (−0.059**).&lt;/p&gt;
&lt;p&gt;Q: Why does the standard CRRA risk-sharing test fail for covariate shocks?
A: Under CRRA preferences, MUEs depend on total expenditures only through a single scalar price index pi(p). When a covariate shock raises prices of inelastic goods (primarily food), total food expenditures increase even as actual consumption quantities fall. Because risk-sharing tests based on CRRA total expenditures cannot separate this price effect from a welfare improvement, the shock appears to raise welfare. The disturbance term in the CRRA TWFE regression depends on the very prices affected by covariate shocks, violating the exclusion restriction.&lt;/p&gt;
&lt;p&gt;Q: What is the lambda-separability condition, and why does it matter?
A: Lambda-separability requires that for each good j, some transformation phi_j of expenditures on that good can be written as the sum of a function of prices and a function of the MUE: phi_j(x_j(p,lambda)) = a_j(p) + b_j(lambda). This property is necessary for time fixed effects to absorb price variation and household fixed effects to absorb Pareto weights, which is the identification strategy behind all TWFE risk-sharing tests. Without it, no panel estimator using only expenditure data can consistently recover MUEs.&lt;/p&gt;
&lt;p&gt;Q: What are the two demand families that satisfy lambda-separability, and what distinguishes them?
A: Theorem 1 establishes that rationalizable lambda-separable demands must belong to either the Constant Frisch Elasticity (CFE) family or the Generalized Stone-Geary family. In CFE demands, log expenditures on each good equal the log of a price function minus beta_j times log lambda, where beta_j is a good-specific constant Frisch elasticity. The Stone-Geary family has a more complex nonlinear form that does not lend itself to linear estimation of log MUEs, making CFE the tractable choice. Both families nest CRRA as the special case where all beta_j are equal.&lt;/p&gt;
&lt;p&gt;Q: How are MUEs estimated from the CFE system in practice?
A: Estimation proceeds in two steps. First, log expenditures on each good are regressed on good-time-market effects and household demographic controls to obtain residuals. Second, the covariance matrix of these residuals has the factor structure Sigma = Var(w)&lt;em&gt;beta&lt;/em&gt;beta&amp;rsquo; + Psi, where beta is the vector of Frisch elasticities; the rank-one matrix beta*beta&amp;rsquo; is recovered from the sample covariance matrix via factor analysis, and household-level MUEs are then obtained by regression using the estimated beta as generated regressors.&lt;/p&gt;
&lt;p&gt;Q: What do the estimated Frisch elasticities reveal about preferences in Uganda?
A: The Frisch elasticities beta_j vary substantially across the 41 goods in the Ugandan sample. Starchy staples and salt are least elastic (lowest beta_j), while fresh milk, sweet bananas, coffee, oranges, and passion fruit exhibit high elasticities — passion fruit is roughly three times as income elastic as cassava. The hypothesis that all elasticities are equal (the CRRA restriction) is easily rejected, providing direct evidence against homothetic preferences in this population.&lt;/p&gt;
&lt;p&gt;Q: What direct evidence does the paper provide that droughts, floods, and pests are genuinely covariate and harmful?
A: About 39% of Ugandan households reported drought in the 2005–06 round. Among drought reporters, 92% said it affected their production, 80% said it affected their income, and 50% said it affected their consumption. Drought, pests, and adverse prices (but not floods) led to statistically significant increases in local farmgate prices. Among markets experiencing covariate shocks, 82%, 74%, 44%, and 53% of t-tests rejected equality of relative food prices for drought, floods, pests, and adverse prices respectively. Dietary diversity and intake of vitamin B-12 (from animal-source foods) declined significantly following covariate shocks.&lt;/p&gt;
&lt;p&gt;Q: How do households cope differently with covariate versus idiosyncratic shocks?
A: Households experiencing covariate shocks primarily relied on self-insurance: 51% of drought-affected households reduced consumption and 45% drew on savings, with increased labor supply also reported. In contrast, households experiencing idiosyncratic shocks most often relied on help from friends and family (52%). This behavioral difference is consistent with the finding that covariate shocks affect welfare mainly through common price channels that are not individually insurable through social networks, while idiosyncratic shocks are partially absorbed via informal transfers.&lt;/p&gt;
&lt;p&gt;Q: What do the CFE results imply about the nature of insurance against covariate shocks in Uganda?
A: The CFE regression finds that none of the four covariate shocks (drought, floods, pests, adverse prices) has a statistically significant effect on household MUEs when time-market fixed effects are included. This implies that the welfare impact of covariate shocks is transmitted primarily through common price changes that affect all households in a market-region symmetrically, rather than through idiosyncratic income variation. Effectively, covariate shocks are &amp;ldquo;shared&amp;rdquo; within market-regions — but through price deterioration affecting everyone, not through informal transfers.&lt;/p&gt;
&lt;p&gt;Q: How robust are the results across different shock time windows?
A: Figure 3 shows that for the CRRA specification, any prior covariate shock 3–12 months earlier has a significant positive effect on log consumption in every month, while for the CFE specification no shock window produces a significant effect on w. In the full tabulation across all shock types and windows (Tables 4 and 5), 42 of 52 covariate shock coefficients are significant and positive in CRRA versus only 4 of 52 in CFE — the latter barely exceeding the 2.6 false positives expected under the null hypothesis of full insurance.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of these findings for relief program design?
A: Because covariate shocks affect welfare mainly through common prices within market-regions, relief programs should target communities rather than individual households, since the burden is broadly shared and not concentrated. Policies that integrate markets across regions of Uganda or connect Ugandan markets to broader African or world markets would reduce the price impact of local covariate shocks. Targeted household transfers would be less effective than interventions that stabilize regional prices or supply.&lt;/p&gt;
&lt;p&gt;Q: What broader applicability do CFE MUEs have beyond risk-sharing tests?
A: Since MUE construction is independent of the risk-sharing hypothesis, CFE-estimated MUEs can be used to estimate and test any dynamic life-cycle model that puts structure on the evolution of MUEs over time, including consumption Euler equations, intertemporal marginal rates of substitution calculations, and household bargaining models. The CFE approach requires only the same expenditure data used in the standard CRRA approach and therefore serves as a more general drop-in replacement across all settings where CRRA MUEs are currently employed.&lt;/p&gt;
&lt;p&gt;Marginal Utility of Expenditure (MUE): The Lagrange multiplier lambda on the household budget constraint in the consumer&amp;rsquo;s optimization problem; the object whose proportionality across households (log lambda_it = log mu_t - log theta_i) characterizes efficient risk-sharing. It is a function of budget, prices, and household characteristics — not reducible to a scalar function of total expenditure except under special preference restrictions.&lt;/p&gt;
&lt;p&gt;Lambda-separability: A property of Frischian expenditures on good j such that some transformation phi_j(x_j) can be written as the sum of a function of prices and a function of the MUE alone — phi_j(x_j(p,lambda)) = a_j(p) + b_j(lambda). This is the necessary and sufficient condition for using time fixed effects to control for prices and household fixed effects to control for Pareto weights in a TWFE risk-sharing regression based solely on expenditure data.&lt;/p&gt;
&lt;p&gt;Constant Frisch Elasticity (CFE) expenditure system: The tractable member of the two demand families satisfying lambda-separability, characterized by log x^j_it = a_j(p_t) + g_j(z_it) + beta_j * w_it + epsilon^j_it, where beta_j is a good-specific constant elasticity of expenditures with respect to MUE. Nests CRRA as the special case of equal beta_j across all goods, but admits nonhomothetic preferences and fully flexible relative-price responses.&lt;/p&gt;
&lt;p&gt;Frischian demands: Demands expressed as functions of prices and the MUE lambda rather than prices and budget — f(p, lambda). Homogeneous of degree zero in (p, 1/lambda), equivalently written f(p*lambda). This representation is central to the lambda-separability characterization because it separates the role of the budget (via lambda) from the role of prices directly.&lt;/p&gt;
&lt;p&gt;Covariate shock: In this paper&amp;rsquo;s usage, a shock that affects prices common to all households in a market-region — not merely a shock affecting many households simultaneously. The key analytical distinction is that idiosyncratic shocks change individual budgets without changing prices, while covariate shocks change prices, which is what causes the standard CRRA test to fail.&lt;/p&gt;
&lt;p&gt;Nonhomothetic preferences: Preferences for which expenditure shares vary with income (budget), so no single scalar price index can fully represent the welfare impact of price changes. The paper confirms nonhomotheticity in the Ugandan data through widely varying Frisch elasticities, and argues this is the root cause of the CRRA test&amp;rsquo;s failure for covariate shocks — a problem that does not arise when shocks are idiosyncratic and leave prices unchanged.&lt;/p&gt;</description></item><item><title>Robot adoption and inflation dynamics</title><link>https://macropaperwarehouse.com/papers/robot-adoption-and-inflation-dynamics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/robot-adoption-and-inflation-dynamics/</guid><description>&lt;h2 id="robot-adoption-and-inflation-dynamics"&gt;Robot Adoption and Inflation Dynamics&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;Basso and Rachedi investigate how robot adoption influences inflation dynamics — specifically, whether the surge in automation during the 2000s and 2010s can explain the muted sensitivity of inflation to unemployment (the &amp;ldquo;flat Phillips curve&amp;rdquo;) observed in advanced economies prior to the Covid pandemic, and whether the same framework can account for the subsequent resurgence of steep inflation-unemployment co-movement.&lt;/p&gt;
&lt;h3 id="data-and-methodology"&gt;Data and Methodology&lt;/h3&gt;
&lt;p&gt;The empirical analysis uses an annual panel covering 384 U.S. metropolitan statistical areas (MSAs) from 2008 to 2018. The dependent variables are non-tradable goods inflation (log-difference of services prices excluding rents and utilities, from BEA regional price parities) and wage inflation (log-difference of average compensation per job). Robot adoption at the MSA-year level is constructed following Acemoglu and Restrepo (2020a): industry-level robots per employee at the U.S. national level are weighted by industry employment shares in each MSA, yielding an MSA-year robot-per-employee ratio.&lt;/p&gt;
&lt;p&gt;The regression specification extends Hazell et al. (2022) by adding an interaction term between the lagged unemployment rate and the (demeaned) robot-per-employee ratio, along with MSA and year fixed effects. Year fixed effects absorb common inflation expectations and the endogenous response of monetary policy to aggregate demand shocks. To address endogeneity, unemployment is instrumented with a Bartik shift-share variable of tradable demand spillovers, and robot adoption is instrumented with average industry-level robot penetration in the five largest European economies — under the identifying assumption that robot demand shocks are weakly correlated across advanced countries.&lt;/p&gt;
&lt;p&gt;The theoretical framework is a New Keynesian model augmented with (i) directed search frictions in the labor market, and (ii) producer-level automation decisions in the spirit of Acemoglu and Restrepo (2020a). Producers pay a fixed entry cost, draw idiosyncratic efficiency for employing labor, and then choose between a robot technology (certain output at low efficiency) and a labor technology (uncertain hiring, higher potential efficiency). This generates an automation threshold: low-efficiency producers install robots, displacing low-wage jobs. A Taylor rule closes the model. Quantitative exercises compare two steady states calibrated to robot-per-employee ratios of 0.2% (low automation, targeting the U.S. in the early 2000s) and 0.6% (high automation, calibrated to one standard deviation of robot penetration variation across MSAs).&lt;/p&gt;
&lt;h3 id="main-findings"&gt;Main Findings&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Empirical.&lt;/strong&gt; In the baseline IV regression, a one standard deviation increase in robot adoption reduces the sensitivity of price inflation to unemployment by 17%, and the sensitivity of wage inflation to unemployment by 9%, relative to a MSA with the average robot penetration. The larger flattening effect on price inflation than on wage inflation implies that robot adoption also diminishes the pass-through from wages to prices. All three effects are statistically significant at the 5% level, and are robust to controls for demographic structure (age composition, gender/race/education participation rates, MPC heterogeneity), occupational structure (abstract, routine, manual, and offshorable occupations), and import competition exposure (Chinese and Mexican import shares).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model quantification.&lt;/strong&gt; Comparing the high-automation to the low-automation steady state, the model generates a 14% reduction in the slope of the price Phillips curve and a 13% reduction in the slope of the wage Phillips curve, conditional on the same-sized demand shocks in both economies. The price Phillips curve result accounts for 82% of the empirical estimate (17%). The model overstates the flattening of the wage Phillips curve (13% vs. 9% in the data), and therefore understates the reduction in the wage-to-price pass-through.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanisms.&lt;/strong&gt; Automation flattens the Phillips curve through two primary channels. First, the outside option of automating production reduces workers&amp;rsquo; bargaining power and dampens the elasticity of wages to unemployment (the &amp;ldquo;Wage Setting Effect&amp;rdquo;). Second, a higher share of robot firms reduces the aggregate labor share, muting the pass-through from wages into prices (the &amp;ldquo;Steady State Effect&amp;rdquo;). A third channel — firms cyclically substituting workers for machines in response to a shock (the &amp;ldquo;Cyclical Effect&amp;rdquo;) — operates during the transition but the Wage Setting Effect accounts for the bulk of the flattening.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-linearity and the post-Covid resurgence.&lt;/strong&gt; When robot-production is subject to convex adjustment costs, the threat of automation that underlies the Wage Setting Effect becomes inoperative during large expansionary shocks. When investment in machines surges, the marginal cost of producing robots rises sharply, raising the price of machines and pushing the automation threshold downward — more firms must use labor. Workers then negotiate higher wages, which pass into prices. Conditional on small demand shocks, the high-automation economy still exhibits a flatter Phillips curve than the low-automation economy. Conditional on large demand shocks (simulated as a 2 percentage point drop in unemployment), there is no difference in the inflation response between the low- and high-automation economies, so the Phillips curve reverts to steep.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the exact empirical specification and how does it map to a structural object?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The regression is: non-tradable goods inflation = β × lagged unemployment + γ × (lagged unemployment × demeaned robot adoption) + ζ × lagged robot adoption + χ × relative non-tradable price + MSA fixed effects + year fixed effects + error. In a multi-region model without automation, Hazell et al. (2022) show that the coefficient β identifies the aggregate slope of the Phillips curve because year fixed effects absorb both common inflation expectations and the endogenous monetary policy response to aggregate demand shocks. Adding the interaction term extends this logic: γ identifies how robot adoption causally shifts the slope of the local Phillips curve, which maps into changes in the aggregate slope.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the first-stage instruments and why are they valid?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Unemployment is instrumented with local tradable demand spillovers — a Bartik variable weighting national industry value-added growth (excluding each MSA&amp;rsquo;s own contribution) by each MSA&amp;rsquo;s average industry value-added shares, so national supply disturbances uncorrelated with MSA-level heterogeneity generate plausibly exogenous unemployment variation. Robot adoption is instrumented with the implied robot-per-employee ratio obtained by replacing U.S. industry robot installations with the average across the five largest European economies, weighted by U.S. industry employment shares; this isolates the supply-side efficiency improvements in robot technology that drove global adoption, conditional on robot demand shocks being weakly correlated across countries. The correlation between the two instruments in the sample is 0.2, ensuring they do not strongly co-move.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the point estimates and their magnitudes in the baseline IV regression?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For price inflation (Panel A, Column 4), the base sensitivity β = −0.5069 (SE 0.1381, significant at 1%), and the interaction coefficient γ = 0.0066 (SE 0.0030, significant at 5%). For wage inflation (Panel B, Column 4), β = −0.9580 (SE 0.2450, significant at 1%), and γ = 0.0049 (SE 0.0024, significant at 5%). A one standard deviation increase in robot adoption reduces price inflation sensitivity by 17% and wage inflation sensitivity by 9% relative to the average-automation MSA.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What does the difference in flattening magnitudes (17% for prices vs. 9% for wages) imply about the wage-price pass-through?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because automation reduces the price Phillips curve slope by proportionally more than the wage Phillips curve slope, each percentage-point change in wages translates into a smaller percentage-point change in prices in higher-automation areas. This indicates that robot adoption diminishes the influence of wage changes on price changes — i.e., it reduces the wage-to-price pass-through. In the model, this operates through the Steady State Effect: a larger share of production carried out by robot firms means that a given change in average wages applies to a smaller portion of total marginal costs, weakening the price response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How is the automation threshold determined in the theoretical model, and what economic forces govern it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A producer opts for the labor technology if and only if the expected value of a labor firm (= job-filling probability × (producer price × labor efficiency − posted wage) − entry cost) exceeds the value of a robot firm (= producer price × robot efficiency − machine price − entry cost). Since the value of a labor firm increases in labor efficiency, there is a unique cut-off efficiency level γ* at which a producer is indifferent. Producers with labor efficiency above γ* post vacancies; those below γ* install robots. The cut-off rises (more automation) when wages rise relative to machine prices, and falls (less automation) when machine prices rise due to costly robot production during large expansionary shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the wage-posting equilibrium under directed search generate the Wage Setting Effect of automation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under directed search, each labor firm posts a wage to maximize its expected value, and workers sort into sub-markets offering higher wages but lower job-finding probabilities. The equilibrium posted wage for a firm with labor efficiency γj is Wγj,t = PP,t × γj × (1 − η), where η is the elasticity of matches to vacancies. The option to install a robot — available at any time — limits how much any individual firm needs to offer workers. When automation increases, the outside option becomes more attractive to more firms, which constrains wage offers industry-wide, reducing the elasticity of average wages to unemployment fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How is the slope of the price Phillips curve characterized analytically?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Log-linearizing the model around the steady state and substituting labor market and wholesaler equilibrium conditions into the pricing equation yields: inflation = −[(ε−1)/φ] × Ψ(γ*; Θ) × unemployment gap + β × expected future inflation, where Ψ(γ*; Θ) is a function of the automation cut-off γ*, the elasticity of substitution ε, the matching function elasticity η, the efficiency bounds γM and γH, and the distribution shape parameter α. In contrast to standard New Keynesian models where the slope depends only on markup and nominal rigidity parameters, this expression depends directly on the degree of automation through the steady-state threshold γ*.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Across different structural parameter configurations, does automation always flatten the Phillips curve?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Numerical analysis of the closed-form Phillips curve expression (Figure 1) shows that robot adoption unambiguously decreases the slope of the price Phillips curve across all combinations of the key structural parameters — the distribution shape parameter α, the matching elasticity η, the upper bound of labor efficiency γH, and the steady-state unemployment rate ū. The flattening effect is more pronounced when η is low, when α implies a larger fraction of low-efficiency producers, and when the steady-state unemployment rate is low.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do the three mechanism channels (Cyclical, Wage Setting, Steady State) compare quantitatively?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper isolates channels by comparing alternative model specifications: (i) Baseline directed search with endogenous automation, (ii) Directed search with fixed automation (removing Cyclical and Wage Setting Effects, leaving only the Steady State Effect), (iii) Random search with τ = 0.5 (efficient bargaining, retaining both the Cyclical and Wage Setting Effects), (iv) Random search with τ = 0.01 (near-zero worker bargaining power, removing the Wage Setting Effect but retaining the Cyclical Effect). Figure 5 shows that the Steady State Effect alone accounts for only a small portion of the total inflation differential between low- and high-automation economies. The Wage Setting Effect — isolated by comparing τ = 0.01 and τ = 0.5 economies with endogenous automation — accounts for the bulk of the flattening. The Cyclical Effect (isolated by comparing fixed and endogenous automation with τ = 0.01) contributes an intermediate amount.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the quantitative exercise comparing low- and high-automation steady states?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The low-automation economy targets the U.S. robot-per-employee ratio of 0.2% in the early 2000s (Acemoglu and Restrepo, 2020a), calibrated with robot-specific technological change ζ = 2. The high-automation economy features a 200% higher robot-per-employee ratio of 0.6%, calibrated to replicate one standard deviation of cross-MSA dispersion in robot penetration in the data. Both economies are simulated with 10,000 realizations of preference shocks, and the slopes of the price and wage Phillips curves are estimated from simulated inflation and unemployment outcomes. The price Phillips curve flattens by 14% and the wage Phillips curve by 13% moving from low to high automation, conditional on the same-sized shock in both economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the model account for the Covid-era resurgence of high inflation despite high automation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper extends the machine manufacturer&amp;rsquo;s production function to include an asymmetric convex adjustment cost that activates when investment deviates more than 5% from its steady-state level (parameterized with δ = 0.0015 and ϱ = 100). Under a small expansionary shock (0.25 percentage point decrease in unemployment), inflation rises less in the high-automation economy, consistent with a flat Phillips curve. Under a large expansionary shock (2 percentage point decrease in unemployment), the surge in robot investment triggers sharply rising machine prices, eliminating the automation outside option for marginal producers and fully restoring workers&amp;rsquo; bargaining power — so the inflation response is identical in the low- and high-automation economies, consistent with a steep Phillips curve. The paper interprets this as a proof-of-concept consistent with post-Covid wage compression evidence for low-wage workers documented by Autor, Dube, and McGrew (2023).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What do the robustness checks establish regarding alternative explanations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The interaction of unemployment and robot adoption remains statistically significant at the 5% level across all the robustness checks (Appendix A). These include controlling for: (i) demographic heterogeneity — shares of young (below 30) and old (above 60) individuals, female/Black/Asian labor market participation, low-education attainment shares, overall participation, and MSA-level average marginal propensity to consume (MPC); (ii) occupational structure — shares of abstract, routine, manual, and offshorable occupations; and (iii) import competition — MSA exposure to Chinese and Mexican import competition. The coefficient on the robot-unemployment interaction term is stable across specifications, with the magnitude remaining close to that in the baseline (approximately 0.0140 across all demographic robustness columns in Table A.1).&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Automation threshold (γ&lt;/em&gt;):&lt;/em&gt;* The paper-specific level of idiosyncratic labor efficiency at which a producer is indifferent between installing a robot and posting a vacancy. Producers with labor efficiency below γ* choose the machine technology; those above choose the labor technology. The threshold is determined by the relative profitability of the two technologies, and it shifts endogenously with wages, machine prices, and job-filling probabilities. A higher γ* means more of the production sector is automated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage Setting Effect of automation:&lt;/strong&gt; The channel through which the existence of the outside option to install robots reduces workers&amp;rsquo; bargaining power and dampens the elasticity of wages to unemployment fluctuations. Under directed search, firms&amp;rsquo; ability to substitute machines for labor at a lower cost constrains the wage offers they need to post, so that a given decline in unemployment generates a smaller increase in average wages in higher-automation economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Steady State Effect of automation:&lt;/strong&gt; The channel through which a larger steady-state fraction of robot firms reduces the aggregate labor share, so that even a given change in wages translates into a smaller change in aggregate marginal costs and prices. This channel operates even when automation cannot change upon a shock (fixed automation baseline).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cyclical Effect of automation:&lt;/strong&gt; The channel through which firms actively replace workers with machines in response to expansionary shocks that raise wages, generating an endogenous dampening of labor demand and putting downward pressure on the wage increase itself. This channel requires endogenous automation choices at business-cycle frequencies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robot-specific technological change (ζ):&lt;/strong&gt; In the paper&amp;rsquo;s model, the parameter governing the efficiency with which machine manufacturers transform final goods into robots. A higher ζ reduces the relative price of machines (PM/P = 1/ζ), making automation more attractive to lower-efficiency producers and raising the automation threshold γ*. In quantitative exercises, variation in ζ across steady states drives differences in the degree of automation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price Phillips curve slope (Ψ):&lt;/strong&gt; In the paper&amp;rsquo;s log-linearized model, the structural coefficient linking inflation to the unemployment gap. Unlike in standard New Keynesian models — where the slope depends only on the markup and nominal rigidity — Ψ is a function of the automation threshold γ*, the matching elasticity η, the efficiency distribution parameters (γM, γH, α), and the elasticity of substitution ε. Robot adoption shifts γ* and thereby changes Ψ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asymmetric investment adjustment cost:&lt;/strong&gt; An extension of the machine manufacturer&amp;rsquo;s production function that imposes convex costs when robot investment deviates above 5% from its steady-state level (parameterized by δ and ϱ). This specification makes it increasingly costly to rapidly scale up automation in response to large demand shocks, causing the machine price to spike and the automation outside option to cease being effective for marginal producers, thereby restoring workers&amp;rsquo; bargaining power and steepening the Phillips curve during large expansionary episodes.&lt;/p&gt;</description></item><item><title>Running Primary Deficits Forever in a Dynamically Efficient Economy: Feasibility and Optimality</title><link>https://macropaperwarehouse.com/papers/running-primary-deficits-forever-in-a-dynamically-efficient-economy-feasibility-and-optimality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/running-primary-deficits-forever-in-a-dynamically-efficient-economy-feasibility-and-optimality/</guid><description>&lt;h2 id="running-primary-deficits-forever-in-a-dynamically-efficient-economy-feasibility-and-optimality"&gt;Running Primary Deficits Forever in a Dynamically Efficient Economy: Feasibility and Optimality&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;The paper addresses two questions about government debt rollover. First, a positive question: what is the maximum ratio of government bonds to capital that can be sustained forever without any primary budget surpluses? Second, a normative question: among sustainable bond-capital ratios along a balanced growth path, which one maximizes the welfare (steady-state utility) of consumers? The analysis is motivated by Blanchard&amp;rsquo;s (2019) AEA presidential address and the fiscal responses to the COVID-19 pandemic.&lt;/p&gt;
&lt;h3 id="setting-and-mechanism"&gt;Setting and Mechanism&lt;/h3&gt;
&lt;p&gt;The baseline environment is a standard two-generation (young and old) overlapping-generations model. Young consumers earn labor income and save; old consumers live off portfolio returns. The production function is Cobb-Douglas, Yt = (GtN)^(1−α) K^α, where G = 1+g is the gross growth rate of labor-augmenting productivity. Uncertainty enters exclusively through a stochastic i.i.d. durability shock ε_t to the depreciation rate of capital (δ − ε_t), so the rate of return on capital r = αk^(α−1) − δ + ε is stochastic even though the capital stock per unit of effective labor k is deterministic along a balanced growth path. Consumers have Epstein-Zin-Weil utility with an intertemporal elasticity of substitution equal to one. Because IES = 1 and labor income is earned only when young, aggregate saving of young consumers is a constant fraction β of their wage income, making total assets (capital plus bonds) non-stochastic.&lt;/p&gt;
&lt;p&gt;This structure creates a key wedge: the expected rate of return on capital R can exceed the growth rate g (dynamic efficiency) while the riskfree interest rate rf — determined by the portfolio equilibrium between risky capital and riskless bonds — can remain below g. In deterministic economies these two rates coincide, so dynamic efficiency and the infeasibility of permanent debt rollover always go together. In this stochastic model they can be decoupled.&lt;/p&gt;
&lt;h3 id="main-findings"&gt;Main Findings&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Positive finding.&lt;/strong&gt; The maximum sustainable bond-capital ratio, Bmax, is attained precisely when rf = g (equivalently, when the adjusted gross riskfree rate Rf = 1). Starting from a bond-less economy with rf &amp;lt; g (which may itself be dynamically efficient), introducing government bonds crowds out capital, raises the marginal product of capital and the constellation of returns, and drives rf upward toward g. Once rf = g is reached, any further increase in bonds would require rf &amp;gt; g, making rollover infeasible without primary surpluses. The maximum sustainable ratio Bmax is characterized as the unique root of f(Bmax, 1) = 0, and it is finite.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Normative finding.&lt;/strong&gt; The welfare-maximizing sustainable bond-capital ratio equals Bmax. Proposition 6 establishes that u′(B) ≥ 0 for all B ∈ [0, Bmax] whenever Rf ≤ 1, with strict inequality unless Rf = 1. Proposition 7 therefore concludes that the welfare-maximizing B is the corner solution Bmax. Intuitively, increasing B reduces capital and wages but raises the rate of return on capital. When rf ≤ g, the welfare gain from a higher return on capital in old age dominates the welfare loss from a lower wage when young (via the factor-price frontier and the intertemporal optimality condition E{uo′(co)} ≥ uy′(cy)). When rf = g (at Bmax), a marginal increase in bonds also provides no additional welfare improvement if all seignorage is transferred to young consumers (ζ = 1), but still raises welfare if some seignorage is wasted (ζ &amp;lt; 1). In either case, Bmax is the optimum. Critically, at the optimum the economy is dynamically efficient — even though the government is running permanent primary deficits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dual role of bonds.&lt;/strong&gt; At the optimal bond-capital ratio, government bonds serve two purposes simultaneously: (1) they crowd out any dynamically inefficient overaccumulation of capital that might prevail without bonds, and (2) they supply riskfree assets to risk-averse consumers who would otherwise hold only risky capital, improving risk sharing.&lt;/p&gt;
&lt;h3 id="quantitative-illustration"&gt;Quantitative Illustration&lt;/h3&gt;
&lt;p&gt;The paper calibrates a 30-year-period OLG model with α = 0.33, β = 0.353 (annual discount rate 2%), annual productivity growth g = 1% (G = 1.35), and target mean return on unlevered equity m = 3% per year. Risk aversion γ ∈ {1, 3, 8, 10} and annualized standard deviation of capital returns s ∈ {0.02, …, 0.22}. Key results (ζ = 0): at γ = 10 and s = 0.22, Bmax = 0.478 and B∗ (the bond-capital ratio needed just to eliminate dynamic inefficiency) = 0.083, so there is a wide interval [0.083, 0.478] of dynamically efficient, permanently rollable bond-capital ratios. For a capital-output ratio of 2, the debt-GDP ratio corresponding to Bmax = 0.478 is approximately 0.956. Bmax is strictly increasing in both γ and s, and is invariant to ζ (the share of seignorage transferred rather than wasted).&lt;/p&gt;
&lt;h3 id="scope-conditions"&gt;Scope Conditions&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Results hold along balanced growth paths with constant g and constant rf; the sustainability characterization is more complex if either rate is stochastic.&lt;/li&gt;
&lt;li&gt;The key sufficient condition for Rf to be increasing in B (Proposition 1) is that risk aversion γ &amp;lt; Λ, a model-dependent upper bound that is always positive. All subsequent propositions assume R′f(B) &amp;gt; 0, which is satisfied for a potentially larger set of γ.&lt;/li&gt;
&lt;li&gt;The paper focuses on welfare along the balanced growth path; it does not study transition dynamics or welfare during convergence from an initial state.&lt;/li&gt;
&lt;li&gt;The No Ponzi Game (NPG) condition is violated by design in the feasible-rollover region (rf ≤ g); the value of government bonds is positive even though the present value of all future primary surpluses is non-positive.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why can an economy be both dynamically efficient and able to roll over government bonds forever, when this is impossible in deterministic models?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In a deterministic economy, the riskfree rate rf and the rate of return on capital r are equal, so the conditions rf &amp;lt; g (feasibility of rollover) and r &amp;lt; g (dynamic inefficiency) are identical. In a stochastic economy, aggregate uncertainty drives a wedge between rf and the expected return on capital. Risk-averse consumers require a premium to hold risky capital over riskless bonds, so rf &amp;lt; E{r}. It is therefore possible that E{ln R} &amp;gt; 0 (the Zilcha sufficient condition for dynamic efficiency holds) while Rf &amp;lt; 1, i.e., rf &amp;lt; g. This decoupling is the central theoretical contribution of the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the formal criterion the paper uses for dynamic efficiency, and how does it relate to the AMSZ criterion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Abel, Mankiw, Summers, and Zeckhauser (AMSZ, 1989) show that if the rate of return on capital exceeds g in all states (R &amp;gt; 1 always), the economy is dynamically efficient, and since rf &amp;lt; r, the economy has rf &amp;gt; g so rollover is infeasible; conversely if r &amp;lt; g always, the economy is dynamically inefficient. The AMSZ criteria are silent when R sometimes exceeds and sometimes falls short of one. Building on Zilcha (1991), the paper uses E{ln R} ≥ 0 as a sufficient condition for dynamic efficiency. In the five-region diagram (Figure 1), Region E satisfies E{ln R} &amp;gt; 0 (Zilcha-efficient) and Rf &amp;lt; 1 (rollover feasible simultaneously), which is the case of central interest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the model achieve a deterministic capital stock despite stochastic capital returns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The durability shock ε_t affects depreciation but is additively separable from the production function. Because (1) IES = 1 and (2) consumers earn income only when young, aggregate saving is the fixed fraction β of wage income, which depends only on capital k (itself non-stochastic). Total assets At+1 = Kt+1 + Bt+1 = St are thus non-stochastic. The stochastic shock to depreciation makes the rate of return on capital r = αkα−1 − δ + ε stochastic even though k is deterministic. Online Appendix B establishes that this model is isomorphic to a model with production function shocks, extending the scope of the results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the financial market equilibrium condition that pins down the riskfree rate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Young consumers optimally choose the portfolio share λ in riskfree bonds. The first-order condition for this portfolio problem along a balanced growth path is E{(λRf + (1−λ)R)^(−γ)(Rf − R)} = 0 (equation 20). In equilibrium, λ = B/(1+B) (the bond-capital ratio determines the portfolio share), so the equilibrium riskfree rate Rf satisfies the implicit equation f(B, Rf) = 0 (equation 21). Lemma 1 establishes that Rf = E{R^(1−γ)_a}/E{R^(−γ)_a}, a ratio-of-moments formula analogous to an Euler equation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why is the riskfree rate Rf an increasing function of the bond-capital ratio B, and what is the sufficient condition for this?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Lemma 2 shows ∂f/∂B &amp;gt; 0; intuitively, more bonds reduce capital, raise the marginal product of capital, and raise R, inducing consumers to demand more capital and less bonds, pushing Rf up to restore equilibrium. Lemma 3 provides a sufficient condition for ∂f/∂Rf &amp;lt; 0, namely γ &amp;lt; Λ (where Λ is a positive parameter-dependent bound). Under this condition, the implicit function theorem implies Rf′(B) &amp;gt; 0 (Proposition 1). The condition γ &amp;lt; Λ is sufficient but not necessary, so the results of all downstream propositions hold potentially for a wider parameter range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the maximum sustainable bond-capital ratio Bmax, and how is it characterized?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By definition, a bond-capital ratio B is sustainable if and only if Rf(B) ≤ 1. If Rf(0) ≥ 1, then Bmax = 0 (no positive amount of bonds is sustainable). If Rf(0) &amp;lt; 1, Bmax is the unique positive root of Rf(B) = 1, i.e., f(Bmax, 1) = 0 (Proposition 4). At Bmax, the riskfree rate exactly equals the growth rate: rf = g. The paper also shows Bmax ≤ (1−α)β/α − 1, an upper bound that depends only on production and preference parameters. Notably, Bmax is invariant to the parameter ζ (the share of seignorage transferred to young consumers rather than wasted), because at Bmax transfers are always zero regardless of ζ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why does the welfare-maximizing sustainable bond-capital ratio equal Bmax rather than some interior value?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 6 shows that u′(B) ≥ 0 for all B ∈ [0, Bmax] whenever Rf ≤ 1, with strict inequality unless Rf = 1 and (1−ζ)B = 0. Since utility is weakly increasing throughout the feasible set, the optimum is the corner solution Bmax (Proposition 7). The mechanism: increasing B reduces k, lowering wages (bad for utility when young) but raising the marginal product of capital and hence the rates of return on capital and bonds (good for utility when old). The factor-price frontier ensures that the wage reduction equals the income gain accruing to initial capital, and the intertemporal optimality condition uy′(cy) = Rf E{uo′(co)} implies that when Rf ≤ 1 (so E{uo′(co)} ≥ uy′(cy)/Rf ≥ uy′(cy)), the welfare gain in old age dominates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does Proposition 5 square with the optimality of Bmax? Does reducing expected consumption not reduce welfare?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 5 shows that when ζ = 1, a marginal increase in B at Bmax reduces expected aggregate consumption (dE{c}/dB &amp;lt; 0). However, welfare is not simply expected aggregate consumption: it also depends on the distribution of consumption across states. At Bmax, even though expected consumption falls, the increased risk sharing from holding more riskfree bonds — which smooth consumption between the high-return and low-return states of capital depreciation — is large enough to leave welfare unchanged (u′(Bmax) = 0 when ζ = 1) or to increase it (u′(Bmax) &amp;gt; 0 when ζ &amp;lt; 1). This illustrates that in stochastic economies, the welfare criterion diverges from the aggregate consumption criterion that characterizes dynamic inefficiency in deterministic economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the paper&amp;rsquo;s welfare analysis relate to the No Ponzi Game (NPG) condition and the fiscal theory of the price level?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The standard NPG condition requires that the value of government debt equals the present value of future primary surpluses. In the paper&amp;rsquo;s feasible-rollover region (rf ≤ g), the NPG condition is violated by design: the present value of future primary surpluses is non-positive (all primary balances are deficits or zero), yet the market value of outstanding bonds is strictly positive. This is possible because, as Santos and Woodford (1997) show, when the present value of aggregate consumption is infinite, the NPG can fail. The market value of the capital stock remains finite (it is the value of profits on a depreciating capital stock approaching zero), but the bubble value of government bonds is positive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the quantitative calibration reveal about the range of dynamically efficient, permanently rollable bond-capital ratios?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With α = 0.33, β = 0.353, g = 1% per year, G = 1.35, target mean equity return m = 3% per year, and risk aversion γ = 10 with annualized return standard deviation s = 0.22, the paper finds Bmax = 0.478 and B∗ = 0.083 (ζ = 0, Table 1). The interval [B∗, Bmax] = [0.083, 0.478] is the range of bond-capital ratios for which the economy is both dynamically efficient and able to roll over bonds permanently. For an economy with a capital-output ratio of 2, these bond-capital ratios correspond to debt-GDP ratios of up to 0.956. Both Bmax and B∗ are increasing in risk aversion γ and in the standard deviation of capital returns s; Bmax is independent of γ in any given column of the table for the ζ = 0 case (since R is independent of γ there), but rises substantially with γ in the ζ = 1 case.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the role of the parameter ζ (the share of seignorage transferred vs. wasted)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The parameter ζ governs what the government does with seignorage revenue: transfer it to young consumers (ζ = 1) or waste it (ζ = 0), or some mix. Corollary 1 shows that Bmax is completely invariant to ζ, because at Bmax, rf = g so seignorage (g − rf)Bt = 0 in any case. The value ζ does affect u′(Bmax): if ζ &amp;lt; 1, u′(Bmax) &amp;gt; 0; if ζ = 1, u′(Bmax) = 0. Both configurations yield Bmax as the welfare-maximizing level. The parameter ζ matters for welfare levels and for B∗ (only in the ζ = 1 case, where transfers are positive and boost saving capacity), but not for the main positive or normative results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: In what sense is the model tractable, and what are its key limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Tractability comes from three design choices: (i) the durability shock is additively separable from the production function, so labor income and aggregate saving are non-stochastic; (ii) IES = 1 with Epstein-Zin-Weil preferences, making saving a constant fraction of income; (iii) along balanced growth paths, g and rf are constant, so sustainability reduces to comparing two constants. Limitations acknowledged by the authors: the paper analyzes only balanced growth paths and does not characterize transition dynamics; the framework does not directly address economies where g or rf are stochastic; and the two-period OLG structure is stylized. The authors pose as an open question whether the result that optimal borrowing equals maximal borrowing generalizes to settings with random g.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Bond-capital ratio (B):&lt;/strong&gt; The ratio of outstanding government bonds to the capital stock, Bt/Kt. This is the paper&amp;rsquo;s central state variable and policy instrument. A value B is &amp;ldquo;sustainable&amp;rdquo; if the government can roll over its debt forever at the riskfree interest rate without any primary budget surpluses. The paper distinguishes B from the more commonly reported debt-GDP ratio (which equals B times the capital-output ratio).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adjusted gross rate of return / riskfree rate (R, Rf):&lt;/strong&gt; R ≡ (1+r)/G and Rf ≡ (1+rf)/G, where r is the net return on capital, rf is the riskfree interest rate on bonds, and G = 1+g is the gross growth rate. Expressing returns in these &amp;ldquo;adjusted&amp;rdquo; gross units scales out balanced growth and simplifies the sustainability condition to Rf ≤ 1 (equivalently, rf ≤ g).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Dynamic efficiency (Zilcha criterion):&lt;/strong&gt; In the paper&amp;rsquo;s stochastic setting, the relevant criterion for dynamic efficiency is E{ln R} ≥ 0 (Zilcha 1991, as amended by Rangazas-Russell 2005 and Barbie-Kaul 2009), meaning the geometric mean of the adjusted gross return on capital is at least one. This differs from the deterministic condition r ≥ g. The paper&amp;rsquo;s Region E in Figure 1 is the key zone where E{ln R} &amp;gt; 0 (dynamically efficient) and Rf &amp;lt; 1 (rollover feasible) simultaneously.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bmax (maximum sustainable bond-capital ratio):&lt;/strong&gt; The largest value of B for which the bond-capital ratio is sustainable, defined as the unique root of Rf(B) = 1. At Bmax, the riskfree rate exactly equals the growth rate (rf = g). The paper proves Bmax is finite, invariant to ζ, and equals the welfare-maximizing sustainable bond-capital ratio.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;B∗ (dynamic efficiency threshold):&lt;/strong&gt; The bond-capital ratio at which the economy crosses from Zilcha-inefficiency into Zilcha-efficiency, defined by E{ln R} = 0. For B ∈ [B∗, Bmax], the economy is dynamically efficient and debt rollover is feasible. B∗ &amp;lt; Bmax when risk aversion γ or return volatility s is large enough, defining a non-trivial interval of dynamically efficient, permanently rollable bond levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Durability shock (ε):&lt;/strong&gt; An i.i.d. random variable with mean zero that enters the capital depreciation rate as δ − ε_t. This shock makes the rate of return on capital r = αkα−1 − δ + ε stochastic while leaving the capital stock per unit of effective labor, aggregate wages, and aggregate saving non-stochastic. It is the only source of aggregate uncertainty in the model and is the mechanism that drives a wedge between rf and E{r}.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;No Ponzi Game (NPG) condition:&lt;/strong&gt; The condition that the present discounted value of government debt converges to zero (equivalently, debt equals the present value of future primary surpluses). Standard fiscal sustainability analyses assume this condition holds. The paper explicitly violates it: in the feasible-rollover region rf ≤ g, the present value of aggregate consumption is infinite and the NPG fails, yet government bond values are positive and debt rollover is sustainable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Seignorage (ζ):&lt;/strong&gt; The revenue the government obtains by issuing new bonds in excess of interest payments on existing bonds, equal to (g − rf)Bt when rf &amp;lt; g. The parameter ζ ∈ [0,1] governs the share transferred to young consumers (as lump-sum transfers τt) versus wasted (captured by the government but yielding no utility). A key finding is that Bmax is invariant to ζ, since seignorage is zero at rf = g regardless of ζ.&lt;/p&gt;</description></item><item><title>Rural Migrants and Urban Informality: Evidence From Brazil</title><link>https://macropaperwarehouse.com/papers/rural-migrants-and-urban-informality-evidence-from-brazil/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/rural-migrants-and-urban-informality-evidence-from-brazil/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Does rural-urban migration increase or decrease urban informality, and through what mechanisms — and does the answer depend on the time horizon?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Data.&lt;/strong&gt; The paper studies internal migration in Brazil over 2000–2010. The empirical analysis combines: (i) two waves of the Decennial Population Census (2000 and 2010) covering working-age adults (ages 15–64) across 3,548 Minimum Comparable Areas (MCAs); (ii) the universe of formal firms and workers from the matched employer-employee administrative dataset RAIS (1997–2018); (iii) the ECINF informal firm survey (2003); and (iv) the annual National Household Survey (PNAD, 2001–2009) for year-on-year short-run analysis in 700 identifiable municipalities. Internal immigration to the average urban destination was large: 17.6 percent overall over the decade, 7 percent for state-to-state migration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Design.&lt;/strong&gt; The authors use a shift-share instrumental variable (IV) design. The shares are pre-existing migration networks (migrant flows by origin-destination pair, 1995–2000). The shifts are drought shocks constructed from the Standardized Precipitation-Evapotranspiration Index (SPEI) interacted with agricultural crop calendars and the value share of each crop in each origin municipality — accumulated over the 2000–2010 decade. A second independent instrument uses international commodity price shocks as push factors (following a China-analogous construction); the two instruments are nearly uncorrelated across origins (0.007) and only weakly correlated across destinations (-0.3), providing an independent validation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Long-Run Findings (decadal changes, 2000–2010).&lt;/strong&gt; A one-percentage-point increase in the immigration rate (equal to 18.5 percent of a standard deviation):&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Increases the share of workers in formal wage employment by &lt;strong&gt;0.27 percentage points&lt;/strong&gt; (a 1.2 percent increase from the mean of 23 percent).&lt;/li&gt;
&lt;li&gt;Decreases the share in informal wage employment by &lt;strong&gt;0.29 percentage points&lt;/strong&gt; (a 2.9 percent decrease from the mean of 10 percent).&lt;/li&gt;
&lt;li&gt;Has no effect on overall wage employment, unemployment, or self-employment — the formalization effect is a reallocation from informal to formal jobs, not net job creation.&lt;/li&gt;
&lt;li&gt;Reduces formal sector wages by &lt;strong&gt;0.6 percent&lt;/strong&gt;, with no effect on informal wages.&lt;/li&gt;
&lt;li&gt;Increases the number of formal establishments by &lt;strong&gt;1.6 percent&lt;/strong&gt; and the number of formal jobs by &lt;strong&gt;2 percent&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Raises gross firm entry by &lt;strong&gt;2.8 percent&lt;/strong&gt; and gross firm exit by &lt;strong&gt;3 percent&lt;/strong&gt; (higher churn), with effects stable or slightly increasing through 2017–18.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;These firm-creation effects are not driven by migrants starting businesses: migrants are not more likely to be business owners in high-immigration municipalities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Short-Run Findings.&lt;/strong&gt; Using year-on-year specifications with the PNAD (2001–2009), the authors replicate the results in the prior literature: municipalities receiving more migrants experience a reduction in formal wage employment, with no change in informal employment or non-employment — so the share of informal jobs rises. These short-run informality-increasing effects coexist with the long-run formalization results, and are not a sample artifact (the long-run results are unchanged when restricted to the same 700 PNAD municipalities).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism — Downward Nominal Wage Rigidity (DNWR).&lt;/strong&gt; DNWR in the formal sector is the key mechanism reconciling short- and long-run effects. In Brazil, nominal wage cuts were illegal, and the national minimum wage rose regularly during the 2000s. Two municipality-level DNWR proxies are used: (i) the Kaitz index (national minimum wage / municipality median wage in 2000); (ii) the share of workers with negative year-on-year nominal wage changes (from RAIS, 1997–2000). In municipalities with higher DNWR: the positive formalization effects of immigration are smaller or fully muted; non-employment increases; and formal wages decline less. These cross-sectional patterns echo the Harris-Todaro-Fields prediction, and are consistent with DNWR being more binding in the short run (when nominal rigidities bind) than in the long run (when inflation and worker turnover allow real wage adjustment).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The paper develops and estimates a dynamic model of firm dynamics and informality, extending the canonical Hopenhayn framework with (i) two margins of informality — the extensive margin (whether a firm registers) and the intensive margin (whether a registered formal firm hires workers formally) — and (ii) heterogeneous long-run productivity parameters (nu) that generate firm-specific life-cycle growth profiles. Formal firms cannot revert to informality; informal firms can formalize by paying the cost differential between formal and informal entry costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactuals.&lt;/strong&gt; A simulated once-and-for-all 10 percent labor supply shock (approximately the 80th percentile of observed immigration shocks) produces: a 4.1 percent decline in the share of informal workers (IV: 7.5 percent); a 16.1 percent increase in formal firms (IV: 21.1 percent); and a 3.4 percent wage decline (IV: 5 percent). Of the increase in formal firms, &lt;strong&gt;40 percent&lt;/strong&gt; is accounted for by formalization of previously informal firms, highlighting the stepping-stone role of informality that a static or dual-economy model would miss. Average firm productivity declines by 1.4 percent due to worsening firm composition (the share of formal firms in the lowest productivity quartile rises by more than 4 percentage points). A counterfactual that nearly eliminates the extensive margin of informality (via steep enforcement costs) raises total output by 8.6 percent vs. 7 percent in the baseline shock, and increases average firm productivity by 2.1 percent vs. a decline of 1.4 percent — at the cost of displacing the least productive informal firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; Results pertain to internal (not international) migration; drought-induced migrants do not change the skill composition of the labor force at destination, justifying a homogeneous worker assumption. The formalization effects hold for migrants and non-migrants separately, and for high- and low-skilled workers separately. The model is calibrated to the average urban destination in Brazil, not a spatial general equilibrium.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the identification strategy, and what are the key threats to validity the authors address?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use a shift-share IV where shifts are drought shocks at origin municipalities (constructed from SPEI x crop calendar x crop revenue share, accumulated over 2000–2010) and shares are pre-2000 migration networks. Threats addressed: (i) pre-trends — no evidence of differential pre-trends in firm outcomes between 1997–98 and 1999–2000; (ii) demand channel — controlling for local drought shocks and distance-weighted neighboring shocks leaves results unchanged; (iii) capital reallocation — adding a bank-network-based shift-share control (following prior literature) does not change results; (iv) agricultural processing linkages — results hold after excluding agricultural firms and food/beverage/tobacco manufacturers; (v) migration persistence — controlling for baseline log population and 1995–2000 migration rates leaves results unchanged. The commodity-price-shock instrument provides an independent validation, yielding similar results despite near-zero cross-origin correlation with drought shocks and only -0.3 correlation across destinations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the authors reconcile the long-run formalization result with the short-run informality-increasing result, and what role does DNWR play?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;DNWR is the key mechanism. Nominal wage cuts are illegal in Brazil&amp;rsquo;s formal sector, and the minimum wage rose through the 2000s, making DNWR binding especially in the short run. In the year-on-year specification (PNAD, 2001–2009), immigration reduces formal wage employment with no change in informal employment, raising the informal share — consistent with prior literature. Over the decade, inflation and worker turnover permit real formal wage adjustment, enabling formal sector expansion. Cross-sectional heterogeneity confirms this: in municipalities with above-median Kaitz index or below-median share of negative wage changes, the formalization effect of immigration is smaller or zero, and non-employment rises — precisely the Harris-Todaro-Fields prediction for rigid-wage environments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the exact magnitude of the firm-level effects and how persistent are they?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A one-percentage-point increase in the immigration rate increases formal establishments by 1.6 percent, formal jobs by 2 percent, firm entry by 2.8 percent, and firm exit by 3 percent — all decadal effects (1999–2000 to 2011–12). Effects on firms, entry, exit, and jobs remain stable or slightly increasing through 2017–18 as estimated using RAIS panel data, with no evidence of pre-trends (effects near zero in 1997–98 to 1999–2000 period). The effect on firm-level average wages is negative (consistent with the worker-level wage effect) but not statistically significant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Are migrants themselves the source of new formal firm creation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The authors directly test and reject this channel. Migrants are not more likely to be business owners — either of small firms (fewer than 5 employees) or larger firms (6 or more employees) — in municipalities that receive more immigration. The increase in formal firm entry is driven by non-migrants responding to cheaper labor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the two margins of informality in the model, and why does the intensive margin matter for the migration-formality nexus?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The extensive margin is whether a firm registers formally (firm-level binary). The intensive margin is whether a formally registered firm hires workers without formal labor contracts (worker-level, within formal firms). The intensive margin is crucial because it links formal firms to migrants: newly arrived migrants may take informal jobs within formal firms, allowing formal firm creation to respond to the immigration shock even before the labor market fully formalizes. In the transition dynamics after an immigration shock with DNWR, new formal firms tend to be small and lower-productivity, and hire a substantial fraction of their workforce informally — so labor informality hovers near its initial level for several years even as firm informality declines quickly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What fraction of the increase in formal firms in the counterfactual comes from stepping-stone formalization versus new formal entry?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the baseline 10 percent labor supply counterfactual, approximately &lt;strong&gt;40 percent&lt;/strong&gt; of the increase in the number of formal firms comes from formalization of previously informal firms across their life cycles. The remaining 60 percent comes from new formal firm creation. A static framework would miss the stepping-stone channel entirely and substantially underestimate total formalization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the model&amp;rsquo;s calibration pin down the cost structure of informal vs. formal firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is calibrated using a two-step minimum distance procedure. First-step parameters include the persistence of formal firms&amp;rsquo; productivity process (estimated from RAIS: rho_f = 0.92), and statutory tax rates (payroll tax tau_w = 0.375; revenue VAT tau_y = 0.293). Second-step parameters (12 total, including entry costs, exogenous death rates, productivity dispersion, and cost-function curvatures for both margins of informality) are estimated by minimizing the distance between simulated and observed moments from RAIS (2003 cross-section for static moments; 2000–2011 panel for growth moments) and ECINF (informal firms with up to 5 employees, 2003). Key calibrated values: formal entry costs are more than twice informal entry costs and correspond to over 30 times the 2003 monthly national minimum wage; the informal sector exogenous death rate (delta_i = 0.148) is more than twice the formal rate; productivity variance and persistence are similar across sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What happens to firm productivity and output per worker in the long-run counterfactual?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Average firm productivity declines by 1.4 percent despite lower informality. The composition of formal firms worsens: the share of firms in the lowest productivity quartile rises by more than 4 percentage points, while the share in the top quartile falls by about 3 percentage points. Total output and tax revenues increase (7 and 8.6 percent, respectively), but both decline in per capita terms. The authors note these are likely lower bounds because the model assumes no technological differences between formal and informal sectors and no differential capital access.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What does the enforcement counterfactual reveal about the dual role of informality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;When the extensive margin of informality is nearly shut down (by making the informal cost function very steep), a 10 percent labor supply shock produces: output increase of 8.6 percent (vs. 7 percent with informality present); average firm productivity increase of 2.1 percent (vs. decline of 1.4 percent); much higher tax revenues due to greater formality. However, this comes at the cost of a sizable reduction in total firm count as the least productive informal firms are displaced. This illustrates the dual role: in the short run, the informal sector acts as an employment buffer and stepping-stone, which is more important when formal wage rigidity is stronger; but in the long run, it dampens aggregate economic benefits from immigration by sheltering low-productivity firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Do the results hold for both migrants and non-migrants, and across skill levels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. Appendix results show similar employment and wage effects for migrants and non-migrants separately, though formal wage declines are more pronounced for non-migrants. Results are also similar for high- and low-skilled workers — which the authors attribute to the fact that drought-induced migration does not change the skill composition of the workforce at destination (confirmed empirically). Price-shock-induced migrants differ: they are more likely to be young and male, and do change workforce composition, providing a different set of compliers that strengthens external validity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper relate to the &amp;ldquo;startup deficit&amp;rdquo; literature on demographic decline?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s findings are the mirror image of the US startup deficit literature, which argues that demographic slowdown reduced firm entry, labor reallocation, and employment growth. The magnitudes are comparable in scale: the US startup deficit corresponds to a 5-percentage-point decline in firm entry between 1980 and 2012, while the rural-urban migration shocks studied here produce first-order effects on firm entry of similar or larger magnitude (2.8 percent per percentage point of immigration rate), suggesting labor supply growth is a primary driver of formal firm dynamics in both directions.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Downward Nominal Wage Rigidity (DNWR).&lt;/strong&gt; In the paper&amp;rsquo;s usage, the binding constraint that formal sector wages cannot be cut in nominal terms — in Brazil, both legal prohibition of nominal wage cuts and a rising national minimum wage. DNWR is the paper&amp;rsquo;s central mechanism explaining why immigration increases informality in the short run (wages cannot adjust) but reduces it over the decade (inflation and turnover permit real adjustment). Measured empirically via the municipality-level Kaitz index (national minimum wage / local median wage) and via the share of workers with negative year-on-year nominal wage changes in RAIS.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive Margin of Informality.&lt;/strong&gt; Whether a firm is registered with the government (formal) or not (informal). In the model, informal firms can avoid taxes but face a size-increasing cost of informality and the option to formalize by paying the difference in entry costs. This margin captures the firm&amp;rsquo;s legal registration status.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive Margin of Informality.&lt;/strong&gt; Whether a formally registered firm hires individual workers with or without formal labor contracts (signed work booklet, carteira de trabalho). Formal firms face increasing costs for informal hiring but exploit this margin for lower-cost labor, especially when small or young. This margin is critical because it links formal firms to migration-induced informal labor supply and allows formal firms to absorb migrants before full wage adjustment occurs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stepping-Stone Role of Informality.&lt;/strong&gt; The paper&amp;rsquo;s term for the dynamic channel through which the informal sector facilitates transitions to formality for both firms and workers. Informal firms accumulate productivity experience and formalize when productivity crosses the formalization threshold; informal workers within formal firms transition to formal contracts as firms grow. In the counterfactuals, 40 percent of the increase in formal firms following a labor supply shock is attributable to this channel. The stepping-stone role is most valuable during the short-run period of formal wage rigidity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shift-Share Instrumental Variable.&lt;/strong&gt; The identification design combining pre-existing migration network shares (fraction of prior migrants to destination d from each origin o, computed 1995–2000) with exogenous push shocks at origin (drought shocks or commodity price shocks). The instrument predicts which destination municipalities receive more migrants based purely on exogenous origin-level shocks, purging the endogeneity from migrants self-selecting into prosperous cities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Minimum Comparable Area (MCA).&lt;/strong&gt; The paper&amp;rsquo;s geographic unit of analysis: a harmonized aggregation of Brazilian municipalities whose administrative borders changed during the study period, yielding 3,548 stable units covering all urban destinations studied. The authors call these &amp;ldquo;municipalities&amp;rdquo; for convenience.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Harris-Todaro-Fields Framework.&lt;/strong&gt; The theoretical benchmark against which the paper&amp;rsquo;s results are compared — the view (from Harris and Todaro 1970 and Fields) that rural-urban migration increases urban unemployment or informality because DNWR prevents the formal sector from absorbing migrants, who instead queue for formal jobs or enter the informal sector. The paper shows this prediction holds in the short run and in high-DNWR municipalities, but not in the long run where real wage adjustment occurs.&lt;/p&gt;</description></item><item><title>Sanctions and the Exchange Rate</title><link>https://macropaperwarehouse.com/papers/sanctions-and-the-exchange-rate/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/sanctions-and-the-exchange-rate/</guid><description>&lt;h2 id="layer-1--core-argument"&gt;Layer 1 — Core Argument&lt;/h2&gt;
&lt;p&gt;Itskhoki and Mukhin develop a tractable open-economy model with financial market segmentation — in which only the government sector (including state banks and exporting firms) can intermediate cross-border capital flows — to study how trade and financial sanctions affect the nominal exchange rate. Their first main result is a Lerner-symmetry equivalence: sanctions limiting a country&amp;rsquo;s exports or freezing its foreign assets depreciate the exchange rate, while sanctions limiting imports appreciate it, even though both types of policies have exactly the same effect on real allocations, including household welfare and government fiscal revenues. The mechanism is direct — export sanctions reduce the supply of foreign currency, requiring depreciation to restore market clearing, whereas import sanctions reduce the demand for foreign currency, requiring appreciation — and because real income effects are identical, the exchange rate movement is not informative about effectiveness: one cannot evaluate the effectiveness of sanctions based solely on the dynamics of the exchange rate. Beyond direct trade sanctions, increased precautionary savings in foreign currency also depreciate the exchange rate when they are not offset by the sale of official reserves or financial repression of foreign-currency savings. Applying the calibrated model to Russia&amp;rsquo;s post-invasion experience, the dynamics of the ruble exchange rate following Russia&amp;rsquo;s invasion of Ukraine in February 2022 are quantitatively consistent with the combined effects of these forces calibrated to the observed sanctions and government policies; the combined effect from 2.5 years of sanctions corresponds to a permanent decline in consumption of 0.9% in Russia, while the net effect is close to zero for the rest of the world, and the freeze of FX reserves together with import tariffs act as a positive transfer from Russia to the rest of the world while quantity restrictions on exports raise world energy prices and generate global welfare losses.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the core theoretical result on trade sanctions and the exchange rate?&lt;/strong&gt;
A: Proposition 1 establishes that permanent sanctions on imports (raising import prices P*_t by τ) are equivalent in their effect on import consumption and welfare to a combination of permanent sanctions on exports (reducing export prices Q*_t by τ) and a partial seizure of foreign assets (reducing F*_0 by τ). Both sets of sanctions produce the same path of reduced import quantities and the same welfare loss. However, sanctions on exports cum foreign-asset seizure are associated with an additional depreciation of the exchange rate by τ percent relative to import sanctions. This equivalence is a manifestation of Lerner (1936) symmetry extended to a dynamic international macro environment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the intuition for the opposite exchange rate movements under import versus export sanctions?&lt;/strong&gt;
A: Both kinds of sanctions shrink the country&amp;rsquo;s feasible import consumption set equivalently in real terms, but they operate through different channels. Export sanctions directly reduce the inflow of foreign currency (export revenues fall), so the exchange rate must depreciate to discourage import demand and bring it in line with the reduced budget. Import sanctions raise the price of foreign goods directly; without an offsetting movement, this would create excess demand for domestic non-tradables. To eliminate the excess demand and leave export revenues partially used, the exchange rate must appreciate. In both cases, the import demand schedule — CF_t = (E_t P*_t / P_t)^{-θ} γ Y_t — pins down the exchange rate that supports the same equilibrium import allocation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does fiscal equivalence also hold, even when the government relies primarily on exports for revenue?&lt;/strong&gt;
A: Yes. Proposition 1 and the surrounding analysis show that the equivalence result for export and import sanctions extends to the fiscal balance, even when the government relies exclusively on exports for fiscal revenues. The mechanism is a general equilibrium adjustment in the exchange rate: depreciation (under export sanctions) partially ameliorates the impact by increasing the local-currency purchasing power of export revenues, while appreciation (under import sanctions) has the opposite effect. The net fiscal-balance effect of both kinds of sanctions ends up being the same.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What role does the financial market segmentation assumption play?&lt;/strong&gt;
A: The paper assumes a form of financial market segmentation in which only the government sector (including state banks and exporting companies) can intermediate capital flows across the border, subject to international restrictions. This captures both the withdrawal of foreign investors from the Russian market and the segmentation of Russian households from the international financial market due to external sanctions and domestic capital controls. Under this structure, exports and FX reserves are the key sources of currency supply to the economy, and imports plus domestic foreign-currency savings are the key sources of currency demand; the equilibrium exchange rate is determined by the balance of these in the domestic market. Ricardian equivalence for foreign-currency savings does not hold when κ &amp;gt; 0 in the household utility function, so government reserve policy has real effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the role of precautionary savings demand for foreign currency?&lt;/strong&gt;
A: Households have foreign-currency bonds in their utility function reflecting a precautionary (hedging) demand for future purchases of foreign tradables, parameterized by a shock Ψ_t. When financial conditions collapse — the local stock market crashes, domestic deposits face inflation and bank-run risk, and access to foreign assets is constrained — Ψ_t rises above the real value of household FX savings, creating pressure to accumulate foreign-currency savings despite low expected returns. With inelastic inflow of foreign currency from exports (due to financial sanctions) and no feasible FX reserve sale, a large jump-depreciation is required to restore equilibrium by curbing the increased demand for foreign currency via lower expected returns and higher import prices. The effect is transitory: it dies out as households accumulate enough FX savings. The optimal government response is to sell FX reserves to accommodate household demand without an exchange rate devaluation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens when FX interventions are infeasible?&lt;/strong&gt;
A: When the central bank&amp;rsquo;s reserves are frozen by sanctions or otherwise unavailable, the government can use financial repression to offset the exchange rate effects of financial shocks. Specifically, by imposing fees on purchasing and withdrawing foreign currency — thereby reducing the household return on foreign-currency deposits R*_H below the international rate R*_t — the central bank can suppress foreign-currency demand. While financial repression is suboptimal in a representative-agent economy, it may be second-best in heterogeneous-agent economies or economies with balance-sheet effects. Importantly, the exchange rate remains allocative even under financial sanctions and financial repression; it is not rendered irrelevant by these policies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do the results change when Russia is modeled as a large economy in the commodity market?&lt;/strong&gt;
A: Section 3 extends the analysis to an economy that is large in the world commodity market, modeling Russia as a large commodity exporter, and spelling out specific policy instruments. The paper shows that import prices and export revenues still constitute a sufficient statistic for the macroeconomic effects on the economy under sanctions. However, the welfare implications for the rest of the world depend crucially on whether sanctions take the form of trade taxes or quantity restrictions. A price cap on exported commodities can replicate a tax on exports, achieving the desired wealth transfer to the coalition. In contrast, imposing quantity restrictions on a large commodity exporter reduces global supply and drives up world energy prices, hurting the sanctioned economy when it lowers export revenues, but also imposing substantial costs on senders.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper calibrate the model to Russia&amp;rsquo;s ruble dynamics, and how well does it fit?&lt;/strong&gt;
A: The paper employs two calibration strategies. The first reproduces the ex-ante calibration from the 2022 working paper version based on scant data available in the first months after the invasion, without targeting any exchange rate moments. This calibration provides a remarkable out-of-sample fit, predicting accurately the dynamics of the ruble in the following two years. The second is an ex-post calibration that infers structural shocks to perfectly match observed dynamics of Russian imports, exports, commodity prices, domestic output, official FX reserves, inflation, and the exchange rate. Both approaches agree on the decomposition of exchange rate dynamics and confirm the quantitative importance of the theoretical mechanisms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the calibrated decomposition say about the phases of ruble dynamics?&lt;/strong&gt;
A: The initial sharp depreciation in the first weeks after the invasion is mostly driven by increased precautionary demand for foreign currency. The frozen FX assets translate into modest losses of permanent income (only about 3% depreciation), but the asset freeze and sanctions on the Central Bank had a much larger indirect effect by limiting the capacity to accommodate the financial shock with FX interventions. One month out, trade shocks begin to dominate: import restrictions curb FX demand, while the spike in energy prices elevated Russian export revenues, increasing foreign-currency inflows. These forces combined neutralize capital outflows and the surge in financial FX demand, explaining the sharp appreciation of the ruble by summer 2022 (about 30% stronger than pre-war by June). Over time, import quantities recovered as parallel imports and new trade linkages were established, and export revenue inflows contracted as commodity prices declined, bringing the exchange rate back to and then about 20% weaker than pre-war levels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the welfare and fiscal consequences quantified by the calibrated model?&lt;/strong&gt;
A: The initial exchange rate depreciation boosted fiscal revenues by 12%, amplified further by greater export revenues starting in the second month. These effects were offset in the medium run by the exchange rate appreciation due to trade sanctions, with net real income turning negative starting from April 2022. International sanctions decrease long-run real government revenues by about 4%, mostly due to a reduction in export revenues. The combined effect from 2.5 years of sanctions corresponds to a permanent decline in consumption of 0.9% in Russia — vastly larger than conventional estimates of the cost of a business cycle — and close to zero on net for the rest of the world. Consistent with the theoretical results, the freeze of FX reserves and import tariffs act as a positive transfer from Russia to the rest of the world, while quantity restrictions on exports result in higher energy prices, lower consumption, and global welfare losses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why cannot the exchange rate be used to evaluate the effectiveness of sanctions in real time?&lt;/strong&gt;
A: Because import sanctions and export sanctions generate opposite exchange rate movements while having exactly the same effect on real allocations, welfare, and fiscal balance, there is no one-to-one mapping between the exchange rate and welfare under sanctions. A strong exchange rate (appreciation) after sanctions may reflect import restrictions — which are just as effective in reducing real income as export restrictions that would have caused depreciation. Conversely, a weak exchange rate need not imply sanctions are ineffective; it may simply reflect that sanctions took the form of export or asset-freeze measures. The ruble&amp;rsquo;s rapid appreciation through summer 2022 illustrates this: rather than indicating that sanctions failed, it was largely consistent with the combination of import restrictions and high commodity prices, while the underlying real income effect was substantially negative.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Lerner symmetry (macroeconomic version):&lt;/strong&gt; The principle, originating in Lerner (1936), that a uniform import tariff and a uniform export tax yield the same real economic outcomes — the same allocation and welfare — but are sustained by a differential movement in relative prices (appreciation versus depreciation). In the paper&amp;rsquo;s context, both import and export sanctions of equivalent magnitude reduce the real income of the sanctioned economy by the same amount and produce the same path of import consumption and welfare, even though they move the exchange rate in opposite directions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial market segmentation:&lt;/strong&gt; The model&amp;rsquo;s departure from standard international macro in which only the government sector (including state banks and exporting companies) can intermediate cross-border capital flows, subject to international restrictions. Households cannot freely access international financial markets. This makes exports and FX reserves the only sources of foreign-currency supply to the domestic economy, and imports plus domestic foreign-currency savings the only sources of demand, so the exchange rate is determined entirely by the domestic balance of these flows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Precautionary foreign-currency demand shock (Ψ_t):&lt;/strong&gt; A shock that raises the household bliss-point for real foreign-currency bond holdings above the current stock, capturing a collapse in the supply of alternative savings vehicles (domestic stocks, bank deposits, access to foreign assets). In the model it enters households&amp;rsquo; utility directly; an increase in Ψ_t above real FX savings creates depreciatory pressure on the exchange rate when not offset by FX reserve sales or financial repression.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Financial repression (in the model):&lt;/strong&gt; Government suppression of the household rate of return on foreign-currency deposits R*_H below the international rate R*_t, implemented via fees on purchasing and withdrawing foreign currency. It offsets the depreciatory effect of a precautionary savings shock without requiring FX reserve sales, at the cost of a distortion in the domestic financial market. The paper notes Russia introduced such fees in March–April 2022.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient statistic for macroeconomic effects:&lt;/strong&gt; When the sanctioned economy is large (as Russia is in global energy markets), import prices and export revenues still constitute a sufficient statistic for the macroeconomic effects of sanctions on the economy — i.e., the same pair of variables summarizes welfare, fiscal, and exchange rate outcomes regardless of the specific instrument used to impose sanctions, provided the terms of trade deterioration is the same.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price cap (as an export tax equivalent):&lt;/strong&gt; A price cap on a sanctioned country&amp;rsquo;s exported commodities can replicate the effect of a tax on exports from the coalition&amp;rsquo;s perspective, achieving the same real-income transfer from the sanctioned country to the rest of the world without reducing global supply (as quantity restrictions do). This distinguishes it from quantity restrictions on exports, which reduce global energy supply and impose welfare costs on the coalition.&lt;/p&gt;
&lt;hr&gt;
&lt;p&gt;&lt;em&gt;Summary based on LSE Research Online accepted version. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;</description></item><item><title>School Choice and the Housing Market</title><link>https://macropaperwarehouse.com/papers/school-choice-and-the-housing-market/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/school-choice-and-the-housing-market/</guid><description>&lt;p&gt;Grigoryan (2021) develops a unified general-equilibrium framework that jointly models school assignment mechanisms and the housing market to evaluate the welfare and distributional consequences of replacing traditional neighborhood assignment (NA) with the Deferred Acceptance (DA) mechanism. The paper fills a gap in the matching theory literature, where preferences and priorities are typically treated as exogenous, by making residential choices endogenous: families first observe which school assignment mechanism the district announces, then optimally select a neighborhood given market-clearing prices and other families&amp;rsquo; choices, and finally children are assigned to schools through the announced mechanism.&lt;/p&gt;
&lt;p&gt;The model features a continuum of families, each with a type defined by valuations over all neighborhood–school pairs, a finite set of neighborhoods and schools (one school per neighborhood), and competitive equilibrium prices. Three mechanisms are compared: NA (each child attends the neighborhood school), DA without neighborhood priority (DA), and DA with neighborhood priority (DN), where neighborhood residents receive priority at their local school.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s first major result (Theorem 3) is that DN unambiguously generates weakly higher aggregate welfare than NA. The proof exploits the fact that DN preserves NA&amp;rsquo;s option — families can still guarantee admission to the neighborhood school by living there — while additionally allowing families to access seats at other schools that go unclaimed by neighborhood residents. Although price effects under DN can make some individual families worse off relative to NA, aggregate welfare (inclusive of house sellers) is always weakly higher under DN. In simulations with 1,000 students, 10 neighborhoods, and 10 schools, DN yields average aggregate welfare gains of 2.40% relative to NA across the 18 parameter configurations studied.&lt;/p&gt;
&lt;p&gt;The welfare comparison between DA (without neighborhood priority) and NA is ambiguous in the general model: simulations show DA producing gains as large as +5.65% and losses as large as −18.26% relative to NA, depending on the degree of preference alignment across families (parameter α) and the variance in school capacities (parameter γ). DN also dominates DA in aggregate welfare under two sufficient conditions — identical ordinal preference rankings over neighborhoods and schools (Assumption 1 or 2) — though counterexamples exist when these assumptions fail.&lt;/p&gt;
&lt;p&gt;The second major result (Theorem 5, Corollaries 1–2) concerns the welfare of lowest-income families, defined as those with budget (maximum willingness to pay for housing) equal to zero or sufficiently close to zero. Under two jointly sufficient conditions — (1) neighborhoods that are underdemanded (zero-priced) under NA remain underdemanded under DA/DN, and (2) the schools in those underdemanded neighborhoods are themselves underdemanded — both DA and DN generate weakly higher welfare for the lowest-income families than NA. These conditions hold whenever families share common ordinal preference rankings (Corollary 1) and in the uniform economy where each valuation profile is equally likely (Corollary 2). The conditions are shown to be approximately necessary in a robustness sense (Theorem 6): for any economy violating them, an arbitrarily close economy exists in which a positive measure of zero-income families prefer NA. In simulations, DN raises lowest-income welfare by an average of 26.51% and DA by an average of 38.25% relative to NA.&lt;/p&gt;
&lt;p&gt;The paper also proves existence of a competitive equilibrium for the continuum economy under DA and DN via the Schauder-Tychonoff fixed-point theorem (Theorem 2), exploiting the continuity of school assignment probabilities in families&amp;rsquo; neighborhood choices. In discrete economies, assignment externalities can preclude equilibrium existence, but approximate equilibria exist in sufficiently large discrete markets and all welfare comparisons carry over approximately. The existence proof technique applies to general assignment games with externalities including peer preferences and complementarities.&lt;/p&gt;
&lt;p&gt;Scope conditions: results are derived for a model without direct peer externalities or endogenous school quality; a supplementary extension to local public financing finds that the aggregate welfare superiority of DA over NA may not survive when school spending is capitalized into housing prices, though the lowest-income welfare sufficiency conditions of Theorem 5 do extend to that environment.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question and why does the housing market matter for evaluating school choice?&lt;/p&gt;
&lt;p&gt;A: The paper asks how replacing neighborhood assignment with the Deferred Acceptance mechanism affects aggregate welfare and the welfare of the lowest-income families, accounting for the fact that families choose where to live in response to the school assignment mechanism. The housing market matters because under neighborhood assignment families can guarantee enrollment at a preferred school by purchasing a house in that school&amp;rsquo;s neighborhood; switching to DA changes these strategic incentives, alters equilibrium prices, and therefore changes who ends up in which neighborhood before any school assignment takes place. Ignoring residential choices would miss this feedback loop between assignment rules and housing demand.&lt;/p&gt;
&lt;p&gt;Q: What are the three mechanisms compared, and how do they differ?&lt;/p&gt;
&lt;p&gt;A: Neighborhood assignment (NA) assigns each child to the school in their neighborhood with certainty. DA without neighborhood priority allocates seats by student preference rankings and lottery numbers, with market-clearing cutoffs determined iteratively; no residential location confers a priority advantage. DN (DA with neighborhood priority) works like DA but grants neighborhood residents a priority of 1 at their local school and 0 at all other schools, effectively guaranteeing neighborhood families a seat at their local school while filling remaining seats by lottery among non-neighborhood applicants.&lt;/p&gt;
&lt;p&gt;Q: What does Theorem 3 establish, and what is the intuition for why DN dominates NA in aggregate welfare?&lt;/p&gt;
&lt;p&gt;A: Theorem 3 establishes that for any competitive equilibrium under DN and any competitive equilibrium under NA, aggregate welfare is weakly higher under DN. The intuition is that DN preserves all options available under NA — a family can always choose the neighborhood corresponding to its most-valued school and be guaranteed admission there — while additionally providing access to seats at other schools not claimed by their own neighborhood residents. The proof maps DN&amp;rsquo;s CE onto a Walrasian equilibrium of a continuum assignment game and invokes the welfare-maximization property of such equilibria from Gretsky, Ostroy, and Zame (1992).&lt;/p&gt;
&lt;p&gt;Q: Why is the welfare comparison between DA and NA ambiguous?&lt;/p&gt;
&lt;p&gt;A: Under NA, families with the highest cardinal valuations for a particular school can guarantee admission by purchasing a house in that neighborhood, and this targeted sorting can raise aggregate welfare when preferences over schools are strongly aligned. Under DA (without neighborhood priority), no location guarantees school admission, so families lose this signaling device; but DA allows families to live in preferred neighborhoods without sacrificing school quality, which raises welfare when preferences are heterogeneous. Neither effect dominates in general: in simulations, DA ranges from −18.26% to +5.65% relative to NA across the parameter space.&lt;/p&gt;
&lt;p&gt;Q: What role do neighborhood priorities play as a &amp;ldquo;signaling device,&amp;rdquo; and when does DN dominate DA?&lt;/p&gt;
&lt;p&gt;A: Neighborhood priorities allow families to credibly signal high valuations for a school by choosing to live in that school&amp;rsquo;s neighborhood, analogously to signaling devices in matching markets without money. When families have identical ordinal preference rankings over neighborhoods and schools (Assumptions 1 or 2), DN generates weakly higher aggregate welfare than DA because any DA assignment probability can be replicated under DN by mixing over neighborhoods, but the converse is not true. Counterexamples exist when preference rankings differ across families, so the DN-over-DA dominance is not universal.&lt;/p&gt;
&lt;p&gt;Q: What are the sufficient conditions for lowest-income families to prefer DA/DN to NA, and how tight are they?&lt;/p&gt;
&lt;p&gt;A: The two joint conditions are: (1) neighborhoods that have zero price (are underdemanded) under NA also have zero price under DA or DN after the mechanism switch; and (2) the schools located in those underdemanded neighborhoods are themselves underdemanded (have zero admission cutoffs) under DA/DN. Condition (1) reflects that the poorest neighborhoods are unlikely to become highly sought-after merely because the assignment mechanism changed. Condition (2) is consistent with the empirical finding of Owens and Candipan (2019) that in large US metropolitan areas the poorest neighborhoods typically have underperforming schools. Theorem 6 shows these conditions are approximately necessary: any economy violating them is arbitrarily close to one where a positive measure of zero-budget families prefer NA, so robustness requires them.&lt;/p&gt;
&lt;p&gt;Q: What do the simulations show about the magnitude of welfare effects for lowest-income families?&lt;/p&gt;
&lt;p&gt;A: In simulations with 10 lowest-income families (budgets of 0.05) among 1,000 total, DN raises lowest-income welfare by an average of 26.51% relative to NA and DA raises it by an average of 38.25% relative to NA, across the 18 parameter configurations. The gains are larger when preferences for neighborhoods and schools are less correlated (lower α) and when school capacities are more uniform (higher γ). DA consistently outperforms DN for lowest-income families in the simulations, even though DN dominates NA in aggregate welfare more reliably.&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle equilibrium existence given the externalities created by residential choices?&lt;/p&gt;
&lt;p&gt;A: Because a family&amp;rsquo;s expected utility from a neighborhood depends on other families&amp;rsquo; neighborhood choices (through their effect on school assignment probabilities), standard existence results for assignment games do not directly apply. For the continuum economy, the author proves that school assignment probabilities under DA/DN are equicontinuous in families&amp;rsquo; neighborhood choices, which enables application of the Schauder-Tychonoff fixed-point theorem to guarantee the existence of a competitive equilibrium (Theorem 2). In finite discrete economies, assignment externalities can prevent equilibrium existence (illustrated by an example in Appendix B), but approximate equilibria exist for sufficiently large discrete markets, and all welfare comparisons hold approximately.&lt;/p&gt;
&lt;p&gt;Q: How does the paper&amp;rsquo;s model relate to and extend prior theoretical work on school choice and welfare?&lt;/p&gt;
&lt;p&gt;A: Prior theoretical work (e.g., Calsamiglia et al. 2015; Xu 2019; Avery and Pathak 2020) uses stylized models with single-parameter family types, identical ordinal school rankings, supermodular valuations, and no preferences over neighborhoods. This paper allows an unrestricted preference domain — families have arbitrary valuations over all neighborhood–school pairs — which generates novel findings: in the general model, lowest-income families do not necessarily benefit from DA (contrary to Calsamiglia et al. and Xu), aggregate welfare comparisons between DA and NA are ambiguous (whereas they are trivially resolved in the special cases of prior work), and neighborhood priorities can be welfare-improving even relative to DA without priorities.&lt;/p&gt;
&lt;p&gt;Q: Does the paper address the extension to endogenous school quality or local public financing?&lt;/p&gt;
&lt;p&gt;A: In Supplementary Appendix B, the model is extended to allow school spending to be financed by local property taxes, making school quality endogenous to neighborhood housing values. In that environment, the aggregate welfare superiority of DA/DN over NA may not hold: DA attracts non-neighborhood applicants to high-priced neighborhoods, and if those schools are a poor match for those applicants absent the spending, social welfare may fall — a result analogous to Barseghyan et al. (2013). However, the paper reports that the sufficiency conditions for lowest-income family welfare comparisons (Theorem 5) do extend to the local public financing environment, preserving the distributional results.&lt;/p&gt;
&lt;p&gt;Q: What does the paper say about alternative mechanisms such as Immediate Acceptance (Boston mechanism) and Top Trading Cycles?&lt;/p&gt;
&lt;p&gt;A: The Supplementary Appendix studies these alternatives. For Immediate Acceptance (IA), the paper shows that when there are neighborhood priorities, lowest-income families may prefer DA to IA, echoing the finding that IA is not strategyproof and may disproportionately hurt low-income families who are worse at gaming the system or have worse outside options (Pathak and Sonmez 2008; Calsamiglia et al. 2015). Top Trading Cycles and further extensions are also analyzed in the Supplementary Appendix, though detailed results are not developed in the main text.&lt;/p&gt;
&lt;p&gt;Neighborhood Assignment (NA): The baseline mechanism under which each family&amp;rsquo;s child is automatically enrolled in the school located in their chosen residential neighborhood, with no option to attend schools outside that neighborhood.&lt;/p&gt;
&lt;p&gt;Deferred Acceptance without Neighborhood Priority (DA): A strategyproof centralized assignment mechanism in which seats are allocated by families&amp;rsquo; stated preference rankings and lottery numbers via market-clearing admission cutoffs; residential location confers no priority advantage at any school.&lt;/p&gt;
&lt;p&gt;Deferred Acceptance with Neighborhood Priority (DN): A version of DA in which families residing in a neighborhood receive priority 1 at their neighborhood school and priority 0 at all other schools, guaranteeing neighborhood residents a seat at their local school before remaining seats are allocated by lottery to non-neighborhood applicants.&lt;/p&gt;
&lt;p&gt;Competitive Equilibrium (CE): A pair of neighborhood choices and a price vector such that (1) each family optimally selects the neighborhood maximizing expected utility net of price (subject to budget), (2) neighborhood capacities are not exceeded, and (3) neighborhoods with excess capacity are priced at zero.&lt;/p&gt;
&lt;p&gt;Underdemanded Neighborhood/School: A neighborhood whose equilibrium price is zero (excess housing supply) or a school whose admission cutoff is zero (excess capacity), meaning any applicant who lists it can gain admission.&lt;/p&gt;
&lt;p&gt;Assignment Externality: The indirect dependence of a family&amp;rsquo;s expected utility on other families&amp;rsquo; neighborhood choices, which operates through the effect of the population distribution across neighborhoods on the family&amp;rsquo;s school assignment probabilities under DA or DN. This externality can preclude competitive equilibrium existence in discrete economies.&lt;/p&gt;
&lt;p&gt;Aggregate Welfare: The utilitarian sum of all families&amp;rsquo; expected utilities from their neighborhood–school assignments, not netting out neighborhood prices (so it includes the welfare of house sellers as passive agents); the comparison criterion for Theorems 3 and 4.&lt;/p&gt;
&lt;p&gt;Signaling Device (neighborhood priority as): The interpretation that neighborhood priorities allow families to credibly reveal high valuations for a school by choosing to live in that school&amp;rsquo;s neighborhood, analogously to signaling instruments in matching markets without monetary transfers; the mechanism through which DN can improve welfare relative to DA.&lt;/p&gt;</description></item><item><title>Screening and Segmenting: A Consumer Surplus Perspective</title><link>https://macropaperwarehouse.com/papers/screening-and-segmenting-a-consumer-surplus-perspective/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/screening-and-segmenting-a-consumer-surplus-perspective/</guid><description>&lt;p&gt;Bergemann, Heumann, and Wang study consumer surplus when a monopolist simultaneously engages in second-degree price discrimination (screening consumers within each market segment through quality-differentiated menus) and third-degree price discrimination (offering different menus across segments). The central question is which market segmentation maximizes aggregate consumer surplus, and under what conditions any segmentation benefits consumers at all.&lt;/p&gt;
&lt;p&gt;The model features a monopolist selling vertically differentiated goods of quality q at strictly convex cost c(q) to a continuum of buyers with privately known values v drawn from an aggregate market m*. A segmentation is any decomposition of m* into submarkets, each receiving a profit-maximizing screening menu. The seller observes segment identity but not individual values. The problem of finding the consumer-optimal segmentation is, on its face, an optimization over distributions of distributions — an infinite-dimensional object.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central methodological contribution is a dramatic dimensional reduction. Theorem 1 establishes that the maximum consumer surplus achievable by any segmentation equals the maximum of the expected local information rent, u(v,h) = h·Q(v−h), over all inverse hazard rate functions h satisfying a majorization constraint h ≺ h* (where h* is the aggregate market&amp;rsquo;s inverse hazard rate). The local information rent captures both the extensive margin (h measures the mass of higher-value buyers per unit of value-v buyers who earn rent from v&amp;rsquo;s allocation) and the intensive margin (Q(v−h) is the quality allocated to value v, decreasing in h as distortion increases). The two margins trade off: raising h widens the base of rent-earning buyers but worsens allocative distortion, making u(v,h) hump-shaped in h with an interior maximizer h̄(v).&lt;/p&gt;
&lt;p&gt;The consumer-optimal segmentation has a striking structural property: every buyer of a given value v receives the same quality in every segment in which they appear, even though the monopolist could in principle offer different qualities across segments. Prices, however, differ across segments for identical buyers. This holds because the optimal segmentation is always a uniform segmentation — one in which the inverse hazard rate hm(v) is equalized across all segments containing value v.&lt;/p&gt;
&lt;p&gt;Under log-concavity of both aggregate demand (equivalently, a non-increasing aggregate inverse hazard rate h*(v), satisfied by uniform, normal, logistic, and exponential distributions) and the supply function Q(v) (equivalent to c&amp;rsquo;&amp;rsquo;&amp;rsquo;(q)q/c&amp;rsquo;&amp;rsquo;(q) ≥ −1, satisfied by all power cost functions), the optimal segmentation takes a transparent two-regime form (Proposition 3): for values below a threshold v̂ where h*(v̂) = h̄(v̂), the inverse hazard rate is reduced to h̄(v) by concentrating low-value buyers; for values above v̂, the aggregate market is left unchanged. The resulting segments are nested convex intervals [vm, v̄], all sharing the same upper bound v̄, with pricing differing across segments only by a quality-independent base price Tm that increases with vm (Theorem 2).&lt;/p&gt;
&lt;p&gt;Corollary 3 delivers the sharpest policy-relevant finding: under log-concave demand and supply, zero segmentation is optimal — any segmentation harms consumers — if and only if h*(v̲) ≤ h̄(v̲) at the lowest value v̲. For iso-elastic costs c(q) = q^γ/γ (γ &amp;gt; 1), this becomes η*(v̲) ≤ γ/(1−γ), where η*(v̲) is the aggregate demand elasticity at the bottom of the distribution. When demand is sufficiently elastic relative to supply, the monopolist&amp;rsquo;s screening already provides near-optimal consumer rents and no redistribution of buyers across segments can improve them. More elastic supply (lower γ) shrinks the set of markets where zero segmentation is optimal (Proposition 4, Zγ&amp;rsquo; ⊂ Zγ for γ&amp;rsquo; &amp;lt; γ); more inelastic supply (higher γ) expands it, and in the limit γ → ∞ zero segmentation is suboptimal only when the aggregate allocation itself is efficient.&lt;/p&gt;
&lt;p&gt;For iso-elastic costs, the optimal segmentation assigns each segment a Pareto distribution below v̂ with shape parameter α = γ/(γ−1), and the aggregate market above v̂ (Corollary 1). Each segment&amp;rsquo;s demand elasticity equals the constant γ/(1−γ) below v̂ and the aggregate elasticity above (Corollary 2): the supply elasticity 1/(γ−1) determines how elastic demand must be made within segments to counteract monopoly distortions. The paper also extends the framework to adverse selection (where seller cost rises with buyer type), with the full reduction to inverse hazard rate optimization preserved when the rate of increase in adverse selection satisfies τ&amp;rsquo;&amp;rsquo;(v)v/τ&amp;rsquo;(v) ∈ [0,1] (Proposition 5).&lt;/p&gt;
&lt;p&gt;Q: What is the local information rent and why is it central?
A: The local information rent is u(v,h) = h·Q(v−h), where h is the inverse hazard rate at value v and Q is the inverse marginal cost (supply) function (equation 9). The factor h captures the extensive margin — the mass of higher-value buyers per unit of value-v buyers who earn rent from v&amp;rsquo;s quality allocation — while Q(v−h) captures the intensive margin — the quality allocated to v via the virtual value v−h, which falls as h rises. Because u is hump-shaped in h, there is an interior rent-maximizing inverse hazard rate h̄(v) for each value. Lemma 2 establishes that in every regular market, total consumer surplus equals the integral of u(v,hm(v))dFm(v), so the entire segmentation problem reduces to choosing h.&lt;/p&gt;
&lt;p&gt;Q: What is the majorization constraint and what does it exactly characterize?
A: The majorization constraint h ≺ h* requires that for all v ∈ V, the integral from v̲ to v of [h*(t) − h(t)]dF*(t) ≥ 0 (equation 18). Proposition 1 shows that for any segmentation σ, the average inverse hazard rate hσ must satisfy hσ ≺ h*. A partial converse holds: given h ≺ h* under regularity conditions, a uniform segmentation implementing h exists. The constraint is strictly weaker than the pointwise bound h ≤ h* available in the binary case because it permits h to exceed h* at some values (dilution) provided it falls sufficiently below h* at higher values (concentration) to maintain the cumulative inequality.&lt;/p&gt;
&lt;p&gt;Q: What are concentration and dilution, and how do they interact?
A: Concentration gathers buyers of a given value into fewer segments, lowering their inverse hazard rate below h*(v). Dilution raises the inverse hazard rate of value v by placing v in segments where immediately higher values are missing — creating gaps in the support — thereby increasing the support increment Δm(v) and hence hm(v) (equation 12). Dilution at v requires that values just above v have already been concentrated elsewhere to create the gaps; concentration thus enables dilution, linking the two tools. With only binary values, only concentration is available; with a continuum, dilution can strictly expand achievable consumer surplus by permitting h to exceed h* at low values.&lt;/p&gt;
&lt;p&gt;Q: What does Theorem 1 establish and why is it a major simplification?
A: Theorem 1 states that the maximum consumer surplus over all segmentations of m* equals the maximum of ∫u(v,h(v))dF*(v) over all h satisfying the majorization constraint h ≺ h* (equation 25). The original problem maximizes over distributions on the infinite-dimensional space of probability measures on V; the reduced problem is a standard optimal control problem over a single real-valued function h: V → R+, amenable to Karush-Kuhn-Tucker methods and often yielding closed-form solutions. Furthermore, every optimal segmentation is a uniform segmentation implementing some h solving the reduced problem, so the reduction is exact. The optimal h always satisfies regularity (h&amp;rsquo;(v) ≤ 1), meaning v − h(v) is non-decreasing, which ensures segments in the optimal uniform segmentation are themselves regular.&lt;/p&gt;
&lt;p&gt;Q: What is the structural property of consumer-optimal segmentations regarding quality across segments?
A: In any consumer-optimal segmentation, every buyer of value v receives the same quality in every segment in which they appear (the uniform quality property following from Theorem 1). This holds because the optimal inverse hazard rate h(v) is equalized across segments (uniform segmentation), and quality in a regular market is qm(v) = Q(v − hm(v)), which depends on the market only through hm(v). Prices, however, differ across segments for identical buyers: the monopolist does not redesign its product line across segments but adjusts only quality-independent base prices. This is counterintuitive because nothing in the monopolist&amp;rsquo;s problem requires quality uniformity — it emerges purely from the consumer surplus maximization.&lt;/p&gt;
&lt;p&gt;Q: What conditions guarantee the simple two-regime convex segmentation structure?
A: Log-concavity of aggregate demand — equivalently, h*(v) non-increasing in v, satisfied by uniform, normal, logistic, and exponential families — and log-concavity of the supply function Q(v), equivalent to c&amp;rsquo;&amp;rsquo;&amp;rsquo;(q)q/c&amp;rsquo;&amp;rsquo;(q) ≥ −1, together guarantee the structure of Proposition 3 and Theorem 2. Under these conditions, h̄(v) is strictly increasing in v (log-concave supply) while h*(v) is decreasing (log-concave demand), so they cross exactly once at v̂. The optimal h equals h̄(v) below v̂ and h*(v) above. Only concentration (not dilution) is ever used because log-concave supply makes u concave in h and log-concave demand ensures monotone ordering of marginal local information rents across values, so the binding majorization constraint becomes the pointwise constraint at the bottom.&lt;/p&gt;
&lt;p&gt;Q: What is the structure of convex segmentations and their menus (Theorem 2)?
A: Under log-concave demand and supply, the consumer-optimal segmentation consists of segments m with absolutely continuous supports [vm, v̄] for varying lower bounds vm ≤ v̂, all sharing the same upper bound v̄ (Part 1 of Theorem 2). Pricing across these segments differs only by a quality-independent base price Tm that is increasing in vm — more concentrated segments (lower vm) face a lower base price and carry higher information rents — while the quality menu p(q) is uniform across segments (Part 2). Equivalently, the monopolist offers nested menus all sharing the same efficient upper bound quality Q(v̄), differing in how far down the menu is extended and in the price of the lowest offered quality.&lt;/p&gt;
&lt;p&gt;Q: What do Corollaries 1 and 2 say for iso-elastic cost functions?
A: With iso-elastic cost c(q) = q^γ/γ (γ &amp;gt; 1) and log-concave demand, the consumer-optimal segmentation assigns each segment a Pareto distribution with shape parameter α = γ/(γ−1) below the threshold v̂, and the aggregate distribution above v̂ (Corollary 1). This delivers a constant demand elasticity of γ/(1−γ) within each segment below v̂, matching the aggregate market&amp;rsquo;s elasticity above v̂ (Corollary 2). The Pareto shape — and thus the degree of demand manipulation — is determined entirely by the supply elasticity 1/(γ−1): more elastic supply (lower γ) mandates a higher shape parameter α and more elastic within-segment demand to counteract larger monopoly distortions.&lt;/p&gt;
&lt;p&gt;Q: When is zero segmentation optimal, and what is the precise elasticity condition?
A: Under log-concave demand and supply, zero segmentation is optimal if and only if h*(v̲) ≤ h̄(v̲) — the aggregate inverse hazard rate at the lowest value already lies at or below its rent-maximizing level (Corollary 3). Since h* is decreasing under log-concavity, this condition at v̲ implies it holds everywhere, so the designer cannot improve rents at any value. For iso-elastic cost, the condition becomes η*(v̲) ≤ γ/(1−γ): aggregate demand elasticity at the bottom must be at least as large in magnitude as one plus the supply elasticity. For a Pareto aggregate distribution with shape parameter α, zero segmentation is optimal when α ≥ γ/(γ−1).&lt;/p&gt;
&lt;p&gt;Q: How does supply elasticity govern the scope for beneficial segmentation (Proposition 4)?
A: Proposition 4 establishes that for iso-elastic cost, the set of markets Zγ where zero segmentation is optimal is strictly nested increasing in γ: for any γ&amp;rsquo; &amp;lt; γ, Zγ&amp;rsquo; ⊂ Zγ. More elastic supply (lower γ) amplifies monopoly distortions and enlarges the set of markets where segmentation benefits consumers; more inelastic supply (higher γ) makes quality provision rigid, reducing segmentation&amp;rsquo;s scope. In the limit γ → ∞ (approaching unit demand), zero segmentation is suboptimal only if the aggregate allocation is already efficient — but this limit also means very inelastic supply, so the potential benefits from segmentation have shrunk toward zero simultaneously.&lt;/p&gt;
&lt;p&gt;Q: How does this paper compare to and depart from Haghpanah and Siegel (2023)?
A: Haghpanah and Siegel (2023) showed that in generic markets with a finite number of goods, some segmentation always improves consumer surplus relative to the aggregate market. This paper shows that with a continuum of qualities, this universal improvement result fails: Corollary 3 identifies a large, non-degenerate class of markets satisfying Haghpanah and Siegel&amp;rsquo;s genericity conditions where zero segmentation is optimal for consumers. The discrepancy arises because the log-concave supply condition (equation 27) is violated in finite-good environments — Haghpanah and Siegel explicitly provide a counterexample showing their result fails with a continuum of goods. This paper characterizes exactly when the finite-good gains vanish as the quality space becomes continuous, providing the precise elasticity conditions.&lt;/p&gt;
&lt;p&gt;Q: What changes and what is preserved when extending to adverse selection?
A: In the adverse selection specification, buyer net value v is private and the seller&amp;rsquo;s cost per unit is τ(v) − v, increasing in v when τ&amp;rsquo;(v) &amp;gt; 1. The local information rent becomes w(v,h) = u(v, τ&amp;rsquo;(v)·h), where adverse selection enters by amplifying the effective inverse hazard rate by τ&amp;rsquo;(v) (equation 40). Proposition 5 confirms that the full reduction to majorization-constrained optimization over h goes through, and the optimal segmentation features more elastic within-segment demand when adverse selection is more severe. The reduction requires τ&amp;rsquo;&amp;rsquo;(v)v/τ&amp;rsquo;(v) ∈ [0,1] (equation 39), bounding the rate of increase of adverse selection severity; if this fails, the key inequality (35) driving the optimality of uniform segmentations may break down.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications for regulation of price discrimination?
A: The results imply that blanket restrictions on market segmentation may harm consumers by preventing welfare-enhancing price discrimination in markets where demand is sufficiently inelastic relative to supply (the region outside the zero-segmentation condition). In markets satisfying η*(v̲) ≤ γ/(1−γ), allowing segmentation yields no consumer benefit, so restrictions are harmless to consumers. The key policy-relevant primitives are demand and supply elasticities, which are in principle measurable. The findings also imply that the welfare effects of data-driven personalized pricing depend critically on the interaction between consumer heterogeneity (demand shape) and cost structure (supply elasticity), rather than on the degree of segmentation per se.&lt;/p&gt;
&lt;p&gt;Local information rent: u(v,h) = h·Q(v−h), the total consumer surplus generated per unit mass of buyers at value v as a function of the inverse hazard rate h. The factor h is the extensive margin (mass of higher-value buyers per unit of value-v buyers who earn rent) and Q(v−h) is the intensive margin (quality allocated to v via the virtual value v−h). It is hump-shaped in h with interior maximizer h̄(v), and the segmentation problem reduces entirely to maximizing its expectation.&lt;/p&gt;
&lt;p&gt;Inverse hazard rate hm(v): in a continuous market, (1−Fm(v))/fm(v); generalized to accommodate atoms and support gaps (equation 12). It simultaneously determines the virtual value ϕm(v) = v − hm(v) (governing allocative distortion) and the scaled mass of higher-value buyers per unit of value-v buyers (governing the extensive margin of rents). The dual role requires both a continuum of qualities and endogenous segmentation.&lt;/p&gt;
&lt;p&gt;Majorization constraint h ≺ h*: for all v, the cumulative integral of [h*(t)−h(t)]dF*(t) from v̲ to v is non-negative (equation 18). It is the exact characterization of inverse hazard rate functions achievable by some segmentation of m*, strictly weaker than the pointwise bound h ≤ h* of the binary case because it permits h to exceed h* at some values (dilution) provided it falls sufficiently below h* at higher values (concentration).&lt;/p&gt;
&lt;p&gt;Uniform segmentation: a segmentation in which every buyer of value v faces the same inverse hazard rate hm(v) = hσ(v) in every segment containing v (equation 22). Theorem 1 establishes that every consumer-optimal segmentation is uniform; this class converts the double integral over segments and values into a single integral against F*, enabling the dimensional reduction of Theorem 1.&lt;/p&gt;
&lt;p&gt;Concentration and dilution: the two tools by which segmentation modifies inverse hazard rates. Concentration gathers buyers of a given value into fewer segments, lowering hm(v) below h*(v). Dilution raises hm(v) above h*(v) by placing value v in segments where immediately higher values are absent, creating support gaps. Dilution requires prior concentration of adjacent higher values, so the two tools are linked; under log-concave demand and supply, only concentration is used in the optimal segmentation.&lt;/p&gt;
&lt;p&gt;Convex segmentation: a segmentation whose constituent segments have nested convex interval supports [vm, v̄] all sharing the same upper bound v̄, with varying lower bounds vm. This is the consumer-optimal structure under log-concave demand and supply (Theorem 2). For iso-elastic cost, each segment below the threshold v̂ corresponds to a Pareto distribution with shape parameter α = γ/(γ−1) determined by cost convexity γ.&lt;/p&gt;
&lt;p&gt;Zero-segmentation condition: the condition under which no segmentation can improve consumer surplus over the aggregate market. Under log-concave demand and supply with iso-elastic cost c(q) = q^γ/γ, it is η*(v̲) ≤ γ/(1−γ): aggregate demand elasticity at the lowest value must be at least as large in magnitude as one plus the supply elasticity (Corollary 3). When this holds, any redistribution of buyers across segments strictly reduces consumer surplus.&lt;/p&gt;</description></item><item><title>Search Frictions and Product Design in the Municipal Bond Market</title><link>https://macropaperwarehouse.com/papers/search-frictions-and-product-design-in-the-municipal-bond-market/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/search-frictions-and-product-design-in-the-municipal-bond-market/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper investigates whether intermediaries in the U.S. municipal bond market strategically exploit product design to increase search frictions and, through that channel, capture rents. Specifically, it asks: do underwriters who negotiate bond design with local governments have an incentive to add nonstandard provisions that raise their own competitive advantage in subsequent secondary-market intermediation, even at the expense of issuing governments and their taxpayers?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The study focuses on tax-exempt general obligation and revenue bonds issued via negotiated sales by local governments (counties, cities, school districts, and other special-purpose governments) from 2010 to 2013, tracking all secondary-market transactions through 2014. The final sample comprises 13,118 bond issues with a total face value of $266.9 billion. Bond attribute data come from Mergent; transaction data come from the Municipal Securities Rulemaking Board (MSRB). Issuer financial health, demographics, and economic conditions are drawn from the Census and American Community Survey; state revolving-door regulations are compiled from the National Conference of State Legislatures database. Structural estimation uses a subsample of 927 bonds concentrated in the five states that enacted revolving-door regulations during the study period and neighboring border counties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A core empirical challenge is that unobserved factors may jointly determine bond complexity and market outcomes. The authors exploit panel variation in state-level revolving-door regulations — laws that restrict former public officials from taking employment at firms regulated by their former agencies for a &amp;ldquo;cool-off&amp;rdquo; period — as an instrument for bond complexity. Between 2010 and 2013, three states (Arkansas 2011, Indiana 2010, Maine 2013) enacted new legislation covering state officials, and two states (New Mexico 2011, Virginia 2011) extended existing regulations to cover local officials. A difference-in-differences regression, with county and year-month fixed effects, shows that adopting revolving-door regulations covering local officials reduces bond complexity by 6% on average (coefficient −0.064, p &amp;lt; 0.01). Regulations targeting only state officials, who are not directly involved in bond negotiations, yield smaller and statistically fragile effects. Placebo checks on auctioned bonds, where underwriters cannot influence design, show no effect, and there is no evidence of pre-existing trends in complexity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Flexibility vs. liquidity trade-off&lt;/strong&gt;: A 1% increase in the bond complexity index lowers the number of negative credit-watch events (a proxy for default risk) by 0.002, a 3% decrease relative to the mean of 0.074, confirming that nonstandard provisions provide genuine financial flexibility. However, increasing the complexity index from its mean (1.46) to the 75th percentile (1.69) raises the intermediation spread — the cost for an investor to buy and immediately sell a bond — by 17 basis points (a 14% increase over the average of 120 basis points), confirming that complexity raises trading frictions. For context, the average intermediation spread of 120 basis points is large relative to the 30–60 basis point bid-ask spread of corporate bonds in 2010–2013.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Underwriter incentive to complicate&lt;/strong&gt;: Increasing complexity from the mean to the 75th percentile raises the underwriter&amp;rsquo;s market share in secondary-market intermediation by 1.4 percentage points, an 11% increase over the average underwriter share of 12.2%. The underwriter&amp;rsquo;s gross profits from intermediation also increase with complexity.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Structural estimates — search costs&lt;/strong&gt;: For a median bond, average dealer search costs amount to 10% of monthly gross profits ($2,625 per month). The underwriter&amp;rsquo;s exclusive initial sales generate a client network that lowers its effective search costs by 21% relative to an average dealer, more than offsetting its initial geographical disadvantage (for 72% of bonds, the underwriter&amp;rsquo;s baseline search cost exceeds the median dealer&amp;rsquo;s). Nonstandard provisions increase both the initial search cost parameter (φ₀) and the network-effect parameter (φ₁): a 1% increase in the complexity index increases φ₀ by 3.79% and φ₁ by 1.66%, implying complex bonds raise search costs broadly but amplify the advantage of a large client network — a position the underwriter occupies via exclusive primary-market sales.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Investor demand&lt;/strong&gt;: Nonstandard provisions do not substantially change the average investor valuation but substantially increase the dispersion: the standard deviation of investor valuations is 0.003 for simple bonds and 0.013 for complex bonds, consistent with complex bonds being niche products that investors &amp;ldquo;either love or loathe.&amp;rdquo;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Government cost&lt;/strong&gt;: The marginal cost of paying debt obligations is convex in complexity, reaching a minimum at an interior level of provisions; the government&amp;rsquo;s marginal financial cost increases by 42% when a median bond is stripped of all nonstandard provisions, reflecting the value of payment flexibility.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Conflict of interest&lt;/strong&gt;: The estimated weight that government officials place on underwriter payoffs in the absence of revolving-door regulations (ψ₀) is 0.34, implying the underwriter&amp;rsquo;s value accounts for 6.7% of the government official&amp;rsquo;s payoff under the median unregulated issuer. With revolving-door regulations in place, ψ₁ is essentially zero.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Counterfactual Policies (on representative bond: face value $6.45 million, maturity 7.7 years)&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Standardization mandate&lt;/strong&gt; (ban on all nonstandard provisions): The coupon rate falls from 2.81% to 2.16% (−23%), average dealer search costs fall 47%, and investor surplus rises 13.3%. However, the marginal financial cost (c₀) rises by 41% (from 0.615 to 0.871), so the issuer&amp;rsquo;s total debt payment cost — principal plus interest, weighted by c₀ — rises by 35%, from $5.13 million to $6.96 million. The standardization policy harms issuers even while saving 7.8% of raw principal-and-interest payments ($8,349K to $7,997K), because the loss of flexibility more than offsets the liquidity gain.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Issuer-driven design&lt;/strong&gt; (issuer sets complexity to minimize its own debt payment cost, then negotiates the coupon): Complexity falls 19% to 1.14, the interest rate falls to 2.37%, total issuer cost falls 1.5%, investor surplus rises 6%, and the underwriter&amp;rsquo;s secondary-market payoff falls 19.9%.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Underwriter intermediation ban&lt;/strong&gt; (underwriter excluded from trading after six months): Complexity falls 5.7% to 1.33, the coupon falls to 2.59%, issuer cost falls 1.5%, but investor surplus falls 1.84% and even other dealers are worse off by 3.97%, because the underwriter&amp;rsquo;s information on primary-market buyers is lost, offsetting the liquidity gains from lower complexity.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the five nonstandard bond features tracked as proxies for complexity, and how are they combined into a single index?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following Harris and Piwowar (2006), the paper focuses on five features that are particularly difficult for investors to price: (i) multiple or serial bonds per issue (as opposed to a single bond), (ii) call provisions allowing early redemption, (iii) sinking fund provisions requiring periodic debt retirement, (iv) nonstandard interest payment frequencies (other than semiannual), and (v) variable or floating interest rates. The complexity index is constructed as the simple average of the latter four provisions across bonds within an issue, plus a dummy for whether the issue contains multiple bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why do revolving-door regulations that target local officials reduce complexity more than those targeting state officials?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;State officials are not directly involved in bond origination negotiations — they can only indirectly influence local governments through budget allocations. Local officials negotiate directly with underwriters and are thus the proximate counterparties whose incentives the regulations alter. Accordingly, revolving-door regulations covering local officials reduce complexity by 6% (coefficient −0.064, p &amp;lt; 0.01 with full controls), whereas regulations targeting only state officials produce a smaller effect (approximately 2%) that loses statistical significance once issuer financial health controls are added.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the paper validate that revolving-door regulations are a valid instrument for bond complexity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper provides three pieces of evidence. First, the regulations have no effect on the credit ratings of bonds issued prior to their enactment, on the annual amount of bond issuance, or on the maturity length and sale method conditional on issuance — confirming the regulations do not alter governments&amp;rsquo; risk management or underlying financing needs. Second, the regulations have no effect on complexity for competitively auctioned bonds, where underwriters cannot influence design — a direct placebo test. Third, a pre-trend analysis (Figure A1) finds no differential trend in complexity in states that subsequently adopted regulations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the mechanism by which underwriters benefit from adding nonstandard provisions, and why does this advantage not diminish over time?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Underwriters purchase and distribute the entire bond issue at origination, giving them an exclusive network of investors who initially purchased the bonds. In the secondary market, knowing who owns a bond allows the underwriter to locate buyers and sellers with lower search effort. For complex bonds, this advantage is amplified: nonstandard provisions make investor education and persuasion more costly, increasing the value of pre-existing client relationships. The network-effect parameter φ₁ — which governs how rapidly search costs fall as a dealer&amp;rsquo;s cumulative trades grow — itself rises with complexity (by 1.66% per 1% increase in the complexity index), so the underwriter&amp;rsquo;s head start in client network accumulation translates into a persistently larger cost advantage precisely for the most complex bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How large is the underwriter&amp;rsquo;s search cost advantage in equilibrium, and what drives it?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At the equilibrium meeting rate, the underwriter&amp;rsquo;s effective search cost of maintaining a given meeting rate is 21% lower than that of an average dealer. This advantage arises despite the underwriter having a higher initial search cost type (φ₀ of $3,609 vs. $3,216 for the average dealer at λ = 1), because for 72% of bonds the underwriter has less local trading experience than the median dealer. The advantage is entirely driven by the underwriter&amp;rsquo;s network: its exp(−φ₁ log(b)) cost discount factor averages 0.34, 32% lower than the average dealer&amp;rsquo;s 0.50. The underwriter meets investors 20% more frequently than the average dealer (0.23 vs. 0.19 per month), despite higher absolute search expenditures ($3,045 vs. $2,625 per month).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does bond complexity affect investor demand — mean or dispersion of valuations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Structural estimates show that increasing the complexity index by 1% increases the standard deviation of investor valuations (γ₂) by 4.60% but has no statistically significant effect on the mean valuation (coefficient −0.085, standard error 0.561). This pattern is consistent with complex bonds being niche products — they attract a subset of investors with specific preferences for the embedded features (e.g., certain tax or cash-flow attributes), while being unappealing to most investors. The standard deviation of valuations is 0.003 for a low-complexity bond (25th percentile) and 0.013 for a high-complexity bond (75th percentile).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What does the structural estimate of ψ₀ imply about the degree of collusion between government officials and underwriters?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The estimated collusion parameter without revolving-door regulations (ψ₀ = 0.34) implies that, for the median unregulated issuing government, the underwriter&amp;rsquo;s value from secondary-market trading accounts for 6.7% of the government official&amp;rsquo;s objective function. This is a substantial weight: it means officials act partly as agents for the underwriter rather than purely for taxpayers. With revolving-door regulations (ψ₁ ≈ 0), this collusive weight is essentially eliminated, explaining the empirical reduction in complexity found in Table 2.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the effects of a full standardization mandate on each class of market participant, and why does the issuer lose overall despite paying a lower coupon?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under standardization, the coupon falls 23% (from 2.81% to 2.16%) and the raw principal-plus-interest payment falls 7.8% (from $8,349K to $7,997K). However, the marginal financial cost c₀ rises 41% (from 0.615 to 0.871), reflecting the loss of payment flexibility previously provided by call provisions and other features; the total issuer cost — c₀A(1 + rT) — rises by 35% (from $5.13 million to $6.96 million). Investors gain 13.3% in surplus because they value liquidity and, on average, do not value nonstandard features. The underwriter loses 36.6% of its secondary-market value while other dealers gain 36.1%, as standardization erodes the underwriter&amp;rsquo;s network advantage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Why does the issuer-driven design scenario outperform standardization in terms of total issuer cost, even though complexity does not fall to zero?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under issuer-driven design, the government minimizes its total cost of debt payment c₀A(1 + rT), accounting for both the flexibility value of provisions and their effect on the negotiated coupon. The optimal complexity index is 1.14 — positive, but 19% below the current baseline of 1.41 — because some provisions genuinely lower c₀ by allowing flexible debt service. The cost of search frictions (and hence the liquidity premium embedded in the coupon) falls 32% and the negotiated coupon falls to 2.37%, sufficient to reduce total issuer cost by 1.5%. By contrast, full standardization imposes a complexity of zero, which overshoots: c₀ rises more than the coupon savings compensate, increasing total costs by 35%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the net welfare effects of the underwriter intermediation ban, and why is investor surplus negative despite lower complexity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The ban reduces complexity by 5.7%, lowering the coupon to 2.59% and reducing issuer costs by 1.5%. However, the underwriter&amp;rsquo;s client network — built during exclusive initial sales — is a productive resource that improves match quality in the secondary market; banning the underwriter from trading after six months wastes this information. Average dealer search costs rise 1.2% and the meeting rate falls 1.7%, net of the complexity reduction. Investors face bonds with lower coupons and higher effective search frictions, so their surplus falls 1.84%. Non-underwriter dealers also lose 3.97% because lower coupons reduce the rents extractable from intermediation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How is the structural model estimated, and what role do revolving-door regulations play in the estimation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Estimation proceeds in three steps. In Step 1, bond-specific trading market parameters (investor demand, dealer search costs, meeting rates, bargaining parameters) are recovered separately for each bond by minimizing squared differences between observed and simulated trading prices, quantities, and transaction timing. In Step 2, IV regressions using revolving-door regulations and their interactions with county/state attributes as instruments for endogenous complexity map Step 1 parameters to bond attributes, addressing the endogeneity of complexity in determining search costs and investor demand. In Step 3, GMM moment conditions derived from Nash bargaining first-order conditions for the equilibrium complexity and coupon rate identify government preference parameters (θ_c, ψ₀, ψ₁), using the orthogonality condition that unobserved financing cost shocks are mean-zero conditional on observed attributes, regulations, and bond supply from neighboring counties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Does the underwriting market show signs of concentration that might amplify the conflict-of-interest problem?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. The mean state-level Herfindahl-Hirschman Index (HHI) for underwriting is 0.12, with the top three firms covering 45% of the market on average. For smaller deals (under $10 million), concentration is markedly higher: mean HHI of 0.24 and top three firms covering 64% of the market. Repeat relationships are common — 41% of bonds issued in 2011–2017 were underwritten by a firm that had underwritten a prior bond for the same issuer within five years — reflecting both informational advantages of local presence and potentially entrenched relationships that may increase government officials&amp;rsquo; susceptibility to underwriter influence.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Complexity index (nonstandard provisions)&lt;/strong&gt;: A bond-level measure computed as the simple average, across bonds within an issue, of four nonstandard features — call provisions, sinking fund provisions, nonstandard interest payment frequency, and variable/floating interest rates — plus a dummy for whether the issue contains multiple bonds. Used as the primary measure of bond complexity in all regressions and the structural model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Revolving-door regulation&lt;/strong&gt;: A state-level law restricting former public officials or employees from engaging in lobbying or taking employment at regulated firms for a specified &amp;ldquo;cool-off&amp;rdquo; period (typically one to two years) after leaving office. The paper uses the presence and scope of such regulations (whether they cover state officials, local officials, or both) as a source of exogenous variation in government officials&amp;rsquo; incentives to align with underwriter interests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intermediation spread&lt;/strong&gt;: The logarithm of the average dealer-to-investor sale price minus the logarithm of the average dealer-from-investor purchase price for a given bond. Used as the empirical measure of trading frictions; the sample average is 120 basis points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Network effect in search (φ₁)&lt;/strong&gt;: The parameter governing how a dealer&amp;rsquo;s cumulative prior trades with investors in a given bond reduce its cost of meeting new investors for that bond. A higher φ₁ means a larger client network translates into steeper cost savings. The paper estimates that φ₁ itself increases with bond complexity, so complex bonds amplify the advantage of dealers (especially the underwriter) who accumulate large client networks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Marginal cost of debt payment (c₀)&lt;/strong&gt;: A bond- and issuer-specific parameter capturing the effective cost to the government of repaying each dollar of principal and interest, net of the flexibility benefits provided by nonstandard provisions. Normalized to one for a bond with zero nonstandard provisions at average issuer characteristics; estimated to be convex in complexity with an interior minimum, implying some nonstandard provisions are beneficial from the government&amp;rsquo;s perspective.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Collusion weight (ψ)&lt;/strong&gt;: The weight a government official places on the underwriter&amp;rsquo;s secondary-market value from trading when negotiating bond design. Estimated at ψ₀ = 0.34 in the absence of revolving-door regulations (implying the underwriter&amp;rsquo;s interest accounts for 6.7% of the official&amp;rsquo;s objective) and at ψ₁ ≈ 0 when such regulations are present.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Underwriter dual role&lt;/strong&gt;: The institutional arrangement in which the same investment bank (i) negotiates and purchases the entire bond from the issuing government at origination, and (ii) subsequently acts as a dealer in the bond&amp;rsquo;s secondary market. This dual role creates an incentive to design complex bonds that strengthen the underwriter&amp;rsquo;s competitive advantage in secondary intermediation via network effects in search.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Issuer-driven design&lt;/strong&gt;: A counterfactual policy scenario in which the government sets the complexity level to minimize its total cost of debt payment — accounting for both the flexibility value of provisions and the anticipated effect on the negotiated coupon rate — before bargaining with the underwriter only over the coupon. This policy allows some nonstandard provisions (complexity index 1.14 vs. baseline 1.41) and reduces total issuer cost by 1.5% relative to the baseline.&lt;/p&gt;</description></item><item><title>Selection in Surveys: Using Randomized Incentives to Detect and Account for Nonresponse Bias</title><link>https://macropaperwarehouse.com/papers/selection-in-surveys-using-randomized-incentives-to-detect-and-account-for-nonresponse-bias/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/selection-in-surveys-using-randomized-incentives-to-detect-and-account-for-nonresponse-bias/</guid><description>&lt;p&gt;This paper addresses nonresponse bias in surveys — the distortion that arises when survey participants differ systematically from nonparticipants in ways that correlate with the survey&amp;rsquo;s outcomes of interest. The authors develop and apply methods to detect and correct for nonresponse bias using randomized financial incentives embedded in the survey design itself.&lt;/p&gt;
&lt;p&gt;The empirical application is the &amp;ldquo;Norge i Koronatid&amp;rdquo; (NiK) survey, conducted by Statistics Norway in April–May 2020 to study the immediate labor market consequences of Norway&amp;rsquo;s COVID-19 lockdown. The NiK survey has two features that make it unusually well-suited for studying nonresponse bias: (1) it is linked to full-population administrative data, providing a verifiable ground truth for the entire Norwegian adult population; and (2) survey invitees were randomly assigned to one of five financial incentive levels (0%, 1%, 5%, 7%, or 10% probability of receiving a 1,000 NOK prepaid card), generating exogenous variation in participation rates. The final sample of 10,000 randomly drawn adults achieved a 47.4% participation rate.&lt;/p&gt;
&lt;p&gt;The administrative data reveal large, statistically significant nonresponse bias across all six labor market outcomes examined. Participants in the high-incentive arm had on average roughly 930 USD (30%) higher monthly pre-lockdown earnings than the full population, and were 10.8 percentage points (19%) more likely to be employed. Standard corrections for selection on observable characteristics — including propensity-score reweighting on age, gender, immigration status, schooling, and municipality-level variables — fail to eliminate this bias. For the high-incentive arm, reweighting on individual characteristics more than doubles the nonresponse bias for earnings loss and employment loss measures relative to unweighted estimates, meaning that observable-based corrections can make things worse, not better.&lt;/p&gt;
&lt;p&gt;A key finding is that higher participation rates do not imply lower nonresponse bias. The high-incentive arm, with the highest response rate, exhibited larger nonresponse bias than the no-incentive arm. Marginal participants — those induced to respond by higher incentives — had much stronger pre-lockdown labor market attachment (average earnings of 6,806 USD/month vs. 3,666 USD/month for inframarginal participants) but suffered substantially greater lockdown impacts: 32.3% became furloughed or unemployed versus only 3.4% of inframarginal participants.&lt;/p&gt;
&lt;p&gt;Existing methods designed to handle selection on unobservables also perform poorly. Worst-case (Manski) bounds contain the truth but are very wide: employment before lockdown is bounded between 30% and 83% against a true value of 57%. Monotone response selection assumptions produce bounds that do not contain the population quantities for any of the six outcomes, because the marginal survey response function is empirically non-monotone. A Heckman parametric selection model produces point estimates inconsistent with the ground truth (e.g., estimating 51% pre-lockdown employment against the true 57%).&lt;/p&gt;
&lt;p&gt;Investigation of participation timing reveals that reminder emails attract a qualitatively different type of respondent than incentives do. This motivates the paper&amp;rsquo;s central methodological contribution: a two-dimensional participation model that distinguishes &amp;ldquo;active&amp;rdquo; nonparticipants (those who received the invitation and chose not to respond because the incentive was insufficient) from &amp;ldquo;passive&amp;rdquo; nonparticipants (those who never received or attended to the invitation but who may respond to reminders). These two groups have labor market outcomes that differ from participants in opposite directions, which is why single-dimensional monotone selection models fail. The two-dimensional model, exploiting both incentive randomization and the timing of responses, produces bounds that contain or are closer to the ground truth than all other methods examined — for example, bounding pre-lockdown employment at [48%, 63%] around the true value of 57%.&lt;/p&gt;
&lt;p&gt;The paper is scoped to a high-quality, randomly sampled, administrative-data-linked survey conducted during a period of acute economic disruption. The authors note the patterns observed may differ outside crisis periods, though the methods developed apply generally.&lt;/p&gt;
&lt;p&gt;Q: How prevalent is nonresponse bias discussion in economics research, and what methods do researchers currently use?
A: A systematic review of survey-based papers in top-five economics journals from January 2015 to August 2020 found that nearly half of studies omit any discussion of nonresponse bias despite often high nonresponse rates. Among studies using researcher-collected survey data, the average nonresponse rate is 50%; rates reach as high as 87%. When researchers do address nonresponse, 47% of own-survey papers compare sample means to a reference population and 16% apply reweighting on observables; virtually none use methods that address selection on unobservables.&lt;/p&gt;
&lt;p&gt;Q: How was the NiK survey designed to enable testing for nonresponse bias?
A: The 10,000-person random sample was assigned to five incentive groups with probabilities of receiving a 1,000 NOK credit card set at 0%, 1%, 5%, 7%, and 10%, yielding expected payoffs ranging from 1.1 USD to 11 USD. Because group assignment was random, the groups are probabilistically identical ex ante, so differences in average responses across groups — given an exclusion restriction that incentives do not directly affect answers — provide a direct test for nonresponse bias. Participation rates across the aggregated no/low/high incentive groups were 45.7%, approximately 47.6%, and approximately 51.7%, respectively; the joint test of equal participation across groups rejects with p-value &amp;lt; 0.01.&lt;/p&gt;
&lt;p&gt;Q: How large is nonresponse bias in the NiK survey as measured against the administrative ground truth?
A: Across all six administrative outcomes and all three incentive arms, joint tests of no nonresponse bias are rejected with p-values &amp;lt; 0.01. High-incentive arm participants had pre-lockdown monthly earnings roughly 930 USD (30%) above the population mean, and were 10.8 percentage points (19%) more likely to be employed. The high-incentive arm&amp;rsquo;s estimated post-lockdown employment rate of 58% overstates the true rate by 8 percentage points; a researcher comparing this to the true pre-lockdown rate of 57% would erroneously conclude employment was essentially unchanged, when in fact it dropped 7 percentage points.&lt;/p&gt;
&lt;p&gt;Q: Does correcting for observable characteristics remove nonresponse bias?
A: No. After reweighting by propensity scores constructed from age, gender, immigration status, schooling, and municipality or individual-level characteristics, joint tests of zero remaining nonresponse bias are rejected with p-values &amp;lt; 0.01 for each specification and incentive arm. In some cases, reweighting on individual characteristics more than doubles the nonresponse bias — for example, for earnings loss and employment loss measures in the high-incentive arm — meaning that standard observable-based corrections can amplify rather than reduce bias. Robustness checks using machine learning algorithms, class weights, imputation, and richer covariate sets including lagged outcomes yield the same conclusion.&lt;/p&gt;
&lt;p&gt;Q: Does nonresponse bias in survey responses (not just administrative outcomes) differ across incentive arms?
A: Yes. For survey-elicited outcomes, average responses differ significantly across incentive arms, with all joint equality tests rejected at p &amp;lt; 0.1. For example, 10.4% of high-incentive participants reported applying for UI benefits versus 7.5% in the no-incentive group. Estimated UI expenditure as a share of Norway&amp;rsquo;s 2020 social insurance budget varies from 13.2% (no-incentive arm) to 18.4% (high-incentive arm), illustrating the policy stakes.&lt;/p&gt;
&lt;p&gt;Q: Do higher response rates reduce nonresponse bias?
A: Not in this survey. The no-incentive arm, with the lowest participation rate (45.7%), exhibits smaller nonresponse bias than the high-incentive arm (51.7% participation). This finding contradicts standard guidance from the U.S. Office of Management and Budget and J-PAL research guidelines, which equate higher response rates with lower bias risk. The authors note that J-PAL has subsequently updated its guidance in response to this paper&amp;rsquo;s findings.&lt;/p&gt;
&lt;p&gt;Q: How do marginal participants (induced by higher incentives) differ from inframarginal participants?
A: Marginal participants — those who participate only under high incentives but not without them — had average pre-lockdown monthly earnings of 6,806 USD versus 3,666 USD for inframarginal participants (p-value 0.08), indicating much stronger pre-lockdown labor market attachment. Post-lockdown, both groups had similar earnings (approximately 3,600–3,800 USD/month). Consistent with this, 32.3% of marginal participants became furloughed or unemployed after the lockdown versus 3.4% of inframarginal participants. Notably, marginal and inframarginal participants do not differ significantly on observable background characteristics (age, gender, immigrant status, schooling; joint test p-value 0.70), confirming that selection is on unobservables.&lt;/p&gt;
&lt;p&gt;Q: Why do existing methods designed to handle selection on unobservables fail?
A: Worst-case (Manski) bounds contain the truth but are too wide to be informative — pre-lockdown employment is bounded at [30%, 83%] against a true value of 57%. Adding randomized incentives as instruments tightens bounds only modestly (8.5% width reduction for employment before lockdown). Monotone response selection assumptions fail because the empirically estimated marginal survey response function is non-monotone: for employment, the probability first decreases and then increases as a function of willingness-to-participate. The Heckman parametric selection model gives point estimates inconsistent with the ground truth for most outcomes (e.g., 51% estimated pre-lockdown employment vs. 57% true).&lt;/p&gt;
&lt;p&gt;Q: What motivates the two-dimensional participation model?
A: Analysis of participation timing shows that reminder emails attract a qualitatively different type of respondent than incentives alone. Reminders have a larger proportional effect on participation in the no-incentive group than in the high-incentive group, both in absolute and proportional terms. Early respondents (responding to initial contact) had lower pre-lockdown earnings and employment than late respondents (responding to reminders). This implies that the two types of unobservables — resistance to incentive and probability of receiving the invitation — are associated with outcomes that move in opposite directions, producing a non-monotone marginal survey response function that single-dimensional models cannot capture.&lt;/p&gt;
&lt;p&gt;Q: How does the two-dimensional model work and what are its results?
A: The model distinguishes active nonparticipants (saw the invitation, declined because the incentive was too low — more likely to be employed and higher earners) from passive nonparticipants (did not receive or attend to the invitation — more likely to have been adversely affected by the lockdown). By exploiting both the randomized incentive variation and the timing of responses (initial contact vs. reminder), the model partially identifies population mean outcomes under shape restrictions on the joint distribution of the two unobservables. For pre-lockdown employment, the model produces bounds of [48%, 63%] bracketing the true value of 57%, compared to worst-case bounds of [34%, 83%] and monotone selection bounds that do not contain the truth. Improvements are largest for pre-lockdown levels outcomes where the two types of nonparticipants differ most.&lt;/p&gt;
&lt;p&gt;Q: What are the practical recommendations for survey researchers?
A: Embedding randomized incentives in surveys at little or no additional cost enables an inexpensive test for nonresponse bias that does not require linked administrative data. When such a test detects bias, researchers should apply the two-dimensional model rather than relying on observable-based reweighting or conventional selection models. The question of who participates matters at least as much as how many participate; surveys should be designed to characterize and correct for selection, not merely to maximize response rates.&lt;/p&gt;
&lt;p&gt;Nonresponse bias: The difference between the mean response among survey participants and the true population mean, arising when the decision to participate is correlated with the outcome of interest. Distinct from sampling bias; it persists even with a randomly drawn sample.&lt;/p&gt;
&lt;p&gt;Selection on unobservables: Nonresponse bias that remains after conditioning on all observed characteristics. In the NiK survey, marginal and inframarginal participants are indistinguishable on observable demographics but differ dramatically in labor market outcomes, providing direct evidence that unobservables drive selection.&lt;/p&gt;
&lt;p&gt;Marginal vs. inframarginal participants: Under the Imbens-Angrist monotonicity condition, inframarginal participants would respond at any incentive level; marginal participants respond only at higher incentive levels. Their average responses are separately identified using an IV regression with the incentive as instrument.&lt;/p&gt;
&lt;p&gt;Marginal survey response (MSR): The function m(u) = E[Y*_i | U_i = u], giving the average outcome for individuals at the uth quantile of willingness to participate. The MSR is nonparametrically identified for u in [0, p(z_high)]; its empirically non-monotone shape in the NiK data explains why monotone selection assumptions produce bounds that miss the ground truth.&lt;/p&gt;
&lt;p&gt;Active vs. passive nonparticipants: Active nonparticipants received the survey invitation and declined because the incentive was insufficient; they tend to have higher labor market attachment. Passive nonparticipants never received or attended to the invitation but may respond to reminders; they tend to have been more adversely affected by the lockdown. This distinction motivates the two-dimensional model.&lt;/p&gt;
&lt;p&gt;Two-dimensional participation model: A model of survey participation with two unobservables — resistance to incentive (determining active nonresponse) and probability of receiving the invitation (determining passive nonresponse). By exploiting both incentive randomization and the timing of responses (initial contact vs. reminder), the model produces bounds or point estimates on population means that are narrower and closer to ground truth than single-dimensional alternatives.&lt;/p&gt;
&lt;p&gt;Exclusion restriction for incentives: The assumption that randomly assigned incentives affect participation rates but do not directly affect participants&amp;rsquo; answers to survey questions. This is required for incentives to serve as valid instruments for testing and correcting nonresponse bias; the authors test and find no evidence that it is violated.&lt;/p&gt;</description></item><item><title>Should Monetary Policy Care about Redistribution? Optimal Monetary and Fiscal Policy with Heterogeneous Agents</title><link>https://macropaperwarehouse.com/papers/should-monetary-policy-care-about-redistribution-optimal-monetary-and-fiscal-policy-with-heterogeneous-agents/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/should-monetary-policy-care-about-redistribution-optimal-monetary-and-fiscal-policy-with-heterogeneous-agents/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Should monetary policy deviate from price stability to address redistributive concerns in an economy with heterogeneous agents? The paper jointly solves for optimal monetary and fiscal policy under commitment in a Heterogeneous Agent New Keynesian (HANK) environment with incomplete insurance markets for idiosyncratic risk, nominal frictions (Rotemberg price adjustment costs), and aggregate technology shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Framework.&lt;/strong&gt; The model is a Bewley-style incomplete-markets economy populated by a continuum of agents who differ in their idiosyncratic labor productivity histories. Agents save in two assets — nominal public debt and real capital shares — and face nominal borrowing constraints. Intermediate firms operate under monopolistic competition and face quadratic price adjustment costs. The government has up to five fiscal instruments: linear taxes on real capital income, on nominal asset income, and on labor income; lump-sum transfers; and one-period public nominal debt. Monetary policy controls the path of the nominal interest rate, and thereby inflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Three fiscal regimes are analyzed:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regime 1 — Full optimal fiscal policy.&lt;/strong&gt; When both capital taxes (on real and nominal asset returns) and a labor tax are freely optimizable and time-varying, the paper proves analytically (Proposition 1) that optimal monetary policy implements exact price stability at all periods. The intuition is that linear capital taxes replicate all direct redistributive channels of inflation (return effects and Fisher effects), while the labor tax replicates all indirect general-equilibrium channels (real wage effects). Hence fiscal tools are sufficient substitutes for any redistributive role of inflation, and the Rotemberg price-adjustment loss makes any deviation from zero inflation strictly costly. This equivalence result extends Correia et al. (2008) to environments with heterogeneous asset holdings, capital, and both real and nominal assets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regime 2 — Exogenous fiscal rules (constant or modestly time-varying taxes).&lt;/strong&gt; Using a standard quarterly calibration for the US (capital tax 36%, labor tax 28%, transfers 8% of GDP; Frisch elasticity 0.5; price adjustment cost κ=100; TFP shock persistence 0.95, standard deviation 0.31% per quarter; wealth Gini 0.73), the paper solves for optimal inflation dynamics numerically via a &amp;ldquo;timeless perspective&amp;rdquo; — i.e., around the long-run equilibrium. Under Fiscal Rule 1 (constant marginal tax rates, debt-stabilizing transfer rule), the maximum change in the inflation rate following a one-standard-deviation negative TFP shock is &lt;strong&gt;0.01%&lt;/strong&gt;, and the annualized standard deviation of inflation is &lt;strong&gt;0.020%&lt;/strong&gt;. Under Fiscal Rule 2 (labor tax falls by 0.2 percentage points on impact from 28% to 27.8%, capital tax rises by 0.2 percentage points from 36% to 36.2%), inflation volatility is &lt;strong&gt;slightly lower&lt;/strong&gt; and aggregate consumption volatility is also reduced, confirming that even simple time-varying fiscal rules dominate optimal inflation as an insurance device. The aggregate welfare gain from implementing optimal inflation relative to constant inflation (Π=1) is &lt;strong&gt;0.002%&lt;/strong&gt; in consumption-equivalent terms, with the gain concentrated among low-productivity agents (up to 0.01%), while high-productivity agents who can self-insure experience a near-zero gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regime 3 — Constrained-optimal fiscal policy.&lt;/strong&gt; Holding the capital tax constant while optimizing over the labor tax (or vice versa), and calibrating Pareto weights via an inverse-optimal-taxation approach to match the observed US steady-state fiscal system, the paper finds that optimal inflation volatility remains small at a standard deviation of &lt;strong&gt;0.01%&lt;/strong&gt;, again confirming the dominance of fiscal over monetary instruments for redistribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robustness.&lt;/strong&gt; A simple two-agent economy calibrated closer to Bhandari et al. (2021b) — with a steeper Phillips curve (κ=20, slope ~6%), higher IES (1/σ=1/2), and highly unequal profit distribution (parameter ν=10 so high-productivity agents receive nearly all profits) — generates an inflation response on impact of &lt;strong&gt;0.17%&lt;/strong&gt;. Introducing a countercyclical fiscal rule (even a simple one) in this more volatile calibration reduces optimal inflation volatility by one order of magnitude, from &lt;strong&gt;0.68% to 0.07%&lt;/strong&gt;, and the on-impact response from &lt;strong&gt;0.15% to less than 0.01%&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodological contribution.&lt;/strong&gt; The analysis relies on two innovations: (i) a Lagrangian approach adapted from Marcet and Marimon (2019) that introduces the concept of &amp;ldquo;net social value of liquidity&amp;rdquo; for each agent, greatly simplifying first-order conditions; and (ii) a truncation method (LeGrand and Ragot 2022a,c) that represents incomplete-market heterogeneity by grouping agents by their last N periods of idiosyncratic history (truncation length N=5, giving 727 active histories), yielding a finite state space tractable for optimal policy computation. Results are validated against the Reiter (2009) histogram method.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; The equivalence result holds with commitment, a timeless perspective, and requires one distinct tax instrument per asset class (a separate tax on nominal and real returns). It holds under general period utility (not only separable forms). The result does not hold if the nominal asset tax is constrained to equal the real capital tax, in which case inflation would partially substitute for the missing instrument. The quantitative findings on small optimal inflation volatility are specific to the timeless perspective; a time-0 problem can generate larger deviations due to the ability to surprise agents with an initial inflation jump.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the central equivalence result and under what exact conditions does it hold?&lt;/strong&gt;
When the government has access to time-varying linear taxes on real capital income, on nominal asset income, and on labor income — in addition to lump-sum transfers and public debt — optimal monetary policy implements exact price stability (gross inflation Πt = 1 at all dates). The conditions are: Ramsey commitment, both real and nominal asset taxes available as distinct instruments, and the Rotemberg price adjustment friction. The equivalence holds in the timeless perspective and the time-0 perspective, and does not require separability of the utility function.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does the availability of capital and labor taxes render inflation redundant as a redistributive tool?&lt;/strong&gt;
Monetary policy operates through five channels identified in the HANK literature: three direct channels (substitution effect on returns, Fisher effect on nominal assets, wealth effect from unhedged interest-rate exposure) and two indirect channels (general-equilibrium labor income effects, heterogeneous exposure to income variation). The real capital tax — by affecting returns on all savings proportionally — can replicate any allocation achievable through the direct channels. The labor tax — by creating a wedge between the firm&amp;rsquo;s marginal cost of labor and household labor income — can replicate any allocation achievable through the indirect channels. With both instruments available, inflation&amp;rsquo;s only remaining effect is to destroy resources via Rotemberg adjustment costs, so the planner optimally sets Πt = 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the &amp;ldquo;net social value of liquidity&amp;rdquo; and how does it simplify the analysis?&lt;/strong&gt;
The net social value of liquidity for agent i at date t, ψ̂i,t = ψi,t − μt, equals the planner&amp;rsquo;s benefit from transferring one unit of consumption to agent i net of its fiscal cost. It combines the agent&amp;rsquo;s marginal utility of consumption with the planner&amp;rsquo;s internalization of effects on saving incentives (through real and nominal Euler equations) and on labor supply (through the labor Euler equation). Expressing the Ramsey first-order conditions in terms of ψ̂i,t reduces them to Euler-like smoothing conditions that closely parallel the individual agents&amp;rsquo; Euler equations, making both algebra and economic interpretation substantially more transparent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How large is the optimal inflation response in the baseline quantitative calibration, and how does it decompose?&lt;/strong&gt;
Under the baseline US calibration (κ=100, quarterly period, standard fiscal rules with constant marginal tax rates), the optimal inflation response to a one-standard-deviation negative TFP shock reaches a maximum of 0.01% (ten basis points on an annualized basis or less). The annualized standard deviation of inflation is 0.020%. Inflation rises on impact and then declines back to steady state. The correlation of optimal inflation with output is 0.20, indicating mild countercyclicality. The difference in aggregate consumption volatility between the optimal-inflation economy (Economy 1) and the constant-inflation economy (Economy 2) is small; the std of consumption is 1.33% vs. 1.34% of the mean.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What welfare gains does optimal inflation deliver, and how do they vary across the productivity distribution?&lt;/strong&gt;
The average welfare gain from implementing optimal inflation relative to constant inflation (Π=1) is 0.002% in consumption-equivalent terms. This aggregate figure conceals heterogeneity: low-productivity agents experience a welfare gain of up to 0.01% because they benefit disproportionately from the reduction in consumption volatility (inflation acts as a partial Fisher-effect transfer to debtors who are credit-constrained). High-productivity agents experience a near-zero gain because they can self-insure through portfolio choice. All productivity groups experience a positive but modest welfare gain.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the effect of introducing a simple time-varying fiscal rule (Fiscal Rule 2) on optimal inflation dynamics?&lt;/strong&gt;
Fiscal Rule 2 sets the labor tax to fall from 28% to 27.8% on impact after a negative TFP shock (a decline of 0.2 percentage points), while the capital tax rises from 36% to 36.2%. The public debt path is roughly unchanged relative to Fiscal Rule 1. Compared to the constant-tax baseline, Fiscal Rule 2 yields slightly lower inflation volatility (standard deviation 0.018% vs. 0.020%) and lower aggregate consumption volatility (std 1.31% vs. 1.33% of mean). These results confirm that even a small, simple exogenous fiscal rule dominates inflation as an insurance device against aggregate TFP shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Under what calibration does the optimal inflation response become quantitatively sizable, and how does a fiscal rule affect it in that case?&lt;/strong&gt;
A combination of a steep Phillips curve (κ=20 rather than 100, implying a slope of about 6% rather than 2%), a higher intertemporal elasticity of substitution (IES = 1/σ = 1/2 rather than 1), and highly unequal profit distribution (parameter ν=10, so high-productivity agents receive nearly all profits) generates an on-impact inflation response of approximately 0.15%–0.17% after a 1% negative TFP shock, and an inflation volatility of 0.68%. Introducing a countercyclical fiscal rule in this environment reduces inflation volatility by one order of magnitude to 0.07%, and the on-impact response from 0.15% to less than 0.01%, while also reducing aggregate consumption volatility.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the role of profit distribution in determining the sign and magnitude of the optimal inflation response?&lt;/strong&gt;
The distribution of firms&amp;rsquo; profits to households is a key driver of optimal inflation. When profits are distributed predominantly to high-productivity agents (ν=10), optimal inflation rises on impact after a negative TFP shock, because higher inflation benefits low-productivity credit-constrained agents through the Fisher effect and the real-wage channel. When profits are distributed equally across agents (ν=0), the optimal inflation response reverses sign and becomes negative on impact (−0.13% instead of +0.17%), because decreasing inflation raises firms&amp;rsquo; profits and, since those profits are equally shared, acts as a progressive transfer to credit-constrained low-income agents who consume a larger fraction at the margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the constrained-optimal fiscal policy scenario (Regime 3) affect inflation dynamics?&lt;/strong&gt;
In Regime 3, a Pareto-weight social welfare function is calibrated via an inverse-optimal-taxation approach so that the observed US fiscal steady state (36% capital tax, 28% labor tax, 8% transfers/GDP) is an interior optimal. The planner then jointly optimizes either the labor tax path (holding capital tax constant) or the capital tax path (holding labor tax constant) together with the inflation path. The resulting optimal inflation standard deviation is 0.01%, confirming that even partial fiscal flexibility is sufficient to drive inflation volatility close to zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the timeless perspective differ from a time-0 problem in generating inflation deviations?&lt;/strong&gt;
In a time-0 problem the planner can exploit initial surprise: at date 0, unexpected inflation can redistribute real wealth through the Fisher effect on pre-existing nominal debt holdings, a mechanism immune to the time-consistency constraint. This creates a larger initial inflation front-loading. In the timeless perspective — the paper&amp;rsquo;s main framework — the economy is assumed to have been running under the optimal commitment rule for a long time, so no such surprise mechanism is available, and the planner&amp;rsquo;s only inflationary tool is the recurrent business-cycle insurance motive. As a result, inflation volatility in the timeless perspective is substantially smaller than in a time-0 problem.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the truncation method and how does the paper validate its accuracy?&lt;/strong&gt;
The truncation method (LeGrand and Ragot 2022a,c) groups agents by their last N periods of idiosyncratic productivity history, creating a finite state space. With N=5 and 5 idiosyncratic states, there are 5^5=3,125 possible histories, of which 727 have positive probability. A &amp;ldquo;refined&amp;rdquo; variant (LeGrand and Ragot 2022c) applies longer truncation lengths to more common histories while keeping total history count linear rather than exponential in Nmax. The paper sets Nmax=20 for the refined truncation as a robustness check and finds impulse responses and second-order moments nearly identical to the N=5 baseline. Results are also compared against the Reiter (2009) histogram method, showing close agreement in both impulse response functions and second-order moments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the paper relate to the equivalence results of Correia et al. (2008)?&lt;/strong&gt;
Correia et al. (2008) show that in a representative-agent economy without capital, a time-varying consumption tax can implement price stability regardless of nominal frictions. The current paper extends this to an environment with heterogeneous asset holdings (both real and nominal), capital accumulation, and an incomplete insurance market. The extension requires one distinct tax instrument per asset class (separate taxes on nominal and real returns), rather than a single consumption tax. The equivalence result would break down if the nominal asset tax were forced to equal the real capital tax, because inflation would then be needed to partially substitute for the missing degree of freedom.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What three mechanisms shape the optimal inflation first-order condition when fiscal policy is exogenous?&lt;/strong&gt;
When tax rates follow exogenous fiscal rules, the planner&amp;rsquo;s first-order condition for inflation balances three forces: (1) the Rotemberg resource-destruction cost of price adjustment (μt·κ·(Πt−1)), which penalizes any deviation from Πt=1; (2) the ability to manipulate the real wage through the New-Keynesian Phillips curve (a term involving the lead and lag of the Phillips-curve multiplier γt), which can transfer resources across households; and (3) the gain from reducing the real interest payment on existing nominal public debt through unexpected inflation (a term involving fund multipliers Γt and Υt, scaled by the outstanding debt Bt−1). The balance among these three forces determines the sign and magnitude of the optimal inflation response.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Net Social Value of Liquidity (ψ̂i,t).&lt;/strong&gt; The planner&amp;rsquo;s benefit from transferring one unit of consumption to agent i net of its fiscal cost (μt). Formally ψ̂i,t = ψi,t − μt, where ψi,t captures the agent&amp;rsquo;s marginal utility of consumption adjusted for the planner&amp;rsquo;s internalization of savings distortions through real and nominal Euler equations and the labor supply equation. This concept is introduced in the paper to simplify Ramsey first-order conditions in incomplete-market environments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Equivalence Result (Proposition 1).&lt;/strong&gt; The theoretical finding that, when the government has access to time-varying linear taxes on both nominal and real asset returns and on labor income, the planner can exactly reproduce the flexible-price allocation and optimal monetary policy is to implement zero net inflation at all dates. The equivalence holds because the fiscal instruments can replicate every redistributive channel of monetary policy at no resource cost, while any inflation deviation destroys output through price adjustment costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Timeless Perspective.&lt;/strong&gt; A solution concept for Ramsey optimal policy in which the economy is assumed to have been operating under the optimal commitment rule for a long time, so initial conditions no longer matter. As described in the paper (following Woodford, 1999, and McCallum and Nelson, 2000), this is &amp;ldquo;the closest notion to optimal policy making according to a rule&amp;rdquo; and eliminates the time-0 front-loading bias that arises when the planner can surprise agents with an initial inflation jump.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Truncation Method.&lt;/strong&gt; A method (LeGrand and Ragot 2022a,c) that approximates the infinite-dimensional heterogeneous-agent state space by grouping agents by their last N periods of idiosyncratic productivity history. Within each truncated history, agents are pooled with history-specific heterogeneity parameters (ξh) capturing wealth dispersion from histories prior to the aggregation window. The refined variant assigns different truncation lengths to different histories to keep the total number of histories linear in Nmax rather than exponential.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Direct vs. Indirect Channels of Monetary Policy.&lt;/strong&gt; Following Kaplan et al. (2018) and Auclert (2019), the paper distinguishes: (i) direct channels — the substitution effect on real returns, the Fisher effect on nominal asset values, and the wealth effect from unhedged interest-rate exposure — which operate through changes in asset returns; and (ii) indirect channels — heterogeneous labor income effects and heterogeneous income exposure — which operate through general-equilibrium effects on wages and employment. The paper&amp;rsquo;s equivalence result shows that capital taxes replicate the direct channels and the labor tax replicates the indirect channels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Fiscal Rule (Bohn-type, affine structure).&lt;/strong&gt; An exogenous rule specifying that marginal tax rates on capital and labor respond linearly to current and lagged TFP deviations from steady state, while transfers respond to TFP deviations and public debt deviations from target. The paper uses two such rules: Fiscal Rule 1 (constant marginal tax rates, debt-stabilizing transfer) and Fiscal Rule 2 (countercyclical labor tax and procyclical capital tax with the same debt path), to assess whether simple time-varying fiscal policies substitute for optimal inflation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Rotemberg Price Adjustment Cost.&lt;/strong&gt; A quadratic cost κ/2·(pj,t/pj,t−1 − 1)^2·Yt incurred by each intermediate firm when it changes its price, used as the nominal friction generating the New-Keynesian Phillips curve. In the paper&amp;rsquo;s model, any deviation of gross inflation Πt from 1 destroys real output, making this the welfare cost of using inflation as a policy instrument.&lt;/p&gt;</description></item><item><title>Silence to Solidarity: How Communication About a Minority Affects Discrimination</title><link>https://macropaperwarehouse.com/papers/silence-to-solidarity-how-communication-about-a-minority-affects-discrimination/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/silence-to-solidarity-how-communication-about-a-minority-affects-discrimination/</guid><description>&lt;p&gt;This paper examines how two types of communication about a minority group affect discriminatory behavior: (i) horizontal communication between majority-group members, and (ii) top-down communication from agents of authority such as the legal system. The setting is urban Chennai, India, where the paper measures discrimination against thirunangai — a community of transgender women who are India&amp;rsquo;s most visible LGBTQ+ group — in a field experiment with 3,397 participants.&lt;/p&gt;
&lt;p&gt;Discrimination is measured using incentivized hiring choices. Participants are offered a free grocery delivery and make 10 binary choices over which worker will carry out the delivery, with worker gender (cisgender male, cisgender female, or transgender) varying across options. The stakes are real: one choice is randomly selected and implemented 2–9 weeks later. Participants in the control condition are highly discriminatory: they are 19 percentage points (32%) less likely to hire a transgender worker than a non-transgender worker (p&amp;lt;0.001), and are willing to sacrifice grocery items worth 1.9 times their median daily per capita food expenditure to avoid a 15-minute interaction with a transgender worker.&lt;/p&gt;
&lt;p&gt;The first main treatment involves randomly assigning participants to a 3-person group discussion with two neighbors, in which they discuss and make collective hiring choices over the same options. The key outcome is participants&amp;rsquo; subsequent private, individual hiring choices. The discussion eliminates anti-transgender discrimination on average: participants in the discussion arm are 17 percentage points (42%) more likely to select a transgender worker in their private post-discussion choices relative to the control group (p&amp;lt;0.001), so that discrimination is no longer statistically distinguishable from zero (p=0.30). The discussion&amp;rsquo;s effect is partially persistent: approximately one month later, discussion participants are still 4 percentage points more likely to select transgender workers in hypothetical hiring choices (p=0.03), representing roughly 25% of the short-run effect.&lt;/p&gt;
&lt;p&gt;The second main treatment cross-randomizes a video shown before hiring choices. The legal rights video informs participants of a Supreme Court ruling affirming that transgender people hold the same fundamental constitutional rights as other citizens. This reduces discrimination by 10.3 percentage points (p&amp;lt;0.001). A rights messaging video — which argues that transgender people should have equal rights without invoking legal authority — reduces discrimination by a smaller 5.8 percentage points (p=0.001), and there is some evidence the legal-authority version is more effective (p of difference in [0.01, 0.12]). However, the legal rights video&amp;rsquo;s effect is only 59% as large as the discussion&amp;rsquo;s effect (p of difference in [0.002, 0.04]), and it does not persist at the one-month follow-up (p in [0.12, 0.51]).&lt;/p&gt;
&lt;p&gt;The paper rules out two candidate mechanisms for the discussion&amp;rsquo;s effects and supports a third. First, the discussion does not work primarily through correcting misperceived norms: while control-group participants do overestimate peer discrimination by 5 percentage points, the discussion reduces predicted discrimination by 24 percentage points — far more than a corrected misperception could explain (at most 21% of the effect under generous assumptions). Second, the discussion does not work through virtue signaling alone: a &amp;ldquo;No discussion (public)&amp;rdquo; arm in which participants make individually-visible choices shows no reduction in discrimination on average (p=0.83). Third, the paper provides affirmative evidence for a persuasion channel: participants in a &amp;ldquo;listener&amp;rdquo; arm, who silently observe a 2-person discussion without participating, discriminate 13 percentage points less than the control group (p&amp;lt;0.001), an effect that is highly persistent at the 2–9 week follow-up (11 percentage points, p&amp;lt;0.001). The persuasion mechanism is further supported by the finding that pro-trans participants are more vocal: each additional transgender worker chosen in post-discussion private choices is associated with a 32% higher probability of speaking first (p=0.03) and a 27% higher probability of dominating the discussion (p=0.02). Statements about transgender workers during discussions were 5.7 times more likely to be positive than negative. Listeners who heard moral argumentation about equality, rights, and giving opportunities subsequently discriminated less (p&amp;lt;0.001).&lt;/p&gt;
&lt;p&gt;Scope conditions: the study is conducted among urban Chennai residents (85% female), where transgender identity is visually recognizable and socially salient, awareness of the 2014 Supreme Court ruling is low (36% could not identify a single legal right transgender people hold), and a wedge exists between descriptive norms (high actual discrimination) and prescriptive norms (93% of the control group rate explicit discrimination as wrong). The model&amp;rsquo;s &amp;ldquo;sweet spot&amp;rdquo; logic implies these effects may not generalize to settings where discrimination is either near-universal (no privately pro-trans individuals to be vocal) or already minimal (no incentive to persuade).&lt;/p&gt;
&lt;p&gt;Q: How is anti-transgender discrimination measured in the experiment?
A: Participants make 10 incentive-compatible binary hiring choices over grocery delivery workers, with one choice randomly selected and implemented 2–9 weeks later. Discrimination is defined as the reduction in the probability of selecting the alternative worker when that worker is transgender versus non-transgender, conditional on other option characteristics such as items offered and reliability score. Participants are told they will have a 15-minute conversation with the selected worker, ensuring anticipated social contact. The design is framed as market research to obfuscate the study&amp;rsquo;s purpose; only 8% correctly guessed the true focus.&lt;/p&gt;
&lt;p&gt;Q: How large is baseline discrimination in the control group?
A: In the No discussion (private) control condition, participants are 19 percentage points (32%) less likely to hire a transgender worker than a non-transgender worker (p&amp;lt;0.001). In willingness-to-pay terms, participants sacrifice grocery items worth 1.9 times their median daily per capita food expenditure (Rs. 127 on a base of Rs. 67) to avoid selecting a transgender worker. Even when a transgender worker dominates on both items and reliability score, participants in the control group still select the non-transgender worker 47% of the time.&lt;/p&gt;
&lt;p&gt;Q: What is the main effect of the 3-person group discussion on subsequent discrimination?
A: Participants who engage in a group discussion with two neighbors are 17 percentage points more likely to select a transgender worker in their subsequent private individual choices (p&amp;lt;0.001). This eliminates average discrimination entirely: in the discussion arm, the probability of selecting a transgender worker is not statistically distinguishable from the probability of selecting a non-transgender worker (p=0.30). The willingness-to-pay to avoid a transgender worker falls from Rs. 127 to Rs. 13 (p of difference &amp;lt; 0.001), and is no longer significantly different from zero (p=0.265).&lt;/p&gt;
&lt;p&gt;Q: How persistent are the effects of the group discussion?
A: At the 2–9 week follow-up survey (mean 35 days), discussion participants are approximately 4 percentage points more likely to select transgender workers in hypothetical hiring choices (p=0.03). This represents approximately 25% of the short-run 17 percentage point effect, a decay rate comparable to the persistence of US political advertising effects in the political science literature (Hill et al., 2013, estimate 10–15% remaining after 30 days).&lt;/p&gt;
&lt;p&gt;Q: What is the effect of the legal rights video, and how does it compare to the discussion?
A: The legal rights video — informing participants of the Supreme Court ruling affirming transgender people&amp;rsquo;s fundamental constitutional rights — increases the probability of selecting a transgender worker by 10.3 percentage points (p&amp;lt;0.001). The rights messaging video, which argues that transgender people should have equal rights without invoking legal authority, increases it by 5.8 percentage points (p=0.001). The legal rights video&amp;rsquo;s effect is only 59% as large as the discussion&amp;rsquo;s 17 percentage point effect (p of difference in [0.002, 0.04]), and unlike the discussion, neither video&amp;rsquo;s effect is detectable at the one-month follow-up (p in [0.12, 0.51]).&lt;/p&gt;
&lt;p&gt;Q: Does the legal rights video work through a different channel than the rights messaging video?
A: There is evidence that the legal authority of the Supreme Court matters beyond the content of the rights message. The legal rights video is more effective than the rights messaging video at reducing discrimination (p of difference in [0.01, 0.12]), and the legal rights video (but not the rights messaging) affects participants&amp;rsquo; beliefs about the legal status of transgender people (as measured by a summary index). Both videos shift perceived descriptive norms — participants predict others will select transgender workers more, by 2–6 percentage points — but neither significantly affects attitudes as measured by a list experiment or disapproval questions.&lt;/p&gt;
&lt;p&gt;Q: Does the discussion work through correcting misperceived norms?
A: This channel can account for at most a small fraction of the effect. Control-group participants do overestimate peer discrimination by 5 percentage points in incentivized predictions (p&amp;lt;0.001, as measured by predicted probability of selecting a transgender worker). However, the discussion reduces predicted discrimination by 24 percentage points (p&amp;lt;0.001), far exceeding the initial misperception. Even under generous assumptions in which the misperception is precisely corrected, this mechanism could account for no more than 21% of the discussion&amp;rsquo;s treatment effect (95% CI: [8.9%, 32.5%]).&lt;/p&gt;
&lt;p&gt;Q: Does the discussion work through virtue signaling?
A: The evidence rules out virtue signaling as the primary channel. The &amp;ldquo;No discussion (public)&amp;rdquo; treatment arm makes participants&amp;rsquo; individual hiring choices visible to their group members, exogenously increasing social image concerns in the absence of a discussion. This has no detectable average effect on discrimination (p=0.83), indicating that social image concerns alone — without the persuasive content of an actual discussion — do not explain the reduction in discrimination generated by the group discussion.&lt;/p&gt;
&lt;p&gt;Q: What is the evidence for the persuasion mechanism?
A: The &amp;ldquo;listener&amp;rdquo; treatment arm provides direct evidence. In this arm, one participant silently observes a 2-person discussion without speaking, then makes private individual choices. Listeners discriminate 13 percentage points less than the control group (p&amp;lt;0.001), an effect statistically indistinguishable from full discussion participants. Since listeners changed their behavior based solely on what they heard and saw, this constitutes evidence of persuasion. The listener effect is highly persistent at the 2–9 week follow-up (11 percentage points, p&amp;lt;0.001) and holds on a robustness outcome designed to be completely private. The implied persuasion rate is 29%, described as high relative to values in the literature (DellaVigna &amp;amp; Gentzkow, 2010).&lt;/p&gt;
&lt;p&gt;Q: Why do pro-trans participants persuade others — what drives the discussion&amp;rsquo;s content?
A: Pro-trans participants are disproportionately vocal. Each additional transgender worker chosen in post-discussion private choices (a proxy for pro-trans private attitudes) is associated with a 32% higher probability of speaking first (p=0.03) and a 27% higher probability of dominating the discussion (p=0.02), but only when discussing a choice involving a transgender worker. The overall tone of discussions is strongly pro-trans: statements about transgender workers are 5.7 times more likely to be positive than negative. Participants who hear moral argumentation about equality, rights, and giving opportunities subsequently discriminate significantly less (p&amp;lt;0.001).&lt;/p&gt;
&lt;p&gt;Q: Does the discussion work by changing statistical (belief-based) discrimination?
A: Partially, baseline discrimination in the control group is partly statistical: despite transgender workers having the same average reliability scores as others, participants rate them as less likely to complete a delivery, and revealing the true reliability score makes participants 2.9 percentage points more likely to select a transgender worker (an effect unique to transgender workers). However, the discussion does not significantly affect beliefs about transgender workers&amp;rsquo; reliability, and there is no detected reduction in the belief-based component of discrimination in the discussion arm (though the test is underpowered).&lt;/p&gt;
&lt;p&gt;Q: Are the effects of the discussion and the legal rights video additive?
A: The two interventions appear to combine approximately linearly for the legal rights video: there are no detected interaction effects (p in [0.83, 0.96]). By contrast, there is weak evidence of a negative interaction between the rights messaging video and the discussion, suggesting these two may be substitutes — consistent with the rights messaging video&amp;rsquo;s content being similar to the pro-trans moral argumentation already present in discussions.&lt;/p&gt;
&lt;p&gt;Q: What alternative explanations are ruled out?
A: The paper tests and finds no support for: (i) photo characteristics such as perceived caste driving results; (ii) social image concerns affecting even post-discussion private choices (the &amp;ldquo;extra private&amp;rdquo; robustness outcome designed to be unobservable by neighbors yields similar results); (iii) increased contemplation or deliberation about choices; (iv) experimenter demand effects or social desirability bias (treatment effects do not differ for the 8% who guessed the study&amp;rsquo;s purpose); (v) increased salience of the transgender category; and (vi) cheap talk from low stakes (choices were incentive-compatible and implemented).&lt;/p&gt;
&lt;p&gt;Q: What is the study&amp;rsquo;s theoretical model for why pro-trans participants speak out?
A: The paper develops a model combining social signaling (people want to fit in with their group; Bénabou &amp;amp; Tirole, 2006) with direct persuasion (participants can change each other&amp;rsquo;s preferences through messages). Under the right conditions, only pro-trans participants send persuasive pro-trans messages. This occurs in a &amp;ldquo;sweet spot&amp;rdquo; range: when average discrimination is not so strong that no one is privately pro-trans, and not so weak that pro-trans participants lack an incentive to persuade (since they are already in the majority). The context in Chennai — high actual discrimination but strong social norms against it — satisfies this sweet spot condition.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications regarding horizontal versus top-down communication?
A: In this context, facilitating horizontal communication between neighbors is a more effective tool for reducing discrimination than top-down communication about legal rights: the discussion&amp;rsquo;s effect is 1.7 times larger than the legal rights video (17 p.p. vs. 10.3 p.p.) and partially persists at one month, whereas the legal rights video&amp;rsquo;s effect does not persist. However, the legal rights video does reduce discrimination relative to the rights messaging video, suggesting that communicating the legal authority of the Supreme Court carries independent weight beyond rights advocacy messaging. Both interventions are complementary when combined.&lt;/p&gt;
&lt;p&gt;Horizontal communication: Communication between members of the majority group about a minority, as distinct from contact between majority and minority groups or top-down communication from authority. In this paper, operationalized as a group discussion among three neighbors who make collective hiring choices.&lt;/p&gt;
&lt;p&gt;Top-down communication: Communication from agents of authority — here, the legal system — about a minority group&amp;rsquo;s rights. Measured via a video informing participants of a Supreme Court ruling affirming transgender people&amp;rsquo;s constitutional rights.&lt;/p&gt;
&lt;p&gt;Anti-transgender discrimination: In the paper&amp;rsquo;s own measurement, the reduction in the probability that a worker is chosen because they are transgender (relative to being non-transgender), conditional on other delivery option characteristics. Measured in incentivized, privately-elicited binary hiring choices.&lt;/p&gt;
&lt;p&gt;Expressive law hypothesis: The theory that changes in the law affect behavior by changing people&amp;rsquo;s perception of the prevailing social norm, not (only) through deterrence. The paper tests this by comparing a legal rights video (invoking Supreme Court authority) to a rights messaging video with identical content but no legal backing, finding the legal-authority version more effective.&lt;/p&gt;
&lt;p&gt;Persuasion channel: The mechanism by which discussion participants change each other&amp;rsquo;s preferences through persuasive messages, particularly moral arguments about equality and rights. Distinguished in the paper from virtue signaling (publicly visible pro-trans behavior) and norm correction (updating misperceived beliefs about peer behavior).&lt;/p&gt;
&lt;p&gt;Pluralistic ignorance: A setting in which people misperceive how common discriminatory attitudes are among their peers, potentially hiding genuine minority support for the discriminated group. The paper tests this as a candidate mechanism and finds it can account for at most 21% of the discussion effect.&lt;/p&gt;
&lt;p&gt;Sweet spot condition: The range of average group discrimination levels in which pro-trans participants have both the motivation and opportunity to speak out persuasively — discrimination is not so universal that no one is privately pro-trans, and not so minimal that the pro-trans participants feel no need to persuade others. The paper argues the Chennai context satisfies this condition.&lt;/p&gt;</description></item><item><title>Skill-Replacing Technology and Bottom-Half Inequality</title><link>https://macropaperwarehouse.com/papers/skill-replacing-technology-and-bottom-half-inequality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/skill-replacing-technology-and-bottom-half-inequality/</guid><description>&lt;p&gt;This paper proposes a model of skill-replacing routine-biased technological change (SR-RBTC) to explain patterns in U.S. bottom-half wage inequality that standard RBTC models cannot account for. The central departure from prior models (e.g., Acemoglu and Autor 2011; Cortes 2016) is that technology substitutes the usage of skill within routine occupations rather than replacing routine workers wholesale. Formally, SR-RBTC is characterized by epsilon &amp;lt; 0, where epsilon = d² log phi_R / (d theta_i d tau), meaning productivity gains in the routine occupation are disproportionately concentrated among lower-skilled workers, compressing skill-wage gradients within that occupation.&lt;/p&gt;
&lt;p&gt;The paper addresses three stylized facts that skill-neutral RBTC models leave unexplained. First, wage polarization concentrated around the median rather than the entire bottom half, even though routine workers are dispersed across the full bottom half of the wage distribution. SR-RBTC explains this because the largest wage drops accrue to the highest-skilled routine workers, who were empirically concentrated near the middle of the overall distribution. Second, the decline in middle wages stopped around 2000 even as routine employment continued falling. The model accounts for this through a two-phase mechanism: once the return to skill in routine occupations falls below that in manual occupations, the routine occupation attracts the lowest-skilled workers, shifting negative wage pressure to the bottom rather than the middle of the distribution. Third, average wages in routine occupations did not fall substantially despite large employment declines; in SR-RBTC, wage losses for higher-skilled routine workers are partially offset by gains for lower-skilled ones, leaving average routine wages relatively stable.&lt;/p&gt;
&lt;p&gt;The paper tests two new predictions using an Interactive Fixed-Effects Model (IFEM) estimated on Panel Study of Income Dynamics (PSID) data for 1980–2017. The IFEM regresses log wages on occupation-year fixed effects, experience controls, and worker fixed effects (capturing unobserved skill theta_i) interacted with occupational category and year, instrumenting the fixed effects with years of schooling to correct attenuation bias. Results confirm both predictions. The return to skill in routine occupations declined sharply from the late 1980s onward: log alpha_{R,t} fell by more than 0.7, corresponding to a greater-than-50 percent reduction between its 1987 peak and 2017, while manual and abstract occupations showed no comparable decline. Average skill in routine occupations also fell steadily, dropping from near the population mean in the early 1980s to approximately -0.2 by the end of the sample, such that by 2015 routine workers had lower average skill than manual workers.&lt;/p&gt;
&lt;p&gt;To quantify SR-RBTC&amp;rsquo;s contribution to overall wage polarization, the paper introduces a skewness decomposition. Because SR-RBTC violates the ignorability assumption underlying standard decomposition methods (e.g., DiNardo et al. 1996; Firpo et al. 2009), prior approaches could not capture the within-occupation inequality changes central to the mechanism. The skewness decomposition partitions the third central moment of log wages into a within-occupation component, a between-occupation component, and a covariance component (correlation between occupation mean wages and occupation wage inequality). Using Current Population Survey Outgoing Rotation Group (CPS-ORG) data focused on 1992–2002, the paper finds that 93 percent of the rise in skewness is related to occupational trends (the within component explains only 7 percent). Of that, 78 percent of the total increase in skewness is driven by the covariance component — rising inequality in higher-paying abstract occupations combined with falling inequality in lower-paying routine occupations — consistent exclusively with SR-RBTC rather than skill-neutral RBTC. The paper concludes that SR-RBTC can account for the large majority of U.S. bottom-half wage polarization trends from the late 1980s through the early 2000s.&lt;/p&gt;
&lt;p&gt;Q: What is the core distinction between SR-RBTC and standard (skill-neutral) RBTC?&lt;/p&gt;
&lt;p&gt;A: In standard RBTC, technology raises productivity uniformly for all routine workers regardless of skill (epsilon = 0), so wage effects are identical across the skill distribution within routine occupations. In SR-RBTC (epsilon &amp;lt; 0), technology and skill are substitutes, so higher-skilled routine workers experience proportionally smaller productivity gains — or relative wage declines — while lower-skilled routine workers may benefit. This means SR-RBTC compresses the within-routine wage distribution rather than shifting it uniformly downward.&lt;/p&gt;
&lt;p&gt;Q: How does SR-RBTC generate wage polarization concentrated at the median rather than across the full bottom half?&lt;/p&gt;
&lt;p&gt;A: Because the largest wage drops fall on the highest-skilled workers within the routine occupation, and those workers were empirically concentrated near the middle of the overall wage distribution, SR-RBTC disproportionately reduces wages around the median. Skill-neutral RBTC, by contrast, would reduce wages equally for all routine workers who are spread across the full bottom half, predicting wage declines throughout the bottom 50 percent rather than just near the 50th percentile.&lt;/p&gt;
&lt;p&gt;Q: Why does the model predict a non-monotonic relationship between technological progress and bottom-half inequality?&lt;/p&gt;
&lt;p&gt;A: In Phase 1, routine occupations employ middle-skilled workers; SR-RBTC reduces wages most for the highest-earning (highest-skilled) routine workers, compressing the bottom half of the distribution. In Phase 2, once the return to skill in routine occupations falls below that in manual occupations, the comparative advantage of middle-skilled workers shifts away from routine jobs, and routine occupations come to employ the lowest-skilled workers. Further SR-RBTC then concentrates negative wage pressure at the bottom of the distribution, potentially increasing bottom-half inequality. The transition between these phases corresponds empirically to the reversal around 2000.&lt;/p&gt;
&lt;p&gt;Q: What does the IFEM find about the return to skill in routine versus other occupations?&lt;/p&gt;
&lt;p&gt;A: Log alpha_{R,t} (the return to unobserved skill in routine occupations) fell by more than 0.7 log points between its 1987 peak and 2017, representing a greater-than-50 percent reduction. Manual occupations remained stable at approximately log alpha_{M,t} = -0.3. Abstract occupations saw a smaller and later decline, largely after 1994, consistent with evidence on a reversal in demand for cognitive skills (Beaudry et al. 2016) but far less pronounced than the routine occupation decline. The ranking of return to skill between routine and manual occupations reversed during the 1990s, matching the model&amp;rsquo;s Phase 2 threshold condition (Theorem 5).&lt;/p&gt;
&lt;p&gt;Q: What does the IFEM find about the skill composition of routine workers over time?&lt;/p&gt;
&lt;p&gt;A: Average estimated skill (theta_hat_i) in routine occupations declined from near zero (the population average) in the early 1980s to approximately -0.2 by the end of the sample. By 2015, average skill in routine occupations fell below that of manual workers, a reversal not seen for abstract or manual occupations over the same period. The decline in routine skill composition was primarily driven by fewer middle-skilled workers entering the labor force into routine jobs: the share of middle-skilled new entrants going into routine occupations fell from nearly 50 percent in the early 1980s to around 33 percent after 2010, at a rate of 0.53 percentage points per year.&lt;/p&gt;
&lt;p&gt;Q: What is the skewness decomposition and why is it needed?&lt;/p&gt;
&lt;p&gt;A: Skewness — the third standardized moment of the log wage distribution — measures asymmetry and captures wage polarization (rising top-half inequality alongside falling bottom-half inequality). It decomposes into three components: within-occupation (residual skewness not explained by occupational structure), between-occupation (skewness from differences in group means), and a covariance component (correlation between occupation-level mean wages and occupation-level wage inequality). Standard decomposition methods (Juhn et al. 1993; DiNardo et al. 1996; Firpo et al. 2009) rely on ignorability, which fails when the within-occupation wage distribution itself changes — as SR-RBTC predicts. The covariance component of skewness captures exactly these within-occupation structural changes without requiring ignorability.&lt;/p&gt;
&lt;p&gt;Q: What do the skewness decomposition results show about the driver of wage polarization?&lt;/p&gt;
&lt;p&gt;A: Decomposing the rise in skewness between 1992 and 2002 using 3-digit occupational coding, 93 percent of the total increase is attributable to occupational trends (only 7 percent is explained by the within-occupation component unrelated to occupational structure). Of the total skewness increase, 78 percent is accounted for by the covariance component — rising inequality in high-paying abstract occupations combined with declining inequality in low-paying routine occupations. This pattern is precisely what SR-RBTC predicts and cannot be generated by skill-neutral RBTC, which would predict the rise to come primarily from the between-occupation component (declining average routine wages).&lt;/p&gt;
&lt;p&gt;Q: Why did prior decomposition methods fail to detect the SR-RBTC mechanism?&lt;/p&gt;
&lt;p&gt;A: Prior methods (e.g., Autor et al. 2005; Firpo et al. 2013) operated under the ignorability assumption: the conditional distribution of wages given observables (e.g., occupation) is unchanged when the distribution of observables changes. This holds under skill-neutral RBTC (uniform wage effects within routine occupations) but fails under SR-RBTC, where the within-occupation wage structure itself changes. Consequently, prior methods only captured the (modest) decline in average routine wages — too small to explain observed polarization — and missed the inequality compression within routine occupations, which is the primary driver.&lt;/p&gt;
&lt;p&gt;Q: What are the two micro-foundations offered for SR-RBTC?&lt;/p&gt;
&lt;p&gt;A: The first (Appendix B.1) models technology as automating a subset of tasks within routine occupations, freeing workers to spend more time on remaining tasks. SR-RBTC arises when the automated task is more skill-intensive than the average task (e.g., arithmetic calculations for cashiers); automating a relatively skill-intensive task disproportionately helps lower-skill workers. The second (Appendix B.2) models technology as improving the quality or quantity of capital (computers, robots) that substitutes for skill; SR-RBTC arises when the elasticity of substitution between skill and technology exceeds a threshold, making skill and technology gross substitutes.&lt;/p&gt;
&lt;p&gt;Q: How does SR-RBTC explain the absence of large average wage declines in routine occupations despite large employment declines?&lt;/p&gt;
&lt;p&gt;A: Under SR-RBTC, wages fall for the highest-skilled workers in the routine occupation but may rise (or fall less) for lower-skilled routine workers, since the technology reduces the skill premium rather than depressing all wages uniformly. The compositional shift — higher-skilled workers exiting routine occupations — further mitigates measured average wage declines by replacing the departing high earners with lower-skilled entrants who earn closer to the (now-compressed) routine wage floor. As a result, quantity (employment) adjusts more than price (average wage), consistent with the observed data.&lt;/p&gt;
&lt;p&gt;Q: What is the quantitative magnitude of the skill-level change in routine occupations?&lt;/p&gt;
&lt;p&gt;A: Given that the return to skill in routine occupations in 2017 (alpha_{R,2017}) was approximately 0.3 (corresponding to -1.2 in log units), and average skill in routine occupations fell by approximately 0.2 units, the paper calculates that if routine workers in 2017 had maintained the same average skill level as in 1980, their wages would have been approximately 6 percent higher.&lt;/p&gt;
&lt;p&gt;Q: What alternative explanations does the paper evaluate, and how does it rule them out?&lt;/p&gt;
&lt;p&gt;A: The paper considers minimum wage increases (Piketty 2014) and declining unionization (Firpo et al. 2013) as potential contributors. The skewness decomposition implies these explanations are limited: since 93 percent of the skewness increase is driven by occupational trends and 78 percent by the covariance component (within-occupation inequality changes), mechanisms that operate through uniform group-level wage shifts — as minimum wage or union explanations would — can account for only a small fraction of the overall trend. The IFEM further rules out that the decline in within-routine inequality reflects worker composition becoming more homogeneous rather than a genuine decline in return to skill, as the sensitivity analysis shows alpha_jt changes are driven almost entirely by workers staying within each occupational category.&lt;/p&gt;
&lt;p&gt;Skill-Replacing RBTC (SR-RBTC): A variant of routine-biased technological change in which technology substitutes the usage of skill within routine occupations (epsilon &amp;lt; 0), reducing the return to skill and compressing within-occupation wage inequality, as distinct from skill-neutral RBTC (epsilon = 0) which shifts wages uniformly and skill-enhancing RBTC (epsilon &amp;gt; 0) which widens skill gaps.&lt;/p&gt;
&lt;p&gt;Interactive Fixed-Effects Model (IFEM): An extension of the standard fixed-effects panel wage regression in which worker fixed effects (capturing unobserved permanent skill theta_i) are interacted with both occupational category and year, allowing the estimated return to skill alpha_jt to vary across occupations and over time; worker fixed effects are instrumented with years of schooling to correct attenuation bias.&lt;/p&gt;
&lt;p&gt;Skewness Decomposition: A decomposition of the third central moment of the log wage distribution (skewness) into three components — within-occupation, between-occupation, and a covariance term (the covariance between occupation-level mean wages and occupation-level wage inequality) — that, unlike standard decomposition methods, does not require the ignorability assumption and can therefore capture changes in the within-occupation wage structure.&lt;/p&gt;
&lt;p&gt;Ignorability Assumption: The assumption, required by standard decomposition methods (e.g., DiNardo et al. 1996; Firpo et al. 2009), that the conditional distribution of wages given observables (here, occupations) does not change when the distribution of observables changes; violated under SR-RBTC because the within-occupation wage structure itself shifts as skill-replacing technology advances.&lt;/p&gt;
&lt;p&gt;Comparative Advantage (Occupational Sorting): The mechanism by which workers sort into occupations based on their skill level theta_i relative to occupation-specific return-to-skill schedules; SR-RBTC shifts occupational thresholds by compressing the routine occupation&amp;rsquo;s skill premium, causing higher-skilled workers to exit routine jobs and lower-skilled workers to enter.&lt;/p&gt;
&lt;p&gt;Two-Phase Dynamics: The non-monotonic relationship between technological progress and bottom-half inequality in the SR-RBTC model; Phase 1 (late 1980s–2000) sees middle wages decline as the highest-skilled (middle-of-distribution) routine workers experience the largest wage drops; Phase 2 (2000 onward) sees bottom wages fall as the routine occupation shifts to employing the lowest-skilled workers once the routine skill premium falls below the manual skill premium.&lt;/p&gt;</description></item><item><title>Slum Upgrading and Long-Run Urban Development: Evidence from Indonesia</title><link>https://macropaperwarehouse.com/papers/slum-upgrading-and-long-run-urban-development-evidence-from-indonesia/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/slum-upgrading-and-long-run-urban-development-evidence-from-indonesia/</guid><description>&lt;p&gt;This paper estimates the long-term causal effects of the Kampung Improvement Program (KIP), one of the world&amp;rsquo;s largest slum upgrading programs, on urban development in Jakarta, Indonesia. KIP ran from 1969 to 1984 across three staggered waves (Pelita I-III), covered 110 square kilometers (25% of Jakarta&amp;rsquo;s area), and served approximately 5 million residents at a total cost of roughly $500 million (2015 USD). The program provided basic physical upgrades — paved roads and footpaths, sanitation and drainage, and community buildings such as schools and health clinics — along with a verbal non-eviction guarantee for 15 years. Residents were not relocated.&lt;/p&gt;
&lt;p&gt;The central research question is whether preserving slums through upgrading entails long-run dynamic inefficiency: as Jakarta formalizes, do KIP areas lag behind non-KIP areas in ways that generate opportunity costs from land misallocation?&lt;/p&gt;
&lt;p&gt;The authors assemble high-resolution data on KIP policy boundaries, current assessed land values (nearly 20,000 sub-blocks), building heights from a novel photographic survey of 19,518 pixels stratified across Jakarta, and multiple novel measures of informality — a rank-based photographic index (0 to 4), an attributes-based index across fifteen binary characteristics, and administrative data on unregistered land-parcel titles. They also use digitized historical maps from 1937 and 1959 to identify pre-KIP kampung boundaries.&lt;/p&gt;
&lt;p&gt;Two empirical strategies address program selection bias (KIP planners prioritized the worst-condition kampungs first). The first restricts the sample to historical kampungs that existed before KIP and includes locality fixed effects, comparing treated kampungs against nearby untreated ones within the same neighborhood. The second is a boundary discontinuity design (BDD) comparing observations within 200 meters of KIP boundaries. Both strategies include eighteen predetermined controls for historical landmarks, infrastructure, and topography including flood proneness.&lt;/p&gt;
&lt;p&gt;Average effects (robust across both strategies): KIP areas today have land values approximately 14-17 log points (roughly 15%) lower than observably equivalent non-KIP areas, and are about 8-12 percentage points less likely to contain buildings taller than three floors — half the control-group mean of 0.24. KIP areas are more informal across all three informality metrics: the rank-based index is higher by 0.29 standard deviations, the attributes-based index by 0.05 SD units, and the share of unregistered parcels is 3 percentage points higher. Building heights corroborate the land-value finding: imputing the hedonic value of missing tall buildings in KIP accounts for approximately 90% of the aggregate land-value impact ($2.2 billion of $2.4 billion).&lt;/p&gt;
&lt;p&gt;Heterogeneity by real estate potential is a central finding. The authors construct a predicted land index for 2,058 hamlets in Jakarta using non-KIP land values. In the lowest quintile (Q5), KIP areas show a positive and statistically significant effect of +10 log points on land values, consistent with direct capitalization of the upgrades. This effect reverses in higher-potential areas: the estimate reaches -28 log points in Q2 and -30 log points in Q1, as non-KIP neighborhoods formalize while KIP areas lag.&lt;/p&gt;
&lt;p&gt;Surplus calculations integrating land values, building heights, horizontal built-up coverage (35% for KIP vs. 18% for non-KIP), and demand and supply elasticities reveal that 90% of total surplus losses are concentrated in the top two quintiles (Q1 and Q2), which comprise 47% of KIP&amp;rsquo;s coverage area. In Q1, KIP surplus is lower by $2,369 per square meter; in Q2, the gap is $1,044 per square meter. In the bottom two quintiles, KIP delivers greater surplus (up to +$347 per square meter in Q5), covering an estimated 3 million residents across 57 square kilometers.&lt;/p&gt;
&lt;p&gt;Mechanisms consistent with delayed formalization include significantly higher population density in KIP areas (+33 log points, or 39%) and greater land fragmentation (+9 parcels per pixel relative to a non-KIP mean of 19), both of which raise relocation and land assembly costs. The original KIP investments show no differential effect by type or intensity after four decades, consistent with their 15-year projected useful life. Endogenous sorting is ruled out as a confounder: if anything, educational attainment is slightly higher in KIP areas.&lt;/p&gt;
&lt;p&gt;Q: What is the Kampung Improvement Program (KIP) and what did it provide?
A: KIP was a slum upgrading program implemented in Jakarta, Indonesia from 1969 to 1984 across three five-year plan waves (Pelita I, II, III). It covered 110 square kilometers and 5 million residents at a total cost of approximately $500 million (2015 USD). The program provided three categories of basic physical improvements — vehicular and pedestrian road access, sanitation and drainage infrastructure, and community buildings (schools, health clinics) — along with a verbal non-eviction guarantee for 15 years. Crucially, upgrades were designed to be basic, with a planned useful life of only 15 years, to avoid attracting higher-income groups.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question and theoretical concern motivating the paper?
A: The paper asks whether slum upgrading programs, while immediately beneficial to residents, entail dynamic inefficiency by delaying formalization as cities develop. The concern is that preserving slums through upgrades and non-eviction guarantees can create opportunity costs from land misallocation when surrounding areas formalize and redevelop into higher-value formal structures. This is framed as a trade-off between the direct welfare benefits of upgrading (affordable in-situ housing for millions) and the long-run costs to urban land productivity.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address the selection bias problem — KIP targeted the worst-condition kampungs first?
A: Two complementary strategies are used. First, the historical kampung specification restricts the sample to areas that were kampungs before KIP (from 1937 and 1959 maps) and includes locality fixed effects, so treated and control units are compared within the same neighborhood and share the same real estate market by assumption. Second, a boundary discontinuity design (BDD) compares observations within 200 meters of KIP boundaries with boundary fixed effects and quadratic distance controls. A falsification test using sequential KIP waves confirms the approach: the raw data shows a monotonic pattern (Wave I worst: -0.40 log points, Wave II: -0.29, Wave III: -0.17) consistent with selection bias, but this pattern disappears in the historical kampung specification (Wave I: -0.13, Wave II: -0.11, Wave III: -0.14), supporting the identification assumption.&lt;/p&gt;
&lt;p&gt;Q: What are the average effects of KIP on land values and building heights?
A: In the historical kampung specification, KIP areas have land values 14 log points (approximately 15%) lower than non-KIP historical kampungs within the same locality. The BDD estimate is similar at -17 log points. For building heights, KIP areas are 12 percentage points less likely to contain a building taller than three floors in the historical kampung sample (8 percentage points in the BDD), relative to a non-KIP control mean of 0.24 — meaning KIP areas are roughly half as likely to have tall buildings. The average effect on floors is -1.6 floors, relative to a control mean of 5 floors.&lt;/p&gt;
&lt;p&gt;Q: How do the authors validate that land value estimates are not distorted by measurement error in informal areas?
A: The authors impute the hedonic value of missing tall buildings in KIP using a hedonic regression estimated solely on non-KIP historical kampungs. KIP areas have 145 fewer buildings with more than ten floors; combined with a 57% price premium for tall buildings (relative to a base price of 13.4 million Rupiahs per square meter), the implied land value loss from missing buildings above ten floors is approximately $1.3 billion, and from buildings between four and ten floors is $0.9 billion, for a total imputed effect of $2.2 billion. This accounts for approximately 90% of the aggregate land value impact from the historical kampung specification ($2.4 billion), assuaging concerns that lower measured land values in KIP reflect data quality differences rather than true price gaps.&lt;/p&gt;
&lt;p&gt;Q: How does the KIP effect vary across the distribution of real estate potential?
A: The authors construct a predicted land index for 2,058 Jakarta hamlets by regressing non-KIP log land values on hamlet fixed effects, then rank hamlets into quintiles. In Q5 (lowest predicted land values, least likely to formalize), KIP areas show a statistically significant positive effect of +10 log points on land values, consistent with direct capitalization of the upgrades. Moving to higher-potential areas, the effect attenuates and reverses: it is -28 log points in Q2 and -30 log points in Q1, where non-KIP areas have formalized. This cross-sectional pattern traces out the dynamic inefficiency predicted by theory.&lt;/p&gt;
&lt;p&gt;Q: What informality measures does the paper construct and what do they show?
A: The paper constructs three complementary informality metrics. First, a rank-based photographic index (0 = very formal, 4 = very informal) coded by two trained Jakarta-based research assistants from approximately 28,000 hand-coded photographs, with inter-rater correlation of 0.78. Second, an attributes-based index averaging fifteen binary characteristics across vehicular access, neighborhood appearance, and structural permanence, standardized to a z-score. Third, the area share of unregistered land parcels from the Indonesian National Land Agency&amp;rsquo;s 2020 digital land maps. KIP areas score higher on all three: the rank-based index is higher by 0.29 SD units, the attributes-based index by 0.05 SD units, and the unregistered parcel share is higher by 3 percentage points.&lt;/p&gt;
&lt;p&gt;Q: What mechanisms explain why KIP areas remain informal and have lower land values?
A: The paper identifies three mutually reinforcing mechanisms. First, KIP areas have significantly higher population density (+33 log points or 39% in the historical kampung sample, equivalent to 51 more people per pixel), which raises relocation costs. Second, KIP areas have greater land fragmentation, with 9 more parcels per pixel relative to a non-KIP mean of 19, exacerbating holdout problems during land assembly; a back-of-the-envelope calculation attributes a 9% land value effect (60% of the total 15% effect) to this channel. Third, the verbal non-eviction guarantees and improved conditions likely strengthened residents&amp;rsquo; tenure perceptions and encouraged them to stay, leading to sub-division of parcels over time. The original KIP investments show no differential effect by type after four decades, consistent with their designed 15-year useful life, and KIP areas have similar access to public amenities today.&lt;/p&gt;
&lt;p&gt;Q: How does the paper calculate surplus and what are the results?
A: The surplus framework compares KIP (informal, tends to stay informal) against non-KIP counterfactuals (more likely formal) on three dimensions: non-KIP areas have (i) higher land values, (ii) taller structures, but (iii) lower horizontal built-up coverage than slums (18% vs. 35% for KIP). Consumer surplus uses a linear demand approximation with elasticity of 0.2 for non-KIP and 0.16 for KIP (backed out from differences in housing budget shares). Producer surplus integrates a Cobb-Douglas supply curve with elasticities of 1.4 (formal) and 1.3 (informal). In Q1, KIP property value is $1,873 per square meter vs. $3,098 for non-KIP, a difference of $1,225 in value terms and $2,369 in surplus terms. The surplus gap falls to $1,044 in Q2, and halves again in Q3, becoming positive (+$347 per square meter) in Q5. Ninety percent of total surplus losses are concentrated in Q1 and Q2, which cover 47% of KIP&amp;rsquo;s area.&lt;/p&gt;
&lt;p&gt;Q: What do the case studies of kampung clearances illustrate?
A: Three Jakarta kampungs cleared in 2015-2016 are examined. Kampung Bukit Duri (Q5, lowest real estate potential) shows a surplus difference of +$572 per square meter in favor of KIP — meaning clearance there is socially inefficient. Kali Pessangrahan (Q3) shows a surplus difference of -$307. Kalijodo (Q2) shows -$910 per square meter, suggesting sizable societal gains from formalization. However, even in Kalijodo, residents were relocated 24 km away to Marunda (a Q5 area), where consumer surplus is only 46% of Kalijodo&amp;rsquo;s — illustrating that societal gains from formalization do not automatically translate into Pareto improvements for evicted residents.&lt;/p&gt;
&lt;p&gt;Q: What robustness checks address alternative explanations?
A: The paper runs several tests. A placebo BDD using 45 non-KIP historical kampung boundaries finds no significant discontinuity, ruling out the hypothesis that slums generically have persistently lower land values. Bandwidth robustness shows consistent BDD estimates from 150 to 500 meters. Tests for spatial spillovers find no spatial decay pattern in land values near KIP boundaries, consistent with the prevalence of gated communities in formal Jakarta minimizing neighborhood contamination. Endogenous sorting is examined using 2010 Census data on 10 million individuals: educational attainment is slightly higher in KIP, and in-migration is slightly lower (1-2 percentage points below mean) with migrants having slightly more years of schooling — both inconsistent with an explanation based on low-skill sorting into KIP. Direct congestion effects from population density are also ruled out by estimating spatial decay around 45 dense non-KIP informal hamlets, finding no decay large enough to explain the land-value effects.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications for slum upgrading in other developing countries?
A: The paper&amp;rsquo;s framework suggests that slum upgrading&amp;rsquo;s cost-benefit balance depends critically on where the upgraded area sits in the real estate potential distribution. In low-potential areas (bottom quintiles of the land index), upgrading delivers net surplus even decades later and implicitly provides affordable housing at scale to millions of residents. In high-potential areas (top quintiles), the opportunity costs from delayed formalization can be large — up to $2,369 per square meter in surplus terms — and the paper suggests that stronger land market institutions to share surplus with informal residents could partially mitigate these costs. The paper also notes that formalization involves complex institutional and political challenges: relocating millions of kampung residents is logistically difficult, compensation is frequently inadequate or absent, and land assembly faces severe holdout problems.&lt;/p&gt;
&lt;p&gt;Dynamic inefficiency in cities: The phenomenon, in the context of this paper, whereby preserving informal slum settlements through upgrading delays their formalization, generating opportunity costs from land misallocation as surrounding formal areas develop. Distinguished from static inefficiency: KIP may raise resident welfare while simultaneously reducing aggregate land productivity.&lt;/p&gt;
&lt;p&gt;Slum upgrading: A policy providing basic public goods improvements (roads, sanitation, community buildings) and tenure security (typically verbal non-eviction guarantees) to existing slum residents in situ, without relocating them. Contrasted with formalization (redevelopment) and sites-and-services programs.&lt;/p&gt;
&lt;p&gt;Boundary discontinuity design (BDD): The paper&amp;rsquo;s second identification strategy, comparing outcomes for observations within 200 meters on either side of KIP program boundaries, with boundary fixed effects and quadratic distance controls, under the assumption that absent KIP, unobserved real estate potential varies smoothly at program boundaries.&lt;/p&gt;
&lt;p&gt;Predicted land index: A hamlet-level index constructed by regressing non-KIP log land values on hamlet fixed effects across 2,058 Jakarta hamlets, used to proxy real estate market potential and rank neighborhoods into quintiles from highest (Q1) to lowest (Q5) development stage.&lt;/p&gt;
&lt;p&gt;Informal surplus: The surplus generated within the informal housing sector, including built-up volume from high horizontal coverage (35% for KIP kampungs) and low-cost informal structures, which is destroyed upon formalization and must be weighed against the gains from taller, higher-value formal developments.&lt;/p&gt;
&lt;p&gt;Land fragmentation: The number of distinct land parcels per unit area (pixel), measured from Jakarta&amp;rsquo;s 2011 cadastral maps. Higher fragmentation exacerbates holdout problems in land assembly, raising the cost of redevelopment and contributing to delayed formalization.&lt;/p&gt;
&lt;p&gt;Source text origin: A classification in the paper&amp;rsquo;s summarization pipeline indicating whether the paper text derives from a full PDF or open-access HTML (permitting summarization) versus abstract-only text (which blocks summarization). All claims in this summary derive from the full paper text.&lt;/p&gt;</description></item><item><title>Soft landing and inflation scares</title><link>https://macropaperwarehouse.com/papers/soft-landing-and-inflation-scares/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/soft-landing-and-inflation-scares/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Why did the 2021–2023 US inflation surge end in a soft landing — disinflation without a major recession — while the Volcker disinflation of 1979–1987 required substantial output losses? And was the timing and strength of the Federal Reserve&amp;rsquo;s reaction to the inflation surge decisive in achieving this outcome?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology and Model&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper develops and estimates a micro-founded Heterogeneous-Expectation New Keynesian (HENK) model in which agents hold idiosyncratic, dispersed beliefs about the long-run (steady-state) level of inflation. The key departure from full-information rational expectations (FIRE) is that information about the long-run value of inflation is dispersed and sticky: agents update their beliefs through pairwise social learning (SL), adopting the forecasting model of the agent whose belief produced lower recent inflation forecast errors. This tournament process — inspired by genetic algorithms — generates a time-varying cross-sectional distribution of subjective inflation beliefs.&lt;/p&gt;
&lt;p&gt;The model admits a closed-form solution that retains the entire time-varying distribution of beliefs and can be estimated with standard full-information Bayesian methods using the inversion filter (Cuba-Borda et al. 2019). The FIRE benchmark is nested as the special case in which the average belief deviation from the target is zero at all times.&lt;/p&gt;
&lt;p&gt;Estimation uses four US macroeconomic observables (output gap, CPI inflation, one-quarter-ahead average SPF inflation expectation, and the proxy funds rate of Choi et al. 2022 that captures both conventional and unconventional monetary policy) over 1985Q1–2023Q4. A formal model comparison rejects the RE null hypothesis (p &amp;lt; 0.0001) in favor of the HENK specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings With Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Inflation scares are endogenous&lt;/strong&gt;: In the model, inflation scares arise whenever repeated above-target inflation outcomes validate and diffuse above-target beliefs through social interactions. Under the historical scenario, the share of agents holding long-run inflation beliefs between 1 and 3 percent (annualized) falls to 40 percent in mid-2022 before recovering above 90 percent by end-2023, indicating a partial but not complete unanchoring of expectations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Timing dominates strength&lt;/strong&gt;: Counterfactual simulations show that the timing — not the strength — of the Fed&amp;rsquo;s reaction to the inflation surge is the key determinant of inflation expectations management and subsequent macroeconomic outcomes. Varying the Taylor-rule inflation coefficient by +/-10 percent (moving from 1.64 to 2.00) produces negligible differences in inflation and output gap dynamics, with welfare ratios of 1.052 and 0.981 relative to benchmark respectively under the ad-hoc loss function. By contrast, varying the timing via the interest-rate smoothing parameter by +/-10 percent produces much larger divergences.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;The Fed fell behind the curve&lt;/strong&gt;: Under a scenario in which the Fed had strictly followed its estimated Taylor rule (removing the negative monetary policy shocks observed from mid-2020 to mid-2022), inflation would have peaked approximately 3 percentage points lower on a yearly basis. Inflation expectations would have remained lower for almost a year longer, and the subsequent rise in expectations would have been more gradual and lower-peaking. Crucially, the output gap in this preemptive-tightening scenario would have been only briefly negative (in 2022Q2) and not deep enough to trigger a recession.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Further delays would have been highly costly&lt;/strong&gt;: A delay of the tightening by one, two, four, or eight quarters would have produced successively worse outcomes. A two-year delay generates runaway inflation and 100 percent loss of target credibility (complete unanchoring). A delay of approximately three quarters would have resulted in a sizable, self-reinforcing entrenchment of above-target inflation expectations. The welfare cost of an eight-quarter delay is 5.76 times the benchmark loss under the ad-hoc measure (1.167 under the microfounded measure).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Early rate cuts would have reignited inflation&lt;/strong&gt;: A counterfactual 100-basis-point cut as early as 2022Q3 would have pushed annual inflation approximately 2 percent above the historical scenario through end-2023, with inflation expectations rebounding by about 1 percent (annualized) immediately after the cut. Under no early-cut scenario would inflation or expectations have converged back to target by end-2023.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Expectation heterogeneity amplifies shocks&lt;/strong&gt;: Greater initial dispersion in beliefs amplifies and prolongs the impact of all shocks (demand, supply, monetary policy, expectation). After a one-standard-deviation cost-push shock, higher initial belief dispersion produces larger and more persistent deviations in inflation, output, and interest rates. The model-implied interquartile range of beliefs is correlated 0.538 with the SPF interquartile range and the cross-sectional standard deviation is correlated 0.483 (both p &amp;lt; 0.001).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Historical decomposition&lt;/strong&gt;: Over the 2010s, negative expectation shocks account for a substantial fraction of the persistent below-target inflation (&amp;ldquo;missing inflation&amp;rdquo;). From approximately mid-2022 onward, positive expectation shocks account for most of the variance of inflation in the model. The recent disinflation is attributed to a combination of: easing supply pressures, normalization of monetary policy, and re-anchoring of inflation expectations.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results are conditional on the estimated HENK model applied to US data, 1985Q1–2023Q4, using a stylized three-equation NK backbone (no labor market dynamics, no financial sector, no capital). The proxy funds rate is more volatile than the federal funds rate, which affects the welfare comparison for large preemptive tightening scenarios. Counterfactual scenarios are implemented through unexpected monetary policy shocks; anticipated shocks would only strengthen the inflationary effects of delays.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core mechanism by which an inflation scare can develop in the HENK model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: When inflation repeatedly exceeds the target — whether due to shocks or delayed policy — agents whose beliefs are already above-target incur lower forecast errors than those anchored at the target. During pairwise social interactions (the tournament step of social learning), above-target beliefs spread through the population because they are selected as the &amp;ldquo;better&amp;rdquo; forecasting model. The resulting upward shift in the average belief feeds higher inflation through the New Keynesian Phillips Curve, which validates above-target beliefs further, creating a self-reinforcing loop. This mechanism differs from rational-expectations models, where beliefs mean-revert automatically.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the model retain a closed-form solution despite the nonlinearity of the social-learning process?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Two assumptions deliver the closed-form. First, beliefs are private and dispersed (Assumption 1): agents observe only the belief of their matched mate, not the population distribution. Second, a quasi-rational-expectations (quasi-RE) observer treats aggregate beliefs as a random walk in expectations (Assumption 2: a martingale). Under these conditions, the aggregate subjective inflation expectation equals the average subjective belief about steady-state inflation plus the rational-expectations forecast. This augmented minimum-state-variable (MSV) solution can be estimated with full-information methods (the inversion filter) via standard Dynare tooling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What data are used and how are observables mapped to model variables?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The estimation uses four quarterly US observables from 1985Q1–2023Q4: the output gap (real GDP from FRED, HP-filtered with a one-sided adjusted filter); the CPI inflation rate (CPIAUCSL, FRED); one-quarter-ahead average CPI inflation expectation from the Survey of Professional Forecasters (CPI3); and the proxy funds rate of Choi et al. (2022), which captures both QE and QT so that unconventional monetary policy is reflected in the instrument. Inflation and expectations are demeaned by the sample average to express them as deviations from steady state. The discount factor is calibrated at 0.99; all other parameters are estimated via Bayesian methods with Metropolis-Hastings (8 parallel chains x 100,000 iterations, acceptance rate ~30%).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the key estimated parameter values for the social-learning block?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The posterior mean of the decay parameter in the fitness evaluation (discounting of past forecast errors) is 0.775, implying a half-life of past forecast errors of approximately 3 quarters. The frequency of news shocks has a posterior mean of 0.436, meaning approximately 40 percent of agents receive an inflation news shock every quarter. The standard deviations of the aggregate and idiosyncratic news shocks are very small (posterior means of 0.0004 and 0.0006, respectively) but strictly positive. The 95 percent confidence intervals for both exclude zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the HENK model outperform the RE benchmark in fitting the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Formal model comparison rejects the RE null (p &amp;lt; 0.0001) with equal prior model weights (50/50). On second moments, only the HENK model replicates positive autocorrelation in inflation (0.428 vs. 0.162 for RE, against an empirical interval of [0.239; 0.579]), in inflation expectations (0.824 vs. 0.161, empirical interval [0.839; 0.927]), and in inflation forecast errors (0.122 vs. -0.145). Additionally, the HENK model reproduces the untargeted cross-sectional dispersion of beliefs over the business cycle, including the increase during the GFC and the COVID-19 era and the low dispersion during the Great Moderation — with correlations of 0.538 and 0.483 between model and SPF dispersion measures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What does the historical shock decomposition reveal about the recent inflation surge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The decomposition (Section 3.3) shows that in the initial phase of the COVID-19 shock (2020Q2-Q3), negative demand and monetary policy shocks drove inflation down. Adverse cost-push (supply) shocks dominate from early 2021 into 2022. Expectation shocks — the contribution of dispersed beliefs — are negative throughout the 2010s (explaining part of the &amp;ldquo;missing inflation&amp;rdquo;) and remain briefly negative at the pandemic&amp;rsquo;s onset before turning sharply positive and driving most of the variance of inflation in the final two years of the sample (2022-2023). The loose monetary policy stance (negative monetary policy shocks from mid-2020 to mid-2022, visible in the Taylor-rule residuals) also contributes substantially to the inflation dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What does the Taylor-rule counterfactual show, and why doesn&amp;rsquo;t preemptive tightening cause a recession in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Removing the monetary policy shocks after 2020Q4 so that the proxy rate follows the estimated Taylor rule would have reduced the inflation peak by approximately 0.75 percentage points per quarter (equivalent to about 3 percentage points annualized) and kept expectations lower-anchored for almost a year longer. The output gap under the Taylor-rule scenario is only briefly negative (2022Q2) and does not constitute a recession. This occurs because the preemptive tightening exploits the sluggishness of subjective expectations stemming from information frictions: by raising rates earlier when beliefs are still anchored (or only weakly above target), the CB prevents the social-learning mechanism from diffusing above-target beliefs, which in turn softens the stabilization trade-off between inflation and output.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the U-shaped welfare relationship between preemptive tightening size and welfare?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Both the ad-hoc and microfounded welfare measures show a U-shaped relationship as the size of the front-loaded tightening in 2021Q1 increases from 100 bps to 400 bps to 800 bps. At 100 bps, the welfare ratio is 0.336 (ad-hoc, improvement over benchmark at 1.0); at 400 bps it improves further to 0.304; but at 800 bps (front-loading the entire subsequent tightening cycle) the ratio rises to 0.555, reflecting that the output costs of a very large early rate increase become prohibitive amid the series of supply shocks that hit in 2022. The maximum welfare gain in the microfounded criterion occurs at a slightly larger early increase than in the ad-hoc criterion, attributed to the absence of a financial sector and use of the more volatile proxy funds rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Does increasing the hawkishness of the Taylor rule compensate for falling behind the curve?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No. Varying the inflation reaction coefficient by +/-10 percent (to 2.00 for &amp;ldquo;hawk&amp;rdquo; and 1.64 for &amp;ldquo;dove&amp;rdquo;) from the posterior mean of approximately 1.82 produces negligible differences in inflation and output gaps. The hawkish scenario achieves marginally earlier rate increases but does not reduce the inflation gap relative to the historical benchmark. Welfare ratios are 0.960 (hawkish, slight improvement) and 1.057 (dovish, slight deterioration) under the ad-hoc measure, and 0.981 and 1.052 under the microfounded measure. The joint simulations varying both smoothing (timing) and hawkishness (strength) confirm that timing is the dominant factor: the two &amp;ldquo;earlier reaction&amp;rdquo; scenarios are clustered together and well-separated from the two &amp;ldquo;later reaction&amp;rdquo; scenarios, regardless of the inflation coefficient.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the model handle the role of initial belief dispersion in monetary policy transmission?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Impulse response function exercises varying the initial standard deviation of beliefs (as a share of the maximum model-generated standard deviation under the filtered shocks) show that greater initial dispersion uniformly amplifies and prolongs the macroeconomic response to all shock types (demand, cost-push, monetary policy, expectation). The mechanism is: greater dispersion means the population contains more &amp;ldquo;extreme&amp;rdquo; (far-from-target) beliefs; a shock that temporarily moves inflation off target temporarily validates extreme beliefs (lower forecast errors), causing them to spread in social interactions and shift the average belief further from target. This raises nominal rates (through the Taylor rule), deepens output losses, and prolongs the return to steady state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What are the implications of early interest rate cuts in the counterfactual scenarios?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: A 100-basis-point cut in any quarter from 2022Q3 through 2023Q2 would have reignited inflation expectations. The 2022Q3 scenario is most severe: expectations rebound approximately 1 percentage point higher (annualized) immediately post-cut, and annual inflation remains on average 2 percent above the historical path through end-2023. Across all early-cut scenarios, neither inflation nor inflation expectations would have returned to target by end-2023; instead, inflation would have been landing approximately 2 percent above the 2 percent target. The welfare ratios for early cuts range from 1.200 (cut in 2022Q3) down to 1.079 (cut in 2023Q2) under the ad-hoc measure — all welfare-worsening.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Inflation scare (Goodfriend 1993, as used in this paper)&lt;/strong&gt;: A situation in which the public&amp;rsquo;s long-run inflation expectations become unanchored from the central bank&amp;rsquo;s target, making beliefs about above-target steady-state inflation self-fulfilling via the New Keynesian Phillips Curve. In the HENK model, a scare arises endogenously when above-target inflation outcomes repeatedly validate above-target beliefs, causing them to spread through social interactions. Measured in the paper by the share of idiosyncratic beliefs falling between 1 and 3 percent (annualized); lower share = more severe scare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social learning (SL)&lt;/strong&gt;: The belief-updating mechanism in which agents are paired at random each period and compare their inflation forecasting models; the agent whose model produced lower recent forecast errors (measured by the discounted sum of squared forecast errors with half-life approximately 3 quarters) is adopted by both members of the pair. This evolutionary tournament process — analogous to a genetic algorithm — generates a nonlinear, history-dependent distribution of beliefs that can drift persistently away from the target.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Steady-state learning&lt;/strong&gt;: The restriction that agents&amp;rsquo; heterogeneous beliefs concern only the low-frequency (intercept) component of inflation — i.e., their subjective perception of the steady-state inflation rate — while the rest of their inflation forecast (the effects of transitory shocks and lagged variables) coincides with rational expectations. This assumption, combined with internal rationality, permits a closed-form MSV solution of the HENK model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Internal rationality&lt;/strong&gt;: The assumption that each agent uses a perceived law of motion that is consistent with the true MSV solution of the HENK economy (including the effect of heterogeneous beliefs on dynamics), even if their intercept differs from the rational-expectations value. Agents internalize how the aggregate deviation of expectations from RE affects inflation, but they disagree about the long-run level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quasi-rational-expectations (quasi-RE) observer&lt;/strong&gt;: An observer (or central bank) who, lacking information about how individual private beliefs are formed and aggregated, treats aggregate beliefs as a martingale — i.e., the expected future aggregate belief equals its current value. This assumption closes the model and permits estimation with full-information (inversion filter) methods, while preserving consistency between subjective beliefs and the law of motion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Belief dispersion / expectation heterogeneity&lt;/strong&gt;: The time-varying cross-sectional standard deviation (or interquartile range) of idiosyncratic beliefs in the population. In the model this is an endogenous, history-dependent outcome of the SL process. Greater dispersion amplifies the response of all macroeconomic variables to any shock by providing more &amp;ldquo;extreme&amp;rdquo; beliefs that can gain traction in pairwise tournaments when inflation temporarily deviates from target. Measured empirically by the interquartile range and standard deviation of individual SPF forecasts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Proxy funds rate (Choi et al. 2022)&lt;/strong&gt;: A summary measure of the US monetary policy stance that incorporates both conventional interest rate policy and the effects of unconventional policies (quantitative easing and tightening), used in the paper in place of the federal funds rate to capture the full stance of monetary policy in the estimation and historical decomposition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Inversion filter (Cuba-Borda et al. 2019)&lt;/strong&gt;: A computationally efficient estimation algorithm that, rather than the Kalman or particle filter, inverts the observation equation analytically to recover the sequence of structural shocks for a given parameter vector. It enables full-information Bayesian estimation of the nonlinear HENK model by separating the linear part of the solution from the nonlinear social-learning residual.&lt;/p&gt;</description></item><item><title>Spatial Implications of Telecommuting</title><link>https://macropaperwarehouse.com/papers/spatial-implications-of-telecommuting/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/spatial-implications-of-telecommuting/</guid><description>&lt;p&gt;Delventhal and Parkhomenko build a quantitative spatial model of the United States to study how the rise of telecommuting reshapes the distribution of residents, jobs, and housing costs across and within cities. The model divides the continental U.S. into 4,502 locations (defined as intersections of Census PUMAs and counties) and allows each worker to choose any residence-job pair. Workers differ by education (college vs. non-college) and occupation type (telecommutable vs. non-telecommutable). Telecommutable workers can split labor time between on-site and remote work; their remote-work intensity responds endogenously to relative remote productivity, a work-from-home aversion parameter, home floorspace costs, and commute time.&lt;/p&gt;
&lt;p&gt;The model is calibrated to pre-2020 U.S. data (2012–2016 ACS, 2018 SIPP, 2017 NHTS). Key calibrated facts include: 33.6% of workers have telecommutable jobs (40.6% of non-college, 72.7% of college workers); remote work is nearly as productive as on-site work (relative productivity 0.99–1.00); elasticities of substitution between work modes range from 3.48 to 5.05; and work-from-home aversion parameters range from 2.48 to 3.35, indicating large non-pecuniary barriers especially for non-college workers in non-tradable sectors.&lt;/p&gt;
&lt;p&gt;The counterfactual simulates a permanent increase in remote work driven by an 8–10% rise in remote productivity and a fall in work-from-home aversion, guided by Barrero, Bloom, and Davis (2021) survey evidence. Results show net reallocation of jobs and residences equivalent to nearly 5% of the population.&lt;/p&gt;
&lt;p&gt;Main spatial findings exhibit a non-monotonic pattern. Telecommutable residents move away from dense, high-cost locations toward sparser areas with lower housing costs and better amenities. Non-telecommutable residents partially counteract this by centralizing — moving toward denser areas as housing costs fall near job centers. Non-tradable jobs follow telecommuters outward. Tradable jobs move in both directions: some firms relocate to low-density areas with newly accessible remote worker pools; others expand in the largest, most productive city centers as office space costs fall and the catchment area of workers widens.&lt;/p&gt;
&lt;p&gt;In aggregate: the average worker lives 47% farther (in commuting time) from their workplace but spends 25% less time commuting, because average remote-work frequency rises by 1.1 days per week. The share of workers living in one commuting zone and working in another increases from 24.6% to 34%. Average income falls marginally by 1%, masking large gains for telecommutable workers and losses for non-telecommutable workers. Average floorspace prices fall by 2%; non-tradable prices rise by 2.6%. Overall welfare increases by an average of 12.7%, driven by gains for telecommutable workers, while non-telecommutable workers experience net losses.&lt;/p&gt;
&lt;p&gt;The model predicts a partial reversal of the &amp;ldquo;Great Divergence&amp;rdquo;: skill sorting falls both within and across commuting zones, residential income inequality across CZs falls, and house price dispersion falls both within and across cities. These predictions are directionally consistent with 2019–2023 data.&lt;/p&gt;
&lt;p&gt;Scope conditions: results are for a permanent shock to the full-time U.S. workforce as modeled in 2012–2016; the model does not predict the end of big cities but rather a reallocation at the margin. The model shows that the introduction of telecommuting narrows the parameter range guaranteeing a unique spatial equilibrium, because remote-capable firms can draw from a broader worker catchment area, amplifying agglomeration forces.&lt;/p&gt;
&lt;p&gt;Q: What are the four stylized facts about pre-2020 telecommuting that discipline the model?
A: Fact 1: telecommutability is higher for college workers and those in tradable industries — 68.8% of college-tradable workers can work from home versus 18.9% of non-college non-tradable workers. Fact 2: among telecommutable workers, uptake is also higher for college-tradable workers (38% actually work from home at least one day per week) than for non-college non-tradable workers (21%). Fact 3: the distribution of remote-work frequency is bimodal — most workers are either fully on-site or fully remote, with the bimodality less pronounced for college-tradable workers where hybrid (1–4 days/week) accounts for over 11% of paid workdays. Fact 4: there is a positive relationship between work-from-home frequency and distance from the job site, consistent with telework reducing effective commuting costs.&lt;/p&gt;
&lt;p&gt;Q: How is the counterfactual shock calibrated and what drives it?
A: The counterfactual raises remote-work productivity by 8–10% across all worker types and simultaneously reduces work-from-home aversion, guided by Barrero, Bloom, and Davis (2021) survey evidence that 25–30% of paid workdays will be remote post-pandemic, compared to about 8% in 2018. The authors consider both a technology shock (productivity increase) and a preference shock (aversion decrease) as mechanisms, consistent with their view that multiple hypotheses about the COVID-19 telework shock are plausible and non-exclusive.&lt;/p&gt;
&lt;p&gt;Q: How do residents reallocate in response to the rise in telecommuting?
A: Net reallocation of residents equivalent to nearly 5% of the population occurs. Telecommutable residents decentralize — moving to less dense areas with lower housing costs and better amenities — because the cost of choosing a residence far from work falls. Non-telecommutable residents partially centralize, moving toward denser locations in larger metro areas, because housing costs fall in locations with short commutes, making them more affordable.&lt;/p&gt;
&lt;p&gt;Q: How do jobs reallocate?
A: Non-tradable jobs follow the decentralization of residents (their source of demand) monotonically to less dense locations. Tradable jobs move in both directions: some firms relocate to low-density areas that can now access a larger pool of remote workers at lower real estate costs; others expand operations in the highest-productivity city centers, benefiting from both an expanded catchment of remote workers and a decline in the high cost of office space.&lt;/p&gt;
&lt;p&gt;Q: What are the aggregate commuting implications?
A: The average worker lives 47% farther in commuting time from their workplace in the counterfactual, yet spends 25% less time commuting, because average remote-work frequency increases by 1.1 days per week. The share of workers living in one commuting zone and working in another rises from 24.6% to 34%, which the authors note may call into question current administrative definitions of commuting zones and have major impacts on travel patterns.&lt;/p&gt;
&lt;p&gt;Q: What are the welfare and income effects?
A: Overall welfare increases by an average of 12.7%, but this masks very unequal distribution: telecommutable workers experience large gains while non-telecommutable workers suffer losses. Average worker income falls marginally by 1%, reflecting sizable gains for remote-capable workers offset by losses for those who cannot telecommute. Average floorspace prices fall by 2%, while non-tradable goods prices rise by 2.6%.&lt;/p&gt;
&lt;p&gt;Q: What does the model predict for the &amp;ldquo;Great Divergence&amp;rdquo;?
A: The model predicts a significant re-convergence across multiple dimensions: skill sorting falls both within and across commuting zones, residential wage inequality across CZs falls, and house price dispersion falls both within and across cities. The authors find that commuting zones with higher college shares in 2019 experienced slower growth in college shares 2019–2023, and that there is a negative correlation between average wages by CZ in 2019 and wage growth 2019–2023 — both consistent with model predictions.&lt;/p&gt;
&lt;p&gt;Q: How does the model validate against post-2019 data?
A: The authors show that their counterfactual results are positively correlated with observed changes in population, jobs, and housing rents since 2019. Within-city price variance has already converged in 2019–2023 data, consistent with model predictions. CZ-level patterns of skill concentration and wage growth also move in the direction the model predicts.&lt;/p&gt;
&lt;p&gt;Q: Is the COVID-19 shock better described as a technology shock or a preference shock?
A: The authors test both. To replicate observed changes in remote-work frequency using only a productivity shock requires a 55–99% jump in remote productivity, which yields implausibly large wage gains for remote-capable workers of 47–82%. The preference-based scenario yields results more consistent with observed data, supporting the view that a preference shock — changes in norms, attitudes, and institutional policies — is the primary driver.&lt;/p&gt;
&lt;p&gt;Q: What happens to real estate prices when supply and amenities are held fixed?
A: When real estate supply, productivity, and amenities are all held fixed, residential prices jump by 16% and commercial prices fall by 16%. The authors note this mimics the bifurcated shift in real estate values observed during the pandemic years, suggesting that supply responses and amenity adjustments are important for dampening the price effects in the full model.&lt;/p&gt;
&lt;p&gt;Q: How does the model handle the uniqueness of spatial equilibrium, and how does telecommuting affect it?
A: In a standard quantitative spatial model, agglomeration forces are dampened by the finite pool of workers willing to commute daily to a productive location. When telecommuting is introduced, productive locations can draw workers from a much broader catchment area, amplifying agglomeration forces and narrowing the range of parameter values for which a unique equilibrium is guaranteed. The authors establish conditions under which uniqueness is preserved.&lt;/p&gt;
&lt;p&gt;Q: What are the model&amp;rsquo;s three main advantages over more stylized spatial models of remote work?
A: First, by including 4,502 locations, the model can predict how far telecommuters will move from their jobs — a key variable for real estate markets and commuting patterns. Second, it can represent changes in the distribution of workers across different work-from-home frequencies, which is crucial as hybrid work has emerged as the dominant post-pandemic arrangement. Third, it predicts how the location of jobs (not just residents) changes, which has important implications for city centers.&lt;/p&gt;
&lt;p&gt;Q: What is the overall welfare conclusion regarding non-telecommutable workers and income inequality?
A: Non-telecommutable workers suffer welfare losses from the rise of remote work, even as overall average welfare rises by 12.7%. The overall income inequality — as opposed to spatial wage dispersion — does not fall. The authors note this means the spatial re-convergence does not translate into a broader reduction in income inequality, which they flag as an important limitation for policy.&lt;/p&gt;
&lt;p&gt;Telecommutability: the ability of a worker&amp;rsquo;s occupation to be performed from home, measured using Dingel and Neiman (2020) occupational classifications; varies by education and industry, with 68.8% of college-tradable workers telecommutable versus 18.9% of non-college non-tradable workers.&lt;/p&gt;
&lt;p&gt;Work-from-home aversion (ς): a preference parameter representing tastes, norms, and institutional policies that create non-pecuniary barriers to remote work; calibrated to range from 2.48 to 3.35 across worker types, higher for non-college workers in non-tradable sectors.&lt;/p&gt;
&lt;p&gt;Hybrid work: an arrangement in which a telecommutable worker splits paid workdays between on-site and remote work (1–4 days per week from home); the model&amp;rsquo;s bimodal distribution of work-from-home frequency replicates the empirical observation that most workers are either fully on-site or fully remote, with hybrid most prevalent among college-tradable workers.&lt;/p&gt;
&lt;p&gt;Catchment area: the pool of workers from which a firm can practically hire, which widens under telecommuting because workers no longer need to commute daily; this widening amplifies agglomeration forces and narrows the parameter range guaranteeing a unique spatial equilibrium.&lt;/p&gt;
&lt;p&gt;Great Divergence: the multi-decade trend (documented in Moretti 2012 and related work) of spatially concentrating talent, income, and housing costs in a small number of large, high-skill cities; the paper predicts a partial reversal — &amp;ldquo;Great Re-Convergence&amp;rdquo; — driven by the rise of telecommuting.&lt;/p&gt;
&lt;p&gt;Productive externalities (agglomeration): local productivity in the model depends on employment density; remote workers participate in these externalities only partially (parameter ψ ∈ [0,1]), so the shift to remote work can reduce agglomeration benefits in city centers.&lt;/p&gt;
&lt;p&gt;Source text origin: the paper&amp;rsquo;s own classification of the text on which a summary is based (full PDF, open-access HTML, or abstract-only); the paper&amp;rsquo;s CLAUDE.md rules mandate that abstract-only summaries are blocked.&lt;/p&gt;</description></item><item><title>Spread too thin: The impact of lean inventories</title><link>https://macropaperwarehouse.com/papers/spread-too-thin-the-impact-of-lean-inventories/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/spread-too-thin-the-impact-of-lean-inventories/</guid><description>&lt;p&gt;This paper investigates the macroeconomic consequences of widespread just-in-time (JIT) inventory management, documenting a fundamental trade-off: JIT raises firm profitability and reduces micro-level volatility in normal times, but renders the economy significantly more vulnerable to large unanticipated shocks.&lt;/p&gt;
&lt;p&gt;The empirical analysis draws on a novel dataset of approximately 200 publicly listed U.S. manufacturing firms for which the author identifies JIT adoption years using narrative records from SEC filings and historical news archives. Firm-level balance sheet data come from Compustat Fundamentals Annual (1980–2018), merged with county-level weather event data from NOAA. Four empirical facts are documented. First, JIT adoption is associated with a 13% decrease in inventory-to-sales ratios and a 9% increase in sales. Second, JIT adopters experience a roughly 7% decline in sales and employment growth volatility. Third, JIT adopters are approximately 25–30% more cyclical than non-adopters: a 1% increase in GDP growth predicts an additional 0.47 percentage point increase in JIT firm sales growth above the non-adopter baseline of roughly 1.6%. Fourth, a weather disaster predicts an additional 3% decline in JIT firm sales and employment relative to non-JIT firms.&lt;/p&gt;
&lt;p&gt;To explain and quantify these facts, the author builds and structurally estimates a dynamic general equilibrium model with a distribution of heterogeneous final goods firms that differ in idiosyncratic productivity, inventory holdings, and JIT adoption status. Materials must be drawn from inventory stocks; new orders are subject to stochastic fixed order costs. JIT producers draw from a first-order stochastically dominated order cost distribution relative to non-JIT firms. Adopting JIT requires an upfront sunk cost and a smaller continuation cost thereafter. The model is estimated via simulated method of moments (SMM) targeting 11 moments (adoption frequency, inventory-to-sales ratios, covariances, and spike frequencies for both firm types), with nine parameters to be estimated.&lt;/p&gt;
&lt;p&gt;In the estimated model steady state, JIT adoption delivers a 9–10% increase in output, a 40% decline in the aggregate inventory-to-sales ratio (close to the observed 35% decline in nonfarm inventories-to-final-sales from 1980 to 2018), a 1.3% increase in firm value, a 1.3% increase in measured TFP, and a welfare gain of 1.43% in consumption equivalent terms. These gains arise because lower order costs allow firms to better align material input use with realized productivity, smoothing inventory cycles.&lt;/p&gt;
&lt;p&gt;The vulnerability side is quantified through an unanticipated supply disruption calibrated to match the 3.4% drop in real U.S. GDP between 2019 and 2020. In response, the JIT economy experiences an approximately 0.40 percentage point excess output contraction relative to the no-JIT counterfactual, amounting to roughly 13–15% more output lost. The mechanisms are stockouts — firms that fully exhaust their inventories and cannot produce — and hoarding behavior, whereby firms that retain some inventory draw stocks down more slowly to preserve buffers, reducing material input use. Both channels reduce production relative to the counterfactual. The excess output loss is estimated at approximately $100 billion, comparable to state and local government allocations under the CARES Act.&lt;/p&gt;
&lt;p&gt;JIT nevertheless remains welfare-improving even under this shock. For a social planner to prefer a no-JIT world, the negative productivity shock to the intermediate goods sector would need to be nearly 14% — an order of magnitude larger than the calibrated COVID-19 shock. The trade-off is robust across alternative order cost distributions, parameterizations, partial anticipation scenarios, and stockout cost specifications.&lt;/p&gt;
&lt;p&gt;Q: What is the central trade-off identified by the paper?
A: JIT adoption reduces fixed order costs, enabling firms to place smaller and more frequent orders, which raises sales, reduces micro-level volatility, and increases firm value and welfare in normal times. However, because JIT firms hold fewer inventories, an unexpected aggregate shock increases the likelihood of stockouts and hoarding behavior, producing a deeper aggregate output contraction relative to an economy without JIT. Firms do not internalize the prospect of large shocks when making their private adoption decisions, generating the externality at the heart of the trade-off.&lt;/p&gt;
&lt;p&gt;Q: How does the paper measure JIT adoption, and how large is the sample?
A: The author constructs an adoption dummy for approximately 200 publicly listed manufacturing firms by exhaustively reviewing SEC filings and historical news archives for keywords including &amp;ldquo;JIT,&amp;rdquo; &amp;ldquo;just-in-time,&amp;rdquo; &amp;ldquo;lean manufacturing,&amp;rdquo; &amp;ldquo;pull system,&amp;rdquo; and &amp;ldquo;zero inventory.&amp;rdquo; Each document is individually analyzed to confirm the adoption year and to ensure it refers to the firm itself rather than its suppliers. More than half of observed adopters in the sample adopt prior to 1990, and nearly all adopt before 2000. The final Compustat-linked sample covers about 5,017 unique manufacturing firms from 1980 to 2018.&lt;/p&gt;
&lt;p&gt;Q: What are the firm-level efficiency gains from JIT adoption?
A: JIT adoption is associated with a 13% decrease in inventory-to-sales ratios and a 9% increase in sales; the corresponding standard deviation changes are –16% and +4%, respectively. Adopters also experience a roughly 7% decline in both sales and employment growth volatility, and a 5% increase in sales per worker relative to non-JIT firms. JIT firms additionally show a roughly 20% standard deviation reduction in squared forecast errors, indicating improved predictability of profitability.&lt;/p&gt;
&lt;p&gt;Q: How much more cyclical are JIT firms relative to non-JIT firms?
A: A 1% increase in GDP growth is associated with approximately a 1.6% increase in sales growth for non-adopters; JIT adopters experience an additional 0.47 percentage point increase above this baseline, making them roughly 25–30% more cyclical. This elevated cyclicality is estimated from variation external to the firm and reflects the heightened sensitivity of lean producers to aggregate demand fluctuations.&lt;/p&gt;
&lt;p&gt;Q: How are JIT firms affected by local weather disasters?
A: On average, a weather disaster predicts an additional 3% decline in JIT firm sales and employment relative to non-JIT firms. Using upstream supply chain linkages from Compustat Segment files, a unit increase in the average number of disasters hitting a firm&amp;rsquo;s suppliers predicts a 7–8% decline in firm sales and employment, with a similar excess decline for JIT firms. These results parallel the strategy in Barrot and Sauvagnat (2016).&lt;/p&gt;
&lt;p&gt;Q: What is the model structure, and how does the JIT adoption decision work?
A: The model features a representative household, a representative intermediate goods firm producing materials with capital and labor, and a continuum of heterogeneous final goods firms that differ in idiosyncratic productivity (AR(1) in logs), inventory holdings, and JIT adoption status. Each period has three stages: adoption decision, order decision (conditional on stochastic fixed order cost draw), and production decision. JIT producers draw order costs from a distribution first-order stochastically dominated by the non-JIT distribution, meaning JIT firms face systematically lower expected order costs. Adoption requires an upfront sunk cost c_s; maintaining JIT requires a smaller continuation cost c_f (estimated at slightly more than one-third of c_s), generating hysteresis: conditional on being an adopter, the probability of remaining one is estimated at 94%.&lt;/p&gt;
&lt;p&gt;Q: What moments are targeted in the SMM estimation, and how well does the model fit?
A: Eleven moments are targeted to identify nine parameters: the empirical adoption frequency, plus five moments each for JIT and non-JIT firms (mean inventory-to-sales ratio, the covariance matrix of inventory-to-sales ratios and log sales delivering three moments, and the frequency of positive inventory-to-sales ratio spikes exceeding 0.20). The model successfully fits targeted moments; non-targeted regression coefficients reproduce a quantitatively similar reduction in inventory-to-sales ratios after adoption, a comparable increase in sales among adopters, and reductions in firm volatility of 4–5% versus 6–7% in the data.&lt;/p&gt;
&lt;p&gt;Q: What are the estimated key structural parameters?
A: The upper support of the order cost distribution among non-adopters is estimated to be an order of magnitude larger than that of adopters, implying JIT firms place orders about 45% smaller than non-JIT firms. The estimated carrying cost is about 20% of inventory value. The estimated share of non-adopters in the model&amp;rsquo;s steady state implies a mass of JIT establishments of approximately 0.40. The technology parameters for the idiosyncratic productivity process are consistent with prior estimates in the structural firm dynamics literature.&lt;/p&gt;
&lt;p&gt;Q: What are the steady-state aggregate gains from JIT adoption in the model?
A: Relative to a counterfactual economy with no JIT option, the estimated model delivers a 9–10% increase in output, a 40% decline in the aggregate inventory-to-sales ratio (close to the observed 35% decline from 1980 to 2018), a 1.3% increase in firm value, a 1.3% increase in measured TFP, and a welfare gain of 1.43% in consumption equivalent terms. The TFP gain arises because lower order costs reallocate resources toward high marginal product producers at the aggregate level.&lt;/p&gt;
&lt;p&gt;Q: How is the unanticipated disaster calibrated, and what are its effects in the JIT versus no-JIT economies?
A: The disaster is an unanticipated negative shock to aggregate productivity in the intermediate goods sector, calibrated to match the 3.4% drop in real U.S. GDP between 2019 and 2020. In response, the JIT economy experiences approximately a 0.40 percentage point excess output contraction relative to the no-JIT counterfactual, amounting to roughly 13–15% more output lost. This excess loss equals approximately $100 billion, comparable to CARES Act allocations to state and local governments.&lt;/p&gt;
&lt;p&gt;Q: What are the two mechanisms through which JIT amplifies the disaster shock?
A: The first mechanism is stockouts: because JIT firms hold fewer inventories, an unexpected spike in order costs makes them more likely to fully exhaust their existing stocks, leaving them with no material inputs and forcing them to forgo production entirely. The second mechanism is hoarding: firms that do not fully stock out face a higher shadow value of inventories and cut back on material input use to draw inventories down more slowly, reducing output even without a full stockout. Both mechanisms reduce material input utilization in the JIT economy, causing a sharper drop in sales relative to the counterfactual.&lt;/p&gt;
&lt;p&gt;Q: Is JIT still welfare-improving when the COVID-19 shock is accounted for?
A: Yes. A social planner comparing welfare across steady states would not prefer to eliminate JIT even accounting for the deeper crisis it generates. For the planner to prefer a no-JIT world, the negative productivity shock to the intermediate goods sector would need to be nearly 14% — an order of magnitude larger than the calibrated 3.4% shock. This implies that the welfare gains from JIT in normal times substantially outweigh the welfare costs of the deeper recession under a COVID-19-scale shock.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to the Great Moderation literature?
A: JIT adoption is credited in prior work (McConnell and Perez-Quiros, 2000; Blanchard and Simon, 2001; Kahn et al., 2002) as contributing to the roughly 35% reduction in the aggregate inventory-to-sales ratio between 1980 and 2018 and to the broader decline in macroeconomic volatility. The estimated model is consistent with this: JIT adoption reduces firm-level volatility and, in the steady state, implies a reduction in aggregate inventory-to-sales ratios close to the observed magnitude. However, the paper documents that the same forces that smooth normal-times fluctuations amplify unanticipated large shocks.&lt;/p&gt;
&lt;p&gt;Q: What robustness checks does the paper conduct?
A: The paper considers alternate parameterizations (all robustly show the micro-macro trade-off), larger disaster sizes calibrated to UK and France 2020 contractions (JIT economy contracts ~10% vs. ~8.7%, a ~15% larger contraction), partial anticipation (a sizable excess output drop persists because the left tail of firm outcomes is truncated at zero profits), stockout costs (trade-off remains with ~1.2% firm value gain and ~10% excess contraction), and an alternative right-skewed beta order cost distribution (firm value gain rises to 1.8%, trade-off remains). An alternative CUSUM-based measure of JIT adoption identifying approximately 560 firms produces qualitatively similar empirical results.&lt;/p&gt;
&lt;p&gt;Q: What is the subsample estimation finding on adoption costs over time?
A: Comparing 1980–1989 and 1990–2018 subsamples, the upfront sunk cost of JIT adoption estimated from the 1980s sample is about 26% higher than in the later subsample, implying it has become easier to initiate JIT production over time. Steady-state output rises by about 3.4% in the 1990–2018 period relative to 1980–1989, and the excess output contraction under the disaster shock is about 15% relative to the 1980s counterfactual, close to the baseline estimate.&lt;/p&gt;
&lt;p&gt;Just-in-Time (JIT) Production: A lean inventory management philosophy that minimizes the time between orders by committing to smaller and more frequent orders from suppliers, reducing costs of managing large material purchases and storing idle stocks; in the model, JIT is operationalized as drawing order costs from a distribution first-order stochastically dominated by the non-JIT distribution.&lt;/p&gt;
&lt;p&gt;Stockout: The condition in which a final goods firm enters a period with no inventories (s = 0) and chooses not to place an order, leaving it without any material inputs and forcing it to forgo production entirely for that period.&lt;/p&gt;
&lt;p&gt;Hoarding (in the disaster context): The behavior of firms that, facing a higher shadow value of inventories during an unexpected shock, cut back on material input use in order to draw down existing inventory stocks more slowly, preserving buffers at the cost of reduced current production.&lt;/p&gt;
&lt;p&gt;Fixed Order Cost: A stochastic, labor-denominated cost that a firm must pay each period in which it places a materials order; JIT adopters face a systematically lower distribution of these costs, enabling more frequent ordering at smaller quantities.&lt;/p&gt;
&lt;p&gt;Adoption Sunk Cost: The one-time upfront cost c_s a non-adopter must pay to initiate JIT status, which exceeds the continuation cost c_f paid by existing JIT firms to maintain their status; the gap between these costs generates hysteresis in the adoption decision.&lt;/p&gt;
&lt;p&gt;Simulated Method of Moments (SMM): The structural estimation procedure used to identify model parameters by minimizing the weighted distance between model-simulated moments and their empirical counterparts; here applied with 11 targeted moments to identify 9 parameters in an overidentified system.&lt;/p&gt;
&lt;p&gt;Micro-Macro Trade-off: The paper&amp;rsquo;s central finding that individual firms rationally adopt JIT for private profitability gains (1.3% increase in firm value, 1.43% welfare gain), while the aggregate economy becomes more fragile to unanticipated shocks (roughly 13–15% deeper output contraction) because firms do not internalize the systemic vulnerability created by economy-wide lean inventories.&lt;/p&gt;</description></item><item><title>Staffing agencies and in-house bargaining</title><link>https://macropaperwarehouse.com/papers/staffing-agencies-and-in-house-bargaining/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/staffing-agencies-and-in-house-bargaining/</guid><description>&lt;p&gt;This paper asks whether a labor market with search-and-matching frictions and firms producing under decreasing returns to labor is better characterized by in-house hiring with intra-firm wage bargaining (Stole-Zwiebel) or by an alternative arrangement in which intermediaries — &amp;ldquo;staffing agencies&amp;rdquo; — search for and employ workers and then rent them to producing firms on a frictionless, perfectly competitive market. The paper&amp;rsquo;s second and central question is what happens when firms can choose their optimal combination of the two arrangements simultaneously.&lt;/p&gt;
&lt;p&gt;The model is static. There are Z homogeneous firms with production function F(n) satisfying F&amp;rsquo;&amp;rsquo;(n) &amp;lt; 0, N homogeneous workers, and a standard concave constant-returns-to-scale matching function M = m(V, N). Firms can post vacancies, workers search, and Nash bargaining with worker bargaining weight β determines wages. The analysis is conducted with fully general production and matching functions throughout, deviating to specific functional forms (Cobb-Douglas matching, power production function F(n) = An^α) only when needed to illustrate a particular efficiency result. All main results hold for both directed and random search.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Comparing the two polar arrangements (Theorem 3.1).&lt;/strong&gt; When all hiring is in-house (Stole-Zwiebel), equilibrium firm size n^SZ, aggregate employment Zn^SZ, labor market tightness θ^SZ, and the equilibrium wage w^SZ are all strictly higher than their counterparts under full staffing-agency employment (n^SA, Zn^SA, θ^SA, w^SA). The mechanism is that under in-house hiring with decreasing returns, a worker&amp;rsquo;s threat to leave raises the marginal product — and hence the wage — of remaining workers, giving workers additional bargaining leverage. Firms respond by over-employing in-house hires to dilute each worker&amp;rsquo;s marginal product and thus moderate wages. This over-employment raises vacancy posting and tightness, which in general equilibrium bids up wages despite each firm&amp;rsquo;s individual wage-moderation motive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency (Theorem 3.2).&lt;/strong&gt; Under the standard Hosios condition — worker bargaining weight β equals the elasticity η(θ) of the matching function with respect to vacancies — the staffing-agency equilibrium achieves the social planner&amp;rsquo;s optimum (θ^SA = θ*), while the in-house equilibrium posts strictly too many vacancies (θ^SZ &amp;gt; θ^SA = θ*). The in-house arrangement can be optimal for some β &amp;gt; η when workers&amp;rsquo; bargaining power is sufficiently high (Theorem 3.3, proved for Cobb-Douglas matching and power production function), because the over-employment incentive then counteracts the externality from underprovision of vacancies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The main result: staffing agencies dominate (Theorem 3.4).&lt;/strong&gt; When firms choose their profit-maximizing combination of in-house hires n^SZ and rented staffers n^SA, the unique equilibrium has n^SZ = 0 and n^SA &amp;gt; 0 — firms use only staffers. The key is Lemma 3.1: renting one additional staffer reduces the wage paid to in-house workers by more than does hiring one additional in-house worker (formally, ∂w^SZ/∂n^SA &amp;lt; ∂w^SZ/∂n^SZ). This asymmetry arises because staffers cannot leave during intra-firm bargaining breakdowns — they remain regardless — so each additional staffer tightens the firm&amp;rsquo;s fallback position more effectively than an additional in-house hire. With continuous labor, any positive mass of in-house workers leaves residual scope for further wage moderation through staffers, so the firm always finds it profitable to convert the last in-house hire to a staffer. The corner solution n^SZ = 0 is thus the unique equilibrium. With discrete labor, a firm would be indifferent between exactly one and zero in-house workers.&lt;/p&gt;
&lt;p&gt;The paper also notes that this staffing-agency arrangement is formally equivalent to the &amp;ldquo;labor packer&amp;rdquo; or intermediate-good setup widely used in applied macroeconomics (e.g., Gertler, Sala, and Trigari 2008) to avoid Stole-Zwiebel complications, providing a micro-foundation for that modeling convention.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does in-house hiring with decreasing returns to labor generate higher wages and employment than the staffing-agency arrangement?&lt;/strong&gt;
A: Under decreasing returns, if a worker&amp;rsquo;s wage negotiation breaks down and the worker leaves, the marginal product of the remaining n−1 workers rises. This gives each in-house worker additional bargaining leverage beyond the standard β parameter. To counteract this, firms over-employ in-house hires to keep the marginal product low. In general equilibrium this raises tightness θ^SZ &amp;gt; θ^SA, which in turn raises wages w^SZ &amp;gt; w^SA even though each individual firm&amp;rsquo;s motive was wage moderation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the formal basis for the in-house wage equation, and what does it depend on?&lt;/strong&gt;
A: Following Stole and Zwiebel&amp;rsquo;s stability condition, the continuous-labor wage for a firm with n workers is w(n) = (1−β)b + n^(−1/β) ∫₀ⁿ z^((1−β)/β) F&amp;rsquo;(z) dz. The wage depends on the entire distribution of marginal products over [0, n], not merely on the marginal product at n. In the special case of a power production function, the integral yields an explicit power function in n.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: When is the staffing-agency equilibrium socially efficient?&lt;/strong&gt;
A: Under the standard Hosios condition β = η(θ*), the staffing-agency equilibrium attains exactly the planner&amp;rsquo;s tightness (θ^SA = θ*), because bargaining in staffing agencies is standard — the worker&amp;rsquo;s outside option does not affect other workers&amp;rsquo; wages and so the usual efficiency characterization applies. The in-house equilibrium then strictly over-posts vacancies (θ^SZ &amp;gt; θ*).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Can the in-house equilibrium ever be socially optimal?&lt;/strong&gt;
A: Yes, but only under specific parameter conditions. Theorem 3.3 shows that with Cobb-Douglas matching (η constant) and a power production function F(n) = An^α, there exists a threshold β̂ ∈ (η, 1) at which θ^SZ = θ*. The intuition is that strong worker bargaining power creates a vacancy-underprovision problem; the over-employment incentive under in-house hiring then partially corrects it. The functional form restriction is made for expositional convenience; the core logic is general.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is Lemma 3.1 and why is it the key to the main result?&lt;/strong&gt;
A: Lemma 3.1 states that, given any positive number of in-house hires n^SZ &amp;gt; 0, renting one additional staffer reduces the wage paid to in-house workers by more than does hiring one additional in-house worker: ∂w^SZ/∂n^SA &amp;lt; ∂w^SZ/∂n^SZ. This is proved by showing the relevant integral in the difference (∂w^SZ/∂n^SA − ∂w^SZ/∂n^SZ) is negative for n^SZ &amp;gt; 0 given F&amp;rsquo;&amp;rsquo; &amp;lt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does renting an additional staffer moderate in-house wages more than hiring an in-house worker?&lt;/strong&gt;
A: An in-house worker who is hired can, in principle, leave during a bargaining breakdown, triggering renegotiation all the way down to zero in-house workers and driving the firm&amp;rsquo;s fallback to zero profit. A rented staffer cannot leave; at minimum, all rented staffers remain in production regardless of in-house bargaining outcomes. Each additional staffer thus raises the firm&amp;rsquo;s floor payoff in bargaining by more than an additional in-house hire does, generating stronger wage moderation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why does Theorem 3.4 produce a corner solution rather than an interior mix?&lt;/strong&gt;
A: Because labor is treated as a continuous input, any strictly positive mass n^SZ &amp;gt; 0 of in-house workers leaves the marginal in-house worker with positive bargaining leverage through the threat-to-leave mechanism. The firm can always improve its bargaining position by converting that marginal in-house worker to a staffer. This margin is present no matter how small n^SZ is, so the only equilibrium is n^SZ = 0. In discrete labor the firm would be indifferent between exactly one and zero in-house hires.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What happens to labor market tightness when both arrangements coexist?&lt;/strong&gt;
A: In the mixed equilibrium the tightnesses for in-house and staffer jobs must be equal in equilibrium (θ^SZ = θ^SA). If one tightness were higher, workers would prefer that job type (higher wage and higher probability of finding it), but firms would reduce vacancy posting there (costlier to fill), automatically equalizing tightness. This equilibration occurs even though in equilibrium vacancy posting for in-house jobs goes to zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Do the results require directed search or hold under random search as well?&lt;/strong&gt;
A: The results hold under both directed and random search. Appendix 3.C establishes that with random search and a single pooled matching function M = m(V^SZ + V^SA, N), the unique equilibrium also features n^SZ = 0. The directed-search assumption is made without loss of generality.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does this paper imply for the applied macroeconomics literature&amp;rsquo;s &amp;ldquo;labor packer&amp;rdquo; modeling convention?&lt;/strong&gt;
A: The paper provides a formal micro-foundation for the labor-packer or intermediate-good approach used in New Keynesian DSGE models (e.g., Gertler, Sala, and Trigari 2008) to sidestep Stole-Zwiebel bargaining. In that literature, a &amp;ldquo;wholesale firm&amp;rdquo; or &amp;ldquo;packer&amp;rdquo; searches for workers and sells their services to final-goods firms under perfect competition — formally identical to the staffing-agency arrangement in this paper. Theorem 3.4 shows this arrangement is the unique equilibrium outcome of rational firm choice, so the shortcut is not merely convenient but theoretically grounded.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the empirical literature say about wage differentials between in-house and agency workers?&lt;/strong&gt;
A: Drenik et al. (2023), using Argentine administrative data linking temp agencies to user firms, estimate a significant wage premium for in-house hires relative to temp workers. This is consistent with the paper&amp;rsquo;s theoretical prediction that w^SZ &amp;gt; w^SA in the polar-case comparison (Theorem 3.1), though the paper itself presents no empirical estimation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the implications of the staffing-agency arrangement for measured labor shares?&lt;/strong&gt;
A: The paper notes that costs for staffers typically appear in firm accounts as intermediate input costs rather than labor costs. A shift from in-house hires to staffers therefore reduces measured labor costs and, because it also reduces value added (by more than the labor-cost reduction), lowers the measured labor share at the firm even when actual labor input and output are unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What future extensions do the authors identify as priorities?&lt;/strong&gt;
A: The authors flag three main extensions: (i) heterogeneous workers and firms, which could generate predictions about which firms use each hiring mode; (ii) worker effort/loyalty differences between in-house and agency workers that could make in-house hiring attractive ex post; and (iii) a frictional rental market for staffers or heterogeneous tasks within the firm, where insufficient staffer supply in certain sub-markets could restore a role for in-house hiring.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Staffing agency (in this paper&amp;rsquo;s sense):&lt;/strong&gt; An intermediary that posts vacancies on the frictional labor market, employs workers through standard Nash bargaining, and rents those workers one-for-one to producing firms on a frictionless, perfectly competitive market. The staffing agency is separated from the firm&amp;rsquo;s production decisions; its search activity has constant returns to scale.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;In-house hiring with Stole-Zwiebel bargaining:&lt;/strong&gt; A market arrangement in which the producing firm itself posts vacancies, employs workers, and conducts intra-firm Nash bargaining. Under decreasing returns to labor, the bargaining outcome for worker i depends on the firm&amp;rsquo;s payoff if that worker left, which in turn depends on wages paid to the remaining n−1 workers — generating a system of interdependent bargaining problems captured by the differential equation w(n) = (1−β)b + n^(−1/β) ∫₀ⁿ z^((1−β)/β) F&amp;rsquo;(z) dz.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage-moderation incentive (over-employment):&lt;/strong&gt; Under in-house hiring, a firm has an incentive to hire more workers than a social planner would recommend, because additional workers reduce each worker&amp;rsquo;s marginal product and hence the wage the firm must pay. This incentive is present because decreasing returns mean a departing worker raises the marginal product of remaining workers, giving each in-house worker leverage over the firm.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Differential wage-moderation effect (Lemma 3.1):&lt;/strong&gt; The finding that, given any positive mass of in-house hires, renting one additional staffer reduces in-house wages by more than hiring one additional in-house worker (∂w^SZ/∂n^SA &amp;lt; ∂w^SZ/∂n^SZ). The asymmetry arises because staffers cannot leave during intra-firm bargaining breakdowns, so they provide a more effective floor to the firm&amp;rsquo;s fallback payoff.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hosios condition (as applied here):&lt;/strong&gt; The standard efficiency condition β = η(θ), where β is the worker&amp;rsquo;s Nash bargaining weight and η(θ) is the elasticity of the job-offer arrival rate with respect to tightness. When this condition holds, the staffing-agency equilibrium is socially optimal (θ^SA = θ*) and the in-house equilibrium is inefficient (θ^SZ &amp;gt; θ*).&lt;/p&gt;</description></item><item><title>State Capacity as an Organizational Problem</title><link>https://macropaperwarehouse.com/papers/state-capacity-as-an-organizational-problem/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/state-capacity-as-an-organizational-problem/</guid><description>&lt;p&gt;Mastrorocco and Teso study how the internal organization of a state evolves during national development, framing state capacity as an organizational — specifically a principal-agent — problem. Using a new micro-database covering the U.S. federal bureaucracy from 1817 to 1905, they ask: once rulers have incentives to build a state apparatus, how do they organize it to perform its functions across a vast territory, and what drives transitions between organizational forms?&lt;/p&gt;
&lt;p&gt;The dataset is constructed from every issue of the Official Register of the United States published between 1817 and 1905 (44 biennial volumes, 15,801 pages digitized). It records full name, state of birth, state of appointment, occupation, salary, department, office, and location for 304,410 unique federal employees across 810,942 employee-year observations. The authors reconstruct the bureaucracy&amp;rsquo;s four-layer hierarchy (department → office/bureau → division → local office), link employees over time to track careers, categorize all 11,930 occupation codes into five tiers, and geo-code 9,651 places of employment to 1890 county boundaries.&lt;/p&gt;
&lt;p&gt;The paper first documents three sets of descriptive facts. On growth: the federal workforce expanded very slowly before the 1860s and then rapidly, with geographic expansion accounting for none of state growth before 1859 but roughly 29% after. On location: state presence responded positively to local manufacturing activity (a one standard deviation increase in manufacturing employment share raises presence probability by 1.3 percentage points), but distance from Washington DC significantly attenuated this relationship in 1817–1859 and not in 1861–1905. On organization: before the 1860s, employee turnover was high and spiked sharply at presidential transitions (reaching 72% of employees departing in 1861), supervisors&amp;rsquo; departures strongly predicted subordinates&amp;rsquo; departures (a one-for-one supervisor exit raised subordinate turnover probability by 37% pre-1841), and managerial delegation outside DC was stagnant or declining. After the 1860s, turnover trended down (35% at the 1897 transition), the supervisor-subordinate career link weakened materially, and field managers tripled relative to the 1850s.&lt;/p&gt;
&lt;p&gt;The authors argue that high monitoring costs in the early century made trust-based, personalistic organization the second-best solution to principal-agent problems. The limited supply of sufficiently trusted individuals constrained geographic expansion, delegation, and total size. As railroad and telegraph networks lowered communication and transportation costs, monitoring capacity increased, enabling a transition to a Weberian bureaucracy no longer constrained by trust supply.&lt;/p&gt;
&lt;p&gt;The causal identification strategy uses the staggered expansion of the railroad network. For each county and decade (1820–1900), the authors compute the minimum-travel-time route from the county centroid to DC using Donaldson and Hornbeck (2016) data on railroads, steamboat waterways, coastal routes, and land routes. The specification includes county fixed effects, state-by-decade fixed effects, and controls for local railroad presence in the county and for the county&amp;rsquo;s market access, so the identifying variation comes from distant changes in the network that altered travel time to DC without directly affecting the county&amp;rsquo;s local economy or trade access.&lt;/p&gt;
&lt;p&gt;Results: a one standard deviation decrease in travel time to DC raises the probability of federal state presence by approximately 3 percentage points (about 8% of the mean), raises log employment similarly, raises the probability of observing a local managerial layer by approximately 3 percentage points (about 8% of the mean), and reduces employee turnover by approximately 2 percentage points (about 4% of the mean turnover rate). Placebo tests confirm that travel time to other major economic centers does not predict state presence. Telegraph network data (1845–1852, Wang 2020) yield consistent results. An additional test using the post-Civil War decline in Southern-born employee shares shows that better railroad connection to DC narrowed the North-South employment gap, consistent with monitoring substituting for trust-based selection.&lt;/p&gt;
&lt;p&gt;Scope conditions: the paper covers the civilian executive branch of the federal government, excluding the Postal Office, navy yards, and the engineer department; results are robust to restricting to states already in the union at the start of the sample, ruling out frontier-specific dynamics.&lt;/p&gt;
&lt;p&gt;Q: What is the central theoretical claim of the paper?
A: The paper argues that state capacity is fundamentally an organizational problem shaped by principal-agent constraints. When communication and transportation costs are high, the government cannot effectively monitor distant agents, so the second-best solution is to staff the bureaucracy with trusted individuals connected through personal networks. This personalistic form limits size and delegation because the supply of sufficiently trusted individuals is inherently scarce. Technological reductions in monitoring costs allow a transition to a Weberian bureaucracy based on procedural oversight rather than trust, removing the supply constraint on organizational growth.&lt;/p&gt;
&lt;p&gt;Q: What data source does the study rely on, and what time period does it cover?
A: The study draws on the Official Register of the United States, a biennial government publication listing all federal employees, digitized for every issue from 1817 to 1905. The resulting dataset includes 304,410 unique employees and 810,942 employee-year observations, with each record carrying name, state of birth, state of appointment, occupation, salary, department, office, location, and — through hierarchical reconstruction — position in a four-layer chain of command.&lt;/p&gt;
&lt;p&gt;Q: How did the size of the U.S. federal bureaucracy evolve over the nineteenth century?
A: Growth was slow before the 1860s. The first Register for 1817 listed 1,056 employees across 33 pages; the 1905 volume listed over 120,000 employees across 1,254 pages. Geographic expansion contributed zero to state growth before 1859 — the share of counties with any federal employee hovered around 15% from 1817 to 1859 — but contributed approximately 29% of growth after 1859, when county presence rose to 24% by 1871, 38% by 1881, and 61% by 1905.&lt;/p&gt;
&lt;p&gt;Q: What were the three sources of state growth, and how did their relative importance change?
A: The authors decompose growth into: (1) functions (new offices/bureaus), (2) geographic expansion (new counties), and (3) intensity (more employees per county-office pair). Before 1859, growth was entirely driven by functions (~40%) and intensity (~60%), with zero contribution from geographic expansion. After 1859, geographic expansion accounted for ~29%, intensity for ~32%, and functions for ~39% of growth.&lt;/p&gt;
&lt;p&gt;Q: How did employee turnover behave across the century, and what pattern emerges at presidential transitions?
A: Turnover trended upward through the late 1850s and then declined. During presidential transitions, the rate rose from 52–53% in 1841 and 1845 to 60–63% in 1849 and 1853 and peaked at 72% in 1861; it then fell to 55% in 1869, 44–48% in 1885/1889/1893, and 35% in 1897. Turnover was consistently lower in DC than in the field: controlling for year-bureau-position fixed effects, being employed in DC was associated with a 40% reduction in turnover probability.&lt;/p&gt;
&lt;p&gt;Q: How tight was the link between supervisors&amp;rsquo; and subordinates&amp;rsquo; careers, and how did it change?
A: Before 1841, moving from none to all supervisors leaving an organizational unit increased subordinate turnover probability by 37 percentage points. The effect was similar between 1841 and 1859, then dropped substantially to 22 percentage points in the following twenty-year period, and remained roughly constant after 1881. This pattern is consistent with the early bureaucracy relying on chains of personal trust that broke when a supervisor departed.&lt;/p&gt;
&lt;p&gt;Q: What evidence describes the evolution of delegation outside DC?
A: The number of field managers did not grow between 1817 and 1859 — it actually declined in the 1820s and was flat through the mid-1850s — and then tripled by 1905 relative to the 1850s level. The probability that workers in a local office had an additional managerial layer between them and DC was unchanged between pre-1841 and 1841–1859, increased by 5 percentage points between 1861 and 1881, and by 6 percentage points post-1881.&lt;/p&gt;
&lt;p&gt;Q: How does the paper measure monitoring capacity for the causal analysis?
A: The primary measure is travel time in hours from each county centroid to Washington DC, computed decade by decade (1820–1900) as the minimum-cost route across the available railroad network, steamboat waterways, coastal routes, and land routes, using data from Donaldson and Hornbeck (2016). A second, complementary measure is the number of telegraph connections between a county and DC using data from Wang (2020) for 1845–1852.&lt;/p&gt;
&lt;p&gt;Q: What is the identification strategy for the railroad analysis, and why are controls for local railroads and market access important?
A: The specification includes county fixed effects, state-by-decade fixed effects, an indicator for whether the county itself has railroad (LocalRailroad), and the county&amp;rsquo;s market access. County fixed effects mean beta is identified within-county from changes over time. Controlling for local railroad removes the direct correlation between local construction and local economic growth. Controlling for market access removes the effect of distant rail expansion on trade flows that raised agricultural land values and manufacturing activity. The remaining variation in travel time to DC — coming from distant network changes that altered the DC-county connection without affecting local conditions or broader trade access — is the identifying source.&lt;/p&gt;
&lt;p&gt;Q: What are the main quantitative effects of reduced travel time to DC?
A: A one standard deviation decrease in travel time to DC is associated with: (1) approximately 3 percentage point increase in the probability of federal state presence (~8% of the mean); (2) a similar magnitude increase in log employment conditional on presence; (3) approximately 3 percentage point higher probability of an additional managerial layer (~8% of the mean); and (4) approximately 2 percentage point reduction in employee turnover (~4% of the mean turnover rate).&lt;/p&gt;
&lt;p&gt;Q: How do placebo tests support the monitoring interpretation?
A: The authors show that, conditional on the same controls, travel times from a county to a set of other major economic centers are not associated with larger federal state presence. Since these other cities had no role as monitoring headquarters, the absence of an effect for them and the presence of an effect specifically for DC is consistent with the channel operating through the government&amp;rsquo;s ability to supervise agents from the capital, rather than through generic economic connectivity.&lt;/p&gt;
&lt;p&gt;Q: What does the telegraph evidence add, and what is its limitation?
A: Telegraph data (1845–1852, Wang 2020) show that counties with more telegraph connections to DC have larger state presence, more managerial delegation, and lower turnover, consistent with the monitoring mechanism. The limitation is that the authors have limited ability to address the endogeneity of telegraph network timing — the telegraph analysis is treated as corroborating evidence rather than the primary causal identification.&lt;/p&gt;
&lt;p&gt;Q: How do the Southern-born employee results illuminate the trust mechanism?
A: After the Civil War, the share of Southern-born federal bureaucrats fell sharply, consistent with reduced trust toward individuals from former Confederate states. However, counties that became better connected to DC via railroad expansion experienced a relative increase in the share of Southern-born employees. This shows that when monitoring costs fell, the government was willing to hire individuals from groups with lower baseline trust — monitoring substituted for trust as the mechanism ensuring agent performance.&lt;/p&gt;
&lt;p&gt;Q: Does federal state presence crowd out state and local government?
A: No. The presence of federal bureaucrats is positively correlated with the presence of state and local government employees at the county level, suggesting complementarity rather than substitution across levels of government.&lt;/p&gt;
&lt;p&gt;Q: What alternative mechanisms do the authors consider and how do they address them?
A: Three alternatives are discussed. First, demand shocks (Civil War debt repayment, industrialization) could explain the post-1860s expansion; the empirical specifications control for year fixed effects to absorb aggregate time-varying incentives, and the identification relies on differential cross-county variation in DC connectivity. Second, patronage as an electoral tool is consistent with spoils-driven turnover spikes but cannot explain why better-connected counties show lower turnover before civil service reform. Third, cognitive models of the firm (lower communication costs complement managerial problem-solving even without agency problems) could also predict the positive delegation result; the authors note they cannot empirically distinguish the monitoring and cognitive channels, and both may contribute.&lt;/p&gt;
&lt;p&gt;Q: What are the implications for developing countries today?
A: The authors suggest that their findings from nineteenth-century U.S. history may apply to understanding why modern Weberian bureaucracies remain elusive in many developing countries. Where communication infrastructure is limited and monitoring costs remain high, personalistic organizational forms based on trust networks may persist as constrained optima — not failures of will or design, but rational responses to structural conditions. Infrastructure investment that lowers monitoring costs could be a precondition for bureaucratic modernization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Personalistic state organization&lt;/strong&gt;: The paper&amp;rsquo;s term for the organizational form that prevails when monitoring costs are high. It is characterized by staffing decisions based on personal character, moral reputation, and relationships of trust between principals and agents — and between supervisors and subordinates — rather than on formal procedural monitoring of performance. Frequent turnover at leadership transitions and constrained delegation are defining features, because the supply of trusted individuals is limited.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Weberian bureaucracy&lt;/strong&gt;: In the paper&amp;rsquo;s usage (following Weber 1978), a modern state organization defined by a fixed hierarchy of officials monitored through procedural rules rather than personal trust, lower turnover, and effective delegation of managerial power to geographically dispersed units. The paper treats this as the organizational form enabled by low monitoring costs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monitoring capacity&lt;/strong&gt;: The principal&amp;rsquo;s (politicians in DC and their cabinets) ability to observe and evaluate the behavior of agents (federal employees) throughout the territory. In the paper&amp;rsquo;s operationalization, monitoring capacity is proxied inversely by travel time and communication cost between DC and the county: lower travel time and more telegraph connections mean higher monitoring capacity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geographic expansion component&lt;/strong&gt;: One of three decomposed sources of state growth. Defined as the increase in state size attributable to the state becoming present in more county locations. This component contributed zero to federal growth before 1859 and approximately 29% of growth after 1859.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Employee turnover&lt;/strong&gt;: In the paper&amp;rsquo;s measurement, the share of employees who leave the federal bureaucracy in a given year. The paper distinguishes politically-driven spikes at presidential transitions — reaching 72% of employees in 1861 — from the secular trend, which rose through the late 1850s and then declined, reaching 35% by the 1897 transition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Delegation of managerial power&lt;/strong&gt;: The probability that a local county office has an additional managerial layer between its workers and DC, rather than reporting directly to the bureau-level supervisor in Washington. The paper uses this as its measure of whether decision authority has been decentralized to the field.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trust substitution&lt;/strong&gt;: The paper&amp;rsquo;s mechanism linking monitoring capacity to organizational form. In the absence of effective monitoring, principals substitute trust for oversight — selecting agents whose personal loyalty, moral character, or political alignment gives the principal confidence they will not shirk or defect. As monitoring costs fall, trust becomes less necessary as a screening device, and the trust-constrained supply limit on organizational growth is relaxed.&lt;/p&gt;</description></item><item><title>Structural Change, Land Use and Urban Expansion</title><link>https://macropaperwarehouse.com/papers/structural-change-land-use-and-urban-expansion/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/structural-change-land-use-and-urban-expansion/</guid><description>&lt;p&gt;This paper asks how cities grow in the process of structural transformation — specifically, whether urban expansion occurs at the intensive margin (higher density within a fixed area) or the extensive margin (larger area). The authors document and explain a persistent decline in urban density in France since 1870, and develop a spatial general equilibrium model in which endogenous land use — land allocated either to agriculture or housing — is the key mechanism linking structural change to urban sprawl.&lt;/p&gt;
&lt;p&gt;The central empirical fact is striking: between 1870 and 2015, the area of the 100 largest French cities increased by a factor of roughly 30, while their population grew by only a factor of about 4, implying that average urban density fell by a factor of roughly 8. This density decline was fastest over 1950–1975, coinciding with the acceleration of structural change (France&amp;rsquo;s rural exodus). Since the mid-nineteenth century, approximately 15% of French land has been reallocated away from agricultural use — more than the total artificially-used land in France today (about 9%).&lt;/p&gt;
&lt;p&gt;The theoretical mechanism operates through the opportunity cost of urban expansion. Agricultural land at the urban fringe must earn its marginal product in the rural sector; this agricultural rent pins down the cost of converting land to urban use. When agricultural productivity is low, farmland is expensive relative to income (the &amp;ldquo;food problem&amp;rdquo;), households devote large shares of resources to food, and cities remain small in area and very dense. As agricultural productivity rises — the engine of structural change — workers leave rural areas, farmland values fall relative to income, and cities can expand cheaply at their fringes. Simultaneously, richer households spend more on housing. Both forces cause urban area to grow faster than urban population, generating a sustained decline in average density.&lt;/p&gt;
&lt;p&gt;The model also predicts a &amp;ldquo;hockey-stick&amp;rdquo; path for housing prices: during structural change, the extensive margin expansion of cities limits the rise in urban land rents despite growing housing demand. Once the reallocation of workers and land out of agriculture slows, urban land values must adjust upward rapidly, producing the pattern documented by Knoll et al. (2017) — relatively flat housing prices until roughly the 1950s, then steep increases.&lt;/p&gt;
&lt;p&gt;The model is a multi-city, multi-sector spatial equilibrium framework with non-homothetic CES preferences (including a subsistence requirement for the agricultural good), endogenous city fringes determined by land market clearing between agricultural and residential uses, and a monocentric commuting structure with endogenous commuting speed (workers adopt faster modes as wages rise). The model is calibrated to French historical data spanning 1840–2015, with 20 regions whose sectoral productivities are estimated to match regional urban populations and local farmland prices.&lt;/p&gt;
&lt;p&gt;Quantitatively, the calibrated model accounts for approximately 70% of the increase in urban area since 1870, most of the decline in average urban density (the factor-of-8 fall), about half of the rise in real housing prices, and most of the reallocation of land values from agricultural to urban. Cross-sectional evidence confirms a core prediction: cities surrounded by more expensive farmland are denser, with an IV-estimated elasticity of urban density with respect to farmland prices of approximately 0.3 (a 10% increase in farmland prices raises urban density by about 3%), consistent with the model&amp;rsquo;s counterpart. Scope conditions include the focus on France as a single country case, reliance on a monocentric urban structure, and the abstraction from within-urban-sector reallocation (manufacturing to services).&lt;/p&gt;
&lt;p&gt;Q: What is the central stylized fact motivating the paper?
A: Between 1870 and 2015, the area of the 100 largest French cities increased by a factor of roughly 30, while their total population grew by a factor of about 4, so average urban density fell by a factor of roughly 8. This density decline was most rapid over 1950–1975, coinciding with France&amp;rsquo;s peak rural exodus, and has barely fallen since — tracking the slowdown of structural change. This pattern is not unique to France; Angel et al. (2010) document persistent urban density decline on a global scale.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s key theoretical mechanism linking structural change to urban sprawl?
A: The rental price of agricultural land at the urban fringe is the opportunity cost of expanding the city into surrounding farmland. When agricultural productivity is low, farmland is expensive relative to income, keeping cities small and dense. As productivity rises and workers migrate to cities, the value of agricultural land falls relative to income, reducing the cost of urban expansion at the fringe. Richer households also devote a larger share of spending to housing, reinforcing the demand for space. These two channels together cause city area to grow faster than city population, generating a sustained decline in average density — even without any improvement in commuting technology.&lt;/p&gt;
&lt;p&gt;Q: How does the paper distinguish between the structural change channel and the commuting cost channel?
A: The model contains both channels: structural change (falling agricultural land values at the fringe) and falling effective commuting costs (rising wages lead workers to adopt faster commuting modes, a wage elasticity of commuting speed calibrated from survey data). Counterfactuals show that without structural change (rural productivity growth set to 4% of baseline), the model cannot replicate the observed density decline. Without faster commutes (setting the income elasticity of commuting speed to unity), the model predicts only about 30% of the baseline density decline. Both channels are necessary; their combined effect exceeds the sum of parts because structural change raises wages, which in turn amplifies the commuting speed mechanism.&lt;/p&gt;
&lt;p&gt;Q: How do the two channels differ in their spatial imprint within cities?
A: Structural change adds new low-density settlements at the urban fringe, so suburban density falls more than average density — the center is relatively less affected. Faster commuting modes, by contrast, induce suburbanization: workers relocate from the center outward, so central density falls more than average density. For Paris, historical data show that central density fell less than average urban density, which is consistent with both mechanisms operating simultaneously — the commuting channel pushing central density down more, but the structural change channel adding fringe expansion that affects suburban density more.&lt;/p&gt;
&lt;p&gt;Q: What is the empirical evidence on the cross-sectional farmland price prediction?
A: Using data on local farmland transaction prices from the French Ministry of Agriculture at the &amp;ldquo;Petite Region Agricole&amp;rdquo; level (over 700 areas), the authors show that cities surrounded by more expensive farmland are denser. A binned scatter plot across 200 French cities shows that moving from the first to last decile of farmland prices raises density by about one third — an effect comparable in magnitude to an increase in population from roughly 25,000 (3rd decile) to 150,000 (9th decile). To address endogeneity (productive cities may inflate nearby farmland prices), the authors instrument farmland prices with soil quality characteristics; the IV elasticity of urban density with respect to farmland prices is approximately 0.3, consistent with the model&amp;rsquo;s predicted counterpart.&lt;/p&gt;
&lt;p&gt;Q: What does the model predict about the time path of housing prices?
A: The model predicts a &amp;ldquo;hockey-stick&amp;rdquo; pattern: housing prices remain relatively flat for decades while structural change is ongoing, because cities expand cheaply at the extensive margin, absorbing growing housing demand without large rent increases. Once the reallocation of workers and land out of agriculture slows, the extensive margin ceases to buffer demand, and urban land values must rise sharply. The calibrated model accounts for about half of the observed rise in real housing prices since the mid-nineteenth century; it matches the qualitative hockey-stick pattern documented by Knoll et al. (2017) and Piketty and Zucman (2014) for France and advanced economies more broadly.&lt;/p&gt;
&lt;p&gt;Q: What happens to the relative values of agricultural versus urban land over the period?
A: Agricultural land values relative to income fall dramatically: the average value of a French agricultural field per unit of land, as a share of per capita income, was divided by a factor of 15 between 1850 and 2015. Meanwhile, urban land values rise. In 1820, agricultural land accounted for more than 70% of total housing and land wealth in France; by 2010 this share had fallen to about 3%. This reallocation of land values from rural to urban is a central prediction the model accounts for, driven by structural change reducing the scarcity premium on farmland.&lt;/p&gt;
&lt;p&gt;Q: How is the model parameterized and calibrated?
A: Preferences are non-homothetic CES with housing preference parameter gamma = 0.22, subsistence consumption for the rural good calibrated to match the 1840 agricultural employment share (about 60%), and substitution elasticity between urban and rural goods sigma = 0.8. The labor share in agriculture is alpha = 0.6. Commuting cost parameters (elasticities to wages and distance) are estimated from the French Labor Force Survey (Enquete Emploi). Region-specific sectoral productivity parameters for 20 regions (40 parameters total) are estimated to match the cross-section of urban populations and local farmland values in the base year 1870. The model is then simulated forward to 2015.&lt;/p&gt;
&lt;p&gt;Q: What share of French land has been reallocated away from agriculture, and how does this relate to urban expansion?
A: About two-thirds of French land was used for agriculture in 1840; by 2015 this fell to 52%, implying roughly 15 percentage points of French territory reallocated away from agricultural use. This 15% exceeds the total land currently under artificial use in France (about 9%). Over the more precisely measured period 1982–2015, artificialized soil increased by about 2 million hectares (3.7% of French territory), representing roughly 70% of the land converted away from agriculture over the same period. Two-thirds of land surrounding French cities is agricultural, confirming that urban expansion occurs at the expense of farmland.&lt;/p&gt;
&lt;p&gt;Q: What are the limitations and directions for future research acknowledged by the authors?
A: The model relies on a monocentric urban structure where all workers commute to a single city center, which is an approximation — commuting distance increases with residential distance to the center but less than one-for-one, suggesting workers sort into nearby jobs. The model also abstracts from within-urban-sector reallocation (the manufacturing-to-services transition), which the authors conjecture matters for the cross-section of cities in recent times. Finally, the model cannot fully replicate the steep recent rise in housing prices, which the authors attribute partly to land-use regulations constraining extensive margin growth — a policy counterfactual the general equilibrium structure is well-suited to analyze.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to the Ricardo/Nichols view that land values should rise with economic development?
A: The traditional Ricardian view predicts that a fixed factor like land must rise in value with economic development — counterfactual given the historical data showing farmland values falling sharply relative to income. The authors reconcile this with the data by emphasizing that structural change and agricultural productivity growth reduce the scarcity of farmland even as total income grows, so farmland values fall. Urban land values do rise, but the structural change channel initially dampens this increase by facilitating extensive-margin city growth. The paper thus reconciles the Ricardian fixed-factor view with the commuting technology view (Miles and Sefton, 2020) within a unified spatial structural change framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous land use&lt;/strong&gt;: In this paper&amp;rsquo;s framework, land in each region is allocated either to agricultural production or to residential use, with the margin between the two determined in equilibrium by the equality of the rental price of land at the urban fringe and the marginal product of land in the rural sector. This makes the urban-rural land boundary an endogenous object that responds to structural change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Urban fringe (phi_k)&lt;/strong&gt;: The furthest residential location of an urban worker in city k, determined endogenously as the commuting distance at which the opportunity cost of further expansion (the agricultural land rent) equals the willingness of urban workers to pay for land. All workers beyond this fringe produce rural goods without commuting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural change (in the paper&amp;rsquo;s sense)&lt;/strong&gt;: The reallocation of workers and land away from agriculture driven jointly by non-homothetic preferences with a subsistence consumption requirement for the agricultural good (demand side) and rising sectoral productivity (supply side). Structural change is the primary driver of falling farmland values and urban sprawl in the model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Non-homothetic CES preferences&lt;/strong&gt;: Household preferences over rural and urban goods that are not homogeneous of degree one in income, specified as a CES aggregate with a subsistence floor for the rural (agricultural) good. At low income levels, households devote large budget shares to food; as income rises, spending shifts toward urban goods and housing. This demand-side non-homotheticity is the channel through which rising income generates structural change.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Food problem (Schultz, 1953)&lt;/strong&gt;: The condition in which low agricultural productivity forces households to devote a large fraction of resources to meeting subsistence food needs, leaving little for housing expenditure. In the paper&amp;rsquo;s model, the food problem makes cities initially small and very dense; as agricultural productivity rises and the food problem relaxes, cities can expand in area.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Commuting cost function tau(l_k)&lt;/strong&gt;: Spatial frictions proportional to the worker&amp;rsquo;s distance from the city center and the urban wage, of the functional form tau(l_k) = a * w_{u,k}^{xi_w} * l_k^{xi_l}, where xi_w in (0,1) captures the endogenous adoption of faster commuting modes as wages rise. Concavity in both arguments is micro-founded by an optimizing commuting mode choice model, ensuring that the share of resources devoted to commuting falls as incomes rise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hockey-stick housing price path&lt;/strong&gt;: The model&amp;rsquo;s prediction that real housing prices remain relatively flat over the period of active structural change — because city expansion at the extensive margin absorbs rising housing demand without large rent increases — before rising steeply once structural change slows and the extensive margin is exhausted. This prediction matches the empirical pattern documented by Knoll et al. (2017) for France and other advanced economies.&lt;/p&gt;</description></item><item><title>Supply, Demand, Institutions, and Firms: A Theory of Labor Market Sorting and the Wage Distribution</title><link>https://macropaperwarehouse.com/papers/supply-demand-institutions-and-firms-a-theory-of-labor-market-sorting-and-the-wage-distribution/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/supply-demand-institutions-and-firms-a-theory-of-labor-market-sorting-and-the-wage-distribution/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; How do workforce composition (labor supply), labor demand, and minimum wage policy jointly determine the wage distribution in imperfectly competitive labor markets, and what were the quantitative contributions of each force to the dramatic decline in Brazilian wage inequality between 1998 and 2012?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation.&lt;/strong&gt; Brazil&amp;rsquo;s formal-sector wage inequality fell sharply over this period. Three candidate shocks are well-documented: (1) a large increase in educational attainment — the share of adults completing at least secondary school rose by 20 percentage points (a 68 percent increase) between 1998 and 2012; (2) labor demand shocks, primarily the commodities boom of the 2000s; and (3) a 93.7 percent (66.1 log point) real increase in the federal minimum wage. Existing frameworks analyze these shocks separately — competitive supply/demand models on one side and imperfectly competitive minimum wage models on the other — and therefore cannot detect interactions or jointly explain all observed patterns, including the novel finding that assortative matching between high-wage workers and high-wage establishments rose in 104 out of 151 microregions, a fact inconsistent with the predictions of leading minimum wage models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper uses the RAIS (Relação Anual de Informações Sociais), a confidential linked employer-employee dataset covering the Brazilian formal sector, together with Brazilian Census data for 1991, 2000, and 2010. Statistics are computed for 151 microregions (analogous to US commuting zones) with at least 15,000 workers in RAIS in both base years and at least 1,000 formal workers per educational group. The final sample covers 73 percent of the adult population. Firm wage premiums and assortative matching are measured via AKM two-way fixed effects regressions using the bias-corrected KSS (Kline, Saggio, Sølvsten 2018) estimator, run separately for each microregion and period on three-year panels.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theoretical framework.&lt;/strong&gt; The paper develops a unified general-equilibrium model featuring: (i) a task-based production function with distance-dependent complementarity between worker types; (ii) monopsony power arising from idiosyncratic worker preferences for firms, generating constant firm-level labor supply elasticity β (calibrated at 4, implying markdowns of 20 percent); (iii) heterogeneous firms differentiated by their production &amp;ldquo;blueprints&amp;rdquo; (the complexity of tasks they require), with blueprint shape parameterized as a Gamma distribution; and (iv) free firm entry, endogenous participation, and goods market general equilibrium with CES consumer preferences (elasticity σ). A key result is that firms with different blueprints exhibit different within-firm substitution patterns: worker types that are substitutes at low-skill, low-wage firms may be complements at high-skill, high-wage firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation.&lt;/strong&gt; A parsimonious parameterization is estimated by simultaneous-equation nonlinear least squares, targeting 26 endogenous outcomes per region (13 per period) including between- and within-group wage inequality, variance of establishment effects, covariance of worker and establishment effects, formal employment rates by education, and minimum wage bindingness. The model requires solving for equilibrium more than 15,000 times per optimization step (151 regions × 2 periods × 53 Jacobian columns). The elasticity of substitution between goods is estimated at σ = 8.36 (significantly above 1), and the aggregate labor supply parameter λ implies formal-sector elasticities of approximately 0.6–0.7 for college workers and around 1.1 for less-than-secondary workers. The model fits the data well, with R² above 0.5 for most targeted moments and perfect fit for the six moments used in the inversion procedure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Demand shocks and the minimum wage are the primary drivers of falling inequality.&lt;/strong&gt; In counterfactual simulations, the minimum wage alone (a 66.1 log point increase) reduces the variance of log wages by 0.13. Demand shocks reduce it by a further 0.18. Supply shocks (rising education) increase the variance by 0.04, leaving their net inequality-reducing contribution negligible.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Supply shocks increase assortative matching despite compressing within-firm skill premiums.&lt;/strong&gt; Within-firm task reassignment would reduce the variance of log wages by 0.221 and the correlation between worker and establishment effects by 0.165, holding production levels and firm entry fixed. However, scale, entry, and price adjustments — driven by the large estimated σ = 8.36 &amp;gt; β + 1 = 5 — reallocate skilled labor toward high-wage, skill-intensive firms, counteracting within-firm compression and raising assortative matching by 0.189. These two channels largely offset each other.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Concurrent supply and demand changes attenuate minimum wage impacts by roughly half.&lt;/strong&gt; When the minimum wage is the only shock, it would have reduced the variance of log wages by 0.13; in the presence of supply and demand changes, its incremental contribution is approximately 0.07. Minimum wage effects on sorting (which would reduce assortative matching when acting alone) disappear when accompanied by supply and demand transformations.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Minimum wage effects are concentrated in the bottom two productivity deciles.&lt;/strong&gt; Wage effects for workers in productivity deciles three through ten from the minimum wage are approximately 1 percent or less once all channels are considered. Strong wage gains are concentrated at the bottom, primarily through the monopsony channel. The wage-posting channel (within-firm returns to skill) reduces wages for low- and middle-skill workers and raises them at the top two deciles due to the reallocation of low-skilled workers toward high-wage firms, which reduces those workers&amp;rsquo; marginal products there.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Cross-firm differences in substitution patterns generate non-standard minimum wage spillovers.&lt;/strong&gt; Conditional on the task demands of the firm employing them, a pair of worker types may be substitutes in low-skill firms and complements in high-skill firms. This firm-heterogeneity channel causes minimum wage impacts to be non-monotone across the productivity distribution, contrasting with the smooth inequality-reducing effects predicted by both competitive task-based models and frictional minimum wage models.&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the novel empirical fact that motivates the unified framework?&lt;/strong&gt;
A: Using KSS bias-corrected AKM decompositions performed separately for each of 151 microregions, the paper documents that assortative matching — measured as the correlation between worker and establishment fixed effects — rises in 104 out of 151 regions between 1998 and 2012. The covariance term accounts for less than 7 percent of the average decline in the variance of log wages. This finding is inconsistent with the leading imperfectly competitive minimum wage model (Engbom and Moser 2022), in which minimum wages reduce assortative matching. It is also inconsistent with purely competitive supply/demand models, which have no role for firm wage premiums or sorting. The divergence from prior national-level studies (which do not find rising sorting) is explained by the fact that national-level sorting conflates geographical sorting with supply-demand dynamics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the key mechanism through which the task-based production function generates cross-firm differences in substitution patterns?&lt;/strong&gt;
A: In the task-based production function, each firm assigns workers to tasks assortatively — lower types handle lower-complexity tasks, higher types handle higher-complexity tasks, with cutoff thresholds determined by the firm&amp;rsquo;s blueprint. When a firm has a blueprint concentrated in complex tasks (a high-skill, high-wage firm), adjacent worker types are more differentiated in the tasks they perform, making them complements. When a firm has a blueprint concentrated in simple tasks (a low-skill, low-wage firm), adjacent worker types are assigned to a narrow, similar range of tasks and are therefore closer substitutes. The elasticity of complementarity between any pair of worker types is thus endogenous, depending on which tasks the firm uses and, in the monopsony case, on the firm&amp;rsquo;s skill intensity — a prediction validated empirically using nonroutine cognitive task content data for Brazilian occupations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: Under what conditions can a positive supply shock (rising educational attainment) widen the aggregate skill wage premium rather than compress it?&lt;/strong&gt;
A: The paper&amp;rsquo;s Proposition 4 and Corollary 2 show that a supply shock that increases the relative supply of skilled workers can widen the aggregate skill wage premium when the elasticity of substitution between goods (σ) exceeds the firm-level elasticity of labor supply plus one (β + 1). Intuitively, when σ is large, the reduction in prices for skill-intensive goods generated by the supply shock shifts consumption toward those goods, causing net entry of skill-intensive firms. If the gains in firm wage premiums earned by skilled workers reallocated to those firms outweigh the compression in within-firm productivity differentials, the aggregate skill premium can rise. This mechanism does not require non-convexities from endogenous innovation; it operates through imperfect competition and firm entry alone. In the estimated Brazilian model, σ = 8.36 substantially exceeds β + 1 = 5, so this condition holds, explaining why rising education increases rather than compresses assortative matching in the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the model generate positive employment effects from minimum wages, and how do these interact with reallocation?&lt;/strong&gt;
A: In the monopsonistic baseline without a minimum wage, firms post wages below workers&amp;rsquo; marginal revenue products, causing some workers to choose non-employment. A minimum wage increase raises posted wages at constrained firms, shifting some workers from non-employment (or home production) to formal employment, generating positive employment effects at the margin where the minimum wage binds. Simultaneously, minimum wages price out the least productive workers at low-wage firms (disemployment), while workers in the intermediate productivity range reallocate from low- to high-wage firms, because high-wage firms have higher revenue productivity and can profitably hire workers that low-wage firms can no longer afford. The net employment elasticity for the lowest productivity decile with respect to the log minimum wage is −0.61 (Table 7), while the mean wage for that decile rises substantially through the monopsony channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the three channels through which the minimum wage affects wages and employment in the model, and what does each channel contribute?&lt;/strong&gt;
A: The paper decomposes minimum wage effects into three channels. Channel 1 (monopsony): mechanical wage increases, positive employment effects at firms where the minimum wage binds, disemployment of very low-productivity workers, and reallocation from low- to high-wage firms, holding posted wage schedules, prices, and entry fixed. This channel accounts for nearly all of the strong wage effects at the bottom two productivity deciles. Channel 2 (wage posting): firms reoptimize earnings schedules following changes in worker composition and marginal products induced by Channel 1, holding prices and entry fixed. This channel reduces wages for low- and middle-skill workers (productivity deciles 1–7) by approximately 0.01–0.02 log points and increases wages for top deciles (decile 9: +0.04, decile 10: +0.11), because reallocation of low-skill labor to high-wage firms lowers those workers&amp;rsquo; marginal products there. Channel 3 (general equilibrium): firm entry and price responses. The fall in low-wage-firm profits causes entry of high-wage, skill-intensive firms, while the price of low-skill goods falls. General equilibrium effects generate modest positive wage effects for most workers but negative effects for very low-productivity workers due to reduced aggregate demand for low-skill labor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why do the minimum wage&amp;rsquo;s inequality-reducing effects diminish when accompanied by concurrent supply and demand changes?&lt;/strong&gt;
A: The paper documents that, under concurrent supply and demand transformations, the minimum wage&amp;rsquo;s reduction of the variance of log wages is approximately 0.07, roughly half the 0.13 reduction it would achieve acting alone. The attenuation occurs through interactions: supply and demand shocks raise the average productivity level of the labor market and shift workers toward high-wage, skill-intensive firms. In this altered equilibrium, the minimum wage binds less tightly (or hits a different part of the distribution), and the reallocation effects of the minimum wage that would normally reduce assortative matching are offset by the sorting-increasing effects of supply and demand changes. The estimated model shows that interactions between the minimum wage and supply/demand changes (columns 6, 7, 8 of Table 5) are economically meaningful, something undetectable without a unified framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the model&amp;rsquo;s prediction regarding minimum wage spillovers differ from Engbom and Moser (2022), and what explains the difference?&lt;/strong&gt;
A: Engbom and Moser (2022) find that the Brazilian minimum wage hike had significant wage effects extending far up the worker productivity distribution, while this paper&amp;rsquo;s model finds negligible effects (approximately 1 percent) beyond the bottom two productivity deciles. Two structural differences explain this divergence. First, Engbom and Moser (2022) assume perfect substitutability between worker types within firms, so a minimum wage increase at low-wage firms mechanically raises posted wages at all other firms to maintain relative competitiveness. In this paper&amp;rsquo;s framework, wage-posting responses at high-wage firms can be negative for low-skill workers because the inflow of reallocated low-skill workers reduces their marginal products — a channel absent under perfect substitution. Second, Engbom and Moser (2022) use a national model, allowing displaced low-skill workers to reallocate to top-productivity firms anywhere in the country, dampening disemployment; this paper&amp;rsquo;s local labor markets approach restricts reallocation to within-region boundaries, consistent with low rates of interregional migration documented for Brazil by Dix-Carneiro and Kovak (2017).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How are firm wage premiums generated in the model, and why do differences in physical productivity between firms not generate wage differentials?&lt;/strong&gt;
A: Proposition 3 establishes that wage dispersion for similar workers across firms requires either (i) differences in blueprint shapes (firm heterogeneity in skill intensity) or (ii) differences in entry costs. Differences in physical productivity (z_g) or consumer taste parameters alone are insufficient, because with equal entry costs, differences in productivity lead to additional firm entry until the marginal revenue product of labor is equalized across firm types. Wage premiums proportional to entry costs arise because optimal firm creation requires larger-scale operation for higher-entry-cost firms, and hiring more workers forces those firms to post higher wages. Additionally, skill-intensive firms (firms with blueprints tilted toward complex tasks) pay relative wage premiums for the worker types they use most intensively, and if skill intensity and entry costs co-vary, all workers at high-skill firms may receive a wage premium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the estimation procedure handle unobserved regional heterogeneity in labor demand?&lt;/strong&gt;
A: Demand shocks are not directly observed; they are inferred as a residual from changes in targeted outcomes after accounting for observed supply (education shares from Census) and minimum wage changes. Five region-time-specific demand parameters — TFP (z), blueprint complexities (θ₁, θ₂), relative entry costs (F₂/F₁), and relative consumer preferences (γ₂/γ₁) — are modeled as linear functions of 1998 regional covariates (educational shares, agricultural share, manufacturing share, and initial minimum wage bindingness) with time-specific coefficients. This formulation allows unobserved demand shifters to correlate with initial educational levels, preventing incorrect attribution of demand-supply correlations to causal supply effects. Region-specific parameters (TFP in each period, education-group-specific formal employment shifters) are inverted exactly from six targeted moments within each region, eliminating incidental parameter bias.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What micro-level empirical validations does the paper conduct for the task-based model&amp;rsquo;s mechanisms?&lt;/strong&gt;
A: The paper tests four micro-level predictions using nonroutine cognitive task content data for Brazilian occupations. First, skill-intensive firms have greater demand for complex tasks (consistent with Figure 1 of the model). Second, within firms, more skilled workers are assigned to more complex tasks (Lemma 1). Third, workers who move to more skill-intensive firms are assigned more complex tasks (Lemma 2, consistent with the monopsony model&amp;rsquo;s mismatch prediction). Fourth, wage gaps between high- and low-skill firms are larger for skilled workers (Proposition 3). The paper reports finding strong support for all four predictions in the data, lending credibility to the theoretical structure and quantitative results.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Task-based production function (paper&amp;rsquo;s definition):&lt;/strong&gt; A production function in which a firm produces output by assigning workers of different types to tasks indexed by complexity. The assignment is assortatively optimal: lower-type workers handle lower-complexity tasks, with unique threshold complexities separating adjacent worker types. The critical property is distance-dependent complementarity — any pair of worker types that are &amp;ldquo;close&amp;rdquo; in skill rank are substitutes, while pairs distant in skill rank are complements. This differs from CES production functions where the elasticity of complementarity is the same for all pairs; in the task-based version, substitutability depends on endogenous assignment and thus on the firm&amp;rsquo;s blueprint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Blueprint (paper&amp;rsquo;s definition):&lt;/strong&gt; A function b_g(x) that specifies the density of tasks of each complexity level x required to produce one unit of good g. It is the fundamental source of firm heterogeneity in the model: firms producing goods with blueprints tilted toward complex tasks are more skill-intensive, hire workers of higher average type, and pay higher wages. The paper parameterizes blueprints as Gamma distributions with shape parameter θ_g indexing average task complexity; firms with higher θ_g are more skill-intensive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm wage premium (paper&amp;rsquo;s definition):&lt;/strong&gt; The component of wages at a given establishment that accrues equally to all workers at that firm regardless of their type, measured as the establishment fixed effect ψ_j in AKM two-way fixed effects regressions. In this model, firm wage premiums arise from heterogeneity in blueprints (skill intensity) and entry costs, not from differences in TFP or consumer tastes. Under monopsony, firms with higher entry costs must operate at larger scale and post higher wages; blueprint heterogeneity generates differential wage premiums by skill type.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sorting / assortative matching (paper&amp;rsquo;s definition):&lt;/strong&gt; The correlation between the worker fixed effect (ν_i,r capturing worker skill) and the establishment fixed effect (ψ_j capturing firm wage premium) in the AKM decomposition, measured as Cov(ν_i,r, ψ_{J(i,r,τ)} | r). In this paper&amp;rsquo;s framework, sorting arises because firms with blueprints demanding complex tasks (high-wage firms) have a comparative advantage in employing high-skill workers; labor market sorting can therefore change over time due to supply, demand, or minimum wage shocks, even without changes in search frictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Monopsony power / markdown (paper&amp;rsquo;s definition):&lt;/strong&gt; Arising from idiosyncratic worker preferences for firms (modeled as a nested logit), firms face upward-sloping labor supply curves with constant firm-level elasticity β. Optimal posted wages equal a constant markdown β/(β+1) of the marginal revenue product of labor, set to β = 4 (implying a 20 percent markdown). The macro elasticity of formal sector labor supply is governed by a separate parameter λ, estimated from the data, yielding aggregate formal-sector supply elasticities of approximately 0.6–0.7 for college workers and around 1.1 for less-educated workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage posting responses (paper&amp;rsquo;s definition):&lt;/strong&gt; The second channel of minimum wage effects, in which firms reoptimize their entire earnings schedule following the wage-composition changes induced by the minimum wage&amp;rsquo;s mechanical and reallocation effects (Channel 1), while keeping goods prices and firm entry fixed. Because task-based production functions are concave, changes in factor proportions (due to reallocation of low-skill workers to high-wage firms) alter marginal products of all worker types within those firms, causing firms to adjust all posted wages — not just those directly constrained by the minimum wage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distance-dependent complementarity (paper&amp;rsquo;s definition):&lt;/strong&gt; The property, proven as a Corollary to Proposition 1, that for a fixed worker type h, the partial elasticity of complementarity between h and any other type h&amp;rsquo; is strictly increasing in h&amp;rsquo; for h&amp;rsquo; ≥ h (more distant high types are stronger complements) and strictly decreasing in h&amp;rsquo; for h&amp;rsquo; ≤ h (more distant low types are weaker substitutes / stronger complements). This pattern results from the division of labor: adding a very different worker type allows specialization gains that do not arise when adding similar-type workers competing for the same tasks.&lt;/p&gt;</description></item><item><title>Take the Goods and Run: Contracting Frictions and Market Power in Supply Chains</title><link>https://macropaperwarehouse.com/papers/take-the-goods-and-run-contracting-frictions-and-market-power-in-supply-chains/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/take-the-goods-and-run-contracting-frictions-and-market-power-in-supply-chains/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper studies the efficiency of self-enforced relational agreements in manufacturing supply chains when sellers have market power and contracts cannot be externally enforced. The setting is Ecuador, an upper-middle-income country with slow commercial courts (debt enforcement takes around two years even after a 2016 reform) and highly concentrated manufacturing markets (average Herfindahl-Hirschman Index of 0.6 for 6-digit economic codes, well above the 0.25 threshold used by the US Department of Justice to identify highly concentrated markets).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; How efficiently do long-term trade relationships operate, period by period, when the seller can price discriminate and the buyer can opportunistically default on trade-credit debt? Does seller market power worsen or mitigate enforcement-driven inefficiencies?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper uses three Ecuadorian government administrative databases: (1) an electronic invoicing (EI) system covering all sales of 49 large manufacturing firms in textiles, pharmaceuticals, and cement products for 2016–2017, providing product-level unit prices, quantities, and payment method for each buyer-seller pair (median seller has 600 buyers); (2) the universe of firm-to-firm VAT transactions from 2008–2015, used to measure relationship age (censored at 9 years); and (3) annual financial statements providing variable costs to proxy marginal cost.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The author develops a dynamic contracting model that embeds non-linear pricing with heterogeneous buyers (following Jullien 2000 and Attanasio-Pastorino 2020) into an infinitely repeated game with limited enforcement (following Martimort et al. 2017). The seller holds all bargaining power, commits to a long-term menu of prices and quantities, and finances every transaction through trade-credit. The buyer has a privately observed, fully persistent type (willingness to pay) and can opportunistically default after delivery — &amp;ldquo;take the goods and run&amp;rdquo; — at the cost of losing the future relationship. The seller uses the value of the ongoing relationship as the enforcement instrument. The paper solves the seller&amp;rsquo;s profit-maximization problem using a recursive Lagrangian approach, yielding a modified virtual-surplus condition that governs optimal quantity allocations as a function of current and past limited-enforcement Lagrange multipliers (LE multipliers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Six motivating empirical facts&lt;/strong&gt; documented in the data: (1) New buyers are ~35% of pairs but account for only ~10% of total trade; relationships lasting nine or more years are less than 10% of pairs but generate over 30% of trade. (2) Trade-credit is used in approximately 65% of transactions in the first year and 70–75% in older relationships. (3) Quantities increase as relationships age. (4) A 10% increase in quantity purchased is associated on average with a 2% decrease in unit price (quantity discounts). (5) Conditional on quantity, older buyers pay up to 3% less; these price discounts appear only in trade-credit transactions, not in pay-in-advance transactions. (6) Approximately 40% of new relationships survive one additional year, 60% of relationships aged 1–3 years survive, and more than 75% of relationships aged four or more years survive.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key structural finding.&lt;/strong&gt; Almost all new relationships have binding enforcement constraints. The estimated LE multiplier equals 1 (unconstrained) only for the top 1% of pairs at tenure 0. As relationships age, the constraint relaxes and quantities are backloaded — consistent with the seller making promises of higher future trade to incentivize current debt repayment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Efficiency results.&lt;/strong&gt; New relationships operate at approximately 30% of the frictionless (first-best) surplus level. Efficiency rises to 60% at tenure 2, 75% at tenure 4, and over 80% at tenure 5. Aggregating across buyers weighted by efficient quantities: only 5% of sellers trade efficiently with new buyers, rising to 70% by tenure 2 and 84% in the long term. By sector, 68% of textiles, 88% of pharmaceutical, and 95% of cement-product sellers reach efficient aggregate output by tenure 5. Sellers capture approximately 80% of generated surplus; the median buyer captures around 25%, and the smallest buyers may capture less than 10%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Counterfactuals reveal a second-best interaction.&lt;/strong&gt; Fixing enforcement alone (Counterfactual a: non-linear pricing with perfect enforcement) raises surplus for 75% of buyers in the early tenures but reduces surplus for essentially all buyers in later tenures, because the threat of buyer default was the force compelling the seller to promise growing quantities over time. Fixing market power alone (Counterfactual b: uniform pricing with limited enforcement) collapses surplus to 0–40% of the baseline because the seller can no longer tailor dynamic incentives to each buyer&amp;rsquo;s enforcement constraint, causing a large share of buyers to be excluded from trade. Addressing both frictions simultaneously (Counterfactual c: uniform pricing with perfect enforcement) raises surplus for most buyers in early tenures but remains welfare-reducing for high types in later tenures; the aggregate effect depends critically on weighting: positive (~40% gain) when weighted by number of buyers, negative (surplus falls to ~58% of baseline) when weighted by quantities.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the central theoretical mechanism by which limited enforcement leads to backloading of quantities in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The buyer can default after delivery because payment is post-delivery (trade-credit). To prevent this, the seller must ensure the buyer&amp;rsquo;s discounted future net returns exceed the current payment obligation. This creates a forward-looking enforcement constraint: the seller must credibly promise sufficiently large future quantities at lower prices. As a result, current quantities are distorted downward (the seller delays granting full trade volumes), but quantities increase over time as past promises become binding promise-keeping constraints. The optimal contract is therefore non-stationary: total surplus generated and the buyer&amp;rsquo;s net return both increase over time even without efficiency gains in production.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does seller market power interact with enforcement frictions — does it worsen or improve efficiency relative to a perfect-enforcement benchmark?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s key finding is that market power and enforcement constraints act as partially offsetting frictions. Seller market power creates downward quantity distortions (the seller restricts supply to extract rents). Limited enforcement, however, compels the seller to promise growing quantities to prevent buyer default, which counteracts the market-power distortion. Thus, in older relationships, the enforcement constraint effectively disciplines the seller&amp;rsquo;s rent-extraction incentives, producing trade levels that approach the frictionless first-best. This is an instance of the theory of second-best: each friction partially offsets the other, so removing only one friction can reduce total welfare.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the six motivating empirical facts and why do they rule out standard alternative explanations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The six facts are: (1) heavy concentration of trade in long-established relationships; (2) widespread trade-credit even in new relationships; (3) quantities increase with relationship age; (4) quantity discounts within any age cohort; (5) older buyers pay lower prices conditional on quantity; (6) survival rates increase with quantity and relationship age. Alternative models — efficiency gains, learning, demand assurance, and supply-side enforcement issues — cannot jointly account for all six patterns under realistic assumptions. Critically, Fact 5 holds only in trade-credit transactions and not in pay-in-advance transactions, which supports limited enforcement (not learning or demand assurance) as the underlying mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How is the model identified from cross-sectional data on prices and quantities for a single seller?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Identification exploits two sources of variation. First, because the seller offers non-linear price menus that induce type revelation, cross-sectional variation in prices and quantities across buyers reveals their underlying private types. Second, for the highest-type buyer at tenure 0, the cumulative LE multiplier equals 1 by construction, so the gap between the observed marginal price and marginal cost directly reveals the current enforcement multiplier for that type; cross-sectional variation across high-type buyers then identifies the elasticity parameter β. Once β is pinned down, the multipliers for all types and tenures are recovered as unique solutions to ordinary differential equations, and buyer types are recovered semi-parametrically. The approach requires only cross-sectional data from one seller per year — no panel of individual buyers is needed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the estimated magnitudes of the marginal product of capital wedge, and how do they compare to related studies?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper finds a wedge between the buyer&amp;rsquo;s marginal product of capital (MPK) and the transaction price of 40% for the median new relationship and 34% for the median tenure-5 relationship. These wedges are smaller than the 80% gaps estimated for Indian firms by Banerjee and Duflo (2014), and larger than the average 6% gap calculated by Blouin and Macchiavello (2019) in the international coffee market. They are also much smaller than the 300–500% gaps estimated for Mexican micro-enterprises by McKenzie and Woodruff (2008), which is consistent with the buyers in this sample being substantially larger (median yearly sales of USD 200,000).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What does Counterfactual (a) — perfect enforcement with non-linear pricing — reveal about the intertemporal trade-off?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Counterfactual (a) shows massive short-run gains for low and middle types: surplus at tenure 0 increases to 1,508% and 628% of baseline for the bottom 10th and median buyer percentile groups respectively. However, for higher types (top 25%), perfect enforcement is immediately welfare-reducing because these buyers are already trading near efficiently and the seller loses the incentive to grow quantities over time once default is not a threat. By tenure 3 and beyond, perfect enforcement reduces surplus for essentially all buyers. The aggregate effect is negative because high-type buyers, who trade larger volumes, bear larger losses in later tenures when those tenures are weighted by quantity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why does uniform pricing with limited enforcement (Counterfactual b) perform so poorly?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under uniform pricing, the seller cannot tailor the dynamic contract to each buyer&amp;rsquo;s individual enforcement constraint. Without individualized price-quantity menus, many buyers cannot credibly commit to repaying their debts — because the seller cannot offer a sufficiently personalized future stream of benefits — and are thus excluded from trade entirely. For instance, at tenure 0, 95.8% of the bottom-decile buyers and 64% of median buyers are excluded. The aggregate surplus under this regime reaches only 3–68% of baseline across different tenures and percentile groups. This implies that the seller&amp;rsquo;s price discrimination ability, while generating informational rents, also serves a second purpose: it allows each buyer&amp;rsquo;s specific enforcement constraint to be satisfied, enabling trade that would otherwise be infeasible.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What do the sector-level results suggest about the generalizability of the main findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;All six motivating empirical facts are consistent across the three industries studied (textiles, pharmaceuticals, and cement products). The efficiency patterns also appear in all three sectors, though with heterogeneous speeds of convergence. Pharmaceutical and cement-product sellers converge faster (88% and 95% efficient at tenure 5) than textiles sellers (68% efficient at tenure 5). The finding that relationships approach efficiency in the medium and long term holds in every industry analyzed, suggesting that the underlying mechanisms — limited enforcement and seller market power — are broadly operative rather than sector-specific.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the paper establish that the standard non-linear pricing model without enforcement constraints does not explain the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper tests whether the LE multiplier at tenure 0 (G0) is statistically distinguishable from the null hypothesis of a standard non-linear pricing model (which would imply G0 = 1 for all buyers). Based on t-statistics from the estimated distribution of G0 across seller-year markets, the null of a standard model is rejected for 86% of the markets (seller-years) in the sample. Additionally, the dynamic price discounts conditional on quantity — which are the key signature of backloading — appear only in trade-credit transactions and not in pay-in-advance ones, ruling out alternative explanations such as learning about buyer quality or demand assurance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What are the model&amp;rsquo;s main limitations and how do they affect the counterfactual conclusions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The author flags three principal limitations. First, buyer types are assumed fully persistent due to data constraints (only two years of invoice-level data); a Markov type structure would require longer buyer-level panels. Second, the identification strategy relies on the seller&amp;rsquo;s first-order optimality conditions and cannot recover counterfactual dynamic quantities — the counterfactuals are therefore static comparisons of per-period surplus rather than full dynamic simulations. Third, if buyers have unobserved outside options, the counterfactual efficiency results may be biased, though the direction of the bias is uncertain and depends on the distribution of types and the curvature of the return function.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Limited enforcement constraint (LE-B).&lt;/strong&gt; The paper&amp;rsquo;s central friction: because payment is post-delivery, the buyer can default and keep the goods. In the model, the contract is &amp;ldquo;default-free&amp;rdquo; only if the buyer&amp;rsquo;s post-delivery payment is weakly less than the discounted value of all future truthful net returns. The constraint is binding when this condition is tight — the buyer is on the margin of defaulting. When binding, it forces the seller to reduce current tariffs and quantities (to lower the attractiveness of default) while promising higher future quantities (to raise the continuation value).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Limited enforcement Lagrange multiplier (LE multiplier), Gt(α).&lt;/strong&gt; The shadow price on the buyer&amp;rsquo;s enforcement constraint at tenure t for a buyer at quantile α. It takes values in [0,1], equals 1 only when the enforcement constraint is slack (unconstrained buyer), and equals zero for the lowest type at all tenures. In the paper&amp;rsquo;s framework, the entire trajectory of Gt(α) across tenures encodes the history of past enforcement promises and is the key object identified and estimated to recover the dynamic distortions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Backloading.&lt;/strong&gt; The equilibrium property whereby the total surplus generated by the relationship and the buyer&amp;rsquo;s net return both increase over time. The seller achieves this by initially restricting quantities and promising growing future allocations as an enforcement device. Formally, quantities increase over time if and only if enforcement constraints are relaxed (gt+1(q) ≤ gt(q)).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Modified virtual surplus.&lt;/strong&gt; The object that replaces ordinary virtual surplus (which appears in standard non-linear pricing models) in the seller&amp;rsquo;s first-order condition. It augments standard virtual surplus by adding shadow costs for current binding enforcement constraints and subtracting corrections for past enforcement promises. Optimal quantity allocations are determined by an inverse-markup rule applied to this modified virtual surplus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Relational agreement / self-enforced relational contract.&lt;/strong&gt; An informal long-term agreement sustained purely through the repeated interaction between the parties, without access to third-party (court) enforcement. In this paper&amp;rsquo;s setting, the seller disciplines the buyer&amp;rsquo;s opportunism exclusively through the threat of relationship termination; no legal recourse is available or used in equilibrium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quantity discounts (non-linear pricing / wholesale quantity discounts).&lt;/strong&gt; Price schedules under which the unit price decreases with the quantity purchased, offered by a seller with market power. In the paper&amp;rsquo;s empirical setting, a 10% increase in quantity is associated with a 2% decrease in unit price, and these discounts appear at every relationship age. The model generates them as the incentive-compatibility requirement that ensures higher-type buyers truthfully reveal their demand.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Trade-credit.&lt;/strong&gt; Seller financing of the transaction, in which goods are delivered before payment is received. In the Ecuadorian data, approximately 65% of first-year purchases and 70–75% of purchases in mature relationships are conducted via trade-credit. Because the seller bears the full cost of buyer default, trade-credit is the financial arrangement that gives rise to the limited enforcement constraint studied in the paper.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-best interaction of frictions.&lt;/strong&gt; The paper&amp;rsquo;s counterfactual finding that removing a single friction (either enforcement or market power) can reduce total welfare when both frictions are present simultaneously. This occurs because the two frictions partially offset each other: enforcement constraints discipline the seller&amp;rsquo;s monopoly distortions, and market power allows the seller to price-discriminate in ways that enable enforcement in the first place. Addressing both frictions simultaneously can improve welfare, consistent with the Lipsey-Lancaster theory of second-best.&lt;/p&gt;</description></item><item><title>Talent Hoarding in Organizations</title><link>https://macropaperwarehouse.com/papers/talent-hoarding-in-organizations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/talent-hoarding-in-organizations/</guid><description>&lt;p&gt;This paper provides the first empirical evidence of talent hoarding in organizations — the practice whereby managers deliberately suppress workers&amp;rsquo; internal mobility to retain productive team members, thereby serving their own performance-based compensation interests at the expense of firm-wide talent allocation. The research question is whether managers with misaligned incentives hoard talent, how this can be measured, and what consequences it have for worker career outcomes and organizational efficiency.&lt;/p&gt;
&lt;p&gt;The study uses personnel records from a large German manufacturing firm with over 200,000 employees worldwide, focused on more than 30,000 white-collar and management employees in Germany, covering over 300,000 employee-by-quarter observations from 2015 to 2018. This is supplemented by a manager survey (62% response rate, over 3,000 responses) and an employee survey (50% response rate, over 15,000 responses), plus the universe of internal job application and hiring data covering over 16,000 job openings and over 200,000 applicants.&lt;/p&gt;
&lt;p&gt;The conceptual framework formalizes talent hoarding as a moral hazard problem: managers observe worker productivity and are compensated based on team performance, but are tasked with identifying and developing talent for promotion. When a high-productivity worker leaves, team productivity falls. The framework predicts that hoarding intensity increases with worker productivity, team vulnerability to departures (smaller teams), and manager-level hoarding incentives (performance-related pay, low talent visibility).&lt;/p&gt;
&lt;p&gt;The key administrative measure of hoarding is the systematic gap between managers&amp;rsquo; private performance ratings (not shared outside the team) and public potential ratings (widely circulated within the firm). Managers who suppress potential ratings relative to what would be predicted given worker performance are interpreted as strategically reducing worker visibility. Managers with a 1 percentage point higher share of performance-related pay are 0.19 percentage points more likely to hoard talent; a one-person increase in team size reduces hoarding probability by 1.3 percentage points; and managers in low-visibility functional areas are 4.0 percentage points more likely to hoard. Survey-based hoarding measures yield directionally identical patterns.&lt;/p&gt;
&lt;p&gt;To identify causal effects on workers, the paper exploits quasi-random manager rotations. When a manager learns they will move to a different team — typically two to three quarters before the actual transition — their hoarding incentive ceases. This creates a temporary window of reduced hoarding. During this window, worker application rates increase by 2.3 percentage points, representing a 78% increase over the baseline application rate of 2.9%. An event study confirms flat pre-trends prior to the announcement period, supporting the identifying assumption.&lt;/p&gt;
&lt;p&gt;Using manager rotations as an instrument for worker applications, marginal applicants — those induced to apply only by the manager rotation — face a 49.1% likelihood of receiving a new position, compared to an average hiring likelihood of 27.6%. This positive selection implies that many deterred applicants would have been successful and that talent hoarding meaningfully degrades the quality of the internal applicant pool. Gender analysis reveals that women are 22% more likely to rely on manager career guidance and 26% more likely to prioritize preserving a good manager relationship. Marginal female applicants are more positively selected on education, past performance, and hiring probability for higher-level positions. The counterfactual reduction in the gender pay gap from eliminating talent hoarding is estimated at 86%.&lt;/p&gt;
&lt;p&gt;Scope conditions: the firm is a large European manufacturer with long average tenures (13 years), an application-based internal labor market, and centralized online job portal. Results apply most directly to white-collar and management employees in Germany. External validity is supported by comparisons to German workforce surveys and by the fact that 83% of top publicly listed German companies and half of 665 global organizations in industry surveys report talent hoarding as a significant organizational friction.&lt;/p&gt;
&lt;p&gt;Q: How is talent hoarding formally defined in this paper?
A: Talent hoarding is defined as actions taken by managers that lower the likelihood that a worker applies for and receives a promotion or any internal transfer outside the team. In the formal framework, a manager chooses hoarding intensity β ≥ 0, where β &amp;gt; 0 reduces the equilibrium probability that a worker gets promoted. The definition encompasses all forms of managerial action that reduce worker departure probability, including suppressing visibility, restricting access to trainings, explicit discouragement, and threats.&lt;/p&gt;
&lt;p&gt;Q: Why do managers have an incentive to hoard talent?
A: Managers are compensated based on team performance, so losing a high-productivity worker (whose replacement is a random draw from an outside distribution with expected productivity ᾱ) reduces team performance and thus manager compensation. The framework shows that when a worker&amp;rsquo;s productivity αi exceeds the expected productivity of an outside hire ᾱ, the manager optimally sets β* &amp;gt; 0. The cost of hoarding (parameterized as φm) is convex and varies across managers, capturing altruism, reputation risk, or detection probability.&lt;/p&gt;
&lt;p&gt;Q: What share of managers in the survey self-report talent hoarding?
A: 75% of managers reported that they sometimes find themselves in situations where they need to dissuade a team member from exploring opportunities in another department due to immediate team needs or performance goals. Additionally, 45% cite the risk of losing talent as a reason not to invest in employee career development, and 66% cite the need to prioritize short-term performance targets over long-term employee development.&lt;/p&gt;
&lt;p&gt;Q: How are misaligned incentives documented in the manager survey?
A: 55% of managers agree or strongly agree that talent development entails a conflict of interest because more developed workers are more likely to leave the team. While 96% believe their direct intervention has a large impact on workers&amp;rsquo; career development, only 36% perceive that impact to be valued by the firm as much as team performance impact. Similarly, 87% say talent development is a high-impact area for the firm, but only 40% believe a track record in talent development matters for their own compensation and promotion.&lt;/p&gt;
&lt;p&gt;Q: How is the administrative measure of talent hoarding constructed?
A: The measure is the residual from an OLS regression of a worker&amp;rsquo;s potential rating (a public signal of promotion readiness, widely circulated within the firm) on their performance rating (a private signal of current task performance, not shared outside the team) and worker characteristics including age, education, gender, and tenure. The manager-level measure is the average of these residuals across all workers and quarters under that manager. Managers in the top tercile (mean deviation above 0.1036) are classified as hoarding-prone.&lt;/p&gt;
&lt;p&gt;Q: Does the hoarding measure respond to the incentive proxies as predicted by the framework?
A: Yes. A 1 percentage point higher share of performance-related compensation is associated with a 0.19 percentage point increase in the probability of being classified as hoarding-prone (p = 0.000), corresponding to a 13 percentage point difference between the 90th and 10th percentiles of the financial incentive distribution. A one-person increase in team size reduces hoarding probability by 1.3 percentage points (p = 0.000), again a 13 percentage point difference across percentiles. Managers in low-visibility functional areas are 4.0 percentage points more likely to hoard (p = 0.002) relative to high-visibility areas.&lt;/p&gt;
&lt;p&gt;Q: Is the training-based hoarding measure consistent with the potential-rating measure?
A: Yes. A complementary measure based on managers restricting worker access to high-visibility in-person trainings yields nearly identical patterns: a 1 percentage point increase in performance-related pay increases hoarding probability by 0.20 percentage points (p = 0.000); a one-person increase in team size reduces it by 1.4 percentage points (p = 0.000); low-visibility areas increase hoarding by 2.98 percentage points (p = 0.021). The direction and economic magnitudes are highly similar across both administrative measures and the survey-based measures.&lt;/p&gt;
&lt;p&gt;Q: How are manager rotations used to identify causal effects on workers?
A: When a manager learns they will move to a different position — typically two to three quarters before the rotation — their incentive to hoard workers on their current team ceases. This creates a quasi-random window of reduced talent hoarding for workers on that team. An event study with worker and quarter fixed effects shows flat pre-trends in application rates beyond three quarters before the rotation, consistent with the identifying assumption that managers do not yet know about their rotation in that earlier window. Balance tests confirm workers exposed to rotations are observationally similar on demographics and past performance to non-exposed workers.&lt;/p&gt;
&lt;p&gt;Q: How large is the effect of manager rotations on worker applications?
A: Manager rotations increase worker application rates by 2.3 percentage points in the quarter of rotation, representing a 78% increase over the baseline application rate of 2.9%. The effect is transitory: application rates return to baseline within one quarter after the new manager settles in. The effect is not driven by managers taking subordinates with them (97% of applications are to positions outside both the current team and the manager&amp;rsquo;s new team).&lt;/p&gt;
&lt;p&gt;Q: Does the rotation effect vary with predicted hoarding intensity as the framework requires?
A: Yes. The rotation effect is larger for workers with higher productivity, those whose replacement would be costlier (consistent with the prediction that workers harder to replace face more hoarding), and those working under managers with lower utility costs of hoarding. The paper tests these cross-sectional predictions using continuous interactions between the rotation indicator and standardized proxies for hoarding intensity, and all patterns are consistent with the talent hoarding mechanism rather than alternative explanations.&lt;/p&gt;
&lt;p&gt;Q: How successful would the deterred applicants have been?
A: Marginal applicants — those induced to apply by the manager rotation who would not otherwise have applied, identified via IV assumptions — face a hiring probability of 49.1%, compared to the average hiring likelihood of 27.6% across all applicants. This large positive selection implies that a substantial share of deterred applicants would have been successful, and that talent hoarding meaningfully degrades the quality and quantity of the firm&amp;rsquo;s internal applicant pool and the firm&amp;rsquo;s ability to promote high-productivity workers.&lt;/p&gt;
&lt;p&gt;Q: Does talent hoarding have differential effects by gender?
A: Yes. Women are 22% more likely to place high value on preserving a good relationship with their manager and 26% more likely to rely on manager career guidance when making career decisions. Consistent with this, marginal female applicants are more positively selected on educational qualifications, past performance, and hiring probability for higher-level positions than marginal male applicants. When comparing potential earnings outcomes, both men and women would earn more in the absence of talent hoarding, but the larger earnings gains for women imply a counterfactual reduction in the gender pay gap of 86%.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports external validity of the findings?
A: The firm&amp;rsquo;s employee demographics closely match those of large manufacturing firms in the German BiBB workforce survey across gender, age, citizenship, and marital status. The firm&amp;rsquo;s internal labor market design is standard for large German firms, where 83% of top publicly listed companies cite talent hoarding as a key organizational friction. Industry surveys also report that half of 665 global organizations report managers hoarding talent by discouraging worker mobility, and talent hoarding occurs through many of the same behaviors documented in this study.&lt;/p&gt;
&lt;p&gt;Q: How does the paper rule out confounding mechanisms for the rotation effect?
A: The paper tests and rules out several alternatives: worker-manager specific match effects (the effect does not depend on characteristics of the incoming or outgoing manager); finite project timelines driving a rush to apply; and workers being recruited by managers to their new teams (97% of applications are outside the current team and not to the manager&amp;rsquo;s new team). Balance tests show workers exposed to rotations are observationally similar to non-exposed workers, and event studies confirm absence of pre-trends in team-level outcomes including absenteeism.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: The findings suggest firms forgo productivity gains when hoarded workers are not allocated to positions where they would be most productive. Potential organizational responses include monitoring or rewarding managers for promoting talent, reducing performance-related pay tied to team composition, or structuring career development activities in ways that cannot easily be suppressed by individual managers. The paper notes that firms generally do not compensate managers for promoting workers, partly due to practical difficulties of such contracts, and that the misalignment between what managers believe benefits the firm and what is recognized in their own compensation is particularly pronounced for talent development relative to all other managerial responsibilities.&lt;/p&gt;
&lt;p&gt;Talent hoarding: Actions taken by managers that lower the likelihood that a worker applies for and receives a promotion or internal transfer outside the team, driven by managers&amp;rsquo; incentive to retain productive workers to protect team performance and manager compensation. Distinct from mere neglect — it is strategic and deliberate.&lt;/p&gt;
&lt;p&gt;Potential rating: A public signal of a worker&amp;rsquo;s future potential for higher-level positions, assigned by the direct supervisor and widely circulated within the firm (e.g., via HR lists of high-potential workers); distinguished from performance ratings by its visibility outside the worker&amp;rsquo;s current team, making it a lever for strategic manipulation by hoarding managers.&lt;/p&gt;
&lt;p&gt;Performance rating: A private, task-specific signal of a worker&amp;rsquo;s past performance in their current position, not shared with other units in the firm; used as the baseline against which potential ratings are compared in the paper&amp;rsquo;s administrative hoarding measure.&lt;/p&gt;
&lt;p&gt;Visibility suppression (hoarding measure): The manager-level average residual from a regression of workers&amp;rsquo; potential ratings on their performance ratings and worker characteristics; a positive average residual indicates the manager systematically assigns lower potential ratings than predicted, suppressing worker visibility outside the team in a manner consistent with strategic talent hoarding.&lt;/p&gt;
&lt;p&gt;Manager rotation: An event in which a manager leaves their current team for a different internal position within the firm, temporarily eliminating their hoarding incentive for current team workers and creating the paper&amp;rsquo;s quasi-experimental source of variation in hoarding exposure.&lt;/p&gt;
&lt;p&gt;Marginal applicant: In the IV framework, a worker who applies for an internal position only because their manager is rotating and would not have applied otherwise; estimated via complier analysis (Abadie 2003) and used to characterize the counterfactual quality and hiring probability of workers deterred by talent hoarding.&lt;/p&gt;
&lt;p&gt;Utility cost of hoarding (φm): A manager-level parameter capturing the convex private cost to a manager of engaging in talent hoarding; may reflect altruism, detection risk, or reputational consequences; managers with lower φm hoard more intensively, and variation in φm is proxied empirically by performance-related pay, team size, and functional-area talent visibility.&lt;/p&gt;</description></item><item><title>Taxes Depress Corporate Borrowing: Evidence from Private Firms</title><link>https://macropaperwarehouse.com/papers/taxes-depress-corporate-borrowing-evidence-from-private-firms/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/taxes-depress-corporate-borrowing-evidence-from-private-firms/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Does corporate income taxation raise or lower corporate leverage? The canonical Modigliani-Miller (1963) view holds that the interest tax deduction makes debt more attractive, predicting a positive taxes-to-leverage relationship. Most prior empirical work using large public firms confirms this prediction. This paper re-examines the question using data on small private U.S. firms and finds the opposite: higher corporate taxes &lt;em&gt;depress&lt;/em&gt; leverage, at least for small, financially constrained private firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Identification&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The primary dataset is the Federal Reserve&amp;rsquo;s Y-14Q supervisory collection (2011–2017), which covers the loan portfolios of the 33 largest U.S. banks and includes firm-level income statements and balance sheets for privately held, bank-dependent borrowers. The sample is restricted to domestic private C-corporations with prior-year assets above $100 million (to screen for pass-through entities), yielding 39,363 non-singleton firm-year observations. The median firm has $288 million in book assets and total debt-to-assets of approximately 38%. A supplementary dataset from the Shared National Credit (SNC) Program (1993–2018, 50,203 firm-year observations) provides a longer time series on syndicated loan commitments. Public firm comparisons use CRSP-Compustat (91,314 observations, 1989–2017).&lt;/p&gt;
&lt;p&gt;The empirical strategy is a difference-in-differences event study using variation in state corporate income tax rates. A novel contribution is the manual collection of both &lt;em&gt;enactment&lt;/em&gt; dates (when legislation was signed into law) and &lt;em&gt;effective&lt;/em&gt; dates for each state tax change since 1975. Identification follows the narrative approach of Romer and Romer (2010) and Giroud and Rauh (2019) to exclude tax changes endogenous to local economic conditions. The specification includes firm and industry-by-year fixed effects, and the analysis uses heterogeneity-robust estimators (Borusyak et al. 2024; de Chaisemartin and D&amp;rsquo;Haultfoeuille 2020) to address staggered treatment timing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For small private firms (below-median total assets, i.e., below $288 million), long-term debt-to-assets rises by approximately 4% in the year of tax cut &lt;em&gt;enactment&lt;/em&gt; and remains elevated—at approximately 2%—four or more years later, indicating a permanent increase in leverage. This anticipation effect arises because firms respond to the law&amp;rsquo;s passage, not its effective date; results using effective dates are noisy and largely insignificant. The average tax cut during the sample period was 1.2 percentage points, representing approximately a 6% reduction in firms&amp;rsquo; tax bills (given an average private-firm tax rate of 21%), and the implied leverage change of about 6% at year four is correspondingly large, consistent with a low-interest-rate environment in which small changes in marginal q translate into large investment and borrowing responses.&lt;/p&gt;
&lt;p&gt;For large private firms (above-median assets), leverage shows no significant response to tax cuts in any event year. For public firms, evidence of any effect is scant, with at most transient significance and pre-trend issues that complicate interpretation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper argues two tax-sensitive costs of debt offset the standard interest tax shield. First, a higher tax rate reduces after-tax profits, raising default probabilities and credit spreads endogenously; a tax cut thus lowers credit spreads and incentivizes more borrowing. Second, because external equity finance is either unavailable or very costly for small private firms, debt and capital are complements in financing investment: a tax cut raises the marginal product of capital, inducing firms to invest and borrow more. For small firms with low capital adjustment costs, this capital-debt complementarity dominates the direct loss of interest tax shield value. For large firms with high capital adjustment costs (estimated at nine times the small-firm value), investment responds sluggishly to tax changes, the complementarity effect is muted, and the traditional tax shield effect becomes relatively more important—producing the standard, slightly positive taxes-to-leverage relationship.&lt;/p&gt;
&lt;p&gt;Bank-assessed default probabilities fall by 20–30 basis points (roughly a 10% decline from an average of approximately 2%) in the year of enactment or one year later for small borrowers, directly supporting the model&amp;rsquo;s credit spread mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare Counterfactual&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Removing the interest tax deduction from the estimated model (while retaining profit taxation and restricted equity access) causes leverage to fall from 0.36 to −0.26. Firms substitute into cash holdings, shrinking the capital stock. In equilibrium, hours worked rise, the real wage falls, and consumer welfare drops by approximately 1.8%. The interest deduction thus raises welfare in a second-best sense by offsetting other frictions that impede optimal capital accumulation.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why do prior studies find a positive taxes-to-leverage relationship, and how does this paper differ?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Prior studies—including Titman and Wessels (1988), Heider and Ljungqvist (2015), and Faccio and Xu (2015)—predominantly use large public firms, for which the interest tax shield is the quantitatively dominant consideration. The present paper focuses on small private firms that face greater financial frictions (restricted equity access, higher default risk), in which two additional tax-sensitive costs of debt become quantitatively important. A further methodological difference from Heider and Ljungqvist (2015) is the use of firm fixed effects rather than first differences, which the authors argue is appropriate in a staggered DiD design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why use enactment dates rather than effective dates as the event?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Tax legislation is often signed into law one to two years before taking effect; in the sample of 125 tax packages since 1975, 33 became effective the following year and 13 became effective two or more years later. Firms that anticipate future tax changes will adjust leverage immediately upon enactment, not at the effective date. Results confirm this: event studies using enactment dates yield precise positive estimates for small firms (ranging from ~4% at year 0 to ~2% at year 4+), while results using effective dates are noisy and mostly insignificant. The paper therefore treats the enactment date as the economically relevant event and collects these dates as a novel contribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the economic magnitude of the leverage response for small private firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Small firms&amp;rsquo; long-term debt-to-assets rises by almost 4% in the enactment year and remains elevated at approximately 2% four or more years after enactment, consistent with a permanent adjustment. The average tax cut during the period was 1.2 percentage points, representing roughly a 6% reduction in the average tax bill (given an average effective rate of 21% for private firms, per Zwick et al. 2016). The estimated coefficient of 0.021 in year four also implies approximately a 6% change in leverage, a large response that the paper attributes to the low interest rate environment amplifying the marginal q effect of even modest tax changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Do large private firms respond differently to tax cuts, and why?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Large private firms (above the median of $288 million in total assets) show no statistically significant leverage response to tax cuts in any event year, and this null is not attributable to wider confidence intervals. The model estimation explains this via capital adjustment costs: the adjustment cost parameter for large firms is estimated to be nine times larger than for small firms. With high adjustment costs, investment responds sluggishly to a tax cut, so the complementarity channel (more investment requires more debt) is suppressed. The traditional tax shield effect then becomes relatively more important, producing a slightly positive (or zero net) taxes-to-leverage relationship consistent with the large-firm data moment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the model generate a negative relationship between taxes and leverage when the interest tax deduction is present?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two mechanisms offset the tax shield. First, higher taxes reduce after-tax profits, pushing firms closer to the default threshold; this is capitalized into equilibrium credit spreads, raising the cost of debt. Specifically, for small firms, the model shows that once leverage exceeds approximately 0.47 of assets, the after-tax risky interest rate rises monotonically with the tax rate (rather than falling via the deduction effect). Second, capital and debt are complements in financing investment: because a tax cut raises the marginal product of capital, and because external equity is unavailable, firms substitute into capital by using more leverage. For small firms with low capital adjustment costs, both mechanisms outweigh the loss of interest tax shield value when taxes fall.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How are the model parameters estimated, and what are the key parameter values?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model is estimated by simulated method of moments on the Y-14 small-firm sample, minimizing the distance between nine data moments and their model-simulated counterparts. The nine moments include the means and standard deviations of debt, investment, and operating income (all as ratios of assets), the serial correlations of investment and operating income, and the coefficient from a two-way fixed-effects regression of leverage on a tax-change dummy. The deadweight loss in default (ξ) is estimated at 0.6 for small firms and 0.32 for large firms, consistent with elevated financial frictions for small firms and in line with average recovery rates in Kermani and Ma (2023). Fixed operating costs (f) are approximately 0.15 for both samples, amounting to just under half of steady-state operating profits. The serial correlation of the tax process is estimated at 0.662, with innovation standard deviation of 0.022.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the model&amp;rsquo;s welfare counterfactual, and what does it imply?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper compares two economies both with profit taxation: one with the interest tax deduction and one without. Removing the deduction in the small-firm model causes leverage to fall from 0.36 to −0.26, as firms hold net cash rather than net debt. The capital stock shrinks, output falls, hours worked rise, and both the real wage and consumption decline. Consumer welfare drops by approximately 1.8%. Capital misallocation (measured following Hsieh and Klenow 2009) worsens from 0.89 to 0.88. The result has a second-best character: the interest deduction incentivizes debt-financed investment that partially offsets the distortion from restricted equity access.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What does the evidence on default probabilities add to the empirical case?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Y-14 collection contains bank-assessed default probability estimates. In an event study covering Q1 2012–Q4 2018, the authors find that firms&amp;rsquo; assessed default probabilities decline significantly by 20–30 basis points in the year of enactment or one year later for small borrowers (those with total loan commitments of $10–$100 million), representing approximately a 10% decline from the sample average default rate of around 2%. This decline peaks two years after enactment and persists for three years. No comparable decline is observed for larger loan size buckets. Separately, in SNC data, the probability of a non-pass (i.e., below-investment-grade supervisory) rating falls by 1.7–2.2 percentage points following tax cut enactments, persisting roughly three years. Together, these findings directly validate the model mechanism by which tax cuts lower default risk and credit spreads.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Are the results robust to alternative econometric methods that address heterogeneous treatment effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. The paper applies the Borusyak et al. (2024) imputation estimator, which imputes fixed effects from untreated observations onto treated observations to remove negative weighting bias; for small firms and event years 0–3, it finds significant positive estimates comparable to the baseline. The de Chaisemartin and D&amp;rsquo;Haultfoeuille (2020, 2021) estimator, based solely on first-time switchers to treatment, yields an effect of 0.036 on leverage for small firms in the enactment year and no effect for large firms, consistent with the baseline. Results using the narrative approach (excluding Connecticut 2011 and 2015, New York 2014, and Rhode Island 2014 as potentially endogenous) produce slightly larger leverage estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Are tax hike effects symmetric to tax cut effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Evidence on hikes is weaker because tax hikes are rare in the sample. In Y-14 data, hikes are associated with leverage declines for small firms in event year 4 and for large firms in event years 1, 2, and 4, but without sufficient pre-hike observations to identify pre-trends, these results are less credible than the cut results. In SNC data (which spans a longer period, 1992–2018), tax hikes are associated with large and significant reductions in total syndicated borrowing commitments of 6–7%, while cuts produce smaller and marginally significant increases. This asymmetry is consistent with the lower adjustment costs of reducing debt relative to increasing it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What does the analysis of alternative model specifications reveal about the generality of the mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three model extensions are considered. In a collateral-constrained model (no endogenous default), the cost of debt is lost financial flexibility (the future shadow cost of the borrowing constraint), which remains tax-sensitive. In a model with costly equity issuance (linear cost λ = 0.11 following Hennessy and Whited 2007), equity issuance is rare, so the model behaves nearly identically to the baseline. In a solvency-based default model (default when firm value turns negative rather than when liquidity is insufficient), the negative taxes-to-leverage result is preserved. A news-shock extension (Jaimovich-Rebelo 2009) incorporating the anticipation of future tax changes also produces lower leverage in response to higher anticipated taxes, consistent with the empirical anticipation effects, though with smaller magnitudes because the news shock variance is smaller than the total tax-change variance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Why do contingent-claims models (Fischer-Leland-Goldstein class) always predict a positive taxes-to-leverage relationship?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In these models, shareholders have deep pockets, so negative cash flows can always be covered; this implies default is rare and the effect of taxes on the default put value is small relative to the direct interest tax deduction. Additionally, these models contain no capital stock, so there is no substitution mechanism between capital and a storage technology (i.e., cash/negative debt). Without endogenous investment, the only channel linking taxes to leverage is the tax shield, which necessarily implies a positive taxes-to-leverage relationship. This is why, as the paper notes, the result was &amp;ldquo;already hiding&amp;rdquo; in the Hennessy-Whited class of dynamic investment models but not visible in the contingent-claims literature.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Interest Tax Deduction (Tax Shield)&lt;/strong&gt;
The paper uses this in the standard corporate finance sense: the after-tax cost of debt is reduced because interest payments are deductible against corporate income. In the model, debt proceeds are discounted at the after-tax interest rate, and the deduction is taken at the time of debt issuance. The paper&amp;rsquo;s contribution is to show this benefit can be outweighed by two tax-sensitive costs of debt, reversing the sign of the taxes-to-leverage relationship for small, constrained firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tax-Sensitive Cost of Debt&lt;/strong&gt;
The paper defines two distinct tax-sensitive costs that offset the tax shield. First, taxes reduce after-tax profits, shifting the default threshold and raising equilibrium credit spreads; this is capitalized into the risky lending rate endogenously from the lender&amp;rsquo;s zero-profit condition. Second, taxes reduce the marginal product of capital, making debt-financed investment less attractive; because debt and capital are complements in a model without external equity, a higher tax rate lowers optimal capital and, with it, optimal debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capital Adjustment Costs (ψ)&lt;/strong&gt;
Quadratic costs of changing the capital stock, parameterized as ψ(k&amp;rsquo; − (1−δ)k)² / (2k). The paper identifies this parameter as the key determinant of whether leverage responds positively or negatively to taxes: for small firms, ψ is estimated to be near zero (insignificantly different from zero), enabling free substitution between capital and the storage technology (negative debt), so the complementarity channel dominates. For large firms, ψ is estimated to be nine times larger, suppressing this substitution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Default Threshold&lt;/strong&gt;
In the model, default is triggered when the firm&amp;rsquo;s current after-tax profits plus recoverable capital are insufficient to repay debt: (1−τ)(y − wn − f) + (1−ξ)(1−δ)k &amp;lt; p. This threshold depends directly on the tax rate τ, so higher taxes move the threshold in the direction of default, raising credit spreads. The paper provides empirical support for this mechanism via the event study of bank-assessed default probabilities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Enactment Date vs. Effective Date&lt;/strong&gt;
The paper distinguishes between the date tax legislation is signed into law (enactment date) and the date it becomes operative (effective date), which can differ by one to two years. The paper collects novel data on enactment dates from state legislative records. The empirical finding that firms respond to enactment rather than effective dates constitutes evidence of anticipation effects: firms adjust leverage upon observing future expected tax changes, not when the changes actually take hold.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Second-Best Welfare Effect of the Tax Deduction&lt;/strong&gt;
The paper uses this term to characterize the welfare result from the counterfactual: in an economy already distorted by profit taxation and restricted equity access, the interest deduction raises consumer welfare by incentivizing debt-financed capital accumulation. Removing the deduction causes firms to substitute into cash, shrinking the capital stock and lowering wages and consumption. This is a second-best result because the deduction is welfare-improving only because it partially offsets the distortions created by other frictions; in a frictionless world, no such second-best rationale would apply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Y-14Q Supervisory Data&lt;/strong&gt;
The Federal Reserve&amp;rsquo;s supervisory collection from the 33 largest U.S. banks, covering loan portfolios and associated borrower financial statements for firms with commercial and industrial loans exceeding $1 million in commitment. The paper uses this dataset because it covers private, bank-dependent firms—a population not previously studied in the tax-leverage literature—and contains firm-level balance sheets, credit ratings, and default probability estimates.&lt;/p&gt;</description></item><item><title>Taylor Rule Deviations Across Horizons: A Practical Tool for Monetary Policy</title><link>https://macropaperwarehouse.com/papers/taylor-rule-deviations-across-horizons-a-practical-tool-for-monetary-policy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/taylor-rule-deviations-across-horizons-a-practical-tool-for-monetary-policy/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper addresses a fundamental limitation of the standard Taylor rule as a monetary policy stance gauge: the rule is defined solely for the overnight federal funds rate (FFR) and cannot assess stance across the maturity spectrum of the yield curve. This limitation becomes acute when the FFR hits its effective lower bound (ELB) and the Federal Reserve resorts to unconventional monetary policy (UMP) instruments—quantitative easing and forward guidance—that are explicitly intended to influence longer maturities. The authors ask: can the Taylor rule idea be extended across the yield curve horizon to produce a maturity-specific monetary policy stance measure that remains informative even during ELB episodes?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology and Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper proposes the &amp;ldquo;Taylor rule yield curve,&amp;rdquo; which extends the original Taylor rule to points in time in the future horizon (maturities of 1 through 10 years). The Taylor rule expected rate at maturity h is defined as the average of h annual one-period-ahead Taylor-rule-implied short-term rates, each computed from professional forecasters&amp;rsquo; expectations of inflation and the output gap h years ahead. The market counterpart is the Overnight Index Swap (OIS) rate for the corresponding maturity. The &amp;ldquo;Taylor rule deviation&amp;rdquo; (TRD) at maturity h is then the difference between the Taylor rule expected rate and the market OIS rate at that maturity—interpretable as the average expected monetary policy stance from the current period through h years ahead.&lt;/p&gt;
&lt;p&gt;Data sources: inflation and GDP growth forecasts from Consensus Economics (1–5 years ahead, and 6–10 year average); output gap forecasts constructed using Congressional Budget Office potential output estimates; natural rate of interest estimates from Holston, Laubach, and Williams (2017) available from the Federal Reserve Bank of New York; FFR, core CPI inflation, and GDP growth from FRED; OIS rates from Bloomberg (available from 2002/Q1). Two Taylor rule coefficient sets are examined: the &amp;ldquo;original&amp;rdquo; rule (α = 0.5, β = 0.5) and the &amp;ldquo;balanced&amp;rdquo; rule (α = 0.5, β = 1.0), with the balanced rule as baseline. An inertia parameter of ρ = 0.85 (quarterly) is assumed, implying annual persistence of approximately 0.52. The sample period runs from 2000/Q1 to 2018/Q4 for the Taylor rule yield curve itself, and from 2002/Q1 to 2017/Q4 for OIS-based TRD analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;First, the estimated Taylor rule expected rate curves show that after the onset of the Global Financial Crisis (GFC), the balanced-rule Taylor rate dropped completely below zero for all maturities up to 10 years. During 2008/Q4, the Taylor rule expected rate curve lay approximately 2–3 percentage points below the market rate curve across maturities, reflecting excessively tight market expectations relative to what the Taylor rule framework implied. By 2011/Q4, the market OIS curve fell below the Taylor rule expected rate curve for maturities beyond 4 years—indicating that explicit and forceful forward guidance (the August 2011 FOMC statement committing to &amp;ldquo;exceptionally low levels for the federal funds rate at least through mid-2013&amp;rdquo;) had driven market rates below the Taylor-implied accommodative path at the long end.&lt;/p&gt;
&lt;p&gt;Second, VAR analysis for the sample period 2002–2017 shows that TRDs at both 2-year and 10-year maturities generate statistically significant impulse responses: positive TRD shocks—indicating a tighter-than-Taylor monetary policy stance—cause both the output gap and inflation to decrease. Importantly, this result holds during the ELB period when the FFR gap and shadow policy rate gap do not yield theoretically consistent impulse responses; in the 2002–2017 subsample, both the FFR gap and the shadow rate gap produce perverse (positive) responses of output and inflation to a tightening shock, presumably because the ELB binds and UMP operates outside the overnight rate. The OIS rates per se (without the Taylor rule expected rate subtracted) show mostly muted and statistically insignificant impulse responses in the same VAR framework. Granger causality tests (62 observations) confirm that TRDs Granger-cause OIS rates for both 2-year (F-statistic = 4.579, p = 0.014) and 10-year (F-statistic = 7.734, p = 0.001) maturities, while the reverse direction is not rejected in either case, highlighting TRDs&amp;rsquo; informational superiority over raw OIS rates.&lt;/p&gt;
&lt;p&gt;Third, TRDs for 2-, 5-, and 10-year maturities are positively correlated with the VIX in the same quarter (R² values of 0.34, 0.37, and 0.35 respectively), whereas the FFR gap is negatively correlated with the VIX (R² = 0.22). This positive TRD–VIX relationship holds during both ELB (2008/Q1–2015/Q3) and non-ELB subperiods, suggesting TRDs serve as a proxy for risk appetite in financial markets—with a loose-relative-to-Taylor monetary stance associated with lower risk aversion.&lt;/p&gt;
&lt;p&gt;Fourth, a stylized New Keynesian model with anticipated future shocks to the Taylor rule (interpreted as &amp;ldquo;news shocks&amp;rdquo;) provides theoretical support. When agents learn of a future expansionary Taylor rule shock, they revise upward their expectations of future output and inflation, which—through consumption smoothing (Euler equation) and forward-looking pricing (New Keynesian Phillips curve)—produce contemporaneous expansionary effects. An extended model with habit formation, backward-looking price-setters, and interest rate smoothing generates hump-shaped and persistent IRs consistent with the empirical patterns. Simulations on model-generated data confirm that the TRD measure, but not the future interest rate or contemporaneous rate deviation, recovers statistically significant and correctly signed impulse responses in the VAR.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The methodology requires data on professional forecasters&amp;rsquo; expectations of output and inflation at multi-year horizons, limiting applicability to countries for which such forecast data exist. Term premium components of OIS rates are excluded from the analysis, which the authors note may make estimates of forward guidance impact conservative. The analysis is confined to the United States for the period 2000/Q1–2018/Q4.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the precise mathematical definition of the Taylor rule deviation (TRD) at horizon h, and how does it differ from the conventional FFR gap?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The TRD at maturity h is defined as the difference between the market OIS rate at h-year maturity and the Taylor rule expected rate at that maturity. The Taylor rule expected rate is the average (across years k = 1 to h) of the Taylor-rule-implied short-term interest rates expected k years ahead, where each expected rate uses professional forecasters&amp;rsquo; projections of inflation and the output gap at that horizon, together with the current natural rate of interest (assumed unchanged). The conventional FFR gap is the deviation of the overnight FFR from the contemporaneous Taylor rule rate—a scalar at a single point in time. The TRD generalizes this to any maturity: it equals the average expected monetary policy stance (accommodative or tight relative to Taylor) from the current period through h years ahead, capturing the cumulated sum of anticipated and unanticipated disturbances to the Taylor rule.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does the FFR gap fail as a monetary policy stance indicator during the ELB period, and why does the shadow rate gap not resolve this failure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: When the FFR hits the ELB, it is pinned near zero regardless of how accommodative the Federal Reserve&amp;rsquo;s actual policy intentions are; any further intended easing through forward guidance or quantitative easing is not reflected in the overnight rate&amp;rsquo;s level or its deviation from the Taylor rule. The authors show (Figure 8a, 2002–2017 subsample) that in a three-variable VAR with output gap, inflation, and FFR gap, a positive FFR gap shock generates increases in both output and inflation—the opposite of theoretically expected contractionary effects—because the ELB constrains the FFR while UMP operates through longer maturities. The shadow policy rate (Wu and Xia, 2016) drops below zero during the UMP period and conceptually summarizes the entire yield curve&amp;rsquo;s accommodation in a single synthetic overnight rate. However, Figure 8b shows that replacing the FFR with the shadow rate leaves the perverse VAR impulse responses qualitatively unchanged in the 2002–2017 subsample, because a single short-term summary rate cannot isolate the maturity-specific information that the TRD captures.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What does the VAR analysis reveal about TRDs&amp;rsquo; ability to capture monetary policy effects at the ELB, and does the maturity of TRD matter?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: For the 2002–2017 sample period (Figure 9a), VAR impulse responses with the TRD replacing the FFR gap show that a positive TRD shock causes statistically significant decreases in both the output gap and inflation—the theoretically expected contractionary response. This result holds for both 2-year and 10-year TRDs. The fact that the 10-year TRD also produces this correct result indicates that TRDs at long maturities can capture the stance reflected in forward guidance, which explicitly targets expectations about the future course of monetary policy well beyond overnight. The output gap response is quantitatively larger in magnitude than the inflation response across both maturities (figure axis ranges suggest output gap peaks at roughly ±1.0% versus inflation at ±0.2%), consistent with the theoretical model&amp;rsquo;s prediction that the output gap is more responsive to contemporaneous effects while inflation responds to both current and expected future conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the role of the output gap component versus the inflation component in driving TRD changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Figures 6 and 7 decompose period-by-period first differences of TRDs into their output gap and inflation contributions for both 2-year and 10-year maturities. The output gap component is the main determinant of changes in TRDs across both maturities, reflecting the substantially volatile outlook on economic growth—especially around the GFC. The inflation component has a considerably smaller contribution, and this difference is even more pronounced for 10-year maturities than for 2-year maturities, reflecting the fact that professional forecasters&amp;rsquo; inflation expectations change much less at longer horizons than near-term GDP growth expectations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the Granger causality analysis reveal about the informational content of TRDs relative to OIS rates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Table 1 reports Granger causality tests using 62 observations. For 2-year maturities, the null that TRD 2Y does not Granger-cause OIS 2Y is rejected at the 5% level (F = 4.579, p = 0.014), while the null that OIS 2Y does not Granger-cause TRD 2Y is not rejected (F = 0.999, p = 0.375). For 10-year maturities, the null that TRD 10Y does not Granger-cause OIS 10Y is rejected at the 1% level (F = 7.734, p = 0.001), while the reverse null is not rejected (F = 0.843, p = 0.436). This unidirectional causality—TRDs leading OIS rates but not vice versa—implies that TRDs contain information about future OIS rate movements not already embedded in current OIS rates, making TRDs informationally superior to raw OIS rates for assessing monetary policy stance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do TRDs relate to VIX, and does this relationship depend on whether the economy is at the ELB?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Figures 10 and 11 document that TRDs for 2-, 5-, and 10-year maturities are positively correlated with the VIX in the same quarter (R² values of approximately 0.34, 0.37, and 0.35 for 2Y, 5Y, and 10Y TRDs respectively), meaning that a tighter-than-Taylor monetary policy stance (positive TRD) is associated with higher market risk aversion. By contrast, the FFR gap shows a negative correlation with the VIX (R² = 0.22), the opposite sign. The same positive TRD–VIX correlation is observed when current TRDs are plotted against VIX four quarters later, though the R² values are smaller (ranging from approximately 0.04 to 0.05). Critically, Figure 11 shows that dividing the 2002/Q1–2017/Q4 sample into ELB (2008/Q1–2015/Q3) and non-ELB periods, the positive correlation between the 5-year TRD and VIX holds during both subperiods (R² = 0.37 for ELB current quarter, R² = 0.41 for ELB four quarters ahead), demonstrating that TRDs&amp;rsquo; relationship with risk appetite is not an artifact of the ELB environment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What does the theoretical New Keynesian model contribute, and what is the mechanism by which anticipated future Taylor rule shocks affect current macroeconomic variables?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper embeds anticipated future shocks to the Taylor rule (news shocks) in a stylized New Keynesian model with Euler equation, New Keynesian Phillips curve, and Taylor rule. When a one-period-ahead expansionary monetary policy shock (εh,t for h=1) is announced at time t, agents expect expansionary effects in period t+1 (higher output gap and inflation). Through consumption smoothing in the Euler equation, expected higher output in t+1 raises current consumption and thus current output. Through forward-looking pricing in the NKPC, expected higher future inflation raises current inflation. Analytically, the coefficients on the one-period-ahead shock (c_{1,y} and c_{1,π}) satisfy the same signs as the contemporaneous shock coefficients (c_{0,y} and c_{0,π}), confirming the contemporaneous impact. The model shows that for the inflation rate, the future shock has larger impact than the contemporaneous shock (|c_{1,π}| &amp;gt; |c_{0,π}|) because inflation responds to both current and future output gap in the NKPC; for the output gap, the future shock has smaller impact (|c_{1,y}| &amp;lt; |c_{0,y}|) because higher expected inflation raises the nominal interest rate via the Taylor rule&amp;rsquo;s endogenous feedback, partially offsetting the expansionary effect on current output.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do simulations on model-generated data validate the VAR methodology for identifying TRD effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Figure 17 uses simulated data from the model with inertia (200 periods, corresponding to 50 years) to compare three interest rate measures in a three-variable VAR (output gap, inflation, interest rate measure): (i) the average future interest rate (I), (ii) the contemporaneous interest rate deviation (ε_{0,t}), and (iii) the H-period TRD with H = 8. When the future interest rate I is used, the identified monetary policy shock produces impulse responses with the opposite sign relative to the structural model, because the VAR captures reverse causality between the interest rate and the state of the economy. When the contemporaneous rate deviation ε_{0,t} is used, responses have the intended sign but are not statistically significant, because future anticipated shocks are not materialized in the current period&amp;rsquo;s rate. When the TRD is used, the identified shock generates statistically significant responses with the correct sign, validating TRD as the appropriate measure for capturing the effects of anticipated future monetary policy shocks in an empirical VAR framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the Taylor rule yield curve behave at specific historical episodes, and what do these patterns reveal about monetary policy stance?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: During 2008/Q4, the Taylor rule expected rate curve (balanced rule) lay approximately 2–3 percentage points below the market OIS curve across all maturities, reflecting that markets expected a much faster policy normalization than the Taylor rule implied given the economic collapse—indicating excessively tight market expectations. By 2011/Q4, after successive rounds of forward guidance, the market OIS curve fell below the Taylor rule expected rate curve for maturities beyond 4 years, with the balanced-rule Taylor expected rates remaining negative for maturities up to 3 years. By 2013/Q4, mid- and long-term market expected rates were roughly aligned with Taylor rule expected rates. In 2015/Q4, when the Fed hiked for the first time post-GFC (while the Taylor rule short-term rate was still negative), the market curve almost perfectly matched the Taylor rule expected curve for maturities beyond one year. In 2017/Q4, the Taylor rule expected rate curve exceeded the market curve by approximately 0.5–1 percentage points, suggesting continued expansionary stance even after policy rate normalization began.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How robust are the results to the choice between the original and balanced Taylor rule specifications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Robustness checks (Figures 12–14) compare results under the original rule (α = 0.5, β = 0.5) versus the baseline balanced rule (α = 0.5, β = 1.0). The original rule generates smaller fluctuations in Taylor rule expected rates, reflecting its lower coefficient on the more volatile output gap. However, the overall trajectories do not change significantly. The main qualitative difference emerges in 2011/Q4 and 2013/Q4: the balanced rule implies Taylor expected rates are negative for 1–3 year maturities (indicating the ELB was still binding even relative to medium-term Taylor-implied paths), while the original rule produces all-positive Taylor expected rates for these periods. For 2008/Q4, 2009/Q4, 2015/Q4, and 2017/Q4, both specifications yield similar pictures, and the central conclusions about TRDs&amp;rsquo; macroeconomic relevance and relationship with risk appetite are robust to the specification choice.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Taylor Rule Yield Curve&lt;/strong&gt;: The paper&amp;rsquo;s proposed extension of the standard Taylor rule from the overnight federal funds rate to all points in the future yield curve horizon (1 through 10 years). For maturity h, it is the average of h annual Taylor-rule-implied expected short-term rates, each calculated using professional forecasters&amp;rsquo; h-years-ahead projections of inflation and the output gap plus the current estimate of the natural rate. Not a market instrument but a model-derived benchmark yield curve representing the &amp;ldquo;neutral&amp;rdquo; rate at each horizon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taylor Rule Deviation (TRD)&lt;/strong&gt;: The gap between the market OIS rate at maturity h and the corresponding Taylor rule expected rate—that is, the deviation of market expectations from what the Taylor rule framework implies should prevail at that horizon. A positive TRD indicates market rates are above the Taylor-implied rate (tighter-than-neutral stance); a negative TRD indicates easier-than-neutral stance. The TRD at maturity h equals the average of expected monetary policy stance residuals from the current period through h years ahead.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective Lower Bound (ELB)&lt;/strong&gt;: The floor to which a central bank can reduce the nominal policy rate before further cuts become infeasible or counterproductive. In the paper&amp;rsquo;s empirical context, the FFR ELB episode for the United States runs from 2008/Q1 to 2015/Q3. During this period, the standard FFR gap and shadow rate gap measures fail to produce theoretically consistent VAR impulse responses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Taylor Rule Expected Rate&lt;/strong&gt;: The paper&amp;rsquo;s specific construct: the average of Taylor-rule-implied future short-term interest rates at each year of maturity, computed from professional forecasters&amp;rsquo; consensus projections of inflation and output gap at multi-year horizons. Distinct from any market rate; serves as the &amp;ldquo;neutral&amp;rdquo; benchmark at each maturity against which OIS rates are compared.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Balanced vs. Original Taylor Rule&lt;/strong&gt;: Two coefficient specifications used in the paper. The &amp;ldquo;original&amp;rdquo; rule (Taylor, 1993) sets the inflation gap coefficient α = 0.5 and the output gap coefficient β = 0.5. The &amp;ldquo;balanced&amp;rdquo; rule (Taylor, 1999) sets α = 0.5 and β = 1.0, placing greater weight on output stabilization; the paper uses the balanced rule as its baseline on the grounds that it better reflects the Federal Reserve&amp;rsquo;s dual mandate in recent years.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Anticipated Future Taylor Rule Shocks (News Shocks)&lt;/strong&gt;: Shocks to the Taylor rule that are known to agents at time t but materialize in a future period t+h. Following Laséen and Svensson (2011) and Del Negro et al. (2012), the paper embeds these in a New Keynesian model to show that anticipated future expansionary policy has contemporaneous expansionary effects through consumption smoothing and forward-looking pricing—the theoretical mechanism underpinning why TRDs at longer maturities affect current macroeconomic outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-Taking Channel via TRD&lt;/strong&gt;: The paper&amp;rsquo;s finding that TRDs for 2-, 5-, and 10-year maturities are positively correlated with VIX (R² ≈ 0.34–0.37 in the same quarter), holding in both ELB and non-ELB periods. A positive TRD (tighter-than-Taylor stance) corresponds to higher market risk aversion as measured by VIX, enabling TRDs to serve as a maturity-specific measure of risk appetite in financial markets—in contrast to the FFR gap, which shows the opposite (negative) correlation with VIX.&lt;/p&gt;</description></item><item><title>Technology Transfer and Early Industrial Development: Evidence from the Sino-Soviet Alliance</title><link>https://macropaperwarehouse.com/papers/technology-transfer-and-early-industrial-development-evidence-from-the-sino-soviet-alliance/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/technology-transfer-and-early-industrial-development-evidence-from-the-sino-soviet-alliance/</guid><description>&lt;p&gt;This paper estimates the causal effect of technology and knowledge transfers on early industrial development using the Sino-Soviet Alliance of the 1950s as a natural experiment. Between 1950 and 1957, the Soviet Union supported the &amp;ldquo;156 Projects&amp;rdquo; — 139 approved civil projects for constructing technologically advanced, large-scale, capital-intensive industrial facilities in China. The intended program comprised two components: a &amp;ldquo;basic&amp;rdquo; transfer of Soviet state-of-the-art machinery and equipment (including blueprints, site surveys, and plant construction assistance), and an &amp;ldquo;advanced&amp;rdquo; know-how transfer involving Soviet experts residing in Chinese plants for roughly three years to train engineers and production supervisors in organizational, technological, and planning methods. Total investment amounted to approximately $80 billion in 2020 figures (45.7% of Chinese GDP in 1949).&lt;/p&gt;
&lt;p&gt;Identification exploits idiosyncratic delays in project completion caused by Soviet production capacity constraints, insufficient experts, translator shortages, and miscommunication — factors documented in historical records as unrelated to project-specific characteristics. When the Sino-Soviet Split in 1960 abruptly ended the program, all 139 plants had been built but differed in what transfers they had received: 46 received both machinery and know-how (advanced), 46 received only machinery (basic), and 47 received neither (comparison). The paper verifies, via ANOVA tests, multinomial logit models, balancing regressions on 26 plant characteristics, pre-trend tests, and Oster (2019) selection-on-unobservables bounds, that the three groups were statistically equivalent prior to receiving the Soviet transfers.&lt;/p&gt;
&lt;p&gt;The primary data source is plant-level annual reports from the Steel Association covering 94 steel firms (1,410 plants) from 1949 to 2000, matched to 304 steel plants across the 156 Projects. Supplementary sources include the declassified 1985 Second Industrial Survey (7,592 largest Chinese firms) and the China Industrial Enterprises database (1998–2013, over 1 million firms).&lt;/p&gt;
&lt;p&gt;Three main results emerge. First, receiving only the basic (machinery) transfer had positive but short-lived effects: output of basic plants peaked at 14.7 percent above comparison plants six years after receiving Soviet machinery, then declined monotonically and became statistically insignificant after 20 years — consistent with the estimated 15–20 year life cycle of Soviet capital. Second, the advanced transfer had large and persistent effects: advanced plants&amp;rsquo; output rose 8.4 percent relative to basic plants within two years, 19.7 percent within 20 years, and 49.5 percent cumulatively after 40 years. TFPQ of advanced plants reached 47.9 percent above basic plants after 40 years. These magnitudes held across industries in 1985 and 1998–2013 data, where value added of advanced firms was 41.4–52.0 percent higher and TFPR 39.5–49.3 percent higher than basic firms. Third, the program generated horizontal spillovers (12.9 percent higher output, 12.4 percent higher productivity for steel plants in counties hosting advanced plants) and vertical spillovers (16.4 percent productivity gain for supply-chain firms in counties of advanced nonsteel plants), with spillover effects conditional on post-1990s market liberalization to materialize in private firms.&lt;/p&gt;
&lt;p&gt;The mechanism driving persistence is the accumulation of organizational and human capital during the advanced transfer, which enabled advanced plants — uniquely — to develop new production processes endogenously, home-fabricate continuous casting furnaces to replace obsolete Soviet open-hearth equipment, and produce export-quality steel. Advanced plants employed more engineers and high-skilled technicians, established professional schools, and their counties had 10.4 percent higher STEM university degree rates and 16.8 percent more technical schools.&lt;/p&gt;
&lt;p&gt;Scope conditions: results apply to large-scale, capital-intensive state-planned industrial facilities in a country at an early stage of industrialization, under conditions of near-complete trade isolation (1960–1978) that prevented basic plants from compensating via imported foreign capital. The estimated aggregate contribution of the program is that, without both transfer types, Chinese real GDP per capita growth between 1953 and 1978 would have been 7 to 19 percent lower.&lt;/p&gt;
&lt;p&gt;Q: What distinguishes the &amp;ldquo;basic&amp;rdquo; from the &amp;ldquo;advanced&amp;rdquo; Soviet transfer?
A: The basic transfer involved duplication of whole Soviet plants through provision of state-of-the-art Soviet machinery, equipment, blueprints, geological surveys, and construction assistance. The advanced transfer added visits of Soviet experts — expected to stay approximately three years — to teach Chinese technicians how to operate the machinery and to provide within-firm training in engineering (math, physics, chemistry, organizational and planning methods) and supervisory management based on &amp;ldquo;scientific management&amp;rdquo; principles including quality-control methods.&lt;/p&gt;
&lt;p&gt;Q: What caused plants to receive different levels of transfer, and why is this variation credible for identification?
A: Delays arose from Soviet production capacity constraints (by 1955, one-third of annual Soviet steel-rolling output was destined for China), insufficient experts, translator shortages, and bilateral miscommunication — all documented in historical records as unrelated to project characteristics. When the 1960 Split ended the program, plants&amp;rsquo; treatment status was determined by where they happened to be in the delivery queue. ANOVA tests find no significant differences in approval year, investment, workforce, equipment value, project length, or capacity across the three groups, and a multinomial logit on province and industry fixed effects shows no group had higher ex-ante probability of receiving either transfer type.&lt;/p&gt;
&lt;p&gt;Q: What were the output effects of the basic transfer, and why did they fade?
A: Output of basic plants was not significantly above comparison plants for the first two years, peaked at 14.7 percent higher six years after receiving Soviet machinery, then declined monotonically and became statistically insignificant after 20 years. This timing corresponds to the estimated 15-year life cycle of Soviet capital goods. TFPQ of basic plants followed the same pattern, peaking at 14.5 percent above comparison plants. Without the know-how component, basic plants could not develop new processes or home-fabricate replacement capital, so productivity advantages disappeared as Soviet equipment became obsolete.&lt;/p&gt;
&lt;p&gt;Q: What were the output and productivity effects of the advanced transfer?
A: Advanced plants&amp;rsquo; output rose 8.4 percent relative to basic plants within two years of the Soviet transfer and 19.7 percent within 20 years, reaching a cumulative effect of 49.5 percent after 40 years. TFPQ of advanced plants increased from 8.3 percent above basic plants two years after the transfer to 47.9 percent after 40 years. These effects were driven by output growth rather than differential input use — the number of workers, coke, and iron were statistically indistinguishable across the three plant types — ruling out government input reallocation as an explanation.&lt;/p&gt;
&lt;p&gt;Q: Did the advanced transfer affect steel quality?
A: Advanced plants produced substantially more crude steel (higher quality, lower carbon content) and less pig iron than basic and comparison plants, and this quality advantage persisted well beyond the 20-year life cycle of Soviet capital. Basic plants also shifted toward crude steel initially but the quality advantage dissipated once Soviet machinery became obsolete, whereas advanced plants maintained the shift through adoption of the basic oxygen process and later continuous casting furnaces.&lt;/p&gt;
&lt;p&gt;Q: What is the main mechanism through which the advanced transfer generated persistent effects?
A: The advanced transfer equipped engineers and supervisors with organizational, technological, and planning knowledge, enabling advanced plants to develop and adopt the basic oxygen steelmaking process independently during China&amp;rsquo;s 1960–1978 period of trade isolation. Advanced plants had a 15.2 percent higher probability of using the basic oxygen process five years after the transfer and a 65.1 percent higher probability twenty years after, relative to basic plants. They also home-fabricated continuous casting furnaces, making them 26.7 to 78.4 percent more likely to use such furnaces 10 to 20 years after the transfer; basic plants showed no differential advantage over comparison plants on this measure.&lt;/p&gt;
&lt;p&gt;Q: What role did trade openness play in the divergence between basic and advanced plants?
A: Once China opened to international trade from 1978, advanced plants relied dramatically less on imported foreign capital than basic plants — likely because they had developed domestic production capabilities. At the same time, advanced plants exported 45.5 percent more steel and produced 51.1 percent more steel above international quality standards than basic plants. Basic plants showed no differential imports of foreign capital or differential exports relative to comparison plants, suggesting that once both types could access foreign machinery, basic plants lost any remaining productivity edge.&lt;/p&gt;
&lt;p&gt;Q: What were the human capital effects of the advanced transfer?
A: Over time, advanced plants opened training schools for high-skilled technicians and offered within-firm training programs for engineers. As a result, advanced plants employed more engineers and high-skilled technicians and fewer low-skilled workers than basic plants, while the human capital composition did not differentially change between basic and comparison plants. At the county level, universities hosting advanced plants were 10.4 percent more likely to offer STEM degrees, had 16.8 percent more technical schools, 14.3 percent more STEM college graduates, and 17.6 percent more high-skilled workers than counties hosting basic plants.&lt;/p&gt;
&lt;p&gt;Q: Did the government differentially favor basic or advanced plants after the Split?
A: The paper finds no evidence of special government favor. Government transfers and loans were not differentially allocated to basic or advanced plants in either the short or long run. Distance from railroads and roads did not change differentially across plant types. Measures of political connection and politician quality at the prefecture level showed no significant differences across the three groups in the 40 years after the Soviet transfer. County-level total investment and investments in related and unrelated industries were also statistically indistinguishable.&lt;/p&gt;
&lt;p&gt;Q: What were the intra-firm spillover effects?
A: Steel plants in the same firm as advanced plants increased their steel production by 24.9 percent and were 22.1 percent more productive relative to plants in the same firm as basic plants, after the Soviet transfer. Plants in the same firm as basic plants showed no differential performance relative to plants in the same firm as comparison plants. The within-firm spillovers appear driven by the transmission of new technologies and production methods through formal within-firm training programs, as supported by historical records.&lt;/p&gt;
&lt;p&gt;Q: What were the horizontal spillover effects across firms?
A: Steel plants in the same counties as advanced plants produced 12.9 percent higher output and were 12.4 percent more productive than those in counties hosting basic plants, after the transfer. They were more likely to adopt basic oxygen converters and continuous casting furnaces, and from 1978 they exported significantly more and produced more steel above international quality standards, mirroring the patterns of the advanced plants themselves.&lt;/p&gt;
&lt;p&gt;Q: What were the vertical spillover effects?
A: Steel plants in counties hosting nonsteel basic plants produced 14.2 percent more steel than those in counties hosting nonsteel comparison plants, suggesting some output spillover from basic machinery. However, only plants in counties of advanced nonsteel plants experienced a productivity increase — estimated at 16.4 percent — relative to plants in counties of basic nonsteel plants. These supply-chain firms were also the only ones to show increased adoption of basic oxygen and continuous casting furnace technology and differential engagement in trade.&lt;/p&gt;
&lt;p&gt;Q: How did market liberalization reforms interact with the spillover effects?
A: Starting in the late 1990s, privatized firms economically related to advanced plants outperformed their counterparts in terms of value added, TFPR, and exports, while state-owned firms in the same counties no longer showed a competitive advantage. New private firms locating in counties that had hosted advanced plants received an additional performance gain. At the county level, counties hosting advanced plants had on average 16.6 percent more private firms and 25.2 percent more privately-produced industrial output than counties hosting basic plants. The mechanism appears to be the stock of industry-specific human capital concentrated in those counties, which private firms could draw on once allowed to compete for workers.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated aggregate contribution of the Soviet transfer to Chinese growth?
A: Province-level regressions show that each additional basic project increased province-level output by 1.1 percent per year on average, and each additional advanced project by 6.2 percent per year. A back-of-the-envelope calculation implies that without both transfer types, Chinese real GDP per capita growth between 1953 and 1978 would have been 7 to 19 percent lower.&lt;/p&gt;
&lt;p&gt;Q: How does the paper rule out selection on unobservable characteristics?
A: Using the Oster (2019) methodology, the paper finds that for the treatment effects to become statistically insignificant, selection on unobserved variables would need to be 8 to 19 times larger than selection on observed variables — a range the authors characterize as implausible given the strong balancing on observables and the historical documentation of delay causes.&lt;/p&gt;
&lt;p&gt;Q: How does this paper differ from Heblich et al. (2020), which also studies Sino-Soviet technology transfer?
A: Heblich et al. (2020) study long-run negative spillovers of the 156 Projects on counties that hosted them relative to counties that were geographically suitable but ultimately not selected, focusing on an outside-the-program comparison. This paper instead exploits within-program variation — differences across the three plant types — using plant-level data to assess short-, medium-, and long-run direct effects and spillover effects of different transfer intensities.&lt;/p&gt;
&lt;p&gt;Basic Transfer: The provision of Soviet state-of-the-art machinery, equipment, blueprints, geological surveys, and plant construction assistance — duplicating a whole Soviet plant — without accompanying human capital or organizational training.&lt;/p&gt;
&lt;p&gt;Advanced Transfer: The full Soviet technology and know-how package: basic machinery provision plus multi-year visits of Soviet experts who taught Chinese engineers and production supervisors organizational, technological, and planning methods based on &amp;ldquo;scientific management&amp;rdquo; principles.&lt;/p&gt;
&lt;p&gt;Comparison Plants: Plants approved under the 156 Projects that received neither Soviet machinery nor technical assistance due to delays compounded by the Split, and which continued operating with traditional domestic technology.&lt;/p&gt;
&lt;p&gt;156 Projects: An array of 139 approved, technologically advanced, large-scale, capital-intensive industrial facilities whose construction the Soviet Union agreed to support between 1950 and 1957 as part of the Sino-Soviet Alliance, representing 45.7% of Chinese GDP in 1949.&lt;/p&gt;
&lt;p&gt;Tacit Knowledge: Industry- and firm-specific knowledge embodied in workers and organizations — including operational methods, quality-control procedures, and process innovation capabilities — that cannot be transferred through capital goods alone and requires extensive on-the-job training from foreign experts.&lt;/p&gt;
&lt;p&gt;Basic Oxygen Process: A steelmaking process innovation that became predominant in the 1960s by blowing oxygen through molten pig iron to reduce carbon content; adopted by advanced plants through endogenous process development, while basic plants showed no differential adoption relative to comparison plants.&lt;/p&gt;
&lt;p&gt;Source Text Origin: The paper&amp;rsquo;s classification scheme for the grounding of evidence — in this case, full working paper text obtained from NBER WP 29455, enabling comprehensive summary of quantitative results, mechanisms, and robustness tests.&lt;/p&gt;</description></item><item><title>Tell Me Something I Don't Already Know: Learning in Low- and High-Inflation Settings</title><link>https://macropaperwarehouse.com/papers/tell-me-something-i-dont-already-know-learning-in-low-and-high-inflation-settings/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/tell-me-something-i-dont-already-know-learning-in-low-and-high-inflation-settings/</guid><description>&lt;p&gt;This paper uses randomized control trials (RCTs) applied over time in multiple countries to study whether the economic environment — specifically the level of inflation — affects how agents learn from new information. The main finding is that as inflation rose in advanced economies, both households and firms became more attentive to and informed about publicly available news about inflation, causing them to respond less to exogenously provided information about inflation and monetary policy in the RCT treatments. When agents are already well-informed about the current inflation environment (because high inflation makes it salient), additional information provision moves their beliefs less — the marginal value of information is decreasing in prior attentiveness. Complementary evidence from Uruguay (persistently high inflation) and New Zealand (persistently low inflation) confirms the cross-sectional pattern: agents in high-inflation environments have stronger priors and respond less to information treatments. The results imply that central bank communication interventions are more effective during low-inflation periods when agents are less informed.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-the-rct-design-identify-the-effect-of-the-inflation-environment"&gt;Q1. How does the RCT design identify the effect of the inflation environment?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The paper exploits the within-country time variation in inflation — running comparable RCT information treatments before, during, and after the 2021-2023 surge in multiple countries — to identify whether the same information treatment has different effects depending on the prevailing inflation level.&lt;/strong&gt; By holding the content of the information treatment constant and varying the macroeconomic environment, the paper isolates how environment-driven changes in agent attentiveness mediate the treatment effect.&lt;/p&gt;
&lt;h3 id="q2-why-do-agents-respond-less-to-information-when-inflation-is-high"&gt;Q2. Why do agents respond less to information when inflation is high?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When inflation is high and salient, agents actively monitor publicly available inflation news, reducing their prior uncertainty; the Bayesian posterior shift from a given information signal is smaller when the prior is more concentrated.&lt;/strong&gt; This mechanism — decreasing marginal value of information with prior precision — means that information provision campaigns are subject to diminishing returns as the macroeconomic environment itself provides more signal.&lt;/p&gt;
&lt;h3 id="q3-what-does-this-imply-for-central-bank-communication-strategy"&gt;Q3. What does this imply for central bank communication strategy?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Central bank forward guidance and information campaigns are most effective when agents have diffuse priors — i.e., during periods of low, stable inflation when inflation is not salient to households and firms.&lt;/strong&gt; During high-inflation episodes, agents become more sophisticated but also more resistant to updating on any individual information signal, making communication a weaker tool precisely when the inflation challenge is most acute.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;prior attentiveness&lt;/strong&gt; : the degree to which agents actively monitor publicly available information about inflation; the paper shows this rises with inflation level, reducing the marginal value of additional information provided by RCT treatments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;decreasing marginal value of information&lt;/strong&gt; : the property that additional information moves beliefs less when agents already have precise priors; the mechanism explaining why RCT treatment effects are smaller in high-inflation environments.&lt;/p&gt;</description></item><item><title>Testing Mechanisms</title><link>https://macropaperwarehouse.com/papers/testing-mechanisms/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/testing-mechanisms/</guid><description>&lt;p&gt;Kwon and Roth develop econometric tests for the &amp;ldquo;sharp null of full mediation&amp;rdquo;: the hypothesis that a treatment D affects an outcome Y only through a specified mechanism (or set of mechanisms) M, with no direct pathway. Rather than attempting the more demanding task of identifying average direct and indirect effects — which typically requires strong assumptions about how M is assigned — the paper asks whether full mediation is consistent with the data at all, and if not, how large the alternative mechanisms are.&lt;/p&gt;
&lt;p&gt;The key theoretical observation is that under the sharp null of full mediation, together with independence of D and monotonicity of M in D, the treatment D satisfies the conditions for a valid instrumental variable for the local average treatment effect (LATE) of M on Y. This equivalence means that existing tools for testing IV validity with binary endogenous treatment can be applied off-the-shelf when both D and M are binary. The paper then extends this framework to the general case where M is a p-dimensional vector with finite support, and where the researcher can impose arbitrary restrictions on the distribution of compliance types θ_{lk} = P(M(0)=m_l, M(1)=m_k) — including monotonicity, relaxations allowing a bounded share of defiers, elementwise monotonicity for multidimensional M, or no restrictions.&lt;/p&gt;
&lt;p&gt;The testable implications of the sharp null require that there exists type shares θ̃ in the identified set Θ_I such that sup_A Δ_k(A) ≤ Σ_{l≠k} θ̃_{lk} for all k, where Δ_k(A) is the treatment-control difference in the probability of the compound outcome {Y∈A, M=m_k}. The intuition is that any positive treatment effect on this compound outcome can only be driven by compliers, not by always-takers who under the sharp null have both fixed M and fixed Y. Because Θ_I is characterized by linear constraints when R is, verifying the testable implications reduces to a linear program. The paper proves these implications are sharp: if satisfied, there exists a joint distribution of potential outcomes consistent with the data and the sharp null. The paper also derives sharp lower bounds on ν_k = P(Y(1,m_k) ≠ Y(0,m_k) | M(1)=M(0)=m_k), the fraction of k-always-takers whose outcome is affected despite having the same mediator value under both arms.&lt;/p&gt;
&lt;p&gt;For inference, the testable implications are reformulated as moment inequalities and the Cox-Shi (2022) test is recommended based on Monte Carlo simulations calibrated to the empirical applications, which find close-to-nominal size across nearly all designs (null rejection probability no larger than 9% for a 5% test), with the exception of settings with only 40 clusters where CS is over-sized at 0.15 but recovers with 80 clusters.&lt;/p&gt;
&lt;p&gt;The methodology is illustrated in two RCT applications. In Bursztyn, González, and Yanagizawa-Drott (2020), where an information treatment about other men&amp;rsquo;s beliefs is randomized in Saudi Arabia and the outcome is wives&amp;rsquo; job applications, the sharp null that effects operate only through job-search service sign-up is rejected (p=0.02, CS test); the lower bound on the fraction of never-takers affected despite no change in sign-up is at least 11%, compared to an overall ATE of 0.12, with the lower bound remaining positive for defier shares up to 7%. In Baranov et al. (2020), where cognitive behavioral therapy for new mothers is randomized and the outcome is financial empowerment at seven-year follow-up, the sharp null is rejected for grandmother presence alone (p=0.02, lower bound ≥19% of never-takers affected) and for relationship quality alone (p=0.03, lower bound ≥10% of always-takers affected); however, when both mechanisms are considered jointly, the sharp null cannot be rejected at conventional levels (p=0.65), indicating the data are statistically consistent with the combination of these two mechanisms fully explaining the treatment effect.&lt;/p&gt;
&lt;p&gt;Scope conditions: the main results assume D is randomly assigned (extended in Section 5 to IV, conditional unconfoundedness, and distributional difference-in-differences settings) and M has finite support. An R package, TestMechs, accompanies the paper.&lt;/p&gt;
&lt;p&gt;Q1: What is the sharp null of full mediation and how does it differ from standard mediation analysis objectives?
The sharp null posits that Y(d,m) depends only on m and not on d — that is, Y(0,m) = Y(1,m) almost surely for all m — meaning the treatment affects the outcome exclusively through its effect on M. Standard mediation analysis seeks to decompose the average treatment effect into average direct and indirect components, which requires identifying the causal effect of M on Y and thus typically imposes sequential unconfoundedness or an instrument for M. The sharp null test asks only whether any direct effect exists for any individual, which is answerable without identifying the causal effect of M on Y and therefore under substantially weaker assumptions.&lt;/p&gt;
&lt;p&gt;Q2: What is the core identification insight connecting mediation testing to IV validity testing?
Under the sharp null of full mediation, combined with independence of D and monotonicity of M(d) in d, the treatment D satisfies exactly the LATE assumptions as an instrument for the effect of M on Y. Consequently, testable implications of the LATE assumptions — developed in Kitagawa (2015), Huber and Mellace (2015), and Mourifié and Wan (2017) — translate directly into testable implications of the sharp null when both D and M are binary. This equivalence allows researchers to apply off-the-shelf IV validity tests for mechanism testing with no additional methodological development in the binary-binary case.&lt;/p&gt;
&lt;p&gt;Q3: What are the sharp testable implications of the sharp null in the general multi-valued, multi-dimensional M case?
The sharp testable implications require that there exists a vector of type shares θ̃ in the identified set Θ_I (consistent with observed marginal distributions of M|D and the researcher&amp;rsquo;s restrictions R) such that sup_A Δ_k(A) ≤ Σ_{l≠k} θ̃_{lk} for all k, where Δ_k(A) = P(Y∈A, M=m_k|D=1) − P(Y∈A, M=m_k|D=0). The intuition is that any positive treatment effect on the compound outcome 1{Y∈A, M=m_k} can only be driven by compliers transitioning into state k; always-takers have fixed M=m_k and under the sharp null also have fixed Y, so they contribute zero. The testable implications are proved to be sharp: if they hold, there exists a joint distribution of potential outcomes consistent with the data and the sharp null.&lt;/p&gt;
&lt;p&gt;Q4: How does the paper quantify the magnitude of violation when the sharp null is rejected?
The paper derives sharp lower bounds on ν_k = P(Y(1,m_k) ≠ Y(0,m_k) | M(1)=M(0)=m_k), the fraction of k-always-takers whose outcome is affected by the treatment despite having the same mediator value under both arms. The lower bound is θ_{kk}·ν_k ≥ (sup_A Δ_k(A) − Σ_{l≠k} θ_{lk})₊, which is sharp in the sense that there exists a distribution of potential outcomes achieving equality. Appendix B.1 additionally derives bounds on ADE_k = E[Y(1,m_k)−Y(0,m_k)|M(1)=M(0)=m_k], the average direct effect for k-always-takers.&lt;/p&gt;
&lt;p&gt;Q5: How is inference conducted and which test is recommended?
Because the test statistic involves the solution to a linear program whose constraints depend on the data, and sup_A Δ_k(A) can be non-differentiable in the data-generating process — making standard bootstrap methods invalid — the paper reformulates the testable implications as moment inequalities of the form H₀: ∃ω s.t. C₁ω − C₂p ≥ 0, where C₁ and C₂ are known matrices and p collects observable conditional probabilities. Methods from the moment inequality literature (Andrews, Roth, and Pakes, 2023; Cox and Shi, 2022; Fang, Santos, Shaikh, and Torgovitsky, 2023) are then directly applicable. Cox and Shi (2022) is recommended as a default based on Monte Carlo evidence.&lt;/p&gt;
&lt;p&gt;Q6: What do the Monte Carlo simulations reveal about size and power?
Across nearly all simulation designs calibrated to the two empirical applications, the ARP, CS, and K tests achieve close-to-nominal size, with null rejection probabilities no larger than 9% for a nominal 5% test. The notable exception is settings with only 40 independent clusters, where CS is over-sized with a null rejection probability of 0.15; doubling to 80 clusters restores approximate size control. For power, CS performs similarly to or better than ARP across all designs, with the advantage being substantial in some cases, particularly with multi-valued M. The FSST test can be substantially over-sized in settings with small or moderate numbers of clusters.&lt;/p&gt;
&lt;p&gt;Q7: What does the Bursztyn et al. (2020) application find?
The treatment is random assignment of information about other men&amp;rsquo;s beliefs about women working outside the home in Saudi Arabia; the mediator is job-search service sign-up (binary); the outcome is whether the wife applies for jobs three to five months later. The sharp null is rejected with p=0.02 (CS test), establishing that the information treatment affects long-run labor market outcomes through pathways other than mechanical service sign-up. The lower bound on the fraction of never-takers affected despite no change in sign-up is at least 11%; the estimated average direct effect for these never-takers ranges from 0.11 to 0.18, compared to an overall ATE of 0.12. The lower bound remains positive for defier shares up to 7% of the population (0.33 defiers per complier), providing robustness to violations of monotonicity.&lt;/p&gt;
&lt;p&gt;Q8: What does the Baranov et al. (2020) application find?
The treatment is cognitive behavioral therapy for pregnant women and new mothers (randomized RCT); the outcome is an index of financial empowerment at seven-year follow-up. For the binary mechanism of grandmother presence in the household, the sharp null is rejected (CS p=0.02) with a lower bound of at least 19% of never-takers affected. For relationship quality with husband (1-5 scale, under monotonicity that CBT improves the relationship), the sharp null is rejected (CS p=0.03) with a pooled lower bound of at least 10% of always-takers affected. When both mechanisms are considered jointly as a vector M, the sharp null cannot be rejected (CS p=0.65) and the lower bound on the fraction of always-takers affected is 7%, indicating the data are statistically consistent with the combination of these two mechanisms fully explaining the CBT effect on financial empowerment.&lt;/p&gt;
&lt;p&gt;Q9: How does the framework accommodate relaxations of monotonicity?
The paper allows the researcher to specify arbitrary closed non-empty subsets R of the simplex as restrictions on type shares θ. Monotonicity in the binary case corresponds to R = {θ∈Δ: θ_{10}=0}, ruling out defiers. A relaxation allows up to d̄ fraction of the population to be defiers (θ_{10} ≤ d̄). In the Bursztyn et al. (2020) application, the estimated lower bound on ν_k remains positive for d̄ up to 0.07. One can also completely remove monotonicity by setting R = Δ, though this yields less informative bounds. For multidimensional M, elementwise monotonicity imposes that each dimension of M(d) is increasing in d.&lt;/p&gt;
&lt;p&gt;Q10: How does the paper extend to non-experimental settings?
Section 5 shows that results extend whenever the distributions of (Y^tot(d), M(d)) are identified through strategies other than direct randomization of D. Under a standard IV setup with binary instrument Z for D, the LATE of D on Y and D on M are identified for instrument-compliers, and the same testable implications apply within this subpopulation. Under conditional unconfoundedness D ⊥ (Y(·,·), M(·)) | X with overlap, distributions are identified via propensity-score reweighting. Under distributional difference-in-differences (Athey and Imbens, 2006; Callaway and Li, 2019; Roth and Sant&amp;rsquo;Anna, 2023), counterfactual distributions of Y and M for treated units are identified, enabling the same testing approach.&lt;/p&gt;
&lt;p&gt;Q11: What is the paper&amp;rsquo;s relationship to the principal stratification literature?
The k-always-takers — those with M(1)=M(0)=m_k — correspond directly to principal strata (Frangakis and Rubin, 2002). The bounds on ADE_k derived in Appendix B.1 match those of Lee (2009), Flores and Flores-Lagunes (2010), and Zhang and Rubin (2003) in the special case of binary M under monotonicity, and extend them to non-binary M and relaxations of monotonicity. The primary focus of the present paper is the sharp (Fisherian) null that ν_k = 0 for all k — that is, no always-taker is affected — which is strictly stronger than the weak null of zero average direct effect studied in the principal stratification literature.&lt;/p&gt;
&lt;p&gt;Q12: What are the limitations and directions for future work identified by the authors?
The analysis is restricted to discrete M; while M can be discretized under assumptions described in Remark 3, testing the sharp null directly for continuous M remains an open question for future work. The framework does not impose restrictions on the magnitude of M&amp;rsquo;s effect on Y or on the degree of endogeneity of M, and incorporating such restrictions could yield sharper testable implications. Extension to non-binary treatments D is also identified as a direction for future research.&lt;/p&gt;
&lt;p&gt;Sharp null of full mediation: The hypothesis that Y(0,m) = Y(1,m) almost surely for all m in the support of M — i.e., the treatment D affects the outcome Y exclusively through its effect on M, with no direct effect on any individual&amp;rsquo;s outcome. This is a Fisherian sharp null, strictly stronger than a zero average direct effect.&lt;/p&gt;
&lt;p&gt;k-always-takers: Individuals for whom M(1)=M(0)=m_k — those whose mediator value equals m_k regardless of treatment assignment. Under the sharp null, these individuals&amp;rsquo; outcomes must be unaffected by the treatment. They constitute the principal stratum with fixed mediator value m_k and generalize the always-taker and never-taker concepts from the binary LATE framework.&lt;/p&gt;
&lt;p&gt;ν_k (fraction of always-takers affected): ν_k = P(Y(1,m_k) ≠ Y(0,m_k) | M(1)=M(0)=m_k), the fraction of k-always-takers whose outcome is affected by the treatment despite having the same mediator value under both arms. Under the sharp null ν_k = 0 for all k; a large ν_k indicates strong alternative mechanisms operating outside of M for always-takers with mediator value m_k.&lt;/p&gt;
&lt;p&gt;Type shares θ_{lk}: The fractions of the population of each compliance type, θ_{lk} = P(M(0)=m_l, M(1)=m_k). These generalize the LATE compliance categories (always-takers, never-takers, compliers, defiers) to the multi-valued mediator setting. The vector θ may be only partially identified when M is non-binary, with the identified set Θ_I characterized by linear constraints matching observed marginal distributions of M|D.&lt;/p&gt;
&lt;p&gt;Δ_k(A): The treatment-control difference in the probability of the compound outcome {Y∈A, M=m_k}: Δ_k(A) = P(Y∈A, M=m_k|D=1) − P(Y∈A, M=m_k|D=0). The supremum of Δ_k(A) over all sets A is the key estimable quantity that appears in both the testable implications and the lower bounds on ν_k.&lt;/p&gt;
&lt;p&gt;Identified set Θ_I: The set of type-share vectors θ̃ consistent with the observed marginal distributions of M|D=0 and M|D=1, and with the researcher&amp;rsquo;s restrictions on compliance types R. When R is characterized by linear constraints (as in all main examples), Θ_I is a polytope and optimization over it — required for implementing the testable implications — is a linear program.&lt;/p&gt;
&lt;p&gt;TestMechs R package: The accompanying software implementation of the inference methods and lower bound estimators developed in the paper, designed to facilitate empirical application of the tests.&lt;/p&gt;</description></item><item><title>The Architecture of Social Networks and the Diffusion of Innovations</title><link>https://macropaperwarehouse.com/papers/the-architecture-of-social-networks-and-the-diffusion-of-innovations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-architecture-of-social-networks-and-the-diffusion-of-innovations/</guid><description>&lt;p&gt;This paper examines how the architecture of social networks shapes the success or failure of technology diffusion when adoption decisions exhibit strategic complementarities. The research question is: which structural feature of a network determines whether a new technology spreads or fails, and in which direction does that feature work?&lt;/p&gt;
&lt;p&gt;The paper builds on the canonical threshold diffusion model of Morris (2000) and Granovetter (1978), in which an agent adopts a new technology if the share of his neighbors who have adopted exceeds a threshold Q in [0,1]. The key innovation is the addition of a second structural object — a set of decision-making units C — that captures the empirically common phenomenon that subsets of agents (friends, family, neighbors, colleagues) can coordinate and make joint adoption decisions. The model is purely theoretical; the paper derives characterizations and comparison theorems rather than estimating parameters from data.&lt;/p&gt;
&lt;p&gt;The central structural concept introduced is insularity: the extent to which agents concentrate their connections to a narrow set of other agents, rather than distributing connections broadly. A formal partial order over networks is defined: network {w̃} is less insular than network {w} if there is no local increase in insularity in {w̃} relative to {w}, where a local increase in insularity occurs when one agent&amp;rsquo;s proportionate connections to a narrow set S are strictly higher and another agent&amp;rsquo;s proportionate connections to a superset R are strictly lower (with the first agent&amp;rsquo;s share of S weakly exceeding the second agent&amp;rsquo;s share of R). Moving from a network toward a convex combination with the complete network strictly reduces insularity under this definition (Lemma 3).&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s main characterization result (Proposition 1) establishes that the set of non-adopters of technology Q is precisely SQ — the maximal (1−Q)-subgroup-cohesive set — defined as the largest set in which every decision-making unit C contained in SQ has at least one agent with at least fraction (1−Q) of his connections inside SQ. This extends Morris&amp;rsquo;s (2000) cohesion characterization to the joint-decision setting.&lt;/p&gt;
&lt;p&gt;The main theorem (Theorem 1) establishes that for any two societies sharing the same decision-making structure C but differing in network insularity, there exists a cutoff threshold mu in [0,1] such that: (i) for technologies with Q &amp;lt; mu, adoption is weakly higher in the less insular network; and (ii) for technologies with Q &amp;gt;= mu, adoption is weakly lower in the less insular network. The direction reversal at mu reflects two competing mechanisms. Insular connections hinder singleton diffusion: an agent over-connected to a narrow set will not adopt individually until others in that set adopt, blocking entry of the technology from outside. But insular connections facilitate joint adoption: the same over-connectedness makes it profitable for the group to adopt together if they can coordinate, because each member already has a high share of neighbors within the group. High-threshold technologies depend crucially on joint adoption cascades and so benefit from insularity; low-threshold technologies spread person-to-person and are impeded by insularity when agents cannot coordinate.&lt;/p&gt;
&lt;p&gt;Proposition 2 establishes a complementary monotonicity result: expanding the set of decision-making units (C subset of C&amp;rsquo;) weakly increases adoption for any technology and any network, because joint decision-making resolves local coordination failures.&lt;/p&gt;
&lt;p&gt;The main result is extended to heterogeneous thresholds (Section 7). Proposition 3 shows that Theorem 1 continues to hold when agent-specific idiosyncratic components theta_i are bounded within an interval [−gamma/2, gamma/2] for some gamma &amp;gt; 0. Proposition 4 characterizes the necessary conditions for the main result to break: the specification fails only if there exist two agents i and j with theta_i &amp;gt; theta_j + Q2 − Q1, meaning the idiosyncratic gap between them exceeds the difference between the two technology thresholds being compared.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central research question?
A: The paper asks how the architecture of a social network — specifically the structure of agents&amp;rsquo; connections — determines whether a new technology spreads widely or fails to diffuse. It focuses on technologies with strategic complementarities, where an agent&amp;rsquo;s benefit from adopting depends on neighbors adopting and those neighbors&amp;rsquo; benefit depends on their neighbors, creating potential for both snowballing and coordination failure.&lt;/p&gt;
&lt;p&gt;Q: What is the key modeling innovation relative to the standard threshold model?
A: The paper adds a set of decision-making units C, a collection of subsets of agents each of which can make a joint adoption decision. In the standard Morris (2000) model, only individual agents decide; here, groups such as friends, family, or neighbors can collectively agree to adopt, resolving their local coordination problem. The set C is subject only to closure under subsets and inclusion of all singletons, making the framework highly flexible.&lt;/p&gt;
&lt;p&gt;Q: How does the diffusion process work formally?
A: At each period t &amp;gt;= 1, agent i adopts if either: (1) more than fraction Q of his neighbors adopted in period t−1 (singleton adoption), or (2) i belongs to a decision-making unit C not yet adopted, and for every j in C the fraction of j&amp;rsquo;s neighbors in A_{t−1} union C exceeds Q (joint adoption). Actions are irreversible, and Appendix C proves this irreversibility assumption is without loss of generality for the final adoption set under myopic best-response dynamics.&lt;/p&gt;
&lt;p&gt;Q: What is the characterization of non-adopters (Proposition 1)?
A: The set of agents who do not adopt technology Q equals SQ, the unique maximal (1−Q)-subgroup-cohesive set — the largest set S such that every decision-making unit C contained in S has at least one member i with Pi(S minus C) &amp;gt;= (1−Q), meaning at least fraction (1−Q) of i&amp;rsquo;s connections remain inside S outside of C. This extends Morris (2000)&amp;rsquo;s p-cohesion concept: when C contains only singletons, (1−Q)-subgroup cohesion collapses to (1−Q)-cohesion in Morris&amp;rsquo;s sense.&lt;/p&gt;
&lt;p&gt;Q: What does the simple eight-agent example illustrate?
A: With two four-clique subgraphs (agents 1-4 and 5-8), Network A has agents 1, 3, 5, 7 each holding 3/4 of their connections within their four-agent group; Network B reduces those within-group shares to 5/8 by weakening two within-group links from weight 1 to weight 1/2 and adding cross-group links of weight 1/2. For Q = 3/10: in Network B all eight agents adopt (group {1,2,3,4} adopts jointly at t=1, then agents 5 and 7 adopt as singletons at t=2, agents 6 and 8 at t=3), while in Network A only {1,2,3,4} adopt (agents 5-8 each have only 1/4 of neighbors adopted, below Q = 3/10). For Q = 7/10: in Network A group {1,2,3,4} adopts jointly (each has 3/4 &amp;gt; 7/10 of neighbors adopting), while in Network B there is zero adoption (agent 3 has only 5/8 &amp;lt; 7/10 of neighbors in the joint group). This is the concrete illustration of the threshold-dependent reversal in Theorem 1.&lt;/p&gt;
&lt;p&gt;Q: What is insularity and how is it formally defined?
A: Insularity is the extent to which agents concentrate their connections to a narrow set of others. A local increase in insularity in {w} relative to {w̃} occurs when, for some agents i and j and sets S subset of R: (1) Pi(S) is strictly higher in {w} and Pj(R) is strictly lower in {w}, and (2) Pi(S) &amp;gt;= Pj(R) in {w}. Network {w̃} is less insular than {w} if no local increase in insularity exists in {w̃} relative to {w}. Lemma 3 establishes that the lambda-convex combination of any non-complete network with the complete network is strictly less insular.&lt;/p&gt;
&lt;p&gt;Q: What is the main theorem (Theorem 1) and its precise statement?
A: For two societies sharing the same decision-making structure C but differing in network insularity — with {w̃} strictly less insular than {w} — there exists a cutoff mu in [0,1] such that: for Q &amp;lt; mu, adoption is weakly higher in the less insular network; and for Q &amp;gt;= mu, adoption is weakly lower in the less insular network. The cutoff mu depends on the specific networks and decision-making structure. The result is a clean reversal: less insular is better for low-threshold technologies and worse for high-threshold technologies.&lt;/p&gt;
&lt;p&gt;Q: What are the two competing mechanisms driving Theorem 1?
A: First, insular connections hinder individual diffusion: an agent with a high share of connections concentrated inside a set will not adopt as a singleton until others in that set adopt, blocking entry of the technology from outside via individual contagion. Second, insular connections facilitate joint adoption: precisely because an agent has a high share of connections to a narrow group, jointly adopting with that group is profitable — each member has enough neighbors already within the group to exceed the threshold when the group adopts together. For high-threshold technologies, joint adoption is the only viable mechanism, so the second effect dominates; for low-threshold technologies, singleton diffusion suffices and the first effect dominates.&lt;/p&gt;
&lt;p&gt;Q: How does joint decision-making affect adoption (Proposition 2)?
A: Expanding the set of decision-making units from C to any C&amp;rsquo; containing C weakly increases adoption of technology Q for any network and any Q. The proof shows that the non-adopter set SQ under C&amp;rsquo; is also (1−Q)-subgroup cohesive under C, making it a subset of non-adopters under C. The economic logic is that any group able to make a joint decision can solve its local coordination problem: agents who individually would not adopt because too few neighbors have adopted may collectively adopt if each would benefit from group adoption.&lt;/p&gt;
&lt;p&gt;Q: How robust is Theorem 1 to heterogeneous thresholds?
A: Proposition 3 shows that Theorem 1 extends with the same cutoff structure when each agent i has an idiosyncratic threshold component theta_i in [−gamma/2, gamma/2] for sufficiently small gamma &amp;gt; 0. Proposition 4 establishes the necessary condition for the result to break with unbounded heterogeneity: there must exist agents i and j with theta_i &amp;gt; theta_j + Q2 − Q1, meaning the idiosyncratic gap must strictly exceed the technology threshold gap being compared. The underlying intuition of Theorem 1 persists even when the precise specification fails.&lt;/p&gt;
&lt;p&gt;Q: What are the policy and managerial implications?
A: A firm with a low-threshold technology should target less insular societies to maximize uptake, while a firm with a high-threshold technology should target more insular societies; the paper cites Facebook&amp;rsquo;s initial launch within closed university networks as consistent with the high-threshold logic. Policymakers and firms can increase adoption by encouraging joint decision-making — sanitation campaigns that organize neighborhood workshops, family mobile-plan discounts, or online coordination platforms all work through this channel. Conversely, governments trying to suppress collective action such as protest can prohibit in-person gatherings or online communication to prevent joint decision-making. The paper notes results abstract from seeding, leaving optimal seeding under joint decision-making as a future research direction.&lt;/p&gt;
&lt;p&gt;Insularity: The extent to which agents concentrate their connections to a narrow set of other agents rather than distributing connections broadly; formally defined via a partial order based on local increases in agents&amp;rsquo; proportionate connections to nested sets S subset of R.&lt;/p&gt;
&lt;p&gt;Decision-making unit: A set C of agents who can make a joint decision to adopt together; the collection C of all decision-making units is closed under subsets and contains all singletons, capturing informal group coordination among friends, family, or neighbors.&lt;/p&gt;
&lt;p&gt;p-Subgroup cohesion: A set S is p-subgroup cohesive if every decision-making unit C contained in S (of any size, including singletons) is p-connected in S — meaning at least one agent in C has at least fraction p of his connections to S minus C; the paper&amp;rsquo;s generalization of Morris (2000)&amp;rsquo;s p-cohesion to settings with joint decision-making.&lt;/p&gt;
&lt;p&gt;Threshold of adoption (Q): A parameter Q in [0,1] summarizing a technology&amp;rsquo;s strategic complementarities, such that an agent is better off adopting if and only if more than fraction Q of his neighbors adopt; low Q means the technology is valuable even with few adopters, high Q means it requires near-universal neighborhood adoption.&lt;/p&gt;
&lt;p&gt;Local increase in insularity: A pairwise comparison between two networks: {w} exhibits a local increase in insularity relative to {w̃} when one agent&amp;rsquo;s proportionate connections to narrow set S are strictly higher and another agent&amp;rsquo;s proportionate connections to superset R are strictly lower in {w}, with the first agent&amp;rsquo;s share of S weakly exceeding the second agent&amp;rsquo;s share of R in {w}.&lt;/p&gt;
&lt;p&gt;SQ (maximal non-adopter set): The unique maximal (1−Q)-subgroup-cohesive set in a society, constituting exactly the agents who do not adopt technology Q in the final outcome; it is the union of all (1−Q)-subgroup-cohesive sets and is itself (1−Q)-subgroup-cohesive (Lemma 2, Proposition 1).&lt;/p&gt;</description></item><item><title>The Confederate Diaspora</title><link>https://macropaperwarehouse.com/papers/the-confederate-diaspora/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-confederate-diaspora/</guid><description>&lt;p&gt;This paper investigates how white migration out of the postbellum South diffused Confederate culture and entrenched racial norms across the United States during a critical juncture of westward expansion and post-Civil War reconciliation. The central question is whether the &amp;ldquo;Confederate diaspora&amp;rdquo; — Southern white migrants who left the former Confederacy from 1870 to 1900 — causally shaped the geography of Confederate memorialization, white supremacist organizations, racial violence, and long-run racial inequity outside the South.&lt;/p&gt;
&lt;p&gt;Using complete-count U.S. Census records from 1870–1900 and linked Census records from the Census Linking Project, the authors track nearly one million white migrants from former Confederate states, including more than 61,000 former enslavers and 127,000 of their household kin, who settled outside the South by 1900. By 1900, migrants from the former Confederacy comprised on average 2.2% of the population in destination counties. Four outcomes measuring Confederate culture at the county level are constructed: Confederate memorialization (monuments, place names, schools), United Daughters of the Confederacy (UDC) chapters, Ku Klux Klan (KKK) chapters, and lynchings of Black people.&lt;/p&gt;
&lt;p&gt;The primary identification strategy is a shift-share instrumental variable (SSIV) that combines the cross-sectional distribution of Southern white migrants across non-Southern counties in 1870 (shares) with predicted migration flows out of each Southern state between 1870 and 1900 (shifts). The predicted shifts are constructed from origin-county economic and ideological push factors estimated via LASSO, insulating the IV from endogenous location sorting. Conditional on the 1870 Southern white population share, the SSIV identifies the distinct causal influence of the postbellum Confederate diaspora.&lt;/p&gt;
&lt;p&gt;Main findings are large relative to the diaspora&amp;rsquo;s modest population share. Moving from zero to the mean Confederate diaspora share implies an 8 percentage point (p.p.) increase in the likelihood of KKK activity relative to a mean prevalence of 35% in non-Southern counties. Effects on post-1900 lynching events are even larger proportionally: a 4 p.p. increase in likelihood relative to a mean of only 5%. IV estimates for Confederate memorialization show that a 1 p.p. increase in the Southern white share in 1900 raised the likelihood of memorialization by 3.4 p.p. (after controlling for the 1870 share), relative to a baseline prevalence of 25% outside the South. Effects on UDC chapters are similarly large given the organization&amp;rsquo;s limited non-Southern footprint (present in only 10% of counties). IV estimates consistently exceed OLS estimates, consistent with economic sorting biasing OLS downward.&lt;/p&gt;
&lt;p&gt;Beyond Confederate symbolism, the diaspora also contributed to a novel form of racial exclusion: the &amp;ldquo;sundown town.&amp;rdquo; A 1 p.p. increase in the Confederate diaspora share in 1900 led to a 2.4 p.p. increase in the likelihood of Black depopulation (defined as towns with at least 25 Black residents in 1870 having zero Black residents after 1900).&lt;/p&gt;
&lt;p&gt;Former slaveholders, though only about 6% of Confederate migrants, played an outsized role. They disproportionately sorted into frontier counties and into positions of public authority — more than twice as likely to work as lawyers or judges and nearly three times as likely to work in public administration as the average non-slaveholding Southern white migrant. Their cultural influence was especially pronounced in frontier communities where institutions were weak and norms malleable. In Denver, first-generation Southern white migrants were 11% more likely to join the KKK than men with no Southern heritage, with a similar differential observed for second-generation migrants.&lt;/p&gt;
&lt;p&gt;The diaspora&amp;rsquo;s effects persist into the 21st century: counties with larger Confederate diasporas in 1900 exhibit larger racial wage gaps, greater residential segregation, higher rates of Black incarceration, higher rates of police-induced Black mortality, and more conservative racial attitudes among whites, as measured in modern survey data. These long-run findings are identified using the same county-level SSIV strategy. Scope conditions: effects are larger in frontier counties (weaker institutions, more malleable norms), in counties with fewer Union Army enlistees, and in newly incorporated areas with fewer than 2 residents per square mile in 1860.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why does it matter?
A: The paper asks whether postbellum Southern white migration causally diffused Confederate culture — memorialization, organized white supremacy, and racial violence — beyond the South, and whether this early cultural transplantation has persistent effects on racial inequity today. It matters because Confederate monuments and persistent Black disadvantage in labor, housing, and policing are often attributed to the legacies of slavery within the South; this paper shows the mechanism by which those norms spread nationally through internal migration at a critical juncture of westward expansion and post-war reconciliation.&lt;/p&gt;
&lt;p&gt;Q: How large was the Confederate diaspora, and who comprised it?
A: Estimates from linked Census records suggest that nearly one million whites left the former Confederacy for the rest of the U.S. in the three decades after the war, including more than 61,000 former enslavers and 127,000 of their household kin. By 1900, migrants from the former Confederacy averaged 2.2% of the population in non-Southern destination counties. The diaspora hailed primarily from the upper South — Virginia, Tennessee, and North Carolina — and later from Texas, Arkansas, and Oklahoma.&lt;/p&gt;
&lt;p&gt;Q: How do the authors construct the shift-share instrumental variable, and what identifying assumption does it require?
A: The SSIV multiplies each Southern origin state&amp;rsquo;s 1870 settlement shares across non-Southern counties (the shares) by predicted total Southern white outflows from 1870 to 1900 (the shifts), where the predicted shifts are constructed by summing LASSO-selected origin-county push factors — economic conditions, cotton and tobacco potential, Civil War battle locations, Black population share — rather than actual flows. The exclusion restriction requires that these predicted push-factor-driven outflows affect destination county outcomes only through the Confederate diaspora they deliver, not through direct economic linkages with origin counties. Conditioning on the 1870 Southern white share absorbs time-invariant destination heterogeneity correlated with antebellum settlement.&lt;/p&gt;
&lt;p&gt;Q: What are the IV estimates for Confederate memorialization and UDC chapters?
A: A 1 p.p. increase in the Southern white share in 1900 raised the likelihood of Confederate memorialization by 3.4 p.p. after controlling for the 1870 share (relative to a baseline prevalence of 25% outside the South). For UDC chapters, which were present in only 10% of non-Southern counties, IV estimates show similar or larger proportional effect sizes. IV estimates are consistently more than twice the size of OLS estimates, consistent with downward bias from economic sorting of Southern whites toward productive, culturally-diverse destinations.&lt;/p&gt;
&lt;p&gt;Q: What are the IV estimates for KKK activity and Black lynchings, and how are they interpreted?
A: A 1 p.p. increase in the Southern white share in 1900 raised the likelihood of KKK chapter presence by 3.5 p.p. (controlling for 1870 shares), relative to a mean KKK prevalence of 37% in non-Southern counties, implying that moving from zero to the mean diaspora share is associated with an 8 p.p. increase in the probability of KKK activity. For Black lynchings, the corresponding IV estimate is 1.5 p.p. (column 5), with the effect rising when earlier migration is controlled, against a mean prevalence of only 5% — implying moving from zero to the mean raises lynching likelihood by 4 p.p. Critically, the authors find no diaspora effect on white lynchings, which distinguishes racially-targeted violence from a generalized Southern culture of violence.&lt;/p&gt;
&lt;p&gt;Q: What is a &amp;ldquo;sundown town&amp;rdquo; and what does the paper find about the diaspora&amp;rsquo;s role in producing them?
A: Sundown towns, described in historical research by Loewen (2005), are all-white towns where Black residents and other minorities were excluded from residing after sunset, spreading throughout the non-South from 1890 to 1960 and representing a novel form of racial exclusion distinct from de jure Jim Crow institutions. The authors find that a 1 p.p. increase in the size of the Confederate diaspora in 1900 led to a 2.4 p.p. increase in the likelihood of Black depopulation — defined as towns with at least 25 Black residents in 1870 having zero Black residents after 1900 — changing the geography of Black settlement throughout the 20th century.&lt;/p&gt;
&lt;p&gt;Q: What role did former slaveholders specifically play, and how are their effects separately identified?
A: Former slaveholders comprised just over 6% of the Confederate migrant sample but played an outsized role: they were about 50% more likely than the average Southern white migrant to work in any public-facing authority occupation, more than twice as likely to work as lawyers or judges, and nearly three times as likely to work in public administration. Their effects are identified using an analogous SSIV that, conditional on the instrumented overall diaspora, draws on distinct identifying variation in slaveholder-specific push factors. Former slaveholders gravitated toward Western, lower-density, cotton-suitable counties with higher Breckinridge vote shares and fewer Union Army soldiers, consistent with seeking to reconstruct antebellum hierarchies in malleable frontier spaces.&lt;/p&gt;
&lt;p&gt;Q: Why were effects stronger in frontier counties?
A: The paper finds that diaspora impacts on Confederate culture diffusion were significantly larger in counties along the frontier, where state institutions were weak and cultural norms not yet deeply ingrained. Restricting the sample to counties with fewer than 2 residents per square mile in the 1860 Census yields somewhat larger estimates than baseline, and the differential sorting of Southern whites (especially former slaveholders) into these nascent communities suggests that institutional malleability amplified the cultural entrepreneurs&amp;rsquo; influence. Fewer Union Army enlistees in destination counties also amplified effects, as those families might otherwise have opposed resurgent Confederate ideology.&lt;/p&gt;
&lt;p&gt;Q: How did the diaspora transmit its norms to subsequent generations and non-Southern neighbors?
A: In the Denver metropolitan area, using newly digitized KKK membership records, first-generation Southern migrants were 11% more likely to join the KKK than men with no Southern heritage, and a similar differential holds for second-generation migrants (born in the diaspora), with patterns holding within Census enumeration blocks. White men without Southern heritage living next door to first- or second-generation Southern whites were significantly more likely to join the KKK, consistent with horizontal cultural spillovers. For naming patterns, non-Southern white parents who moved to counties with a larger Confederate diaspora gave their later-born children names more evocative of Confederate heroes than those given to earlier-born children — providing direct evidence of cultural spillovers beyond the diaspora.&lt;/p&gt;
&lt;p&gt;Q: What long-run effects of the diaspora are documented through the 21st century?
A: Using the county-level SSIV strategy, the paper finds that a larger Confederate diaspora in 1900 is associated with larger racial wage gaps, greater residential segregation, higher rates of Black incarceration, and higher rates of police-induced Black mortality through the 21st century. These disparities are mirrored in more conservative racial attitudes among whites in these counties as measured in modern survey data. These persistent effects suggest that, despite racially progressive national policy reform since the 1960s, locally institutionalized mechanisms reinforced by a culture of racial animus continue to generate inequity.&lt;/p&gt;
&lt;p&gt;Q: How robust are the main estimates to alternative specifications?
A: The authors show robustness across: (i) alternative spatial standard errors using Conley (1999) distance-based clustering and Adao et al. (2019) shift-share inference corrections; (ii) Belloni et al. (2014) double LASSO control selection; (iii) replacing predicted shifts with actual shifts; (iv) a random-shifts placebo where fewer than 5% of coefficients are significant; (v) dropping individual origin or destination states one-by-one (all estimates remain significant with 97% positive Rotemberg weights); (vi) excluding border states with antebellum slavery (Delaware, Kentucky, Maryland, Missouri, West Virginia), which actually increases estimates; and (vii) restricting to newly incorporated counties with near-zero 1860 populations, which yields somewhat larger effects.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s contribution to the culture-institutions literature?
A: The paper uses granular data on migration, occupational choices, and local governance to shed light on the historical process by which Confederate &amp;ldquo;cultural entrepreneurs&amp;rdquo; captured early institutions across America, illustrating how culture and institutions reinforce each other during critical junctures of nation-building. The findings suggest that laws to reduce racial discrimination may have limited impact where a culture of racial animus is ingrained in local institutions — an institutionalized persistence mechanism that helps explain the gap between formal legal reforms and observed racial outcomes. The paper also identifies a prestige-biased cultural transmission channel, consistent with Henrich and Gil-White (2001), wherein non-elite masses emulate former slaveowners in positions of power.&lt;/p&gt;
&lt;p&gt;Confederate diaspora: The approximately one million white migrants, including more than 61,000 former enslavers and 127,000 of their household kin, who left former Confederate states for the rest of the U.S. in the three decades after the Civil War, comprising on average 2.2% of destination county populations by 1900 and retaining strong cultural attachments to the Confederacy.&lt;/p&gt;
&lt;p&gt;Confederate culture: A cluster of symbolic and material expressions that coalesced in the postbellum South, encompassing Lost Cause narratives (glorifying Confederate figures and reframing secession as a defense of states&amp;rsquo; rights rather than slavery), public memorialization (monuments, place names, school names), United Daughters of the Confederacy chapters, Ku Klux Klan activity, and lynchings of Black people — together functioning as technologies to transmit white supremacist norms and maintain racial hierarchies.&lt;/p&gt;
&lt;p&gt;Lost Cause: A revisionist narrative emerging after the Civil War that sought to redeem the image of the South by offering noble rationalizations for secession — emphasizing Northern aggression and states&amp;rsquo; rights while downplaying slavery — and portraying enslaved people as content and slaveowners as generously paternalistic; central to the ideology propagated by the UDC and to Confederate memorialization.&lt;/p&gt;
&lt;p&gt;Shift-share instrumental variable (SSIV): An identification strategy that combines the 1870 distribution of Southern white migrants across non-Southern counties (shares, reflecting historical migration networks) with predicted total Southern white outflows from 1870 to 1900 constructed from origin-county push factors via LASSO (shifts), to isolate exogenous county-level variation in Confederate diaspora exposure that is insulated from endogenous location sorting.&lt;/p&gt;
&lt;p&gt;Sundown town: An all-white municipality where Black residents and other minorities were excluded from residing after sunset, spreading throughout the non-South from 1890 to 1960, operationalized in this paper as towns with at least 25 Black residents in 1870 having zero Black residents after 1900 (Black depopulation), representing a novel form of racial exclusion distinct from de jure Jim Crow institutions associated with the Confederacy.&lt;/p&gt;
&lt;p&gt;Prestige-biased cultural transmission: An evolutionary transmission mechanism, formalized in Henrich and Gil-White (2001), in which non-elite populations emulate culturally salient leaders; invoked in this paper to explain how former slaveholders in positions of authority could diffuse Confederate norms to non-Southern whites who had no direct connection to the Confederacy.&lt;/p&gt;
&lt;p&gt;Cultural entrepreneur: A migrant (especially a former slaveholder) who, by sorting into positions of public-facing authority — judges, lawyers, law enforcement, clergy, public administrators — at early stages of community formation when institutions are most malleable, actively embeds cultural norms into nascent local institutions, amplifying influence beyond their small population share.&lt;/p&gt;</description></item><item><title>The crowding-in effects of local government debt in China</title><link>https://macropaperwarehouse.com/papers/the-crowding-in-effects-of-local-government-debt-in-china/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-crowding-in-effects-of-local-government-debt-in-china/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how changes in the &lt;em&gt;composition&lt;/em&gt; (not the size) of Chinese local government debt influence bank risk-taking, credit allocation between privately owned enterprises (POEs) and state-owned enterprises (SOEs), and local total factor productivity. The focus is a 2015 debt-to-bond swap program in which local governments were required to convert outstanding implicit debt — primarily bank loans to local government financing vehicles (LGFVs) and LGFV-issued corporate bonds — into explicitly guaranteed local government bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Institutional Context&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Following China&amp;rsquo;s 2008–09 fiscal stimulus, local government debt outstanding rose from 5.8% of GDP in 2006 to 22% by 2013 and reached RMB 15.4 trillion (24% of GDP) by end-2014. The debt was largely held through LGFVs, which are nominally corporate firms but with implicit government backing. Under China&amp;rsquo;s amended budget law effective early 2015, all outstanding debt had to be converted to provincial government bonds through a three-year swap program. Before the swap, government bonds accounted for only 8% of outstanding local government debt; the remaining 92% (approximately RMB 14.17 trillion) needed to be swapped. Commercial banks hold on average 88% of newly issued local government bonds; the government bond share of commercial bank assets rose from 1.7% in 2014 to 14% in 2019.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under Basel III capital adequacy ratio (CAR) regulations, Chinese commercial banks — specifically the Big Five systemically important banks using the internal-ratings-based (IRB) approach — assign risk weights above 80% on average to corporate loans, but only 20% (the regulatory approach) to local government bonds. Converting LGFV debt to government bonds therefore reduces banks&amp;rsquo; risk-weighted assets, loosening the binding CAR constraint. The paper formalizes this through a partial-equilibrium model of bank portfolio choice: a lower risk weight on government-bond assets (modeled as a fall in ξ_g) loosens an effective capital constraint, inducing banks to shift toward riskier (POE) lending and reducing the POE-SOE loan rate spread. The model predicts this effect is larger in provinces with higher initial outstanding government debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The empirical analysis uses: (1) confidential loan-level data from one of the Big Five Chinese commercial banks covering approximately 400,000 unique firm-loan pairs from 2008:Q1 to 2017:Q4 (regression sample 2013:Q1–2017:Q4); (2) province-level outstanding debt data at end-2014 for 25 provinces, constructed from prefectural-level data collected by Qu et al. (2023); and (3) firm-level balance sheet data from China&amp;rsquo;s Annual Survey of Industrial Firms (ASIF), covering above-scale manufacturing firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using a triple-difference (DDD) identification — interacting POE status, a post-2015 dummy, and provincial initial government debt — the paper finds:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;At the average level of provincial government debt, the debt swap program reduced the POE credit spread (loan rate deviation from benchmark rate, relative to SOEs) by approximately &lt;strong&gt;3.18 percentage points&lt;/strong&gt; (coefficient α = −3.182, significant at p &amp;lt; 0.01).&lt;/li&gt;
&lt;li&gt;For provinces with initial outstanding debt &lt;strong&gt;one standard deviation above the mean&lt;/strong&gt; (approximately 0.402 log units above mean), the swap reduced the POE credit spread by an additional &lt;strong&gt;1.15 percentage points&lt;/strong&gt; (= 0.402 × 2.849; coefficient β = −2.849, significant at p &amp;lt; 0.01), accounting for 10.1% of the standard deviation of loan rates in the sample.&lt;/li&gt;
&lt;li&gt;In terms of the raw loan rate gap between SOEs and POEs (averaging 42 basis points in the sample), the program narrowed this spread by approximately 6 basis points in high-debt provinces (one standard deviation above mean), accounting for about 1/7 of the average gap.&lt;/li&gt;
&lt;li&gt;On the extensive margin, in provinces with outstanding debt one standard deviation above the mean, the swap raised the &lt;strong&gt;probability of bank lending to POE firms&lt;/strong&gt; by approximately &lt;strong&gt;1.2 percentage points&lt;/strong&gt; (= 0.402 × 0.0292).&lt;/li&gt;
&lt;li&gt;2SLS estimates instrumenting swapped debt by initial outstanding debt interacted with the post-2015 dummy confirm: one standard deviation increase in swapped debt leads to an &lt;strong&gt;11.21% decline&lt;/strong&gt; in the POE loan rate deviation from benchmark relative to SOEs (= 3.723 × 3.013%), accounting for 0.98 standard deviations of the loan rate variable.&lt;/li&gt;
&lt;li&gt;For provincial total factor productivity (TFP), provinces with 1% higher outstanding government debt before the swap experienced a &lt;strong&gt;2.2% larger increase in TFP&lt;/strong&gt; after 2015. The debt swap amount itself (instrumented) has a positive and significant effect on provincial TFP.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions and Parallel-Trends Validation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pre-trend tests show that neither the average POE-SOE rate spread (α_τ) nor its interaction with provincial government debt (β_τ) is significantly different from zero in 2014 relative to the base year 2013. Both turn significantly negative only from 2015 onward, validating the parallel-trends assumption. Results are robust to: excluding LGFV firms, excluding large firms (top 10% by assets), restricting to central SOEs as controls (dropping local SOEs), controlling for local debt capacity, GDP growth, FDI/GDP, aged population, total loans, and bank branch fixed effects. A placebo test using the 2016 deleveraging policy shows no significant effect on bank risk-taking, distinguishing the debt-swap mechanism from contemporaneous policy changes.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the key theoretical channel through which the debt-to-bond swap affects bank lending to POEs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The channel is the risk-weighting mechanism under Basel III capital adequacy ratio (CAR) regulations. Under the IRB approach used by Big Five banks, corporate loans carry average risk weights above 80%, while local government bonds carry a fixed regulatory weight of 20%. Converting LGFV corporate loans and bonds to local government bonds on the bank&amp;rsquo;s balance sheet reduces total risk-weighted assets, loosening the binding CAR constraint. The bank responds by adopting a riskier investment policy — lowering the cutoff ω̂ in the model — which increases lending to POE firms and reduces the POE-SOE credit spread.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why is the effect of the swap predicted to be larger in provinces with higher initial outstanding government debt?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 2 of the model shows that the sensitivity of the POE loan rate spread to the debt swap policy (∂²ΔR_loan / ∂ξ_g ∂g) is positive, meaning it increases with the amount of government debt g. Provinces with more outstanding debt at end-2014 have more LGFV loans to swap into lower-risk-weight bonds, implying a larger reduction in risk-weighted assets for banks operating in those provinces and hence a larger relaxation of the CAR constraint. Empirically, the correlation between province-level outstanding debt and the amount of swapped debt from 2015–2017 is 0.85 (p-value &amp;lt; 0.0001), confirming the mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the empirical specification identify the effect of the debt swap rather than pre-existing trends?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use a triple-difference (DDD) design: the outcome (loan rate deviation from benchmark) is regressed on the interaction POE × Post × GovDebt, where GovDebt is the demeaned log of province-level outstanding debt at end-2014. Pre-trend analysis (Equation 16) estimates year-specific coefficients α_τ and β_τ using 2013 as the reference year. For 2014, both coefficients are statistically indistinguishable from zero. From 2015 onward, both turn significantly negative at the 95% confidence level, consistent with the debt-swap policy triggering the change and inconsistent with pre-existing differential trends by province debt level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How do the authors establish that the risk-taking channel rather than a demand-side story drives the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two complementary exercises address demand versus supply. First, the authors add firm × year-quarter fixed effects, which absorb all firm-level time-varying factors (including loan demand). After removing demand effects, the triple-difference coefficient on GovDebt × POE × Post becomes more negative (−23.66, significant at 5%) than the baseline (−2.849), suggesting demand-side movements are not the source of the finding. Second, adding bank-branch × year-quarter fixed effects to remove supply-side heterogeneity makes the triple-difference term insignificant while leaving the POE × Post coefficient at −2.196 (significant at 5%), implying the result is primarily supply-driven and province-specific supply factors captured by the triple interaction absorb into the branch-level controls.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What heterogeneous effects across firm types provide additional evidence for the risk-taking interpretation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Three dimensions of heterogeneity all point toward bank risk-taking. (a) Size: the credit-easing effect (coefficient on GovDebt × POE × Post) is larger in magnitude for small POEs (by firm assets or by loan size) than for large POEs, consistent with small firms being riskier borrowers. (b) Credit rating: the effect is larger for low-rating POEs (below AA-) than for high-rating POEs, consistent with banks taking on more risk in response to a loosened CAR constraint. (c) Firm-bank distance: the effect is larger for firms located farther from the lending bank branch, where information asymmetry is more severe, consistent with increased bank risk-taking toward harder-to-monitor borrowers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do the authors confirm that the debt swap program is the operative channel rather than the overall regulation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Using the Bertrand-Mullainathan (2001) 2SLS approach, the authors treat the amount of swapped debt (ln(1 + Swap_jy)) as the channel variable, instrumented by GovDebt_j × Post_y (and its interaction with POE_i for the intensive-margin regression). The first-stage results are strong (F-statistics of 158–268), confirming that provinces with more initial outstanding debt swap more debt after 2015. The second-stage results show: (a) on the intensive margin, a one-standard-deviation increase in swapped debt leads to an 11.21% decline in the POE loan rate deviation from benchmark relative to SOEs; (b) on the extensive margin, provinces with more swapped debt show significantly higher probability of POE lending. Both second-stage estimates are significant, confirming the debt swap program as the transmission channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What is the effect of the debt swap on provincial total factor productivity, and through what channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Provinces with 1% higher outstanding government debt before the swap experienced a 2.2% larger increase in average provincial TFP after 2015 (column 2 of Table 13, coefficient = 0.0220, significant at p &amp;lt; 0.01), with the parallel-trend analysis showing no significant pre-2015 differential effect (the 2014 coefficient is 0.00346, insignificant). 2SLS estimates using swapped debt as the channel variable confirm a positive, significant effect of swapped debt on provincial TFP, with a coefficient of 0.0253 (p &amp;lt; 0.01) in the second stage. The mechanism is credit reallocation from less-productive SOEs to more-productive POEs, consistent with POEs having higher average productivity as documented in Hsieh and Klenow (2009).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do the authors rule out that the deleveraging policy (implemented in December 2015) drives the results?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A placebo test replaces the Post_y dummy (equal to 1 from 2015 onward) with DeLevy (equal to 1 from 2016 onward, coinciding with the deleveraging policy). Neither the coefficient on GovDebt × POE × DeLevy nor on POE × DeLevy is statistically significant in the placebo regressions (Table 11). This distinguishes the mechanism from the deleveraging policy and confirms that the debt swap program — not deleveraging — is the source of the credit reallocation to POEs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do the authors confirm results are not driven by the debt capacity channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The local government debt reform also regulated debt capacity (the ratio of outstanding debt to a centrally assigned debt limit) for each local government. The authors control for the province-level debt capacity measure (DebtCap_j, the average ratio of local government debt to the debt limit in 2016–2017) alongside the baseline interaction terms. Table 9 shows the baseline results remain valid and significant after including debt capacity controls: the coefficient on GovDebt × POE × Post is −2.210 (p &amp;lt; 0.05) and the POE probability of lending result (coefficient on GovDebt × Post = 0.0277, p &amp;lt; 0.01) both hold, ruling out the debt capacity channel as the driver.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the model predict about the general relationship between capital adequacy requirements and bank risk-taking?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 1 establishes that tightening the capital adequacy ratio requirement (increasing ψ) leads to a safer investment policy (ω̂ increases, meaning the bank sets a higher cutoff before taking risky projects) and a lower leverage ratio. This is the benchmark: the debt swap effectively softens the constraint by reducing risk-weighted assets, analogous to lowering the effective ψ̃, which induces the opposite effect — riskier investment policy (lower ω̂) and lower POE credit spreads. The IRB approach&amp;rsquo;s property that risk weights are higher and increasing in project riskiness (ξ&amp;rsquo;(ω) &amp;lt; 0 and ξ&amp;rsquo;&amp;rsquo;(ω) ≤ 0) is essential for these comparative statics to hold.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Debt-to-Bond Swap Program (2015):&lt;/strong&gt; China&amp;rsquo;s central government program requiring local governments to convert all outstanding non-government-bond debt (primarily bank loans to LGFVs and LGFV-issued corporate bonds) into explicitly guaranteed provincial government bonds over three years starting in 2015. The program covered RMB 15.4 trillion in outstanding debt, of which 92% needed to be converted; by end-2018, approximately 90% of non-government-bond debt had been swapped.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Risk-Weighting Channel:&lt;/strong&gt; The mechanism by which the change in debt composition affects bank lending. Under Basel III&amp;rsquo;s internal-ratings-based (IRB) approach, Chinese Big Five banks assign risk weights above 80% on average to corporate loans but only 20% (the regulatory approach) to local government bonds. Swapping LGFV debt for government bonds reduces the bank&amp;rsquo;s total risk-weighted assets without changing the size of assets, loosening the binding capital adequacy ratio constraint and enabling increased lending to riskier (POE) borrowers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;POE Credit Spread:&lt;/strong&gt; Defined in the paper as the difference between the loan rate for privately owned enterprises (POEs) and that for state-owned enterprises (SOEs), measured as the percentage deviation of each loan&amp;rsquo;s interest rate from the benchmark rate set by the central bank. SOEs are treated as effectively riskless borrowers due to implicit government guarantees; POEs are the riskier counterparts. The paper tracks the POE credit spread as the primary outcome variable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Local Government Financing Vehicles (LGFVs):&lt;/strong&gt; Nominally corporate firms established by Chinese local governments to raise funds for public investment — primarily through bank loans and LGFV-issued corporate bonds (&amp;ldquo;municipal corporate bonds&amp;rdquo;). LGFVs are implicitly backed by local governments but not explicitly guaranteed, so the bank loans and bonds they issue carry higher Basel III risk weights (treated as corporate exposures) than formal government bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capital Adequacy Ratio (CAR) Constraint:&lt;/strong&gt; The Basel III requirement that a bank&amp;rsquo;s equity capital exceed a minimum fraction ψ of its risk-weighted assets. For systemically important Big Five banks in China, implemented via the IRB approach for corporate loans and the regulatory approach for government bonds since 2012. In the theoretical model, the CAR constraint is binding and determines the bank&amp;rsquo;s effective leverage; relaxing it (by reducing risk-weighted assets) permits the bank to shift toward riskier lending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Internal Ratings-Based (IRB) Approach:&lt;/strong&gt; The Basel III methodology used by the Big Five Chinese banks to calculate risk-weighted assets for corporate loan portfolios. Under this approach, the risk weight is an increasing function of credit risk (higher-risk loans receive higher weights), so the average weight on corporate loans exceeds 80%, and even high-quality loans carry weights above 50%. This contrasts with the fixed 20% regulatory weight assigned to local government bonds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Crowding-In Effect:&lt;/strong&gt; In this paper&amp;rsquo;s usage, the mechanism by which restructuring local government debt composition — specifically, replacing corporate-form LGFV debt with low-risk-weight government bonds — frees up bank capacity to extend credit to private firms (POEs) that would otherwise face higher credit spreads or loan denial. This is framed as the opposite of the standard crowding-out effect (where more government debt squeezes private credit), arising because it is the &lt;em&gt;composition&lt;/em&gt; rather than the &lt;em&gt;size&lt;/em&gt; of government debt that changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Total Factor Productivity (TFP) Reallocation Effect:&lt;/strong&gt; The paper measures provincial average TFP (using the Brandt et al. 2013 methodology) and documents that provinces with more government debt outstanding before the swap experienced larger TFP gains after 2015, attributing this to credit reallocation from less-productive SOEs to more-productive POEs. The effect is interpreted as a reduction in credit misallocation rather than within-firm productivity improvement.&lt;/p&gt;</description></item><item><title>The Dynamics of Internal Migration: A New Fact and its Implications</title><link>https://macropaperwarehouse.com/papers/the-dynamics-of-internal-migration-a-new-fact-and-its-implications/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-dynamics-of-internal-migration-a-new-fact-and-its-implications/</guid><description>&lt;p&gt;Howard and Shao document a new empirical regularity in U.S. internal migration: the t-year interstate migration rate — defined as the share of people living in a different state than they did t years ago — is approximately proportional to the square root of t. The fact is established using the Gies Consumer and Small Business Credit Panel (GCCP), a 15-year panel (2004–2018) covering approximately 1 percent of all Americans with a credit report, and is corroborated in the Panel Survey of Income Dynamics (PSID, 1969–1997), where the square root pattern holds out to a 25-year horizon. The fact is not an artifact of averaging across origins, destinations, cohorts, or age groups: most of the distribution across these cuts is concentrated close to the square root line. It holds for both people under 45 and over 45, and is robust to the choice of time period and inter-state distance.&lt;/p&gt;
&lt;p&gt;The standard moving cost model — in which location choice is a Markov process with i.i.d. extreme-value utility shocks and large bilateral moving costs — is shown (Proposition 1) to imply that the t-year migration rate is approximately proportional to t, not sqrt(t), as moving costs tend to infinity. Simulations confirm the linear pattern persists in calibrated versions of the moving cost model even when adding state variables for prior location, home state, or age.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s main theoretical contribution is the SPACE model (Spatially and Persistently Autocorrelated Epsilons). Rather than imposing moving costs, the SPACE model assumes that person-location match-specific utility is (i) persistent over time, governed by an autocorrelation parameter rho, and (ii) spatially correlated across locations via a generalized extreme-value (cross-nested logit) structure. The model has no moving costs by default. Proposition 3 proves that as rho approaches 1, the ratio of t-year migration to 1-year migration is bounded below by sqrt(t) and above by sqrt(pi/3) * sqrt(t) — a tight bound, since sqrt(pi/3) is approximately 1.023. The calibrated rho-tilde is 0.892, implying a period-to-period autocorrelation of 1 − (1 − rho-tilde)^2 = 0.988.&lt;/p&gt;
&lt;p&gt;The SPACE model replicates bilateral one-year migration flows, matches the decreasing hazard rate of migration conditional on duration of stay, reproduces the distribution of lifetime move counts (including the large fraction who never move and the few percent who move four or more times in 14 years), and outperforms the moving cost model at out-of-sample individual location forecasting: by 2018, the moving cost model&amp;rsquo;s mean Kullback-Leibler divergence reaches approximately 0.12 log-points per observation above the maximum-possible benchmark, versus only 0.014 log-points for the SPACE model.&lt;/p&gt;
&lt;p&gt;Key divergences from the moving cost model arise in four areas. First, moving costs need not be large: the SPACE model rationalizes observed low migration without any moving costs, in contrast to Kennan and Walker&amp;rsquo;s (2011) estimate of average moving costs of $312,146 (2010 dollars), more than six times median household income; when moving costs are added to the SPACE model, they are roughly two orders of magnitude smaller. Second, long-run population elasticities differ sharply: in the SPACE model they remain proportional to bilateral gross migration rates, while in the moving cost model they converge to a static logit proportional to population shares — and population shares and gross migration rates have little empirical correlation, so the long-run elasticities of the two models are essentially uncorrelated across state pairs. Third, adjustment dynamics differ: in the SPACE model a permanent utility shock to Louisiana produces immediate, full population adjustment; in the moving cost model adjustment takes roughly 200 years, with Mississippi overshooting its new steady-state and New York adjusting implausibly slowly. Fourth, welfare inferences are almost reversed: the correlation between log utility changes implied by the two models using U.S. population data is −0.497, with the SPACE model attributing relative utility gains to the South and West and the moving cost model attributing gains to New York and New England.&lt;/p&gt;
&lt;p&gt;Q: What is the square root fact, and which datasets confirm it?
A: The t-year interstate migration rate scales approximately as sqrt(t). It is documented in the GCCP (2004–2018, ~1% of Americans with credit reports) and verified in the PSID (1969–1997), where the pattern holds out to a 25-year horizon. It is not driven by averaging across subgroups: the distribution of the fact across origin-destination pairs, age groups, cohorts, and starting years is concentrated close to the square root line.&lt;/p&gt;
&lt;p&gt;Q: Why does the standard moving cost model fail to match the square root fact?
A: In the moving cost model, location choice is a Markov process with i.i.d. extreme-value shocks. Proposition 1 proves that as the common component of moving costs tends to infinity, the t-year migration rate is proportional to t (linear). Because the model requires large moving costs to rationalize low migration rates, the linear prediction is unavoidable. Simulations of calibrated versions — including variants with home bias, prior-location state variables, or age — confirm the relationship remains approximately linear.&lt;/p&gt;
&lt;p&gt;Q: What is the SPACE model, and why does it generate a square root?
A: The SPACE model replaces moving costs with persistent and spatially correlated person-location match-specific utility. Utility shocks are drawn from a generalized extreme-value (cross-nested logit) distribution that allows spatial correlation, and they are autocorrelated over time with persistence parameter rho. Proposition 3 shows that as rho → 1, the ratio of t-year to 1-year migration is bounded in [sqrt(t), sqrt(pi/3)*sqrt(t)], a tight interval since sqrt(pi/3) ≈ 1.023. The intuition is that when rho is close to 1, the idiosyncratic utility process resembles a random walk, whose standard deviation grows as sqrt(t), causing migration thresholds to be crossed at a sqrt(t) rate.&lt;/p&gt;
&lt;p&gt;Q: What is the calibrated persistence parameter, and what does it imply?
A: The calibrated rho-tilde is 0.892, close enough to 1 to generate the square root fact in simulations. The implied period-to-period autocorrelation of match-specific utility is 1 − (1 − 0.892)^2 = 0.988. This calibration is achieved by solving for the largest eigenvalue of an I×I matrix of conditional migration rates.&lt;/p&gt;
&lt;p&gt;Q: How do the two models compare on individual-level forecasting accuracy?
A: Performance is evaluated using mean Kullback-Leibler divergence from the maximum-achievable log likelihood. Both models perform similarly in 2005, but by 2018 the moving cost model&amp;rsquo;s KL divergence reaches approximately 0.12 log-points per observation, while the SPACE model&amp;rsquo;s reaches only 0.014 log-points — roughly an order of magnitude better — leaving little room for improvement.&lt;/p&gt;
&lt;p&gt;Q: How large are implied moving costs under each model?
A: Kennan and Walker (2011) estimate average moving costs of $312,146 in 2010 dollars, exceeding six times the median household income. The baseline SPACE model requires zero moving costs to match observed migration levels. When an augmented SPACE model with both persistence and moving costs is calibrated to match the one-year and ten-year migration rates, the estimated moving costs are approximately two orders of magnitude smaller than those from a moving-cost-only model.&lt;/p&gt;
&lt;p&gt;Q: How do short-run population elasticities compare across models?
A: In both models, the short-run cross-elasticity of population in state i with respect to utility in state j is approximately proportional to the gross migration rate between them. Corollary 1 formalizes this for the SPACE model: dp_i/du_j = −(1/(1−rho)) * m_{i→j} for i ≠ j. This means that in the short run, both models deliver similar predictions for how populations respond to local shocks.&lt;/p&gt;
&lt;p&gt;Q: How do long-run population elasticities differ?
A: In the SPACE model, long-run elasticities remain proportional to bilateral gross migration rates — the same relationship as in the short run. In the moving cost model, Proposition 4 shows that the long-run elasticity converges to the static logit: d(log p_i)/d(v_j) = −2*p_j for i ≠ j, depending only on population shares. Since population shares and gross migration rates are empirically uncorrelated, the long-run elasticities of the two models are essentially uncorrelated across state pairs.&lt;/p&gt;
&lt;p&gt;Q: What do the models predict about the speed of regional adjustment?
A: In the SPACE model, a permanent utility shock to Louisiana causes full, immediate population adjustment in the first period with no further dynamics. In the moving cost model, the same shock generates adjustment lasting roughly 200 years. Mississippi overshoots its long-run steady state in the moving cost model due to high bilateral migration with Louisiana, while New York adjusts especially slowly due to low bilateral migration — a pattern the authors describe as potentially counterintuitive.&lt;/p&gt;
&lt;p&gt;Q: How do the models handle events involving rapid population change, such as Hurricane Katrina?
A: The SPACE model accommodates fast adjustments by assuming rapid utility changes, consistent with the observed sharp decline in Louisiana&amp;rsquo;s population share followed by a small rebound. The moving cost model requires implausible utility assumptions to match these dynamics: it implies that Louisiana utility two years after Katrina was higher than before the hurricane.&lt;/p&gt;
&lt;p&gt;Q: What do the two models infer about which U.S. states have gained or lost relative utility over time?
A: Using exact-hat algebra applied to observed U.S. population changes, the SPACE model infers that the South and West have the largest relative utility gains, while New England and the Rust Belt have the largest relative declines. The moving cost model produces nearly the opposite inference: New York and New England show relative utility gains, while the South and West show declines. The correlation between the log utility changes implied by the two models is −0.497.&lt;/p&gt;
&lt;p&gt;Q: Why do the authors argue that spatially and temporally correlated utility is realistic, not merely a mathematical convenience?
A: Surveys (Jia et al., 2023) show that people primarily cite family and employment considerations as reasons for interstate moves — both are persistent and geographically concentrated. Proximity to family is spatially correlated: if state i is close to one&amp;rsquo;s family, nearby states are also relatively close. Job opportunities in specific industries or skills are geographically clustered. Natural amenities and regional cultures are spatially correlated as well. The authors argue it is harder to defend the i.i.d. assumption of the moving cost model than the SPACE model&amp;rsquo;s correlated structure.&lt;/p&gt;
&lt;p&gt;Q: What is the distinction between moving costs and persistent match-specific utility?
A: A moving cost is a one-time irreversible cost paid upon leaving a location. Persistent match-specific utility implies that the utility change from moving is ongoing, partially reversible upon return, and decays with time away from the original location. The authors argue that many factors labeled &amp;ldquo;moving costs&amp;rdquo; in the literature — such as distance from friends or amenities — are more accurately characterized as persistent and partially reversible utility losses, a distinction previous models could not draw.&lt;/p&gt;
&lt;p&gt;Q: Does the SPACE model replicate the gravity equation for bilateral migration?
A: Yes. Proposition 2 shows that migration from i to j in the SPACE model is given by m_{i→j} = (1 − rho) * p_i * p_j * (1 + tau_ij), where tau_ij captures spatial correlation. This resembles a gravity equation: more spatially correlated location pairs have higher bilateral migration, and higher persistence (higher rho) implies lower overall migration levels.&lt;/p&gt;
&lt;p&gt;Q: Can the SPACE model be embedded in broader quantitative spatial models?
A: Yes. The SPACE model admits closed-form solutions for state populations and bilateral migration flows, is compatible with exact-hat algebra for dynamic counterfactuals, and supports computationally feasible individual-level simulations. Appendix E embeds the SPACE model in a housing model with durable local housing production and shows that slow population adjustment can emerge from housing durability rather than slow migration per se, providing an alternative explanation for regional divergence persistence.&lt;/p&gt;
&lt;p&gt;SPACE model: A model of internal migration featuring Spatially and Persistently Autocorrelated Epsilons — person-location match-specific utility that is both autocorrelated over time (with persistence parameter rho) and spatially correlated across locations via a generalized extreme-value (cross-nested logit) distribution. The model contains no moving costs by default.&lt;/p&gt;
&lt;p&gt;Square root fact: The empirical regularity that the t-year interstate migration rate (share of people living in a different state than t years ago) is approximately proportional to sqrt(t). Documented in GCCP data (2004–2018) and PSID (1969–1997) up to a 25-year horizon.&lt;/p&gt;
&lt;p&gt;Moving cost model: The standard dynamic discrete-choice model of migration in which an agent living in state i chooses location j to maximize u_j − delta_ij + epsilon_j + beta*E[V&amp;rsquo;], where delta_ij is a bilateral one-time irreversible moving cost and epsilon_j is i.i.d. extreme-value. Low migration rates are rationalized by large moving costs (e.g., $312,146 average in Kennan and Walker 2011).&lt;/p&gt;
&lt;p&gt;Persistence parameter (rho): In the SPACE model, rho governs the autocorrelation of match-specific utility over time. The calibrated value is rho-tilde = 0.892, implying period-to-period autocorrelation of 0.988. As rho → 1, the model generates a square root relationship between the t-year migration rate and t.&lt;/p&gt;
&lt;p&gt;Population cross-elasticity: The elasticity of population in state i with respect to utility in state j. In both models it is proportional to gross bilateral migration in the short run. In the long run, the SPACE model retains this proportionality to migration rates, while the moving cost model converges to a static logit proportional to population shares.&lt;/p&gt;
&lt;p&gt;Exact-hat algebra: A solution method for computing counterfactual equilibria in terms of ratios of new to old values (hats), without requiring knowledge of levels. The SPACE model admits simple exact-hat formulas for population changes; the moving cost model&amp;rsquo;s exact-hat algebra additionally requires tracking past population changes.&lt;/p&gt;
&lt;p&gt;Kullback-Leibler divergence (in this context): The mean divergence between a model&amp;rsquo;s predicted distribution over future locations and the empirical distribution, used as a measure of forecasting accuracy. By 2018, the SPACE model achieves KL divergence of 0.014 log-points per observation versus approximately 0.12 for the moving cost model.&lt;/p&gt;</description></item><item><title>The Dynamics of Verification when Searching for Quality</title><link>https://macropaperwarehouse.com/papers/the-dynamics-of-verification-when-searching-for-quality/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-dynamics-of-verification-when-searching-for-quality/</guid><description>&lt;p&gt;This paper develops a dynamic principal-agent model in which a principal seeks to select exactly one project from a stream of possibilities emerging over time, while a biased agent (who wants any project selected, regardless of quality) reports project quality each period. The principal cannot observe quality directly but can pay a cost c to verify it. Monetary transfers are unavailable. The central question is how verification and selection rules should optimally evolve over time as new options arrive.&lt;/p&gt;
&lt;p&gt;The model is set in discrete time with an infinite horizon (extended to finite horizons in Section 6.1). Each period, a project of quality h with probability q = λΔ or quality l with probability 1 − q arrives i.i.d. The principal selects at most once; the agent receives utility 1 from any selection and 0 otherwise; the principal&amp;rsquo;s payoff equals project quality net of verification costs. Both parties share discount factor δ = e^{−ρΔ}.&lt;/p&gt;
&lt;p&gt;When verification costs are low (c ≤ h) and the horizon is effectively infinite, the optimal mechanism exhibits decreasing skepticism: verification of high-quality reports occurs with a probability that is strictly declining over time, hitting zero at an endogenous deadline T* = ⌈(1−q)(δr − l) / (qc(1−δ))⌉. At that deadline, the principal selects any project irrespective of quality. Before the deadline, the agent reports truthfully — proposing only high-quality projects — and is incentivized by the threat of verification catching a lie, which triggers permanent exclusion. As the deadline approaches, the agent&amp;rsquo;s continuation value rises (guaranteed allocation arrives sooner), so the loss from a detected lie grows, and less verification is needed to deter misreporting. The deadline length is weakly increasing in h and r and decreasing in l and c; as c → 0, T* → ∞ and the principal&amp;rsquo;s payoff converges to the first-best of qh/(1−δ(1−q)).&lt;/p&gt;
&lt;p&gt;When verification costs are high (h &amp;lt; c &amp;lt; c̄, where c̄ is an explicitly computed threshold), deterministic selection is suboptimal. The optimal mechanism has two sequential phases: a randomization phase (periods 1 through T_R = ⌊log(h/c)/log(1−q)⌋ + 1) in which the principal randomizes between selecting and never selecting after a high-quality report without any verification, and a subsequent verification phase matching the low-cost structure. Verification is strictly backloaded: the principal never uses both tools simultaneously in the same period, and randomization always precedes verification. The intuition is that verification acts as a reward to the agent (guaranteeing allocation when h is realized), so delaying it allows earlier periods to exploit the prospect of future verification to relax incentive constraints across more periods, accumulating gains that justify the high verification cost.&lt;/p&gt;
&lt;p&gt;When the horizon is short (T ≤ T̄ := ⌊−(1−q)l/(qc)⌋) and l &amp;lt; 0 (static bias), increasing skepticism emerges: verification probability rises toward 1 in the final period. This occurs because a shrinking horizon reduces the agent&amp;rsquo;s continuation value, weakening the punishment for a detected lie, so more verification is required to maintain incentive compatibility. The paper also establishes that under renegotiation-proofness (Ray 1994), the optimal mechanism takes the same qualitative form as the full-commitment case but with permanent exclusion replaced by a mechanism restart. The leading application is board oversight of CEO-proposed acquisitions, motivated by the Smith v. Van Gorkom Delaware Supreme Court ruling; Graham et al. (2020) is cited as broad empirical support for decreasing oversight of CEOs over time.&lt;/p&gt;
&lt;p&gt;Q: What is the core agency conflict in the model?
A: The agent receives utility 1 from any selection regardless of quality, while the principal&amp;rsquo;s payoff equals quality minus verification costs. The agent always prefers immediate selection, while the principal prefers waiting for high quality, formalized by the condition qh + (1−q)l &amp;lt; qh/(1−δ(1−q)). This is &amp;ldquo;dynamic bias.&amp;rdquo; &amp;ldquo;Static bias&amp;rdquo; additionally arises when l &amp;lt; 0, meaning the principal prefers not allocating to allocating a low-quality project; this second source of conflict is more common in static settings.&lt;/p&gt;
&lt;p&gt;Q: What is the endogenous deadline T* and what determines its length?
A: T* = ⌈(1−q)(δr − l)/(qc(1−δ))⌉. It is weakly increasing in h and r (higher upside makes waiting worthwhile), weakly decreasing in l (a less costly low type shortens the horizon), and decreasing in c (cheaper verification makes longer search feasible). The term δr − l reflects the value of an additional quality draw relative to selecting low quality. As c → 0, T* → ∞ and the principal&amp;rsquo;s payoff converges to the first-best.&lt;/p&gt;
&lt;p&gt;Q: Why does the verification probability decline over time under decreasing skepticism?
A: As the deadline T* approaches, the agent&amp;rsquo;s continuation value from truthful play rises because guaranteed allocation is nearer. The loss from having a lie detected — permanent exclusion — therefore grows in absolute expected terms. Since more severe punishment requires less verification to deter misreporting, the minimum verification probability that satisfies the low type&amp;rsquo;s incentive compatibility constraint falls strictly over time, reaching zero exactly at T*.&lt;/p&gt;
&lt;p&gt;Q: When is randomization of the selection rule optimal, and when is verification strictly better?
A: Randomization is optimal if and only if c &amp;gt; h — when verification would guarantee a negative ex-post payoff for the principal. When c ≤ h, replacing randomization probability (1 − p̂_h) with verification probability x_h = 1 − δu_{t+1} maintains incentive compatibility while yielding a net gain to the principal proportional to h − c &amp;gt; 0 per period. The condition c &amp;gt; h is both necessary and sufficient for the randomization-augmented mechanism to dominate.&lt;/p&gt;
&lt;p&gt;Q: Why is verification backloaded when c &amp;gt; h?
A: Verification guarantees allocation whenever h is realized, which is a valuable reward for the agent. Deploying this reward later allows earlier randomization-phase periods to exploit the prospect of future verification to relax incentive constraints across multiple periods, accumulating gains. Moving verification earlier yields the same static cost but foregoes these accumulated gains; thus backloading verification is optimal. The principal never simultaneously randomizes and verifies in the same period.&lt;/p&gt;
&lt;p&gt;Q: What are the two phases in Theorem 2 and how long does each last?
A: The randomization phase runs from period 1 through T_R = ⌊log(h/c)/log(1−q)⌋ + 1; during this phase the principal randomizes allocation after a high-quality report (with the outside-option probability declining toward 0) but never verifies. The verification phase runs from T_R + 1 through a deadline at T* or T* + 1, with verification probability declining over time exactly as in Theorem 1. The total deadline is T* = T_R + ⌊(h − c − (l − δr)/(1−δ))(1−q)/(qc)⌋.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions does increasing skepticism emerge?
A: Increasing skepticism arises when the horizon is finite and short — specifically when T ≤ T̄ = ⌊−(1−q)l/(qc)⌋, which requires l &amp;lt; 0 (static bias present). In this regime, verification probability rises to 1 in the final period T. Before T, the agent&amp;rsquo;s continuation value shrinks as fewer drawing opportunities remain, weakening the punishment for detected lies, so verification must increase to maintain incentive compatibility. Decreasing skepticism necessarily emerges only given a horizon long enough to overcome static bias.&lt;/p&gt;
&lt;p&gt;Q: How does the renegotiation-proofness extension modify the optimal mechanism?
A: Under renegotiation-proofness following Ray (1994), the mechanism cannot indefinitely withhold allocation following a detected lie, because both parties would prefer to restart rather than receive zero forever. The optimal renegotiation-proof mechanism takes the same qualitative form as Theorems 1 and 2, but permanent exclusion is replaced by a restart to the first period whenever a lie is verified during the verification phase or allocation is withheld during the randomization phase after a high-quality report. Deadlines, verification dynamics, and the phase structure are otherwise unchanged.&lt;/p&gt;
&lt;p&gt;Q: What is the three-region form of the value function?
A: Lemma 4 identifies thresholds u_low &amp;lt; u_high such that: for promised utility u ∈ [0, u_low], x_h(u) = 0 (no verification; only randomization); for u ∈ [u_low, u_high], dV/du = h − c (verification is interior, slope equals net benefit of verification); and for u &amp;gt; u_high, x_h(u) + y(u) = 1 (verification is at maximum). The slope h − c is constant on the middle region because increasing verification by ε raises promised utility by qε and the objective by q(h−c)ε, yielding a constant marginal rate.&lt;/p&gt;
&lt;p&gt;Q: What revelation-principle simplifications reduce the problem?
A: Lemmas 1–3 establish: (i) only high-type reports are ever verified (x_l = 0), since verification of the low type cannot improve principal payoffs; (ii) following verified truthfulness, allocation occurs with probability 1 (p*_{hh} = 1); (iii) the high type&amp;rsquo;s incentive constraint never binds in the optimal solution; and (iv) only the low type&amp;rsquo;s incentive compatibility constraint binds. These reduce the optimization to four free variables — x_h, p̂_h, p̂_l, û_l — subject to two binding constraints.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate to Kovac et al. (2013)?
A: The model builds most directly on Kovac et al. (2013)&amp;rsquo;s principal-agent stopping problem, which lacks costly verification. The key addition is the verification technology; the paper shows that when c ≤ h, verification eliminates the need for randomized selection rules that arise in Kovac et al. (2013). Kovac et al.&amp;rsquo;s randomization logic resurfaces in the randomization phase when c &amp;gt; h, and the analysis applies and extends Kovac et al.&amp;rsquo;s innovations.&lt;/p&gt;
&lt;p&gt;Q: What empirical and institutional evidence motivates the model?
A: The Smith v. Van Gorkom Delaware Supreme Court ruling (1985) established that boards must make meaningful efforts to become informed — exercising verification — as part of their duty of care in acquisition approvals; the TransUnion board was found negligent after approving an acquisition following a twenty-minute presentation with no written materials. Graham et al. (2020) provides broad empirical support for decreasing board oversight of CEOs over time, consistent with the paper&amp;rsquo;s decreasing skepticism prediction. Gompers et al. (2020) on VC analysts&amp;rsquo; project evaluation processes also illustrates the general applicability.&lt;/p&gt;
&lt;p&gt;Decreasing skepticism: The property of the optimal mechanism whereby the principal verifies high-quality reports with a probability that strictly declines over time, reaching zero at the endogenous deadline. Reflects diminishing concern about misrepresentation as the agent&amp;rsquo;s continuation value — and thus the cost of a detected lie — rises as the deadline approaches.&lt;/p&gt;
&lt;p&gt;Endogenous deadline (T*): The period at which the principal allocates any project irrespective of quality, ending the mechanism. Determined by T* = ⌈(1−q)(δr − l)/(qc(1−δ))⌉, balancing the value of waiting for additional quality draws against verification costs; weakly increasing in h and r, decreasing in l and c.&lt;/p&gt;
&lt;p&gt;Static bias vs. dynamic bias: Dynamic bias denotes the conflict that the principal prefers waiting for high quality while the agent prefers immediate selection. Static bias is the additional conflict (arising when l &amp;lt; 0) that the principal prefers withholding allocation to selecting a low-quality project, mirroring the agent-prefers-higher-action conflict in standard static models. Decreasing skepticism necessarily obtains absent static bias; static bias may flip dynamics to increasing skepticism if the horizon is short.&lt;/p&gt;
&lt;p&gt;Backloaded verification: The property that when c &amp;gt; h, verification is deployed only after a complete randomization phase, never simultaneously with randomization. Arises because verification acts as a reward to the agent by guaranteeing allocation when high quality is realized, and delaying this reward allows its incentive-relaxation benefits to compound across more randomization-phase periods.&lt;/p&gt;
&lt;p&gt;Randomization phase: The initial phase (periods 1 to T_R) in the high-cost regime, in which the principal randomizes the allocation decision after a high-quality report (outside option selected with declining probability) without using the verification technology. The randomization probability is set to keep the low type indifferent between truthful reporting and misreporting.&lt;/p&gt;
&lt;p&gt;Increasing skepticism: The opposite verification dynamic from decreasing skepticism, arising when the horizon is short (T ≤ T̄) and l &amp;lt; 0 (static bias). Verification probability rises over time toward 1 in the final period, because the agent&amp;rsquo;s continuation value shrinks as drawing opportunities dwindle, weakening the deterrent effect of detection and requiring more frequent verification to maintain incentive compatibility.&lt;/p&gt;
&lt;p&gt;Incentive compatibility via verification: The mechanism through which the principal deters low-type misreporting: by verifying a reported high-quality project with probability x_h, and punishing detected lies with permanent exclusion (or restart under renegotiation-proofness). This strictly dominates selection randomization when c ≤ h because the net per-period gain equals h − c &amp;gt; 0 while maintaining the same incentive compatibility condition for the low type.&lt;/p&gt;</description></item><item><title>The Earnings and Labor Supply of U.S. Physicians</title><link>https://macropaperwarehouse.com/papers/the-earnings-and-labor-supply-of-u.s.-physicians/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-earnings-and-labor-supply-of-u.s.-physicians/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; What do U.S. physicians earn, how is that earnings variation structured across geography and specialty, and how much does government healthcare payment policy shape those earnings and — through them — physicians&amp;rsquo; labor supply and long-run talent allocation?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The paper builds a novel administrative panel by merging the universe of U.S. federal individual income tax returns (2005–2017) with: the National Plan and Provider Enumeration System (NPPES) physician registry; Medicare billing records with procedure-level Relative Value Unit (RVU) rates (2012–2017); restricted-use American Community Survey responses; Social Security Administration demographic records; and medical school ranking and graduation data. The main sample covers 11.6 million physician-year observations for 965,000 unique physicians aged 20–70.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Earnings Facts.&lt;/strong&gt; In 2017, average physician total individual income was $350,000 (median $265,000); the distribution is right-skewed — the top 1% of age-40–55 physicians averages $4.0 million. Physicians in aggregate earned $297 billion in pre-tax dollars, equaling 8.6% of total U.S. healthcare spending. The age-earnings profile is steep: earnings are approximately $60,000 during residency, rise to roughly $185,000 by the early thirties, and peak near $425,000 at age 50. Business income — systematically underreported in survey data (ACS estimates are approximately $140,000 lower than tax data during peak career years, almost entirely due to non-reporting of business income) — accounts for nearly one-quarter of earnings at age 50. Earnings differ sharply across specialties: primary care physicians average $201,200 (ages 40–55), about half the sample mean, while surgeons earn roughly twice as much.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geographic Pattern.&lt;/strong&gt; Contrary to the pattern for lawyers and workers broadly, physician earnings are not highest on the coasts. A movers-based event study (physicians who changed commuting zones once during 2005–2017) finds that roughly 70% of the cross-location income difference is driven by place rather than worker composition. A two-way fixed-effects variance decomposition reveals pronounced negative physician-location sorting: high-earning physicians tend to locate in lower-income commuting zones, while lower-earning physicians locate in higher-income areas — the opposite of the pattern for lawyers. Medicare&amp;rsquo;s relatively weak adjustment of reimbursement rates for local costs (the empirical elasticity of the Geographic Adjustment Factor to median household income is 0.09, versus 0.33 for a broader local price index) can, by the authors&amp;rsquo; estimates, account for approximately one-third of this unusual geographic earnings pattern.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Government Influence — Medicare Price Changes.&lt;/strong&gt; Using procedure-specific RVU changes as a simulated instrument for each physician&amp;rsquo;s Medicare price exposure, the authors find that a 10% increase in the Medicare price instrument leads to a 2.4% increase in professional earnings of physicians aged 40–55. The behavioral supply response is substantial: physicians bill 4.4% more RVUs (supply elasticity of 0.4 after netting out the mechanical component), of which 3.9% reflects more unique procedures and the rest a shift toward higher-paid procedures. Nearly all of the procedure-level supply increase (3.4 out of 3.8 percentage points) comes from treating additional patients rather than more frequent treatment of existing patients. Converting to pass-through: physicians retain $62 of each $100 in additional Medicare spending directly, or approximately $25 of each $100 of any insurance spending once Medicare&amp;rsquo;s documented spillover into private insurance rates is accounted for. For physicians aged 56–70, a 10% increase in earnings driven by reimbursement changes reduces retirement probability by 0.5 percentage points in that year.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Government Influence — ACA Insurance Expansion.&lt;/strong&gt; Using county-level variation in pre-ACA uninsurance rates (as of 2013) as a source of differential exposure to the ACA&amp;rsquo;s Medicaid expansions and Marketplace subsidies (in 24 states expanding Medicaid in 2014 or early 2015), the authors estimate that a 10 percentage point higher baseline uninsurance rate led to 3.9% higher physician earnings four years post-expansion. Scaling by the first stage (a 10 p.p. higher uninsurance rate translating to 4.96 p.p. higher insurance coverage post-expansion), the implied elasticity of physician earnings to the insurance rate is 0.41. The ACA expansion also reduced retirement probability — a 10 p.p. higher insurance coverage rate leads to a 1 p.p. decline in retirement probability — consistent with a medium-run retirement-to-income elasticity of approximately −1.1. In aggregate, 6% of the $110 billion in annual ACA insurance expansion spending accrued to physicians personally, slightly below their 8.6% baseline share of healthcare spending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Talent Allocation.&lt;/strong&gt; Specialty choice is sticky and entry-restricted. The authors estimate a discrete-choice model of specialty choice using graduates of top-5 medical schools — physicians with effectively unconstrained specialty access — and an aggregate model using USMLE Step 1 score buckets as ability proxies. At the top of the ability distribution, higher specialty earnings strongly attract physicians: increasing primary care physicians&amp;rsquo; hourly income from $98 to $168 per hour (the level of medicine subspecialists) would raise the share of top-5 medical school graduates choosing primary care by approximately 20 percentage points (nearly doubling their representation in primary care). Moving down the USMLE score distribution, the earnings coefficient falls monotonically and turns negative for the lowest score groups — consistent with the model&amp;rsquo;s prediction that entry restrictions cause higher-paying specialties to displace lower-ability applicants as earnings rise, rather than simply attracting more entrants. A more modest counterfactual — raising internal medicine earnings to dermatology levels — raises the average USMLE score in internal medicine by 10 points (from 230.2 to 239.6).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; The earnings estimates are for the period 2005–2017. Pass-through estimates use a short-run price instrument; long-run pass-through may differ depending on private market spillovers and entry. The ACA analysis is restricted to 24 early-expanding states. The specialty-choice model is estimated on medical graduates entering the residency match; the extensive margin of entering medicine itself is not modeled. Health outcome effects of changing physician ability distributions are not estimated.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the level and composition of physician earnings in the tax data, and how do they compare to survey-based estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In 2017, average physician total individual income was $350,000 and median was $265,000; the top 1% of age-40–55 physicians earned $4.0 million on average, more than twice the average of the top 5%. Business income constitutes nearly one-quarter of earnings at age 50 and is concentrated among top earners: 80% of physicians in the top 1% have business income exceeding $25,000, versus 35% overall. ACS survey data for the same physicians underestimate earnings by approximately $140,000 (roughly one-third of the administrative mean) during peak career years, driven entirely by non-reporting of business income on the extensive margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What share of total U.S. healthcare spending do physician earnings represent, and what does this imply for policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Physicians in aggregate earned $297 billion pre-tax in 2017, equaling 8.6% of total U.S. healthcare spending (approximately $913 of the average American&amp;rsquo;s $10,611 annual healthcare expenditure). After applying a 30% income tax rate, after-tax physician earnings equal approximately 6% of total healthcare spending, or roughly 1% of GDP. The authors note this provides an upper bound on the magnitude of savings available from policies aimed at reducing physician incomes as a strategy for lowering overall healthcare spending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the age-earnings profile of physicians evolve, and what drives growth during peak years?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Physician earnings average approximately $60,000 during residency, rise to roughly $185,000 by the early thirties, and peak near $425,000 at age 50, before declining gradually to approximately $270,000 in the late 60s. Growth during peak earning years (ages 40–55) is driven almost entirely by business income: average wages are approximately flat at $285,000 across this age range, while business income and the probability of filing Schedule C rise steadily.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How large and unusual is the geographic pattern of physician earnings, and what is the causal role of location?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Physician earnings are highest in lower-income states (not on the coasts), unlike lawyers and the broader workforce. A movers event study finds that approximately 70% of the cross-commuting-zone income difference is attributable to location rather than worker characteristics; within specialty the estimate rises to approximately 85%. A two-way fixed-effects variance decomposition (with limited-mobility-bias corrections following Andrews et al. 2008 and Kline et al. 2020) reveals pronounced negative physician-location sorting, with the corrected covariance between individual and location effects being 0.6–0.8 times the variance of location effects in magnitude but opposite in sign — a pattern that reverses to positive sorting when the same methods are applied to lawyers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What instrument is used to identify the causal effect of Medicare price changes on physician earnings, and why is it valid?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors construct a physician-year &amp;ldquo;Medicare price instrument&amp;rdquo; by fixing each physician&amp;rsquo;s service mix at its 2012–2017 average and then multiplying those fixed quantities by annually-updated RVU rates, summing over services. Because the fixed quantity weights exclude behavioral responses, and because national RVU changes from CMS periodic reviews affect physicians differentially according to their pre-determined service mix, variation across physicians and over time is plausibly exogenous to individual physicians&amp;rsquo; income shocks. Year-by-specialty fixed effects absorb common specialty-level price trends.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the magnitudes of the earnings and labor supply responses to Medicare price changes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A 10% increase in the Medicare price instrument raises earnings of 40–55 year-old physicians by 2.4% (reduced-form), with a 2SLS elasticity of income to billed RVUs of 0.17. The total-RVU billing coefficient of 1.437 implies a supply elasticity of 0.437 (subtracting 1 for the mechanical component). At the procedure level, a 10% price increase for a specific code leads to 3.8% more billings for that code, of which 3.4 percentage points reflects treating additional patients. For physicians aged 56–70, a 10% earnings increase reduces that year&amp;rsquo;s retirement probability by 0.5 percentage points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the ACA insurance expansion affect physician earnings and retirement, and what is the implied pass-through?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Counties with a 10 percentage point higher pre-ACA uninsurance rate saw 3.9% higher physician earnings by 2017 (four years post-expansion). Scaled by the first stage (4.96 p.p. higher coverage), the elasticity of physician earnings to insurance coverage is 0.41. A 10 p.p. higher insurance coverage rate leads to a 1 p.p. lower retirement probability post-expansion (medium-run elasticity of retirement to income of approximately −1.1). In aggregate, 6% of $110 billion in annual ACA expansion spending — roughly $7.1 billion, or about $8,400 per physician — accrued to physicians.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How does the earnings-specialty choice relationship vary across the physician ability distribution?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the individual-level discrete-choice model estimated on top-5 medical school graduates (likely unconstrained in specialty choice), the coefficient on hourly earnings is 0.014. In the aggregate score-group model, the implied earnings coefficient is 0.016 for USMLE scores above 260 and declines monotonically to −0.008 for scores at or below 190. This negative coefficient for low scorers is consistent with the theoretical prediction that higher earnings attract high-ability physicians, leaving fewer slots for lower-ability applicants due to binding entry restrictions — not a reversal of preferences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What are the quantitative implications for specialty choice if primary care incomes were raised to subspecialty levels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Raising primary care hourly income from $98 to $168 (the level of medicine subspecialists) would increase the share of top-5 medical school graduates choosing primary care by approximately 20 percentage points (about 48% would enter primary care, versus the current share), nearly doubling their representation. Nearly half of these reallocations would come from procedural specialties. An analogous exercise raising internal medicine earnings to dermatology levels shifts the average USMLE score in internal medicine from 230.2 to 239.6 — a 10-point increase — as higher-scoring applicants displace lower-scoring ones within a fixed slot constraint.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What is the pass-through from Medicare reimbursements to physician earnings, and how does it compare to rent-sharing elsewhere?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Direct estimates imply physicians retain $62 of each $100 in additional Medicare spending. Accounting for Medicare&amp;rsquo;s documented spillover into private insurance rates (following Clemens and Gottlieb 2017), the pass-through drops to $25 per $100 of total insurance spending. The authors note this is substantially higher than the modest rent-sharing found for average workers in response to firm-level shocks (Card et al. 2018), but comparable to rent-sharing with high-skilled workers benefiting from patent rents (Kline et al. 2019).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Can Medicare&amp;rsquo;s geographic pricing policy explain the unusual geographic earnings pattern for physicians?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The elasticity of Medicare&amp;rsquo;s Geographic Adjustment Factor (GAF) to commuting zone median household income is 0.09, compared to 0.33 for a broader local price index. Using the authors&amp;rsquo; short-run estimate that a 10% increase in Medicare prices raises earnings by 2.4%, a counterfactual simulation shows that if the GAF-to-income elasticity rose to 0.33 (aligning Medicare rates with the general cost-of-living gradient), the geographic physician earnings pattern would more closely resemble that of lawyers. The authors estimate that the gap in Medicare&amp;rsquo;s local cost adjustment explains approximately one-third of the unusual physician earnings geography, conditional on the short-run pass-through estimate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the theoretical model of specialty choice and entry restrictions guide the empirical predictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The model features a unit mass of physicians with heterogeneous ability (Pareto-distributed) and idiosyncratic specialty preferences (exponentially distributed). Physicians choose whether to specialize in period 1; government sets reimbursement rates in period 2; physicians choose labor supply in period 3. With a fixed number of residency slots, higher specialty earnings raise the ability cutoff for entry (rationing by ability). This generates a key nonmonotonic empirical prediction: higher-ability physicians respond positively to earnings increases (choosing a specialty more frequently), while lower-ability physicians respond negatively (displaced by the shift upward in the ability cutoff). The model also implies that demand shocks are not moderated by contemporaneous entry, so incumbents capture the full rent — motivating the estimated pass-through.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Medicare Price Instrument (Simulated RVU Instrument).&lt;/strong&gt; A physician-year measure of Medicare payment exposure constructed by holding each physician&amp;rsquo;s service mix fixed at its 2012–2017 average and multiplying those fixed quantities by time-varying national RVU rates, then summing across services. This purges the instrument of behavioral responses, creating exogenous cross-physician variation in price exposure arising from the interaction of fixed service mix with national RVU policy changes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Relative Value Unit (RVU).&lt;/strong&gt; The unit by which Medicare defines and reimburses each physician service in the Physician Fee Schedule. RVUs are intended to reflect the time, effort, and resources required to provide each service, but are subject to periodic review by CMS&amp;rsquo;s RVU Update Committee (RUC) and influenced by political factors. Changes in RVUs translate directly into changes in Medicare reimbursement rates for affected services.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pass-Through (Reimbursement to Earnings).&lt;/strong&gt; The share of an additional dollar of Medicare (or insurance) spending that accrues to physicians personally as earnings, after accounting for practice costs, intermediaries, and behavioral responses. The paper estimates $62 per $100 of direct Medicare spending or $25 per $100 of total insurance spending (the latter accounting for Medicare&amp;rsquo;s spillover into private rates).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Negative Physician-Location Sorting.&lt;/strong&gt; The empirical finding — robust to limited-mobility-bias corrections — that higher-ability (higher-earning) physicians disproportionately locate in lower-income commuting zones, while lower-earning physicians concentrate in higher-income areas. This is the opposite of the pattern for lawyers and for worker-firm matching in the broader labor literature. The paper attributes part of this pattern to Medicare&amp;rsquo;s incomplete geographic adjustment of reimbursement rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ability Cutoff (am) in Residency Matching.&lt;/strong&gt; In the paper&amp;rsquo;s theoretical model, the minimum ability level required to gain entry into a restricted-entry specialty. Because the number of residency slots is fixed, the cutoff rises when a specialty&amp;rsquo;s relative earnings increase (attracting more high-ability applicants), displacing lower-ability physicians who would otherwise have entered. This makes the earnings-specialty relationship nonmonotonic across the ability distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Business Income (Pass-Through Entity Income).&lt;/strong&gt; Income from physician-owned practices organized as sole proprietorships, S-corporations, or partnerships, reported on Schedule C or through pass-through entities rather than on Form W-2. In the tax data, business income accounts for nearly one-quarter of physician earnings at career peak and is the main source of earnings for top physicians, but is systematically underreported in survey data (ACS), leading to a roughly one-third underestimate of total earnings during peak years.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Geographic Adjustment Factor (GAF).&lt;/strong&gt; A Medicare policy parameter that multiplies the national RVU rate to adjust physician reimbursements for local input costs (specifically physicians&amp;rsquo; work, practice expenses, and malpractice). The paper documents that the GAF&amp;rsquo;s elasticity to local median household income is 0.09 — far below the 0.33 elasticity of the general local price index — constituting an effective subsidy to rural and lower-income markets relative to higher-income areas.&lt;/p&gt;</description></item><item><title>The Economics of Equilibrium with Indivisible Goods</title><link>https://macropaperwarehouse.com/papers/the-economics-of-equilibrium-with-indivisible-goods/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-economics-of-equilibrium-with-indivisible-goods/</guid><description>&lt;p&gt;This paper develops an economic theory of competitive equilibrium with indivisible goods that accommodates both complementarities and substitutabilities. The central research question is: what conditions on demand are sufficient, and essentially necessary, for the existence of competitive equilibrium prices when goods are indivisible?&lt;/p&gt;
&lt;p&gt;The classical answer — gross substitutes (Kelso and Crawford, 1982) — entirely rules out complementarities. Complementarities matter in practice, yet prior work showed that equilibrium does not generally exist when all goods are complements (Bikhchandani and Mamer, 1997), while certain patterns of complementarities are compatible with equilibrium (Greenberg and Weber, 1986; Danilov, Koshevoy, and Lang, 2013). The economic content of which patterns permit equilibrium has remained opaque, previously accessible only through combinatorial or tropical geometry.&lt;/p&gt;
&lt;p&gt;Jagadeesan and Teytelboym&amp;rsquo;s key conceptual move is to analyze complementarity and substitutability between bundles of goods, rather than between individual goods. They introduce a bundle consistency condition: each pair of relevant bundles — defined via the compensated price effects of agents — must be either consistently substitutable or consistently complementary across all agents. A bundle is relevant if it arises as a price effect (revealing either direct complementarity or hidden complementarity between a good and an opportunity to sell another good) or consists of a single good. Bundle consistency is formulated as: for each bundling composed only of relevant bundles, each pair of bundles within it must be consistent.&lt;/p&gt;
&lt;p&gt;The paper establishes three core results. First (Theorem 1), for economies in which each agent demands at most one unit of each good, bundle consistency is sufficient for competitive equilibrium existence. Second (Theorem 2), bundle consistency is essentially necessary: if competitive equilibria exist for all economies in which agents have valuations in an invariant domain, then those valuations are bundle-consistent. &amp;ldquo;Invariant&amp;rdquo; requires closure under addition of nonneg linear functions and inclusion of the zero valuation — a condition satisfied by all major prior domains including gross substitutes, consecutive games, substitutes-and-complements, and all classes of discrete convexity. Third, for the multiunit demand setting (Theorems 3 and 4), unit consistency is additionally required: units of the same good must be substitutes for each other. This rules out increasing returns to scale at the unit level, analogous to the absence of increasing returns in standard divisible-good theory.&lt;/p&gt;
&lt;p&gt;The sufficiency proof works by showing that unit- and bundle-consistent preferences lie within a class of discrete convexity (Danilov, Koshevoy, and Murota, 2001), with bundle consistency shown equivalent (Proposition 3) to total unimodularity of the matrix of all agents&amp;rsquo; price effects in {-1, 0, 1}^I. Equilibrium existence then follows from existing results for discrete convex economies.&lt;/p&gt;
&lt;p&gt;A testable characterization is provided: preferences are bundle-consistent if and only if the set of all agents&amp;rsquo; price effects in {-1, 0, 1}^I is totally unimodular (Proposition 3, under unit consistency). This gives a finite, computable test.&lt;/p&gt;
&lt;p&gt;The scope conditions are explicit: the full theorem applies to agents with continuous utility functions strictly increasing in money; income effects are permitted. The necessity results apply to invariant domains. The multiunit extension requires the additional unit consistency condition. The paper does not impose quasilinearity for the main theorems, though geometric appendices restrict to the quasilinear case for the connection to tropical geometry.&lt;/p&gt;
&lt;p&gt;The results unify all previously known sufficient conditions for equilibrium existence with indivisible goods — substitutes, consecutive games, substitutes-and-complements, and the geometric domains — as special cases of bundle consistency. Crucially, Example 3 (four goods, six agent types with additive and pairwise-complement valuations) demonstrates a case where equilibrium exists under bundle consistency even though no bundling makes all agents view bundles as substitutes, so the result cannot be derived from Kelso-Crawford by rebundling.&lt;/p&gt;
&lt;p&gt;Q: What is the fundamental obstruction to equilibrium existence with indivisible goods, according to this paper?&lt;/p&gt;
&lt;p&gt;A: The only essential obstruction is an inconsistency between substitutability and complementarity across a pair of relevant bundles — that is, one agent seeing two bundles as substitutes while another sees them as complements. With only two goods (or only two units), consistency between goods themselves suffices. With more goods, apparent consistency at the good level can mask bundle-level inconsistency, as shown in Example 1 (three goods, each pair complements, yet no equilibrium exists). Bundle consistency — requiring pairwise consistency for all relevant bundlings — captures the full obstruction.&lt;/p&gt;
&lt;p&gt;Q: What makes a bundle &amp;ldquo;relevant&amp;rdquo; for the purpose of bundle consistency?&lt;/p&gt;
&lt;p&gt;A: A bundle b in {-1, 0, 1}^I is relevant if it either arises as a compensated price effect for some agent (revealing which goods move together following a price decrease, including negative entries that reveal hidden complementarities between a good and the opportunity to sell another) or consists of a single good e_i. Bundles with negative components (sale opportunities) are included because sale opportunities can themselves be complementary to goods — the &amp;ldquo;hidden complementarity&amp;rdquo; concept from Ostrovsky (2008) and Hatfield et al. (2013, 2019).&lt;/p&gt;
&lt;p&gt;Q: Why does the three-cycle-of-complements example (Example 1) fail to have an equilibrium, and how does bundle consistency detect this?&lt;/p&gt;
&lt;p&gt;A: Three agents hold V^1 = 3 min{x_a, x_b}, V^2 = 3 min{x_b, x_c}, V^3 = 3 min{x_a, x_c}, with one unit of each good available. Every pair of goods is complementary for some agent, so no inconsistency appears at the goods level. However, under the bundling B = {(1,0,0), (1,1,0), (0,0,1)} (apples-and-bananas bundled, coconuts separate), a fall in the coconut price induces agent 2 to buy the apple-banana bundle and sell apple, making apple and coconut substitutes for agent 2 while they remain complements for agent 3 — a bundle inconsistency. Bundle consistency detects this whereas good-level consistency does not.&lt;/p&gt;
&lt;p&gt;Q: What distinguishes the consecutive-games pattern (Example 2) from the three-cycle pattern (Example 1), and why does equilibrium exist in the former?&lt;/p&gt;
&lt;p&gt;A: In Example 2, agent 3&amp;rsquo;s valuation is replaced by V^3 = 3 min{x_a, x_b, x_c}: coconuts are complementary to apples only in conjunction with bananas, not directly. Under the same bundling B, a fall in the coconut price again makes apple and coconut substitutes for agents 2 and 3, but now this substitutability is consistent — neither agent sees apple and coconut as direct complements independently of bananas. Bundle consistency holds, and Greenberg and Weber (1986) confirm equilibrium existence for all endowments. The difference between the two examples hinges entirely on whether coconuts are directly complementary to apples or only complementary to apples in combination with bananas.&lt;/p&gt;
&lt;p&gt;Q: How does bundle consistency relate to the prior geometric approaches (discrete convexity, tropical geometry)?&lt;/p&gt;
&lt;p&gt;A: Proposition 4 establishes that a family of utility functions belongs to a single class of discrete convexity (Danilov, Koshevoy, and Murota, 2001) if and only if the family is unit- and bundle-consistent. Proposition 3 establishes that (under unit consistency) preferences are bundle-consistent if and only if the set of all agents&amp;rsquo; price effects in {-1, 0, 1}^I is totally unimodular — the same mathematical condition underlying Baldwin and Klemperer&amp;rsquo;s (2019) totally unimodular demand types. The paper thus provides economic interpretations for the entire class of geometric domains, not just substitutes or specific named cases.&lt;/p&gt;
&lt;p&gt;Q: What does unit consistency require, and why is it needed in the multiunit setting?&lt;/p&gt;
&lt;p&gt;A: Unit consistency requires that for any good i and any two serial-number indices m &amp;lt; m&amp;rsquo;, the m-th and m&amp;rsquo;-th units of good i are substitutes for each other (Definition 6). This rules out increasing returns to scale in units of the same good: with one indivisible good, increasing returns arise if and only if units of that good are complements. Since units of the same good are mechanically substitutes in the divisible-good limit, complementarity between units creates an inconsistency between substitutability and complementarity at the unit level. Unit consistency is automatically satisfied when each agent demands at most one unit of each good.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;essentially necessary&amp;rdquo; sense of the necessity results (Theorems 2 and 4)?&lt;/p&gt;
&lt;p&gt;A: The results require that the domain be &amp;ldquo;invariant&amp;rdquo; — closed under addition of nonneg linear price functions and containing the zero valuation. This is satisfied by all major prior domains: substitutes, consecutive games, substitutes-and-complements, sign-consistent tree valuations, all classes of discrete convexity, and all totally unimodular demand types. For any such domain in which competitive equilibria are guaranteed to exist for all economies, the domain&amp;rsquo;s valuations must be bundle-consistent (Theorem 2) or unit- and bundle-consistent (Theorem 4). This is stronger than previous necessity results because it covers any invariant domain, not just specific named ones.&lt;/p&gt;
&lt;p&gt;Q: How can bundle consistency be tested computationally?&lt;/p&gt;
&lt;p&gt;A: Under unit consistency, Proposition 3 gives a finite test: collect all agents&amp;rsquo; compensated price effects that lie in {-1, 0, 1}^I and form a matrix with these vectors as columns. Preferences are bundle-consistent if and only if this matrix is totally unimodular. Total unimodularity of an integer matrix can be verified in polynomial time using standard results from combinatorial optimization (Schrijver, 1998). Example 3 demonstrates this explicitly: for four goods and six agent types (additive plus four pairwise-complement pairs plus one all-complement agent), the 4x9 price-effect matrix is verified to be totally unimodular, confirming bundle consistency and equilibrium existence.&lt;/p&gt;
&lt;p&gt;Q: Does bundle consistency imply that some rebundling of goods makes all agents treat bundles as substitutes?&lt;/p&gt;
&lt;p&gt;A: No — this is a key finding. Example 3 shows a case where bundle consistency holds and equilibrium exists, yet Danilov, Koshevoy, and Lang (2013) confirm that no bundling exists for which all agents view the bundles as substitutes. Thus, the paper&amp;rsquo;s equilibrium existence result is strictly stronger than what could be obtained by applying Kelso and Crawford (1982) after rebundling goods. Bundle consistency is a weaker condition than the existence of a substitute-making rebundling.&lt;/p&gt;
&lt;p&gt;Q: What are the implications of the results for auction design?&lt;/p&gt;
&lt;p&gt;A: The paper suggests that bidding languages for sealed-bid multi-item auctions can be extended beyond the quasilinear-substitutes case (where Milgrom&amp;rsquo;s (2009) assignment messages apply) by using the economic concepts of bundling and consumer theory. Since bundle consistency characterizes when market-clearing prices exist even with complementarities and income effects, auction formats that guarantee equilibrium existence could in principle be designed for the full bundle-consistent domain, accommodating richer preference structures including complementarities and income effects.&lt;/p&gt;
&lt;p&gt;Q: How do &amp;ldquo;hidden complementarities&amp;rdquo; enter the analysis and why must bundles with negative components be considered?&lt;/p&gt;
&lt;p&gt;A: When a good&amp;rsquo;s price falls and demand for another good decreases, this reveals a hidden complementarity between the first good and the opportunity to sell the second. Ostrovsky (2008) and Hatfield et al. (2013, 2019) identified this structure in trading networks. Ignoring these hidden complementarities would miss obstructions to equilibrium existence: Online Appendix E provides an example where the full set of obstructions is only revealed by including bundles with negative components (sale opportunities) among the relevant bundles. This is why relevant bundles are defined to include price effects with negative entries, and bundles in a bundling are allowed to have negative components.&lt;/p&gt;
&lt;p&gt;Bundle consistency: The condition that for each bundling composed solely of relevant bundles, each pair of bundles within it is either consistently substitutable or consistently complementary across all agents — meaning no two agents disagree on whether the bundles are substitutes or complements. This is the paper&amp;rsquo;s central sufficient and essentially necessary condition for equilibrium existence.&lt;/p&gt;
&lt;p&gt;Relevant bundle: A bundle b in {-1, 0, 1}^I that is either a compensated price effect for some agent (a vector describing how demand changes following a price decrease, including negative entries for goods whose demand falls) or the unit vector e_i for a single good i. Only relevant bundles determine the obstructions to equilibrium existence.&lt;/p&gt;
&lt;p&gt;Compensated price effect: A nonzero vector delta_x for which there exist a utility level u, a price vector p, and a lower price p&amp;rsquo;_i at which demand shifts from x to x + delta_x, with unique demand at both prices. Price effects identify which pairs of goods are strict complements (same-sign entries) and which involve hidden complementarities (opposite-sign entries).&lt;/p&gt;
&lt;p&gt;Hidden complementarity: A complementarity between a good and the opportunity to sell another good, revealed when a price effect has a negative entry — meaning demand for some good decreases following the price decrease of another. The concept unifies settings with substitutes and with complements by treating sale opportunities as analogous to goods.&lt;/p&gt;
&lt;p&gt;Unit consistency: The condition that for any good i and any two units m &amp;lt; m&amp;rsquo; of that good, the m-th and m&amp;rsquo;-th units are substitutes. This rules out increasing returns to scale at the unit level and is needed for equilibrium existence in the multiunit demand setting; it is automatically satisfied in the single-unit case.&lt;/p&gt;
&lt;p&gt;Total unimodularity (of price effects): The property, for the matrix formed by stacking all agents&amp;rsquo; price effects in {-1, 0, 1}^I as columns, that every square submatrix has determinant in {-1, 0, 1}. Proposition 3 establishes this is equivalent to bundle consistency under unit consistency, providing a computable test and linking the economic conditions to the geometric literature.&lt;/p&gt;
&lt;p&gt;Invariant domain: A domain V of valuations closed under addition of nonneg linear price functions (V(x) + p*x remains in V for all p &amp;gt;= 0) and containing the zero valuation. Invariance is the scope condition under which the necessity theorems apply; it is satisfied by all major prior equilibrium existence domains.&lt;/p&gt;</description></item><item><title>The Effect of Education Policy on Crime: An Intergenerational Perspective</title><link>https://macropaperwarehouse.com/papers/the-effect-of-education-policy-on-crime-an-intergenerational-perspective/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effect-of-education-policy-on-crime-an-intergenerational-perspective/</guid><description>&lt;p&gt;This paper studies the intergenerational effects of education policy on crime, asking whether a compulsory schooling reform that reduced crime among those directly exposed also reduced crime among their children. The authors exploit the staggered municipal rollout of Sweden&amp;rsquo;s comprehensive school reform, implemented gradually between 1949 and 1962 across more than 1,000 municipalities, which increased compulsory schooling by one to two years, abolished tracking into academic and vocational streams after 6th grade, and introduced a uniform national curriculum. The parent generation consists of all individuals born in Sweden between 1945 and 1955 (approximately 447,000 men and 450,000 women), and their children form the child generation (426,721 sons observed from age 15 to 29). Crime is measured by administrative conviction records from the Swedish National Council for Crime Prevention covering 1973–2010.&lt;/p&gt;
&lt;p&gt;The empirical strategy is difference-in-differences, comparing changes in conviction rates across cohorts in municipalities that implemented the reform at different times, with treatment assigned based on the parent&amp;rsquo;s birth municipality to avoid endogenous sorting bias. Standard errors are clustered at the municipality level. Parallel trends validity is supported by three tests: results are unchanged when municipality-specific linear trends are included, placebo tests using incorrect reform dates yield effects indistinguishable from zero, and residuals from crime regressions show no correlation with municipality-specific trends.&lt;/p&gt;
&lt;p&gt;The main finding is a significant 0.79 percentage point (pp) decline in conviction rates among sons of fathers exposed to the reform (p-value &amp;lt; 0.002), representing a 3.4 percent reduction relative to baseline. The decline spans multiple crime types: violent crime fell by 0.27 pp, traffic-related crime by 0.45 pp, fraud by 0.22 pp, and other offenses by 0.41 pp — percentage reductions of three to six percent across categories. Multiple convictions fell by 0.43 pp (5.8 percent). These second-generation effects are driven entirely by paternal exposure: the impact of maternal reform exposure is an order of magnitude smaller and statistically insignificant, and the difference between paternal and maternal effects is itself significant (p-value 0.048 for any conviction, 0.009 for multiple convictions). Effects on daughters in the child generation are much smaller, with only the residual &amp;ldquo;other crime&amp;rdquo; category showing a significant 0.129 pp (15.5 percent) decline.&lt;/p&gt;
&lt;p&gt;The asymmetry between paternal and maternal transmission is explained by the first-generation effects of the reform. For men, the reform increased schooling by 0.32 years, earnings by approximately 1 percent, the probability of white-collar employment by 1.2 percent, cognitive skills by 0.14 standard deviations, noncognitive skills by 0.17 standard deviations, spousal earnings by 1,022 SEK per year, and overall household income by approximately 1 percent. For women, the reform increased education by 0.21 years but did not raise earnings, household income, or white-collar employment, and did not reduce their already low crime rates. Only 13 percent of women in the 1945–55 cohorts were at or below the compulsory schooling threshold, versus 20 percent of men, substantially limiting the reform&amp;rsquo;s bite for women.&lt;/p&gt;
&lt;p&gt;A mediation analysis decomposes the intergenerational transmission through three channels: fathers&amp;rsquo; education accounts for 64.8 percent of the indirect effect, the decline in paternal crime accounts for 18.5 percent, and the increase in household disposable income accounts for 16.7 percent. The direct effect (unexplained by these mediators) accounts for 48 percent of the total effect. The paper also documents that children of treated fathers attended schools with lower peer crime rates and lived in neighborhoods with lower youth crime rates, supporting a neighborhood and peer effects channel alongside human capital and role-model channels.&lt;/p&gt;
&lt;p&gt;Scope conditions: the study covers male children observed to age 29 in Sweden; results apply to a context of near-universal administrative records, a specific postwar schooling reform, and cohorts born 1945–1955 in a Nordic welfare state.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the intergenerational crime reduction caused by the reform?&lt;/p&gt;
&lt;p&gt;A: Sons of fathers exposed to the reform experienced a 0.79 pp decline in conviction rates (p-value &amp;lt; 0.002), corresponding to a 3.4 percent reduction relative to the baseline conviction rate of approximately 24 percent for the child generation by age 29. Multiple convictions fell by 0.43 pp, a 5.8 percent reduction. These magnitudes are similar in percentage terms to the direct crime reduction the reform caused among fathers themselves.&lt;/p&gt;
&lt;p&gt;Q: Does the reform&amp;rsquo;s intergenerational effect on crime differ by the sex of the treated parent?&lt;/p&gt;
&lt;p&gt;A: Yes. The intergenerational effect is driven entirely by paternal exposure to the reform: the effect of maternal exposure is an order of magnitude smaller and insignificant at any conventional significance level. The difference between paternal and maternal effects is statistically significant, with p-values of 0.048 for any conviction and 0.009 for multiple convictions. The paper attributes this asymmetry to the much weaker first-generation effects of the reform on women&amp;rsquo;s earnings, household income, crime rates, and neighborhood sorting.&lt;/p&gt;
&lt;p&gt;Q: Which crime types declined significantly among sons of treated fathers?&lt;/p&gt;
&lt;p&gt;A: Significant declines were found in violent crime (−0.27 pp, Romano-Wolf p-value 0.09), traffic-related crime (−0.45 pp, RW p-value 0.057), fraud (−0.22 pp, RW p-value 0.09), and other offenses (−0.41 pp, RW p-value 0.047), each representing a three-to-six percent reduction relative to the mean incidence of that crime type. Property crime and drug-related crime did not show significant declines.&lt;/p&gt;
&lt;p&gt;Q: What were the direct effects of the reform on the parent generation&amp;rsquo;s human capital?&lt;/p&gt;
&lt;p&gt;A: For men, the reform increased schooling by 0.32 years, earnings by approximately 1 percent, the probability of white-collar employment by 1.2 percent, cognitive skills by 0.14 standard deviations, and noncognitive skills by 0.17 standard deviations, all measured at military enlistment. Spousal earnings increased by 1,022 SEK per year and overall household income rose by approximately 1 percent. For women, education increased by 0.21 years and marriage market matches improved, but earnings, household income, and white-collar employment probability did not increase significantly.&lt;/p&gt;
&lt;p&gt;Q: Why did the reform have stronger first-generation effects on men than on women?&lt;/p&gt;
&lt;p&gt;A: The average share of individuals at or below the compulsory schooling threshold — the margin at which the reform was binding — was 20 percent for men but only 13 percent for women in the 1945–55 cohorts. Because fewer women were constrained by the old compulsory schooling limit, the reform increased their education by less and produced smaller downstream effects on earnings and labor market outcomes.&lt;/p&gt;
&lt;p&gt;Q: What are the three channels through which the reform reduces child crime, and what is the relative contribution of each?&lt;/p&gt;
&lt;p&gt;A: The paper identifies three channels: (1) the human capital channel, whereby increased parental education raises household income and child human capital; (2) the role model channel, whereby reduced paternal crime participation directly reduces son&amp;rsquo;s crime; and (3) the neighborhood and peer effects channel, whereby higher income enables sorting into lower-crime neighborhoods and better schools. The mediation analysis attributes 64.8 percent of the indirect effect to fathers&amp;rsquo; increased education, 18.5 percent to the decline in paternal crime, and 16.7 percent to the increase in household disposable income. The direct effect unexplained by these three mediators accounts for 48 percent of the total effect.&lt;/p&gt;
&lt;p&gt;Q: What is the role model effect, and how strong is it in the parent generation?&lt;/p&gt;
&lt;p&gt;A: The role model channel operates through the strong intergenerational persistence in crime participation: sons are 2.06 times more likely to participate in crime if their fathers have been convicted (Hjalmarsson and Lindquist, 2012). The reform reduced the incidence of any conviction among treated men by 1.5 pp and repeat convictions by 1.5 pp — the latter representing an approximately 8 percent decline from a lower base. For women, the reform produced no reduction in crime, providing no analogous role model improvement through the maternal channel.&lt;/p&gt;
&lt;p&gt;Q: How does neighborhood and school peer quality change for children of treated fathers versus treated mothers?&lt;/p&gt;
&lt;p&gt;A: Sons of fathers exposed to the reform moved to neighborhoods with lower youth crime rates (−0.087 pp) and attended schools with lower peer crime rates (−0.077 pp). In contrast, sons of mothers exposed to the reform experienced higher neighborhood crime rates (p-value 0.06) and higher school peer crime rates (p-value 0.01), the opposite direction. This asymmetry helps explain why only paternal treatment generates significant second-generation crime reductions.&lt;/p&gt;
&lt;p&gt;Q: What happens to other outcomes for children of treated fathers beyond crime?&lt;/p&gt;
&lt;p&gt;A: Sons experienced a 1.2 percentile increase in school GPA (RW p-value 0.05), a 2.3 pp increase in employment (RW p-value 0.04), a matching 2.3 pp decline in unemployment benefit receipt, a reduction in hospitalization of 2.4 days (17 percent, RW p-value 0.02), and a decline in prescribed drugs of 31 doses (2.8 percent, RW p-value 0.09). The decline in prescribed drugs for sons is driven by nervous system drugs and painkillers, pointing to improved mental health. Daughters of treated fathers show a significant reduction in welfare dependency but no other significant improvements.&lt;/p&gt;
&lt;p&gt;Q: How does the paper validate the parallel trends assumption?&lt;/p&gt;
&lt;p&gt;A: Three tests are reported. First, including municipality-specific linear trends leaves the main coefficient unchanged (p-value 0.85 for the trend terms themselves). Second, placebo contrasts using incorrect reform implementation dates produce effects indistinguishable from zero for all tested dates. Third, graphical inspection of regression residuals shows no correlation with municipality-specific trends. Together these provide strong support for the identifying assumption.&lt;/p&gt;
&lt;p&gt;Q: Are the results sensitive to using a linear probability model instead of a nonlinear model?&lt;/p&gt;
&lt;p&gt;A: A Monte Carlo experiment was conducted replicating observed crime rates across municipalities and imposing the estimated average treatment effect. Assuming the true data-generating process is a probit model, the linear probability model biases the estimated average effect upward by only 5 percent — a difference that is statistically indistinguishable from zero in the actual data — validating the OLS approach.&lt;/p&gt;
&lt;p&gt;Q: What is the broader policy implication of the findings?&lt;/p&gt;
&lt;p&gt;A: The results show that well-designed education policies can reduce crime not only among the directly treated generation but also among their children, amplifying the social benefits of reform across generations. The authors interpret this as consistent with the theoretical framework of Becker and Tomes (1979) on intergenerational transmission of human capital, and suggest that education policy evaluations that focus only on the treated generation substantially understate total social returns.&lt;/p&gt;
&lt;p&gt;Intergenerational transmission of education reform effects: the phenomenon whereby an education policy that raises parental human capital produces improvements in children&amp;rsquo;s outcomes — including crime — through multiple channels including resource increases, parental role modeling, and neighborhood sorting, beyond any direct policy exposure of the child generation.&lt;/p&gt;
&lt;p&gt;Comprehensive school reform (Sweden, 1949–1962): a nationally mandated restructuring of compulsory schooling that extended required attendance by one to two years, abolished selection into academic and vocational tracks after 6th grade, and introduced a uniform national curriculum, rolled out staggered across 1,055 Swedish municipalities.&lt;/p&gt;
&lt;p&gt;Human capital channel: the mechanism by which increased parental education raises earnings and household income, enabling greater investments in children&amp;rsquo;s development and exploiting complementarity between parental and child human capital in the skill production function, thereby raising children&amp;rsquo;s opportunity cost of crime.&lt;/p&gt;
&lt;p&gt;Role model channel: the mechanism by which reduced parental crime participation directly reduces children&amp;rsquo;s crime, operating through the transmission of norms and information across generations; identified empirically by the strong intergenerational correlation in convictions (sons with convicted fathers are 2.06 times more likely to be convicted themselves).&lt;/p&gt;
&lt;p&gt;Neighborhood and peer effects channel: the mechanism by which increased parental income from the reform enables sorting into residential neighborhoods and schools with lower youth crime rates, exposing children to peers less involved in illegal activities and thereby reducing their own crime participation.&lt;/p&gt;
&lt;p&gt;Mediation analysis: a decomposition method following Heckman, Pinto, and Savelyev (2013) that quantifies the share of a total treatment effect accounted for by specific intermediate variables (here: fathers&amp;rsquo; education, fathers&amp;rsquo; crime participation, and household disposable income) versus the direct unexplained effect.&lt;/p&gt;
&lt;p&gt;Conviction rate: the proportion of individuals in a given generation and observation window who received at least one criminal conviction in Swedish administrative records; used as the primary outcome measure because it captures offenses that led to a court appearance, excluding minor infractions resolved by direct fine.&lt;/p&gt;</description></item><item><title>The Effect of High-Tech Clusters on the Productivity of Top Inventors: Comment</title><link>https://macropaperwarehouse.com/papers/the-effect-of-high-tech-clusters-on-the-productivity-of-top-inventors-comment/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effect-of-high-tech-clusters-on-the-productivity-of-top-inventors-comment/</guid><description>&lt;p&gt;This paper is a comment on Moretti (2021b), which studied agglomeration effects for innovation by testing whether the size of technology clusters causes patenting. The original paper (M21) used US patent data from 1971 to 2007 (Zucker and Darby, 2014) and reported a baseline elasticity of patenting with respect to cluster size of 0.0676, along with event study and instrumental variables (IV) evidence supporting a causal interpretation.&lt;/p&gt;
&lt;p&gt;Wiebe identifies two major methodological problems that undermine M21&amp;rsquo;s causal claims.&lt;/p&gt;
&lt;p&gt;Problem 1 — Misspecified event study. M21&amp;rsquo;s event study (Figure 6) was designed to test for selection bias from &amp;ldquo;rising star&amp;rdquo; inventors sorting into large clusters. The event is inventors moving across cities exactly once. However, M21&amp;rsquo;s specification interacts pre-move average cluster size with pre-move event-time indicators and post-move average cluster size with post-move event-time indicators separately — it does not exploit the change in cluster size generated by the move itself. Following the standard &amp;ldquo;mover&amp;rdquo; design literature (Finkelstein et al., 2016; Molitor, 2018; Cantoni and Pons, 2022), the correct specification uses the change in average cluster size as the treatment variable, interacted with event-time indicators. Wiebe implements this corrected event study and finds no statistically significant pre-trend and no statistically significant treatment effect post-move. Notably, the baseline elasticity estimated on the mover sample using all observed variation is large and significant at 0.3145 (SE 0.0953), but no effect is detected when variation is restricted to that generated by moving. The null result could also partly reflect attenuation bias from misclassified moves, since the dataset does not distinguish inventors who share the same name.&lt;/p&gt;
&lt;p&gt;Problem 2 — Coding error in IV. M21&amp;rsquo;s Table 5 instruments cluster size using variation in the number of inventors in other cities employed by firms also active in the focal inventor&amp;rsquo;s city, with the instrument calculated via first-differencing. Due to a coding error, M21 sorts data by firm, field, and year but not by city before first-differencing, so the differencing is taken across cities rather than within cities. Because firm-field-year is not a unique sorting key, Stata&amp;rsquo;s sort command pseudo-randomly orders observations with tied values, making the results unreproducible across runs. When Wiebe corrects the code to sort by city and compute first-differences within city, the 2SLS estimates become unstable and nonsignificant, with the first-stage F-statistic falling to approximately 7. This means M21 provides no valid IV evidence against confounding from city-field-year shocks such as local subsidies.&lt;/p&gt;
&lt;p&gt;Beyond these two major problems, the Appendix documents seven additional issues. The positive effect of cluster size on patent quality (M21 Table 6) disappears and reverses when the log transformation is corrected from log(y + 0.00001) to log(y + 1) or Poisson regression — the corrected estimate is negative and significant, implying that cluster size reduces citations per patent along the intensive margin and the overall quality effect is negative. Heterogeneous elasticity estimates (M21 Table 8) contain a coding error; corrected estimates show substantial heterogeneity. The distributed lag model (M21 Figure 5) uses an incorrectly defined lag structure in an unbalanced panel; corrected estimates yield nonsignificant contemporaneous effects. Cluster quality estimates (M21 Table A.8) use a cluster size definition differing from the text, and corrected elasticities are approximately half as large. M21&amp;rsquo;s claimed extensive margin effect in Table A.7 is logically unsupported since no zeros are observed. The team size robustness check is conceptually flawed because it controls twice for per-coauthor adjustment. A gap-interpolation coding error in Table A.6 biases estimates downward. Broader computational reproducibility failures arise from many-to-many merges with non-unique sort orders. Wiebe explicitly notes that the null IV and event study results are not evidence against agglomeration effects per se.&lt;/p&gt;
&lt;p&gt;Q: What is the baseline finding in M21 that Wiebe contests?
A: M21 reports a baseline elasticity of patenting with respect to cluster size of 0.0676, estimated from linear regressions with extensive fixed effects including inventor fixed effects. M21 presents an event study and IV strategy as additional evidence supporting a causal interpretation of this elasticity.&lt;/p&gt;
&lt;p&gt;Q: What is wrong with M21&amp;rsquo;s event study specification?
A: M21&amp;rsquo;s event study interacts pre-move average cluster size with pre-move event-time indicators and post-move average cluster size with post-move event-time indicators, but never uses the change in cluster size associated with moving. The standard mover design (Finkelstein et al., 2016; Molitor, 2018) uses the change in average environment as a constant treatment variable interacted with all event-time indicators. Because M21&amp;rsquo;s specification does not exploit moving-induced variation, it would be identified even if moving induced no change in cluster size.&lt;/p&gt;
&lt;p&gt;Q: What does Wiebe&amp;rsquo;s corrected event study find?
A: Wiebe&amp;rsquo;s corrected mover event study shows no statistically significant pre-trend (consistent with no systematic sorting of rising-star inventors into large clusters) and no statistically significant post-move treatment effect. In contrast, the baseline fixed-effects elasticity on the mover sample using all observed variation is 0.3145 (SE 0.0953) — large and significant — indicating the null result is specific to the moving-generated variation.&lt;/p&gt;
&lt;p&gt;Q: What alternative explanation does Wiebe offer for the null event study result?
A: The null result could be partly explained by attenuation bias from misclassified moves. M21&amp;rsquo;s code creates inventor identifiers based on names, but the COMETS dataset does not distinguish inventors who share the same name, so an apparent cross-city move may simply be two different inventors with the same name living in different cities.&lt;/p&gt;
&lt;p&gt;Q: What is the coding error in M21&amp;rsquo;s IV strategy?
A: M21 constructs the instrument by first-differencing a variable measuring inventors in other cities working for firms also active in the focal city. The code sorts by firm, field, and year before differencing, but omits city from the sort key, so first-differencing is computed across cities rather than within cities, generating an instrument that does not match the definition in the text.&lt;/p&gt;
&lt;p&gt;Q: Why does the coding error also cause non-reproducibility?
A: Firm-field-year is not a unique sorting key because multiple cities can share the same firm-field-year values. Stata&amp;rsquo;s sort command pseudo-randomly orders observations with tied values, so each run produces a different city ordering within tied groups and therefore a different instrument and different estimates.&lt;/p&gt;
&lt;p&gt;Q: What do the corrected IV results show?
A: After correcting the sort order to include city and computing first-differences within city, the 2SLS estimates are unstable and nonsignificant. The first-stage F-statistic falls to approximately 7, indicating a weak instrument. This does not constitute evidence against agglomeration effects, but means M21&amp;rsquo;s IV strategy provides no valid evidence against confounding from city-field-level shocks such as local subsidies.&lt;/p&gt;
&lt;p&gt;Q: What happens to the patent quality results when the log transformation is corrected?
A: M21 uses log(citations + 0.00001), which assigns very large weight to the extensive margin. When Wiebe uses log(citations + 1) or Poisson regression instead, the estimated effect of cluster size on patent quality is negative and statistically significant, reversing M21&amp;rsquo;s finding. The corrected result implies that while cluster size may raise the probability of producing any cited patent, it reduces citations per patent for inventors who do produce cited patents, and the overall effect is negative.&lt;/p&gt;
&lt;p&gt;Q: What are the corrected aggregate agglomeration loss estimates?
A: Using the corrected constant elasticity, the estimated output reduction from equalizing cluster sizes is -9.15% (slightly smaller than M21). Using corrected heterogeneous elasticities based on within-field-year size quartiles, the output loss is -23.75% (about twice as large). Using elasticities based on global size quartiles, the loss is -35.11%.&lt;/p&gt;
&lt;p&gt;Q: What is wrong with M21&amp;rsquo;s distributed lag model (Figure 5)?
A: M21&amp;rsquo;s code defines lags and leads using sequential observations in the panel rather than calendar years. Because the inventor-year panel is unbalanced, a coded &amp;ldquo;one-year lag&amp;rdquo; can refer to any number of years prior. When Wiebe restricts to inventors with 11 consecutive years and correctly defines year-based lags, confidence intervals widen substantially and the contemporaneous effect estimate becomes nonsignificant.&lt;/p&gt;
&lt;p&gt;Q: What is the conceptual flaw in M21&amp;rsquo;s team-size robustness check?
A: M21&amp;rsquo;s Table A.8 controls for the number of coauthors on a patent, but the dependent variable is already measured as patents per coauthor. Controlling for team size after already dividing by team size effectively controls for the same variable twice.&lt;/p&gt;
&lt;p&gt;Q: What are the broader computational reproducibility problems in M21?
A: The cleaning code uses many-to-many merges with non-unique sort orders, generating slightly different datasets on each run. For example, when merging inventors with patent assignees, patent identifiers are not unique because multiple firms can be assigned to a single patent. Removing name suffixes also causes distinct inventors (e.g., Paul H. Hamisch Jr. and Sr.) to be assigned the same identifier. Additionally, using reghdfe with the keepsingletons option retains singleton groups explicitly warned against by the package due to biased standard errors.&lt;/p&gt;
&lt;p&gt;Agglomeration elasticity: The elasticity of an inventor&amp;rsquo;s patent output with respect to the size of the technology cluster (city-field-year cell) in which they work; reported as 0.0676 in M21&amp;rsquo;s baseline and 0.3145 on the mover sample with all observed variation.&lt;/p&gt;
&lt;p&gt;Mover event study design: An event study specification in which the treatment variable is the change in an individual&amp;rsquo;s average environment (here, cluster size) before and after a geographic move, interacted with event-time indicators — the standard design used in Finkelstein et al. (2016) and Molitor (2018), which M21&amp;rsquo;s specification does not follow.&lt;/p&gt;
&lt;p&gt;Cluster size: The number of inventors (or cluster density) active in the same city-field-year cell as the focal inventor, used as the key independent variable in M21&amp;rsquo;s regressions.&lt;/p&gt;
&lt;p&gt;First-stage F-statistic: A measure of instrument strength in 2SLS IV estimation; the corrected instrument yields F ≈ 7 (indicating weakness), whereas M21&amp;rsquo;s incorrectly constructed instrument produced a stronger first stage by exploiting spurious cross-city variation.&lt;/p&gt;
&lt;p&gt;Extensive vs. intensive margin (patent quality): The extensive margin captures whether an inventor produces any cited patent; the intensive margin captures citations per patent conditional on having any. M21&amp;rsquo;s log(y + 0.00001) transformation overweights the extensive margin, and the corrected intensive-margin effect of cluster size on quality is negative and significant.&lt;/p&gt;
&lt;p&gt;Computational reproducibility: The property that running code on the same data produces identical results across runs. M21&amp;rsquo;s code fails this standard due to non-unique sort orders in merges and first-differencing steps, causing the IV instrument to differ across runs.&lt;/p&gt;
&lt;p&gt;Rising star sorting: The hypothesized selection mechanism whereby inventors with increasing patent trajectories are preferentially hired into large clusters, which would bias OLS agglomeration elasticity estimates upward; M21&amp;rsquo;s event study was designed to test for this but is incorrectly specified and does not use moving-induced variation.&lt;/p&gt;</description></item><item><title>The Effect of Omitted Variables on the Sign of Regression Coefficients</title><link>https://macropaperwarehouse.com/papers/the-effect-of-omitted-variables-on-the-sign-of-regression-coefficients/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effect-of-omitted-variables-on-the-sign-of-regression-coefficients/</guid><description>&lt;p&gt;Masten and Poirier demonstrate a previously unrecognized asymmetry in the coefficient stability literature: depending on how omitted variable bias is measured, it can be substantially easier for omitted variables to flip a regression coefficient&amp;rsquo;s sign than to drive it to zero. The paper focuses specifically on Oster (2019b), a widely used robustness framework with approximately 5,500 Google Scholar citations as of December 2025, and shows that Oster&amp;rsquo;s sensitivity parameter δ — commonly interpreted as the ratio of selection on unobservables to selection on observables — exhibits a structural problem when used to assess sign robustness.&lt;/p&gt;
&lt;p&gt;The core theoretical result (Theorem 2) is that, in Oster&amp;rsquo;s sensitivity analysis, the sign change breakdown point is bounded above by 1 for any value of R²_long. Since researchers typically treat |δ| = 1 as the cutoff for a robust result, this implies that no empirical result is robust to sign changes under Oster&amp;rsquo;s framework, even when the explain away breakdown point is far larger than 1. The mechanism is a vertical asymptote in the identified set for βlong that occurs precisely at δ = 1, arising from near multicollinearity between the treatment X and the covariates. At this asymptote, the bias-adjusted estimand becomes discontinuous: βlong can jump from a positive to a negative value as δ crosses 1, even when δ is changed by a negligible amount.&lt;/p&gt;
&lt;p&gt;The paper illustrates this with the bias-adjusted estimand formula. Under Oster&amp;rsquo;s Proposition 1 (which requires δ = 1 plus an auxiliary proportionality assumption), the point estimate for the social capital application is 0.532. But if δ = 1 without the auxiliary assumption, the identified set becomes {−0.0855, 1.8947}. For δ = 0.99, the identified set includes {−18.66, −0.0868, 1.736}. The baseline OLS estimate is 0.17, and the explain away breakdown point (correct) is −32.0, while the sign change breakdown point is only 0.586 — well below the conventional robustness threshold of 1.&lt;/p&gt;
&lt;p&gt;The authors propose a modified robustness measure that adds Assumption A5: an explicit bound M on the magnitude of omitted variable bias (|βlong − βmed| ≤ M). Under this restriction, the sign change breakdown point can exceed 1, making robust sign conclusions possible. The choice of M requires substantive justification by the researcher.&lt;/p&gt;
&lt;p&gt;Two meta-analyses covering 58 empirical papers document the practical extent of the problem. For papers published in top-five journals from 2019–2021 that cite Oster (2019), the median explain away breakdown point is 2.65, while the median sign change breakdown point (with M = 10|β̂med|) is 1.15 and without the M restriction is 0.96. At the 90th percentile, the explain away point is 13.22, while the sign change point (M = 10|β̂med|) is only 1.66. Across both meta-analytic samples, more than 50% of regressions require that the sign of βlong must be assumed a priori in order to interpret the explain away breakdown point as evidence of sign robustness.&lt;/p&gt;
&lt;p&gt;Scope conditions: The results apply specifically to Oster&amp;rsquo;s linear regression coefficient stability framework under the assumption of exogenous controls (cov(W1, W2) = 0, Assumption A4). The authors note this exogeneity assumption is strong in many applications. The paper does not claim the results extend to other sensitivity analysis frameworks (e.g., Cinelli and Hazlett 2020). The methods are implemented in the companion Stata module regsensitivity.&lt;/p&gt;
&lt;p&gt;Q: What is the central finding of the paper?&lt;/p&gt;
&lt;p&gt;A: The sign change breakdown point for Oster&amp;rsquo;s δ is bounded above by 1 (Theorem 2), regardless of how large the explain away breakdown point is. Since |δ| = 1 is the conventional robustness threshold, this implies that, under Oster&amp;rsquo;s framework, no result is ever robust to a sign change. The explain away breakdown point can simultaneously be very large — e.g., −32.0 in the social capital application — while the sign change breakdown point is only 0.586.&lt;/p&gt;
&lt;p&gt;Q: What are the two kinds of breakdown points the paper distinguishes?&lt;/p&gt;
&lt;p&gt;A: The explain away breakdown point answers: what is the smallest |δ| required for the data to be consistent with a zero causal effect? The sign change breakdown point answers: what is the smallest |δ| required for the data to be consistent with a causal effect of opposite sign? These two quantities are often equal but are not generally equivalent, and the sign change breakdown point can be strictly smaller than the explain away breakdown point.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism behind the sign change breakdown point being bounded above by 1?&lt;/p&gt;
&lt;p&gt;A: The identified set for βlong has a vertical asymptote precisely at δ = 1, arising because the sensitivity analysis allows treatment X and the covariates (W1, W2) to approach near multicollinearity. Near this asymptote, omitted variable bias can be arbitrarily large while δ remains close to 1. This discontinuity allows the bias-adjusted estimand to jump across zero — changing sign — even as δ is changed by an infinitesimal amount near 1.&lt;/p&gt;
&lt;p&gt;Q: How sensitive is Oster&amp;rsquo;s bias-adjusted point estimator near δ = 1?&lt;/p&gt;
&lt;p&gt;A: Extremely sensitive. In the social capital application, Oster&amp;rsquo;s Proposition 1 formula (which assumes δ = 1 with the auxiliary proportionality condition) yields an estimate of 0.532. But without the auxiliary assumption, at δ = 1 the identified set is {−0.0855, 1.8947}; at δ = 0.99 it includes {−18.66, −0.0868, 1.736}; at δ = 1.01 it includes {−0.0843, 2.133, 15.64}. These are not minor perturbations — the estimand is discontinuous in δ at the value that Oster&amp;rsquo;s formula evaluates it.&lt;/p&gt;
&lt;p&gt;Q: What modification do the authors propose to recover sign robustness?&lt;/p&gt;
&lt;p&gt;A: They propose adding Assumption A5, which bounds the magnitude of omitted variable bias: |βlong − βmed| ≤ M for a researcher-specified M ≥ 0. Under this restriction, the identified set BI(δ, R²_long, M) is intersected with [βmed − M, βmed + M], and it becomes possible for the sign change breakdown point to exceed 1. The practical difficulty is that M must be chosen with substantive justification, and the authors show via meta-analysis that the conventional choice M = |βmed| (equivalent to assuming the sign of βlong is already known) applies to more than 50% of regressions in their sample.&lt;/p&gt;
&lt;p&gt;Q: What do the meta-analyses show about the gap between explain away and sign change breakdown points in practice?&lt;/p&gt;
&lt;p&gt;A: For 34 primary regressions from top-five journal papers (2019–2021) with R²_long = 1, the median explain away breakdown point is 2.65 while the median sign change breakdown point (M = 10|β̂med|) is 1.15 and without the M restriction is 0.96. At the 90th percentile, the explain away point is 13.22 versus a sign change point (M = 10|β̂med|) of only 1.66. The second meta-analysis (141 regressions from 55 papers, 2008–2013) produces qualitatively similar results.&lt;/p&gt;
&lt;p&gt;Q: Why does the paper flag the implicit sign assumption embedded in many applications of Oster&amp;rsquo;s method?&lt;/p&gt;
&lt;p&gt;A: Using the explain away breakdown point as evidence of sign robustness implicitly requires that M = |βmed|, which is equivalent to constraining βlong ∈ [0, 2βmed] — that is, assuming the sign of βlong is the same as the sign of βmed. The paper shows (Table 4) that across both meta-analytic samples, more than 50% of regressions make this implicit sign assumption in order to interpret the explain away breakdown point as informative about sign robustness.&lt;/p&gt;
&lt;p&gt;Q: What is δ, and what are its interpretive limitations?&lt;/p&gt;
&lt;p&gt;A: δ is the ratio of (cov(X, γ′2,long W2)/var(γ′2,long W2)) to (cov(X, γ′1,long W1)/var(γ′1,long W1)), measuring the relative magnitude of selection on unobservables versus observables. As Cinelli and Hazlett (2020) show, it is a double ratio: the ratio of the treatment-unobservable association to the treatment-observable association, divided by the ratio of their outcome effects. This double-ratio structure leads to counter-intuitive behavior: a single omitted variable that is only modestly related to treatment can produce δ values far from 1 if the observable control is also only weakly related to treatment, even if the omitted variable is not strongly confounding in an absolute sense.&lt;/p&gt;
&lt;p&gt;Q: What assumption is required for the entire sensitivity analysis framework, and how restrictive is it?&lt;/p&gt;
&lt;p&gt;A: Assumption A4 requires that all observed covariates W1 are uncorrelated with all unobserved covariates W2 (exogenous controls). The authors note this is a strong assumption in many empirical settings. A companion paper (Diegert, Masten, and Poirier 2025a) addresses the case where controls are endogenous.&lt;/p&gt;
&lt;p&gt;Q: What do the authors recommend as best practice?&lt;/p&gt;
&lt;p&gt;A: They recommend two practices: (1) plotting the full estimated identified set for the coefficient of interest across a range of assumptions about omitted variables, rather than relying on a single bias-adjusted point estimate; and (2) reporting sign change breakdown points as robustness summary statistics in addition to (or instead of) explain away breakdown points. Both are implemented in the companion Stata module regsensitivity.&lt;/p&gt;
&lt;p&gt;Explain Away Breakdown Point: The smallest value of the sensitivity parameter |δ| required for the data to be consistent with a zero causal effect (βlong = 0). This is the quantity computed by Oster&amp;rsquo;s Proposition 2 and commonly reported as &amp;ldquo;Oster&amp;rsquo;s delta.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;Sign Change Breakdown Point: The smallest value of |δ| required for the data to be consistent with a causal effect of opposite sign from the baseline estimate. The paper proves this is bounded above by 1 in Oster&amp;rsquo;s framework, regardless of the magnitude of the explain away breakdown point.&lt;/p&gt;
&lt;p&gt;Oster&amp;rsquo;s δ: The ratio of the regression of treatment X on the omitted variable index (γ′2,long W2) to the regression of X on the observed covariate index (γ′1,long W1), measuring relative selection on unobservables versus observables. Interpreted as a double ratio: (treatment-unobservable association / treatment-observable association) ÷ (outcome effect of unobservable index / outcome effect of observable index).&lt;/p&gt;
&lt;p&gt;Identified Set BI(δ, R²_long): The set of values of βlong consistent with the observed data and a given value of δ and R²_long. Characterized as roots of a cubic polynomial. Has a vertical asymptote at δ = 1, meaning the set can include arbitrarily large or small values of βlong as δ approaches 1.&lt;/p&gt;
&lt;p&gt;Bias Magnitude Restriction (Assumption A5): A bound M ≥ 0 on the magnitude of omitted variable bias: |βlong − βmed| ≤ M. Adding this assumption intersects the identified set with [βmed − M, βmed + M], allowing the sign change breakdown point to potentially exceed 1 and making sign robustness conclusions possible.&lt;/p&gt;
&lt;p&gt;Coefficient Stability Analysis: A class of empirical methods that assess omitted variable bias by comparing regression coefficients across specifications that include different sets of covariates. The intuition is that if adding observed controls substantially raises R² but barely moves the coefficient, further omitted variable bias is likely small. Formalized by Altonji, Elder, and Taber (2005) and extended by Oster (2019b).&lt;/p&gt;
&lt;p&gt;Near Multicollinearity (in this context): The situation in which treatment X and the combined covariate vector (W1, W2) are nearly collinear. In Oster&amp;rsquo;s framework, this arises precisely at δ = 1 and produces the vertical asymptote in the identified set, making the bias-adjusted estimand discontinuous and potentially unbounded near this value.&lt;/p&gt;</description></item><item><title>The Effect of Provider Diversity on Racial Health Disparities: Evidence from the Military</title><link>https://macropaperwarehouse.com/papers/the-effect-of-provider-diversity-on-racial-health-disparities-evidence-from-the-military/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effect-of-provider-diversity-on-racial-health-disparities-evidence-from-the-military/</guid><description>&lt;p&gt;This paper asks whether racial concordance between patients and medical providers — specifically, whether Black patients are treated by Black physicians — improves use of preventive care and reduces mortality among patients with chronic, manageable diseases. The authors argue that trust and communication deficits along racial lines cause Black patients to underuse low-cost, life-saving preventive care, and that increasing the share of Black providers addresses this deficit.&lt;/p&gt;
&lt;p&gt;The authors use data from the Military Health System (MHS) Data Repository covering fiscal years 2003–2013, encompassing roughly 9.6 million beneficiaries. A distinctive feature of the MHS is that active-duty providers are themselves MHS beneficiaries, so their race is observed in the same eligibility files used for patients — overcoming the typical absence of provider-race data in claims databases. The study focuses on four chronic, deadly but manageable conditions: diabetes, hypertension, hypercholesterolemia, and clinical atherosclerotic cardiovascular disease. Preventive care is measured by medication fill-days for condition-appropriate generic drugs, HEDIS-recommended Comprehensive Diabetes Care compliance, and (for a subset) blood pressure control. Mortality is tracked across the full sample period.&lt;/p&gt;
&lt;p&gt;The identification strategy exploits quasi-random variation in provider racial composition induced by across-base moves. The MHS setting generates abundant moves driven by DoD personnel management needs — not by patient health or preferences. Using a movers-only differences specification (analogous to Finkelstein et al. 2016), the authors compare differential changes in outcomes for Black versus non-Black patients who move to bases with larger versus smaller increases in the share of Black providers. This design includes fixed effects for both sending and receiving bases, controlling flexibly for regional quality differences. The estimand is an intent-to-treat effect among patients living within 10 miles of a base (who use on-base care 66% of the time).&lt;/p&gt;
&lt;p&gt;The findings are consistent across all four disease samples. For diabetes, a move-induced one-standard-deviation increase in the share of Black diabetes providers is associated with a roughly 6 additional metformin fill-days per year (approximately 16% relative to the mean) and a 3 percentage-point increase (roughly 8% relative to the mean) in Comprehensive Diabetes Care compliance for Black relative to non-Black patients. Mortality falls by 0.4 percentage points — a 33% relative decline — for Black relative to non-Black diabetes patients following such a move.&lt;/p&gt;
&lt;p&gt;Pooling across all four chronic-disease samples, a one-standard-deviation move-induced increase in the Black provider share is associated with approximately 3 additional fill-days of relevant preventive medication and a roughly 0.2 percentage-point reduction in mortality — approximately 15% relative to the mean mortality rate — for Black relative to non-Black patients.&lt;/p&gt;
&lt;p&gt;A decomposition analysis combining the paper&amp;rsquo;s estimates with medical-literature parameters on the mortality effects of preventive medications finds that between 55% and 69% of the concordance mortality effect across the four disease samples can be attributed to improved medication adherence alone, with the remainder attributed to other aspects of the provider-patient relationship (e.g., lifestyle effects, other preventive care).&lt;/p&gt;
&lt;p&gt;Scope conditions: results are local to MHS movers, who are on average slightly younger and healthier than non-movers, potentially understating concordance benefits for the full population. The MHS covers over 3% of all Black U.S. residents, but beneficiaries may differ from the general population. The paper measures Black patient / Black provider concordance specifically; it does not establish a symmetric concordance effect for non-Black patients. The concordance effect estimated is relative — it captures how much Black patients benefit more than non-Black patients from moving to a higher Black-provider-share base. A system-wide spillover mechanism (non-Black providers improving care for Black patients when working alongside more Black providers) cannot be ruled out and would also be consistent with the core concordance motivation.&lt;/p&gt;
&lt;p&gt;Q: What is the central research question and why is the MHS an advantageous setting?
A: The paper asks whether racial concordance between providers and patients causes Black patients to use more preventive care and achieve better health outcomes, focusing on the trust and communication channel. The MHS is advantageous because active-duty providers are themselves MHS beneficiaries, making their race observable — a feature absent in most claims databases. Across-base moves are driven by DoD staffing needs rather than patient health or preferences, providing quasi-random variation in provider racial composition. The system offers complete claims data covering both on- and off-base care, allowing full mortality tracking.&lt;/p&gt;
&lt;p&gt;Q: How does the empirical strategy address selection concerns that plague prior concordance studies?
A: Prior studies face selection problems from Black patients choosing different doctors than white patients and from residential segregation concentrating Black patients and Black physicians in regions with distinct care quality. The movers-based differences specification directly addresses both problems: it uses only patients who move across bases, comparing how the same individual&amp;rsquo;s outcomes change relative to non-Black patients experiencing the same move, as a function of the move-induced change in the Black provider share. Inclusion of fixed effects for both sending and receiving bases accounts flexibly for regional quality differences. Balance tests on observable patient characteristics show no differential sorting of Black versus non-Black patients toward high-Black-provider-share bases.&lt;/p&gt;
&lt;p&gt;Q: What specific preventive care and outcome measures are used for each disease?
A: For diabetes, the primary measures are annual metformin fill-days and Comprehensive Diabetes Care (CDC) compliance — defined as receiving HbA1c testing, a retinal eye exam, and medical attention for nephropathy in the focal year — plus blood pressure control (available only from 2009 onward for on-base patients). For hypertension, the measures are annual fill-days of WHO-recommended antihypertensives (thiazides, ACEs/ARBs, or long-acting dihydropyridine CCBs) and blood pressure control. For hypercholesterolemia, the measure is fill-days of antilipemic agents, bile acid sequestrants, and statins. For atherosclerotic cardiovascular disease, the HEDIS statin therapy receipt indicator is used. Mortality is tracked across all four samples.&lt;/p&gt;
&lt;p&gt;Q: What are the main quantitative results for the diabetes sample?
A: A move-induced one-standard-deviation increase in the share of Black diabetes providers is associated with approximately 6 additional metformin fill-days annually for Black relative to non-Black patients (roughly 16% relative to the mean). Compliance with Comprehensive Diabetes Care increases by 3 percentage points for Black relative to non-Black patients (roughly 8% relative to the mean). Mortality falls by 0.4 percentage points for Black relative to non-Black patients — a 33% relative decline — in connection with the same one-standard-deviation increase in Black provider share.&lt;/p&gt;
&lt;p&gt;Q: What are the pooled results across all four chronic-disease samples?
A: Pooling across diabetes, hypertension, hypercholesterolemia, and atherosclerotic cardiovascular disease, a one-standard-deviation move-induced increase in the Black provider share is associated with approximately 3 additional preventive medication fill-days per year for Black relative to non-Black patients. The pooled mortality effect is a 0.2 percentage-point reduction — roughly 15% relative to the mean mortality rate — for Black relative to non-Black patients.&lt;/p&gt;
&lt;p&gt;Q: How much of the concordance mortality effect operates through medication adherence?
A: The decomposition combines the paper&amp;rsquo;s estimated concordance effects on medication fill-days with medical-literature estimates of the mortality impact of each additional fill-day. For the diabetes sample, increased metformin adherence (4.2 additional fill-days) explains approximately 58.8% of the 0.4 percentage-point concordance mortality effect, with the residual 41.2% attributed to other channels such as lifestyle changes or other preventive care. Across all four disease samples, the medication fill-day channel explains between 55% and 69% of the respective concordance mortality effects.&lt;/p&gt;
&lt;p&gt;Q: What specification checks do the authors conduct to validate causal identification?
A: The authors conduct five main checks. First, balance regressions show that move-induced changes in Black provider share are not differentially related to baseline patient characteristics for Black versus non-Black patients. Second, regressions of the probability of moving on initial Black provider share and its interaction with patient race yield a near-zero concordance coefficient (0.008, SE 0.023), indicating no differential sorting. Third, regressions of post-move on-base care share on the concordance interaction term yield a near-zero coefficient (0.002, SE 0.003), indicating no differential race-specific selection into on-base care. Fourth, a distance falsification test shows that concordance coefficients are near zero and statistically insignificant for patients living more than 10 miles from the base. Fifth, event-study dynamics show no pre-move divergence in preventive care adherence between Black and non-Black patients, with a positive divergence emerging only after the move to a higher Black-provider-share base.&lt;/p&gt;
&lt;p&gt;Q: How does the paper separate a concordance effect from a pure Black-physician-quality effect?
A: The paper estimates a &amp;ldquo;first stage&amp;rdquo; specification on the subsample receiving on-base care (where provider race is observed), regressing the change in the probability of visiting a Black provider on the move-induced change in Black provider density. The results show an approximately one-to-one relationship between higher Black provider availability and increased visits to Black providers for all patients, with only a modest differential by patient race. This confirms that non-Black patients also see more Black providers when Black provider density rises, allowing the interaction specification to isolate concordance from a pure physician-quality effect.&lt;/p&gt;
&lt;p&gt;Q: How do the authors assess the potential role of spillover effects?
A: The authors acknowledge they cannot rule out that some of the estimated concordance effect arises through system-wide spillovers — for instance, non-Black providers on bases with more Black colleagues may improve their care for Black patients through peer learning or information transmission. They note that even if such a spillover mechanism operates, it is still consistent with the paper&amp;rsquo;s core concordance motivation, because provider-knowledge deficiencies about treating Black patients are among the theorized channels of racial discordance.&lt;/p&gt;
&lt;p&gt;Q: What do the results imply for the overall racial mortality gap?
A: Among MHS beneficiaries aged 20–65, Black beneficiaries are roughly 38% more likely to have diabetes and die over the sample period than non-Black beneficiaries; this gap appears driven primarily by higher diabetes prevalence rather than a within-diabetes mortality gap. Applying the diabetes concordance mortality estimate (a 0.4 percentage-point reduction), the authors calculate that a one-standard-deviation increase in the Black provider share would reduce the overall diabetes mortality gap from 38% to approximately 21% — a substantial narrowing driven by the concordance effect operating through conditional-on-prevalence outcomes.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: The results imply that investments in increasing physician workforce diversity could meaningfully reduce racial mortality disparities in the United States, particularly for chronic diseases manageable through preventive medication. The paper notes the results are relevant to affirmative action policies in medical school admissions, specifically the pending Supreme Court cases Students for Fair Admissions v. University of North Carolina and Students for Fair Admissions v. Harvard at the time of writing. The MHS population covered in the study includes over 3% of all Black U.S. residents, so the policy stakes extend substantially beyond the military context.&lt;/p&gt;
&lt;p&gt;Q: What are the limitations of the study regarding generalizability?
A: Movers in the chronic-disease samples are on average about four years younger and 0.2 percentage points less likely to die than non-movers, suggesting the local average treatment effect for movers may understate concordance benefits for the full population. The MHS population may be healthier overall than the general population, though conditioning on chronic-disease patients mitigates this concern. The paper covers only Black-patient/Black-provider concordance; concordance effects for other racial and ethnic groups are not estimated. The estimate of the concordance coefficient technically captures how much the Black patient / Black provider concordance effect exceeds the non-Black patient / non-Black provider concordance effect, meaning the absolute magnitude of Black concordance benefits is understated if non-Black concordance effects are also positive.&lt;/p&gt;
&lt;p&gt;Racial concordance: In this paper&amp;rsquo;s usage, the match between the race of a patient and their treating physician — specifically Black patient / Black provider pairing — theorized to improve care through trust, communication, and reduced provider knowledge deficiencies about Black patients.&lt;/p&gt;
&lt;p&gt;Provider Black share: The fraction of outpatient office visits for a given chronic condition at a given military base that are attended by Black active-duty providers, used as the base-level treatment variable; varies across bases from zero to approximately 20 percentage points in the pooled sample.&lt;/p&gt;
&lt;p&gt;Movers-based differences specification: An identification strategy that restricts to patients who relocate across military bases exactly once during the sample period and estimates the differential change in outcomes for Black versus non-Black patients as a function of the move-induced change in the base&amp;rsquo;s Black provider share, including fixed effects for both the sending and receiving base.&lt;/p&gt;
&lt;p&gt;Intent-to-treat (ITT) effect: The concordance estimate as applied to all patients living within 10 miles of a base — regardless of whether they actually received on-base care — to avoid selection bias from differential race-specific decisions to seek care on versus off base.&lt;/p&gt;
&lt;p&gt;Comprehensive Diabetes Care (CDC): A HEDIS composite measure requiring receipt of all three of the following in the focal year: HbA1c testing, a retinal eye exam, and medical attention for nephropathy (via microalbumin exam, ACE/ARB therapy, or nephropathy treatment).&lt;/p&gt;
&lt;p&gt;Medication fill-days: Annual days of supply dispensed for condition-appropriate generic medications (metformin for diabetes; thiazides/ACEs/ARBs/CCBs for hypertension; antilipemic agents, bile acid sequestrants, and statins for hypercholesterolemia; statins for atherosclerotic cardiovascular disease), used as the primary preventive care adherence measure.&lt;/p&gt;
&lt;p&gt;Decomposition of concordance mortality effect: A calculation that uses the paper&amp;rsquo;s estimated concordance effect on medication fill-days, combined with medical-literature estimates of the mortality impact per fill-day, to determine what share of the total concordance mortality effect passes through medication adherence versus other channels (lifestyle, other preventive care).&lt;/p&gt;</description></item><item><title>The Effects of Gender Integration on Men</title><link>https://macropaperwarehouse.com/papers/the-effects-of-gender-integration-on-men/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effects-of-gender-integration-on-men/</guid><description>&lt;p&gt;Greenberg, Wasserman, and Weber (2024/2026) ask whether men negatively respond—in terms of job performance, behavior, and workplace perceptions—when women first enter an exclusively male occupation. They exploit the staggered 2017-onward integration of women into U.S. Army infantry and armor combat companies following the 2016 rescission of the Ground Combat Exclusion Policy. The setting offers unusually clean causal identification: integration timing within Brigade Combat Teams was neither systematic nor data-driven, the Army&amp;rsquo;s rigid pay scales meant integration posed no displacement or wage threat to incumbent men, and roughly 391 companies are observed over 2012–2020. The empirical strategy is a staggered difference-in-differences design with company fixed effects, BCT-by-year-of-arrival fixed effects, and month-of-year fixed effects, applied to an individual-level sample of newly arrived male soldiers. Outcomes come from monthly administrative personnel records (retention, misconduct separations, demotions, criminal investigations, drug tests, medical profiles, physical fitness scores) and the Defense Organizational Climate Survey (DEOCS), a congressionally mandated annual survey with response rates above 50% covering organizational effectiveness, equal opportunity, and sexual assault prevention and response. The main finding is that integrating women into previously all-male combat companies does not negatively affect men&amp;rsquo;s performance or behavioral outcomes. Estimates are precise enough to rule out small detrimental effects: two years post-integration, the authors can rule out a 3% increase in attrition, a 5% increase in demotions, and a 4% increase in criminal investigations relative to their respective means. One behavioral outcome shows a statistically significant improvement: integration reduces separations for misconduct by 1.3 percentage points (16% of the mean). Drug test positivity also declines. The sole potential negative administrative finding is a 1.8-point decline in physical fitness scores (0.7% of the mean, roughly 5% of a standard deviation), but this does not affect pass rates and becomes statistically insignificant when scores are imputed using observable covariates. An aggregate Performance and Behavior Index rules out reductions of 0.8% of a standard deviation; the No Adverse Outcomes measure rules out a 1.2 percentage point increase (3% of the mean). Despite these null-to-positive performance effects, survey data reveal that integration causes a 5% of a standard deviation decline in men&amp;rsquo;s overall perceptions of workplace quality. This perception decline is concentrated in companies that received a female officer shortly after integration. Among companies integrated only with female enlisted soldiers (no female officer), men&amp;rsquo;s workplace attitudes actually improve by 14.7% of a standard deviation. Two mechanisms are examined: increased male awareness of pre-existing workplace problems (supported by higher reported observations of bullying, hazing, and unwanted comments, especially among male officers in female-officer-integrated companies), and negative reactions to women in positions of authority (supported by broader declines in organizational effectiveness perceptions not confined to equal-opportunity items). Crucially, the perception decline does not translate into retaliatory behavior or performance deterioration; companies integrated with a female officer show some performance gains, and female enlisted soldiers in those companies report fewer workplace problems. Scope conditions: findings apply to a high-stakes, traditionally male-dominated, hierarchical occupational setting during 2017–2020, a period when U.S. deployment missions were primarily advise-and-assist rather than direct combat. Integration increased female representation by approximately 4.7 percentage points on average.&lt;/p&gt;
&lt;p&gt;Q: What was the policy change studied and why does it offer causal leverage?
A: In December 2015, Secretary of Defense Ashton Carter announced that all U.S. military occupations, including infantry and armor combat roles, would open to women starting in 2016. Women did not begin arriving at operational companies until 2017 due to training timelines. Within BCTs, the selection of which companies to integrate was neither systematic nor data-driven, and baseline characteristics of integrated and non-integrated companies are similar after conditioning on BCT and company-type fixed effects, supporting a parallel trends assumption.&lt;/p&gt;
&lt;p&gt;Q: What are the main administrative performance findings?
A: Integration has a positive but statistically insignificant effect on retention, and reduces misconduct separations by 1.3 percentage points (significant at the 5% level), representing a 16% reduction relative to the mean. Demotions, criminal investigations (including sex-related and domestic violence), and medical profiles show no significant negative effects, with precision sufficient to rule out 5% increases in demotions and 4% increases in criminal investigations. Physical fitness scores decline by 1.8 points (0.7% of mean, approximately 5% of a standard deviation), but pass rates are unaffected and the estimate becomes insignificant when scores are imputed with observable covariates.&lt;/p&gt;
&lt;p&gt;Q: What does the aggregate performance index show?
A: The Performance and Behavior Index—an equally weighted z-score average of retention, misconduct separations, demotions, criminal investigations, medical profiles, promotions to Sergeant, and physical fitness outcomes—shows a positive but insignificant effect of integration, ruling out reductions of 0.8% of a standard deviation. The No Adverse Outcomes measure rules out a 1.2 percentage point increase (3% of the mean incidence of adverse outcomes).&lt;/p&gt;
&lt;p&gt;Q: How do men&amp;rsquo;s workplace perceptions change after integration?
A: The overall workplace quality index constructed from all DEOCS Likert-scale items declines by 5% of a standard deviation following integration, spanning perceptions of organizational effectiveness, workplace inclusivity, and sexual assault prevention and response. This average effect masks critical heterogeneity by the rank composition of integrating women.&lt;/p&gt;
&lt;p&gt;Q: What is the key heterogeneity in survey responses?
A: The decline in men&amp;rsquo;s perceptions is entirely driven by companies that received a female officer shortly after integration. In companies integrated only with female enlisted soldiers (17% of integrating companies did not receive a female officer within a month), men&amp;rsquo;s perceptions improve by 14.7% of a standard deviation. Male officers show a larger negative shift than male enlisted soldiers in officer-integrated companies, and this difference is statistically significant.&lt;/p&gt;
&lt;p&gt;Q: What mechanisms explain the negative perception response to female officers?
A: Two mechanisms are investigated. First, increased awareness: male soldiers—especially male officers—report observing more bullying, hazing, and unwanted comments after a female officer is integrated but not after integration with only female enlisted, and the decline in perceptions of sexual assault prevention and response is significantly larger among male officers than enlisted men, consistent with shared leadership roles amplifying awareness of workplace problems. Second, negative reactions to female authority: declines in perceptions are more pronounced on organizational effectiveness questions than on equal-opportunity items and extend to issues unrelated to women, suggesting broader dissatisfaction with female leadership alongside heightened awareness.&lt;/p&gt;
&lt;p&gt;Q: Is the decline in perceptions related to actual differences in female officer qualifications or preferential treatment?
A: No. Female and male officers have similar baseline characteristics including educational background and experience. Companies integrated with female officers perform at least as well as non-integrated companies or those integrated only with enlisted women on administrative metrics. There is no evidence that male officers waited longer for leadership assignments relative to female colleagues, ruling out perceived preferential treatment as a driver.&lt;/p&gt;
&lt;p&gt;Q: Do men&amp;rsquo;s negative perceptions of female officers translate into retaliatory behavior toward women?
A: No. Administrative misconduct metrics show some improvements in male behavior when a female officer is present. Female enlisted soldiers in female-officer-integrated companies report fewer workplace problems on the climate survey than female enlisted soldiers in companies integrated without a female officer, indicating that the presence of a female officer generates benefits for female enlisted soldiers rather than backlash against them.&lt;/p&gt;
&lt;p&gt;Q: Does heterogeneity by integration intensity or women&amp;rsquo;s rank affect administrative outcomes for men?
A: Integration intensity (number of women initially integrated) and rank composition (female officers vs. only female enlisted) do not produce negative administrative outcomes in any subgroup. The aggregate Performance and Behavior Index shows a positive effect when a female officer is included. Effects also do not vary with male soldiers&amp;rsquo; rank (enlisted vs. officer) or their tenure in the company.&lt;/p&gt;
&lt;p&gt;Q: What happens in units that deploy to combat zones?
A: Approximately one in five integrated companies deployed to a combat zone within two years of integration. Integration does not negatively affect retention, behavior, or performance of men in deploying units. Declines in workplace perceptions are larger for deploying units and are most pronounced when integration occurs shortly after return from deployment, consistent with deployment strengthening in-group identity among male soldiers rather than women performing poorly during combat-zone service.&lt;/p&gt;
&lt;p&gt;Q: What do the findings imply for theories of identity economics and the pollution theory of discrimination?
A: The null-to-positive behavioral and performance responses to women&amp;rsquo;s entry contradict the predictions of Akerlof and Kranton&amp;rsquo;s (2000) identity economics model and Goldin&amp;rsquo;s (2014) pollution theory of discrimination, which predict retaliatory or otherwise unproductive behaviors when women enter a male-dominated occupation. The paper shows that, to the extent identity concerns shape male responses, these are confined to subjective perceptions and do not manifest in diminished performance, retention, or conduct.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications for employers considering gender integration?
A: The paper provides evidence against the argument that men will become less productive when women enter previously male-only occupations, a justification sometimes offered for excluding women from such jobs. The finding that performance and behavior are unaffected—and misconduct actually declines—allows policymakers and employers to weigh these results against concerns about operational or productivity costs of integration. The perception gap between men&amp;rsquo;s attitudes and actual outcomes points to a need for targeted leadership and organizational interventions, particularly around the introduction of female leaders.&lt;/p&gt;
&lt;p&gt;Ground Combat Exclusion Policy (GCEP): The U.S. military policy, rescinded in 2013 and fully eliminated by Secretary of Defense Carter in 2016, that precluded women from serving in infantry and armor positions; the policy whose removal is the source of the integration shock studied. | Staggered difference-in-differences: The empirical strategy exploiting the sequential, non-systematic integration of women into combat companies across years 2017–2023, using never-yet-treated companies as a comparison group with company fixed effects and BCT-by-year-of-arrival fixed effects. | Performance and Behavior Index: An equally weighted average of z-scored administrative outcomes (retention, no misconduct separations, no demotions, no criminal investigations, no medical profiles, promotion to Sergeant, physical fitness pass/fail and score), constructed for enlisted soldiers, oriented so higher values indicate better outcomes. | Leaders First policy: An Army requirement that a female officer be assigned to a combat company before or alongside female junior enlisted soldiers to ensure female leadership presence at integration; adherence was not universal, with 17% of integrating companies not following it within one month. | Defense Organizational Climate Survey (DEOCS): A congressionally mandated, annually administered, anonymous survey of military unit members covering organizational effectiveness, equal opportunity, and sexual assault prevention and response; the source of workplace perception outcomes. | Pollution theory of discrimination: Goldin&amp;rsquo;s (2014) theory that men may seek to exclude women from occupations because women&amp;rsquo;s presence is perceived to diminish the occupation&amp;rsquo;s prestige or status, potentially leading to retaliatory or unproductive behaviors among incumbent male workers. | Perception-performance wedge: The paper&amp;rsquo;s central finding that men&amp;rsquo;s subjective workplace quality perceptions decline with integration—especially when a female officer is present—even as objective administrative performance and behavior metrics show null to positive effects, a divergence between attitudes and measurable outcomes.&lt;/p&gt;</description></item><item><title>The Effects of Mandatory Profit-Sharing on Workers and Firms</title><link>https://macropaperwarehouse.com/papers/the-effects-of-mandatory-profit-sharing-on-workers-and-firms/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effects-of-mandatory-profit-sharing-on-workers-and-firms/</guid><description>&lt;p&gt;This paper studies the causal effects of mandatory profit-sharing on workers and firms using a quasi-experimental design arising from a 1990 French reform that lowered the eligibility threshold for mandatory profit-sharing from 100 to 50 employees. The institutional setting is the French RSP (Réserve Spéciale de Participation), a profit-sharing scheme in place since 1967 that requires firms above the threshold to distribute a fraction of their excess profits — defined as net income above 5% of book equity — to employees according to a formula scaled by the firm&amp;rsquo;s labor share. For the median firm, this amounts to roughly 10.5% of pre-tax income transferred to workers.&lt;/p&gt;
&lt;p&gt;The authors employ two primary empirical strategies. First, a bunching analysis exploits the pre-reform distribution of firm employment around the 100-employee threshold as a revealed-preference test of whether firms perceive profit-sharing as a net cost. Second, a difference-in-differences design compares treated firms (55–85 employees in 1989–1990, who become newly subject to the regulation after 1991) against two control groups: small firms (35–45 employees, likely never subject) and large firms (120–300 employees, already subject). Data come from the universe of French corporate tax files (FICAS) and a linked employer-employee panel (DADS) covering approximately 4% of private-sector workers, spanning 1985–1997.&lt;/p&gt;
&lt;p&gt;The bunching analysis documents a 22.3% excess density in the 95–99 employee bin before the reform, which disappears after 1991. Three tests — comparing wage bills per employee across the threshold, cross-checking with DADS employment records, and examining profitability patterns — collectively support the conclusion that bunching reflects genuine employment reductions rather than under-reporting. The implied employment loss is approximately 1.67% of total employment among affected firms.&lt;/p&gt;
&lt;p&gt;The difference-in-differences results yield the following firm-level findings: (a) the total compensation share (wages plus profit-sharing divided by value added) rises by 1.8 percentage points for firms with positive excess profits; (b) 77% of this increase comes at the expense of firm owners — the profit share falls by 1.37 percentage points; (c) the remainder is borne by the government through a reduction in the corporate income tax share; (d) the wage share (base wages only) is unaffected, indicating that owners do not reduce wages to offset the cost of profit-sharing; (e) investment and total factor productivity show no statistically significant change — effects on productivity are bounded below ±1% for several TFP measures; and (f) the capital-labor ratio shows a small, mostly insignificant negative effect, consistent with a model-implied increase in the cost of capital of only 0.43 percentage points.&lt;/p&gt;
&lt;p&gt;Worker-level analysis using the linked employer-employee data confirms that average total compensation rises by approximately 3.5% for workers in treated firms, with no decline in base wages. Critically, this average conceals distributional heterogeneity across the skill spectrum. For low- and medium-skill workers (blue-collar workers, clerks, supervisors, skilled technicians), total compensation rises while base wages are unchanged — consistent with wage rigidity binding for these groups. For high-skill workers (managers, engineers, executives), base wages fall by enough to leave total compensation unchanged, consistent with more flexible wages at the upper end of the skill distribution. This pattern implies that mandatory profit-sharing is a progressive policy within firms, redistributing excess profits predominantly to lower-skill workers.&lt;/p&gt;
&lt;p&gt;The paper concludes that France&amp;rsquo;s mandatory profit-sharing scheme, as implemented, functions as a non-distortive redistributive tool: it transfers excess profits from shareholders to lower-skill workers without generating measurable productivity losses or large investment distortions. The fiscal cost is non-trivial: each dollar transferred to workers costs approximately 20 cents in foregone corporate income tax. The scheme also has an inherent inequality in its redistribution since it exclusively benefits workers in profitable firms, and firms&amp;rsquo; excess profits are highly persistent.&lt;/p&gt;
&lt;p&gt;Q: What is the French RSP and how does the formula work?
A: The RSP (Réserve Spéciale de Participation) is a mandatory profit-sharing fund established by executive order in 1967. The formula is RSP = 0.5 × (wage bill / value added) × max(net income − 5% × book equity, 0). The 5% deduction represents lawmakers&amp;rsquo; view of fair compensation to shareholders; any excess is split between shareholders and workers, with the split scaled by the firm&amp;rsquo;s labor share. For the median firm in the sample — ROE of 12%, labor share of 0.52, corporate tax rate of 37% — the formula yields roughly 9.5% of pre-tax income, and in post-1991 data the realized average is 10.5% of pre-tax income for firms with positive excess profits.&lt;/p&gt;
&lt;p&gt;Q: Why can&amp;rsquo;t a standard regression discontinuity be used at the 100-employee threshold?
A: Because firms strategically control their position relative to the threshold — the bunching analysis itself demonstrates this. When firms sort non-randomly around the cutoff, the local randomization assumption underlying RD is violated. The authors instead use a difference-in-differences design exploiting the time variation introduced by the 1990 reform.&lt;/p&gt;
&lt;p&gt;Q: How large is the pre-reform bunching and what does it imply?
A: The distribution of employment shows 22.3% excess density in the 95–99 employee bin relative to the post-reform counterfactual distribution. Interpreting this as real employment reduction (supported by three empirical tests), the implied employment loss is approximately 1.67% of total employment among firms in the 85–120 employee range. Dynamic bunching analysis shows this is persistent rather than temporary — the 100-employee threshold significantly constrained three-year employment growth for firms in the 85–99 range in the pre-reform period.&lt;/p&gt;
&lt;p&gt;Q: How do the authors establish that bunching is real rather than under-reporting of employment?
A: Three tests are conducted. First, wage bills per employee show no discontinuity around the 100-employee threshold in either period, ruling out systematic under-reporting of headcount while truthfully reporting wages. Second, employment from DADS payroll records — harder to manipulate — shows only a statistically insignificant gap of roughly 0.5 employees relative to tax-file employment just below the threshold, far too small to shift firms across the 100-employee bin. Third, profitability and value added per employee are significantly higher just below the threshold, consistent with more profitable firms having stronger incentives to bunch through genuine employment reductions.&lt;/p&gt;
&lt;p&gt;Q: What is the main identification strategy for the firm-level analysis?
A: A difference-in-differences design where treated firms have 55–85 employees in both 1989 and 1990 (newly subject to the mandate after 1991), compared to small control firms with 35–45 employees (likely never subject) and large control firms with 120–300 employees (likely always subject). Specifications include firm fixed effects and county-by-year and industry-by-year fixed effects. Parallel pre-trends are confirmed graphically and in event-study regressions. The design is intent-to-treat: by 1997, 26.7% of treated firms had shrunk below 50 employees and did not actually pay profit-sharing. LATE estimates are obtained via 2SLS.&lt;/p&gt;
&lt;p&gt;Q: What are the main firm-level findings on compensation and profit shares?
A: For treated firms with positive excess profits, the total compensation share rises by 1.8 percentage points. The wage share (base wages only, excluding profit-sharing) is precisely estimated at zero — owners do not reduce wages. The profit share falls by 1.37 percentage points, accounting for 77% of the increase in total compensation. The remaining approximately 23% is borne by the tax authority through a reduction in the corporate income tax share, since profit-sharing reduces the corporate income tax base. These findings are robust to balanced vs. unbalanced samples and to alternative control group definitions.&lt;/p&gt;
&lt;p&gt;Q: Does mandatory profit-sharing raise or lower firm productivity?
A: Across five different TFP estimators (Olley-Pakes, Olley-Pakes with Ackerberg-Caves-Frazer correction, Wooldridge, Levinsohn-Petrin, and Ackerberg-Caves-Frazer), the effect of mandatory profit-sharing on productivity is a precisely estimated zero. For several measures, effects larger than ±1% in magnitude can be rejected. Softer measures of effort — sick leave rates and the probability of working extra hours — also show no significant change. This null finding contrasts with the literature on voluntary profit-sharing adoption, which typically finds 3–5% productivity gains, likely reflecting selection bias in that literature.&lt;/p&gt;
&lt;p&gt;Q: Does mandatory profit-sharing distort investment?
A: The effect on investment is small and mostly statistically insignificant. The theoretical model shows why: the profit-sharing formula is based on excess profits (net income minus 5% of book equity), not total profits. When the firm&amp;rsquo;s actual cost of equity approximately equals the regulatory 5% benchmark, the distortion to the cost of capital is zero. The calibrated distortion to the user cost of capital is only 0.43 percentage points — approximately 1.9% of the standard user cost — implying an investment ratio reduction of about 0.84 percentage points using estimated elasticities from Chodorow-Reich et al. (2024). Empirically, capital-labor ratios show a small, largely insignificant negative effect.&lt;/p&gt;
&lt;p&gt;Q: How does profit-sharing incidence differ across the skill distribution?
A: The worker-level DADS analysis reveals that the average 3.5% increase in total compensation masks sharp heterogeneity. For low- and medium-skill workers (blue-collar workers, clerks, supervisors, skilled technicians), total compensation rises while base wages are unchanged. For high-skill workers (managers, engineers, executives), base wages decline sufficiently to leave their total compensation unchanged. The authors interpret this pattern as consistent with wage rigidity being more binding for lower-skill workers — due to the federal minimum wage and collective agreements — than for managers whose pay is more flexibly set.&lt;/p&gt;
&lt;p&gt;Q: Why does profit-sharing not affect base wages for low-skill workers?
A: Two candidate explanations are considered. The risk channel — that profit-sharing is risky and thus less valuable to risk-averse workers, who demand wage compensation — is rejected empirically because profit-sharing only marginally increases the variability of workers&amp;rsquo; total earnings. The wage rigidity channel is supported: France&amp;rsquo;s binding federal minimum wage and widespread collective agreements constrain downward adjustment in base wages for lower-skill workers, so firms cannot pass through profit-sharing costs as lower wages for this group.&lt;/p&gt;
&lt;p&gt;Q: What is the fiscal cost of the profit-sharing scheme?
A: Each dollar transferred to workers through mandatory profit-sharing costs approximately 20 cents in reduced corporate income tax receipts, since profit-sharing payments are deductible from taxable income. The paper notes this is a partial fiscal evaluation; a full assessment would also require analyzing personal income tax implications, which are left for future work.&lt;/p&gt;
&lt;p&gt;Q: How does this scheme compare to a corporate income tax as a redistributive tool?
A: Both instruments reduce firm profits and can benefit workers, but differ in three key respects. First, the tax base differs: profit-sharing targets excess profits above 5% of book equity whereas the corporate income tax applies to all corporate earnings, generating different distortions to investment. Second, profit-sharing goes directly to workers in the same firm, whereas corporate tax revenues are redistributed through general government spending — making the incidence more direct and more closely monitored by workers. Third, workers have stronger incentives to monitor firm compliance with profit-sharing (each euro of diverted excess profit reduces workers&amp;rsquo; collective income by roughly 10–15 cents) than with corporate taxes.&lt;/p&gt;
&lt;p&gt;Q: How does this paper compare to findings on mandatory profit-sharing in Peru?
A: Tolentino (2022) studies a mandatory profit-sharing scheme in Peru exploiting a 20-employee eligibility threshold and finds larger distortions — reductions in both investment and productivity. The authors attribute this difference to two features: the Peruvian scheme applies to the entirety of post-tax profits rather than excess profits above an equity deduction, creating a broader and more distortionary base; and there is pre-existing bunching at the Peruvian threshold even before the scheme was introduced, suggesting confounding pre-existing regulations.&lt;/p&gt;
&lt;p&gt;Q: What are the scope conditions on the external validity of the findings?
A: The findings apply specifically to mandatory profit-sharing under the French RSP formula — which exempts a 5% equity return from the profit-sharing base, limiting distortions — during 1985–1997, for firms in the 55–300 employee range. The null productivity effect may not generalize to voluntary schemes, where selection on anticipated gains likely produces positive correlations. The redistributive finding (benefiting lower-skill workers) is specific to a context with binding minimum wages and collective agreements that constrain wage adjustment for that group. The fiscal cost calculation also excludes personal income tax effects.&lt;/p&gt;
&lt;p&gt;Excess profits: Defined in the paper as net income minus 5% of book equity — the amount above what lawmakers considered fair compensation to shareholders. Only excess profits (not total profits) are subject to the mandatory profit-sharing formula.&lt;/p&gt;
&lt;p&gt;RSP formula (Réserve Spéciale de Participation): The statutory formula RSP = 0.5 × (wage bill / value added) × max(net income − 5% × book equity, 0), scaled by the firm&amp;rsquo;s labor share to reflect labor&amp;rsquo;s contribution to production. Unchanged since 1967.&lt;/p&gt;
&lt;p&gt;Total compensation share: The ratio of (wage bill plus profit-sharing) to value added — the paper&amp;rsquo;s primary measure of workers&amp;rsquo; overall claim on firm output, as distinct from the wage share (wage bill alone divided by value added).&lt;/p&gt;
&lt;p&gt;Wage incidence parameter (λ): The fraction of profit-sharing that firms pass through to workers as lower base wages. λ = 1 means full incidence (workers&amp;rsquo; total compensation unchanged); λ = 0 means no incidence (workers fully benefit). The paper&amp;rsquo;s empirical findings are consistent with λ ≈ 0 for low-skill workers and λ ≈ 1 for high-skill workers.&lt;/p&gt;
&lt;p&gt;Bunching: The empirical phenomenon whereby firms cluster employment just below the 100-employee regulatory threshold to avoid mandatory profit-sharing. The paper uses the pre- vs. post-reform shift in the employment distribution as a revealed-preference test of whether firms perceive the scheme as a net cost.&lt;/p&gt;
&lt;p&gt;Intent-to-treat (ITT) design: The empirical design comparing firms that were in the newly eligible size range (55–85 employees) just before the 1990 reform against firms that were either always or never eligible, regardless of whether treated firms actually ended up paying profit-sharing post-reform. LATE estimates are obtained via 2SLS to recover effects on actual compliers.&lt;/p&gt;
&lt;p&gt;Distortion to user cost of capital: The additional cost of capital induced by profit-sharing, equal to ϕ × γ(1−λ) / [1 − γ(1−τ)] × (re − ρ), where ρ = 5% is the regulatory equity benchmark. When the firm&amp;rsquo;s actual cost of equity equals the 5% benchmark, this distortion is zero — a feature that distinguishes the French scheme from a standard corporate income tax.&lt;/p&gt;</description></item><item><title>The Effects of Medical Debt Relief: Evidence from Two Randomized Experiments</title><link>https://macropaperwarehouse.com/papers/the-effects-of-medical-debt-relief-evidence-from-two-randomized-experiments/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-effects-of-medical-debt-relief-evidence-from-two-randomized-experiments/</guid><description>&lt;h2 id="layer-1-overview"&gt;Layer 1: Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks whether relieving downstream medical debt — debt that has been sold to third-party debt collectors — causes improvements in financial outcomes, mental and physical health, and healthcare utilization for recipients. The question is motivated by a large correlational literature documenting strong associations between medical debt and adverse outcomes, and by the rapid expansion of government and private debt relief programs that, as of mid-2024, had committed or planned over $14.6 billion in relief.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Design&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors partnered with RIP Medical Debt (a non-profit that purchases and forgives medical debt for government and private donors) to conduct two randomized controlled trials between March 2018 and October 2020. In total the experiments relieved medical debt with a face value of $169 million for 83,401 people.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Hospital debt experiment&lt;/strong&gt;: RIP purchased a random subset of debt from a large for-profit hospital system at the juncture when the hospital would normally sell accounts to a debt collector (approximately one year after the medical service). The purchase price was 5.5 cents per dollar of face value. The treatment group consisted of 14,377 people who received $19 million in face-value relief (average of $1,321 per person). The 61,496-person control group had their debt pursued by the collector under normal protocol.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Collector debt experiment&lt;/strong&gt;: RIP purchased a random subset of older debt already under collection on the secondary market for several years, at a price of less than one cent per dollar. The treatment group consisted of 69,024 people who received $150 million in face-value relief (average of $2,167 per person). The 68,014-person control group retained their debt.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Credit reporting sub-experiment&lt;/strong&gt;: Partway into the collector debt experiment, the debt collector ceased reporting medical debt to the credit bureaus, reflecting an industry-wide trend. The authors isolate 2,761 accounts (6.8% of wave 1) that were reported prior to treatment assignment to estimate the effects of debt relief when accounts would have been counterfactually reported, compared to the subsequent no-reporting environment.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Outcomes are tracked using quarterly depersonalized credit bureau data from TransUnion (spanning at least four quarters before to four quarters after treatment), collections account data on future bill accrual, and a multimodal survey of 2,888 hospital debt experiment respondents measuring mental and physical health, healthcare utilization, and financial wellness. The primary credit-bureau outcome is the number of accounts past due; the primary survey outcome is the share with at least moderate depression (PHQ-8).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Credit market outcomes (main experiments)&lt;/strong&gt;: In both the hospital and collector debt experiments — where there is no counterfactual credit bureau reporting — debt relief has no average effect on financial distress, credit access, or credit utilization. The effect on the number of accounts past due is -0.01 (statistically insignificant; 95% CI excludes effects smaller than -0.04, relative to a control mean of 1.20). Effects on credit card balances (95% CI: -$42 to $47 relative to a mean of $1,481) and auto loan balances (95% CI: -$235 to $148 relative to a mean of $8,020) are similarly precise nulls. These null effects hold for the hospital debt sample (younger debt, 1.3 years old on average) and the collector debt sample (older debt, 7.0 years old on average), and across all preregistered subgroups.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Credit reporting sub-experiment&lt;/strong&gt;: When control group accounts are counterfactually reported, debt relief immediately raises credit scores by an economically small average of 3.4 points (p-value 0.021), with a larger 13.8-point increase (p-value 0.008) for persons with no other debt in collections. Credit limits grow gradually, reaching $340 (15.3% of the post-reporting control mean of $2,231; p-value 0.010) after the no-reporting period begins, with larger effects for those with no other debt in collections. Once control group reporting ceases, both the credit score and credit limit effects converge to zero for those with other debts in collections. No effects on borrowing or financial distress measures are detected in this sub-experiment.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Collections account outcomes (bill repayment)&lt;/strong&gt;: Debt relief causes a statistically significant 1.1 percentage-point increase in the probability of having another unpaid bill sent to collections (6.6% of the control mean of 16.2%; p-value &amp;lt; 0.05) and a $15 increase in the dollar amount of future medical debt sent to collections (7.2% of the control mean of $208). The increase is almost entirely attributable to pre-relief medical services, indicating reduced repayment of existing bills rather than greater healthcare utilization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Survey outcomes&lt;/strong&gt;: There are no detectable average effects on depression (primary outcome), anxiety, stress, subjective well-being, or general health. Debt relief raises the share with at least moderate depression by a statistically insignificant 3.2 percentage points (p-value 0.097; control mean 45.0%); a 95% CI rules out a reduction of more than 0.6 percentage points, well below the 7.0 percentage-point improvement predicted by the median expert respondent. There are similarly null effects on healthcare utilization and financial wellness as measured in the survey.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The study focuses specifically on downstream medical debt in collections — debt that has already been through the hospital billing cycle and sold to third-party collectors. Results do not necessarily apply to upstream debt relief (e.g., financial assistance programs applied closer to the time of the medical event), nor to populations with different baseline financial profiles. The credit reporting results are most relevant to the prior regime of widespread reporting; under the current environment in which most medical debt has been removed from credit reports, the credit-access channel is largely foreclosed.&lt;/p&gt;
&lt;h2 id="layer-2-qa"&gt;Layer 2: Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why did the authors focus specifically on downstream medical debt in collections, and how does this define the scope of their study?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors focus on downstream medical debt because this is the target of essentially all large-scale government and private relief programs working with RIP Medical Debt, and because it is the category of debt that is most comprehensively observable. Downstream medical debt is defined as bills that have been or are about to be sold by the healthcare provider to a third-party debt collector. This focus excludes upstream unpaid bills still held by the hospital, bills being paid over time, and medical expenses charged to credit cards. The distinction matters because prior literature on hospital financial assistance programs finds substantial benefits from upstream interventions that relieve debt closer to the precipitating medical event; the authors&amp;rsquo; null results are explicitly scoped to the downstream, post-collection stage.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why did the purchase price of medical debt (5.5 cents per dollar for hospital debt, less than 1 cent per dollar for collector debt) suggest caution about expected financial impacts ex ante?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors argue that in a competitive market, the purchase price of medical debt reflects the sum of expected recovery rates and collection costs. A price of 5.5 cents per dollar implies that actual recovery (what collectors expect to collect from patients) is very low. Even if all of the expected recovery is passed through to the patient as a financial benefit, the direct liquidity gain from debt forgiveness is a small fraction of the debt&amp;rsquo;s face value. For the collector debt experiment, where the purchase price is less than 1 cent per dollar, the expected direct financial benefit to recipients is even smaller. The authors note that survey respondents expected to pay 54% of their outstanding medical debt and thought it fair to pay 37%, suggesting that perceived (rather than actual) payment obligations may be what connects medical debt to financial behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How was random assignment implemented in the hospital debt experiment, and what design features ensure the validity of the experiment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Within each of 18 waves between August 2018 and October 2020, RIP received a portfolio of unpaid bills from the hospital system. Persons were grouped at the individual level and stratified by the amount of debt, state of residence, insurance status, and a collections score predicting repayment likelihood. Within strata, persons were randomly assigned to treatment or control, with approximately 20% treated per wave (varying with donor funding). The hospital was unaware of the intervention, eliminating scope for selection of particularly uncollectible accounts. Treatment notification occurred via two letters sent approximately three and six weeks post-purchase. Balance tests confirm successful randomization: all p-values on baseline characteristics are above 0.05, and F-tests fail to reject joint balance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What was the credit reporting sub-experiment and how was it identified?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The debt collector in the collector debt experiment historically reported medical debt to the credit bureaus but largely ceased doing so before the first intervention wave (March 2018), reflecting broader industry concerns about CFPB enforcement and data integrity risk. However, a subset of accounts — 2,761 accounts (6.8% of wave 1, with virtually identical match rates across treatment and control) — were still being reported until 2019 Q1 (three quarters after wave 1 and one quarter after wave 2). This created a natural sub-experiment: for this subset, treatment group accounts were removed from credit reports immediately upon debt relief, while control group accounts continued to be reported for three more quarters before also being removed. The authors identify reported accounts by matching dollar amounts in collections account data to credit bureau tradeline data in the four quarters prior to intervention, and use this variation to estimate effects separately for the &amp;ldquo;reporting&amp;rdquo; and &amp;ldquo;no-reporting&amp;rdquo; periods.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What are the exact estimated effects on credit scores and credit limits in the credit reporting sub-experiment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;During the three quarters when control group accounts are still reported to credit bureaus, debt relief raises credit scores by an average of 3.4 points (p-value 0.021) for the full reporting subsample. The effect is concentrated among those with no other debt in collections: 13.8 points (p-value 0.008) versus 1.2 points (p-value 0.440) for those with other debt in collections. Credit limits increase gradually, reaching $340 (15.3% of the post-reporting control mean of $2,231; p-value 0.010) by the four quarters after control group reporting ceases. Among persons with no other debt in collections, this credit limit effect grows to $922 (23% of the control mean; p-value 0.070). Once control group reporting stops, both the credit score effect and the credit limit growth converge to zero for persons with other debts in collections. The event study coefficients show the credit limit effect growing approximately linearly over five quarters post-intervention before leveling out.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the paper rule out the possibility that medical debt relief increases healthcare utilization, thereby causing more future medical bills?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The collections account analysis separates future debt accrual into debt associated with pre-relief medical services (which can only result from reduced repayment of existing bills) and post-relief medical services (which could reflect either increased utilization or changed repayment of new bills). Panel B of Table VI shows that virtually all of the increased debt sent to collections — a $15 increase and 1.1 percentage-point increase in the probability of any future collection — is attributable to pre-relief services. Panel C shows statistically insignificant increases in future debt from post-relief services. The authors therefore attribute the effect to reduced payment of existing bills and conclude they &amp;ldquo;cannot rule in or rule out effects on healthcare utilization&amp;rdquo; for the post-relief services channel, but the dominant mechanism is behavioral change in repayment of already-incurred debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are the three mechanisms proposed to explain the reduction in repayment of existing medical bills, and which mechanism is rejected?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors offer three candidate mechanisms for the 6.6% relative increase in the probability of future bill collections: (i) an expectations mechanism, in which beneficiaries reduce payments because they anticipate future debt relief from similar charitable programs; (ii) a targeting mechanism, drawing on Dobkin et al. (2018), in which patients tolerate a certain level of indebtedness — relieving some debt creates &amp;ldquo;room&amp;rdquo; in their debt budget, so they reduce payment of remaining bills to return to that target level; and (iii) a confusion mechanism, in which recipients mistakenly believe the relief applied to non-forgiven bills (the notification letter explicitly stated &amp;ldquo;the forgiveness is for this outstanding bill only&amp;rdquo; but patients may not have internalized this). The income effect or &amp;ldquo;flypaper&amp;rdquo; mechanism — the idea that financial relief of existing debt frees up mental-account resources for paying medical bills, thereby increasing repayment — is explicitly rejected by the data, as the effect goes in the direction of less repayment, not more.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What did the expert survey predict, and how did those predictions compare to the experimental estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;An expert survey conducted between April and May 2022 — after the interventions were completed but before results were released — asked academics, non-profit staff, hospital revenue-cycle practitioners, and policymakers to predict the impact of the hospital debt experiment. The median expert predicted a 7.0 percentage-point reduction in depression (8.0 points when weighted by confidence), a 10.2 percentage-point reduction in borrowing (13.7 points when confidence-weighted), and meaningful improvements in healthcare access. In total, 75.6% of respondents predicted medical debt relief is at least a moderately valuable use of charity resources, and 51.1% thought it very or extremely valuable. The authors estimate a statistically insignificant 3.2 percentage-point increase in depression (not a decrease), and a 95% confidence interval that rules out a reduction in depression of more than 0.6 percentage points — far below the 7.0 percentage-point expert prediction.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What survey methodology was used, and what response rate was achieved?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The survey, administered by NORC at the University of Chicago, targeted a random subset of 14,922 hospital debt experiment participants who entered the study after September 2019 (waves 6-18) and owed at least $500. The protocol spanned 13 weeks and included five postal mailings (including a $2 upfront incentive and a $5 incentive with the paper survey), twice-weekly email reminders, certified mail delivery of the full survey instrument, and telephone interviews by a US-based call center. Respondents received a $50 completion incentive. The protocol achieved a 19.4% response rate, with 68% responding via web, 10% via telephone, and 23% via mail. The survey was titled &amp;ldquo;Health and Financial Wellness Study&amp;rdquo; and made no reference to RIP Medical Debt to avoid priming respondents. Respondents were surveyed on average 13 months after treatment assignment (interquartile range 10 to 17 months).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What heterogeneity in survey outcomes was detected, and how do the authors interpret the anomalous depression finding for high-debt recipients?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Across all four preregistered heterogeneity dimensions (medical debt amount, age of debt, age of person, amount of other debt in collections), null effects on survey outcomes were found in 15 of 16 subgroups. The exception is persons in the fourth quartile of medical debt eligible for relief, for whom debt relief caused a statistically significant 12.4 percentage-point increase in depression (p-value 0.002) relative to a control mean of 45.9%, with similar patterns for anxiety, stress, subjective well-being, and general health. The authors consider this may be a statistical fluke given the null results across all other 15 groups. They also note potential parallels with findings from unconditional cash transfer experiments, where the receipt of transfers raised the salience of financial deprivation without addressing its underlying causes. A charity-stigma mechanism (recipients did not request the assistance) is also considered. The authors caution against giving this result undue weight in the overall assessment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper position downstream debt relief relative to upstream interventions, and what does prior evidence suggest about upstream alternatives?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors highlight that their null results do not extend to upstream medical debt relief. Adams et al. (2022), studying a hospital financial assistance program at Kaiser Permanente that bundled debt relief with reductions in cost-sharing close to the time of the medical event, found substantial increases in high-value healthcare utilization. The Oregon Health Insurance Experiment (Baicker et al. 2013) found that Medicaid reduced depression by 9 percentage points among low-income uninsured adults. The authors suggest several reasons why downstream relief may fail: the intervention occurs too late after the precipitating event (approximately 15 months after the medical service in the hospital debt experiment, and about 7 years in the collector debt experiment), patients may have habituated to the stress of debt collections, the relief amount may be too small relative to overall financial distress, and the direct financial benefit is inherently limited by the low market price of collections-stage debt.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How do the authors address concerns about differential survey response and external validity?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Treated persons were a statistically insignificant 1.3 percentage points more likely to respond to the survey (p-value 0.056). The authors address this in two ways. First, they estimate specifications that (i) add rich observable controls and (ii) use speed of survey response as a proxy for unobserved response propensity; neither exercise changes the estimates meaningfully. Second, to probe external validity, they test for heterogeneous effects by predicted response propensity (from a logistic regression of a response indicator on baseline characteristics) and by speed of response; neither yields evidence of differential effects for non-respondents. They also compare credit bureau treatment effects for the full hospital debt sample, the survey outreach sample, and the survey respondent sample and find similar estimates across all three groups.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Downstream medical debt&lt;/strong&gt;: Medical bills that have already been sent to third-party debt collectors by the healthcare provider after the initial billing cycle, as distinguished from upstream unpaid bills still held by the hospital at or near the time of the medical event. The paper studies debt at this late stage specifically because it is the target of most large-scale relief programs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Credit reporting sub-experiment&lt;/strong&gt;: An embedded quasi-experiment within the collector debt RCT, exploiting the fact that a subset of accounts (6.8% of wave 1) were still being reported to credit bureaus at the time of intervention while the debt collector had already ceased reporting for the remaining accounts. This allows separate estimation of debt relief effects with and without counterfactual credit bureau reporting, using the period until 2019 Q1 (when the collector stopped reporting entirely) as the &amp;ldquo;reporting&amp;rdquo; window.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Downstream bill repayment effect&lt;/strong&gt;: The paper&amp;rsquo;s finding that debt relief increases the probability of a subsequent unpaid medical bill being sent to collections. The paper attributes this primarily to reduced repayment of existing pre-relief medical bills rather than to increased healthcare utilization, consistent with an expectations, targeting, or confusion mechanism — and inconsistent with an income or flypaper effect that would increase repayment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Targeting a level of indebtedness&lt;/strong&gt;: A behavioral model (drawn from Dobkin et al. [2018]) in which patients implicitly target a certain level of indebtedness. Under this model, relieving some debt creates headroom in the patient&amp;rsquo;s implicit debt budget, leading to reduced repayment of remaining bills to restore the targeted level of total indebtedness.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Expert survey (pre-results)&lt;/strong&gt;: A structured elicitation of predicted treatment effects conducted between April and May 2022 — after the interventions were completed but before results were released — from academics, non-profit practitioners, hospital revenue-cycle managers, and policymakers. Used as a benchmark to quantify how far the causal estimates fall below prevailing beliefs, and to document that the null results were ex ante surprising to informed observers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;PHQ-8 (Patient Health Questionnaire-8)&lt;/strong&gt;: An eight-item validated clinical screen for depression, used as the paper&amp;rsquo;s primary preregistered survey outcome. An indicator for &amp;ldquo;at least moderate depression&amp;rdquo; on the PHQ-8 is the main mental health measure against which the debt relief treatment effect is estimated.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multimodal survey&lt;/strong&gt;: A survey protocol combining five postal mailings, twice-weekly email reminders, certified mail delivery of a paper survey instrument, and US-based call center telephone interviews, designed to maximize response rates in a hard-to-reach low-income population with medical debt in collections.&lt;/p&gt;</description></item><item><title>The Environmental Bias of Corporate Income Taxation</title><link>https://macropaperwarehouse.com/papers/the-environmental-bias-of-corporate-income-taxation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-environmental-bias-of-corporate-income-taxation/</guid><description>&lt;p&gt;This paper documents and quantifies an &amp;ldquo;environmental bias&amp;rdquo; embedded in the U.S. corporate income tax code: CO2-intensive (&amp;ldquo;dirty&amp;rdquo;) firms systematically face lower effective tax rates than clean firms, constituting an implicit subsidy on pollution. The authors — Iovino, Martin, and Sauvagnat — establish this cross-sectional fact, trace it to a specific mechanism, provide causal evidence using the 2017 Tax Cuts and Jobs Act (TCJA), and quantify aggregate emissions implications using a calibrated multi-sector general-equilibrium model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and sample.&lt;/strong&gt; The empirical analysis combines firm-level CO2 emissions from Trucost (scope 1 greenhouse gases) with financial data from Compustat North America for U.S. publicly listed firms, 2003–2021, yielding 11,223 firm-year observations with positive pretax and gross capital income. Effective tax rates are measured as income taxes paid divided by gross capital income (sales minus COGS minus SGA expenses, adding back R&amp;amp;D).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cross-sectional finding.&lt;/strong&gt; A one-standard-deviation increase in CO2 intensity is associated with a decrease in the effective tax rate equal to approximately 9% of its standard deviation (coefficient −0.021 to −0.022, significant at 1%). The negative relationship is entirely explained by the lower taxable fraction of gross capital income for dirty firms — that is, by larger interest expense deductions — rather than by differences in the statutory tax rate applied to pretax income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism.&lt;/strong&gt; The chain of causation runs: CO2-intensive production requires tangible capital (primarily machinery and equipment) → tangible capital serves as collateral → higher collateral supports higher debt → higher debt generates larger interest deductions (the &amp;ldquo;tax shield of debt&amp;rdquo;) → lower effective tax rates. Once PPE-to-capital-income is controlled for, the coefficient on CO2 intensity in leverage, pretax income, and tax regressions becomes small and statistically insignificant. The relationship holds both across and within industries, including within the energy sector, though the dominant variation is cross-industry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Causal evidence: TCJA 2017.&lt;/strong&gt; The paper exploits the federal corporate tax rate cut from 35% to 21% (effective January 2018) in a difference-in-differences design, comparing firms in the top quartile of 2017 CO2 intensity (&amp;ldquo;dirty&amp;rdquo;) to cleaner firms. Dirty firms experienced a relative increase in their federal effective tax rate of 2.4 percentage points post-reform. Correspondingly, dirty firms&amp;rsquo; total assets grew approximately 11% less than clean firms post-reform. This translates to a semi-elasticity of firm total assets to a one-percentage-point increase in the effective tax rate of approximately −4.8. Parallel pre-trends are confirmed visually and via Rambachan-Roth (2023) sensitivity analysis; a placebo using non-federal taxes shows no differential effect. Results survive controls for other TCJA provisions (interest deductibility limits, international tax changes, net operating loss restrictions), exposure to import tariffs and carbon taxes, leave-one-industry-out specifications, and a triple-difference using foreign firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;General-equilibrium model and counterfactuals.&lt;/strong&gt; A 375-sector model with input-output networks (both intermediate and investment networks), financial frictions linking equipment to debt capacity, and endogenous CO2 emissions through fossil fuel usage is calibrated to 2017 BEA and Compustat data. In the Cobb-Douglas benchmark, the 2017 tax cut raises output by 5.9% and emissions by only 4.5% — a less-than-proportional emissions response because clean sectors expand relatively more. A counterfactual eliminating the tax shield of debt while simultaneously cutting the tax rate from 35% to 30% (to hold GDP constant) reduces aggregate emissions by 1.3% with output declining only 0.1%. When equipment and fuel are treated as complements (elasticity of substitution below 1), the emissions reduction under the same policy rises to over 3.7%, implying an absolute reduction of 80–240 million metric tons of CO2 from 2017&amp;rsquo;s total of 6,457 million metric tons. Monetized at the social cost of carbon, this ranges from USD 8–24 billion (conservative, ~USD 100/ton) to USD 112–336 billion (USD 1,400/ton per Bilal and Kanzig 2024).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the central empirical finding of the paper?&lt;/strong&gt;
A: CO2-intensive firms in the U.S. face systematically lower effective corporate income tax rates than clean firms. A one-standard-deviation increase in CO2 intensity is associated with a roughly 9% of a standard deviation decrease in the ratio of taxes paid to gross capital income. This negative relationship is robust to alternative emissions measures (EPA data, scope 2 and 3 emissions), alternative tax scalings (taxes over sales or assets), log CO2 emissions, and leave-one-industry-out specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the mechanism linking CO2 intensity to lower effective tax rates?&lt;/strong&gt;
A: Dirty firms rely on tangible capital — specifically machinery and equipment — to produce. Tangible capital is pledgeable as collateral, enabling higher debt. Higher debt generates larger interest expense deductions under the tax code (the &amp;ldquo;debt tax shield&amp;rdquo;), which reduces taxable income relative to gross capital income. Once PPE-to-capital-income is included as a control, the coefficient on CO2 intensity in regressions of leverage, pretax income, and taxes paid all become small and statistically insignificant, confirming that PPE fully mediates the relationship.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Which component of tangible capital drives the result?&lt;/strong&gt;
A: Machinery and equipment, not buildings, leases, land, natural resources, or construction in progress, explains virtually the entire positive relationship between PPE and CO2 intensity. This finding is based on the Compustat breakdown of PPE components available for roughly 70% of sample firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does the mechanism operate within industries or only across them?&lt;/strong&gt;
A: Both. Decomposing firm CO2 intensity into an implied industry component (sales-weighted from pure-play firms) and a firm residual, both components are significantly associated with higher tangible capital, leverage, lower taxable fraction of capital income, and lower taxes paid at the 1% level. However, the largest share of the total effect stems from cross-industry variation. Within the energy sector specifically, firms with greater fossil fuel production capacity (from EPA/EIA data) also have more tangible capital, higher debt, and lower effective tax rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the 2017 TCJA cut affect clean versus dirty firms differently?&lt;/strong&gt;
A: Because dirty firms already shield a large fraction of their capital income from taxation via interest deductions, a uniform cut in the statutory rate benefits them less in proportional terms. The difference-in-differences estimates show that dirty firms (top quartile of 2017 CO2 intensity) experienced a relative increase in their federal effective tax rate of 2.4 percentage points post-reform compared to clean firms, and their total assets grew approximately 11% less than clean firms post-reform. The semi-elasticity of firm assets to a one-percentage-point increase in effective tax rate is approximately −4.8.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How is the parallel trends assumption supported?&lt;/strong&gt;
A: Event-study graphs show no pre-2018 divergence in federal effective tax rates or asset growth between dirty and clean firms. A placebo test using non-federal income taxes (which should be unaffected by the federal statutory rate change) shows no differential post-reform effect. The Rambachan-Roth (2023) sensitivity analysis confirms that the null of no differential effect can be rejected at the 1% level allowing for pre-trend deviations up to M = 0.5, and at the 10% level up to M = 1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What robustness checks address other provisions of the TCJA and concurrent shocks?&lt;/strong&gt;
A: The authors exclude or control for firms affected by the TCJA&amp;rsquo;s interest deductibility limitation, multinational firms (more than 20% foreign sales), firms with large loss carryforwards, and manufacturing firms — results are unchanged. They also control for firm-level exposure to import tariff changes and carbon taxes (using the World Carbon Pricing Database), with coefficients of interest remaining virtually unchanged. Leave-one-industry-out specifications and a triple-difference using foreign firms (comparing U.S. dirty vs. clean firms pre/post-2018, against foreign equivalents in countries with stable tax rates) yield a semi-elasticity of −5.8, if anything larger than the baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the general-equilibrium model add that the difference-in-differences cannot?&lt;/strong&gt;
A: The DiD design identifies relative effects of the tax cut on dirty versus clean firms but cannot recover the absolute effect on aggregate output and emissions. The GE model, calibrated to 2017 data and validated against the untargeted DiD estimates, quantifies aggregate impacts: the 2017 tax cut raises steady-state output by 5.9% while emissions rise by only 4.5% — a less-than-proportional increase due to compositional reallocation toward clean sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the counterfactual removing the debt tax shield find?&lt;/strong&gt;
A: Eliminating the tax shield of debt while simultaneously lowering the corporate tax rate from 35% to 30% (to keep GDP constant) reduces aggregate emissions by 1.3% (Cobb-Douglas benchmark) while total output falls only 0.1% and GDP remains constant by design. The emissions reduction arises because clean sectors, which rely more on less-pledgeable capital, are made relatively cheaper once the tax advantage of debt is removed, redirecting demand away from CO2-intensive sectors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the complementarity assumption between equipment and fuel affect the results?&lt;/strong&gt;
A: When equipment and fuel are modeled as complements (elasticity of substitution below 1) rather than Cobb-Douglas substitutes, both policy counterfactuals yield larger emissions effects. For the tax shield removal policy, the predicted emissions reduction rises from 1.3% to over 3.7% as complementarity strengthens. This is because policies that raise the cost of equipment also induce firms to cut fuel consumption, amplifying the direct compositional effect.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the quantified absolute emissions impact of removing the tax shield?&lt;/strong&gt;
A: Given 2017 U.S. total emissions of 6,457 million metric tons, the model predicts an absolute reduction of 80–240 million metric tons of CO2, depending on the assumed complementarity between equipment and fuel. Monetized at conservative estimates (~USD 100/ton), the policy saves USD 8–24 billion; at USD 1,400/ton (Bilal and Kanzig 2024), the value rises to USD 112–336 billion. The authors note that the physical quantity measure is more reliable than the monetized figure given uncertainty in the social cost of carbon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does this paper relate to the ECB bond purchasing literature?&lt;/strong&gt;
A: Piazzesi et al. (2022) document that the ECB&amp;rsquo;s market-neutral bond purchases implicitly favor dirty firms because those firms issue more bonds due to higher tangible capital holdings. This paper identifies the same underlying mechanism — tangible capital → debt capacity — but on the tax side, showing that the corporate income tax code independently provides an implicit subsidy to dirty firms through the debt tax shield.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the policy implication for the debt tax shield specifically?&lt;/strong&gt;
A: The debt tax shield — the deductibility of interest payments but not dividends — has no clear economic rationale (both are returns to capital) and, per several policy proposals (CBO 1997, IMF 2016), is a candidate for elimination. This paper adds a new dimension: the tax shield indirectly subsidizes CO2 emissions by differentially benefiting capital-intensive, CO2-intensive sectors. A revenue-neutral reform eliminating the shield can reduce emissions without sacrificing GDP.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective tax rate (paper&amp;rsquo;s definition):&lt;/strong&gt; The ratio of corporate income taxes paid to gross capital income, where gross capital income equals sales minus cost of goods sold minus SGA expenses plus R&amp;amp;D spending. This differs from the tax-to-pretax-income ratio because it captures how much of total capital earnings — before any deductions — is remitted as tax.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Debt tax shield (tax advantage of debt):&lt;/strong&gt; The reduction in corporate tax liability arising from the deductibility of interest payments on corporate debt. Because dividends are not deductible, debt-financed capital faces a lower after-tax cost than equity-financed capital. The shield&amp;rsquo;s value is estimated at approximately 10% of firm value in prior literature.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;CO2 intensity:&lt;/strong&gt; Metric tons of CO2 equivalent per USD 1,000 of output (tCO2/k$). The sample average is 0.1 tCO2/k$, with a heavily right-skewed distribution (median 0.02, 99th percentile 1.5).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Environmental bias of corporate taxation:&lt;/strong&gt; The paper&amp;rsquo;s central concept — the systematic difference in effective tax rates between dirty and clean firms that arises not from explicit environmental policy but from the interaction of the debt tax shield with the capital structure of CO2-intensive industries. This constitutes an implicit subsidy on pollution embedded in the corporate income tax.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Asset pledgeability (psi):&lt;/strong&gt; The fraction of a firm&amp;rsquo;s assets recoverable by creditors in the event of default. In the model, equipment has higher pledgeability than other capital (estimated b_psi = 0.23 additional pledgeability for equipment, a_psi = 0.35 base). Higher pledgeability allows firms to sustain more debt and thus benefit more from the tax shield.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;User cost of capital:&lt;/strong&gt; The total cost to a firm of using one unit of capital, combining depreciation, tax allowances from accelerated depreciation, and the financing cost advantage of debt over equity. The model formalizes how both the equity-financed component and the debt advantage component respond to tax rate changes, with the debt advantage term being larger for firms with more pledgeable (tangible) capital.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Investment network:&lt;/strong&gt; An input-output structure capturing which sectors&amp;rsquo; outputs are used to produce each type of capital good. The paper extends vom Lehn and Winberry (2021) by constructing separate equipment and non-equipment investment networks across 375 non-fuel BEA sectors, enabling emissions accounting that includes capital production alongside direct production inputs.&lt;/p&gt;</description></item><item><title>The Future in Mind: Aspirations and Long-Term Outcomes in Rural Ethiopia</title><link>https://macropaperwarehouse.com/papers/the-future-in-mind-aspirations-and-long-term-outcomes-in-rural-ethiopia/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-future-in-mind-aspirations-and-long-term-outcomes-in-rural-ethiopia/</guid><description>&lt;p&gt;This paper tests whether a light-touch behavioral intervention targeting aspirations can produce persistent economic effects on a poor rural population. The research question is whether changing how poor people perceive their future opportunities — by raising aspirations — alters their investment decisions in ways that persist over a multi-year horizon. The authors conduct a randomized controlled trial in Doba, a remote mountainous district in rural Ethiopia roughly 380 kilometers from Addis Ababa, selected partly because its extreme isolation meant residents had almost no exposure to television or media, making even a single video screening a memorable event.&lt;/p&gt;
&lt;p&gt;The sample consists of 1,152 households (2,112 individuals) across 64 villages. Households were randomly assigned to one of three conditions: a treatment group shown four 15-minute documentaries featuring real rural individuals from similar communities who escaped poverty through goal-setting and hard work; a placebo group shown an Ethiopian entertainment comedy with no aspirational content; and a within-village control group who were only surveyed. Both the household head and spouse in treatment and placebo groups were invited to attend. Compliance was very high, with only 2 percent of individuals not complying with their assigned condition. Data were collected at baseline (2010), six months after screening (2011), and five years after baseline (2015–2016). Attrition was notably low: 96 percent of households were re-interviewed at the five-year endline, and 94 percent of individual respondents.&lt;/p&gt;
&lt;p&gt;Five years after the screening, treated households show meaningfully larger investment across three domains relative to the control group, with all headline results significant at 5 percent or less and robust to multiple hypothesis testing. First, on agricultural effort and investment: treated household heads and spouses work approximately one extra hour per day on their own farms (roughly 8.6 percent of the control mean per spouse). Treated households are 10 percentage points more likely to have adopted modern crop inputs (improved seeds, inorganic fertilizer) and 10 percentage points more likely to have invested in modern livestock inputs (feed, veterinary supplies). Holdings of productive tools are 20 percent higher than in the control group. Second, on educational investment: treated households spend approximately 36 percent more on children&amp;rsquo;s schooling than the control group. Among children who were of school-going age at the time of the intervention (aged 11–15 then, 16–20 at endline), the number completing full primary school is nearly double the control rate (0.16 per household versus 0.07 in the control). Third, on living standards: treated households experienced 0.33 to 0.38 fewer months of food insecurity in the previous year. Their holdings of consumer durables (furniture, kitchenware, phones) are 29 percent higher than the control group in value. Estimated house values are 27 percent higher. However, there is no statistically significant effect on measured food or frequent non-food consumption expenditure, a finding the authors interpret as consistent with households continuing to divert resources toward future-oriented investments rather than current consumption.&lt;/p&gt;
&lt;p&gt;The intervention&amp;rsquo;s effects appear to operate primarily through aspirations — defined in this paper as desired goals for the future that motivate investment and effort. Treated households report significantly higher aspirations and expectations for income, assets, and children&amp;rsquo;s education five years later. By contrast, the paper finds no persistent changes in time preferences, risk preferences, grit, or beliefs about returns to technology. Locus of control shifted six months after the intervention but did not persist to the five-year endline, and the authors argue that if locus of control were the operative mechanism, investment effects would also have dissipated. The placebo group shows no significant effects relative to the control, ruling out screening exposure or social attention as mechanisms.&lt;/p&gt;
&lt;p&gt;The paper is explicit about scope conditions. The study area was deliberately chosen for its extreme remoteness and media isolation, and the authors caution that this may have amplified the intervention&amp;rsquo;s salience and persistence relative to less isolated populations. External validity beyond comparable settings is uncertain. A back-of-the-envelope cost-effectiveness calculation finds that increases in durable asset holdings alone outweigh intervention costs by a factor of approximately two at reasonable scale.&lt;/p&gt;
&lt;p&gt;Q: What was the intervention and what made it distinct from other role model studies?
A: Treated households were invited to watch four 15-minute documentary films featuring real rural individuals from similar socioeconomic backgrounds who had escaped poverty through goal-setting, perseverance, and hard work. The films were produced in Oromiffa, the local language, and featured two male and two female role models depicting achievable actions such as installing irrigation or starting a small business. Unlike studies that vary exposure to in-person mentors or peers, participants received no ongoing mentorship, financial resources, or support of any kind beyond the single video screening, isolating the aspirations channel from material or informational transfers.&lt;/p&gt;
&lt;p&gt;Q: How were aspirations measured and validated?
A: Aspirations were measured using locally validated survey instruments (Bernard and Taffesse, 2014) that asked respondents what level of annual income, asset wealth, and oldest child&amp;rsquo;s education they would like to achieve in their lifetime. Test-retest reliability over two weeks produced within-respondent correlations of 0.77 to 0.98 across domains, which the authors benchmark against Angrist and Krueger (1999) standards for reliable income and education measures. The measures correlated in expected directions with wealth: mean income aspirations in the upper wealth tercile were 1.5 times those in the lower tercile, and asset aspirations in the upper tercile were 1.9 times those in the lower tercile.&lt;/p&gt;
&lt;p&gt;Q: What were the five-year effects on agricultural effort and investment?
A: Treated household heads and spouses worked approximately half an hour more per day each on their own farms relative to control, implying roughly one extra hour per day across the typical household&amp;rsquo;s adult members — an 8.6 percent increase over the control mean. Treated households were 10 percentage points more likely to have adopted modern crop inputs and 10 percentage points more likely to have invested in modern livestock inputs. Holdings of productive tools were 20 percent higher in value than in the control group. The overall agricultural investment index increased by 0.21 standard deviations relative to the control and 0.18 standard deviations relative to the placebo.&lt;/p&gt;
&lt;p&gt;Q: What were the five-year effects on children&amp;rsquo;s education?
A: Among children aged 16 to 20 at endline (who were 11 to 15, upper primary school age, at the time of the intervention), the number per household completing full primary school nearly doubled: 0.16 in the treatment group versus 0.07 in the control. These children in treated households also spent on average 33 minutes more per day attending school than the control group. Across all children, schooling expenditures in the treatment group were 36 percent higher than in the control and 30 percent higher than in the placebo. The education index increased by 0.25 standard deviations relative to the placebo and 0.21 standard deviations relative to the control.&lt;/p&gt;
&lt;p&gt;Q: Why did consumption expenditure not increase despite improvements in assets and food security?
A: The authors argue that the consumption result is theoretically ambiguous: if treated households continue to divert resources toward future-oriented investments (savings, productive assets, durable goods, housing), intertemporal substitution effects could offset income effects within the five-year observation window. The measured consumption variables — food and frequent non-food spending — do not capture the service flow value of accumulated durables or housing improvements, both of which increased substantially. The authors interpret this as evidence that households were still in an investment phase rather than having converted accumulated wealth into current consumption by endline.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports aspirations as the operative mechanism rather than alternative channels?
A: The treatment group had significantly higher aspirations and expectations for income, assets, and children&amp;rsquo;s education at the five-year endline, while the placebo group did not. Measured time preferences, risk preferences, grit, and beliefs about returns to technology were all statistically unchanged for treated households. Locus of control shifted six months post-intervention but did not persist to five years, and the authors note that if locus of control were the driver, investment effects would also have dissipated alongside it. The null placebo effect rules out screening exposure, social attention, or information salience from outside facilitators as mechanisms.&lt;/p&gt;
&lt;p&gt;Q: How were locus of control and fatalistic beliefs assessed in this population?
A: The sample scored twice as high as Western samples on the classic Levenson (1981) fatalism scale. On the Feagin (1975) scale of perceived causes of poverty, the sample was more likely to attribute poverty to structural or fatalistic explanations than Western samples, and both measures of fatalistic beliefs were higher among poorer households within the sample. The study region&amp;rsquo;s worldview — rooted in traditional Waaqeffannaa religion, local variants of Orthodox Christianity (Fekade Egziabher), and Islam (Qadar) — emphasizes deference to authority, predestination, and resistance to change, providing qualitative grounding for the aspirations deficit being targeted.&lt;/p&gt;
&lt;p&gt;Q: What were the effects on food insecurity and subjective wellbeing?
A: Treated households reported 0.33 fewer months of food insecurity in the previous year relative to the control group (from a base of 2.71 months in the control), and 0.38 fewer months relative to the placebo. Treated participants scored approximately a quarter of a step higher on the Cantril ladder of self-reported wellbeing than the control group. There was no significant difference on the USDA food insecurity questionnaire, which the authors attribute to that scale&amp;rsquo;s unsuitability for households that consume largely from own production.&lt;/p&gt;
&lt;p&gt;Q: What were the effects on durable goods and housing?
A: Treated households reported 29 percent higher value of consumer durables (furniture, kitchenware, phones) than the control group and 32 percent higher than the placebo. Estimated house replacement values were 27 percent higher than the control and 21 percent higher than the placebo. Enumerators directly observed that treated households were more likely to have their own toilet facility, though this result was not significant relative to the placebo. There were no effects on the probability of having a non-organic roof, which the authors note is an especially expensive upgrade.&lt;/p&gt;
&lt;p&gt;Q: How does the paper rule out spillover effects from treated to control households?
A: The authors collected data on a supplementary sample of non-treated villages to serve as a &amp;ldquo;pure control&amp;rdquo; and used this to run a suggestive test for spillovers from treated households to untreated households within the same village. They found little evidence of large spillover effects, although they acknowledge limitations in the power of these tests. The physical design of the screenings — held in rooms with shuttered windows, requiring tickets for entry, conducted separately from placebo screenings — also minimized contamination during the intervention itself.&lt;/p&gt;
&lt;p&gt;Q: What were the early (six-month) results and what do they suggest about the timing of effects?
A: At six months, the shorter follow-up found increases in savings and investment in education, consistent with behavioral change beginning soon after treatment. Aspirations showed positive but noisier effects at immediate post-screening and six-month follow-ups, which the authors interpret as consistent with aspirations increasing gradually as people experiment with alternative futures (Appadurai, 2004) or as demotivating beliefs shift incrementally (Carvalho et al., 2023), rather than changing abruptly. This gradual pattern is consistent with a learn-by-doing dynamic where small initial investments generate returns that further raise aspirations.&lt;/p&gt;
&lt;p&gt;Q: How does this study&amp;rsquo;s attrition and follow-up compare to the literature?
A: The five-year attrition rate was very low: 96 percent of baseline households were re-interviewed and 94 percent of individual respondents. The authors cite Bouguen et al. (2019) as a benchmark, noting this is a high tracking rate relative to recent long-run RCT follow-ups in low- and middle-income countries. The low attrition strengthens confidence that endline estimates are not contaminated by selective dropout.&lt;/p&gt;
&lt;p&gt;Q: What is the cost-effectiveness of the intervention?
A: A back-of-the-envelope calculation indicates that increases in durable asset holdings alone outweigh the costs of the intervention by a factor of approximately two at reasonable implementation scale. The authors present this as a proof-of-concept estimate, not a full social cost-benefit analysis, and caution that cost-effectiveness may differ in settings with higher baseline media exposure or less extreme isolation.&lt;/p&gt;
&lt;p&gt;Q: What are the key scope conditions limiting external validity?
A: The study district (Doba) was chosen specifically for its extreme remoteness: at baseline, only 11 percent of respondents watched TV at least weekly and no household owned a television. The authors argue this isolation likely made the screening event especially salient and memorable, potentially amplifying effects relative to what would be expected in less isolated contexts. They are explicit that the findings represent a proof of concept for the aspirations mechanism and that effect magnitudes should not be assumed to replicate in settings with higher baseline media exposure or different cultural belief systems.&lt;/p&gt;
&lt;p&gt;Aspirations: Defined in this paper as desired goals for the future that motivate investment and effort in order to attain them (following Bandura, 1977; Locke and Latham, 1990). Measured via validated survey instruments asking respondents the level of income, assets, or children&amp;rsquo;s education they would like to achieve in their lifetime — distinct from expectations (what one expects to achieve) and from the village maximum (what one believes the most successful person in the village could achieve).&lt;/p&gt;
&lt;p&gt;Aspirations gap: The difference between an individual&amp;rsquo;s aspired level of income, assets, or education and their current reported level. Median aspirations gaps in the sample are 55 percent of median wealth aspirations and 58 percent of median income aspirations, indicating that aspirations exceed current levels by meaningful but not unrealistic margins.&lt;/p&gt;
&lt;p&gt;Capacity to aspire: Drawn from Appadurai (2004), defined as a navigational capacity — the ability to read and navigate a map of a journey into the future. In contexts of poverty, this capacity is described as more brittle because poorer individuals have narrower social networks, fewer role models, and less material slack for experimentation with alternative futures.&lt;/p&gt;
&lt;p&gt;Role model: A real individual from a similar socioeconomic background whose documented experience of escaping poverty through goal-setting and effort provides vicarious experience that allows audience members to imagine what is possible for people like them. Role models are most effective when their success appears attainable and when the steps to achieve it are visible.&lt;/p&gt;
&lt;p&gt;Zero-sum beliefs: The belief that gains for one individual come at the expense of others in the community, documented in the study area as part of a broader fatalistic, deterministic belief system. These beliefs can suppress effort and future-oriented investment by making individual advancement appear normatively transgressive or materially impossible.&lt;/p&gt;
&lt;p&gt;Source text origin: A classification in the paper&amp;rsquo;s pipeline framework distinguishing whether a summary is based on a full working paper PDF or HTML text versus abstract-only text. Abstract-only summaries are blocked as they miss scope conditions, quantitative results, and the full argument structure.&lt;/p&gt;
&lt;p&gt;Placebo group: Households randomly invited to watch an Ethiopian comedy entertainment program (with no aspirational content) rather than the role model documentaries. Used to separate the effect of the aspirations content from the effects of the screening event itself, exposure to outside facilitators, or social attention accompanying selection for the intervention.&lt;/p&gt;</description></item><item><title>The Geography of job creation and job destruction</title><link>https://macropaperwarehouse.com/papers/the-geography-of-job-creation-and-job-destruction/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-geography-of-job-creation-and-job-destruction/</guid><description>&lt;p&gt;This paper asks why unemployment rates differ so persistently across local labor markets, and what role job creation and job destruction play in generating those differences. The authors document a comprehensive set of spatial labor market facts using administrative and survey microdata from Germany, the United States, and the United Kingdom, then build and calibrate a quantitative theoretical framework that accounts for all documented regularities.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and scope.&lt;/strong&gt; For Germany, the authors use administrative data from the German employment office (universe of vacancies and unemployed, 1999–2020) and the IAB social security sample (SIAB, 2% of all workers, 2000–2017) aggregated to 194 commuting zones. For the U.S., they use BLS Local Area Unemployment Statistics (2000–2019) at commuting zones, CPS worker flows at metropolitan areas, and JOLTS vacancy data for the 18 largest MSAs (covering roughly 40% of the U.S. labor force). For the UK, they use Nomis data and Jobcentre Plus vacancy records (2004–2006) for 378 Local Authority Districts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical findings.&lt;/strong&gt; Spatial unemployment rate differences are large and highly persistent. In Germany, the correlation of local unemployment rates across commuting zones over a 19-year span is 0.84 (West) and 0.77 (East). In the U.S., the correlation between 2000 and 2019 unemployment rates is 0.81; in the UK it is 0.76. In all three countries, local labor markets with lower unemployment are tighter (more vacancies per unemployed worker) and less productive. Firms in low-unemployment markets fill vacancies more slowly — in Germany, vacancy duration ranges from approximately 35 days in high-unemployment locations to approximately 65 days in low-unemployment locations, roughly an 85% difference.&lt;/p&gt;
&lt;p&gt;A formal steady-state decomposition reveals that across all three countries, differences in job-separation rates account for approximately two-thirds of the cross-sectional variation in unemployment rates, while differences in job-finding rates account for roughly one-third. Specifically: Germany 62.4% separations / 33.2% job-finding; U.S. 72.0% / 32.8%; UK 64.3% / 35.8%. This primacy of separation rates in the cross-section stands in stark contrast to business-cycle dynamics, where job-finding rates account for 50–60% of unemployment fluctuations (Fujita and Ramey, 2009).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theory.&lt;/strong&gt; The authors embed a Diamond-Mortensen-Pissarides (DMP) model with endogenous separations — following Den Haan, Ramey, and Watson (2000) — into a Rosen-Roback spatial equilibrium framework. Locations differ in exogenous productivity; workers and firms are freely mobile; cost-of-living differences sustain the spatial equilibrium. The model is calibrated to the U.S. median-unemployment labor market (separation rate 0.0128, job-finding rate 0.2368, vacancy-filling rate 0.7365) plus the productivity differential between the 5th and 95th percentile unemployment locations (4.8% higher and 3.0% lower productivity than median, respectively). The baseline model, imposing the Hosios condition, matches the spatial patterns of separation rates, job-finding rates, tightness, vacancy duration, wages, and cost of living without targeting most of these. The decomposition in the calibrated baseline model attributes 33.5% of spatial unemployment variation to job-finding rates, compared to 32.8% in the data.&lt;/p&gt;
&lt;p&gt;The baseline model generates a counterfactual upward-sloping Beveridge curve and cannot explain why job-finding rates dominate business-cycle fluctuations. Introducing on-the-job search (with 12% of employed workers searching each period, calibrated from Faberman et al., 2017) resolves both problems. In the extended model, job-to-job transition rates are virtually constant across local labor markets (matching the data) but strongly procyclical over the business cycle. This asymmetry amplifies the response of vacancies and job-finding rates to aggregate productivity shocks while muting the cyclical variation in separation rates. The extended model&amp;rsquo;s business-cycle decomposition attributes 54.4% of unemployment volatility to job-finding rates, within the empirical 50–60% range.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy implications.&lt;/strong&gt; Under the Hosios condition, the decentralized equilibrium is efficient — large spatial differences in unemployment, tightness, and wages are efficient outcomes, not signs of mismatch. The relevant policy benchmark is not deviation of tightness from the national average but deviation from the model&amp;rsquo;s location-specific prediction conditional on local productivity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the central empirical puzzle the paper addresses?&lt;/strong&gt;
A: Spatial unemployment differences are large and persistent — in Germany, unemployment rates ranged from 1.9% to 11.9% across commuting zones even after 15 years of decline. These differences are not well understood theoretically, and the crucial missing empirical piece was data on job creation and vacancy filling across locations, which this paper provides for three countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How large and persistent are cross-sectional unemployment differences in each country?&lt;/strong&gt;
A: In Germany, commuting-zone unemployment ranged from 3.6% to 24.0% in 2000 and persisted with a 19-year correlation of 0.84 (West) and 0.77 (East). In the U.S., the 2000–2019 correlation is 0.81, with unemployment as low as 1.5% and as high as 16.9% in 2000. In the UK, the 2004–2018 correlation is 0.76, with 2004 unemployment ranging from 1.8% to 13.1%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do the data show about the relationship between unemployment and labor market tightness across locations?&lt;/strong&gt;
A: In all three countries, lower-unemployment labor markets are tighter — they have more vacancies per unemployed worker. This is documented for Germany using the universe of registered vacancies, for the U.S. using JOLTS data for 18 large MSAs, and for the UK using Jobcentre Plus administrative data. The relationship holds after controlling for local labor market composition (age, gender, education, occupation, industry shares).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What do vacancy-filling rates look like across locations, and how large are the differences?&lt;/strong&gt;
A: Vacancy-filling rates are lower in low-unemployment (tight) labor markets. In Germany, the monthly probability of filling a vacancy is approximately 50% higher in high-unemployment markets than in low-unemployment markets. Completed vacancy duration ranges from about 35 days in high-unemployment locations to about 65 days in low-unemployment locations — a difference of approximately 85%. The UK data show a strikingly similar elasticity of vacancy-filling rates with respect to unemployment rates to Germany.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the formal decomposition reveal about the sources of spatial unemployment differences?&lt;/strong&gt;
A: In a steady-state two-state decomposition, separation rates account for 62.4% (Germany), 72.0% (U.S.), and 64.3% (UK) of cross-sectional unemployment variation, while job-finding rates account for 33.2%, 32.8%, and 35.8%, respectively, with small residuals. This consistently assigns primary importance to separation rates across all three countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why is the primacy of separation rates in the cross section surprising, and what literature does it contrast with?&lt;/strong&gt;
A: The business-cycle literature (Fujita and Ramey, 2009; Shimer, 2012) finds that job-finding rate variation accounts for 50–60% of unemployment fluctuations over the cycle, roughly twice the contribution of separation rates. The spatial pattern is the mirror image: separations dominate. Any credible theory of spatial unemployment must rationalize both patterns simultaneously — a challenge the paper explicitly takes up.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the baseline DMP model with endogenous separations generate the spatial patterns?&lt;/strong&gt;
A: Higher-productivity locations feature higher match surpluses. Higher surplus induces more vacancy creation and tighter markets, raising job-finding rates and lowering vacancy-filling rates. Crucially, a higher surplus means idiosyncratic shocks must be more negative to make the joint surplus negative, so fewer matches dissolve — separation rates are lower. The calibrated model reproduces the 32.8% job-finding / ~67% separation decomposition without targeting it (model yields 33.5% job-finding).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the calibration targets and key parameter values in the baseline model?&lt;/strong&gt;
A: The model is calibrated monthly to the U.S. economy. Median-unemployment-location targets: separation rate 0.0128, job-finding rate 0.2368, vacancy-filling rate 0.7365. Productivity targets: the 5th-percentile-unemployment location is 4.8% more productive than median, and the 95th-percentile-unemployment location is 3.0% less productive. Key calibrated values include matching elasticity alpha = 0.4711 (equal to worker bargaining power under Hosios), matching efficiency m = 0.4371, vacancy posting cost kappa = 0.3070, and flow nonmarket value z = 0.9072.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the two shortcomings of the baseline model, and how does on-the-job search resolve them?&lt;/strong&gt;
A: The baseline model generates a counterfactual upward-sloping Beveridge curve and cannot generate the asymmetry between cross-sectional and business-cycle drivers of unemployment. Adding on-the-job search (fraction phi = 0.12 of employed workers searching, calibrated from Faberman et al., 2017) resolves both. It corrects the Beveridge curve by allowing the model to match the spatial vacancy-unemployment relationship, and it introduces procyclical job-to-job mobility that amplifies the cyclical response of job-finding rates while dampening cyclical separation rate variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How do job-to-job transition rates differ across space versus over the business cycle, and why does this matter?&lt;/strong&gt;
A: Job-to-job rates are virtually constant across the cross-section of local labor markets (the extended model is calibrated to match this). But they are strongly procyclical — high in booms, low in recessions, about as volatile as job-finding rates over the cycle. In a boom, more employed workers search, spurring vacancy creation, which raises both vacancy-filling probability (making vacancies easier to fill) and job-finding probability for the unemployed, amplifying the cyclical job-finding rate response while muting the cyclical separation rate response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the extended model predict for business-cycle dynamics?&lt;/strong&gt;
A: The model with on-the-job search and aggregate productivity shocks (parameterized following Hagedorn and Manovskii, 2008) generates unemployment and vacancy rates that are an order of magnitude more volatile than productivity — matching the data. Labor market tightness is about twice as volatile as unemployment, as in the data. The Fujita-Ramey decomposition in the model attributes 54.4% of unemployment volatility to job-finding rates, which falls within the empirical range of 50–60%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the paper&amp;rsquo;s efficiency result and its policy implication?&lt;/strong&gt;
A: Under the Hosios condition (imposed in calibration), the decentralized equilibrium is efficient: job creation and destruction are privately efficient in each market, and free mobility of workers and firms ensures efficient spatial allocation. Therefore, large observed differences in unemployment, tightness, and wages across locations are not evidence of inefficiency. The relevant signal for policy is not deviation from the national average but deviation from the model&amp;rsquo;s location-specific prediction conditional on productivity. Locations where data deviate from model predictions are candidates for policy intervention.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Do the spatial patterns survive controls for worker and firm composition?&lt;/strong&gt;
A: Yes. The authors regress labor market tightness and vacancy-filling rates on local unemployment rates and a full set of composition controls (age, gender, education, occupation, and industry shares) derived from the IAB microdata for Germany, along with year fixed effects. The relationship between local unemployment and both tightness and job-filling rates remains highly statistically and economically significant after these controls, for both Germany and the U.S.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the model handle wages and cost of living, and does it match the data?&lt;/strong&gt;
A: Wages are determined by state-contingent generalized Nash bargaining with worker bargaining power eta. Cost-of-living differences are backed out as the values needed to sustain the spatial equilibrium (Rosen-Roback). Neither wages nor costs of living are calibration targets in the cross section, yet the model closely matches the empirically observed wage gradient across local labor markets and the negative correlation between cost of living and local unemployment (using Economic Policy Institute Family Budget Calculator data).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Labor market tightness:&lt;/strong&gt; The ratio of vacancies posted in a local labor market to the number of unemployed workers in that market; the paper documents that tightness is systematically higher (more vacancies per unemployed worker) in lower-unemployment locations across all three countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job-separation rate (EU rate):&lt;/strong&gt; The share of employed workers who transition from employment to unemployment in a period; in the paper&amp;rsquo;s framework, this is endogenously determined by the idiosyncratic match productivity threshold below which the joint match surplus turns negative, and it is the primary driver of spatial unemployment differences (accounting for roughly two-thirds of cross-sectional variation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Job-finding rate (UE rate):&lt;/strong&gt; The share of unemployed workers who transition from unemployment to employment in a period; in the paper&amp;rsquo;s framework, this is higher in tighter (lower-unemployment) markets, but accounts for only roughly one-third of spatial unemployment variation — the opposite of its dominant role in business-cycle fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spatial Beveridge curve:&lt;/strong&gt; The cross-sectional relationship between vacancy rates and unemployment rates across local labor markets; in the data it is downward sloping (low-unemployment locations have both high vacancies and low unemployment), which the baseline model fails to capture but the extended model with on-the-job search reproduces.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous separation threshold:&lt;/strong&gt; The location-specific minimum idiosyncratic match productivity below which the joint match surplus becomes negative and the worker-firm pair dissolves; this threshold is lower (tolerates a wider range of idiosyncratic shocks) in higher-productivity locations because the average surplus is larger, generating lower separation rates in more productive locations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Spatial equilibrium (Rosen-Roback):&lt;/strong&gt; The equilibrium condition in which differences in local costs of living adjust to make workers and firms indifferent across locations, sustaining persistent productivity-driven differences in wages and unemployment as equilibrium outcomes rather than disequilibrium phenomena.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Procyclical on-the-job search:&lt;/strong&gt; The mechanism by which the fraction of employed workers actively searching — and thus the rate of job-to-job transitions — is approximately constant across the cross-section of local labor markets but strongly procyclical over the business cycle. This asymmetry is the key to reconciling why job-finding rates drive business-cycle unemployment variation while separation rates drive spatial unemployment variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Hosios condition:&lt;/strong&gt; The parametric restriction equating the unemployment elasticity of the matching function (alpha) and the workers&amp;rsquo; Nash bargaining weight (eta); when satisfied, job creation is efficient in every local labor market. The paper imposes this condition deliberately to demonstrate that the decentralized equilibrium is efficient despite large spatial differences in outcomes.&lt;/p&gt;</description></item><item><title>The housing wealth effect: Quasi-experimental evidence</title><link>https://macropaperwarehouse.com/papers/the-housing-wealth-effect-quasi-experimental-evidence/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-housing-wealth-effect-quasi-experimental-evidence/</guid><description>&lt;p&gt;This paper estimates a causal housing wealth effect on consumption using a quasi-natural experiment in Stockholm, Sweden. The identification exploits an unanticipated political decision — announced in September 2007 — to renew the operating contract of Bromma Airport through 2038, reversing a long-standing expectation of closure by 2011. Because the decision resulted from opaque political bargaining and was widely characterized as a political coup by opposition parties, the announcement was genuinely unexpected. The negative externality of continued airport operations (primarily aircraft noise exceeding 70 decibels within a mapped contour) capitalized locally into house prices within one quarter of the announcement. Using difference-in-differences on all single-family house transactions in Stockholm Municipality from 2004 to 2012, the authors estimate a house price decline of 19.4 percent for dwellings within 1,000 meters of the noise contour relative to those farther away (t-statistics above 5; robust to control variables and sample period). Co-op apartment prices show no statistically significant response, consistent with greater structural noise insulation in multi-story concrete buildings.&lt;/p&gt;
&lt;p&gt;The consumption outcome is new car purchases, observed at quarterly frequency in a registry-based household panel covering all Stockholm residents, with balance sheet information (loan-to-value ratios, bank deposits, mortgage types) and GIS-located residences. The paper focuses on the intensive margin — the log value of new cars purchased conditional on a purchase — since no effect is found on the extensive margin (probability of buying). A two-sample IV approach yields a short-run elasticity of 0.39: homeowners near the noise contour reduce the value of new cars purchased by 7.7–8.5 log points relative to homeowners farther away. Converting to a marginal propensity for expenditures (MPX): conditional on purchasing a new car, the car MPX is 2.5 cents per dollar of housing wealth lost; scaling by the annual new-car purchase rate of 0.049 per household yields an aggregate new-car MPX of 0.12 cents per dollar per year. Including a symmetry assumption for used cars raises the overall car MPX to 0.38 cents per dollar per year.&lt;/p&gt;
&lt;p&gt;Heterogeneity analysis reveals that the collateral channel dominates the pure wealth channel. Homeowners with loan-to-value ratios above 50 percent respond almost twice as strongly as those below (elasticities of 0.526 versus 0.269). Homeowners with below-median bank deposits respond with an elasticity of 0.694, roughly five times larger than those with larger deposits. The financing data show that 47 percent of a new car&amp;rsquo;s value is financed with credit on average, of which 71 percent takes the form of mortgage debt; however, households with high LTV ratios borrow one-third less per dollar of car value, almost entirely through reduced mortgage use.&lt;/p&gt;
&lt;p&gt;A calibrated life-cycle model (quarterly, ages 30–85, Cobb-Douglas preferences over non-durables and cars, long-term fixed-rate mortgage, adjustment costs for cars and mortgages, information friction) replicates the empirical findings. In simulation, a 19.4 percent permanent house-price shock reduces new car values purchased by 6.1 log points on average over the first four quarters, implying an elasticity of 0.31 and a new-car MPX of 0.20 cents per dollar — close to the empirical 0.12 cents and within the 95 percent confidence interval. The model decomposes the response: the collateral effect accounts for 93 percent of the car MPX and 83 percent of the total MPX in the first four quarters; the pure wealth effect accounts for the remainder. The model further shows that full information awareness would roughly double the one-year response, and that smaller shock magnitudes, shorter measurement windows, and crisis-era credit conditions (where more households are already at borrowing limits) each amplify estimated MPXs — helping account for the wide range of estimates (0.12 to 2.3 cents per dollar) in prior literature.&lt;/p&gt;
&lt;p&gt;The identification is validated by dose-response monotonicity with distance to the noise contour, placebo tests showing no response for apartment owners or renters, and absence of income effects or differential moving behavior in the treatment group.&lt;/p&gt;
&lt;p&gt;Q: What is the quasi-experiment and why is it well-suited for identifying housing wealth effects?
A: The Stockholm municipality unexpectedly renewed Bromma Airport&amp;rsquo;s operating contract through 2038 in September 2007, reversing a broadly held expectation that the airport would close by 2011. The decision emerged from closed-door political negotiations and was denounced as a political coup by opposition parties, making it genuinely unanticipated. Because the shock is geographically contained within the airport&amp;rsquo;s noise contour, it is unrelated to macroeconomic conditions and unlikely to generate general equilibrium feedback. The authors also verify that no differential income effects, tax changes, or other policies affected the treatment versus control groups over the study window.&lt;/p&gt;
&lt;p&gt;Q: How large is the estimated house price effect, and how precisely is it measured?
A: Dwellings within 1,000 meters of the noise contour experienced a price decline of 19.4 percent relative to dwellings farther away (baseline estimate, longer sample period). The estimate is highly significant with t-statistics above 5 in all specifications and is robust to the inclusion of rich property-level controls; adding controls changes the pre-crisis estimate only trivially (from -21.4 to -21.3 percent). Co-op apartment prices show no statistically significant response across all specifications, consistent with better structural insulation of multi-story concrete buildings.&lt;/p&gt;
&lt;p&gt;Q: What is the main consumption response finding?
A: Homeowners near the noise contour reduce the log value of new cars purchased by 7.7–8.5 log points relative to homeowners farther away (reduced form, intensive margin). There is no detectable effect on the extensive margin — the probability of purchasing a new car changes by only 0.029 percentage points per quarter against a baseline of approximately 1.2 percent per quarter. Two-sample IV yields an elasticity of 0.39 (statistically significant at 1 percent), meaning a 1 percent decline in house prices leads to a 0.39 percent reduction in new car values among purchasers.&lt;/p&gt;
&lt;p&gt;Q: What does the elasticity of 0.39 imply for the marginal propensity to spend on cars?
A: Conditional on purchasing a new car, the car MPX is 2.5 cents per dollar of housing wealth lost (calculated as 0.393 × 19.4% × SEK 250,000 average car value, divided by SEK 774,060 housing wealth loss). Scaling by the annual new-car purchase frequency of 0.049 per household yields an aggregate new-car MPX of 0.12 cents per dollar per year. Assuming an equal response for used cars, the overall car MPX is 0.38 cents per dollar per year. These estimates are substantially smaller than Mian et al. (2013)&amp;rsquo;s 1.8–2.3 cents per dollar, a discrepancy the model helps explain.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the loan-to-value ratio in shaping the consumption response?
A: Homeowners with LTV ratios above 50 percent respond almost twice as strongly (elasticity 0.526) as those with LTV below 50 percent (elasticity 0.269). The financing data confirm the mechanism: on average 71 percent of car-purchase borrowing takes the form of mortgage debt, but households with high LTV ratios borrow one-third less per dollar of car value, with the difference almost entirely attributable to reduced mortgage use. This pattern is consistent with binding borrowing constraints preventing high-LTV households from extracting home equity for collateral.&lt;/p&gt;
&lt;p&gt;Q: What is the role of liquid savings (bank deposits) in the response?
A: Homeowners with bank deposits below the median respond with an elasticity of 0.694, roughly five times larger than homeowners with larger deposits (elasticity approximately 0.139). This heterogeneity is consistent with deposits serving as a buffer stock that allows wealthier households to smooth consumption without altering borrowing behavior after a wealth shock.&lt;/p&gt;
&lt;p&gt;Q: What does the quantitative model find about the relative importance of the collateral channel versus the pure wealth effect?
A: In the first four quarters following the shock, the collateral effect accounts for 93 percent of the car MPX response and 83 percent of the total expenditure MPX; the pure wealth effect accounts for only 7.5 percent of car MPX and 19 percent of total MPX over the same horizon. Over a longer horizon of 20 quarters, the collateral channel remains dominant at 69 percent of the car baseline, while the wealth effect rises to 32 percent. For non-durable consumption, the short-run collateral effect is 81 percent and the wealth effect is 19 percent.&lt;/p&gt;
&lt;p&gt;Q: How does the model match the empirical estimates?
A: Simulating a permanent 19.4 percent house-price shock for 200,000 household pairs, the model produces a 6.1 log point average reduction in new car values over the first four quarters, corresponding to an elasticity of 0.31 and a new-car MPX of 0.20 cents per dollar. The empirical estimate is 0.12 cents, and the model value falls within the empirical 95 percent confidence interval. The model also replicates the pattern of no extensive-margin response in the short run and a gradual build-up in the non-durable consumption response (maximum elasticity of 0.079 reached only after ten quarters).&lt;/p&gt;
&lt;p&gt;Q: Why is the short-run response concentrated in cars rather than non-durables?
A: The paper establishes an intertemporal smoothing mechanism for durables analogous to McKay and Wieland (2021): households delay or bring forward lumpy durable purchases in response to shocks to borrowing capacity. Although cars represent only 5.5 percent of total consumption in the model (Cobb-Douglas expenditure share), they account for 45–72 percent of the total expenditure response in the first four quarters after the house-price shock. The non-durable consumption response builds slowly and reaches its maximum after about ten quarters.&lt;/p&gt;
&lt;p&gt;Q: What factors does the model identify as explanations for the wide range of MPX estimates across studies?
A: Three factors are identified. First, shock magnitude: larger shocks produce smaller partial-equilibrium MPXs because more households hit borrowing constraints; across shock sizes from -30 to +20 percent, car and total MPXs can range from 1 to 2 cents per dollar. Second, measurement period: short-run (1-year) MPXs exceed long-run (3-year) MPXs, especially for durable goods. Third, the state of the economy: in a crisis-era bust following credit-fueled boom, many more households are constrained when prices fall, amplifying MPXs; Guerrieri and Iacoviello (2017) report car elasticities of 0.24 in the boom phase and 0.49 in the bust phase of the US financial crisis.&lt;/p&gt;
&lt;p&gt;Q: What is the role of the information friction in the model?
A: Because the quasi-experiment occurred in &amp;ldquo;normal times&amp;rdquo; just before the global financial crisis became acute, the authors argue that households were not immediately aware of the house-price shock; they only update their perceived housing wealth when they attempt to adjust their mortgage, trade cars, or receive a random information update. Under full information awareness, the one-year MPX would be approximately twice as large, and the one-year total MPX could be as much as three times as large (with a car MPX of 3 cents per dollar and total MPX well above 6 cents per dollar under full information with small positive shocks). The information friction thus attenuates the estimated MPX relative to a world of full information.&lt;/p&gt;
&lt;p&gt;Q: What placebo and robustness tests support the identification?
A: Co-op apartment owners show no statistically significant price or consumption response, consistent with their structural insulation from aircraft noise. Renters also show no consumption response. The dose-response test confirms a monotone relationship between distance to the noise contour and both house price and car expenditure effects. Income effects are absent (Figure B.2), and there is no differential probability of moving in either the short or long run. Tax reforms benefited both groups equally and had already been announced before the quasi-experiment.&lt;/p&gt;
&lt;p&gt;Q: How does this study&amp;rsquo;s identification strategy compare to instrumental variable approaches using housing supply elasticity?
A: Supply elasticity IV approaches (Mian et al. 2013; Aladangady 2017; Kaplan et al. 2020) rely on regional variation in construction constraints and must assume that consumption demand factors are either observed or uncorrelated with supply elasticity — an assumption critiqued by Davidoff (2016). This paper&amp;rsquo;s identification exploits an exogenous change in a local negative externality, yielding a geographically granular shock unrelated to macroeconomic conditions and free from general equilibrium feedback. The result is interpretable as a partial equilibrium housing wealth effect in the sense of Berger et al. (2018) and Guren et al. (2020).&lt;/p&gt;
&lt;p&gt;Housing wealth effect: The causal effect of a change in housing wealth on household consumption expenditure, decomposed in this paper into a pure wealth channel (change in lifetime resources) and a collateral channel (change in borrowing capacity via home equity).&lt;/p&gt;
&lt;p&gt;Marginal propensity for expenditures (MPX): The change in spending per dollar change in housing wealth; distinct from the marginal propensity to consume (MPC) because spending on durables may be lumpy and differ from the flow of consumption services. The paper distinguishes the car MPX conditional on purchase (2.5 cents per dollar), the aggregate new-car MPX (0.12 cents per dollar per year), and the total expenditure MPX.&lt;/p&gt;
&lt;p&gt;Collateral channel: The mechanism by which a decline in house prices reduces homeowners&amp;rsquo; borrowing capacity — because the house serves as collateral for mortgage debt — thereby tightening credit constraints and reducing spending, independent of any change in permanent income. The model assigns 93 percent of the short-run car MPX to this channel.&lt;/p&gt;
&lt;p&gt;Two-sample instrumental variable (TSIV): The empirical strategy of Angrist and Krueger (1992) used here to estimate the consumption elasticity: the house-price first stage is estimated in one sample (transaction data), and the reduced-form consumption effect is estimated in a second sample (household panel), with the IV elasticity computed as the ratio.&lt;/p&gt;
&lt;p&gt;Information friction: The assumption in the model that households do not immediately observe the spatial divergence in house prices; they update their perceived housing wealth only when they attempt to adjust their mortgage, trade a durable good, or receive a random information shock. This friction attenuates the short-run consumption response and is calibrated to &amp;ldquo;normal times&amp;rdquo; conditions.&lt;/p&gt;
&lt;p&gt;Noise contour: The geographic boundary around Bromma Airport within which properties are regularly exposed to noise levels of at least 70 decibels, as adjudicated by the Swedish Land and Environment Court. Properties within 1,000 meters of this contour define the treatment group.&lt;/p&gt;
&lt;p&gt;Intertemporal smoothing of durables: The pattern, documented in the model and complementary to McKay and Wieland (2021), whereby households adjust lumpy durable purchases (cars) rapidly in response to changes in borrowing capacity, so that durables account for a disproportionately large share of the total expenditure response in the short run (45–72 percent in the first four quarters despite a 5.5 percent Cobb-Douglas expenditure share).&lt;/p&gt;</description></item><item><title>The Illiquidity of Water Markets</title><link>https://macropaperwarehouse.com/papers/the-illiquidity-of-water-markets/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-illiquidity-of-water-markets/</guid><description>&lt;p&gt;Donna and Espín-Sánchez investigate whether a market (sequential English auction) or a non-market institution (fixed quota) more efficiently allocates an intermediate good — irrigation water — when some buyers are liquidity constrained. The setting is Mula, a city in southeastern Spain, where farmers used an unregulated water auction continuously from 1244 until August 1, 1966, when the institution was replaced by a fixed quota system. This 700-year natural experiment, combined with the fact that water demand for a given crop is pinned down by the crop&amp;rsquo;s production function rather than by farmer wealth, allows the authors to separately identify liquidity constraints from unobserved heterogeneity in productivity.&lt;/p&gt;
&lt;p&gt;The empirical context has four features the authors exploit. First, the pre-1966 auction was entirely unregulated, so price differences directly reflect valuations without the confounds of regulatory changes. Second, water is an intermediate good for apricot production; conditional on plot area, tree count, and crop type, demand is determined by the apricot tree&amp;rsquo;s biological water requirements — not by the farmer&amp;rsquo;s wealth — so wealthy and poor farmers growing the same bulida apricot variety share the same underlying demand up to an idiosyncratic productivity shock. Third, farmers are classified as wealthy if they held positive urban real estate (non-agricultural wealth) in 1955 tax records; wealthy farmers&amp;rsquo; average annual urban rental income (5,702 pesetas) far exceeded their average annual water expenditure (500 pesetas, rising to 1,619 in the highest-expenditure year, 1963), supporting the assumption that wealthy farmers were never liquidity constrained. Fourth, the 1966 institutional shift to quotas — under which each farmer received a fixed water allotment (tanda) every three weeks proportional to plot size, paying only a small annual maintenance fee after the critical season — provides the counterfactual.&lt;/p&gt;
&lt;p&gt;The authors build a structural dynamic demand model with three key features: storability (irrigation raises soil moisture, creating intertemporal substitution between periods because water evaporates partially), liquidity constraints (poor farmers cannot always afford water during the critical season when prices peak), and weather seasonality (the critical season, corresponding to apricot fruit growth stages II–III and the Early Post-Harvest period, spans roughly weeks 18–32 and is when trees most need water). Farmers are forward-looking and form expectations about future prices and rainfall. The model&amp;rsquo;s production function, drawn from the agricultural engineering literature (Torrecillas et al., 2000; Allen et al., 2006), transforms soil moisture into apricot output via a transformation rate parameter gamma, a hydric stress coefficient, and a seasonal dummy.&lt;/p&gt;
&lt;p&gt;Demand parameters are estimated using a two-step conditional choice probability (CCP) estimator (Hotz et al., 1994) on wealthy farmers only, then projected onto poor farmers&amp;rsquo; welfare calculations. The sample consists of 24 single-crop apricot farmers observed in weekly auction records from January 1955 to July 1966, embedded in a market with over 500 total participants.&lt;/p&gt;
&lt;p&gt;The main finding is that the institutional change from auction to quota increased total efficiency. Welfare increased by 23.4 real pesetas per farmer per tree, a 6 percent increase in total apricot production relative to the market. This gain arises because: (1) farmers were relatively homogeneous in productivity (small idiosyncratic shocks), so the primary source of misallocation was not productivity heterogeneity but wealth heterogeneity; (2) liquidity constraints prevented poor farmers from purchasing water during the critical season when their valuation was high, causing them instead to buy earlier (at lower prices but with partial evaporation loss) or later (when their trees had already experienced hydric stress); and (3) the apricot production function is concave in water, so uniform quota allocation is more efficient than market allocation when farmers are approximately homogeneous. The paper provides the first empirical demonstration that liquidity constraints can reverse the standard efficiency ranking of markets over quotas.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question?
A: The paper asks whether a free market (water auction) or a non-market institution (fixed quota) more efficiently allocates an intermediate good when some buyers are liquidity constrained. The theoretical ranking is ambiguous when agents are heterogeneous in both productivity and wealth, making this an empirical question. The authors find that quotas dominated the auction in the specific Mula setting.&lt;/p&gt;
&lt;p&gt;Q: What was the historical water market in Mula and when did it end?
A: From 1244 to 1966 — over 700 years — Mula farmers used a sequential ascending-price (English) auction to allocate river water. The auctioneer sold water in discrete units called cuartas (each representing 3 hours of canal flow, or approximately 432,000 liters), holding 40 units per weekly Friday session. Farmers paid in cash on auction day. On August 1, 1966, the farmers&amp;rsquo; union (Sindicato de Regantes) replaced the auction with a fixed quota system, having secured a credit line to purchase water property rights share by share.&lt;/p&gt;
&lt;p&gt;Q: How did the quota system work, and how did it eliminate liquidity constraints?
A: Under the quota, each plot of land received a fixed water allotment (tanda) every three weeks, proportional to plot size. Farmers paid only a small annual maintenance fee to the Sindicato at year-end, after the critical season harvest. Because payment occurred after farmers collected harvest revenue, no farmer was liquidity constrained under the quota. The fee was substantially lower than the per-unit average price under the market.&lt;/p&gt;
&lt;p&gt;Q: How do the authors identify liquidity constraints separately from unobserved heterogeneity in productivity?
A: The key insight is that water is an intermediate good whose demand is determined by the apricot tree&amp;rsquo;s biological production function, not by farmer wealth. Two farmers growing the same bulida apricot variety with the same number of trees should have the same water demand up to an idiosyncratic shock. The authors use wealthy farmers (those with positive urban real estate in 1955 tax records) to estimate preferences, under the assumption that wealthy farmers are never liquidity constrained. They then verify that outside the critical season, wealthy and poor farmers purchase similar amounts of water; the purchasing divergence appears only during the high-price critical season, consistent with a cash constraint rather than a preference difference.&lt;/p&gt;
&lt;p&gt;Q: What empirical evidence shows poor farmers were liquidity constrained rather than simply less interested in water?
A: Poor farmers display a bimodal purchasing pattern inconsistent with the apricot tree&amp;rsquo;s biological water needs: they buy water before the critical season (when prices are low) in anticipation of not being able to afford it during the critical season, and again after the critical season (when prices fall) to prevent their trees from withering from dehydration. Wealthy farmers, by contrast, delay purchases strategically to the critical season when trees most need water (weeks 18–32). Regression analysis confirms that wealthy farmers purchase significantly more water per tree during the critical season than poor farmers growing identical bulida apricots, while the difference outside the critical season is not statistically significant.&lt;/p&gt;
&lt;p&gt;Q: How were wealthy farmers defined and why does their wealth validate the non-constrained assumption?
A: A farmer is defined as wealthy if the value of their urban real estate (from 1955 urban tax records) is positive, and as poor if it is zero. Urban real estate constitutes non-agricultural wealth uncorrelated with the apricot production function. Wealthy farmers&amp;rsquo; average annual urban rental income was 5,702 pesetas, while their average annual water expenditure was only 500 pesetas (rising to 1,619 pesetas in 1963, the highest-expenditure sample year). This large gap supports the assumption that wealthy farmers could always afford water purchases.&lt;/p&gt;
&lt;p&gt;Q: What is the model&amp;rsquo;s treatment of soil moisture dynamics and why does it matter?
A: Soil moisture (M_it) evolves according to an agricultural engineering formula: it increases with rainfall and irrigation purchases (each unit adding 432,000 liters divided by plot area) and decreases via evapotranspiration (ET), subject to a full-capacity ceiling (FC) and a permanent wilting point (PW) lower bound. This storage structure creates intertemporal substitution — water purchased early partially substitutes for future purchases, but at a cost (evaporative loss). The dynamics mean poor farmers who pre-buy water before the critical season lose some of that investment to evaporation, generating a real efficiency loss relative to the quota that delivers water closer to when it is biologically needed.&lt;/p&gt;
&lt;p&gt;Q: What are the two sources of potential inefficiency the authors identify?
A: The first is inefficiency due to heterogeneity: if farmers differ in ex-post productivity (captured by idiosyncratic shocks epsilon_it), allocating water to a less productive farmer at a given moment is wasteful. Markets correct this inefficiency (they direct water to highest-valuation buyers) while quotas do not. The second is inefficiency due to decreasing marginal returns (DMR): because the production function is concave in water, giving water to a farmer with already-high soil moisture is less productive than giving it to a farmer with low moisture. Quotas naturally avoid DMR inefficiency by allocating uniformly; markets with liquidity constraints exacerbate DMR inefficiency by directing scarce critical-season water to wealthy farmers who may have already accumulated moisture from prior purchases.&lt;/p&gt;
&lt;p&gt;Q: What is the main quantitative result of the welfare analysis?
A: Switching from the market auction to the fixed quota system increased welfare by 23.4 real pesetas per farmer per tree, representing a 6 percent increase in total apricot production relative to the market counterfactual. This is computed as the difference in yearly mean welfare per tree per farmer (net of irrigation costs, excluding water expenditures which are transfers) between the quota and market allocations using the estimated structural model.&lt;/p&gt;
&lt;p&gt;Q: Under what conditions is a quota more efficient than a market with liquidity constraints?
A: Quotas dominate markets when three conditions hold simultaneously: (1) farmers are relatively homogeneous in productivity (so the market&amp;rsquo;s advantage of directing water to high-valuation buyers is small), (2) liquidity constraints are significant (so the market misallocates water away from constrained high-valuation farmers), and (3) the production function is concave in water (so uniform allocation is efficient when farmers are homogeneous). The authors find all three conditions hold in Mula. Conversely, markets dominate quotas when heterogeneity in productivity is large relative to heterogeneity in wealth.&lt;/p&gt;
&lt;p&gt;Q: How is the transformation rate parameter gamma estimated and interpreted?
A: The transformation rate gamma measures how soil moisture above the permanent wilting point converts into apricot output (in pesetas) during the critical season, via the production function h() = gamma * (M_it - PW) * KS(M_it) * Z(w_t). It is identified from variation in purchasing patterns across seasons and variation in moisture across farmers within the same season. The preferred specification (column 3 of Table 3) yields gamma_L = 0.05. With average moisture per tree (accounting for the hydric stress coefficient) of 873.93 during the critical season, a farmer earns on average 29.09 pesetas per tree per week during the critical season, or 407.25 pesetas per tree per year.&lt;/p&gt;
&lt;p&gt;Q: How does ignoring liquidity constraints bias demand estimates?
A: If one estimates demand using the full sample (poor and wealthy farmers pooled), a decrease in demand during the critical season when prices rise conflates two effects: (1) the standard price effect (fewer farmers have valuations above the price) and (2) the liquidity constraint effect (some farmers with valuations above the price still cannot buy because they lack cash). Attributing the second effect to price sensitivity overstates the demand elasticity, biasing its absolute value upward.&lt;/p&gt;
&lt;p&gt;Q: What robustness checks do the authors provide against unobserved heterogeneity?
A: The authors provide four pieces of evidence that wealthy and poor farmers do not have systematically different underlying preferences: (1) wealthy and poor farmers are not geographically sorted into different locations (both groups appear in subareas 1, 2, 4, and 7); (2) wealthy and poor farmers grow the same bulida apricot variety; (3) outside the critical season, wealthy and poor farmers purchase statistically similar amounts of water; and (4) the purchasing divergence is significant only during the critical season when prices are high, precisely the pattern predicted by the liquidity constraint mechanism.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications for water allocation in developing countries?
A: The paper implies that before introducing water markets in regions where farmers may be liquidity constrained, policymakers should assess the magnitude of those constraints. If liquidity constraints are significant and farmers are relatively homogeneous in productivity, a quota system or a market supplemented with credit provision may deliver higher efficiency than a pure market. The standard presumption that markets outperform quotas can reverse when poor farmers cannot access credit to purchase water at the times they most need it.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to Che et al. (2013)?
A: Che, Gale, and Kim (2013) assume agents consume at most one unit with linear utility and find that markets always dominate quotas, though some non-market mechanisms with resale outperform markets. Donna and Espín-Sánchez extend this framework by allowing multiple discrete units, a concave utility function, and intertemporal dynamics. Under these extensions, the efficiency ranking between markets and quotas is theoretically indeterminate, and the authors show empirically that quotas can dominate markets. Both papers agree that non-market mechanisms with resale outperform both markets and simple quotas.&lt;/p&gt;
&lt;p&gt;Liquidity constraint (paper&amp;rsquo;s sense): A farmer is liquidity constrained when they lack sufficient cash to purchase water at the prevailing auction price, even if their valuation (marginal productivity of water) exceeds that price. In Mula, poor farmers without urban real estate income faced this constraint during the critical season when prices peaked, because they had already spent their harvest proceeds from the prior year and lacked access to credit markets.&lt;/p&gt;
&lt;p&gt;Soil moisture (M_it): The state variable measuring water accumulated in a farmer&amp;rsquo;s plot, computed using the agricultural engineering evapotranspiration formula. Moisture increases with rainfall and irrigation purchases (each auction unit contributing 432,000 liters divided by plot area) and decreases via evapotranspiration. It is bounded below by the permanent wilting point (PW) — below which trees die — and above by field capacity (FC). Moisture creates intertemporal substitution in demand.&lt;/p&gt;
&lt;p&gt;Critical season: The period corresponding to apricot fruit growth stages II and III and the Early Post-Harvest (EPH) period, spanning approximately weeks 18–32 (early May to early August). This is when the bulida apricot tree transforms water into fruit at the most rapid rate, when water demand peaks biologically, and when auction prices rise to their highest levels. It is the season during which liquidity constraints are binding.&lt;/p&gt;
&lt;p&gt;Transformation rate (gamma): The parameter in the apricot production function that measures the rate at which excess soil moisture (above the permanent wilting point) converts into apricot output (measured in real pesetas) during the critical season. Estimated at gamma_L = 0.05 in the preferred specification (column 3). It is identified from cross-seasonal variation in purchasing patterns and cross-farmer variation in moisture levels.&lt;/p&gt;
&lt;p&gt;Inefficiency due to decreasing marginal returns (DMR): One of two sources of allocation inefficiency identified in the paper. It arises when a farmer with already-high soil moisture receives water, yielding less additional output than if that water had gone to a farmer with lower moisture, given the concavity of the production function. Quotas avoid this inefficiency by allocating uniformly; markets with liquidity constraints exacerbate it by directing critical-season water to wealthy farmers who may have accumulated moisture from earlier purchases.&lt;/p&gt;
&lt;p&gt;Cuarta (quarter): The unit of water sold at Mula auctions, representing the right to use water flowing through the main channel for three hours. At approximately 40 liters per second of flow, each cuarta carried approximately 432,000 liters of water. Water rights and land rights were held independently; farmers who participated in auctions owned only land, while waterlords separately owned canal usage rights.&lt;/p&gt;
&lt;p&gt;Conditional choice probability (CCP) estimator: The two-step estimation procedure used to recover demand parameters from wealthy farmers&amp;rsquo; purchasing choices. In Step 1, transition probability matrices for observable state variables (moisture, week, price, rainfall) are computed and CCP is estimated via multinomial logit. In Step 2, the value function is forward-simulated using these transition matrices and parameters are estimated by GMM, following Hotz et al. (1994).&lt;/p&gt;</description></item><item><title>The Impact of Incarceration on Employment, Earnings, and Tax Filing</title><link>https://macropaperwarehouse.com/papers/the-impact-of-incarceration-on-employment-earnings-and-tax-filing/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-impact-of-incarceration-on-employment-earnings-and-tax-filing/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;This paper estimates the causal effect of incarceration on employment, wage earnings, self-employment, and tax filing behavior using administrative criminal justice data linked to Internal Revenue Service (IRS) records for approximately half a million felony defendants in two U.S. states: North Carolina and Ohio. The study period covers cases filed from the early 2000s through 2014, with outcomes tracked through 2020 using IRS W-2 and 1040 records.&lt;/p&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;The central question is whether incarceration itself — as distinct from arrest, conviction, and other criminal justice interactions that precede or accompany it — causes lasting reductions in defendants&amp;rsquo; labor market outcomes. The paper explicitly holds fixed upstream interactions (conviction, arrest) to isolate the effect of the incarceration sentence.&lt;/p&gt;
&lt;h3 id="data-and-sample"&gt;Data and Sample&lt;/h3&gt;
&lt;p&gt;Criminal justice records from Ohio (Common Pleas courts in Franklin, Cuyahoga, and Hamilton counties, covering Columbus, Cleveland, and Cincinnati) and North Carolina (Administrative Office of the Courts and Department of Public Safety) are linked to de-identified IRS records via name, date of birth, sex, address, and partial Social Security Numbers. Match rates are 92% in Ohio and 95% in North Carolina. The sample is restricted to defendants aged 18–50 at time of offense with cases filed 2002–2014. IRS records include employer-reported W-2 wages (regardless of individual tax filing), self-employment income from Schedule C/SE, non-employee compensation (1099-MISC), and gig-economy earnings from 1099 returns. All dollar figures are adjusted to 2016 dollars using the PCE deflator.&lt;/p&gt;
&lt;h3 id="empirical-strategy"&gt;Empirical Strategy&lt;/h3&gt;
&lt;p&gt;Two independent quasi-experimental research designs are used:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;North Carolina — Sentencing guideline discontinuities&lt;/strong&gt;: North Carolina&amp;rsquo;s structured sentencing guidelines map offense class (E through I, the five least severe felony classes) and prior record points (a numerical criminal history score) into permissible punishment types (incarceration vs. probation) and sentence lengths. Allowable punishment types change discretely at five cell boundaries, generating discontinuities in incarceration sentences for otherwise similar defendants. The paper uses these five boundary discontinuities as excluded instruments in a parameterized regression discontinuity design stacked across offense classes. First-stage F-statistic = 115.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Ohio — Random assignment to judges&lt;/strong&gt;: Cases are randomly assigned by computer to judges at arraignment in the three counties studied. Judge leave-out mean sentence length is used as an instrument for individual sentence length. The design follows Norris et al. (2021) and yields F-statistic = 321. The instrument shifts sentences along both the extensive margin (any vs. no incarceration) and intensive margin (longer vs. shorter sentences).&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Both designs produce complier populations for whom at least 37–45% are shifted on the extensive margin (from no incarceration to some incarceration), based on partial identification bounds using linear programming.&lt;/p&gt;
&lt;h3 id="main-findings"&gt;Main Findings&lt;/h3&gt;
&lt;p&gt;The paper&amp;rsquo;s central finding is that incarceration generates &lt;strong&gt;large short-run reductions&lt;/strong&gt; in labor market activity during the incapacitation period, but &lt;strong&gt;no detectable long-run reductions&lt;/strong&gt; in annual employment or earnings once defendants have been released and the incapacitation effects have dissipated.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;In the first year after case filing, when incarceration rates peak (roughly 75–100 additional days incarcerated for a 12-month sentence), employment falls by approximately &lt;strong&gt;10 percentage points&lt;/strong&gt; and total W-2 earnings contract commensurately.&lt;/li&gt;
&lt;li&gt;Within 3–4 years of filing, employment effects return to near zero and are statistically insignificant in both states.&lt;/li&gt;
&lt;li&gt;Five to nine years after filing, when effects on contemporaneous incarceration have dissipated, the estimated effect of a 12-month sentence on annual earnings is &lt;strong&gt;positive or near zero&lt;/strong&gt; in both states. The combined 95% confidence interval rules out reductions in annual wages greater than &lt;strong&gt;$231&lt;/strong&gt; (approximately 5% of the untreated complier mean) and rules out any adverse employment effects.&lt;/li&gt;
&lt;li&gt;Despite no long-run level effects, losses during incapacitation are never recouped. A one-year sentence reduces &lt;strong&gt;cumulative earnings over five years by approximately $2,914&lt;/strong&gt; — a 13% reduction relative to the complier mean.&lt;/li&gt;
&lt;li&gt;Effects on self-employment, independent contracting, 1040 filing, adjusted gross income, EITC take-up, and interstate migration are similarly null in the long run.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="incapacitation-vs-post-release-scarring"&gt;Incapacitation vs. Post-Release Scarring&lt;/h3&gt;
&lt;p&gt;The paper provides two tests for whether short-run earnings losses reflect incapacitation alone or also post-release scarring (e.g., human capital depreciation, employer discrimination, or discouragement effects):&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;A &amp;ldquo;visual IV&amp;rdquo; regression of year-t earnings effects on year-t days-incarcerated effects yields an R² of 0.83–0.85 across states, with the intercept near zero (positive and small), indicating that virtually all dynamic earnings impacts flow through contemporaneous incapacitation and not through a post-release channel.&lt;/li&gt;
&lt;li&gt;Constructed outcomes that impose the null of pure incapacitation (scaling pre-case average earnings or covariate-predicted earnings by the share of the year free from prison) closely track actual earnings effects in both states, further confirming that incapacitation is the dominant mechanism.&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="pre-existing-labor-market-detachment"&gt;Pre-existing Labor Market Detachment&lt;/h3&gt;
&lt;p&gt;A key scope condition is defendants&amp;rsquo; severe labor market disadvantage prior to their case. Fewer than 50–60% of defendants are employed in the year before filing; average pre-case W-2 earnings (including zeros) are below $6,000. Among employed defendants, only 10% earn more than $22,000 per year. Untreated complier means for earnings in the year after case filing are below $4,000, with virtually no earnings or employment growth over the following nine years. The paper concludes that returning to pre-filing earnings levels is sufficient for incarcerated defendants to match their non-incarcerated peers — a low bar that is readily met.&lt;/p&gt;
&lt;h3 id="policy-implications"&gt;Policy Implications&lt;/h3&gt;
&lt;p&gt;Back-of-envelope aggregation implies incapacitation losses of approximately &lt;strong&gt;$6.16 billion per year&lt;/strong&gt; in foregone earnings for the U.S. prison population, concentrated in communities heavily affected by incarceration. However, a marginal reduction in incarceration rates would increase average earnings by only &lt;strong&gt;$51 for white men&lt;/strong&gt; and &lt;strong&gt;$213 for black men&lt;/strong&gt;, suggesting incarceration&amp;rsquo;s direct contribution to labor market inequality is modest relative to the $21,100 black-white earnings gap estimated by Bayer and Charles (2018). The paper concludes that upstream factors — other criminal justice interactions, human capital deficits, and broader socioeconomic disadvantage — are more plausibly responsible for low earnings among the formerly incarcerated.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the exact treatment variable, and what is the counterfactual?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The treatment variable is months of incarceration sentenced in the focal case (a continuous, weakly positive ordered treatment). The counterfactual for non-incarcerated defendants in North Carolina is probation (all defendants are convicted by construction under structured sentencing guidelines). In Ohio, the authors cannot reject that all compliers who do not receive a prison sentence are still convicted, implying the counterfactual is also conviction and probation. All compliers therefore acquire a criminal record regardless of sentence. The treatment effect is thus the effect of incarceration conditional on conviction, holding fixed the criminal record.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How are effects interpreted given multiple instruments and continuous treatment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Under a &amp;ldquo;weakly positive ordered treatment&amp;rdquo; assumption and standard LATE conditions, the 2SLS estimates can be interpreted as Average Causal Responses (ACRs) — weighted averages of the marginal dose effects (12 vs. 11 months, 6 vs. 5 months, 1 vs. 0 months, etc.) for complier subgroups shifted by each instrument. In North Carolina with five parameterized RD instruments, the estimate averages ACRs weighted by first-stage strength. In Ohio with a leave-out mean instrument, the estimate is a convex average of ACRs under the assumption that the linear first-stage model is a good approximation. Dosage weights for both states put mass on a wide range of sentence lengths including both extensive and intensive margins, though Ohio&amp;rsquo;s weights are more skewed toward shorter sentences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How large are the first-stage effects, and how strong is the instrument?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In North Carolina, sentences jump by 50% or more at sentencing guideline cell boundaries where allowable punishment types change to include incarceration. The first-stage F-statistic is 115. In Ohio, defendants assigned to the most severe judge receive incarceration sentences approximately six months longer than those assigned to the least severe judge (roughly 30% of the average non-zero sentence), with a slope of approximately 0.8 in the first-stage regression; F-statistic = 321. At least 37% of compliers in North Carolina and 45% in Ohio are shifted on the extensive margin (from no incarceration to some positive incarceration), with upper bounds as high as 95%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What evidence supports instrument validity (exclusion restriction and independence)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Instrument validity is tested by estimating 2SLS &amp;ldquo;effects&amp;rdquo; on pre-case outcomes measured 2–4 years before the focal case. In both states, the instruments show no relationship with pre-case employment, W-2 wages, total days previously incarcerated, or binary severe prior incarceration. The probability of being matched to IRS records and the quality of the match are also uncorrelated with the instruments. In Ohio, potential exclusion restriction violations from judges affecting conviction (not just sentence) are addressed empirically: nearly 90% of defendants are convicted, the most severe judge is only 0.7 p.p. more likely to convict than the least severe judge (t-stat = 1.53), and the estimated conviction rate among untreated compliers is 0.972 (s.e. 0.018), so one cannot reject that all non-incarcerated compliers are convicted.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the paper test for the incapacitation mechanism against post-release scarring?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Two complementary exercises are conducted. First, a &amp;ldquo;visual IV&amp;rdquo; plot regresses year-t earnings effects on year-t days-incarcerated effects across all post-filing years. If incapacitation is the sole channel, all points should lie on a line through the origin. The R² is 0.83 in North Carolina and 0.85 in Ohio, the estimated intercept is near zero (positive and small) in both states, and the slope (earnings lost per day incarcerated) is approximately $12. This implies cumulative earnings losses of $12 × 268 days = $3,216, very close to the directly estimated $2,914. Second, constructed outcomes that scale pre-case earnings or covariate-predicted earnings by the share of the year not incarcerated closely track actual earnings effects throughout the post-filing period, and both converge to zero as incapacitation effects fade — consistent with pure incapacitation and no net scarring.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the long-run (5–9 years) earnings and employment estimates, and how precisely are null effects ruled out?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Averaged across both states using inverse-variance weights, the estimated effect of a 12-month sentence on annual W-2 earnings five to nine years after filing is positive but statistically indistinguishable from zero. The 95% confidence interval rules out reductions in annual wages greater than $231 (approximately 5% of the untreated complier mean of roughly $4,500–$5,000). The 95% CI also rules out any adverse employment effects. The untreated complier mean for employment 5–9 years post-filing is approximately 40% in North Carolina and slightly above 40% in Ohio.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What happens to cumulative earnings over five years despite null long-run level effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Even though long-run annual earnings are unaffected, earnings losses during incapacitation are never made up. A one-year sentence reduces cumulative employment (measured as years with any W-2) and cumulative earnings over five years by approximately $2,914 — a 13% reduction relative to the complier mean. This reflects the mechanical loss of earnings during the period of physical incapacitation, without a subsequent compensating period of higher earnings after release.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Do defendants with stronger pre-case labor market attachment show different long-run patterns?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The sample is split between defendants employed in at least 2 of the 4 years prior to the case (53–57% of the sample across states) and those less attached. Both groups show zero long-run earnings and employment effects. Previously employed defendants experience much larger short-run earnings drops — more than three times larger in the first year post-filing — and their earnings recover more slowly, reaching zero effect approximately six years after filing (vs. three years for the previously unemployed). For a stricter cut (pre-case average earnings above $15,000, representing only 12–15% of the sample), the long-run earnings effect is −$1,426 (8% of the untreated complier mean), significant only at the 10% level, and partly attributable to residual incapacitation (19.6 additional days incarcerated 5–9 years post-filing). For defendants with pre-case earnings below $15,000, incarceration slightly increases long-run employment (2.4 pp, p = 0.01) and earnings ($400, p = 0.03), possibly reflecting rehabilitative benefits (GED or educational programs) for labor-market-detached individuals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Does first-time incarceration (extensive-margin exposure) have larger long-run effects than repeat exposure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper tests this by splitting the sample into defendants with and without prior incarceration history. Among defendants with no prior incarceration, the instruments generate large differences in lifetime exposure: a 12-month sentence increases the probability of ever being incarcerated over the next 5–9 years by 26 p.p. (North Carolina) and 41 p.p. (Ohio). Among those not receiving a sentence, 48% (North Carolina) and 19% (Ohio) are eventually incarcerated anyway, implying treatment causes a 52 and 81 p.p. increase in lifetime incarceration probability for extensive-margin compliers. Despite these large differences in lifetime exposure, long-run earnings and employment effects remain small and statistically insignificant in both subsamples. The difference in long-run effects between previously and never incarcerated defendants is not statistically significant (p = 0.29 for employment, p = 0.82 for earnings).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Are there heterogeneous effects by race, sex, or criminal history?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;There is no evidence of long-run scarring for any demographic or criminal history subgroup. Effects for black and non-black defendants are both positive for long-run earnings and employment. Non-black defendants show somewhat larger cumulative losses (consistent with marginally higher counterfactual earnings), but differences are not statistically significant. Estimates for women are imprecise due to small sample size. Among defendants with and without prior felony charges in the four years preceding the case, there are neither economically nor statistically significant long-run earnings or employment effects. Cumulative losses are somewhat larger for defendants without prior felony charges (p = 0.07), reflecting their higher pre-case earnings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper handle potential migration bias in outcomes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Tax filing and W-2 receipt in the state of sentencing are used to proxy for whether defendants remain in the same state. Among untreated compliers, 88% of those with a tax footprint maintain it in the state of sentencing. No statistically significant effects of incarceration on migration (measured as filing or receiving a W-2 in North Carolina or Ohio) are detected, suggesting prior studies of recidivism measured within-state are unlikely to be severely biased by migration responses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What effect does incarceration have on mortality?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Incarceration reduces five-year mortality by approximately 0.8 percentage points (about 20% of the untreated mean). The authors note this is too small to explain the null long-run labor market effects: even if all defendants whose death was averted were employed, removing them from the employment count would reduce the employment effect of a 12-month sentence only to approximately zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: How do the paper&amp;rsquo;s findings compare to prior studies, particularly Mueller-Smith (2015)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Mueller-Smith (2015) finds large and persistent negative incarceration effects on labor market outcomes in Texas using a structural decomposition and Lasso-based judge-covariate interactions as instruments. The paper argues methodological differences are the likely explanation: the Lasso-selected interacted instruments can be susceptible to many-weak instruments bias toward OLS. It notes that Mueller-Smith&amp;rsquo;s simpler 2SLS specifications (analogous to those used here) show no statistically significant earnings effects. North Carolina and Ohio are documented to be broadly similar to Texas (and the U.S. average) in rehabilitation program participation, recidivism rates, and incarceration rates, reducing the likelihood that genuine geographic heterogeneity explains the divergence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What is the paper&amp;rsquo;s aggregate extrapolation of incapacitation earnings losses?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Scaling the estimated $2,914 cumulative loss per 12-month sentence by the ratio of days exposed to total days in a year gives a per-day loss of approximately $12. Applied to the 1,435,500 people incarcerated in U.S. prisons on any given day in 2019 (excluding the more than 700,000 in jail), the implied aggregate yearly earnings loss from incapacitation is approximately $6.16 billion.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Incapacitation effect&lt;/strong&gt;: The mechanical reduction in earnings and employment that occurs while a defendant is physically confined in prison and unable to work, as distinct from any post-release scarring effect. The paper shows this is the dominant — and essentially sole — causal channel through which incarceration affects labor market outcomes in their sample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Post-release scarring&lt;/strong&gt;: Persistent reductions in earnings or employment that persist after a defendant is released from prison, caused by mechanisms such as employer discrimination based on incarceration history, human capital depreciation, loss of job contacts, or psychological discouragement effects. The paper finds no evidence of scarring in either state.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Average Causal Response (ACR)&lt;/strong&gt;: The weighted average of the marginal dose effects of incarceration (e.g., effect of 12 vs. 11 months, 1 vs. 0 months) for groups of defendants whose sentence lengths are shifted by a given instrument. Contrasted with a binary LATE, the ACR averages across the full dosage distribution for compliers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Complier&lt;/strong&gt;: An individual whose incarceration sentence is shifted by the instrument — either from zero to some positive sentence (extensive margin) or from a shorter to a longer sentence (intensive margin). Counterfactual outcome means for compliers sentenced to zero months provide the baseline for evaluating effect magnitudes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sentencing guideline discontinuity&lt;/strong&gt;: The discrete jump in permissible punishment types and minimum sentence lengths at specific criminal history score thresholds within North Carolina&amp;rsquo;s structured sentencing grid. Defendants just above a threshold are more likely to be incarcerated than otherwise similar defendants just below, generating quasi-experimental variation exploited as an instrument.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Leave-out mean judge instrument&lt;/strong&gt;: In Ohio, each defendant&amp;rsquo;s assigned judge&amp;rsquo;s average incarceration sentence length computed over all other cases that judge handles (excluding the defendant&amp;rsquo;s own case), residualized on court-by-month fixed effects. Because judges are randomly assigned to cases, this measure is conditionally independent of defendant potential outcomes and serves as an instrument for sentence length.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Control complier mean&lt;/strong&gt;: The estimated mean potential outcome for compliers under the counterfactual of receiving zero months of incarceration. Used as a benchmark to evaluate the magnitude of treatment effects and to characterize how low the earnings baseline is for the population driving the causal estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive vs. intensive margin of incarceration&lt;/strong&gt;: The extensive margin refers to the binary shift from receiving no prison sentence to receiving any prison sentence; the intensive margin refers to increasing sentence length conditional on some incarceration. The paper argues that neither margin appears to produce long-run labor market scarring, and uses linear programming bounds to estimate that at least 37–45% of compliers in each state are shifted on the extensive margin.&lt;/p&gt;</description></item><item><title>The Impact of Unions on Nonunion Wage Setting: Threats and Bargaining</title><link>https://macropaperwarehouse.com/papers/the-impact-of-unions-on-nonunion-wage-setting-threats-and-bargaining/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-impact-of-unions-on-nonunion-wage-setting-threats-and-bargaining/</guid><description>&lt;p&gt;This paper estimates the impact of unions on nonunion wage setting in the United States over the period 1980–2010, distinguishing two channels through which unions affect nonunion wages: (1) a traditional threat channel, in which nonunion firms raise wages to preempt unionization by making workers indifferent between forming a union and remaining nonunion (an &amp;ldquo;emulation wage&amp;rdquo;); and (2) a bargaining channel, in which nonunion workers use the availability of high-paying union jobs as part of their outside option when bargaining individually with their employer, so that a decline in union job prevalence or the union wage premium erodes nonunion bargained wages even at firms that face no direct unionization threat.&lt;/p&gt;
&lt;p&gt;The authors build a search-and-bargaining model grounded in Nash bargaining, with endogenous union formation and, in the most complete version, the possibility of nonunion firm responses to the threat of unionization. Workers in this model can be employed at simple nonunion firms, union firms, or union-emulating firms. The model is embedded in a multi-industry, multi-city framework following Beaudry, Green, and Sand (2012), which formalizes the mechanism by which higher-rent jobs in a city raise outside options and therefore wages for workers in all other jobs throughout that local labor market. This cross-city, within-industry variation is the primary source of identification.&lt;/p&gt;
&lt;p&gt;The empirical implementation uses Current Population Survey Merged Outgoing Rotation Groups (1983–2020) and CPS May extracts (1978–1982), pooling observations around 1980, 1990, 2000, 2010, and 2020 across 43 cities and 51 industries. To address endogeneity of outside option variables — which may be correlated with unobserved local productivity shocks — the authors construct Bartik-style instruments based on start-of-period local industry and union employment composition interacted with national changes in industry growth, industry wage premia, and union job transition probabilities. The threat channel is identified by the interaction of the probability a firm in a given industry-city cell faces a union election (proxied using NLRB data) with the outside option value of union workers. The authors derive a model-based overidentifying restriction, test it, and cannot reject it, providing support for their identification strategy.&lt;/p&gt;
&lt;p&gt;The central quantitative finding is that de-unionization accounts for approximately 38% of the 16% decline in the mean real (composition-constant) wage in a typical US city between 1980 and 2010. One-third of that de-unionization effect arises from a standard shift-share component — workers moving from higher-paying union jobs to lower-paying nonunion jobs — while two-thirds arises from spillover channels affecting nonunion wage setting. The spillover effects are almost entirely attributable to the bargaining channel rather than the traditional threat channel; the threat probability was too low, even in 1980, to generate large emulation effects in the aggregate. The total impact of a one-dollar increase in the outside option value for the mean wage in industry i is estimated at 1.78 dollars once within-industry feedback loops are included.&lt;/p&gt;
&lt;p&gt;The paper finds no evidence of bargaining spillovers in the 1980s specifically, the decade of the sharpest unionization declines. The offsetting forces were declining probabilities of finding union jobs and simultaneously rising union wage premia — with the model explaining the premium increase as a consequence of nonunion firms no longer needing to emulate union wages once the threat of their shop being organized receded substantially. After 1990 the threat stabilized at a low level, the premium declined, and the outside-option effect of declining unionization became the dominant force.&lt;/p&gt;
&lt;p&gt;Heterogeneity results show that spillover effects are larger for women than men, and that de-unionization accounts for 43% of the real wage decline for women versus 27% for men. For workers without post-secondary education, de-unionization accounts for 43% of their real wage decline. The traditional threat effect is statistically insignificant in states with Right-to-Work laws, consistent with the interpretation that identification captures emulation responses to unionization threat.&lt;/p&gt;
&lt;p&gt;Q: What are the two channels through which unions affect nonunion wages in this model?
A: The traditional threat channel operates when nonunion firms raise wages to make workers indifferent between unionizing and remaining nonunion, thereby forestalling a costly union election. The bargaining channel operates because nonunion workers can credibly point to available union jobs when bargaining individually; a decline in union job prevalence or the union wage premium therefore weakens nonunion workers&amp;rsquo; outside options and lowers their bargained wages even at firms that face no direct unionization threat.&lt;/p&gt;
&lt;p&gt;Q: How large is the overall contribution of de-unionization to the US wage decline between 1980 and 2010?
A: The paper estimates that de-unionization accounts for 38% of the approximately 16% decline in the mean composition-constant real wage in a typical US city between 1980 and 2010. One-third of that 38% arises from the direct shift-share effect of workers moving from higher-paying union to lower-paying nonunion employment; the remaining two-thirds arises from spillover effects on nonunion wages.&lt;/p&gt;
&lt;p&gt;Q: Which spillover channel dominates in the decomposition, and why?
A: The bargaining channel dominates almost entirely. The traditional threat channel is statistically significant but quantitatively small because the probability that any given nonunion firm faced a union election was low even in 1980, so the scope for emulation to affect aggregate wages was limited. The bargaining channel, by contrast, operates through the outside options of all nonunion workers searching across many industries and cities, giving it broader aggregate reach.&lt;/p&gt;
&lt;p&gt;Q: Why was there no measurable bargaining spillover in the 1980s despite the decade&amp;rsquo;s large drop in union density?
A: During the 1980s, two forces offset each other: the probability of a nonunion worker finding a union job fell sharply, but the union wage premium rose substantially over the same period, so the expected value of the union outside option changed little. The paper explains the rising premium as a consequence of nonunion firms reducing their emulation wages as the threat of unionization receded, causing nonunion wages to fall faster than union wages and thus mechanically widening the premium. After 1990, when the threat stabilized at a low level, the premium declined and the net outside-option effect of continued de-unionization became the dominant spillover force.&lt;/p&gt;
&lt;p&gt;Q: What is the estimated multiplier effect of an improvement in outside options on nonunion wages?
A: The total impact of a one-dollar increase in the outside option value on the mean wage in a given industry is estimated at 1.78 dollars once within-industry feedback loops — in which an improved outside option raises wages, which in turn improves outside options elsewhere — are accounted for.&lt;/p&gt;
&lt;p&gt;Q: How do the authors address endogeneity of the outside option variables?
A: They construct Bartik-style instruments based on start-of-period local industry and union employment composition interacted with national-level changes in industry growth, industry wage premia, and the probability of transitioning to a union job. This strategy isolates variation in local outside options that is driven by predetermined compositional exposure rather than contemporaneous local shocks. They derive a model-based overidentifying restriction, test it in the data, and cannot reject it, supporting the validity of the instrument.&lt;/p&gt;
&lt;p&gt;Q: How do the authors address selection bias arising from the changing composition of union and nonunion workers as unionization declines?
A: They implement a generalized Heckman two-step approach, including a quartic in the change in the proportion unionized to control for selectivity. After this correction, they cannot reject the null of no selectivity effects, and the main estimated coefficients change very little, indicating that compositional selection is not the primary driver of their results.&lt;/p&gt;
&lt;p&gt;Q: What heterogeneity is found across gender groups?
A: Both the bargaining and traditional threat effects are larger for women than for men. Men experienced a decline in mean real wages between 1980 and 2010 more than double that experienced by women, but spillover effects are of identical size, so de-unionization accounts for a larger share of women&amp;rsquo;s wage decline (43%) than men&amp;rsquo;s (27%).&lt;/p&gt;
&lt;p&gt;Q: What heterogeneity is found by education level?
A: For workers with a high school education or less, the traditional threat effect estimate is twice as large as the bargaining effect, while the reverse holds for workers with post-secondary education. Workers without post-secondary education experienced real wage declines nearly triple those of the more educated group, and de-unionization accounts for 43% of the lower-educated group&amp;rsquo;s wage decline.&lt;/p&gt;
&lt;p&gt;Q: How do the authors validate that they are identifying the threat channel rather than some other effect?
A: The traditional threat effect is estimated to be statistically insignificant in states with Right-to-Work (RTW) laws, where the legal environment substantially reduces the ability of workers to organize and therefore reduces the credible threat of unionization that would induce nonunion firms to emulate union wages. This pattern is consistent with the interpretation that the identified effect captures firm emulation responses to a genuine unionization threat.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the distinction between the two channels?
A: The traditional threat effect can only be activated by increasing union power directly, since it depends on a credible risk of a firm&amp;rsquo;s workforce voting to unionize. The bargaining channel, however, is not union-specific: any policy that raises workers&amp;rsquo; outside option values — such as eliminating non-compete agreements or expanding access to higher-paying jobs in a local labor market — can generate similar wage spillovers. Unions are one powerful mechanism for doing this, but not the only one.&lt;/p&gt;
&lt;p&gt;Q: What is the theoretical model structure, and what distinguishes it from Taschereau-Dumouchel (2020)?
A: The model is built on TD&amp;rsquo;s search-and-bargaining framework with endogenous union formation, in which unions can threaten to withdraw the entire workforce from production whereas individual nonunion workers can only threaten to withdraw their own labor. The key modifications are: (1) the hiring-channel mechanism of TD (firms skew toward skilled workers who dislike unions) is replaced with a direct wage-emulation mechanism; (2) the BGS multi-industry, multi-city framework is incorporated to allow outside options to vary with the composition of jobs across industries in a locality; and (3) a single skill level with multiple industries is used, keeping the model tractable for empirical implementation.&lt;/p&gt;
&lt;p&gt;Q: What data sources are used and over what period?
A: The primary dataset is the Current Population Survey Merged Outgoing Rotation Groups for 1983–2020 combined with CPS May extracts for 1978–1982, covering workers aged 25–65 not enrolled in school. The sample is organized into 93 geographic areas (43 cities), 51 industries based on 1980 Census classification, and analyzed at 10-year intervals (1980, 1990, 2000, 2010, 2020) with three-year pooling windows to reduce noise. NLRB case data on union elections proxies for unionization threat probabilities, and County Business Patterns data are used in constructing emulation probabilities.&lt;/p&gt;
&lt;p&gt;Traditional threat effect: The mechanism by which nonunion firms raise wages to an &amp;ldquo;emulation wage&amp;rdquo; — the level that makes workers indifferent between unionizing and remaining nonunion — in order to preempt the costs of a union election, thereby reducing the net benefit of unionization below the threshold required for workers to vote for a union.&lt;/p&gt;
&lt;p&gt;Bargaining channel (bargaining spillover effect): The mechanism by which the availability of union jobs in a local labor market raises the outside option of nonunion workers during individual Nash bargaining, so that declines in union job prevalence or the union wage premium lower nonunion bargained wages even at firms not directly facing a unionization threat.&lt;/p&gt;
&lt;p&gt;Outside option: In the model&amp;rsquo;s Nash bargaining framework, the value a worker (or firm) obtains if negotiations break down — for nonunion workers, this is the expected value of searching across both nonunion and union jobs weighted by transition probabilities and wage rents in each sector.&lt;/p&gt;
&lt;p&gt;Emulation wage: The wage a nonunion firm sets that is just high enough to make workers indifferent between unionizing and remaining nonunion, determined by the firm&amp;rsquo;s calculation of the threshold below which workers would prefer to bear the costs of unionization.&lt;/p&gt;
&lt;p&gt;Union formation (endogenous): In the model, unionization occurs when the surplus workers gain from collective bargaining exceeds the costs of organizing; firms can influence this calculus through wage emulation or direct anti-union actions, making union formation an equilibrium outcome rather than an exogenous event.&lt;/p&gt;
&lt;p&gt;Bartik-style instrument (outside option instrument): An instrument for local outside option values constructed by interacting start-of-period local employment composition across industries with national-level changes in industry growth, industry wage premia, and union job transition probabilities, isolating variation in outside options driven by predetermined exposure to national trends rather than local demand shocks.&lt;/p&gt;
&lt;p&gt;Shift-share (between) component: The portion of the aggregate wage effect of de-unionization attributable to the direct reallocation of workers from higher-paying union jobs to lower-paying nonunion jobs, distinct from spillover effects on nonunion wage setting itself.&lt;/p&gt;</description></item><item><title>The Long-Run Impacts of Public Industrial Investment on Local Development and Economic Mobility: Evidence from World War II</title><link>https://macropaperwarehouse.com/papers/the-long-run-impacts-of-public-industrial-investment-on-local-development-and-economic-mobility-evidence-from-world-war-ii/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-long-run-impacts-of-public-industrial-investment-on-local-development-and-economic-mobility-evidence-from-world-war-ii/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Does government-led construction of large manufacturing plants in previously under-industrialized regions generate long-run improvements in regional economic development and in the lifetime earnings of the incumbent residents who were already living there at the outset? And, if so, through what mechanism — developmental improvements during childhood or expanded adult labor market opportunities?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Identification.&lt;/strong&gt; The paper exploits the United States industrial mobilization for World War II, specifically the construction of 90 large, government-financed, newly-built manufacturing plants (each costing $10 million or more in contemporary dollars, approximately $150 million in 2020 dollars) in dispersed locations outside the major prewar manufacturing hubs. Strategic and security considerations — not economic optimization — drove the military to insist these plants be sited away from congested industrial centers. Because private firms were unwilling to finance construction in isolated locations with uncertain postwar value, the government built them directly as government-owned, contractor-operated (GOCO) facilities through the Defense Plant Corporation. Site selection within the set of sufficiently populated regions was governed by idiosyncratic, short-run factors — the immediate availability of suitable parcels, informal connections to procurement officers, and expedience — rather than systematic economic characteristics of the receiving counties. The paper documents no systematic association between publicly-funded wartime plant construction and prewar county-level economic or demographic characteristics conditional on population size, and finds parallel prewar trends and balanced outcome levels across treatment and comparison counties in all decades leading up to WWII. A placebo test using 1910-to-1940 intergenerational mobility in matched Census records confirms no differential prewar upward mobility in treatment counties.&lt;/p&gt;
&lt;p&gt;The comparison group consists of 1,400 counties outside the 100 largest prewar manufacturing counties that did not receive large public plants. Treatment assignment for individuals is based on birth county, not adult county of residence, enabling the paper to track outcomes regardless of where individuals ultimately live.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data.&lt;/strong&gt; The analysis draws on the 1945 War Production Board data book for plant-level investment; county-level panels from Decennial and Economic Censuses spanning 1900–2000; the SSA NUMIDENT file (birth county and date); IRS Form 1040 individual income tax returns in 1969, 1974, 1979, and 1984 (covering wage earnings and adjusted gross income); the full-count 1940 Census (parent earnings, demographics); the 2000 Census long form (educational attainment); and W-2 earnings histories from the SSA Detailed Earnings Record matched to a CPS-linked subsample, with employer information linked to the Business Register.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Regional Effects.&lt;/strong&gt; By 1970, counties receiving large public wartime plants had approximately 30 percent higher manufacturing employment, 20 percent larger populations, and 7–8 percent higher median family income than comparison counties. Manufacturing employment as a share of total employment rose and remained elevated through the 1970s before converging toward parity with the comparison group by 1990. Treated counties were permanently larger — with population stabilizing at a new, persistently higher equilibrium roughly 20 percent above comparison counties by end of century — even after the manufacturing employment share converged, consistent with path dependence and multiple equilibria. Average production worker pay in manufacturing rose by approximately 10 percent, closely tracking value-added per worker, while average retail wages rose by only one-third as much and were not statistically significant in most years. In the 40 years after the war, treated counties saw median family earnings increase by 5–10 percent, concentrated in higher average wages and employment shares in manufacturing and semi-skilled blue-collar occupations, with limited effects on non-manufacturing, white-collar occupations, or female individual income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Individual Earnings Effects.&lt;/strong&gt; Men born in treatment counties in the 18 years before the war (birth cohorts 1922–1940) earned approximately $1,200–$1,300 more per year (2020 dollars) in average wage earnings reported on 1040 returns in 1969, 1974, 1979, and 1984 — an increase of 2.5–3 percent and roughly a one-percentile rise in the national earnings distribution. Effects were largest for children of parents at the bottom of the 1939 earnings distribution: children of the lowest-income parents saw adult wage earnings rise by approximately $1,800–$2,000 per year (3–4 percent), with effects declining linearly by parent rank and effectively vanishing for children of the highest-earning parents. Black men experienced larger average earnings effects (4–6 percent, or $1,500–$2,500 in 2020 dollars) than White men (2–3 percent, or $1,000–$1,500), with the racial earnings gap estimated to have narrowed by about 2 percent in the treatment group. When examining Form 1040 returns (tax-unit level), effects are comparable for men and women, but W-2 individual earnings data from the SSA-CPS subsample show no positive effect on women&amp;rsquo;s own earnings — the 1040 effects for women are entirely driven by their husbands&amp;rsquo; higher earnings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mechanism.&lt;/strong&gt; The balance of evidence points to access to higher-wage jobs in adulthood as the primary channel, rather than developmental human capital improvements accumulated during childhood. War plants modestly increased male educational attainment — children from the lowest-earning families completed approximately one-quarter of a year more schooling and were 3 percentage points more likely to graduate high school — but education effects are too small to account for the full earnings increase. Critically, there is no gradient in earnings effects by birth cohort: children who were younger at the start of the war and therefore had longer childhood exposure to improved regions did not benefit more, contradicting a childhood exposure-effect mechanism as in Chetty and Hendren (2018b). Adult earnings effects are entirely accounted for by adult location: conditioning on 1979 county of residence eliminates the treatment effect. Stayers in treatment counties show large earnings differences relative to stayers in comparison counties, while movers show none. Men born in treatment counties are also directly documented to have worked in industries with higher wage premiums as adults, with coarse industry classification alone accounting for approximately one-third of the estimated log wage increase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy Scope Conditions.&lt;/strong&gt; The paper argues these effects are specific to the WWII postwar institutional context — high global demand for U.S. manufactured goods, limited international competition, labor-intensive production techniques, and strong union bargaining power — conditions that no longer hold. Reexamination of &amp;ldquo;million-dollar plant&amp;rdquo; openings in the 1980s and 1990s shows manufacturing employment expanded but average manufacturing wages did not increase, suggesting contemporary plant openings do not generate the same high-wage opportunities. The association between manufacturing employment density and upward mobility visible in 1950 has entirely vanished by the end of the twentieth century.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What exactly defines the treatment group, and why were these plants built by the government rather than private firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The treatment group consists of 90 counties outside the 100 largest prewar manufacturing regions that received at least one new, fully publicly-financed manufacturing plant costing $10 million or more (approximately $150 million in 2020 dollars) under the WWII industrial mobilization. Private firms refused to finance construction in dispersed, isolated locations with highly uncertain postwar value; the Air Force historians recorded that &amp;ldquo;industrialists&amp;rsquo; reluctance to invest in dispersed plant facilities was at odds with the government&amp;rsquo;s hope that private capital could finance new inland construction.&amp;rdquo; The government built and owned these facilities as GOCO plants, operated by private firms under contract. The 353 plants meeting the cost threshold (including both large and smaller public plants) account for 70 percent of all spending on new plants during the war.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the authors establish that plant siting was quasi-random conditional on population size?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Identification rests on three forms of evidence. First, historical documents show procurement decisions were driven by idiosyncratic factors — availability of a suitable parcel, informal connections to procurement officers, short-run expedience — rather than systematic economic characteristics. Members of Congress had little ability to influence siting, and Rhode et al. (2018) find little evidence that federal politics drove the geographic distribution of wartime spending. Second, balance tests (estimating prewar county characteristics as outcomes in Equation 1) show no significant differences between treatment and comparison counties in earnings levels, demographics, manufacturing development, or industrial composition after conditioning on 1940 population, with a joint p-value of 0.30 (0.36 when also conditioning on geography and infrastructure). Third, a placebo test using children in the 1910 Census matched to the 1940 Census finds no differential economic outcomes or upward mobility rates in counties that would eventually receive treatment plants, conditional on basic region size.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the county-level effects on the structure of the labor market in the medium run?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: By the 1960s–1970s, treated counties had higher predicted union coverage rates and a greater share of men in semi-skilled production occupations, driven primarily by movement away from farm work and supplemented by higher male labor force participation. Average wages in craftsperson and operator occupations rose by 8 percent in treated counties — more than double the increase in wages for high-skill professional and managerial occupations. Treated counties had 8 percent higher median male individual incomes by 1979. Effects on female median individual income were minimal, and there were no effects on female labor force participation rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the estimated magnitude of the individual earnings effects, and how do they vary by parent income?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Men born in treatment counties averaged $1,200–$1,300 more per year in real wage earnings (2020 dollars) on 1040 tax returns across the four observation years 1969, 1974, 1979, and 1984, a 2.5–3 percent increase equivalent to roughly one percentile in the national earnings distribution. Heterogeneity by parent rank is pronounced and monotone: children of parents at the very bottom of the 1939 earnings distribution gained approximately $2,000 per year (about 4 percent), while children of the highest-earning parents experienced no significant effect. When county weighting is equalized to eliminate the differential representation of rural (lower-income) counties, effects are roughly constant across the bottom six deciles of the parent earnings distribution and then drop steeply at the top, showing that the earnings gradient was not simply an artifact of plant openings in poorer, smaller counties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How did effects differ by race?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Wartime plant construction increased annual adult earnings of Black men by 4–6 percent ($1,500–$2,500 in 2020 dollars) and of White men by 2–3 percent ($1,000–$1,500 in 2020 dollars). The racial earnings gap in the treatment group is estimated to have narrowed by about 2 percent. However, the pattern of heterogeneity by parent income differs by race: for White men, effects are largest for children of below-median parents and effectively zero for children of above-median parents. For Black men, the largest effects — 7–10 percent ($4,000–$5,000 in 2020 dollars) — accrue to children of parents with earnings above the pooled-race national median, while effects for lower-income Black families range from 3–6.5 percent, suggesting that Black workers from higher-income backgrounds particularly benefited from wartime anti-discrimination policies and the opening of previously restricted manufacturing occupations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why do the 1040 returns show comparable effects for men and women, while W-2 data show no effect on women&amp;rsquo;s individual earnings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Form 1040 returns are filed at the tax-unit level — for married couples, they report the combined wages of both spouses. Because more than 80 percent of women in the sample are married, an increase in a husband&amp;rsquo;s earnings raises the joint 1040 figure for both spouses. The SSA-CPS subsample with individual W-2 records shows that the entire effect on men&amp;rsquo;s Form 1040 wages directly reflects increases in their own W-2 earnings, while women&amp;rsquo;s own W-2 earnings show no positive treatment effect. This finding is consistent with county-level evidence of no impact on female individual income or female labor force participation, and with Rose (2018) finding that women were almost universally excluded from manufacturing jobs after the war&amp;rsquo;s conclusion despite high wartime female manufacturing employment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What evidence tests the developmental-effects mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Three tests argue against childhood developmental effects as the primary driver. First, educational attainment effects — while statistically significant for children of the lowest-income parents (approximately one-quarter of a year more schooling, 3 percentage points more likely to graduate high school) — are too small to account for the earnings increase: a Mincer-equation calculation shows that the education effects can explain less than one-half of the estimated effect on 1979 wages. Second, there is no gradient in earnings effects by birth cohort — children younger at the war&amp;rsquo;s start, who had longer post-treatment childhood exposure, did not benefit more, in direct contrast to the Chetty-Hendren childhood-exposure framework. Third, postwar in-migrants into treatment counties were not drawn from better-educated or higher-income families and did not themselves have more education than in-migrants into comparison regions, ruling out peer effects from selective in-migration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What evidence directly implicates adult labor market access as the operative mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Four pieces of evidence point to contemporaneous adult labor market access. First, individuals born in treatment counties lived as adults in counties with 3–4 percent higher median male earnings and higher wages in semi-skilled blue-collar occupations but not in highly-skilled professional occupations — a pattern quantitatively consistent with the individual earnings effects. Second, the entire earnings effect is concentrated among those who remain in their birth counties: stayers in treatment counties show earnings differences of similar magnitude to county-level manufacturing wage effects, while movers show no difference compared to movers from comparison counties. Third, conditioning on 1979 county of residence eliminates the earnings effect entirely (1979 location fixed effects specification). Fourth, using W-2 data matched to the Business Register in the SSA-CPS sample, men born in treatment counties are directly shown to work in industries with higher wage premiums, with coarse industry classification alone accounting for approximately one-third of the log wage increase.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Is the persistence of regional effects driven by continued Cold War military spending at the plants?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No. The paper separates ordnance and ammunition plants — which predominantly became GOCO facilities or Air Force Bases after WWII and received disproportionately more Vietnam War-era defense spending — from general manufacturing plants, which overwhelmingly transitioned to privatized civilian production. Both types of plants show similarly persistent effects on manufacturing employment and comparable impacts on the long-run earnings of local children. Moreover, general manufacturing plants — which did not generate increased postwar military spending — had large permanent effects on overall population growth, while ordnance plants had smaller population effects. The persistence therefore does not appear to reflect continued federal expenditure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What mechanism explains the permanent population effect even after manufacturing employment shares converge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The authors interpret the permanent population differential — treated counties remain roughly 20 percent larger than comparison counties even at the end of the 20th century, after manufacturing employment shares converge — as evidence of path dependence and multiple equilibria. Once a region reaches a new, larger equilibrium, self-sustaining forces (expanded non-tradable employment, public infrastructure investment) maintain it. Treatment counties are more likely to have been connected to the interstate highway system in subsequent decades and show positive effects on local government capital outlays for utilities. The medium-term persistence is attributed partly to the sunk costs of site establishment (surveying, local approvals, infrastructure connections), which make reinvestment at existing sites more attractive than greenfield construction elsewhere.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: Do smaller plant openings generate comparable effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: No. Counties receiving smaller publicly-financed plants costing between $1 and $10 million show no detectable effects on manufacturing employment, population, median family income, or individual adult earnings comparable to those from the large plants. The authors cannot rule out the presence of small effects, but the null results for smaller plants — combined with evidence that the largest effects are in counties with the highest investment intensity per 1940 resident — are consistent with threshold effects (&amp;ldquo;big push&amp;rdquo;) in regional development, though the wide confidence intervals do not allow the authors to conclusively distinguish threshold effects from a linear-in-investment model.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What do modern &amp;ldquo;million-dollar plant&amp;rdquo; openings reveal about the contemporary relevance of these findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Reexamining plant openings from Greenstone et al. (2010) using an event-study design, the authors find that 1980s–1990s million-dollar plant openings expanded manufacturing employment (consistent with Greenstone et al.) but had no impact on average manufacturing wages — in sharp contrast to the WWII findings. Slattery and Zidar (2020) similarly find no impacts on county-level incomes for plant openings since 2000. The correlation between manufacturing employment density and upward mobility rates visible in 1950 had entirely vanished by the end of the 20th century. The authors attribute the divergent results to the changed institutional environment: contemporary production is highly automated, relies on interchangeable labor from staffing agencies, faces intense international competition, and is conducted under much weaker collective bargaining institutions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What is the paper&amp;rsquo;s assessment of aggregate welfare implications?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper is explicit that its local estimates do not allow clean conclusions about aggregate effects. Publicly-financed plant construction in peripheral locations may have crowded out private investment that would otherwise have occurred in major manufacturing hubs. If so, the documented regional gains represent geographic reallocation of manufacturing activity rather than a net increase in the aggregate plant stock. Aggregate gains from reallocation would require that the benefits in the selected dispersed locations exceeded what would have occurred in the counterfactual locations — a plausible conjecture given the paper&amp;rsquo;s evidence that effects are larger in counties with lower prewar manufacturing employment shares and lower initial market access, but one the authors cannot demonstrate decisively.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Government-Owned, Contractor-Operated (GOCO) Plants:&lt;/strong&gt; Manufacturing facilities built and owned by a U.S. government agency (typically the Defense Plant Corporation) during WWII but built and operated by private firms under cost-plus contracts. GOCO status meant the government bore full construction risk and that post-war disposition (sale to private buyers at a fraction of construction cost, or continued GOCO operation for ordnance production) was determined by public agencies, not by the constructing firm&amp;rsquo;s investment calculus.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Place-Based Predistribution:&lt;/strong&gt; The paper&amp;rsquo;s term for the mechanism by which wartime plant construction raised the incomes of existing residents — not through ex-post redistribution of income via taxes and transfers, but by expanding the set of high-wage employment opportunities available to incumbent workers in the region, thereby changing the pre-tax, pre-transfer wage structure facing those workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Adult Labor Market Access (vs. Childhood Developmental Exposure):&lt;/strong&gt; A distinction the paper draws in explaining why children born in treated counties had higher adult earnings. The &amp;ldquo;developmental exposure&amp;rdquo; mechanism (as in Chetty and Hendren 2018b) implies benefits scale with the amount of time spent in an improved childhood environment. The &amp;ldquo;adult labor market access&amp;rdquo; mechanism means children benefit irrespective of years of childhood exposure because they can access improved local labor market conditions when they reach working age as adults — what the paper operationalizes through the finding that earnings effects are entirely accounted for by 1979 county of residence and are concentrated among individuals who remain in their birth counties.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Upward Mobility (Absolute and Relative):&lt;/strong&gt; Following Chetty et al. (2014), the paper uses both concepts: absolute upward mobility means children from low-income backgrounds have higher lifetime earnings than comparable children in counterfactual regions; relative upward mobility means their outcomes converge toward those of children from affluent backgrounds. The paper documents both: large earnings effects for the lowest parent-income deciles, declining linearly to zero for the top deciles.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Conditional Independence (Plant Siting as Quasi-Random):&lt;/strong&gt; The paper&amp;rsquo;s identification assumption — that among counties with observably similar population sizes and basic geographic/infrastructure characteristics, the specific choice of plant siting locations was driven by idiosyncratic, short-run factors uncorrelated with potential postwar outcomes. This is a level-balance assumption (not merely a parallel-trends assumption), required because individual outcomes are only observed in the post-period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Industry Wage Premium:&lt;/strong&gt; The paper uses Krueger and Summers (1988) estimates of inter-industry wage differentials (the portion of a sector&amp;rsquo;s average wage unexplained by worker characteristics) to classify adult employers of treated individuals. Finding that men born in treatment counties work at employers in higher-premium industries — with industry category alone explaining approximately one-third of the log wage increase — provides direct evidence of the adult labor market access mechanism operating through industry sorting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Path Dependence / Multiple Equilibria in Regional Development:&lt;/strong&gt; The paper documents that treated counties remain permanently larger in population than comparison counties even after manufacturing employment shares converge and the original plants begin to close. This self-sustaining population differential, inconsistent with a unique spatial equilibrium, is interpreted as evidence that the temporary wartime shock shifted treated regions into a permanently higher equilibrium, sustained by subsequent infrastructure investment and non-tradable sector expansion proportional to the larger population base.&lt;/p&gt;</description></item><item><title>The Macroeconomic Consequences of Exchange Rate Depreciations</title><link>https://macropaperwarehouse.com/papers/the-macroeconomic-consequences-of-exchange-rate-depreciations/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-macroeconomic-consequences-of-exchange-rate-depreciations/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;How does an exchange rate depreciation causally affect macroeconomic outcomes? The paper asks whether depreciations are expansionary or contractionary, and through which mechanism. The core identification challenge is endogeneity: exchange rate changes are driven by shocks that simultaneously affect output, making causal inference from unconditional variation misleading.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper studies &amp;ldquo;regime-induced&amp;rdquo; exchange rate depreciations by comparing macroeconomic outcomes for countries that peg their currency to the US dollar versus countries whose currencies float against the US dollar, in response to movements in the US dollar&amp;rsquo;s value. The identifying variation arises from the interaction between a country&amp;rsquo;s pre-existing exchange rate regime (peg vs. float) and changes in the US dollar&amp;rsquo;s nominal effective exchange rate (NEER), as measured by the BIS trade-weighted index against 24 relatively advanced economies (which are excluded from the analysis). This variation — which amounts to roughly 8% of total exchange rate variation in the sample — isolates a component of bilateral exchange rate changes that is orthogonal to idiosyncratic domestic shocks. The empirical specification is a local projection (Jorda, 2005) on annual data from 1973 to 2019 with country fixed effects and region-by-time fixed effects (four regions: Europe, Americas, Africa, Asia/Oceania). The main estimating equation regresses cumulative changes in outcome variables on the interaction term Peg × ΔUSD at horizons h = 0 to 9. Standard errors are two-way clustered by time and country. Exchange rate regime classification follows Ilzetzki, Reinhart, and Rogoff (2019); observations classified in the most ambiguous intermediate categories (coarse category 3) are dropped from the baseline.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Empirical Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Regime-induced depreciations are strongly and persistently expansionary. In response to a 1% depreciation of the US dollar, the trade-weighted nominal effective exchange rate of pegger countries depreciates by 0.74% relative to floater countries on impact, rising to 0.9% before falling back to about 0.6% over years 3–5. The real effective exchange rate depreciates by a similar but slightly less persistent amount. The GDP response builds gradually, peaking after five years at approximately 0.4% per 1% US dollar depreciation. Expressed in terms of local currency depreciation, a 10% regime-induced depreciation results in a 5.5% increase in GDP over five years. Consumption rises by nearly 0.4% of GDP at peak. Investment also rises gradually, peaking after five years.&lt;/p&gt;
&lt;p&gt;Two findings are particularly important for identifying the transmission mechanism. First, net exports fall in response to a regime-induced depreciation. Imports rise more than exports for several years following the depreciation, ruling out an export-led boom driven by expenditure switching as the primary driver. Second, the short-term nominal interest rate rises modestly in pegging countries relative to floaters (by less than 0.1 percentage point per 1% depreciation), and the ex-post real interest rate response fluctuates around zero and is statistically insignificant throughout. This rules out looser monetary policy in pegger countries as the driver of the boom. Together, these two findings rule out a large set of standard open-economy models (including those with expenditure switching, monetary easing, and s = 0 financial frictions).&lt;/p&gt;
&lt;p&gt;The booms are concentrated in the service sector. Manufacturing, agriculture, and mining/construction responses are close to zero, indicating a domestic demand-led boom rather than an export-led one. The GDP response is entirely driven by countries with above-median capital account openness (as measured by the Chinn-Ito index); countries with below-median capital account openness show a similar exchange rate response but no significant output response. Results are similar across the early (1973–1995) and later (1996–2019) sub-periods.&lt;/p&gt;
&lt;p&gt;The Plaza Accord of 1985 provides a concrete illustration: the log real exchange rate of peggers depreciated by 12% (SE 2.7%) relative to floaters in the first year, while log GDP of peggers was 7.4% (SE 3.1%) higher after five years, implying a GDP response to a 10% depreciation of 6.2%, broadly consistent with the baseline estimates.&lt;/p&gt;
&lt;p&gt;Robustness checks controlling for Peg × US GDP growth, Peg × US inflation, Peg × US interest rate, Peg × commodity price changes, and Peg × global financial cycle (Miranda-Agrippino and Rey) leave results virtually unchanged.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;To explain these facts, the paper develops a four-region model (US, Euro Area, pegs to USD, pegs to euro) with imperfect financial openness. The model features (i) UIP deviations between the euro and US dollar driven by financial shocks (ψ_t), and (ii) sticky household portfolio shares, so that households invest a fixed fraction s of savings in foreign bonds and do not fully arbitrage cross-currency return differentials. When s = 0 (no household access to foreign assets), standard theory predicts that expenditure switching and real income channels dominate, yielding rising net exports — directly contradicting the data (Proposition 2). When s &amp;gt; 0, a &amp;ldquo;foreign credit channel&amp;rdquo; operates: following a regime-induced depreciation, expected future appreciation of the pegger currency makes foreign-currency borrowing cheaper, stimulating domestic consumption and investment, causing imports to rise more than exports (Proposition 3), consistent with the data.&lt;/p&gt;
&lt;p&gt;The model also accounts for unconditional exchange rate disconnect and the Mussa facts. Two shocks — UIP shocks (which generate a positive exchange rate–output correlation) and domestic discount factor shocks (which generate a negative correlation, since demand contractions lead to currency depreciations via monetary easing) — together produce a low unconditional correlation between exchange rates and output even though the conditional effect of regime-induced depreciation is large. The same logic explains why switching from fixed to floating exchange rates raises exchange rate volatility dramatically without raising macroeconomic volatility commensurately: pegging eliminates UIP shock exposure (reducing output volatility) but removes the ability to use monetary policy to offset discount factor shocks (raising output volatility), and these two effects roughly offset each other in the quantitative model.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the core identification strategy, and what assumption is required for it to yield causal estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: The strategy compares macroeconomic outcomes in countries pegged to the US dollar versus countries floating against the US dollar when the US dollar&amp;rsquo;s value changes. The identifying assumption is that peggers are not differentially exposed (relative to floaters) to aggregate shocks that are correlated with the US dollar exchange rate. If this holds, the direct effects of shocks driving the US dollar move pegs and floats symmetrically and are absorbed by region-by-time fixed effects, leaving only the regime-induced component. Differential exposure to US dollar-correlated shocks is the main threat to identification, but the paper shows robustness by controlling for interactions of the peg indicator with US GDP growth, US inflation, US interest rate changes, commodity price changes, and the global financial cycle.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How is &amp;ldquo;regime-induced&amp;rdquo; exchange rate variation defined, and how large is it relative to total variation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: Regime-induced variation is the component of a country&amp;rsquo;s exchange rate change that arises from its pre-existing regime vis-à-vis the US dollar interacted with the change in the US dollar&amp;rsquo;s nominal effective exchange rate. It is identified via the interaction term Peg_i,t × ΔUSD_t in the local projection. This variation represents roughly 8% of total variation in exchange rates in the sample, so the strategy isolates a small but clean slice of total exchange rate movements.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How do nominal and real effective exchange rates respond for peggers versus floaters?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: In response to a 1% depreciation of the US dollar, the trade-weighted nominal effective exchange rate of peggers depreciates by 0.74% relative to floaters on impact, peaks around 0.9%, and then gradually declines to roughly 0.6% over years 3–5. The real effective exchange rate depreciates by a similar but slightly less persistent amount. The less-than-one-for-one response occurs because the classification includes imperfect pegs and imperfect floats; however, this misclassification attenuates both the first stage (exchange rate response) and the reduced form (output response) proportionally, so the ratio — the IV-style estimate — remains unbiased.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the quantitative magnitude of the output effect, and how is it computed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: In response to a 1% US dollar depreciation, GDP of peggers rises by approximately 0.4% relative to floaters, peaking after five years and building gradually. To express this as a response to a 10% local currency depreciation: the average nominal exchange rate response over the first five years is roughly 0.7%, so the implied GDP response per 10% depreciation is 10 × 0.4 ÷ 0.7 ≈ 5.5%. The Plaza Accord case study yields a similar magnitude: a 12% first-year real exchange rate differential is followed by a 7.4% differential in log GDP after five years, implying 6.2% per 10% depreciation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why does the behavior of net exports rule out the expenditure-switching mechanism as the primary driver?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: Standard open-economy models predict that a depreciation improves competitiveness, boosting exports and reducing imports — generating an improvement in net exports as the engine of expansion. The paper finds the opposite: imports rise more than exports for several years following a regime-induced depreciation, so net exports fall. This is inconsistent with an export-led expenditure-switching boom. The finding is also inconsistent with the real income channel (as formalized in Proposition 2): even with s = 0, standard models predict rising net exports, but the data show the reverse.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why does the behavior of interest rates rule out monetary policy easing as the driver?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: If the US dollar depreciated because of loose US monetary policy, countries with currencies pegged to the US dollar would share US monetary policy more strongly, and one would expect a relative decline in nominal interest rates for peggers. The opposite is found: the nominal interest rate of peggers rises slightly relative to floaters (by less than 0.1 percentage point per 1% depreciation), and the real interest rate response is statistically indistinguishable from zero throughout the nine-year horizon. This rules out the interpretation that the boom is driven by an easing of monetary conditions in the pegger countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What are ex-post UIP deviations, and what do they imply about the shock driving the variation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: Ex-post UIP deviations measure the excess return to holding assets denominated in pegger currencies relative to floater currencies. After the initial depreciation of pegger currencies, those currencies subsequently appreciate and their nominal interest rates are (if anything) higher than floater interest rates. This means the ex-post return to holding pegger-currency assets is higher than for floater-currency assets — a positive UIP deviation that builds over several years after the shock. These deviations imply that the shocks driving the US dollar depreciation are financial in nature (UIP shocks), not changes in expected near-term monetary policy fundamentals.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What is the foreign credit channel, and how does it work in the model?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: The foreign credit channel (the second term in equation (18) of Proposition 1) operates through the cost of foreign-currency borrowing. When the pegger currency depreciates on impact and then is expected to appreciate subsequently, the exchange-rate-adjusted cost of borrowing in foreign currency falls — that is, expected future appreciation of the domestic currency reduces the real cost of foreign credit. To the extent that households have portfolio shares in foreign bonds (s &amp;gt; 0), this stimulates consumption via intertemporal substitution. The channel is operative only when s &amp;gt; 0; with s = 0 (no household access to foreign assets), net exports must rise rather than fall (Proposition 2), contradicting the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does Proposition 1 establish that real interest rates and real exchange rates are sufficient statistics for the relative responses of all macroeconomic aggregates in this setting?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: Under Assumption 1 (pegs to the US dollar and pegs to the euro face symmetric non-monetary fundamental shocks), the relative responses of consumption, output, exports, and imports of USD-peggers versus euro-peggers are functions only of the relative path of the real interest rate and the real effective exchange rate. This is because the underlying shocks to the US economy and the Euro Area economy are common to both groups of peggers and cancel out in the comparison. The monetary regime of a country is fully summarized by the paths of the real interest rate and the real exchange rate. Since the estimated relative real interest rate response is close to zero, the paper infers that the observed output differential must arise from the real exchange rate path — hence the title.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Why does the output response differ by capital account openness but not by trade openness?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: The GDP response to a regime-induced depreciation is entirely driven by countries with above-median capital account openness (Chinn-Ito index). Countries below the median show a similar real exchange rate response but no significant output response. In contrast, splitting by trade openness (exports plus imports as a share of GDP) yields similar output responses in both sub-groups. This pattern is consistent with the model&amp;rsquo;s foreign credit channel, which operates through international capital flows (the parameter s representing financial openness). Countries with restricted capital accounts cannot borrow cheaply from abroad when their currencies become &amp;ldquo;cheap,&amp;rdquo; so the foreign credit channel is shut down. The result is inconsistent with the expenditure-switching channel, which would predict larger effects for more trade-open economies.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the sector composition of the output boom, and what does it imply about the transmission mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: The bulk of the output response is concentrated in the service sector. Manufacturing, agriculture, and the mining/construction/energy sectors show responses close to zero, with only a modest boom in the latter at very long horizons. Services are predominantly non-tradable, so this sectoral pattern is consistent with a domestic demand-led boom (via the foreign credit channel) rather than an export-led boom (via expenditure switching on tradable goods). The foreign credit channel stimulates domestic demand broadly, which disproportionately raises output in the non-tradable sector.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How does the model reconcile large conditional effects of exchange rates with unconditional exchange rate disconnect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: The paper introduces two shocks: UIP shocks (ψ_t) and domestic discount factor shocks (β_t). UIP shocks cause the exchange rate to depreciate and output to rise (a positive conditional correlation). Discount factor shocks reduce domestic demand; monetary policy responds by lowering interest rates, which depreciates the exchange rate, but if the monetary response is insufficient to fully offset the shock, output falls — generating a negative conditional correlation between the exchange rate and output. The unconditional correlation between the exchange rate and output is a weighted average of these two conditional correlations. If these effects are of similar magnitude and opposite sign, the unconditional correlation can be close to zero even though each structural shock generates a large conditional response. This is directly analogous to how supply and demand shocks can generate a small unconditional price-quantity correlation in a standard market setting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: How does the model provide a new interpretation of the Mussa fact?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A13: The Mussa fact is that the collapse of Bretton Woods dramatically increased the volatility of real exchange rates in countries that switched to floating, without a corresponding increase in macroeconomic volatility. In the model, pegging has two opposing effects on output volatility: it insulates the economy from UIP shocks (reducing output volatility) but prevents the use of monetary policy to offset discount factor shocks (raising output volatility). In the quantitative model (Appendix D), these effects roughly offset each other, so moving from a peg to a float raises exchange rate volatility substantially while leaving macroeconomic volatility roughly unchanged — consistent with the Mussa fact. This contrasts with the Itskhoki-Mukhin interpretation, which attributes Mussa facts to exchange rates (driven by UIP shocks) having little effect on output; in the present paper, the conditional effects are large but cancel in the unconditional moments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What does the paper imply for the tradeoffs associated with adopting a fixed versus flexible exchange rate regime?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A14: Traditional analyses of the monetary trilemma emphasize that pegging to the US dollar forces a country to follow US interest rate policy. The paper argues that a first-order consequence of pegging — one that may outstrip the traditional monetary policy tradeoff in importance — is that the country imports the financial shocks (UIP shocks) that drive the US exchange rate while potentially reducing its exposure to home-grown financial shocks. When the US dollar depreciates due to financial shocks, pegger countries experience a stimulatory foreign credit inflow. Conversely, when the US dollar appreciates due to financial shocks, pegger countries face tighter financial conditions. The importance of this financial shock trade-off, the paper argues, may greatly exceed the importance of the traditional monetary trilemma in environments where financial shocks are a dominant driver of exchange rate fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q15: How does the paper handle the potential concern that the peg classification is imperfect?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A15: The paper notes that misclassification of pegs and floats attenuates both the exchange rate response (first stage) and the output response (reduced form) proportionally. Since the ultimate quantity of interest is the ratio of the output response to the exchange rate response (analogous to an IV estimate), misclassification in both the numerator and denominator does not introduce bias. This is analogous to an instrumental variables regression where the first stage need not have a high R-squared for the IV estimate to be valid. The paper also shows robustness to alternative treatments of the ambiguous intermediate categories (Ilzetzki-Reinhart-Rogoff coarse category 3), including them as pegs or floats, with similar results in both cases.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Regime-induced depreciation&lt;/strong&gt;: A change in a country&amp;rsquo;s bilateral exchange rate that arises specifically because the country has a pre-existing peg (or float) to a reference currency, and that reference currency&amp;rsquo;s value changes in world markets. The variation is defined as the component of a country&amp;rsquo;s exchange rate movement driven by the interaction between its exchange rate regime vis-à-vis the US dollar and changes in the US dollar&amp;rsquo;s nominal effective exchange rate. This is distinguished from all other exchange rate variation, including that driven by domestic idiosyncratic shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Foreign credit channel&lt;/strong&gt;: The mechanism in the paper&amp;rsquo;s model through which a regime-induced depreciation stimulates domestic demand. When the domestic currency depreciates on impact and is expected to appreciate subsequently, the exchange-rate-adjusted cost of borrowing in foreign currency falls. Households with portfolio shares in foreign bonds (s &amp;gt; 0) borrow more cheaply from abroad, stimulating consumption via intertemporal substitution. This channel requires imperfect financial openness (s &amp;gt; 0 but not full UIP arbitrage) and predicts that the output boom is domestic-demand-led with falling net exports — consistent with the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;UIP shock (ψ_t)&lt;/strong&gt;: An exogenous shock to uncovered interest parity between the US dollar and the euro, interpreted as arising from frictions in international financial markets or from exogenous shifts in demand for one currency over another. A positive ψ_t represents an increase in demand for the euro (relative to the US dollar), depreciating the US dollar. These shocks are the paper&amp;rsquo;s preferred interpretation of the financial shocks driving the US dollar exchange rate, consistent with the observed joint behavior of exchange rates and interest rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Imperfect financial openness (parameter s)&lt;/strong&gt;: The share of household savings invested in foreign bonds. At s = 0, households have no access to foreign assets (as in Gabaix-Maggiori and Itskhoki-Mukhin); at full financial integration with UIP holding (ψ_t = 0), there is no foreign credit channel. The paper&amp;rsquo;s model is intermediate: s &amp;gt; 0 but portfolio weights are sticky, so households do not fully arbitrage cross-currency expected return differentials. The foreign credit channel is operative only when s &amp;gt; 0, and the strength of the output boom is increasing in s/σ (the ratio of financial openness to the coefficient of relative risk aversion).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient statistics (real interest rate and real exchange rate)&lt;/strong&gt;: Under Proposition 1, conditional on Assumption 1 (symmetric non-monetary fundamental shocks across pegger groups), the relative responses of all macroeconomic aggregates for peggers to the US dollar versus peggers to the euro are functions only of the relative path of the real effective exchange rate and the relative path of the real interest rate. The full set of underlying shocks — monetary, financial, productivity, or discount factor — does not need to be separately identified; only the paths of these two prices matter for relative macroeconomic outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exchange rate disconnect&lt;/strong&gt;: The empirical finding, documented extensively since Meese and Rogoff (1983), that exchange rates have very low unconditional correlations with macroeconomic aggregates such as output and consumption. In the paper&amp;rsquo;s sample, real exchange rates of floating countries are three to four times more volatile than GDP and consumption, and the unconditional correlation of the real exchange rate with GDP is mildly negative (around −0.05 to −0.07). The paper offers a new explanation: this low unconditional correlation reflects the cancellation of large but opposite-signed conditional correlations from UIP shocks and discount factor shocks, rather than indicating that exchange rates have small effects on the economy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Mussa fact&lt;/strong&gt;: The empirical observation (Mussa, 1986) that when countries switched from fixed to floating exchange rates after the collapse of Bretton Woods, real exchange rate volatility increased dramatically — for floaters roughly 50–60% higher standard deviation in the paper&amp;rsquo;s sample than for peggers — but the volatility of GDP, consumption, and other macroeconomic aggregates did not increase correspondingly. The paper interprets this through its two-shock model as the result of two opposing effects of pegging: insulation from UIP shocks (which reduces macroeconomic volatility) versus inability to use monetary policy to offset discount factor shocks (which raises macroeconomic volatility), with the two effects roughly offsetting in the quantitative model.&lt;/p&gt;</description></item><item><title>The Macroeconomics of Irreversibility</title><link>https://macropaperwarehouse.com/papers/the-macroeconomics-of-irreversibility/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-macroeconomics-of-irreversibility/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research question.&lt;/strong&gt; How does partial capital irreversibility — arising from a wedge between the purchase price and the resale (discounted) price of capital — shape the persistence and amplitude of aggregate capital fluctuations? And what is the quantitative magnitude of the capital price wedge that is needed to simultaneously reconcile micro-level investment behavior with macroeconomic propagation?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; Baley and Blanco build a continuous-time investment model for a continuum of firms facing (i) idiosyncratic productivity shocks (geometric Brownian motion), (ii) fixed capital adjustment costs proportional to productivity, and (iii) a capital price wedge ω, under which firms buy capital at price p and sell at p(1−ω). The key state variable is the log capital-productivity ratio k̂. The optimal policy takes the form of an inaction region with two distinct reset points — one for upsizing (k̂*₋) and one for downsizing (k̂*₊) — instead of the single reset point that arises without the wedge.&lt;/p&gt;
&lt;p&gt;Their central innovation is the Cumulative Impulse Response (CIR): the cumulative deviation of average capital-productivity ratios following a small, permanent, unanticipated aggregate productivity shock. They show the CIR can be expressed analytically through three sufficient statistics derived entirely from the steady-state cross-sectional distribution of k̂ and capital age a: (i) Var[k̂], (ii) Cov[k̂, a], and (iii) an &amp;ldquo;irreversibility term&amp;rdquo; reflecting how idiosyncratic shocks change the anticipated direction of the next adjustment. Because idiosyncratic and aggregate shocks enter the law of motion symmetrically, steady-state moments encode the aggregate propagation.&lt;/p&gt;
&lt;p&gt;To handle the path dependence introduced by the dual reset points, they condition all behavior on the previous reset (upsizing or downsizing) and characterize transitions across reset points via a Markov chain. They then derive explicit mappings from observable microdata — size and direction of investment adjustments, duration of inaction spells, and cross-spell transition probabilities — back to the unobservable capital-productivity distributions and sufficient statistics. These mappings require no revenue or productivity data; investment actions alone suffice.&lt;/p&gt;
&lt;p&gt;They extend the baseline model to a generalized hazard framework (stochastic, asymmetric fixed costs), enabling the model to match the full empirical investment-rate distribution, and apply everything to annual establishment-level manufacturing data from Chile (Encuesta Nacional Industrial Anual, 1980–2011), restricting to plants observed for at least ten years with more than ten workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Price wedge estimate.&lt;/strong&gt; A capital price wedge of ω = 0.12 (12%) is selected as the preferred value because it maximizes joint consistency between the model&amp;rsquo;s predicted CIR decomposition and the data, while also matching the distribution of investment rates. At ω = 0 the model generates a CIR of 0.92 and a negative covariance term, inconsistent with the data. At ω = 0.18 the aggregate CIR level (2.39) is close to data (2.33) but the decomposition diverges. At ω = 0.12, the CIR is 1.93 and the decomposition into sufficient statistics closely mirrors the data structure.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Irreversibility doubles persistence.&lt;/strong&gt; In the analytically tractable case of zero drift and only a price wedge (no fixed costs), the CIR equals exactly twice the ratio Var[k̂]/σ², compared to the single fixed-cost case. This means irreversibility doubles the persistence of aggregate capital fluctuations for a given cross-sectional dispersion. More generally, under the calibrated model, a 1% decrease in aggregate productivity generates a nearly 2% cumulative deviation of average capital-productivity ratios from steady state. Without irreversibility, the CIR collapses to approximately 1.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Decomposition of the CIR.&lt;/strong&gt; At ω = 0.12, the variance term Var[k̂]/σ² accounts for 72% of the CIR; the covariance term ν·Cov[k̂,a]/σ² accounts for 10%; and the irreversibility term accounts for 18%. The positive covariance (Cov[k̂,a] = 0.152 &amp;gt; 0) reflects that firms subject to downward rigidity accumulate older capital stocks above the economy&amp;rsquo;s average, amplifying persistence. This positive covariance arises because the price wedge&amp;rsquo;s downward-rigidity force dominates the drift&amp;rsquo;s negative effect.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Micro-level evidence.&lt;/strong&gt; In the Chilean data, the inaction rate is 40%. More than 96% of adjustments are positive (upsizing), fewer than 4% are negative. The probability of upsizing after a previous upsize is P⁻⁻ = 0.958; the probability of downsizing after a downsize is P⁺⁺ = 0.124. A logistic regression yields an odds ratio of 3.3, meaning a firm is more than three times as likely to purchase capital following a prior purchase than following a prior sale. The average duration of inaction conditional on a prior purchase is E⁻[τ] = 1.72 years; conditional on a prior sale it is E⁺[τ] = 1.98 years. These patterns are qualitatively consistent with the serial correlation in adjustment sign predicted by the model.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Comparison with existing wedge estimates.&lt;/strong&gt; The calibrated ω = 0.12 lies between micro-level studies based on liquidating firms (Ramey and Shapiro, 2001: ω ≈ 0.72; Kermani and Ma, 2023: ω ≈ 0.65) and structural models calibrated to static moments of investment distributions (Cooper and Haltiwanger, 2006; Khan and Thomas, 2013: ω = 0.025–0.07). The lower value relative to liquidation studies is attributed to selection effects (liquidating firms face fire-sale dynamics) and firm-internal capital reallocation that mitigates irreversibility for continuing firms.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope conditions.&lt;/strong&gt; The analysis is a partial equilibrium characterization of transitional dynamics, maintaining constant interest rates and steady-state investment policies throughout the transition (a general equilibrium extension delivering constant prices as an equilibrium outcome is provided in Appendix D). Results apply to small, permanent, unanticipated aggregate productivity shocks; nonlinearities for shocks below 5% are found to be tiny. The empirical application is specific to Chilean manufacturing establishments, 1980–2011.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1. What is the economic mechanism by which capital irreversibility generates persistence in aggregate capital fluctuations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Irreversibility creates two distinct reset points rather than one. When a negative aggregate productivity shock hits, it shifts more firms into the downsizing region. Downsizing firms, because they have been selling capital sequentially, maintain capital-productivity ratios persistently above the economy&amp;rsquo;s average and continue to do so for multiple periods. This increases the share of firms in a persistent &amp;ldquo;downsizing phase,&amp;rdquo; which prolongs the aggregate deviation from steady state. Two channels compound: first, the population tilts toward more downsizing firms; second, their mean deviations become larger and converge more slowly. Both channels increase the CIR. Crucially, without irreversibility, firms become identical after their first adjustment and there is no additional persistence beyond what fixed costs alone generate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2. How are the three sufficient statistics derived, and what does each capture?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The CIR is characterized as a steady-state cross-sectional average of a recursive function m(k̂). Integrating over firms first and then time, and splitting each firm&amp;rsquo;s horizon at its first adjustment, yields three steady-state terms (Proposition 4). The first statistic, Var[k̂]/σ², measures how far firms allow their capital-productivity ratio to drift from the frictionless optimum — the &amp;ldquo;insensitivity of incomplete spells&amp;rdquo; to idiosyncratic productivity shocks. The second statistic, ν·Cov[k̂,a]/σ², is a bias-correction term that removes drift effects from the variance, ensuring only Brownian-shock sensitivity is captured. The third statistic, unique to the irreversibility case, measures how much idiosyncratic shocks alter the anticipated direction of the next adjustment — the &amp;ldquo;insensitivity of complete spells&amp;rdquo; — and equals the difference in expected cumulative deviations between departing and ending points of an inaction spell, scaled by duration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3. Why is the CIR exactly twice as large under pure irreversibility (no fixed costs) as under pure fixed costs, for a given level of dispersion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Proposition 5, case (ii) shows that with zero drift and only a price wedge, the CIR = 2 × Var[k̂]/σ², because the first and third sufficient statistics are identical and the covariance term is zero. In contrast, with only fixed costs (case (i)), the CIR = Var[k̂]/σ². The doubling arises because the price wedge generates history-dependence through the dual reset: after a firm adjusts, whether it upsized or downsized predicts its future adjustment direction. This &amp;ldquo;anticipated terminal condition&amp;rdquo; effect (captured by the third statistic) adds an equal contribution to the CIR as the pure inaction effect (the first statistic), doubling total persistence for the same cross-sectional dispersion.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4. How does the empirical strategy recover the capital price wedge?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The price wedge cannot be identified from the investment rate distribution alone: for any price wedge ω, the generalized hazard framework can find an adjustment hazard function Λ(k̂) such that the product Λ(k̂)·g(k̂) matches the observed investment density h(Δk̂). Instead, the authors use the CIR&amp;rsquo;s sufficient statistics — specifically the covariance term and the irreversibility term — as additional discriminating moments. At ω = 0, the model produces a negative covariance (inconsistent with the positive Cov[k̂,a] = 0.152 in the data) and no irreversibility term. At ω = 0.12, all three sufficient statistics simultaneously align with their data counterparts in relative importance (72%, 10%, 18%), selecting this wedge as preferred. The CIR level at ω = 0.12 is 1.93, somewhat below the data value of approximately 2.54–2.60, but the preferred criterion is mechanistic consistency, not just level matching.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5. What is the role of the Markov chain across reset points in handling path dependence?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Because optimal investment features serial correlation in the sign of adjustment (P⁻⁻ = 0.958 and P⁺⁺ = 0.124 in the data), firms&amp;rsquo; future behavior depends on their most recent reset point. To maintain tractability, the authors condition all densities, durations, and expectations on the previous reset (upsizing g⁻(k̂) or downsizing g⁺(k̂)). The transition matrix P encoding probabilities P⁻⁻, P⁻⁺, P⁺⁻, P⁺⁺ determines the steady-state shares of upsizing and downsizing firms (as the eigenvector of P) and the renewal weights r⁻ and r⁺ that rescale conditional densities to account for observational bias (firms with longer inaction spells contribute more to the cross-section). This Markov structure is sufficient because one adjustment erases all heterogeneity except the direction of adjustment.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6. What do the microdata mappings recover, and how are the reset points identified?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Stage I mappings (Propositions 6–9) recover: drift ν = E[Δk̂]/E[τ]; volatility σ² from cross-spell moment E[(k̂τ&amp;rsquo; + ντ&amp;rsquo;)² − (k̂*)²]/E[τ]; conditional means E±[k̂] as midpoints of inaction spells weighted by relative adjustment size; Var[k̂] from differences in cubed stopped values; Cov[k̂,a] from variance, average age, and the dynamic covariance E[(k̂τ&amp;rsquo; − E[k̂])²τ&amp;rsquo;]/E[τ]; and the irreversibility term from differences in expected deviations at departing vs. ending reset points. Stage II (Proposition 10) recovers the two reset points k̂*₋ and k̂*₊ from optimality conditions that equalize the investment price to the expected discounted marginal product of capital during inaction plus the expected value of undepreciated capital, conditioning on the prior reset. The inner inaction region width k̂*₊ − k̂*₋ = 0.813 in the Chilean data, of which 45% is attributed to the exogenous price wedge and 55% to the endogenous response to the wedge.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7. How does the sign of Cov[k̂,a] depend on the price wedge vs. the drift?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;With zero price wedge and negative drift ν &amp;lt; 0 (depreciation exceeding productivity growth), firms with older capital have capital-productivity ratios below average, yielding Cov[k̂,a] &amp;lt; 0. The drift makes old capital-productivity ratios negative. Introducing a price wedge creates downward rigidity: unproductive firms delay selling, so old firms accumulate capital-productivity ratios above average, pushing Cov[k̂,a] toward positive values. The covariance turns positive once ω &amp;gt; 0.08 (in the illustrative parametrization in Figure V). In the Chilean calibration at ω = 0.12, Cov[k̂,a] = 0.152 &amp;gt; 0, confirming that the price wedge&amp;rsquo;s effect dominates the drift&amp;rsquo;s negative effect. A positive covariance amplifies the CIR (through the second sufficient statistic with ν &amp;gt; 0).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8. What is the generalized hazard extension and why is it needed?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The baseline model with a single fixed cost θ generates an investment distribution concentrated at two mass points (purchases and sales of fixed size), which does not match the empirical distribution&amp;rsquo;s coexistence of large and small investment rates and its convex shape. The generalized hazard model replaces the deterministic fixed cost with a stochastic, state-dependent adjustment cost, parameterized by a hazard function Λ(k̂) giving the probability of adjusting per unit time at any capital-productivity ratio in the outer inaction region. This function is recovered non-parametrically from the data by fitting a Gamma distribution to the investment density and inverting the Kolmogorov Forward Equation. The generalized hazard model nests the baseline model, random fixed cost models (Thomas 2002, Khan and Thomas 2008), and asymmetric adjustment models, while preserving the sufficient statistics characterization.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9. How does the model handle the &amp;ldquo;problem with reinjection&amp;rdquo; that arises from path dependence after the first adjustment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Without irreversibility, a firm&amp;rsquo;s initial state k̂₀ does not affect behavior after the first adjustment, because there is a unique reset point; subsequent behavior is independent of the aggregate shock magnitude. With irreversibility, firms only partially absorb the aggregate shock at the first adjustment, since the initial state affects the probability of subsequently upsizing or downsizing. In principle, one must track firms through infinitely many adjustments. The paper&amp;rsquo;s resolution (Proposition 2) is to note that the first adjustment erases all heterogeneity except the direction (upsizing vs. downsizing), allowing subsequent behavior to be summarized by just two numbers m(k̂*₋) and m(k̂*₊), combined with the transition probabilities P⁻(k̂₀) and P⁺(k̂₀). This yields a recursive formulation for m(k̂) governed by an HJB equation with two boundary conditions at the reset points, making the problem tractable.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10. What is the role of the stationarity condition in pinning down the CIR?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The HJB for m(k̂) has infinitely many solutions (m(k̂) + a for any constant a). The stationarity condition, requiring that the cross-sectional average of m(k̂) in steady state is zero (no fluctuations without shocks), pins down the unique solution. Economically, it says that average cumulative deviations from complete upsizing spells and complete downsizing spells must exactly balance the deviations from incomplete inaction spells. For upsizing firms, deviations are negative (they hold too little capital relative to average); for downsizing firms, deviations are positive (they hold too much capital). The stationarity condition imposes a linear relationship between m(k̂*₋) and m(k̂*₊) that together with the HJB uniquely determines the solution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11. How are the results extended to assess nonlinearities and robustness?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Appendix G studies nonlinearities numerically in the generalized hazard model for different signs and magnitudes of the aggregate productivity shock. The authors find tiny nonlinearities and asymmetries for productivity shocks below ε = 5%, validating the first-order approximation used throughout. Appendix E.7 provides comparative statics on the output-capital elasticity α. The model is estimated with an inaction threshold of ι = 0.01 (investment rates below 1% in absolute value are treated as inaction), consistent with Cooper and Haltiwanger (2006). The investment distribution is truncated at the 2nd and 98th percentiles to remove outliers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12. What broader applicability do the authors claim for the CIR sufficient statistics framework?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors argue the framework applies wherever path-dependent lumpy adjustments occur, including: inventory management (with two types of ordering decisions), durable goods consumption, and labor markets with sticky wages. The key requirement is the existence of a finite number of reset points and sufficient microdata to discipline the transition probabilities across them. Future extensions noted in the paper include: analysis of other aggregate shocks (profitability, capital prices, interest rates); corporate tax reform; monetary policy interacting with investment frictions; time-varying and endogenous price wedges in secondary markets; and higher-order cross-sectional moment responses (variance, skewness of capital-productivity ratios) by choosing different functions f(k̂) for the generalized CIR.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Capital price wedge (ω).&lt;/strong&gt; The fractional discount between the purchase price of capital p and its resale price p(1−ω). In the model this creates two distinct reset points for investment (one for buying at price p, one for selling at the discounted price) and represents the core source of irreversibility. It reflects asset specificity, adverse selection, intermediary fees, and obsolescence. The preferred calibrated value for Chilean manufacturing is ω = 0.12.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cumulative Impulse Response (CIR).&lt;/strong&gt; The integral over all future dates of the impulse response function of the average capital-productivity ratio following a small, permanent, unanticipated aggregate productivity shock. It summarizes both the impact and persistence of aggregate capital fluctuations in a single scalar. Without investment frictions, the CIR is zero (firms adjust instantaneously); the calibrated CIR at ω = 0.12 is 1.93, meaning a 1% aggregate shock generates a 1.93% cumulative deviation.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Dual reset points (k̂&lt;/em&gt;₋ and k̂&lt;/em&gt;₊).** The two levels to which firms reset their capital-productivity ratio upon adjustment: k̂*₋ after a capital purchase (upsizing) and k̂*₊ after a capital sale (downsizing). With a price wedge, k̂*₊ &amp;gt; k̂*₋, creating an &amp;ldquo;inner inaction region&amp;rdquo; [k̂*₋, k̂*₊] with path-dependent behavior. The inner inaction region width is 0.813 in the Chilean data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient statistics for the CIR.&lt;/strong&gt; Three steady-state cross-sectional moments that together fully characterize the CIR up to first order: (i) Var[k̂]/σ², the scaled cross-sectional variance of capital-productivity ratios (captures insensitivity of incomplete spells to idiosyncratic shocks); (ii) ν·Cov[k̂,a]/σ², the scaled covariance of capital-productivity ratios with capital age (a drift-bias correction); (iii) the &amp;ldquo;irreversibility term&amp;rdquo; measuring how idiosyncratic shocks change the anticipated direction of the next adjustment (unique to the irreversibility case, zero without a price wedge).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Serial correlation in adjustment sign.&lt;/strong&gt; The property, implied by the dual-reset structure, that a firm is more likely to purchase capital following a prior purchase and more likely to sell following a prior sale. In the Chilean data, P⁻⁻ = 0.958 (probability of upsizing after a prior upsize) vs. P⁺⁺ = 0.124 (probability of downsizing after a prior downside), and a logistic regression yields an odds ratio of 3.3.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Generalized hazard function Λ(k̂).&lt;/strong&gt; A state-dependent adjustment probability per unit time, allowing for stochastic and asymmetric fixed costs, that generates the full empirical investment rate distribution. It replaces the single deterministic fixed cost of the baseline model. The hazard function is recovered non-parametrically from microdata by fitting a Gamma distribution to the investment density and inverting the Kolmogorov Forward Equation, conditional on the price wedge.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Renewal weights (r⁻, r⁺).&lt;/strong&gt; Weights used to construct the unconditional density of capital-productivity ratios from the two conditional densities (conditional on prior purchase g⁻(k̂) and prior sale g⁺(k̂)). They rescale adjustment shares by relative average duration, correcting for the observational bias that firms with longer inaction spells are over-represented in the cross-section: r± = (N±/N) × (E±[τ]/E[τ]).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Endogenous irreversibility.&lt;/strong&gt; The component of the inner inaction region width (k̂*₊ − k̂*₋) that arises not from the exogenous price wedge directly but from firms&amp;rsquo; endogenous responses to the wedge — specifically, the differences in expected marginal products and user costs across the two types of inaction spells. At ω = 0.12, 45% of the inner inaction region is attributed to the exogenous wedge and 55% to endogenous amplification.&lt;/p&gt;</description></item><item><title>The Margins of Trade</title><link>https://macropaperwarehouse.com/papers/the-margins-of-trade/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-margins-of-trade/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Eaton and Fieler seek to reconcile two literatures that have advanced in parallel but remained at odds: (i) general equilibrium models of bilateral trade flows (the &amp;ldquo;gravity&amp;rdquo; tradition) and (ii) empirical work on the margins of trade — the decomposition of bilateral trade into the extensive margin (number of products traded), the quantity margin (physical volumes), and the unit-value (price) margin. Standard GE models cannot accommodate two of the most robust empirical regularities: that richer importing countries pay higher unit values for the same product, and that richer exporting countries charge higher unit values. The paper builds a framework that captures all three margins jointly while still delivering the standard gravity equation and the Arkolakis-Costinot-Rodriguez-Clare (ACR) welfare formula.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The analysis uses UN COMTRADE bilateral merchandise trade data for the 50 largest economies by GDP in 2007, the most disaggregated 6-digit HS product classification (HS6). The working sample covers 2,611,700 importer-exporter-HS6 triads representing US $9.62 trillion of trade. Country characteristics (GDP, population) come from the World Development Indicators; geographical variables from CEPII.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Empirical Regularities Addressed&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In a standard gravity decomposition of bilateral trade, the elasticities of total trade value with respect to importer and exporter GDP are both approximately 1.1 and the distance elasticity is approximately −0.81. Decomposing total value into its extensive, quantity, and price margins reveals: (i) the extensive margin of exporters rises strongly with exporter GDP (elasticity 0.76) but the corresponding importer extensive margin is much smaller (0.34), contrary to what the Eaton-Kortum (2002) model predicts; (ii) the unit-value margin rises with both importer GDP per capita (elasticity approximately 0.13 in product-level regressions controlling for exporter-product fixed effects) and exporter GDP per capita (elasticity approximately 0.22 controlling for importer-product fixed effects); (iii) there is no significant interaction between importer and exporter per capita income in bilateral trade values (coefficient 0.002, statistically insignificant), rejecting the Linder-type prediction from one-dimensional quality models that rich countries disproportionately sell to other rich countries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Building on the Ricardian EK framework with a continuum of varieties, CES aggregation, and perfect competition, the paper introduces two dimensions of quality:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Vertical quality&lt;/strong&gt; (q) complements quantity: as spending on a variety increases, both physical quantity and vertical quality rise. This drives the positive relationship between importer per capita income and unit values, because buyers in richer (higher-wage) countries optimally demand higher vertical quality.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Horizontal quality&lt;/strong&gt; (Q) perfectly substitutes for quantity and is determined by the producing country&amp;rsquo;s endowment of intermediates per worker. Because a better-equipped worker produces higher horizontal quality, this dimension rises with the exporter&amp;rsquo;s wage, explaining why richer countries charge higher unit values.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The model uses Fréchet-distributed productivities as in EK. Despite the non-homothetic intricacies introduced by the two quality dimensions, the trade-share equation is identical to EK&amp;rsquo;s homothetic formulation, and the welfare formula takes the standard ACR form with the elasticity of real income with respect to the home trade share equal to −1/(α̃θ).&lt;/p&gt;
&lt;p&gt;To accommodate the extensive margin, the paper introduces stochastic minimum shipment sizes: small-value flows are observed probabilistically, generating zeros in the trade matrix. Products are treated as bundles of varieties, and the number of varieties per product follows a discretized Weibull distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Estimation and Key Parameter Values&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Multilateral resistance terms (Φ) are estimated from bilateral trade flow regressions. Using product-level unit values and wages/Φ estimates, the authors estimate three structural parameters: γ = 0.13 (cost elasticity of vertical quality, governing how spending splits between quantity and price), ν = 0.22 (elasticity of horizontal quality with respect to intermediate use), and θ = 4 (Fréchet shape parameter, calibrated from the literature as it is imprecisely identified from prices). From IV regressions of product-level spending on unit values — instrumenting a given destination&amp;rsquo;s price with the same exporter&amp;rsquo;s average price to other destinations — the implied demand elasticity with respect to price is −2.83, and the corresponding β (governing the distribution of the structural error across varieties) is estimated at 0.65. Three shipment-size parameters (λ₁ = 2.26×10⁻⁷, λ₂ = 0.042, λ₃ = 0.48) are fitted to match the observed bilateral extensive margins (R-squared 0.79).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A simulation of five million varieties, aggregated into approximately 3,807 traded products, reproduces the key margins of trade in the data. The model with only seven parameters (γ, ν, θ, β, λ₁, λ₂, λ₃) captures: (i) the positive relationship between unit values and both importer and exporter per capita income; (ii) the concave relationship between GDP and the extensive margin (leveling off for large countries); (iii) the standard gravity elasticities of bilateral trade on GDP and distance. Two discrepancies remain: the model understates the effects of per capita income on the extensive margin (shifting them toward the quantity margin), and it does not generate the Alchian-Allen distance effect on unit values.&lt;/p&gt;
&lt;p&gt;Disaggregation to the level of 15 HS sections confirms the pooled results: 80% of HS6 products show positive importer-income elasticities and 94% show positive exporter-income elasticities of unit values. Although section-specific γ and ν estimates are formally rejected to be equal (χ²(28) = 1,249 against a critical value of 41), the implied improvement in model fit is only 3.6% of the total sum of squared residuals, vindicating the aggregate approach.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What are the two dimensions of quality in the model, and why does the paper require both?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: Horizontal quality (Q) substitutes perfectly for physical quantity and is valued identically by all users — a worker equipped with more intermediates produces goods of higher horizontal quality. Vertical quality (q) complements quantity: the isobeneft surface requires a CES aggregator with ρ &amp;lt; 0 (elasticity of substitution below one between effective quantity and vertical quality) so that a buyer spending more on a variety optimally raises both the amount and the vertical quality dimension. One dimension of quality is insufficient: with a single dimension, if rich countries both produce and prefer higher-quality goods, market shares of rich exporters should be systematically higher in rich importing destinations than in poor ones. No such interaction is found in the data (the interaction coefficient in bilateral trade regressions is 0.002, statistically insignificant), requiring the two-dimension structure to break the link.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How does the model predict that unit values rise with importer per capita income?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: The optimal spending on vertical quality for any variety is governed by the elasticity γ/(1+γ): as a buyer&amp;rsquo;s wage rises, spending on any variety rises, and the fraction of that spending that goes to higher unit values (price) has elasticity γ/(1+γ) with respect to spending. The structural elasticity of unit values with respect to the importer wage is δ_{w,M} = γ/(1+γ). With γ = 0.13, this equals approximately 0.115, close to the empirically estimated importer per capita income elasticity of 0.12–0.13 from product-level regressions with exporter-product fixed effects. Richer importers also face a higher price index Φ (lower competition), contributing an additional negative elasticity δ_{Φ,M} = −1/[θ(1+γ)] on Φ, reinforcing the unit-value–income gradient.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the model predict that unit values rise with exporter per capita income?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: In the model, horizontal quality Q is produced by equipping workers with intermediates: Q = m^ν where ν &amp;gt; 0 and m is intermediate use per worker. Because m is determined by the optimal factor mix and rises with the wage (w), horizontal quality rises endogenously with the exporter&amp;rsquo;s wage. The structural elasticity of unit values with respect to the exporter wage is δ_{w,X} = ν/(1+γ). With ν = 0.22 and γ = 0.13, this equals approximately 0.195, consistent with the estimated exporter per capita income elasticity of 0.20–0.22. The exporter Φ contributes δ_{Φ,X} = ν/[θ(1+γ)] &amp;gt; 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: Why does the model still deliver a standard gravity equation despite the non-homothetic quality structure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: The key result is that the trade-share equation — the fraction of varieties that destination n sources from country i — takes the same Fréchet-based form as in Eaton-Kortum (2002): πni = Ti(dni C̃i)^{−θ} / Φn, where C̃i = Ci/Qi is the horizontal-quality-adjusted unit cost. Although quality is non-homothetic in individual variety demands, the distribution of the maximum effective inverse cost across sources conditional on country i being the cheapest is independent of the source country — the key aggregation property inherited from the Fréchet structure. As a result, country i&amp;rsquo;s share in total absorption by n equals its share in the number of varieties sourced from i, and aggregate bilateral trade flows satisfy a standard log-linear gravity equation. The gains from trade are given by the standard ACR formula Un = constant × (Tn d^{−θ}_{nn} / πnn)^{1/(α̃θ)}.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How does the paper handle the extensive margin empirically, and what does the data show?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: The extensive margin is defined as the fraction of HS6 product categories that destination n imports from source i. In panel regressions, the elasticity of the extensive margin with respect to exporter GDP is 0.76 (much less than the total trade value elasticity of 1.16, implying an intensive margin of 0.38), while the importer extensive margin elasticity is 0.34. Both elasticities display a concave relationship with GDP in levels: the range of products both exported and imported expands rapidly for small countries but levels off at high GDP. The standard EK model predicts an importer extensive margin elasticity that is zero or negative (larger importers source more domestically), inconsistent with the positive 0.34 found in the data.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How does the paper model the extensive margin, and what are the parameter estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: The extensive margin arises from stochastic minimum shipment sizes. Trade flows for individual varieties exist according to model-implied values, but are only observed in a given year if the flow exceeds the stochastic shipment size drawn from an exponential distribution H(x) = 1 − exp(−λ₁x). Products are treated as bundles of varieties drawn from a discretized Weibull distribution f(M) parameterized by λ₂ and λ₃. The three parameters are estimated by minimizing squared differences between model-predicted and observed bilateral extensive margins across all country pairs. The estimates are λ₁ = 2.26×10⁻⁷ (SE 1.21×10⁻⁷), λ₂ = 0.042 (SE 0.020), λ₃ = 0.48 (SE 0.10), with an R-squared of 0.79. These imply a mean shipment size of $4.42 million (median $3.07 million) and a mean number of varieties per product of 1,597 (median 344).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How is β estimated, and what does it govern?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: β governs how spending is distributed across varieties — specifically, the elasticity of spending on a variety with respect to its effective inverse cost. It also governs the elasticity of physical demand with respect to price: the model implies that log spending on a product equals log value minus (β/(1−β)) times log unit price. To identify β, the authors regress product-level trade values on unit prices, instrumenting a given importer&amp;rsquo;s price for a product with the same exporter&amp;rsquo;s average price of that product to all other destinations. The IV estimate of −β/(1−β) is −1.83 (SE 0.019), compared with the OLS estimate of −0.25, indicating substantial simultaneity bias. The implied price elasticity of demand is −2.83 and the implied β is 0.65.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the price regularities documented at the product level, and how does the two-quality model explain price overlaps across country pairs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: Regressions of unit values at the importer-exporter-HS6 level show that individual exporters charge systematically higher prices to richer importers for the same product (elasticity 0.12 with exporter-product fixed effects), and that buyers pay systematically higher prices for products from richer exporters (elasticity 0.22 with importer-product fixed effects). A one-dimensional quality model would predict no overlap between prices charged by a rich and a poor exporter across destinations: even Japan&amp;rsquo;s lowest-priced sales should exceed Malaysia&amp;rsquo;s highest-priced sales for the same product. Back-of-envelope calculations using the regression coefficients predict a Malaysian product should sell in Norway at 0.3 log points above a Japanese product in Pakistan — systematic overlap. The paper documents this overlap in the raw data using two HS6 examples: motorcycle hubs (HS871493) and washing machines under 10kg (HS845011). The two-quality model resolves this by making horizontal quality an exporter attribute that raises prices proportionally but leaves market share determination to the EK gravity equation, allowing rich and poor country exporters to coexist in all markets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does the model address the absence of a Linder-type income interaction effect in aggregate trade flows?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: In one-dimensional quality models (e.g., Fajgelbaum, Grossman, and Helpman 2011), rich countries produce high-quality goods appealing primarily to high-income households, so rich-to-rich bilateral trade flows should be systematically higher than rich-to-poor flows. A gravity regression of bilateral trade values on importer and exporter fixed effects, distance, and an interaction of log importer GDP per capita × log exporter GDP per capita yields a coefficient of 0.0020 (SE 0.016), which is small and statistically insignificant. The two-quality model is consistent with this: horizontal quality enters as an exporter fixed effect (it affects prices proportionally for all destinations) and the demand system is structured so that all destinations spend the same share of absorption on a given source&amp;rsquo;s varieties, regardless of income level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How robust are the results to disaggregation by industry?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: For each of 4,786 HS6 products with more than 20 country pairs, separate price regressions are estimated. Across all products, 80% have positive importer per capita income elasticities and 94% have positive exporter per capita income elasticities. The 15 broad HS sections account for only 10% of the variance in importer-income elasticities and 13% of the variance in exporter-income elasticities across HS6 products, suggesting high within-industry heterogeneity. A quasi-likelihood ratio test formally rejects equal γ and ν across sections (χ²(28) = 1,249 against a critical value of 41), but the reduction in total sum of squared residuals from allowing section-specific parameters is only 3.6%, and the R-squared increases from 0.353 to 0.376. The authors conclude the aggregate approach is vindicated for the purpose of characterizing common patterns.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the model simulate trade, and how many products does it generate?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: The simulation draws productivities for 5,000,000 varieties across 50 countries using the estimated model parameters (θ = 4, γ = 0.13, ν = 0.22, β = 0.65, and the estimated gravity fixed effects). For each variety, the cheapest source is determined; trade values and unit values are computed using equations (28) and (29); censoring due to stochastic shipment sizes generates zeros. Varieties are aggregated into products by partitioning sequentially using the estimated Weibull distribution. The simulation yields 3,842 total simulated products of which 3,807 are traded between at least one country pair, compared with 4,973 HS6 products in the COMTRADE data. A Monte Carlo exercise confirms that the estimation procedure recovers parameter values close to the true values when applied to simulated data.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Vertical quality (q):&lt;/strong&gt; A dimension of quality that complements physical quantity. In the paper&amp;rsquo;s utility specification, vertical quality and effective quantity enter a CES aggregator with elasticity of substitution below one (ρ &amp;lt; 0). A buyer spending more on a variety raises both quantity and vertical quality simultaneously, in proportions governed by γ. Vertical quality rises endogenously with the importer&amp;rsquo;s wage because higher-income buyers optimally demand it; it is the mechanism behind the positive relationship between importer per capita income and unit values.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Horizontal quality (Q):&lt;/strong&gt; A dimension of quality that substitutes perfectly for physical quantity (enters the aggregator multiplicatively with quantity). All buyers value an increase in Q equivalently regardless of income level, so it does not generate Linder-type income-matching in trade flows. Horizontal quality is produced by the exporter: better-equipped workers produce higher horizontal quality (Q = m^ν), so it rises with the exporter&amp;rsquo;s wage. It is the mechanism behind the positive relationship between exporter per capita income and unit values.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Extensive margin (E_{ni}):&lt;/strong&gt; In the paper&amp;rsquo;s empirical framework, the fraction of HS6 product categories that destination n imports from source i in a given year. The paper shows this margin rises with both importer and exporter size but in a concave, nonlinear fashion. It is generated in the model by stochastic minimum shipment sizes that probabilistically censor small-value variety flows.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intensive margin:&lt;/strong&gt; Total bilateral trade value divided by the extensive margin. The paper further decomposes the intensive margin into a quantity margin and a unit-value (price) margin. The paper&amp;rsquo;s key contribution is to generate all three margins jointly from one parsimonious framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stochastic minimum shipment size:&lt;/strong&gt; A modeling device, drawn from distribution H(x) (parameterized as exponential with parameter λ₁), that determines whether a given variety&amp;rsquo;s trade flow is observed in any year. If the annual flow x_{ni}(ω) exceeds the drawn minimum size x̄, the shipment is observed with certainty; otherwise, it is observed with probability x_{ni}(ω)/x̄. This mechanism generates the concavity of the extensive margin with respect to GDP without departing from the standard gravity framework.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective inverse cost (v_{ni}):&lt;/strong&gt; Defined as Z_i(ω)/[C̃_i d_{ni}], where Z_i is country i&amp;rsquo;s Fréchet-distributed productivity for variety ω, C̃_i = C_i/Q_i is the horizontal-quality-adjusted unit cost, and d_{ni} is the iceberg trade cost. A buyer in n sources variety ω from the country maximizing v_{ni}. This formulation ensures that horizontal quality differences across exporters are absorbed into the effective cost, preserving the EK aggregation result.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;γ (vertical quality cost elasticity):&lt;/strong&gt; The parameter governing how spending on a variety divides between physical quantity and vertical quality. Spending has elasticity 1/(1+γ) with respect to quantity and elasticity γ/(1+γ) with respect to unit price. The paper estimates γ = 0.13 from product-level unit value regressions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;ν (horizontal quality elasticity):&lt;/strong&gt; The parameter governing how horizontal quality rises with intermediate use per worker: Q = m^ν. Combined with γ, it determines the structural elasticity of unit values with respect to exporter per capita income: δ_{w,X} = ν/(1+γ). The paper estimates ν = 0.22.&lt;/p&gt;</description></item><item><title>The Optimal Taxation of Couples</title><link>https://macropaperwarehouse.com/papers/the-optimal-taxation-of-couples/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-optimal-taxation-of-couples/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; What is the optimal joint nonlinear earnings tax schedule for married couples? How should one spouse&amp;rsquo;s marginal tax rate depend on the other&amp;rsquo;s earnings? When is individual earnings-based (separable) taxation optimal versus family-income-based taxation, and what determines the sign and magnitude of &amp;ldquo;jointness&amp;rdquo; — the dependence of one spouse&amp;rsquo;s marginal tax on the other&amp;rsquo;s earnings?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Model.&lt;/strong&gt; The paper studies a canonical unitary household model in which each couple consists of two spouses who jointly maximize utility subject to a joint budget constraint. Spousal productivities are drawn from a joint distribution F with arbitrary dependence structure. The planner maximizes a weighted sum of couples&amp;rsquo; utilities, with Pareto weights that are decreasing functions of productivities. Utility takes a quasi-linear form in consumption and labor disutility with constant labor supply elasticity parameter γ (implying earnings elasticity γ/(γ-1)). The tax problem is equivalent to a two-dimensional mechanism design problem in which the planner chooses allocations as functions of reported productivity types, subject to incentive compatibility and budget feasibility. Because spousal productivities are two-dimensional, the problem is a multi-dimensional screening problem whose properties are poorly understood in general.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodology.&lt;/strong&gt; The authors proceed in two directions. First, they establish conditions under which the first-order approach (FOA) — restricting attention to local incentive constraints — is valid in this bi-dimensional setting. They show, for the special case of the benchmark economy (symmetric, independent types, separable Pareto weights), that FOA validity is equivalent to convexity of a certain transformation of the value function, and derive necessary and sufficient conditions that are strictly weaker than their unidimensional analogs — so the FOA is more likely to hold in two dimensions than in one. For the general economy, they invoke an Implicit Function Theorem argument in Hölder space to show that the FOA holds for Pareto weights sufficiently close to utilitarian (i.e., when the planner is not &amp;ldquo;too redistributive&amp;rdquo;). Second, assuming FOA validity, they characterize optimal taxes via a second-order nonlinear PDE. Since this PDE cannot be solved analytically in general, they apply the Coarea Formula to derive closed-form expressions for conditional averages of optimal tax distortions over various subsets of the type space, expressed entirely in terms of structural primitives (labor supply elasticities, Pareto weights, and elasticities of the joint distribution of productivities).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Average distortions and assortativeness.&lt;/strong&gt; Average optimal distortions on married individuals are ranked by the degree of positive quadrant dependence (PQD) in spousal productivities: more assortative matching implies higher optimal tax rates. Optimal distortions on married individuals are always weakly lower than on single individuals with the same productivity, same elasticities, and same marginal productivity distribution — strictly so unless matching is perfectly positively assortative. The intuition is that when couples pool resources, intra-family redistribution already occurs, and distortionary taxation crowds this out; more random matching produces more within-family redistribution, reducing the marginal social value of public redistribution through taxation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimality of separable (individual earnings-based) taxation.&lt;/strong&gt; In the benchmark economy with independent types, optimal taxes are exactly separable (individual earnings-based), and optimal distortions on married individuals equal precisely one-half of those on comparable single individuals. With separable Pareto weights and independent types more generally, taxes remain separable. Once types are positively dependent, however, the planner optimally introduces jointness even under separable social weights.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Jointness and tail (in)dependence.&lt;/strong&gt; Optimal jointness — whether one spouse&amp;rsquo;s marginal tax rate increases or decreases in the other&amp;rsquo;s earnings — depends critically on tail dependence of the joint productivity distribution, captured by the copula and survival copula elasticities. For right-tail dependent distributions (so that extremely productive individuals are likely to be matched with extremely productive partners), positive jointness is optimal at the top (raising taxes on high earners whose partners are also high earners) and negative at the bottom. For right-tail independent distributions (such as the Gaussian copula, which is tail-independent for any finite ρ), the distortion-reducing motive dominates: optimal jointness is negative at the top and positive at the bottom, conditional on standard convergence conditions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Primary vs. secondary earners.&lt;/strong&gt; The secondary earner (lower-productivity spouse) faces on average higher optimal distortions than the primary earner when the planner values redistribution to couples with a very unproductive spouse (α(w,0) ≥ 1), because the phasing out of transfers targeted to such couples generates high marginal tax rates on secondary earners. Family earnings-based taxation is optimal only when total family productivity and relative spousal productivity are independent, and when social weights are measurable only with respect to total family output.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Restricted taxation.&lt;/strong&gt; Optimal distortions under any of the three restricted tax regimes (anonymous, separable, family earnings-based) exactly equal the relevant conditional average of unrestricted optimal distortions. This establishes that the welfare difference between the restricted and unrestricted optimum stems solely from the planner&amp;rsquo;s inability to tag taxes to individual productivity types within the restricted class.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Quantitative Findings (calibrated to 2020 CPS data on U.S. married couples, ages 25-65, worked ≥ 20 weeks).&lt;/strong&gt; Spousal productivities are positively but not perfectly dependent, with Kendall&amp;rsquo;s tau = 0.21 and Pearson correlation = 0.25 for productivities (0.21 for earnings). The joint distribution is well approximated by a Gaussian copula (ρ = 0.33) with Pareto-lognormal marginals (a = 2.95, Gini = 0.31). The Gaussian copula is tail-independent, so consistent with analytical results, optimal jointness is positive for low earners and negative for high earners (the latter arising at earnings above approximately $8.5 million in the benchmark specification). The quantitative magnitude of optimal jointness is small — marginal taxes for one spouse change by at most several percentage points as a function of the other spouse&amp;rsquo;s earnings. Individual earnings-based taxation provides a good approximation to the unrestricted optimum. By contrast, family earnings-based (joint) taxation is a poor approximation in all specifications, with marginal taxes on family income varying substantially with the earnings share of the secondary earner, and this conclusion holds even when Pareto weights explicitly favor family earnings-based taxation (k = 0 case). The implied top marginal tax rate converges toward approximately 55 percent (corresponding to limiting distortion of ≈1.35 = 1/γa with γ = 0.25, a = 2.95) but the convergence is slow, so optimal marginal rates remain substantially below this limit even at earnings of $300,000.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the mechanism design formulation, and why is FOA validity a key concern in the bi-dimensional setting?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The planner&amp;rsquo;s problem is cast as a direct mechanism in which couples report their two-dimensional productivity type (w1, w2) and receive allocations (consumption, earnings). Incentive compatibility requires that no couple prefers to misreport. In one-dimensional models (Mirrlees 1971), restricting attention to local incentive constraints (the FOA) yields the standard ODE characterization of optimal taxes and is valid for a broad class of primitives. In two dimensions, solutions to multi-dimensional screening problems generically display &amp;ldquo;bunching&amp;rdquo; (Rochet-Choné 1998, Armstrong 1996), and the FOA may fail. The key difference exploited in this paper is the absence of participation constraints in the public finance setting, which eliminates the main force driving FOA failure in industrial organization models.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What are the necessary and sufficient conditions for FOA validity in the benchmark economy with independent types?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: (Proposition 1) In the benchmark economy (symmetric, independent types, separable Pareto weights), FOA validity is equivalent to the condition that x·(1 + λ̃(x^{-γ})/2) is increasing in x, where λ̃(t) = [∫_t^∞ (1-α̃(w))g(w)dw] / (γtg(t)). The unidimensional analog requires x·(1 + λ̃(x^{-γ})) to be increasing. Since the bi-dimensional condition multiplies λ̃ by 1/2 rather than 1, the set of primitives satisfying it is strictly larger: every (G, α̃, γ) for which the unidimensional FOA holds also satisfies the bi-dimensional condition, but not vice versa. Economically, the FOA holds as long as the planner is not &amp;ldquo;too redistributive&amp;rdquo; — i.e., Pareto weights on low types are not so high as to violate these monotonicity conditions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the Coarea Formula result (equation 27) and why is it the central technical tool?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Given that the optimality conditions form a PDE system that cannot generally be solved pointwise, the authors integrate the optimality condition (equation 20) over subsets of the type space defined by level sets of an arbitrary function Q(w1, w2). The Coarea Formula allows them to express the result as: E[Σ_i λ*_i γ_i (∂lnQ/∂lnw_i) | Q=t] = [1 − E[α|Q≥t]] / [−∂ln P(Q≥t)/∂ln t]. By choosing different Q functions (e.g., Q = w_i, Q = max{k_1 w_1, k_2 w_2}, Q = R(w) for total family productivity, Q = I(w) for relative productivity), the formula delivers closed-form expressions for distinct conditional averages of optimal distortions, all expressed in terms of exogenous primitives. This contrasts with variational approaches (Golosov et al. 2014, Spiritus et al. 2022) that express optimal taxes in terms of endogenous moments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How do optimal distortions on married individuals compare to those on single individuals, and what is the exact quantitative relationship in the independent-types benchmark?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: (Proposition 4) In the benchmark economy with independent types, the optimal distortion on spouse i with productivity t equals exactly one-half of the optimal distortion λ^{sng,&lt;em&gt;}(t) in the corresponding unidimensional economy: λ&lt;/em&gt;&lt;em&gt;i(t, w&lt;/em&gt;{-i}) = (1/2)λ^{sng,*}(t), and this is independent of the partner&amp;rsquo;s productivity w_{-i}. The intuition: the deadweight cost of taxing any individual depends only on her own characteristics (elasticity, productivity, density), not on whom she is married to. However, the redistributive benefit of taxation depends on matching — when matching is random, every high-productivity individual is married on average to an average person, so the incremental social benefit of extracting tax revenue from her is exactly half of what it would be if she were single (since half the benefit goes to a partner who is already average). More generally (Proposition 5 and Corollary 2), average distortions are weakly lower for married individuals than for singles as long as matching is not perfectly positively assortative.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is average jointness and how is it measured?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Average jointness J_i(t) is defined as the ratio of average distortions on spouse i conditional on the partner having above-t productivity to average distortions conditional on the partner having below-t productivity, minus one. Jointness is positive if the marginal tax rate on spouse i is on average increasing in the partner&amp;rsquo;s productivity, negative if decreasing, and zero for separable (individual earnings-based) taxes. The paper characterizes jointness through auxiliary functions H_i(t) (conditional distortion relative to unconditional average), whose behavior is determined by the copula elasticities η_i and survival copula elasticities η̄_i — the percentage change in the conditional quantile of the partner&amp;rsquo;s productivity when one spouse&amp;rsquo;s productivity quantile increases by 1%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What is the role of tail dependence in determining the sign of optimal jointness?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: (Proposition 7, Lemma 4) For right-tail dependent distributions — where the probability that an extremely productive person is married to an extremely productive partner remains bounded away from zero as productivity → ∞ — the redistributive benefit of positive jointness (targeting taxes to the richest couples) dominates its distortionary cost, so optimal average jointness is positive at the top. For right-tail independent distributions (where this probability converges to zero), the distortionary cost of positive jointness dominates, and optimal jointness is negative at the top. Exactly symmetric logic applies at the bottom using the survival copula and left-tail dependence. The bivariate lognormal/Gaussian copula is right-tail independent for any finite correlation ρ, while a distribution with perfect assortative matching in the tails would be right-tail dependent. The speed of convergence to tail independence, measured by κ = lim_{u→0} ln(u)/ln(C(u,u)) ∈ [1/2, 1), also matters: slower convergence (κ closer to 1) implies smaller optimal jointness under tail independence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: When is individual earnings-based (separable) taxation optimal, and when is family earnings-based taxation optimal?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: (Propositions 4, 8, Corollary 1) Individual earnings-based taxation is optimal when Pareto weights are separable and spousal productivities are independent. When types are positively dependent, the planner introduces jointness even with separable social weights, because conditioning taxes on both spouses&amp;rsquo; earnings facilitates redistribution across couple types. Family earnings-based taxation is optimal when: (i) social weights are measurable only with respect to total family productivity r (i.e., the planner cares only about total family output, not the identity or relative productivity of individual spouses), and (ii) total family productivity r and relative spousal productivity ι are statistically independent. When r and ι are not independent, even a planner with an intrinsic preference for family earnings-based taxation will find it optimal to depart from it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What does Proposition 9 (Corollary 7) establish about the relationship between restricted and unrestricted optimal taxes?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: (Corollary 7) For each restricted tax regime (anonymous, individual earnings-based, family earnings-based), the optimal distortions under the restricted tax equal the corresponding conditional average of unrestricted optimal distortions. Specifically: optimal individual earnings-based distortions equal E[λ*_i | w_i = t] (the average unrestricted distortion at productivity t); optimal family earnings-based distortions equal E[weighted average of λ*_i | R(w) = r]. This reveals that the unrestricted and restricted planners solve the same tradeoff between redistribution benefits and distortionary costs, but the restricted planner must apply a single tax rate to groups of couples that cannot be distinguished under the restriction. The welfare loss from restriction comes entirely from this forced bunching, not from a different objective or a different first-order condition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What do the quantitative results say about the goodness of approximation of separable vs. family earnings-based taxation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: In the calibrated benchmark economy (Gaussian copula, ρ = 0.33, Pareto-lognormal marginals, γ = 0.25, m = 0.35), optimal jointness is quantitatively small — the marginal tax rate on one spouse changes by at most several percentage points as a function of the other spouse&amp;rsquo;s earnings over the plotted range. Individual earnings-based (separable) taxation therefore provides a good approximation to the unrestricted optimum across all specifications considered. By contrast, family earnings-based taxation is a poor approximation: the marginal tax rate on family income varies substantially with the earnings share of the secondary earner (the ratio min{y1,y2}/(y1+y2)), and the deviation from the optimal unrestricted tax is large. This finding is robust across different Pareto weight specifications (m ∈ {0.35, 1.5}, k ∈ {0, 1, 2}) and holds even when k = 0, i.e., when the planner&amp;rsquo;s social weights inherently prefer family earnings-based taxation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do the calibration results relate to the analytical comparative statics predictions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The calibration validates the analytical predictions quantitatively. The analytical result (Proposition 5) that optimal distortions in the U.S. lie between those under random matching (1/2 of single-individual rates) and perfect assortative matching (same as single-individual rates) is confirmed: optimal tax rates for married individuals in the calibrated economy lie between the independence and perfect-dependence gray-line benchmarks in Figure 6. The analytical prediction (Proposition 7) that the Gaussian copula implies positive jointness at the bottom and negative at the top is confirmed, with the switch to negative jointness occurring above approximately $8.5 million in earnings. The slow convergence of the Gaussian copula to tail independence (κ = (1+ρ)/2 ≈ 0.665) explains the small magnitude of optimal jointness relative to the FGM copula (which has κ = 1/2, faster convergence, and exhibits more pronounced jointness as shown in the appendix). The analytical limiting distortion of E[λ*_i | w_i = t] → 1/(γa) ≈ 1.35 as t → ∞ (corresponding to a top marginal tax rate of approximately 55 percent) is confirmed, though convergence is slow and rates remain substantially below this limit at $300,000 in earnings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the paper relate to and advance beyond Kleven, Kreiner, and Saez (2007/2009)?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Kleven et al. (2009) studied couples taxation but avoided the multi-dimensional screening complexity by restricting the secondary earner to binary labor supply. The working paper by Kleven et al. (2007) considered the continuous setting but noted the difficulty of the FOA and derived several special-case insights. The current paper extends KKS in several systematic ways: it provides the first formal proof that the FOA conditions are strictly weaker in bi-dimensional than unidimensional settings; generalizes the formula for average distortions to arbitrary joint distributions (not just independent types); characterizes optimal jointness under positive dependence (not just independence); establishes the role of tail (in)dependence in determining the sign of jointness; compares optimal taxes for married vs. single individuals; and derives conditions under which family earnings-based or individual earnings-based taxation is optimal. It also shows that the KKS result on jointness sign (determined by the third derivative of the SWF) applies only under independence and can be reversed even with arbitrarily small positive dependence, as demonstrated with the Gaussian copula example.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;First-Order Approach (FOA) in multi-dimensional taxation.&lt;/strong&gt; The restriction of the mechanism design problem to local incentive constraints only — dropping global (non-local) incentive compatibility conditions and solving a relaxed problem. In the paper&amp;rsquo;s context, FOA validity is equivalent to convexity of a specific transformation vx* of the optimal utility function in the &amp;ldquo;linearized&amp;rdquo; type space X. The paper shows that the condition for FOA validity is strictly weaker (i.e., a strictly larger set of primitives satisfies it) in the bi-dimensional couples setting than in the corresponding unidimensional model, because the absence of participation constraints eliminates the main force driving FOA failure in industrial organization multi-dimensional screening.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;em&gt;Optimal tax distortion λ&lt;/em&gt;_i(w).&lt;/em&gt;* The monotone transformation of the marginal tax rate defined by λ_i(w) = [∇_i T(y(w))] / [1 − ∇_i T(y(w))], where ∇_i T is the partial derivative of the tax function with respect to spouse i&amp;rsquo;s earnings. This transformation maps [−∞, ∞] marginal tax rates to (−1, ∞) distortions. The optimal tax schedule is characterized by the function λ* satisfying a system of PDEs; the paper studies conditional averages of λ* rather than λ* pointwise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Coarea Formula.&lt;/strong&gt; A mathematical result from geometric measure theory that, in this context, converts an integral of the PDE optimality condition over a two-dimensional domain into an integral over the level sets of an arbitrary function Q(w). Applied to equation (20), it yields: E[Σ_i λ*_i γ_i (∂lnQ/∂lnw_i) | Q=t] = [1 − E[α|Q≥t]] / [−∂ln P(Q≥t)/∂ln t]. By choosing different Q functions, the formula delivers conditional averages of optimal distortions over different subsets of the type space, all in terms of exogenous primitives. This is the paper&amp;rsquo;s principal analytical tool for characterizing optimal taxes without solving the PDE explicitly.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Jointness (positive/negative).&lt;/strong&gt; The dependence of the optimal marginal tax rate on one spouse&amp;rsquo;s earnings on the other spouse&amp;rsquo;s earnings. Taxes are positively jointed at w if ∂²T/∂y_1∂y_2 &amp;gt; 0 (so raising one spouse&amp;rsquo;s earnings increases the marginal tax rate on the other); negatively jointed if this cross-partial is negative; disjointed (separable) if it is zero. Average jointness J_i(t) at productivity t is measured as the ratio of conditional average distortions above and below the partner&amp;rsquo;s productivity threshold, minus one. Optimal jointness is the paper&amp;rsquo;s primary policy object for understanding how taxes on one spouse should respond to the other&amp;rsquo;s earnings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Copula and survival copula elasticities (η_i, η̄_i).&lt;/strong&gt; Defined as η_i(t) = ∂ln C(u)/∂ln u_i and η̄_i(t) = ∂ln C̄(u)/∂ln ū_i, where C is the copula of the joint productivity distribution, C̄ is the survival copula, and u_i = G_i(t_i), ū_i = 1−G_i(t_i) are the corresponding quantiles. These elasticities measure the percentage change in the conditional quantile of the partner&amp;rsquo;s productivity when one spouse&amp;rsquo;s productivity quantile increases by 1%. They quantify the additional distortionary cost introduced by jointness relative to a separable tax schedule: smaller elasticities (stronger dependence) correspond to larger distortionary costs of jointness at the boundaries of probability mass.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tail (in)dependence.&lt;/strong&gt; A joint distribution F is right-tail dependent if lim_{t→∞} P(w_{-i}≥t | w_i≥t) &amp;gt; 0, i.e., extremely productive individuals have a positive probability of being matched with equally extreme partners. It is right-tail independent if this limit is zero. The speed of convergence to tail independence is measured by κ = lim_{u→0} ln(u)/ln(C(u,u)) ∈ [1/2, 1). Tail dependence determines the sign of optimal average jointness in the tails: right-tail dependence favors positive jointness at the top; right-tail independence favors negative jointness at the top. The Gaussian copula is right-tail independent for any finite ρ; a perfectly assortative matching distribution is right-tail dependent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Positive quadrant dependence (PQD) order.&lt;/strong&gt; A partial ordering on joint distributions with the same marginals: F^b ≥_{PQD} F^a if F^b(w) ≥ F^a(w) for all w, equivalently if Cov(φ_1(w_1), φ_2(w_2)) ≥ 0 for any two increasing functions. The paper uses this order to rank economies by the &amp;ldquo;assortativeness&amp;rdquo; of matching, and shows that optimal average distortions are monotone in this order (Proposition 5): more assortative matching implies weakly higher optimal tax distortions on each married individual.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pareto-lognormal (PLN) distribution.&lt;/strong&gt; Used in the calibration to model the marginal distribution of spousal productivities. Defined as G(t) = Φ((ln t − μ)/σ) − a·exp(aμ + a²σ²/2)·Φ((ln t − μ)/σ − aσ), parameterized by location μ, scale σ, and tail parameter a. The PLN family has a lognormal body and a Pareto tail with tail parameter a, making it suitable for capturing the empirical finding of a thin left tail (implying optimal marginal taxes approaching zero as earnings → 0) and a thick right tail (implying a positive limiting marginal tax rate of approximately 1/(1 + 1/(γa)) as earnings → ∞).&lt;/p&gt;</description></item><item><title>The Origins and Control of Forest Fires in the Tropics</title><link>https://macropaperwarehouse.com/papers/the-origins-and-control-of-forest-fires-in-the-tropics/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-origins-and-control-of-forest-fires-in-the-tropics/</guid><description>&lt;p&gt;This paper studies the economics of illegal tropical forest fires in Indonesia, framed as a modern counterpart to Pigou&amp;rsquo;s canonical externality example of sparks from railway engines. The central research question is whether private firms adjust their fire-setting behavior depending on the degree to which the costs of fire spread fall on themselves versus others, and what enforcement architecture shapes that adjustment.&lt;/p&gt;
&lt;p&gt;The empirical setting is Indonesia&amp;rsquo;s national forest estate, where palm oil and wood fiber concession holders use fire as a cheap land-clearance method — burning primary forest costs 44–70% less than mechanical clearance — despite the practice being illegal. The paper assembles a novel dataset of 107,334 fires across Indonesia&amp;rsquo;s major forested islands from October 2000 to January 2016, constructed from NASA MODIS daily satellite hotspot data (1 km resolution, four flyovers per day). Fire ignitions and spread paths are traced by linking contiguous pixels burning on adjacent days. This fire data is merged with geocoded concession boundaries (logging, palm oil, wood fiber), land-use classifications (protected forest, unleased productive forest, areas outside the forest estate), annual deforestation data from Hansen et al. (2013) at 30 m resolution, daily wind speed data from NOAA NCEP-DOE Reanalysis 2 interpolated to each 1 km pixel, and data on firms investigated by the Indonesian government following the 2015 fires. The main analytical sample focuses on the 39,077 fires started inside wood fiber and palm oil concessions.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s identification strategy exploits two intersecting sources of variation: (1) temporal and spatial variation in monthly wind speed, which predicts the probability and extent of fire spread — a one-standard-deviation increase in wind speed (approximately 5 km/hr) increases fire spread area by 287%; and (2) cross-sectional variation in the land-type composition of the area surrounding each ignition pixel, which determines whether spread costs would fall on the fire-setter or on others. The interaction of these two factors identifies whether firms are more cautious about igniting fires on windy days when surrounding land is their own versus when it belongs to others.&lt;/p&gt;
&lt;p&gt;Three main findings emerge. First, fires are systematically human-caused and linked to industrial land clearance. Fires are eight times more likely per hectare in oil palm and wood fiber concessions than in logging concessions. Completely deforesting a 1 km pixel increases the probability of fire ignition in that pixel in the subsequent year by 279%, and this effect reverses in the year after (two years post-deforestation), ruling out natural flammability as the explanation and confirming a deliberate slash-and-burn cycle. Fire use following deforestation falls by approximately 38% in oil palm concessions during district election years, consistent with tighter enforcement when political incentives favor suppression.&lt;/p&gt;
&lt;p&gt;Second, firms partially internalize the externalities from fire-setting. They are significantly less likely to set fires on windy days when surrounding pixels belong to their own concession rather than to others. A buffer zone entirely owned by the same concession holder reduces ignitions by 8–25% at mean wind speed, and by 22–61% at the 95th-percentile wind speed. However, firms treat neighboring concession land and unleased productive forest similarly — suggesting Coasian bargaining between concession holders is not occurring.&lt;/p&gt;
&lt;p&gt;Third, the government&amp;rsquo;s enforcement pattern shapes firm behavior. Using data on firms investigated after the 2015 fires, the paper shows the government disproportionately investigates firms whose fires burned protected areas or high-population-density land, but not those whose fires damaged other private concessions. The relative weights firms place on different land types when deciding whether to ignite fires align closely with this government punishment function, consistent with firms responding to implicit Pigouvian incentives.&lt;/p&gt;
&lt;p&gt;Counterfactual simulations show that broadening enforcement to treat all land types as the government currently treats populated areas would reduce fires by 80%; treating all land like protected forest would reduce fires by 67%. By contrast, fully Coasian property-rights solutions yield only 14% reductions, and tort reform allowing concession holders to recover damages from neighbors yields only 6%.&lt;/p&gt;
&lt;p&gt;Q: What is the core externality problem studied in this paper?
A: Firms use fire as a cheap land-clearance method, but once set, fires risk spreading beyond the igniter&amp;rsquo;s own concession onto land owned by others, creating an uncompensated externality. The decision to use fire rather than mechanical clearance is de facto a decision to impose this spread risk on third parties. The paper asks whether firms adjust this decision depending on the extent to which spread costs fall on themselves versus others, and whether government enforcement shapes that adjustment.&lt;/p&gt;
&lt;p&gt;Q: Why is Indonesia the empirical setting?
A: Indonesia holds a large share of the world&amp;rsquo;s tropical forests and is among the countries most affected by illegal land-clearing fires. The 2015 Indonesian fires alone released approximately 400 megatons of CO2 equivalent, at their peak emitting more daily greenhouse gases than all US economic activity, and caused an estimated 100,000 excess deaths across Indonesia, Malaysia, and Singapore. The palm oil industry in Indonesia and Malaysia, where fire is used extensively, accounted for 4.7% of global CO2 emissions from 1986 to 2016.&lt;/p&gt;
&lt;p&gt;Q: How are fire ignitions and spread identified in the data?
A: The paper starts from NASA MODIS daily hotspot data at 1 km resolution from October 2000 to January 2016. An iterative procedure assigns contiguous pixels burning on adjacent days to the same fire event, with a 1-pixel buffer allowing for spread detection. This yields 176,855 total fires across Indonesia, of which 107,334 remain after restricting to the major forested islands and the forest estate. The procedure may understate single-day spread since pixels burning on the same day are classified as part of the ignition area rather than spread.&lt;/p&gt;
&lt;p&gt;Q: What fraction of fires spread beyond their ignition area, and how much of the spread falls on outsiders?
A: 87% of fires burn for only one day and 89% do not spread beyond their initial ignition area. However, the largest fire in the data spread to cover 466 times its initial area, and the largest single fire burned 764 km2. Across all multi-day fires started inside concessions, 32% of the total land burned outside the initial ignition area is outside the concession where the fire began, quantifying the scale of the local externality.&lt;/p&gt;
&lt;p&gt;Q: How is wind speed used as an identification strategy?
A: Wind speed provides temporal and spatial variation in the probability that a fire will spread. A one-standard-deviation increase in wind speed (approximately 5 km/hr) increases the extent of fire spread by 287%. Because wind varies month to month and across space, while the composition of surrounding land types is fixed in the cross-section, the interaction of wind speed with surrounding land type identifies whether firms are more cautious about igniting fires when spread risk is high and spread costs would fall on their own land versus others&amp;rsquo; land.&lt;/p&gt;
&lt;p&gt;Q: What is the main result on firms&amp;rsquo; internalization of fire spread externalities?
A: Firms are significantly less likely to start fires on windy days when a larger share of the surrounding buffer zone belongs to their own concession. One additional buffer pixel in one&amp;rsquo;s own land decreases ignitions by 0.2–0.7%. A buffer zone entirely owned by the same concession holder reduces ignitions by 8–25% at mean wind speed, and by 22–61% at the 95th-percentile wind speed. This demonstrates that firms take fire spread risk into account when it threatens their own assets, but discount it when spread would damage others&amp;rsquo; land.&lt;/p&gt;
&lt;p&gt;Q: Do firms treat different types of neighboring land differently?
A: Yes. The benchmark category is unleased productive forest, which has the weakest property rights and receives the least de facto government protection. Relative to this benchmark, firms are more cautious about fire spread toward protected forest (national parks and watershed areas) and toward land outside the forest estate (typically villages and smallholders). One additional buffer pixel in protected forest versus unleased productive forest decreases ignitions by 0.9% at mean wind speed and 2.7% at the 95th-percentile wind speed; the deterrent for land outside the forest estate is even stronger at 1.6% and 4.6%, respectively. Firms treat other firms&amp;rsquo; concession land similarly to unleased productive forest, suggesting no effective private enforcement between concession holders.&lt;/p&gt;
&lt;p&gt;Q: What evidence shows fires are tied to intentional land clearance rather than natural ignition?
A: Fires are eight times more likely per hectare in oil palm and wood fiber concessions than in logging concessions, consistent with clear-cutting versus selective logging. Completely deforesting a 1 km pixel increases fire probability in that pixel in the subsequent year by 279%. Crucially, the effect reverses in the second year after deforestation — the pixel becomes less likely to burn than before — which rules out natural flammability as the mechanism and confirms deliberate slash-and-burn timing.&lt;/p&gt;
&lt;p&gt;Q: What does the electoral cycle evidence show about government enforcement?
A: Fires following deforestation fall by approximately 38% in oil palm concessions during district election years relative to the year prior to an election, and bounce back to pre-election levels in the year after. The decline is confined to productive forest zones where conversion is occurring; no electoral cycle appears in protected areas where conversion is already prohibited. This indicates that enforcement is tightened when political incentives are strong, and confirms that these fires are set intentionally and are responsive to government pressure.&lt;/p&gt;
&lt;p&gt;Q: How is the government&amp;rsquo;s de facto punishment function estimated?
A: The paper uses data on firms investigated by the Indonesian Ministry of Forestry following the 2015 fires, matching investigated firms (identified only by initials in the published list) to concession-holder names. A logistic regression of investigation probability on the land-type outcomes of a firm&amp;rsquo;s fires — conditional on total area burned — shows the government is substantially more likely to investigate firms whose fires burned protected areas or high-population-density land, but does not differentially investigate cases where fire damage is largely confined to other private concessions.&lt;/p&gt;
&lt;p&gt;Q: How closely do firm behavior and government enforcement weights align?
A: The relative weights across land types that the government applies in its investigation decisions correspond closely to the relative weights firms apply when deciding whether to ignite fires on windy days. Firms are most deterred by spread risk toward protected forest and populated areas outside the forest estate — the same categories the government prioritizes. Firms are least deterred by spread toward unleased productive forest and other private concessions — the categories the government largely ignores. This alignment is consistent with firms responding to Pigouvian-style implicit incentives generated by the government&amp;rsquo;s enforcement pattern.&lt;/p&gt;
&lt;p&gt;Q: What do the counterfactuals reveal about policy effectiveness?
A: Fully Coasian property-rights reform — where firms treat all surrounding land as their own — would reduce fires by only 14%. Tort reform enabling concession holders to recover damages from neighbors (treating neighboring concessions as own land) would reduce fires by only 6%. By contrast, uniform enforcement raising deterrence to the level currently applied to populated areas would reduce fires by 80%; applying the level currently applied to protected forest would reduce fires by 67%. An enforcement regime that perfectly prevented all fire spread outside the igniting concession would reduce area burned by only 23%; preventing spread into protected and populated areas alone would yield only a 2% reduction.&lt;/p&gt;
&lt;p&gt;Q: What do the benefit-cost ratios for fires look like?
A: The estimated external damages from the 1997/1998 Indonesian fires range from 1,286 to 6,074 USD per hectare burned (2020 USD). The average private benefit from using fire rather than mechanical clearance — accounting for fertilizers and other costs — averages approximately 52 USD per hectare (2020 USD). Benefit-cost ratios of 0.008 to 0.04 lie well below 1, indicating that the social damages from fires vastly exceed the private benefits, even though the government currently deters only the most costly categories of fire.&lt;/p&gt;
&lt;p&gt;Q: Why do Coasian private solutions perform poorly in this setting?
A: Coasian bargaining between concession holders would require them to reach agreements to bring fire use to a locally efficient level without government intervention. The evidence shows firms treat other concession holders&amp;rsquo; land essentially the same as unprotected unleased productive forest, implying that no such bargains are being struck. The counterfactual analysis confirms this: even a fully-Coasian outcome where every surrounding pixel is treated as own land would reduce fires by only 14%, because the bulk of fires occur when ignition costs to the firm&amp;rsquo;s own land are low regardless of wind speed.&lt;/p&gt;
&lt;p&gt;Q: What is the primary policy implication?
A: The most effective lever for reducing fires is not preventing spread after the fact, but rather deterring ignition in the first place by extending the enforcement regime uniformly across all land types. If firms were induced to treat all surrounding land with the same caution they currently apply toward populated areas — through broader and stronger penalties — fires would fall by 80%. This is substantially more effective than property-rights reforms, tort reforms, or targeted spread-prevention measures focused only on protected and populated areas.&lt;/p&gt;
&lt;p&gt;Externality (fire spread): In this paper&amp;rsquo;s usage, the cost imposed on third parties when a fire ignited inside one concession spreads to land owned by others. The externality is quantified as the share of area burned outside the igniting concession (32% of multi-day fire spread in the data) and the ratio of external damages (1,286–6,074 USD/ha) to private benefits (52 USD/ha) from using fire rather than mechanical clearance.&lt;/p&gt;
&lt;p&gt;Slash-and-burn (industrial scale): The two-stage land-clearance practice where valuable timber is first harvested (deforestation) and the remaining vegetation is then burned to prepare land for plantation crops. The paper establishes this cycle empirically: complete deforestation of a 1 km pixel increases fire ignitions by 279% in the following year, with the effect reversing in the second year, ruling out natural flammability.&lt;/p&gt;
&lt;p&gt;Pigouvian enforcement: Government-imposed penalties that alter private incentives to account for externalities. In this paper&amp;rsquo;s usage, the government&amp;rsquo;s de facto punishment function — which heavily weights fires spreading into protected areas and populated land — functions as an implicit Pigouvian tax, shaping which fires firms choose to avoid rather than uniformly deterring all illegal burning.&lt;/p&gt;
&lt;p&gt;Coasian bargaining failure: The absence of private negotiations between concession holders to internalize the externalities they impose on each other. The paper demonstrates this failure empirically by showing firms treat neighboring concession land no differently from unprotected unleased productive forest, indicating no effective private agreements are limiting cross-concession fire spread.&lt;/p&gt;
&lt;p&gt;Wind speed as spread risk shifter: Monthly average wind speed at each 1 km pixel, used as the time-varying component of fire spread risk. A one-standard-deviation increase (approximately 5 km/hr) increases fire spread area by 287%. The paper uses wind speed variation interacted with surrounding land type composition to identify whether firms adjust ignition decisions based on spread risk and who bears the cost.&lt;/p&gt;
&lt;p&gt;Unleased productive forest (benchmark): Land within the national forest estate that is neither in a designated concession nor in a protected zone, leaving ownership rights unclear and de facto unprotected. The paper uses firms&amp;rsquo; behavior toward this category as the baseline against which sensitivity to other land types is measured, because it attracts the least government attention and the weakest property rights.&lt;/p&gt;
&lt;p&gt;Government punishment function: The implicit weights the Indonesian government places on different types of fire damage when deciding whether to investigate a firm, estimated from logistic regression on the 2015 investigation data. The function heavily weights fires burning protected areas and high-population-density land, and places near-zero weight on damage to other private concessions, shaping which fire types firms strategically avoid.&lt;/p&gt;</description></item><item><title>The Power of Proximity to Coworkers</title><link>https://macropaperwarehouse.com/papers/the-power-of-proximity-to-coworkers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-power-of-proximity-to-coworkers/</guid><description>&lt;p&gt;This paper studies how physical proximity to coworkers affects on-the-job training and productivity, using software engineers at a Fortune 500 online retailer observed from 2019 to 2024. The authors exploit two quasi-experimental shocks to proximity: the office closures of 2020, which eliminated proximity differentials that previously existed across team types, and the firm&amp;rsquo;s subsequent return-to-office (RTO) mandates in 2022 and 2023, which restored proximity for co-located teams while leaving geographically-distributed teams apart. The core identification strategy is a difference-in-differences design comparing engineers whose teams were co-located in a single headquarters building to those whose teams were split across two buildings a ten-minute walk apart — a distinction that became immaterial once offices closed.&lt;/p&gt;
&lt;p&gt;The central finding is that sitting near teammates substantially increases the digital feedback engineers receive on their code. Before the office closures, engineers on co-located teams received 23.9% (1.92 comments per program) more code review feedback than engineers on multi-building teams. Once offices closed, this advantage narrowed by 18.3% (1.47 comments per program, p-value = 0.0026). The lost comments were disproportionately those predicted by a machine-learning classifier to be helpful, actionable, well-reasoned, and impactful, with high-quality comments declining by 21–23% — exceeding the overall volume decline. Face-to-face and digital communication are complements, not substitutes: proximate engineers drew on a wider pool of reviewers and asked 48.4% more follow-up questions, a differential that vanished once offices closed.&lt;/p&gt;
&lt;p&gt;Proximity&amp;rsquo;s effects are highly heterogeneous. Gains in feedback are concentrated among less-tenured, younger, and female engineers — those with the most to learn. Junior engineers on co-located teams lost 2.03 more comments per program upon office closure than junior engineers already on distributed teams (p-value = 0.001); young engineers lost 2.47 more comments (p-value = 0.0001). Female engineers lost 38.9% more comments than their distributed female counterparts (p-value &amp;lt; 0.0001), partly because women stop asking as many people for feedback when they cannot do so in person.&lt;/p&gt;
&lt;p&gt;Proximity improves code quality for inexperienced engineers. Around the second RTO (three days per week), engineers on co-located teams became 2.2 percentage points less likely to add files subsequently deleted — a measure of churn — and 1.4 pp less likely to introduce bugs, relative to distributed teams (p-values of 0.041 and 0.022 respectively). These gains were roughly twice as large for less-tenured and younger engineers. The benefits persist: engineers who spent more pre-closure time on co-located teams continued to write higher-quality code during the fully remote period.&lt;/p&gt;
&lt;p&gt;However, mentorship is costly for those who provide it. Senior engineers on co-located teams wrote 0.76 fewer programs per month in the main codebase before closures (p-value = 0.0005), a gap that closed when offices did and widened again during the second RTO. The firm faces a fundamental tradeoff: proximity accelerates junior engineers&amp;rsquo; human capital development while reducing experienced engineers&amp;rsquo; immediate coding output.&lt;/p&gt;
&lt;p&gt;These dynamics shape hiring. The firm shifted toward hiring older, more experienced engineers during closures — buying talent it could no longer build in-house — and back toward younger hires once offices reopened. Nationally, young college graduates in remotable occupations (classified per Dingel and Neiman, 2020) experienced a 0.88 pp increase in unemployment between 2017–2019 and 2022–2024, while older graduates saw a marginal decline of 0.11 pp. A triple-difference estimate finds a 0.65 pp greater increase in young workers&amp;rsquo; unemployment in remotable versus non-remotable occupations (p-value = 0.029), a pattern that predates generative AI diffusion and is robust to controlling for AI exposure. Back-of-the-envelope, remote work accounts for an estimated 64% of the total unemployment increase among young college graduates over this period.&lt;/p&gt;
&lt;p&gt;The paper also documents that proximity is fragile: a ten-minute walk between two buildings reduces feedback as much as being multiple states away, and even a single distant teammate imposes negative externalities on those who remain co-located, reducing their feedback by 1.71 comments per program (p-value = 0.095) via a &amp;ldquo;one Zoom, all Zoom&amp;rdquo; norm.&lt;/p&gt;
&lt;p&gt;Q: What is the main identification strategy for the office-closure analysis, and what is the key parallel-trends evidence?&lt;/p&gt;
&lt;p&gt;A: The authors compare engineers on co-located teams (all members in one headquarters building) to those on multi-building teams (split across two buildings a ten-minute walk apart), before and after the March 2020 office closures. Co-located teams lost more proximity when offices closed, while multi-building teams experienced a smaller shock, enabling a difference-in-differences design. Pre-closure trends in feedback are parallel across the two team types (Figure I), supporting the identifying assumption. Standard errors are clustered by team, the unit of treatment assignment.&lt;/p&gt;
&lt;p&gt;Q: How large is the effect of proximity on total code review feedback, and how is it broken down by feedback source?&lt;/p&gt;
&lt;p&gt;A: Before closure, co-located engineers received 23.9% (1.92 comments per program) more feedback than multi-building engineers. The DiD estimate indicates that losing proximity reduced feedback by 18.3% (1.47 comments per program, p-value = 0.0026, Column 3 of Table II). This decline stems entirely from reduced feedback from teammates; there is no detectable effect on feedback from engineers on other teams — a placebo check that supports the identification strategy and rules out explanations based on differential project complexity.&lt;/p&gt;
&lt;p&gt;Q: How does proximity affect the quality — not just the quantity — of code review comments?&lt;/p&gt;
&lt;p&gt;A: Using a gradient-boosted decision tree trained on 5,377 human-labeled comments, the authors predict comment quality across all 174,014 comments. Losing proximity reduced comments predicted to be helpful, well-reasoned, actionable, and likely to change the code by 21–23% — exceeding the 18.3% overall volume decline. The residual comments were lower quality: 2.9 pp fewer were helpful (p-value = 0.039), 1.7 pp fewer explained their reasoning (p-value = 0.094), and 1.9 pp fewer were likely to change the code (p-value = 0.072).&lt;/p&gt;
&lt;p&gt;Q: What mechanisms drive the complementarity between face-to-face interaction and digital feedback?&lt;/p&gt;
&lt;p&gt;A: Proximity increases feedback on both the extensive and intensive margins. On the extensive margin, co-located engineers draw on a wider pool of reviewers, returning less frequently to the same commenter. On the intensive margin, losing proximity reduces follow-up questions by 48.4% (0.12 questions per program, p-value = 0.0083), accounting for roughly half of the total feedback decline. The other half comes from reduced initial reviewer feedback. References to other communication channels (e.g., Slack) within code reviews also decline when proximity is lost, confirming that face-to-face and digital communication are complements.&lt;/p&gt;
&lt;p&gt;Q: How small a physical barrier is sufficient to reduce feedback substantially?&lt;/p&gt;
&lt;p&gt;A: A ten-minute walk between two buildings on the same headquarters campus reduces feedback by as much as being multiple states away — both groups receive significantly less feedback than engineers whose entire team sits in the same building (Figure Ib). This finding aligns with research on academics showing that different floors or buildings reduce coauthorship, and extends it to daily teammates sharing projects.&lt;/p&gt;
&lt;p&gt;Q: What are the externality effects of a single distant teammate?&lt;/p&gt;
&lt;p&gt;A: Through the firm&amp;rsquo;s implicit &amp;ldquo;one Zoom, all Zoom&amp;rdquo; norm, even one teammate in a different location shifts all team meetings to video calls. Engineers in the same building exchange 14.5% less feedback when even one teammate is in another building versus when all teammates are co-located (p-value = 0.037). When a new hire transforms a co-located team into a multi-building one, feedback between the original co-located teammates drops by 1.71 comments per program (p-value = 0.095); adding a new co-located hire produces no such decline.&lt;/p&gt;
&lt;p&gt;Q: How does the effect of proximity on feedback differ by engineer tenure, age, and gender?&lt;/p&gt;
&lt;p&gt;A: Less-tenured engineers on co-located teams lost 2.03 more comments per program upon closure than less-tenured engineers on distributed teams (p-value = 0.001). Young engineers (under 29) on co-located teams lost 2.47 more comments per program than young distributed engineers (p-value = 0.0001). Female engineers on co-located teams lost 38.9% (3.71) more comments than female engineers on distributed teams (p-value &amp;lt; 0.0001), partly because women draw feedback from 14.7% fewer people when proximity is lost (p-value = 0.0078), compared to a negligible 2.6% decline for men. The extra feedback women receive in person is of higher quality, not rude or condescending.&lt;/p&gt;
&lt;p&gt;Q: How is the effect of proximity on code quality identified using the RTO design, and what are the magnitudes?&lt;/p&gt;
&lt;p&gt;A: The RTO design compares engineers on co-located (same-city) teams to geographically-distributed teams across three periods: full closure, first RTO (two days per week), and second RTO (three days per week). The authors predict γ_closed ≈ 0 (office assignment irrelevant when closed) and γ_2nd_RTO &amp;gt; γ_1st_RTO (more in-office days means more proximity). Both predictions are confirmed. During the second RTO, co-located engineers were 2.2 pp less likely to add files later deleted (p-value = 0.041) and 1.4 pp less likely to introduce bugs (p-value = 0.022), with effects roughly twice as large for less-tenured and younger engineers.&lt;/p&gt;
&lt;p&gt;Q: Does the benefit of co-location on code quality persist after remote work resumes?&lt;/p&gt;
&lt;p&gt;A: Yes. After all engineers returned to remote work, those who had been on co-located teams pre-closure were 2.37 pp less likely to write disposable code (p-value = 0.013) and 3.09 pp less likely to introduce bugs (p-value = 0.0012). Code quality improves monotonically with the number of pre-closure months spent on co-located teams (Figure A.5). These gaps persist when including current team fixed effects, meaning within the same post-closure team, the previously co-located engineer writes higher-quality code.&lt;/p&gt;
&lt;p&gt;Q: What is the cost of mentorship for senior engineers, and how does it manifest in coding output?&lt;/p&gt;
&lt;p&gt;A: Senior engineers on co-located teams wrote 0.76 fewer programs per month in the main codebase when offices were open (p-value = 0.0005). Once offices closed, this gap disappeared, and senior engineers who lost proximity to their teammates saw a relative increase in output of 0.58 programs per month (p-value = 0.0014). During the second RTO, engineers with more than sixteen months of tenure on co-located teams wrote fewer programs, while no significant difference emerged for less-tenured engineers. Overall, the DiD estimate indicates losing proximity to teammates increases immediate output by 0.48 programs per month (p-value = 0.0002).&lt;/p&gt;
&lt;p&gt;Q: How does the firm&amp;rsquo;s hiring age distribution respond to changes in proximity?&lt;/p&gt;
&lt;p&gt;A: When offices were closed, the firm shifted toward hiring older engineers: the share of hires under age 29 fell from over half pre-closure to less than a third during the closure. After the RTOs, the firm shifted back toward younger hires. Geographic variation reinforces this: headquarters-campus hires were 7–10 years younger than those hired into distributed roles when offices were open; this gap narrowed substantially during closures when everyone was far from teammates.&lt;/p&gt;
&lt;p&gt;Q: Does proximity affect which engineers are poached by other firms?&lt;/p&gt;
&lt;p&gt;A: Yes. During the office closures, 1.2% of co-located engineers were poached per month, compared to 0.9% of multi-building engineers of similar tenure, age, and engineering group (p-value = 0.044). By the end of the closure period, nearly a quarter of co-located engineers had been poached versus a sixth of multi-building engineers. There is a dose response: more pre-closure time on co-located teams predicts higher poaching rates. The effect is concentrated among younger and female engineers, consistent with their feedback building more transferable general human capital. Tenure does not moderate the poaching effect, consistent with less-tenured engineers&amp;rsquo; feedback being more firm-specific.&lt;/p&gt;
&lt;p&gt;Q: What does national unemployment data show about the scarring effects of remote work on young workers?&lt;/p&gt;
&lt;p&gt;A: Between 2017–2019 and 2022–2024, young college graduates (under 29) in remotable occupations experienced a 0.88 pp increase in unemployment (p-value &amp;lt; 0.00001), while older graduates in the same occupations saw a marginal decline of 0.11 pp (p-value = 0.053). A triple-difference regression finds a 0.65 pp greater increase in young workers&amp;rsquo; unemployment in remotable versus non-remotable occupations (p-value = 0.029). Back-of-the-envelope, scaling this estimate by the 61% share of young graduates in remotable jobs predicts a 0.4 pp increase in young college graduates&amp;rsquo; overall unemployment — equal to 64% of the realized 0.63 pp increase.&lt;/p&gt;
&lt;p&gt;Q: Is the unemployment increase among young workers in remotable jobs driven by generative AI rather than remote work?&lt;/p&gt;
&lt;p&gt;A: The authors argue against AI as the primary driver on two grounds. First, the uptick in young workers&amp;rsquo; unemployment in remotable occupations predates the rapid diffusion of generative AI. Second, the differential increase is not concentrated among occupations with the highest AI task exposure. The triple-difference estimate is robust to controlling for occupational AI exposure using the Eisfeldt, Schubert and Zhang (2023) index. The authors acknowledge that AI may become more important as it diffuses further.&lt;/p&gt;
&lt;p&gt;Q: How do young workers&amp;rsquo; own office attendance decisions reflect the value of proximity?&lt;/p&gt;
&lt;p&gt;A: At the partner firm, engineers under 29 were 8.8 pp (37.6%) more likely to come into the office during the RTOs than older engineers when on co-located teams (solid line in Figure VIIa). This difference was roughly halved on geographically-distributed teams (p-value of difference = 0.0085), indicating that the draw is specifically proximity to teammates. Co-located managers raised attendance by 2.6 pp, while co-located teammates raised it by 5.1 pp. Nationally, Stack Overflow survey data show nearly half of engineers under 25 are in the office each day, versus a quarter of older engineers (p-value &amp;lt; 0.00001).&lt;/p&gt;
&lt;p&gt;Q: What does the paper imply about why remote work was rare before the pandemic despite workers&amp;rsquo; stated preferences for it?&lt;/p&gt;
&lt;p&gt;A: The paper offers a resolution: firms may have recognized that the value of the office lies in training for tomorrow and improving the quality — not the quantity — of work today. Remote work boosts immediate output, especially for experienced workers, but it reduces mentorship and long-run skill development. The tradeoff between current and future productivity, and between individual and collective returns to human capital, explains why firms historically resisted remote work even when workers preferred it and short-run output was unaffected.&lt;/p&gt;
&lt;p&gt;Q: What are the implications for gender equity in remote work?&lt;/p&gt;
&lt;p&gt;A: The findings suggest remote work has ambiguous gender effects. While remote work may help working mothers remain in the workforce, it appears costly for young women&amp;rsquo;s professional development, which is especially sensitive to physical proximity. Women receive substantially more high-quality feedback when co-located, draw feedback from a wider network in person, and lose disproportionately more feedback when proximity is lost. Young female engineers on co-located teams were also disproportionately poached — suggesting their human capital gains from co-location are more general and transferable.&lt;/p&gt;
&lt;p&gt;Code review feedback: The digital comments engineers exchange when reviewing each other&amp;rsquo;s code before it is merged into the live codebase; the paper&amp;rsquo;s primary measure of on-the-job training and mentorship investment, distinct from mere volume because the authors also classify comments by helpfulness, reasoning, actionability, and expected impact using supervised machine learning.&lt;/p&gt;
&lt;p&gt;Co-located team: A team in which all members are assigned to the same office building; the treatment group in the difference-in-differences designs, distinguished from multi-building teams (split across two headquarters buildings, a ten-minute walk apart) and geographically-distributed teams (members in different cities or permanently remote).&lt;/p&gt;
&lt;p&gt;One Zoom, all Zoom norm: The implicit team practice of holding all meetings virtually if any single teammate cannot be physically present; the mechanism by which one distant colleague generates negative externalities for the remaining co-located teammates, reducing their in-person interaction and feedback.&lt;/p&gt;
&lt;p&gt;Proximity fragility: The finding that even small physical barriers — a ten-minute walk between buildings — reduce feedback as much as being multiple states away, implying that the relationship between physical distance and mentorship is highly nonlinear near zero.&lt;/p&gt;
&lt;p&gt;Churn (disposable code): Files that are added by an engineer but deleted within the subsequent six months, either because the code was poorly structured or because it introduced a feature later abandoned; used as one of two code quality proxies in the RTO analysis (occurring in 15% of programs).&lt;/p&gt;
&lt;p&gt;Bugs (immediate reversions): Programs that are immediately and fully reverted after being merged, typically indicating the engineer&amp;rsquo;s changes precipitated an emergency requiring rollback to an earlier version; used as the more serious of the two code quality proxies (occurring in 3.5% of programs).&lt;/p&gt;
&lt;p&gt;Scarring effects: The persistent adverse impact on young workers&amp;rsquo; human capital and labor market outcomes from reduced mentorship during the remote work period; manifested both as lower code quality at the individual level and higher unemployment rates nationally among young college graduates in remotable occupations.&lt;/p&gt;
&lt;p&gt;Remotable occupation: An occupation classified by Dingel and Neiman (2020) as feasibly performed from home; used to construct the national triple-difference analysis comparing age gaps in unemployment across remotable and non-remotable jobs before and after the pandemic.&lt;/p&gt;</description></item><item><title>The Price of Housing in the United States, 1890–2006</title><link>https://macropaperwarehouse.com/papers/the-price-of-housing-in-the-united-states-18902006/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-price-of-housing-in-the-united-states-18902006/</guid><description>&lt;p&gt;Lyons, Shertzer, Gray, and Agorastos construct the first consistent, annual, quality-adjusted market rent and home sales price series for American cities spanning 1890–2006. The paper addresses a fundamental data gap: no annual city-level series existed for market rents at any point in the 20th century, and no annual city-level sales price series existed prior to 1975. Existing national series—the BLS Rent of Primary Residence (RoPR) for rents and the Shiller index for sales—carry well-documented methodological limitations that the authors argue have produced materially misleading stylized facts about long-run U.S. housing markets.&lt;/p&gt;
&lt;p&gt;The Historical Housing Prices (HHP) dataset draws on just under 2.7 million newspaper real estate listings from 30 U.S. cities across 1890–2006. Listings must contain a price, a size measure (rooms or bedrooms), property type (house or apartment), and a location indicator. The authors construct hedonic price indices using a rolling-windows methodology—baseline three-year rolling windows with annual step size—that controls for size, type, and standardized within-city location, allowing coefficients to vary over time rather than imposing a fixed vector across the full century. City-level indices are aggregated to national indices using population weights from census data interpolated between census years. Listed prices serve as proxies for transaction prices; the authors validate these against census distributions and against post-1975 FHFA and Case-Shiller series.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s findings revise several established stylized facts. First, real market rents did not fall over the 20th century as implied by the RoPR series. Instead, real rental price levels were approximately 20% higher in 2006 than in 1890, fluctuating within a relatively narrow band. The RoPR series, by contrast, implies a near-halving of real rents between 1914 and 2006. Second, the paper documents a substantial interwar housing boom-bust absent from the Shiller index: real sales prices rose approximately 47% between 1920 and 1928, then fell 27% by 1935, with the 1928 peak not recovered in real terms until 1968. Third, contrary to the Shiller index&amp;rsquo;s depiction of minimal housing price growth from 1950 to 1995, the HHP series shows real sales prices rising 21% between 1953 and 1974—a period for which Shiller relies on a truncated sample of government-backed mortgages that excluded higher-valued homes.&lt;/p&gt;
&lt;p&gt;On the return to homeownership, the paper finds average nominal housing returns across 1890–2006 of approximately 11% per year, composed of 3.8% capital gain and 7.2% rental return. Gross market rental yields exceeded 8% annually for much of 1900–1945, fell to 7% by 1960, and to 3% by 2006. Capital gains were largely unimportant before the 1940s and became the dominant return component only from 1970 onward; the post-1980 period with sustained capital gains is characterized as historically anomalous. Returns varied substantially across cities, with some cities outperforming the S&amp;amp;P 500 in the prewar era while most underperformed equities from 1981–2006.&lt;/p&gt;
&lt;p&gt;The paper also examines implications for the CPI. The HHP series implies nominal rents grew at approximately 3.5% per year from 1914 to 2006, versus 2.6% per year for the RoPR component. A back-of-the-envelope alternative CPI using HHP rental data yields overall price growth of 3.3% per year rather than the official 3.1%, suggesting the measured increase in U.S. living standards since World War I may be modestly overstated. Finally, cross-city analysis shows that land constraints and, increasingly, regulatory constraints explain divergence in price growth across cities, with the role of zoning becoming more pronounced after 1980.&lt;/p&gt;
&lt;p&gt;Q: What is the core data source and how are the indices constructed?
A: The HHP dataset comprises just under 2.7 million newspaper real estate listings from 30 U.S. cities, 1890–2006, sampled from real estate sections (typically the last Sunday of each month). Valid listings require price, size, property type, and within-city location. Hedonic indices are estimated using rolling three-year windows with annual steps, controlling for size, type, and standardized location, allowing hedonic coefficients to evolve over time rather than imposing a fixed vector. City indices are aggregated to national indices using population-weighted census data interpolated between census years.&lt;/p&gt;
&lt;p&gt;Q: Why are the HHP series based on listing prices rather than transaction prices, and how is this limitation addressed?
A: Transaction-price records require local archival effort infeasible across 30 cities over 116 years, and rental transaction data are essentially unavailable historically. The authors argue that hedonic mix-adjustment makes listed prices strong predictors of selling prices during normal market conditions, and that a substantial share of houses transact at their exact listing price. Validation against census distributions and against post-1975 FHFA and Case-Shiller series supports the approach; the authors acknowledge listing prices may diverge from transaction prices at cyclical peaks and troughs.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the long-run trajectory of real market rents, and how does this revise existing understanding?
A: The HHP series shows real rental price levels in 2006 were approximately 20% higher than in 1890 or 1914, fluctuating within a relatively narrow band over the century. The BLS RoPR series implies real rents fell by nearly half between 1914 and 2006. The HHP findings align with the most influential proposed corrections to the RoPR by Gordon &amp;amp; van Goethem (2007) for 1915–1939 and broadly with Crone et al. (2010) in terms of overall growth levels for 1940–1995, though the HHP series shows a sharper rental spike after World War II rent controls were lifted that the BLS methodology captures only with deliberate lag.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the interwar housing cycle, and why does the Shiller index miss it?
A: The HHP series documents that real sales prices rose approximately 47% between 1920 and 1928, then fell 27% by 1935, with the 1928 nominal peak not regained until 1946 and the real peak not until 1968. The Shiller index for 1890–1934 is based on a 1934 survey of owner recollections of past transaction prices and assessed values, which the authors argue reflects homeowners&amp;rsquo; lack of awareness of the changing value of their homes over prior decades. The HHP finding is consistent with census data, Nicholas &amp;amp; Scherbina&amp;rsquo;s study of New York City, and Fishback &amp;amp; Kollmann&amp;rsquo;s analysis of New Deal reports.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the 1953–1974 period, and what explains the divergence from the Shiller index?
A: The HHP series shows housing sales prices increased 21% in real terms between 1953 and 1974, while the Shiller index (based on the Home Purchase Component of the CPI) implies a moderate decline of around 10%. The Shiller index for this period uses a truncated sample of government-backed mortgages subject to FHA loan limits; when the authors truncate their own data using the same statutory FHA limits ($30,000 in 1973, $45,000 in 1974, $60,000 in 1977), approximately 50% of their 1971–1979 listings are excluded and their truncated series matches the Shiller index more closely. This supports the Greenlees (1982) critique of downward bias in the Home Purchase CPI component.&lt;/p&gt;
&lt;p&gt;Q: What are the long-run return components to homeownership at the national level?
A: Average nominal housing returns across 1890–2006 were approximately 11% per year: 3.8% capital gain and 7.2% rental return. Before World War II (1890–1945), average nominal rental returns ranged from 7.9% to 8.3% per sub-period while capital gains averaged near zero or negative in real terms. Only in 1981–2006 did capital gains (averaging 5.8%) exceed the rental return (averaging 5.3%). The return to housing has thus been dominated by rental income over the long run, with the post-1980 era of sustained capital gains constituting a historical anomaly.&lt;/p&gt;
&lt;p&gt;Q: How do rental yields evolve over the sample period?
A: Gross market rental yields exceeded 8% annually for much of 1900–1945, with spikes after both World Wars and a dramatic fall from nearly 11% to below 7% during the early 1920s boom, consistent with a bubble dynamic before the Great Depression. Yields fell to approximately 7% by 1960 and to 3% by 2006. City-level heterogeneity was substantial: rental returns exceeded 15% in some cities in the two decades before the Great Depression, and most cities saw returns above 10% nominally during 1930–1945, while even by 1981–2006 cities like Phoenix and St. Louis averaged above 12%.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about housing and the business cycle?
A: Real growth rates in GDP and housing prices moved in the same direction in 72 of 116 years for sales prices and 65 of 116 years for rental prices. The paper identifies three major downturns where falling rents led falling prices which led falling GDP: the Great Depression (rents fell from 1924, prices from 1929, GDP from 1930), the early 1990s recession (rents from 1988, prices from 1990, GDP from 1991), and the end-of-sample period (rents from 2002). Only after World War I (1920–21) and World War II (1945–46) did clear economic contractions occur without equivalent housing price downturns.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about cross-city variation in housing returns, and what does this imply for the volatility puzzle?
A: Capital gains and rental returns vary substantially across cities and time periods; some cities saw returns exceeding the S&amp;amp;P 500 before World War II (including New York and Chicago), while most underperformed equities from 1981–2006. The authors argue that the apparently low volatility of housing returns at the national level documented by Jordà et al. (2019) is partly an aggregation artifact: local housing markets with very different trajectories are combined into a national index, dampening measured variance. The mild positive correlation between city-level capital gains and rental returns has an R² of 0.24.&lt;/p&gt;
&lt;p&gt;Q: What are the implications for CPI measurement?
A: The HHP series implies nominal rents grew at approximately 3.5% per year from 1914 to 2006, compared with 2.6% per year for the BLS RoPR component, with higher growth concentrated in the years after both World Wars and in the 1965–1985 period. A back-of-the-envelope alternative CPI substituting HHP rental data yields overall price growth of 3.3% per year rather than the official 3.1%. If rental price growth before 1985 is understated in the BLS data, then there has been less improvement in the U.S. standard of living since World War I than was previously understood.&lt;/p&gt;
&lt;p&gt;Q: What does the paper find about the role of supply constraints in explaining cross-city price divergence?
A: Natural land constraints are positively linked to price growth throughout the 20th century, with the relationship sharpest during 1930–1945 (before the postwar suburban expansion) and again after 1980. Regulatory constraints—measured at the turn of the millennium—have become an increasingly important driver of cross-city price differences, consistent with zoning functioning as a tax (Gyourko &amp;amp; Krimmel 2021). The paper also finds evidence suggesting land-use regulations are partly driven by expectations of future price growth, consistent with the homeowner-voter hypothesis (Fischel 2015; Trounstine 2018).&lt;/p&gt;
&lt;p&gt;Q: How does the paper validate its series against existing sources?
A: The HHP rental series aligns closely with the Rees and Jacobs (1961) series for 1890–1914. For sales, the HHP series matches the Case-Shiller-Weiss and FHFA repeat-sales indices at both national and city level after 1990 despite methodological differences. The paper finds approximately 25% more price growth than the CSW series over 1975–2006 (117% versus 90% in the 30 HHP cities), attributing some of the divergence to OFHEO appraisal-based valuations before 1992 and the HHP coverage of the broader owned housing market beyond single-family homes.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;Historical Housing Prices (HHP) Project: A dataset of just under 2.7 million newspaper real estate listings from 30 U.S. cities, 1890–2006, used to construct annual, quality-adjusted hedonic price indices for both rented and owned housing segments at the city and national level.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Rolling-windows hedonic methodology: An index construction approach that runs sequential hedonic regressions over two-, three-, or five-year overlapping windows with annual step size, allowing the coefficients on size, type, and location to evolve over time rather than imposing a fixed vector across the full sample period, reducing bias from unobserved quality changes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Market rent vs. contract rent: Market rent (the listing price for a rental unit actively advertised) is conceptually distinct from contract rent (the rent paid by tenants currently in situ), which is what the BLS RoPR series measures. Market rents adjust to vacancy and lease resets faster than contract rents, producing substantially more short-run volatility and a materially different long-run trend.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Gross rental yield (rent-to-price ratio): Annual rental income from a property divided by its market sales price, computed as RI_{c,t} / HPI_{c,t}. Gross yields exceeded 8% annually for much of 1900–1945 and fell to 3% by 2006 nationally, making rental income the dominant component of total housing returns for most of the century.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Total return to housing: The sum of the capital gain (percentage change in sales price) and the rental return (rental income divided by sales price), computed at annual, city, and national frequency for 1890–2006. The average nominal total return was approximately 11% per year, with 3.8% from capital gains and 7.2% from rental income.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Rent of Primary Residence (RoPR): The BLS survey-based series measuring changes in contract rents for a rotating panel of rental units, used as the shelter component of the CPI. The HHP series implies this series understates rental price growth by approximately 0.9 percentage points per year (3.5% vs. 2.6% nominal growth), concentrated in post-World War periods and 1965–1985, due to tenant non-response bias and delayed incorporation of new construction.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Supply constraints and cross-city divergence: Natural land constraints (geographic barriers to development) and regulatory constraints (zoning and land-use regulation) that limit housing supply, both positively associated with price growth, with regulatory constraints becoming increasingly important after 1980 and consistent with the hypothesis that land-use regulations are partly driven by homeowner expectations of future price appreciation.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>The Productivity of Professions: Evidence from the Emergency Department</title><link>https://macropaperwarehouse.com/papers/the-productivity-of-professions-evidence-from-the-emergency-department/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-productivity-of-professions-evidence-from-the-emergency-department/</guid><description>&lt;p&gt;This paper studies the productivity of nurse practitioners (NPs) versus physicians performing overlapping tasks in Veterans Health Administration (VHA) emergency departments (EDs), exploiting a quasi-experiment created by the VHA&amp;rsquo;s December 2016 grant of full practice authority to NPs. The identification strategy instruments patient assignment to NPs versus physicians using quasi-random variation in the number of NPs on duty on a given ED-day, conditional on ED-by-time-category fixed effects. The sample covers 1.1 million ED visits across 44 VHA EDs from January 2017 to January 2020, seen by 1,348 physicians and 156 NPs. The instrument is validated by demonstrating balance in patient observable characteristics across values of the instrument, stability of IV estimates across 256 combinations of patient covariate controls, and absence of spillover effects from NP presence onto physician performance.&lt;/p&gt;
&lt;p&gt;On average in the ED setting, NPs increase patient length of stay by 11 percent (approximately 18 additional minutes) and raise the cost of the ED visit by 7 percent (approximately $66 per visit). NPs raise the 30-day preventable hospitalization rate by 0.25 percentage points, a 20 percent increase relative to the mean. No statistically significant effect on 30-day mortality is detected (95 percent confidence interval: -0.34 to 0.11 percentage points). OLS estimates carry the opposite sign because NPs are assigned healthier patients in observational data; the IV design corrects for this selection.&lt;/p&gt;
&lt;p&gt;The average NP-physician performance gap varies systematically by case complexity and severity. For the highest-complexity quartile of cases (by Elixhauser comorbidities), NPs increase ED costs by 12 percent and length of stay by 28 percent. For cases at or above the 95th percentile of severity (based on 30-day mortality by diagnosis), NPs increase ED costs by 25 percent, length of stay by 99 percent, and admissions by 26 percentage points (42 percent relative to the mean), while reducing 30-day preventable hospitalization by 3 percentage points — suggesting that NPs&amp;rsquo; higher care intensity partially offsets worse intrinsic skill for the most severe cases. For lower-complexity cases, the cost and length-of-stay gaps are smaller, but NPs still significantly raise preventable hospitalizations.&lt;/p&gt;
&lt;p&gt;NPs exhibit clinical decision-making patterns consistent with lower diagnostic skill: they are more likely to order consults (2.6 percentage points, or 11 percent of the mean), CT scans (1.2 percentage points, or 8.3 percent), and X-rays (2.0 percentage points, or 6.9 percent). NPs lower opioid prescriptions by 1.8 percentage points (20 percent of the mean) and raise antibiotic prescriptions by 4.0 percentage points (6.3 percent of the mean), consistent with threshold adjustment under lower diagnostic skill with asymmetric error costs. Downstream, patients treated by NPs incur similar opioid use disorder rates despite lower opioid prescribing, and higher infection-related return visit rates despite higher antibiotic prescribing.&lt;/p&gt;
&lt;p&gt;Counterfactual analysis finds that allocating one quarter of ED patients to NPs increases net spending by $129 million per year to the VHA after accounting for NPs&amp;rsquo; lower wages (approximately half of physicians&amp;rsquo;). However, deploying NPs exclusively to the least-complex quarter of cases reduces net spending to approximately one-fifth of this amount.&lt;/p&gt;
&lt;p&gt;A distributional analysis deconvolving provider-specific IV estimates reveals that within-profession productivity variation substantially exceeds the average between-profession gap. The interquartile range in annual spending attributable to provider productivity within each profession is approximately $900,000, roughly three times the mean annual spending difference between the average NP and the average physician. A randomly chosen NP outperforms a randomly chosen physician in up to 38 percent of pairs. Within professions, individual provider productivity shows essentially no relationship with wages or case complexity assigned, whereas between professions, case assignment and wages are strongly sorted by professional class.&lt;/p&gt;
&lt;p&gt;Q: What is the core research question?
A: The paper asks whether NPs and physicians, who perform overlapping tasks in the ED but differ sharply in training, selectivity, and pay, differ in productivity, and how that average between-profession difference compares to productivity variation within each profession. It also asks what mechanisms drive any observed gap and how case assignment responds to provider skill differences.&lt;/p&gt;
&lt;p&gt;Q: What is the identification strategy and why is it credible?
A: The authors instrument patient assignment to NPs with the number of NPs on duty on the ED-day, conditional on ED-by-year, ED-by-month, ED-by-day-of-week, and ED-by-hour fixed effects. Credibility rests on: provider schedules being set months in advance, decoupling NP availability from arriving patient characteristics; patient characteristics being well balanced across values of the instrument conditional on fixed effects; IV estimates being stable across all 256 covariate-control combinations; and on-duty physician and NP characteristics also being balanced across the instrument.&lt;/p&gt;
&lt;p&gt;Q: What are the main average effects of NPs on resource use?
A: IV estimates show NPs increase patient length of stay by 11 percent (approximately 18 minutes) and ED cost by 7 percent (approximately $66 per visit). There is no significant average effect on inpatient admissions in the overall sample, though NPs significantly raise admissions for high-severity cases.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of NPs on patient health outcomes?
A: NPs raise 30-day preventable hospitalizations by 0.25 percentage points, a 20 percent increase relative to the mean. The 95 percent confidence interval for 30-day mortality is -0.34 to 0.11 percentage points, implying no statistically significant mortality effect in the overall sample.&lt;/p&gt;
&lt;p&gt;Q: Why do OLS and IV estimates have opposite signs?
A: In observational data, NPs treat healthier patients than physicians: NP patients are younger (60.7 versus 62.5 years), have fewer Elixhauser comorbidities (3.2 versus 3.7), and have fewer prior inpatient stays (0.4 versus 0.7). This selection causes OLS estimates of NP effects to be negative. The IV corrects for this by exploiting quasi-random variation in NP availability; IV estimates are stable across all combinations of patient controls, consistent with the instrument being orthogonal to unobservable patient health.&lt;/p&gt;
&lt;p&gt;Q: How does the NP-physician performance gap vary with case complexity and severity?
A: For the highest-complexity quartile, NPs increase length of stay by 28 percent and ED costs by 12 percent without a significant preventable hospitalization effect. For cases at or above the 95th severity percentile, NPs increase length of stay by 99 percent, ED costs by 25 percent, and admissions by 26 percentage points (42 percent relative to the mean), while reducing 30-day preventable hospitalization by 3 percentage points. For lower-complexity quartiles, NPs show smaller cost and length-of-stay effects but significantly raise preventable hospitalizations, suggesting the higher care intensity at high severity compensates for lower skill.&lt;/p&gt;
&lt;p&gt;Q: What does the heterogeneity by severity imply for optimal case assignment?
A: The pattern is consistent with skill-task matching: NPs have a comparative and absolute disadvantage in complex cases, so optimal assignment directs less complex cases to NPs and fewer patients to NPs when physicians are more available. Empirically, NPs are indeed assigned healthier patients from the available pool, and are assigned a modestly smaller share when the ED is less busy.&lt;/p&gt;
&lt;p&gt;Q: What mechanisms explain the average NP-physician gap?
A: Three mechanisms are examined. First, experience: a one-standard-deviation increase in specific experience is associated with a 5.8 percent decline in the NP-physician length-of-stay gap, and general experience with a 10 percent decline; however, experience does not significantly narrow the preventable hospitalization gap. Second, information acquisition: NPs order more consults, CT scans, and X-rays, consistent with compensating for lower diagnostic skill. Third, prescription thresholds: NPs reduce opioid prescribing by 20 percent and raise antibiotic prescribing by 6.3 percent, consistent with threshold adjustment under asymmetric error costs, but downstream outcomes are not improved correspondingly.&lt;/p&gt;
&lt;p&gt;Q: What do prescription patterns and downstream outcomes reveal about NP diagnostic skill?
A: NPs prescribe fewer opioids yet patients treated by NPs obtain similar downstream opioid use disorder rates; NPs prescribe more antibiotics yet patients treated by NPs have higher rates of return visits with infections. This pattern is consistent with NPs exhibiting higher rates of both false positives and false negatives, not merely adjusted thresholds, suggesting genuinely lower diagnostic skill rather than threshold differences alone.&lt;/p&gt;
&lt;p&gt;Q: What do counterfactual cost calculations show?
A: Allocating one quarter of ED patients to NPs raises non-wage spending by $197 million per year to the VHA; after accounting for NP wages being half of physician wages (approximately $120,000 versus $240,000 per year), net cost is still $129 million per year. Restricting NP deployment to the least-complex quarter of cases reduces net spending to approximately one-fifth of this amount, illustrating that targeted case assignment substantially improves NP cost-effectiveness.&lt;/p&gt;
&lt;p&gt;Q: How large is within-profession productivity variation relative to between-profession differences?
A: The interquartile range in annual spending attributable to provider productivity within each profession is approximately $900,000, roughly three times the mean annual spending difference between the average NP and the average physician. A randomly chosen NP outperforms a randomly chosen physician in up to 38 percent of random pairs. The authors conclude that, despite stark differences in training and selection between professions, within-profession variation dominates.&lt;/p&gt;
&lt;p&gt;Q: Is individual provider productivity reflected in wages or case assignment within professions?
A: Within each profession, provider productivity shows essentially no relationship with wages or with the complexity of assigned cases. This contrasts sharply with between-profession patterns, where professional class strongly predicts both wages (NPs earn approximately $120,000 per year versus $240,000 for physicians) and assigned case complexity. The authors interpret this as evidence of informational and organizational frictions in recognizing individual productivity within professional classes, and note that professional class is a far stronger predictor of pay and case assignment than is individual productivity.&lt;/p&gt;
&lt;p&gt;Q: How do complier characteristics relate to the broader patient population?
A: Compliers — cases whose provider type is determined by the instrument — are healthier than the average case: younger, with fewer comorbidities, fewer prior inpatient stays, and lower predicted mortality. Never-takers are riskier than the average case. There are no always-takers since patients cannot be assigned to NPs on days when no NPs are on duty.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to the literature on NP scope-of-practice laws?
A: The scope-of-practice literature estimates general-equilibrium effects of allowing NPs greater autonomy, including labor reallocation between professions. This paper instead estimates the partial-equilibrium causal effect of assigning a patient to an NP versus a physician, holding the broader labor market fixed. The two literatures are complementary: the heterogeneity findings here suggest that scope-of-practice expansions may be more beneficial in lower-complexity primary care settings where the NP-physician performance gap is smaller.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: Three implications are highlighted. First, the efficiency of using NPs depends critically on case assignment: deploying NPs on the least-complex cases reduces net costs to approximately one-fifth of indiscriminate deployment. Second, the substantial overlap between NP and physician productivity distributions provides support for NP use in less complex settings even within the ED context. Third, within-profession productivity variation far exceeding between-profession differences suggests that individual-level productivity assessment, rather than professional class, may be a more accurate guide to case assignment and compensation.&lt;/p&gt;
&lt;p&gt;Quasi-experimental variation in NP availability: The identification strategy exploits day-to-day variation in the number of NPs scheduled to work in a given VHA ED, conditional on ED-by-time-category fixed effects, as an instrument for whether a patient is assigned to an NP versus a physician. Schedules are set months in advance, rendering the NP count orthogonal to arriving patient characteristics conditional on those fixed effects.&lt;/p&gt;
&lt;p&gt;30-day preventable hospitalization: A standardized quality-of-care outcome defined by the Agency for Healthcare Research and Quality, measuring hospitalizations occurring within 30 days of ED discharge that are classified as preventable given adequate prior outpatient management. Used by the paper as the primary downstream health outcome beyond the ED visit itself.&lt;/p&gt;
&lt;p&gt;Elixhauser comorbidities: A set of 31 binary indicators for chronic conditions (e.g., cancer, diabetes) based on medical histories in the prior 365 days, used in this paper to measure and stratify case complexity into quartiles for heterogeneity analysis.&lt;/p&gt;
&lt;p&gt;Productivity distributions within professions: Provider-specific productivity estimates derived from a just-identified IV model that instruments assignment to individual providers by indicators for on-duty providers, then deconvolved into underlying distributions using the Efron (2016) and Kline-Rose-Walters (2022) method. These distributions characterize the spread of productivity within each professional class, separate from measurement error.&lt;/p&gt;
&lt;p&gt;Prescription threshold adjustment: The mechanism, formalized in Chan, Gentzkow, and Yu (2022), by which providers with lower diagnostic skill optimally adjust treatment thresholds in response to asymmetric costs of false-positive versus false-negative errors. In this paper&amp;rsquo;s application, NPs lower the opioid prescription rate (where false positives carry higher costs: addiction and overdose) and raise the antibiotic prescription rate (where false negatives carry higher costs: untreated infection), but downstream outcomes do not improve correspondingly.&lt;/p&gt;
&lt;p&gt;Skill-task matching: The organizational economics principle (Acemoglu and Autor 2011) that efficiency requires assigning more complex tasks to higher-skilled workers. The paper documents that between professions, case assignment broadly follows this principle (NPs receive less complex patients on average), but within professions, essentially no matching between individual provider productivity and case complexity is observed.&lt;/p&gt;
&lt;p&gt;Full practice authority (VHA, December 2016): The VHA policy that allowed NPs to treat patients independently without physician supervision at VHA facilities, superseding state-level restrictions. This policy change defines the start of the paper&amp;rsquo;s sample period and establishes the institutional context in which the quasi-experiment occurs, as it removed the requirement for physician oversight that previously constrained NP independence.&lt;/p&gt;</description></item><item><title>The Social Tax: Redistributive Pressure and Labor Supply</title><link>https://macropaperwarehouse.com/papers/the-social-tax-redistributive-pressure-and-labor-supply/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-social-tax-redistributive-pressure-and-labor-supply/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks whether informal redistributive pressure — the social obligation to share earned income with kin and social networks — distorts labor supply in low-income communities. The authors conceptualize such pressure as a &amp;ldquo;social tax&amp;rdquo; on earnings and develop the first direct causal test of whether it reduces labor supply, output, and earnings among full-time workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Setting and Sample&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The study works with 474 full-time piece-rate factory workers (464 of whom are women) employed in cashew processing plants run by Olam in Côte d&amp;rsquo;Ivoire. Workers are paid biweekly in cash entirely through piece rates for individual nut-peeling output, creating a direct mapping between labor supply and income. At baseline, workers report transferring 25–35% of their income to individuals outside their household, with 77% having made at least one transfer in the previous 3 months. Workers also strongly believe that earning more triggers more transfer requests: 77% agree that if someone starts earning more by working harder, people will ask that person more often for financial help.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intervention&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors introduce a blocked savings account into which workers can deposit any earnings above a self-chosen threshold (set at least as high as their own baseline average earnings). Earnings above the threshold are automatically deposited by the factory directly into the account with the Banque Populaire de Côte d&amp;rsquo;Ivoire; the cash component of pay is unchanged. Funds cannot be withdrawn until the end of the blocked period (9 months in Phase 1; 3 months in Phase 2). The key design feature is that the account reduces the effective social tax rate only on earnings &lt;em&gt;increases&lt;/em&gt; above baseline, thereby eliminating income effects and generating only a pure substitution effect — an unambiguous positive prediction on labor supply if a social tax exists.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Experimental Design&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Workers are randomized into three conditions: (1) Control (no account); (2) Private account (existence unknown to anyone outside the worker); (3) Non-private account (existence and forthcoming unblock date revealed to network members via promotional text messages). The contrast between Private and Non-private isolates the role of redistributive pressure specifically — holding constant all other features of the blocked account product. The experiment runs in two cross-randomized phases conducted between 2018 and 2019.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Take-up of blocked accounts is dramatically higher when accounts are private: 60% in Phase 2 (Private) versus 14% (Non-private), a 77% decline (p&amp;lt;0.001). Among workers who declined Non-private accounts, 96% cite anticipated increases in transfer requests as an important factor.&lt;/p&gt;
&lt;p&gt;Being offered a Private account sharply raises labor supply. Pooling both phases, the Private arm increases average daily earnings by 175.9 FCFA, or &lt;strong&gt;11.4%&lt;/strong&gt; (p=0.012), relative to Control or Non-private arms. This is accompanied by a &lt;strong&gt;6.2 percentage point (9.7%)&lt;/strong&gt; increase in daily work attendance (p=0.023), with the entire attendance effect driven by reduced absenteeism rather than turnover. Effects in Phase 1 (Private vs. Control: +11.3%, p=0.032) and Phase 2 (Private vs. Non-private: +11.5%, p=0.043) are nearly identical in magnitude, indicating the results are not sensitive to cross-phase design. The treatment effect magnitude is equivalent to each worker working an additional 1.19 days in every two-week paycycle. Because 89% of workers have no income outside the factory, these constitute increases in total earned income.&lt;/p&gt;
&lt;p&gt;Heterogeneity is consistent with the hypothesized mechanism: among workers who report difficulty saving due to redistributive pressure, the Private treatment increases earnings by &lt;strong&gt;15.0%&lt;/strong&gt; (p=0.018); among those not reporting such difficulty, the estimated effect is near zero and insignificant (p=0.95). Among workers who report transfers to acquaintances (the most likely social-tax-motivated transfers), the effect is &lt;strong&gt;17.5%&lt;/strong&gt; (p=0.014). Workers without a partner — for whom intra-household redistribution is irrelevant — experience a &lt;strong&gt;15.8%&lt;/strong&gt; earnings increase (p=0.017), indicating that extra-household pressure drives the results.&lt;/p&gt;
&lt;p&gt;Outgoing transfers do not decline. The design leaves cash-on-hand unchanged by construction, and consistent with this, there is no significant change in the likelihood or amount of transfers from treated workers to their networks. Total outgoing transfers are if anything higher among Private account workers (p=0.049), suggesting no loss in redistribution to the network.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Social Tax Rate Estimation&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Combining the 11.4% treatment effect on output with a labor supply elasticity estimated from an end-of-experiment piece-rate randomization (intensive-margin elasticity of 0.17; total elasticity of approximately 1.11), the authors estimate the social tax rate for the average worker in the sample at &lt;strong&gt;9–14%&lt;/strong&gt;. For the subset who actually take up Private accounts, the implied social tax rate is &lt;strong&gt;19–23%&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to full-time female piece-rate workers in formal cashew processing plants in Côte d&amp;rsquo;Ivoire, with average tenure of 1.7 years. Because the intervention lowers the tax only on earnings &lt;em&gt;above&lt;/em&gt; baseline (not on all earnings), the estimates do not directly capture the total distortion from eliminating all redistributive pressure. Alternative confounds — fairness/morale effects, self-control, privacy concerns, goal-setting — are each tested and ruled out as primary drivers.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the theoretical basis for predicting that Private accounts unambiguously increase labor supply?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors model redistributive pressure as a social tax rate τ₁ on gross earnings. The blocked account reduces this tax to τ₂ &amp;lt; τ₁ only on earnings &lt;em&gt;above&lt;/em&gt; baseline labor supply e₁, creating a kink in the budget constraint. Starting from e₁, the worker faces only a pure substitution effect (no income effect) when τ₂ falls, because her net earnings at e₁ are unchanged. Equation (2) in the paper shows formally that the income effect term drops out, and the derivative of labor supply with respect to τ₂ is unambiguously negative (i.e., reducing τ₂ increases effort). This &amp;ldquo;clean&amp;rdquo; prediction — no income effect, no ambiguity — is the central design advantage relative to simply shielding existing earnings.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do take-up rates differ between Private and Non-private accounts, and what do workers say explains the difference?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In Phase 2, take-up of Private accounts is 60% versus only 14% for Non-private accounts — a 77% reduction (p&amp;lt;0.001). Among workers who declined a Non-private account, 96% cite the anticipation of increased transfer requests from network members knowing about the account as an important factor in their decision. Only 5% cite any other reason. This pattern is strong direct evidence that the fear of redistribution — not other features of the accounts — drives take-up differences.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What are the treatment effects on earnings and attendance, and how consistent are they across phases and subsamples?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pooled across both phases, the Private arm raises daily earnings by 175.9 FCFA (11.4%, p=0.012) and attendance by 6.2 percentage points (9.7%, p=0.023). In Phase 1 alone (Private vs. Control), earnings rise 11.3% (p=0.032). In Phase 2 alone (Private vs. Non-private), earnings rise 11.5% (p=0.043). Restricting to workers not previously treated in Phase 1, the effect is 12.8% (p=0.034); restricting further to workers new to the study in Phase 2 only, the effect is 17.3% (p=0.020). The authors cannot reject that effects across these three Phase 2 subsamples are statistically the same (p=0.427), ruling out sensitivity to the cross-randomized design.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does treatment effect heterogeneity support the redistributive pressure mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Workers who report difficulty saving because &amp;ldquo;someone else will need it for something urgent&amp;rdquo; see earnings increase by 15.0% (p=0.018) from the Private treatment; those not reporting this difficulty see near-zero, insignificant effects (p=0.95). Workers who make transfers to acquaintances — transfers especially unlikely to reflect altruism — see earnings rise 17.5% (p=0.014). Workers with below-median baseline earnings, potentially those facing the strongest relative disincentive to work, see larger effects. Each of these heterogeneous patterns is in the direction predicted if the social tax is the operative mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Do the treatment effects reflect substitution away from outside earnings or genuine total income gains?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The paper finds no treatment effects on earnings outside the factory. At baseline, 89% of workers report zero outside earnings, and on average 93% of total income comes from factory wages. Consequently, the 11.4% earnings increase represents a near-one-for-one increase in total earned income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Do Private accounts reduce transfers to the network?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;No. The design ensures that cash-on-hand is unchanged by construction — workers receive the same or slightly higher take-home cash pay (the difference is positive but insignificant). Consistent with this, neither the probability of making transfers (p=0.37) nor transfers to family (p=0.35) or non-family (p=0.93) change significantly. Total outgoing transfers in the endline survey are if anything higher in the Private arm (p=0.049, though this may partly reflect redistribution of unblocked savings). The net transfer amount is positive but insignificant (p=0.32). The authors conclude the intervention did not make others in workers&amp;rsquo; networks worse off.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How do the authors rule out morale or fairness effects as an explanation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Treatment assignment was conducted by lottery with ID numbers drawn in front of workers, clearly dissociating it from employer favoritism. More directly, the authors test for morale effects using the 3–4 week &amp;ldquo;announcement period&amp;rdquo; between treatment disclosure and account activation. If disgruntlement among non-Private workers drove results, output should fall during this period — but estimated announcement effects are near zero (0.8% of control mean, p=0.859 in Phase 2). In contrast, effects arise immediately in the first active paycycle: earnings jump 11.4% (p=0.082) even before workers have seen any deposits occur. The fairness story also cannot explain why effects are concentrated precisely among workers who report more redistributive pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do the authors test and rule out self-control as the primary mechanism?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Self-control cannot explain why Non-private accounts — which offer the same commitment benefit — have dramatically lower take-up than Private accounts. Separately, the authors test a core prediction of time inconsistency models by surprising workers with an option to opt out of the next deposit, randomly varying whether the offer comes 4 days before payday or on payday itself. Under quasi-hyperbolic preferences, workers should be more likely to opt out on the payday itself. Counter to this prediction, 94% of workers keep their earnings in the account on payday, compared to 86% four days before — and these means are not statistically distinguishable, with the relative magnitudes actually running opposite to time inconsistency predictions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do the authors address the concern that Non-private accounts may raise the tax rate above the baseline, inflating treatment effect estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The concern is that Non-private SMS alerts could make network members more aware of available cash than under the status quo, pushing the effective comparison above the Control level. The authors note that (a) paydays are already publicly known in this setting and workers regularly face transfer requests around them; (b) workers must physically withdraw savings from a bank after the unblock date, and can even re-block funds; and (c) the magnitude of effects when comparing Private to Control is nearly identical to the effect when comparing Private to Non-private (11.3% vs. 11.5%), suggesting the Non-private condition does not materially raise the tax above the status quo.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How do the authors rule out privacy concerns (rather than redistributive pressure) as the driver of low Non-private take-up and treatment effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Four arguments are provided. First, Phase 1 effects (Private vs. Control, no Non-private arm) are the same magnitude as Phase 2 effects, yet Phase 1 cannot be confounded by privacy concerns. Second, among workers who refused Non-private accounts, 96% cite transfer request anticipation; none volunteer generic privacy concerns. Third, heterogeneity effects — concentrated among high-redistributive-pressure workers — have no obvious connection to privacy preferences. Fourth, two placebo SMS exercises: 95% of Non-private workers grant permission to send generic bank promotional texts, and 88% of workers who had Phase 1 Private accounts grant permission for messages about their past (already-spent) savings — indicating no inherent aversion to having some financial information shared with networks. Since these workers forgo 11.5% of full-time earnings by refusing Non-private accounts, privacy concerns alone are implausible as a full explanation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How is the social tax rate estimated and what does the range look like?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors combine the 11.4% ITT treatment effect (used as the ratio e₁/e₂) with a compensated labor supply elasticity ζ estimated from an end-of-experiment piece-rate randomization. The piece-rate experiment (varying piece rates over four values from −15% to +30% of baseline over 6 days) yields an intensive-margin elasticity of 0.17. Using the ratio of attendance to intensive-margin effects from Table 3, the implied extensive-margin elasticity is 0.94, giving ζ ≈ 1.11. With this elasticity and assuming τ₂ = 0 (most conservative), the ITT-implied social tax rate is 9%; assuming τ₂ = 5%, it is 14%. For compliers (workers who actually take up Private accounts), the estimated rate is 19–23%. If instead the lower elasticity estimate of 0.32 (comparable to Goldberg 2016) is used, the ITT tax rate would be at least 29%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What are the broader implications discussed by the authors?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors propose that if redistributive pressure distorts work incentives, it may also distort other costly income-generating actions: technology adoption, human capital investment, and formal sector participation. They note that 74% of workers believe taking a formal job would increase transfer requests, even though network members could also access such jobs. A speculative but highlighted policy implication is that formal safety nets (health or unemployment insurance) could reduce social tax burdens on non-recipients by absorbing demand for redistribution, potentially generating positive productivity externalities.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Social Tax&lt;/strong&gt;: The paper&amp;rsquo;s central concept. Redistributive pressure from kin and social networks is modeled as a tax rate τ₁ on gross earnings — not altruistic transfers, but transfers made under social pressure that workers would prefer to avoid. The &amp;ldquo;tax&amp;rdquo; analogy captures that the obligation is proportional to visible income and reduces the private return to earning more. The paper explicitly does not take a stance on the underlying microfoundation (risk-sharing, cultural norms, or a mix).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Blocked Savings Account&lt;/strong&gt;: A date-based savings account (implemented with Banque Populaire de Côte d&amp;rsquo;Ivoire) into which any earnings above a worker-chosen threshold are automatically deposited by the factory. Funds are inaccessible until the blocked period ends (3–9 months). Workers cannot withdraw during the period, making deposited earnings unavailable to fulfill transfer requests and therefore effectively reducing the social tax rate on earnings increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Private vs. Non-private Treatment&lt;/strong&gt;: The paper&amp;rsquo;s key experimental contrast. A Private account&amp;rsquo;s existence is unknown to anyone in the worker&amp;rsquo;s network. A Non-private account triggers SMS messages to network members disclosing that the worker is saving and announcing when the unblock date approaches. The contrast isolates whether the shielding of income from social visibility — not the commitment device per se — drives take-up and labor supply.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Substitution Effect without Income Effect&lt;/strong&gt;: The paper&amp;rsquo;s design deliberately places the tax reduction only on earnings &lt;em&gt;above&lt;/em&gt; baseline, creating a kink in the budget constraint. Starting from the existing labor supply level, there is no change in net earnings at the margin — eliminating the income effect of a tax reduction — so any labor supply response is a pure compensated (substitution) effect. This makes any observed increase in labor supply an unambiguous signal that a distortionary social tax exists.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Intent to Treat (ITT) vs. Treatment on the Treated (ToT)&lt;/strong&gt;: The ITT estimate (11.4% earnings increase) reflects the effect of being &lt;em&gt;offered&lt;/em&gt; a Private account on all offered workers, including those who did not take up. The ToT estimate — relevant for workers who actually used the accounts — implies a higher social tax rate (19–23%) because only roughly half of offered workers take up the accounts and only those workers face a materially reduced effective tax rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Compensated (Hicksian) Labor Supply Elasticity (ζ)&lt;/strong&gt;: The ratio used to infer the social tax rate from the observed treatment effect. The paper estimates ζ ≈ 1.11 (extensive margin ζₐ ≈ 0.94, intensive margin ζₑ ≈ 0.17) from an end-of-experiment piece-rate randomization. The social tax rate is recovered as τ₁ = 1 − (1−τ₂)(e₁/e₂)^(1/ζ) from Equation (5).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Piece Rate Setting&lt;/strong&gt;: Workers earn a linear piece rate for every kilogram of cashews peeled, with no fixed pay component. This setting ensures that every unit of additional effort by a worker translates directly into higher earnings, and that any observed earnings changes cleanly reflect labor supply responses rather than hour or schedule effects.&lt;/p&gt;</description></item><item><title>The Surrogate Index: Combining Short-Term Proxies to Estimate Long-Term Treatment Effects More Rapidly and Precisely</title><link>https://macropaperwarehouse.com/papers/the-surrogate-index-combining-short-term-proxies-to-estimate-long-term-treatment-effects-more-rapidly-and-precisely/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/the-surrogate-index-combining-short-term-proxies-to-estimate-long-term-treatment-effects-more-rapidly-and-precisely/</guid><description>&lt;p&gt;This paper addresses a fundamental challenge in program evaluation: primary outcomes of interest — such as lifetime earnings or long-term employment — are often observed only with lengthy delays, forcing researchers to rely on short-term outcomes when making timely policy decisions. The authors develop a formal framework for combining multiple short-term proxy outcomes (surrogates) into a single &amp;ldquo;surrogate index&amp;rdquo; that, under stated assumptions, identifies the average treatment effect on the long-run primary outcome.&lt;/p&gt;
&lt;p&gt;The methodological contribution rests on three key assumptions. First, Unconfoundedness: treatment assignment in the experimental sample is ignorable conditional on pre-treatment variables. Second, Surrogacy (Prentice 1989): the long-term primary outcome is independent of the treatment conditional on the surrogates — formally, Wi ⊥⊥ Yi | Si, Xi, Pi=E — meaning the entire causal path from treatment to primary outcome runs through the surrogates. Third, Comparability: the conditional distribution of the primary outcome given surrogates and pre-treatment variables is identical across the experimental and observational samples. This last assumption is novel relative to the prior surrogacy literature, which implicitly relied on it without formal statement.&lt;/p&gt;
&lt;p&gt;The paper operates with two distinct samples. The experimental sample contains treatment assignment and surrogate outcomes but not the long-term primary outcome. The observational sample contains surrogates and primary outcomes but not treatment assignment. The surrogate index is defined as the conditional expectation of the primary outcome given surrogates and pre-treatment variables estimated in the observational sample, µ(s,x,O) = E[Yi|Si=s, Xi=x, Pi=O]. Under all three assumptions, the average treatment effect on this index equals the average treatment effect on the primary outcome. Under a linear specification, the estimator reduces to multiplying the vector of treatment effects on surrogates (from the experimental sample) by the regression coefficients predicting the primary outcome from surrogates (from the observational sample).&lt;/p&gt;
&lt;p&gt;The paper derives semiparametric efficiency bounds, demonstrating that exploiting the surrogacy assumption — by replacing actual outcomes Yi with the predicted surrogate index µ(Si,Xi,O) — yields strictly lower variance than a standard randomized experiment that directly observes the primary outcome. The precision gain equals the variance of the residual Yi − µ(Si,Xi,O).&lt;/p&gt;
&lt;p&gt;The authors also characterize bias when Surrogacy or Comparability fail. Crucially, even without these assumptions, the estimators consistently estimate a well-defined causal quantity — the average treatment effect on the surrogate index — providing a principled aggregation of intermediate outcomes. Formal bounds on the extent of bias are derived; without bounded outcomes, these bounds are uninformative, but with binary outcomes or bounded violations, sharp intervals are available.&lt;/p&gt;
&lt;p&gt;The empirical application uses the Greater Avenues to Independence (GAIN) job training program, a randomized trial in California. The experimental sample is Riverside (NE,T = 4,405 treated, NE,C = 1,040 control), with 36 quarters of post-assignment outcomes. The observational sample pools three other counties (Alameda, Los Angeles, San Diego; NO = 13,725). Long-run benchmarks are a 6.4 percentage point (s.e. 1.2 pp) increase in mean quarterly employment rates and a $249 (s.e. $83) increase in mean quarterly earnings, each averaged over 36 quarters. All three surrogate-based estimators (surrogate index, surrogate score, influence function) fall within two standard errors of these benchmarks when surrogates include as few as 5 quarters of employment, earnings, and aid outcomes. By 6 quarters, the surrogate index estimate for employment is 0.061 (s.e. 0.006) versus the 0.064 benchmark. The &amp;ldquo;naive&amp;rdquo; estimator — which simply uses the treatment effect on short-run outcomes directly — requires more than 25 quarters before falling within two standard errors of the benchmark. The surrogate index achieves a 35% reduction in standard errors relative to directly waiting to observe the 9-year outcome.&lt;/p&gt;
&lt;p&gt;Q: What is the surrogate index, precisely?
A: The surrogate index is the conditional expectation of the primary outcome given surrogate outcomes and pre-treatment variables, estimated in the observational sample: µ(s,x,O) = E[Yi | Si=s, Xi=x, Pi=O]. It aggregates multiple short-term proxy variables into a scalar index through their predicted value for the long-run outcome. Under the Prentice Surrogacy assumption, the average treatment effect on this index equals the treatment effect on the primary outcome.&lt;/p&gt;
&lt;p&gt;Q: What is the Prentice Surrogacy assumption, and why is it demanding?
A: Surrogacy requires Wi ⊥⊥ Yi | Si, Xi, Pi=E — the long-run outcome is independent of the treatment conditional on the surrogates and pre-treatment variables. This means the surrogates must fully capture all causal pathways from treatment to outcome; any direct effect of the treatment on the primary outcome that does not pass through the measured surrogates violates the assumption. The authors note this is not testable in the two-sample setup because Yi and Wi are never jointly observed.&lt;/p&gt;
&lt;p&gt;Q: What is the Comparability assumption, and why is it novel?
A: Comparability requires Pi ⊥⊥ Yi | Si, Xi — the distribution of primary outcomes given surrogates and pre-treatment variables is identical across the experimental and observational samples. It formalizes the implicit condition under which the observational sample can be used to estimate the surrogate-to-outcome relationship that is then applied to the experimental sample. The authors state this assumption was not previously articulated in the surrogacy literature despite being implicitly relied upon.&lt;/p&gt;
&lt;p&gt;Q: How does the paper handle violations of Surrogacy and Comparability?
A: Theorem 4 shows that even without Surrogacy or Comparability (but maintaining Unconfoundedness), the estimators converge to a valid causal quantity: E[µ(Si(1),Xi,O) − µ(Si(0),Xi,O) | Pi=E], the average treatment effect on the surrogate index. The surrogacy-bias equals E[(µ(Si,1,Xi,E) − µ(Si,0,Xi,E)) · ρ(Si,Xi)(1−ρ(Si,Xi)) / (ρ(Xi)(1−ρ(Xi))) | Pi=E], which is small when the treatment explains little variation in Yi conditional on surrogates, or when the surrogate score is near zero or one. The comparability-bias depends on the product of the cross-sample discrepancy in the surrogate index and the deviation of the surrogate score from the propensity score.&lt;/p&gt;
&lt;p&gt;Q: What are the efficiency gains from using surrogates?
A: Theorem 2(ii) shows that in the limit as the observational sample grows large relative to the experimental sample, the efficiency bound using surrogates is strictly smaller than the Hahn (1998) bound for a direct randomized experiment. The gain equals E[(1−Wi)(Yi−µ(Si,Xi,O))²/(1−ρ(Xi))² + Wi(Yi−µ(Si,Xi,O))²/ρ(Xi)² | Pi=E] — the variance of the residual from predicting Yi with the surrogate index. Theorem 3 also characterizes the efficiency gain within a single sample from imposing the Surrogacy assumption itself, which equals E[σ²(Si,Xi,E) · ρ(Si,Xi)(1−ρ(Si,Xi)) / (ρ(Xi)²(1−ρ(Xi))²)].&lt;/p&gt;
&lt;p&gt;Q: Why do multiple surrogates improve on a single surrogate?
A: Multiple surrogates make the Surrogacy assumption more plausible, analogously to how multiple pre-treatment covariates make Unconfoundedness more plausible. If a treatment affects the primary outcome through several distinct causal channels (e.g., math skills, language skills, social skills), any single surrogate capturing only one channel leaves remaining pathways uncontrolled, producing bias. With multiple noisy measures of underlying mediators, even if no single observable fully satisfies Surrogacy, their combination removes more bias than any individual measure. The authors also illustrate via Figure 1.D that multiple surrogates reduce the &amp;ldquo;teaching to the test&amp;rdquo; problem, where improving a single measured surrogate does not translate to improvements in the primary outcome.&lt;/p&gt;
&lt;p&gt;Q: What is the double matching estimator?
A: For a treated unit i with covariates Xi and surrogates Si, the estimator first finds a control match j in the experimental sample based on Xi alone (so Xj ≈ Xi). It then finds, for each of units i and j, the nearest neighbor in the observational sample using both Xi and Si jointly, yielding observed outcomes Yi&amp;rsquo; and Yj&amp;rsquo;. The estimated individual treatment effect is Yi&amp;rsquo;−Yj&amp;rsquo;, and the estimator averages these across the experimental sample. This mirrors standard matching under unconfoundedness but requires two layers of matching — within the experimental sample on pre-treatment variables, and into the observational sample on both pre-treatment variables and surrogates.&lt;/p&gt;
&lt;p&gt;Q: What do the GAIN empirical results show quantitatively?
A: The experimental benchmark for Riverside is a 6.4 pp (s.e. 1.2 pp) increase in mean quarterly employment and a $249 (s.e. $83) increase in mean quarterly earnings, each averaged over 36 quarters. The surrogate index estimator using 6 quarters yields estimates of 0.061 (s.e. 0.006) for employment and $238.8 (s.e. $31.5) for earnings — both within one standard error of the benchmark. All three surrogate-based estimators are within two standard errors of the benchmark at 5 quarters. The naive estimator (direct short-run effect) requires more than 25 quarters to come within two standard errors. The surrogate approach achieves a 35% reduction in standard errors relative to waiting for 9-year outcomes.&lt;/p&gt;
&lt;p&gt;Q: How do the authors validate the Surrogacy and Comparability assumptions empirically?
A: To test Surrogacy, they regress the primary outcome on pre-treatment variables, surrogates up to quarter t, and the treatment indicator in the Riverside experimental sample: a statistically significant treatment coefficient indicates a violation. Point estimates are large and significant for t ≤ 3 quarters; for t ≥ 4 most t-statistics fall below 2, though some remain slightly above 2 with small coefficient magnitudes. To test Comparability, they pool the experimental and observational samples and include an indicator for the experimental sample; significant coefficients on this indicator signal that the surrogate-to-outcome relationship differs across samples. The Comparability violation indicator remains statistically significant even with many surrogate periods, suggesting residual concern.&lt;/p&gt;
&lt;p&gt;Q: How does the paper relate Surrogacy to the mediation and instrumental variables literatures?
A: In mediation, all three variables — treatment, mediator, outcome — are observed in the same sample, and the goal is to decompose the total effect into direct and indirect components; Surrogacy corresponds to the case where the direct effect is zero by assumption. In the IV framework, the surrogate corresponds to the endogenous treatment, but an unobserved confounder between surrogate and outcome violates Surrogacy. The IV exclusion restriction (no direct effect of the instrument on the outcome) is the analog of Surrogacy&amp;rsquo;s requirement of no direct treatment effect on the primary outcome. The paper formalizes these analogies through directed acyclical graphs.&lt;/p&gt;
&lt;p&gt;Q: What is the missing data interpretation of the key assumptions?
A: The joint conditional independence Pi ⊥⊥ Yi ⊥⊥ Wi | Si, Xi implies both Surrogacy and Comparability simultaneously. This is closely related to the Missing at Random (MAR) assumption: the missingness of Yi in the experimental sample and of Wi in the observational sample is determined entirely by the observed surrogates and pre-treatment variables. This &amp;ldquo;data fusion&amp;rdquo; interpretation allows insights from the missing data literature — including semiparametric efficiency results — to apply directly.&lt;/p&gt;
&lt;p&gt;Q: What is the proposed strategy for building credibility across studies?
A: The authors advocate constructing a &amp;ldquo;library&amp;rdquo; of surrogate indices by systematically cataloging, across multiple studies in a given domain, the smallest set of surrogates that reliably matches long-run treatment effects. If six quarters of employment and earnings data are established across multiple job training programs to predict 9-year impacts — as the cross-site GAIN comparisons suggest — then future job training evaluations could credibly report long-run impact estimates after only six quarters. The empirical application is presented as one element of such a library.&lt;/p&gt;
&lt;p&gt;Surrogate Index: The conditional expectation of the primary outcome given surrogate outcomes and pre-treatment variables, estimated in the observational sample — µ(s,x,O) = E[Yi|Si=s, Xi=x, Pi=O]. It aggregates multiple short-term proxy variables into a scalar that, under Surrogacy and Comparability, identifies the average treatment effect on the long-run outcome.&lt;/p&gt;
&lt;p&gt;Prentice Surrogacy Assumption: The condition Wi ⊥⊥ Yi | Si, Xi, Pi=E — the long-run primary outcome is independent of the treatment conditional on the surrogates and pre-treatment variables. Operationally, this requires that all causal pathways from treatment to primary outcome pass through the measured surrogates, with no direct effect remaining.&lt;/p&gt;
&lt;p&gt;Comparability Assumption: Pi ⊥⊥ Yi | Si, Xi — the conditional distribution of the primary outcome given surrogates and pre-treatment variables is identical in the experimental and observational samples. This formalizes the condition under which the observational sample&amp;rsquo;s surrogate-to-outcome relationship can be transported to the experimental sample.&lt;/p&gt;
&lt;p&gt;Surrogate Score: The conditional probability of treatment given surrogates and pre-treatment variables in the experimental sample, ρ(s,x) = Pr(Wi=1|Si=s, Xi=x, Pi=E). Plays an analogous role in the surrogate framework to the propensity score under unconfoundedness: if Surrogacy holds conditional on (Si,Xi), it also holds conditional on the surrogate score alone.&lt;/p&gt;
&lt;p&gt;Sampling Score: The conditional probability of belonging to the experimental sample given surrogates and pre-treatment variables, φ(s,x) = Pr(Pi=E|Si=s, Xi=x). Appears in the surrogate score estimator and influence function to reweight observations from the observational sample toward the experimental sample distribution.&lt;/p&gt;
&lt;p&gt;Double Robustness: The influence function estimator is doubly robust: it remains consistent if either (a) the conditional outcome models µ(s,x,O) and µ(w,x) are correctly specified regardless of the score models, or (b) the propensity score ρ(s,x), propensity score ρ(x), and sampling score φ(s,x) are correctly specified regardless of the outcome models.&lt;/p&gt;
&lt;p&gt;Surrogacy Bias: The bias arising when Surrogacy fails while Comparability holds, equal to E[(µ(Si,1,Xi,E) − µ(Si,0,Xi,E)) · ρ(Si,Xi)(1−ρ(Si,Xi)) / (ρ(Xi)(1−ρ(Xi))) | Pi=E]. It is driven by the product of the direct treatment effect on the outcome (conditional on surrogates) and a measure of how much the surrogates explain treatment assignment.&lt;/p&gt;</description></item><item><title>To Own or to Rent? The Effects of Transaction Taxes on Housing Markets</title><link>https://macropaperwarehouse.com/papers/to-own-or-to-rent-the-effects-of-transaction-taxes-on-housing-markets/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/to-own-or-to-rent-the-effects-of-transaction-taxes-on-housing-markets/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;p&gt;Using sales and leasing transaction records for the Greater Toronto Area (2006–2018), this paper finds three novel effects of a higher property transaction tax: higher buy-to-rent transactions alongside lower buy-to-own transactions despite both being taxed, a lower sales-to-leases ratio, and a lower price-to-rent ratio. The empirical identification exploits the City of Toronto&amp;rsquo;s introduction of a city-level Land Transfer Tax (LTT) in February 2008 — covering only the city and not surrounding GTA municipalities — comparing outcomes on opposite sides of the city border before and after the tax change. A 1.3 percentage-point higher effective LTT rate causes buy-to-rent purchases to rise by 9.3% while owner-occupier purchases fall by 9.6%; the leases-to-sales ratio rises by 26% and the price-to-rent ratio falls by 3.8%. To explain these facts, the paper develops a search model featuring household tenure choice (own vs. rent) subject to heterogeneous credit costs, endogenous homeowner moving decisions, and free entry of buy-to-rent investors; the key mechanism is that the LTT reduces homeowners&amp;rsquo; mobility — because owner-occupiers expect to transact multiple times over their lifetimes and thus bear the tax repeatedly — discouraging entry into ownership and raising demand for rentals, which in turn attracts investor entry even though investors too pay the tax, since investors need not re-transact whenever a tenant vacates. The implied deadweight loss is large at 111% of tax revenue, with more than half of this due to distorting decisions to own or rent; taking the rental market into account accounts for losses equal to 73% of tax revenue, which is two-thirds of the total loss.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the three novel empirical facts documented in this paper?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Using MLS data on both sales and leases in the Greater Toronto Area, the paper documents: (1) a 1.3 pp higher effective LTT rate causes buy-to-rent (BTR) investor purchases to increase by 9.3%, in stark contrast to a 9.6% fall in owner-occupier (buy-to-own) purchases — a divergence that is counterintuitive because both types of buyer are subject to the same tax; (2) the ratio of leases to sales rises by 26%, indicating that rental-market activity increases relative to ownership-market activity; and (3) the price-to-rent ratio falls by 3.8%, meaning house prices decline relative to rents.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the empirical identification strategy and why is it credible?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper uses a geographic regression discontinuity approach comparing communities on opposite sides of the Toronto city border, where the new city-level LTT applies on one side but not the other, in a difference-in-differences framework spanning January 2006–January 2008 (pre-policy) and February 2008–February 2012 (post-policy). The sample is restricted to properties within 3 or 5 km of the boundary. The paper verifies that property characteristics do not differ significantly across the border and that cross-border differences do not change after the LTT, supporting the parallel-trends assumption. The effective LTT rate increase is measured at 1.3 percentage points (assuming 40% first-time buyers, who receive a partial exemption). Buy-to-rent transactions are identified in the MLS data by matching properties that appear in both the sales and leases datasets within an 18-month window following sale.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the intuition for why the LTT raises buy-to-rent investment even though it taxes investors?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The mechanism hinges on the asymmetry in expected future transaction costs between owner-occupiers and investors. Owner-occupiers face idiosyncratic match-quality shocks — they periodically want to move to a different property as their circumstances or preferences change — so choosing homeownership means expecting to pay the LTT on each future move. This makes homeownership less attractive relative to renting, reducing household entry into the ownership market and increasing demand for rental properties. Investors (landlords), by contrast, do not need to re-transact in the ownership market simply because a tenant moves out; they retain the property and find a new tenant. Investors therefore face a lower expected frequency of LTT payments per year of property holding than owner-occupiers. As a result, the LTT&amp;rsquo;s negative effect on investor returns is smaller in magnitude than the increase in rental demand it generates. In equilibrium, the price-to-rent ratio falls by enough to attract more BTR investors in spite of the direct cost the tax imposes on them, and investor purchases rise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the LTT affect homeowner mobility (the &amp;ldquo;lock-in&amp;rdquo; effect) and what are its welfare implications within the ownership market?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The LTT makes existing homeowners more tolerant of poor match quality with their current property, since the cost of moving — paying the tax again — has risen. Moving rates therefore decline as households remain in properties for longer on average. To mitigate future tax costs, buyers also become more selective (&amp;ldquo;picky&amp;rdquo;) when initially matching with a property, requiring higher match quality before purchasing. This reduces the frequency of moves but increases the cost and duration of search for new buyers. The welfare consequences within the ownership market are: (a) misallocation of properties among owner-occupiers as average match quality falls because households move less often to renew it; partially offset by (b) higher initial match quality for newly matched buyers, but at the cost of longer search. The LTT-induced distortions within the ownership market account for a loss equal to 38% of tax revenue.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the model&amp;rsquo;s quantitative predictions for the four-year post-reform period, and how do they compare to the empirical estimates?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The model is calibrated to the City of Toronto for 2006–8 (homeownership rate ~54%) and simulated for a 1.3 pp LTT increase, with the mobility hazard rate used as the internal calibration target. For the four-year period following the tax change, the model predicts: owner-occupier transactions fall by 14%; buy-to-rent transactions rise by 35%; the leases-to-sales ratio rises by 15%; the price-to-rent ratio falls by 1.6%; and the homeownership rate falls by 0.23 percentage points. These figures are broadly consistent in magnitude with the estimated LTT effects on the variables not directly targeted in calibration (i.e., the transaction-volume and price-to-rent results from the empirical estimation).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What are the long-run (steady-state) effects and why do they differ from the four-year effects?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Tenure-choice variables are very slow to adjust because annual flows are small relative to housing stocks. In the new steady state, the homeownership rate falls by 2.4 percentage points and the leases-to-sales ratio rises by 23% — both substantially larger than the four-year effects. By contrast, four-year effects on owner-occupier transactions and the price-to-rent ratio are already close to their new steady states. Buy-to-rent transactions overshoot their steady-state level (the four-year rise of 35% compares to a steady-state rise of 5.1%) because of a one-off surge in investor entry as the rental market absorbs the transition; once the stock of rental properties has adjusted, the flow of new buy-to-rent purchases settles lower.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How are the welfare (deadweight) losses decomposed across distortion channels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The new LTT generates a total welfare loss equivalent to 111% of the extra revenue it raises. The decomposition is: distortions to flows between the rental and ownership markets (i.e., the tenure-choice margin) account for a loss equal to 60% of extra revenue; distortions within the rental market account for 13% of tax revenue; distortions within the ownership market (lock-in and match-quality misallocation) account for 38% of tax revenue. The presence of the rental market in the analysis — encompassing both the across-market and within-rental-market channels — accounts for a loss equivalent to 73% of tax revenue, which is two-thirds of the total loss. The paper characterises this as &amp;ldquo;large.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the across-market misallocation mechanism behind the 60% welfare loss from tenure distortions?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: Because owner-occupiers expect to transact more frequently than buy-to-rent investors, the same ad valorem tax falls more heavily on owner-occupiers. In equilibrium, the cost of credit paid by the marginal home-buyer must fall — that is, fewer creditworthy households enter ownership. This displaces some creditworthy households into the rental market, creating a misallocation: properties are allocated away from owner-occupiers (who value them as a place of residence and benefit from match quality) toward rentals intermediated through investors. The welfare loss arises because credit-worthy households who would prefer to own are now renters, and the resource costs of intermediating through investors are incurred unnecessarily.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What policy experiment does the paper consider beyond the baseline LTT analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper studies an alternative tax structure that imposes a higher LTT rate on buy-to-rent investors relative to owner-occupiers, calibrated to nullify the implicit tax advantage investors enjoy under a uniform rate. By raising barriers to investor entry, this differential tax reduces the across-market welfare losses from lower homeownership. However, the paper notes an important caveat: pushing the investor tax rate ever higher to boost homeownership would ultimately produce large welfare costs in the opposite direction, as households who cannot qualify for mortgage credit (uncreditworthy households) would be displaced into the ownership market by a shortage of rental properties. Investors play a socially valuable role in providing housing access to households who cannot or choose not to bear the costs of credit.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What data source is used and why is it unusually well-suited to this analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A: The paper uses Multiple Listing Service (MLS) records from the Toronto Regional Real Estate Board covering the Greater Toronto Area, 2006–2018. The dataset is distinctive in including both sales transactions and lease transactions, allowing the paper to match the two and construct the novel buy-to-rent identifier. MLS data cover approximately 78% of detached-house transactions in the Toronto Land Registry for 2006–2012, and the rental listings capture over 90% of properties listed on alternative platforms. This combination of sales and lease records is what makes it possible to document the three novel empirical facts and to study both the ownership and rental markets jointly.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Buy-to-rent (BTR) transaction:&lt;/strong&gt; In this paper&amp;rsquo;s definition, a sale in the ownership market where the buyer subsequently lists the same property on the rental market within 18 months. BTR buyers are investors/landlords who supply rental housing by purchasing from the ownership market. Distinct from buy-to-own (owner-occupier purchases) and buy-to-sell (flipping) transactions. Identified in the MLS data by matching address and transaction dates across the sales and leases databases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Buy-to-own (BTO) transaction:&lt;/strong&gt; A sale in the ownership market where the buyer occupies the property as a homeowner — the residual category after removing BTR and buy-to-sell transactions from total sales. In the City of Toronto, the fraction of all transactions classified as BTO declined from 89% to 84% between 2006 and 2017.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Effective LTT rate:&lt;/strong&gt; The mean land transfer tax paid as a percentage of the sales price, combining provincial- and city-level taxes, averaged over detached-house transactions in the City of Toronto and adjusted for first-time buyer exemptions. The introduction of the city-level LTT in February 2008 raised the effective LTT rate by 1.3 percentage points (assuming 40% first-time buyers).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Match quality:&lt;/strong&gt; In the paper&amp;rsquo;s search model, the idiosyncratic value a particular household places on a particular property, which evolves stochastically over time. When match quality deteriorates sufficiently, a homeowner wishes to move to a better-matched property. Match quality is the source of the &amp;ldquo;lock-in&amp;rdquo; effect: higher transaction taxes raise the threshold quality decline a household is willing to tolerate before moving, reducing mobility. Because investors are not tied to a specific property in the same way (a tenant moving out does not require the investor to transact), this mechanism falls more heavily on owner-occupiers than on BTR investors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Lock-in effect:&lt;/strong&gt; The reduction in homeowner mobility caused by a higher transaction tax. Homeowners become more tolerant of deteriorating match quality (stay longer in poorly matched properties) and more selective when initially purchasing (require higher match quality to justify the transaction cost). The paper treats this as operating on the intensive margin of homeownership decisions, contrasted with the extensive margin (the own-vs.-rent choice).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Credit cost / credit friction:&lt;/strong&gt; Heterogeneous household-level costs of accessing mortgage finance or credit. In the model, a household must pay a credit cost to enter the ownership market. Households with lower credit costs are more likely to choose homeownership; a higher transaction tax effectively raises the total cost of ownership (since it must be paid on each future move), shifting the margin at which the credit cost equals the net benefit of owning, thereby reducing the equilibrium homeownership rate.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Leases-to-sales ratio:&lt;/strong&gt; The ratio of new lease transactions to sales transactions in the housing market, used as a measure of the relative activity of the rental and ownership markets. A higher ratio indicates more households are being accommodated in the rental market relative to the ownership market. The LTT raises this ratio by 26% in the empirical estimation and 15% in the four-year model simulation, with a steady-state increase of 23%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Price-to-rent ratio:&lt;/strong&gt; The ratio of house prices to rents, used as a summary statistic for the relative cost of owning versus renting. In the paper&amp;rsquo;s model, a fall in the price-to-rent ratio is the price signal that attracts additional buy-to-rent investor entry: as tenure-choice distortions shift more households toward renting, rents rise relative to prices, improving the return to BTR investment until the rental market clears. The LTT lowers the price-to-rent ratio by 3.8% empirically and 1.6% in the four-year model simulation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Deadweight loss as a fraction of tax revenue:&lt;/strong&gt; The welfare cost of the LTT measured in units of tax revenue raised, allowing comparison across tax instruments. The paper finds a deadweight loss of 111% of tax revenue for the Toronto LTT. Prior literature, which focused only on the intensive margin (mobility distortions within the ownership market), missed the across-market and within-rental-market channels that together account for 73 percentage points of this total.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on published open-access version. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item><item><title>Trade with Nominal Rigidities: Understanding the Unemployment and Welfare Effects</title><link>https://macropaperwarehouse.com/papers/trade-with-nominal-rigidities-understanding-the-unemployment-and-welfare-effects/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/trade-with-nominal-rigidities-understanding-the-unemployment-and-welfare-effects/</guid><description>&lt;p&gt;Standard international trade models assume perfectly flexible prices and full employment. This paper introduces nominal rigidities (downward wage rigidity) into a quantitative trade model and asks how this changes the welfare gains from trade liberalization. The central finding is that standard flexible-price estimates overstate the welfare gains by approximately one-third: trade liberalization can generate unemployment in import-competing sectors when wages cannot fall, and the forgone output from these workers is a welfare cost that flexible-price models miss entirely. The paper calibrates the degree of downward wage rigidity to cross-country data on unemployment volatility and shows that the magnitude of the overstatement is robust across a range of calibrations. An analytical decomposition separates the allocative efficiency gains (which standard models capture) from the employment losses (which they miss), clarifying when the overstatement is large versus small.&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary of a forthcoming paper, AI-assisted and human-reviewed. See the linked original for the authoritative claims and full conditions.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;
&lt;hr&gt;
&lt;h2 id="in-depth"&gt;In depth&lt;/h2&gt;
&lt;h3 id="q1-how-does-downward-wage-rigidity-create-unemployment-from-trade-liberalization"&gt;Q1. How does downward wage rigidity create unemployment from trade liberalization?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;When import competition reduces demand for domestic labor in an affected sector, wages cannot fall fast enough to clear the labor market under downward rigidity; workers are priced out of re-employment in the short run, generating cyclical unemployment in the import-competing sector that persists until real wages adjust through inflation erosion.&lt;/strong&gt; The unemployment is involuntary and represents forgone production — a social cost that flexible-price models attribute to zero by assumption.&lt;/p&gt;
&lt;h3 id="q2-why-is-the-overstatement-approximately-one-third"&gt;Q2. Why is the overstatement approximately one-third?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;The one-third figure comes from the ratio of the employment-loss welfare cost to the total flexible-price welfare gain in the paper&amp;rsquo;s benchmark calibration; the rigidity-driven employment loss is large enough relative to the allocative efficiency gain to reduce net welfare gains substantially, but not so large as to eliminate them.&lt;/strong&gt; This ratio is not universal — it depends on the degree of wage rigidity, the sectoral composition of trade exposure, and the speed of labor reallocation — but the paper shows it is robust across plausible parameter ranges.&lt;/p&gt;
&lt;h3 id="q3-does-trade-liberalization-still-generate-net-welfare-gains"&gt;Q3. Does trade liberalization still generate net welfare gains?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Yes, on net the welfare gains from trade remain positive even with downward wage rigidity — the overstatement of one-third means the true gains are positive but smaller than flexible-price models predict, not negative.&lt;/strong&gt; The paper does not argue against trade liberalization but against using flexible-price welfare estimates without adjustment for unemployment costs.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;downward wage rigidity&lt;/strong&gt; : the empirical constraint that nominal wages adjust slowly downward; the key friction this paper adds to the quantitative trade model, generating unemployment in sectors hit by import competition.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;welfare overstatement&lt;/strong&gt; : the gap between the flexible-price welfare gain from trade liberalization (the standard model&amp;rsquo;s prediction) and the true gain once unemployment costs from nominal rigidity are accounted for; approximately one-third in the paper&amp;rsquo;s benchmark calibration.&lt;/p&gt;</description></item><item><title>Traditional Institutions in Modern Times: Dowries as Pensions When Sons Migrate</title><link>https://macropaperwarehouse.com/papers/traditional-institutions-in-modern-times-dowries-as-pensions-when-sons-migrate/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/traditional-institutions-in-modern-times-dowries-as-pensions-when-sons-migrate/</guid><description>&lt;p&gt;This paper asks whether dowry — a transfer from the bride&amp;rsquo;s family to the groom&amp;rsquo;s household upon marriage, prevalent throughout India — enables male migration by providing liquidity that compensates parents for the old-age support they would otherwise lose when sons leave the village. The core friction is that in patrilocal societies, sons traditionally co-reside with parents and share income in old age; migration disrupts this arrangement and introduces income-sharing frictions (limited commitment, information asymmetries, remittance costs). Dowry attenuates this friction by providing a liquid pool of resources at the time of marriage that the son can transfer to parents, lowering the net return to migration needed for a household to find migration optimal.&lt;/p&gt;
&lt;p&gt;The authors develop a collective household model in which parents and sons jointly maximize a Pareto-weighted utility function. The model yields six testable predictions: (1) net marriage transfers can flow in either direction; (2) parents are more likely to take from the dowry when sons migrate; (3) conditional on migration, the probability of parental taking increases in the son&amp;rsquo;s income and in parental bargaining power; (4) aggregate male migration rates are higher in districts with stronger historical dowry traditions; (5) migration responses to a reduction in migration costs are larger in dowry areas, provided migration rates are relatively low; and (6) parents who receive remittances from migrant sons are more likely to have also taken from the dowry.&lt;/p&gt;
&lt;p&gt;To test predictions 1–3 and the remittance auxiliary prediction, the authors collected two original datasets: a Destination Survey of 557 prime-age men in Gurugram (near Delhi) conducted in 2018, of whom 62% were migrants; and an Origin Survey of 2,541 households across 34 districts in six North Indian states conducted in 2020, covering 3,069 sons, 20% of whom were migrants. These are the first quantitative data on property rights over dowry in India. Across the Destination and Origin surveys, 45% and 27% of grooms&amp;rsquo; parents, respectively, took from the dowry on net. Parents of migrants are 27 percentage points (Destination) and 8 percentage points (Origin) more likely to take than parents of non-migrants. For migrant sons, a doubling of the son&amp;rsquo;s occupational score raises the likelihood of parental taking by 19 percentage points; no such relationship exists for non-migrants. When sons report that parents held veto power over the marriage — a proxy for parental Pareto weight — parents of migrant sons are 28 percentage points more likely to be net takers. Parents whose migrant son sends financial remittances are 17 percentage points more likely to have taken from the dowry (coefficient 0.168, SE 0.074).&lt;/p&gt;
&lt;p&gt;To test predictions 4 and 5, the authors use the Ancestral Characteristics data (Giuliano and Nunn 2018) to construct district-level measures of dowry tradition strength, validated against 1999 REDS and IHDS survey data, where a one-unit increase in the historical dowry measure is associated with 81–109% higher gross or net dowry payments. Using the NSS Round 64 migration module (2007–08), they find that the continuous dowry tradition measure is associated with a 2.7–3.7 percentage point increase in migration probability against a mean of 23.8%. For the highway construction identification strategy, the authors exploit the staggered rollout of the Golden Quadrilateral and North-South/East-West corridor (5,846+ km, $71 billion), using modern staggered-entry difference-in-differences estimators (Borusyak et al. 2021; Callaway and Sant&amp;rsquo;Anna 2020). Young men (ages 15–30) in dowry districts exhibit a large, significant increase in out-migration following highway construction with no pre-trends, while the effect for non-dowry males is indistinguishable from zero. Older males (ages 31–45) show no such effect in either group, consistent with the mechanism operating at marriage. The highway effects are concentrated in inter-district, employment-driven migration.&lt;/p&gt;
&lt;p&gt;Scope conditions: the migration-enabling mechanism operates through marriage-age liquidity and patrilocal support norms; results are specific to male migration in India. The model assumes parents and sons act collectively, matching is based on grooms&amp;rsquo; earning potential, and migration frictions cause income-sharing transfers to be infeasible when the son migrates.&lt;/p&gt;
&lt;p&gt;Q: What is the central hypothesis of the paper?
A: The hypothesis is that dowry, by providing a liquid transfer at the time of marriage, allows sons to compensate parents for the old-age support that would otherwise be lost when sons migrate. Because migration introduces frictions that prevent optimal post-migration income sharing between parents and sons, dowry lowers the minimum net return to migration required for the household to find migration optimal, thereby enabling more migration.&lt;/p&gt;
&lt;p&gt;Q: What is the &amp;ldquo;Seeking&amp;rdquo; versus &amp;ldquo;Satisfied&amp;rdquo; distinction in the model, and why does it matter?
A: &amp;ldquo;Satisfied&amp;rdquo; parents are those whose own income plus the maximum feasible marriage transfer (bounded by the bride&amp;rsquo;s endowment dE when dowry is present) is at least as large as their consumption allocation under no migration; migration then Pareto-improves the household for any non-negative return R. &amp;ldquo;Seeking&amp;rdquo; parents have insufficient income plus endowment, so migration reduces their consumption unless the son&amp;rsquo;s return R exceeds a threshold B(d). Because dowry strictly increases the feasible transfer ceiling, B(d=1) ≤ B(d=0), meaning dowry converts some Seeking households into effectively Satisfied ones and lowers the migration threshold for the rest.&lt;/p&gt;
&lt;p&gt;Q: What share of grooms&amp;rsquo; parents actually take from the dowry, and how does migration status affect this?
A: In the Destination Survey (62% migrants), 45% of parents take from the dowry on net; in the Origin Survey (20% migrants), 27% do. Parents of migrants are 27 percentage points more likely to take in the Destination Survey and 8 percentage points more likely in the Origin Survey, consistent with the model prediction that migration increases net taking.&lt;/p&gt;
&lt;p&gt;Q: How does the son&amp;rsquo;s earnings level affect parental taking, and does this pattern hold for non-migrants?
A: For migrant sons, a 100% increase in the son&amp;rsquo;s occupational score increases the likelihood of parents taking by 19 percentage points. For non-migrant sons, the son&amp;rsquo;s occupational score has no meaningful association with taking. This asymmetry is consistent with prediction 3: when migration occurs and the alpha income-sharing channel is shut down, parents with higher-income migrant sons have a higher relative marginal return to consumption and thus take more of the dowry.&lt;/p&gt;
&lt;p&gt;Q: What is the remittance auxiliary prediction, and is it borne out in the data?
A: The model predicts that parents who receive remittances from migrant sons should also be more likely to have taken from the dowry, because households first exhaust the costless dowry transfer before making costly or risky remittances — so remittance-receiving parents are precisely those Seeking households where dowry was already taken. The data confirm this: parents whose migrant son sends financial remittances are 17 percentage points more likely to have taken from the dowry (coefficient 0.168, SE 0.074, significant at 5%) compared to parents of migrants who do not remit.&lt;/p&gt;
&lt;p&gt;Q: How is the district-level dowry tradition measure constructed and validated?
A: The measure merges the Giuliano and Nunn (2018) Ancestral Characteristics data — which uses ethnographic sources to estimate the share of each district&amp;rsquo;s current population belonging to historically dowry-practicing groups — with district-level demographic data. Validation against the 1999 REDS shows that a one-unit increase in the historical dowry measure is associated with 81% higher gross dowry payments and 109% higher net dowry payments without region fixed effects, with a still-significant 79% for net dowry including region fixed effects. Additional validation in the IHDS confirms the historical measure predicts gold payments at marriage (coefficient 0.152 without state fixed effects, 0.185 with state fixed effects).&lt;/p&gt;
&lt;p&gt;Q: What is the association between historical dowry traditions and migration in nationally representative data?
A: Using the NSS Round 64 migration module (2007–08) for males aged 15–45, against a mean migration rate of 23.8%, the continuous dowry measure is associated with a 2.66 percentage point increase in migration probability with no controls (significant at 1%), and 3.67 percentage points with full controls including state fixed effects, year-of-birth fixed effects, caste fixed effects, distance controls, and education controls (significant at 5%).&lt;/p&gt;
&lt;p&gt;Q: What is the highway construction identification strategy, and what does it show?
A: The authors exploit the staggered construction timing of the Golden Quadrilateral and NS-EW highway corridors (beginning 1999, 5,846+ km, $71 billion investment) across Indian districts, assembling new data on district-level construction timing from a complete capital projects database. Using staggered-entry event study estimators robust to heterogeneous treatment effects, they separately estimate highway effects in districts with and without strong dowry traditions. For young men aged 15–30, dowry districts show a large, significant increase in out-migration after highway construction with no pre-trends; non-dowry districts show an effect indistinguishable from zero. Older men (31–45) show no significant effect in either group.&lt;/p&gt;
&lt;p&gt;Q: Why is the age heterogeneity (15–30 vs. 31–45) in the highway results important for the mechanism?
A: The model predicts that dowry&amp;rsquo;s migration-enabling role operates at the time of marriage, when the liquid transfer is made. Men aged 31–45 at the time of highway construction would largely have already been married before the roads were built, so they cannot retroactively benefit from the new liquidity channel. Young men (15–30) are near or below marriage age and can time their marriages and migration decisions in response to reduced migration costs. The null result for older men and the strong result for younger men together confirm the marriage-time liquidity channel.&lt;/p&gt;
&lt;p&gt;Q: Why is the highway effect concentrated in inter-district rather than intra-district migration?
A: The Golden Quadrilateral connects districts to other districts, and the model&amp;rsquo;s mechanism relies on migration creating income-sharing frictions that are more severe at longer distances. Intra-district moves are shorter, less likely to disrupt co-residence and informal support arrangements, and less likely to require the dowry&amp;rsquo;s compensatory role. The concentration of effects in inter-district migration is directly consistent with the proposed channel.&lt;/p&gt;
&lt;p&gt;Q: How does the paper address concerns about pre-trends and robustness in the highway analysis?
A: The event study plots show no pre-trends in migration for either dowry or non-dowry districts prior to highway construction. Robustness checks include additional geographic controls, caste-by-year fixed effects, time-varying cultural controls, the alternative Callaway-Sant&amp;rsquo;Anna estimator, adjusted age distributions, and varying dowry tradition cutoffs at 1%, 10%, and 25% thresholds. Results are stable across these specifications.&lt;/p&gt;
&lt;p&gt;Q: What do the theory and evidence imply about the modern transformation of dowry&amp;rsquo;s function?
A: While dowry historically served as a pre-mortem bequest to the bride adapted to patrilocal society, the modern practice has evolved so that grooms&amp;rsquo; parents frequently capture the transfer. The evidence is consistent with this reallocation of property rights serving a new function: providing parents with a pension substitute when sons migrate and traditional co-residential support breaks down. The authors speculate this functional evolution may partly explain why dowry prevalence has grown despite legal bans, as declining patrilocality creates rising demand for this type of intergenerational transfer mechanism.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings?
A: The paper suggests that policies discouraging dowry — which has many well-documented negative consequences including intimate partner violence, female infant mortality, and adverse resource allocation — may be more effective if paired with expansions of formal pension programs or other mechanisms for old-age support. Without such alternatives, eliminating dowry could inadvertently reduce male migration and associated economic development benefits because the migration-enabling liquidity function of dowry would go unfilled.&lt;/p&gt;
&lt;p&gt;Q: Does the mechanism apply equally to households with both sons and daughters?
A: The theoretical appendix shows that in a household with a son and a daughter, the daughter&amp;rsquo;s dowry outflow partially offsets the son&amp;rsquo;s inflow, reducing but not eliminating the migration-enabling effect. However, the net aggregate effect on male migration remains positive because more sons live in households where sons outnumber daughters, so the dowry inflow for the son exceeds the outflow on average across the population.&lt;/p&gt;
&lt;p&gt;Dowry (in the paper&amp;rsquo;s sense): A transfer from the bride&amp;rsquo;s family accompanying marriage that in the modern Indian context is liquid at the time of the wedding and over which grooms&amp;rsquo; parents frequently exercise property rights — distinct from the traditional anthropological conception of dowry as a pre-mortem bequest to the bride.&lt;/p&gt;
&lt;p&gt;Net Taker: A groom&amp;rsquo;s parent who receives a positive net transfer from the son&amp;rsquo;s dowry (tau &amp;gt; 0 in the model), meaning the flow of dowry resources is from the son/bride&amp;rsquo;s side to the groom&amp;rsquo;s parents.&lt;/p&gt;
&lt;p&gt;Seeking vs. Satisfied parents: Model categories distinguishing parents whose consumption needs can be met from own income plus the maximum feasible marriage transfer (Satisfied, no migration distortion) from those whose needs cannot (Seeking, requiring a minimum migration return threshold B(d) &amp;gt; 0 for migration to be household-optimal).&lt;/p&gt;
&lt;p&gt;Migration friction (alpha = 0 under migration): The modeling assumption that income-sharing transfers between migrant sons and parents are infeasible or prohibitively costly due to limited commitment, information asymmetries, and remittance costs — the friction that dowry&amp;rsquo;s lump-sum transfer at marriage is designed to circumvent.&lt;/p&gt;
&lt;p&gt;Ancestral Characteristics dowry measure: The district-level variable from Giuliano and Nunn (2018) measuring the share of the current population belonging to historically dowry-practicing ethnic groups, used as a proxy for the strength of local dowry traditions.&lt;/p&gt;
&lt;p&gt;Patrilocality: The residential norm in which sons remain with or near their parents after marriage and provide old-age support — the norm whose breakdown via migration creates the income-sharing friction that dowry helps resolve.&lt;/p&gt;
&lt;p&gt;Pareto weight (theta): The weight assigned to parents&amp;rsquo; utility in the collective household problem, capturing parental bargaining power; empirically proxied by whether sons report that parents held veto power over the marriage choice.&lt;/p&gt;</description></item><item><title>Trust and Innovation Within the Firm</title><link>https://macropaperwarehouse.com/papers/trust-and-innovation-within-the-firm/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/trust-and-innovation-within-the-firm/</guid><description>&lt;p&gt;This paper investigates whether and how a CEO&amp;rsquo;s inherited generalized trust enhances innovation within firms, offering a micro-foundation for the well-documented macro-level relationship between societal trust and economic growth. The author argues that trust — by inducing tolerance of failure — encourages researchers to undertake high-risk, explorative R&amp;amp;D rather than safe exploitation of known approaches.&lt;/p&gt;
&lt;p&gt;The empirical foundation is a matched CEO-firm-patent dataset covering 5,753 CEOs at 3,598 US public firms during 2000–2011, encompassing 700,000 patents and over one million inventors. CEO trust is measured as an inherited trait: each CEO&amp;rsquo;s ethnic origin is inferred probabilistically from their last name using de-anonymized US censuses from 1910–1940, and ethnic-origin-specific trust levels are drawn from the US General Social Survey (GSS), restricted to respondents in highly prestigious occupations. The resulting trust measure is the weighted average of ethnic-specific trust scores across a CEO&amp;rsquo;s likely ethnic composition.&lt;/p&gt;
&lt;p&gt;The main empirical strategy exploits within-firm variation across CEO transitions, using firm and year fixed effects to compare patenting before and after a CEO change. The identifying assumption — that the timing of CEO transitions and the new CEO&amp;rsquo;s trust level are not predicted by prior firm patenting trends — is supported by event-study tests showing flat pre-trends. A one-standard-deviation increase in CEO inherited generalized trust (equivalent to the difference between Greek and English averages) is associated with a 6.2–6.3% increase in patent filings, statistically significant at the 1% level. For the average firm, this equals approximately 1.1 additional patents annually, worth roughly $6.8 million. The effect is larger among exogenous transitions (CEO retirement or death): 8.5% in the restricted sample, and an IV estimate of 8.2%. The back-of-envelope calculation suggests this trust-innovation channel could account for approximately 37% (range: 16–58%) of the effect of trust on GDP per capita growth.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central mechanism — risk taking — is tested by examining the distribution of patent quality rather than the mean. Under the risk-taking mechanism, trust should increase the variance of R&amp;amp;D project quality, raising high-quality patents without necessarily increasing low-quality ones. Consistent with this, CEO trust raises only above-median quality patents (measured by forward citation decile), with effects increasing monotonically toward the top decile and no statistically significant effect on below-median patents. Average patent quality as measured by citation-weighted counts or patent value rises by 4–6%. Trust also disproportionately raises the share of explorative patents (those with at least 90% of backward citations outside the firm&amp;rsquo;s existing knowledge stock) by 1 percentage point over a base of 17%.&lt;/p&gt;
&lt;p&gt;The transmission channel is examined using BERT-based classification of nearly one million Glassdoor employee reviews. Under more trusting CEOs, firms exhibit stronger top-down trust sentiment (managers trusting workers), particularly among R&amp;amp;D workers and scientists. The effect materializes within the first two years of a CEO term. Director selection provides an additional transmission mechanism: under more trusting CEOs, newly appointed directors are more trusting and departing directors are less trusting.&lt;/p&gt;
&lt;p&gt;A within-CEO design using bilateral trust (toward researchers in specific countries) with CEO fixed effects addresses omitted CEO characteristics. A one-standard-deviation increase in CEO bilateral trust toward a country is associated with a 5% increase in patents by inventors in that country&amp;rsquo;s R&amp;amp;D lab, controlling for firm-by-year, CEO, and inventor-country fixed effects.&lt;/p&gt;
&lt;p&gt;The effect is strongest when CEO trust is matched to a high-quality researcher pool; in firms with mostly low-quality researchers, high trust may be counterproductive. Trust is also a substitute for R&amp;amp;D knowledge: the effect disappears when the CEO holds a non-MBA graduate degree or has prior R&amp;amp;D experience.&lt;/p&gt;
&lt;p&gt;Q: What is the main research question?
A: The paper asks whether a CEO&amp;rsquo;s generalized trust causes more and higher-quality innovation within the firm, and through what mechanism. It also asks how trust transmits from the CEO to researchers who rarely interact with the CEO directly.&lt;/p&gt;
&lt;p&gt;Q: How is CEO trust measured?
A: CEO trust is measured as an inherited trait using a two-step procedure. First, each CEO&amp;rsquo;s last name is probabilistically mapped to one or more ethnic origins using four de-anonymized US censuses (1910–1940). Second, ethnic-origin-specific trust is computed from GSS respondents in highly prestigious occupations. The CEO&amp;rsquo;s trust measure is the weighted average across ethnic compositions. This measure is shown to be more precise than an individual-level survey measure and approximately 80% as precise as a game-based measure, without introducing attenuation bias.&lt;/p&gt;
&lt;p&gt;Q: What is the baseline patent effect and how large is it economically?
A: A one-standard-deviation increase in CEO inherited trust is associated with a 6.2–6.3% increase in patent filings (statistically significant at 1%). For the average baseline firm, this is approximately 1.1 additional patents per year, valued at roughly $6.8 million. When patent quality is accounted for, the effect rises to 9.9% using citation-weighted patent count and 11.5% using patent value based on excess stock returns on grant dates.&lt;/p&gt;
&lt;p&gt;Q: Is the effect causal? What identification strategy is used?
A: The main strategy uses firm and year fixed effects, identifying the effect from within-firm variation around CEO transitions. Pre-trend tests confirm that neither the timing of CEO changes nor the new CEO&amp;rsquo;s trust level predicts prior firm patenting. Among exogenous transitions (CEO retirements and deaths), the effect is 8.5%, and an IV estimate using the predecessor&amp;rsquo;s trust as instrument yields 8.2% (significant at 10%), both comparable to the baseline.&lt;/p&gt;
&lt;p&gt;Q: What is the macroeconomic significance of the trust-innovation channel?
A: Combining the paper&amp;rsquo;s trust-to-patents estimate (0.042–0.062) with Akcigit et al.&amp;rsquo;s (2017) patents-to-GDP-growth estimate (0.026–0.066) and the cross-country trust-to-growth coefficient (0.007), the trust-innovation channel could explain approximately 37% of the effect of trust on growth, with a plausible range of 16–58%.&lt;/p&gt;
&lt;p&gt;Q: What is the mechanism linking CEO trust to innovation?
A: The conceptual mechanism is that a more trusting manager interprets researcher failure as bad luck rather than bad type, making her more likely to tolerate failure and continue employing the researcher. This increases the researcher&amp;rsquo;s incentive to pursue explorative, high-risk R&amp;amp;D over safe exploitation of known approaches. The mechanism implies a variance-increasing effect on the R&amp;amp;D quality distribution, rather than a mean shift.&lt;/p&gt;
&lt;p&gt;Q: How is the risk-taking mechanism tested against alternative mechanisms?
A: The paper examines the distribution of patent quality by citation decile. Under mean-shifting alternatives (delegation, cooperation, relational contracting), trust should raise all quality brackets. Under risk-taking, trust raises only high-quality patents. The results show CEO trust has monotonically increasing effects from low to high quality deciles, with no statistically significant effect on below-median patents, consistent only with the variance-increasing (risk-taking) mechanism.&lt;/p&gt;
&lt;p&gt;Q: What patent quality measures are used and what do they show?
A: Beyond forward citation deciles, the paper uses explorativeness (patents with at least 90% of backward citations outside the firm&amp;rsquo;s existing knowledge stock), disruptiveness (Funk and Owen-Smith, 2017), patent importance (Kelly et al., 2021), backward citations to scientific literature, and patent scope. Trust increases all these measures with statistically significant positive coefficients. The share of explorative patents rises by 1 percentage point over a base of 17%. Average citation count and patent value increase by 4–6%.&lt;/p&gt;
&lt;p&gt;Q: Does CEO trust raise R&amp;amp;D expenditure?
A: No. The coefficients from regressing R&amp;amp;D expenditure on CEO trust are neither statistically significant nor large enough to explain the innovation effect. The patent effect is also robust to controlling for R&amp;amp;D inputs, suggesting that trust affects the type of projects chosen (consistent with risk-taking) or their realized outcomes, rather than the scale of R&amp;amp;D.&lt;/p&gt;
&lt;p&gt;Q: How does CEO trust transmit to corporate culture?
A: Using BERT-based classification of nearly one million Glassdoor reviews covering 266 firms and 397 CEO terms between 2008 and 2017, the paper finds that CEO trust is associated with stronger top-down trust sentiment (managers trusting workers). The normalized effect of a one-standard-deviation increase in CEO trust on overall trust sentiment is 0.257, on top-down trust 0.531, and on bottom-up trust only 0.141 (statistically insignificant). The effect is strongest among reviewers who identify as scientists, researchers, or engineers, and materializes within the first two years of the CEO term.&lt;/p&gt;
&lt;p&gt;Q: What evidence exists for transmission via director selection?
A: Under more trusting CEOs, newly appointed directors — especially those who remain until the end of the CEO term — are more trusting, and departing directors are less trusting. The average director trust improves during the CEO&amp;rsquo;s term. Because 54% of director hirings and 46% of turnovers occur within the first two years, this change also materializes quickly, consistent with the dynamic pattern of trust culture change.&lt;/p&gt;
&lt;p&gt;Q: What is the within-CEO bilateral trust result and what does it add?
A: Using within-CEO variation in bilateral trust toward researchers from different countries (from Eurobarometer surveys), and controlling for CEO, inventor-country, and firm-by-year fixed effects, a one-standard-deviation increase in CEO bilateral trust toward a country is associated with a 5% increase in patents by inventors in that country&amp;rsquo;s R&amp;amp;D lab. This design allows CEO fixed effects, ruling out unobserved CEO-level confounders such as management style or R&amp;amp;D ability.&lt;/p&gt;
&lt;p&gt;Q: When is CEO trust counterproductive?
A: CEO trust is beneficial only when matched to a high-quality researcher environment. Using residual patent output (controlling for observable firm and CEO characteristics) as a proxy for researcher quality, the effect of CEO trust on patents, patent output per R&amp;amp;D dollar, and future sales/employment/TFP is significant only among firms in the top two quintiles of researcher quality. In firms with mostly low-quality researchers, high CEO trust may be counterproductive by failing to screen out bad researchers.&lt;/p&gt;
&lt;p&gt;Q: How does the trust effect vary by industry and CEO background?
A: The effect is ubiquitous across industries but especially pronounced in pharmaceutical and ICT firms. The timing varies: it manifests quickly in ICT (short R&amp;amp;D lag) and more slowly in pharma (long R&amp;amp;D horizon). The effect vanishes when the CEO holds a non-MBA graduate degree or has prior R&amp;amp;D experience, suggesting trust is a substitute for direct knowledge of R&amp;amp;D processes.&lt;/p&gt;
&lt;p&gt;Q: Are the results robust?
A: Yes. The paper reports 14 categories of robustness checks including alternative patent transformations, alternative trust measures (LASSO, World Value Survey, Global Preference Survey, alternative GSS questions), alternative standard error clustering, Poisson count models, restriction to granted patents, exogenous transition subsamples, modern difference-in-differences estimators (de Chaisemartin et al., 2024; Sun and Abraham, 2021; Callaway and Sant&amp;rsquo;Anna, 2021; Borusyak et al., 2024), and leave-one-ethnicity-out. The baseline result is stable across all these checks.&lt;/p&gt;
&lt;p&gt;Inherited generalized trust: The paper&amp;rsquo;s measure of a CEO&amp;rsquo;s trust disposition, defined as the probability-weighted average of ethnic-origin-specific trust levels (from the GSS) based on the CEO&amp;rsquo;s likely ethnic composition inferred from their last name and historical census records. It captures the culturally transmitted component of trust, distinct from individual-level noise.&lt;/p&gt;
&lt;p&gt;Explorative R&amp;amp;D: In the paper&amp;rsquo;s framework (building on March, 1991), research activities that involve testing untested paths, carrying high risk of failure but high potential for innovation, as opposed to exploitation of well-known approaches with low failure risk. The paper argues CEO trust encourages researchers to shift toward exploration.&lt;/p&gt;
&lt;p&gt;Tolerance of failure: A manager&amp;rsquo;s propensity to attribute a researcher&amp;rsquo;s failure to bad luck rather than bad type. Under the paper&amp;rsquo;s mechanism, a more trusting manager gives greater weight to bad luck, making her more likely to retain the researcher after failure, thereby incentivizing risk taking.&lt;/p&gt;
&lt;p&gt;Top-down trust: In the paper&amp;rsquo;s BERT-based classification of Glassdoor reviews, the direction of trust from managers toward workers (as opposed to bottom-up trust from workers toward managers). The paper finds CEO trust primarily raises top-down trust sentiment, especially among R&amp;amp;D workers.&lt;/p&gt;
&lt;p&gt;Patent explorativeness: A patent quality measure defined as the share of its backward citations that fall outside the firm&amp;rsquo;s existing knowledge stock; patents are classified as explorative if at least 90% of backward citations are outside that stock. The paper uses this as a direct measure of explorative R&amp;amp;D output.&lt;/p&gt;
&lt;p&gt;Bilateral trust: CEO d&amp;rsquo;s directed trust toward individuals from country c, computed analogously to inherited generalized trust but using Eurobarometer survey data on country-pair trust attitudes among European-origin populations. Used in the within-CEO design to control for CEO fixed effects.&lt;/p&gt;
&lt;p&gt;Variance-increasing mechanism: The paper&amp;rsquo;s characterization of the risk-taking channel, in which CEO trust raises the variance (not the mean) of the R&amp;amp;D project quality distribution by encouraging researchers to pursue high-risk, high-reward exploration. Empirically identified by the pattern that trust raises only above-median quality patents with monotonically increasing effects toward the top decile.&lt;/p&gt;</description></item><item><title>Unconventional monetary policy spillovers and the (in)convenience of Treasuries</title><link>https://macropaperwarehouse.com/papers/unconventional-monetary-policy-spillovers-and-the-inconvenience-of-treasuries/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/unconventional-monetary-policy-spillovers-and-the-inconvenience-of-treasuries/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper asks why unconventional monetary policy (UMP) spillovers from the European Central Bank (ECB) to the U.S. Treasury yield curve vary so substantially over time, and whether the time-varying &amp;ldquo;convenience&amp;rdquo; of Treasuries — their non-pecuniary premium as the world&amp;rsquo;s preeminent safe asset — can explain that variation. The core claim is that a declining convenience yield on Treasuries makes them more substitutable with other safe sovereign bonds, thereby amplifying the portfolio-balance channel through which foreign large-scale asset purchases (LSAPs) depress U.S. term premia.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use high-frequency identification of ECB monetary policy surprises following Altavilla et al. (2019), defined as the first principal component of intraday changes in 1-, 3-, 6-, 12-, and 24-month euro OIS rates plus 5- and 10-year German and French bond yields, measured in the 10-20 minute window bracketing each ECB decision press conference. Surprises are normalized so that one unit raises the 24-month euro OIS by 10 basis points. The sample runs from March 2001 to December 2023, covering approximately 265-268 ECB announcement dates. U.S. zero-coupon Treasury yields come from Gürkaynak et al. (2007); the yield is decomposed into an expected short-rate path and a term premium using the shadow-rate term structure model (SRTSM) of Wu and Xia (2016). The convenience yield on Treasuries is proxied by the spread between the 10-year Treasury yield and the maturity-matched overnight index swap (OIS) rate, so that a positive (and rising) spread indicates declining convenience. Structural breaks in the convenience yield are identified via the Bai-Perron test.&lt;/p&gt;
&lt;p&gt;The empirical strategy has three main components: (i) 700-business-day rolling regressions of Treasury yields and their decomposition on ECB surprises to document time variation; (ii) interaction regressions (following equation 5/9) that condition the ECB shock effect on lagged convenience-yield proxies, net Treasury supply, intermediary balance-sheet constraints (proxied by G10 covered-interest-parity deviations), and inflation-anchoring indicators; and (iii) a policy decomposition following Swanson (2021) that decomposes ECB surprises into &amp;ldquo;target,&amp;rdquo; &amp;ldquo;forward guidance,&amp;rdquo; and &amp;ldquo;LSAP&amp;rdquo; components. These empirical findings are rationalized in a two-country preferred-habitat model, extending Gourinchas, Ray, and Vayanos (in press) (GRV) by allowing the demand-slope parameter governing investor price elasticity to vary with the convenience yield. Functional derivatives and Malliavin calculus are used to characterize dynamic impulse responses to elasticity shifts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Rising spillovers post-GFC, concentrated at long maturities.&lt;/strong&gt; Rolling regressions show that ECB-to-U.S. spillovers were statistically indistinguishable from zero during the conventional-policy era but grew significantly after 2010, well before the ECB&amp;rsquo;s Expanded Asset Purchase Programme (EAPP) launched in 2015 and before &amp;ldquo;whatever it takes&amp;rdquo; (summer 2012). Spillovers began to dissipate not when ECB purchases ended (March 2022) but when the Fed announced tapering in November 2021 — consistent with the convenience channel rather than mere co-movement in LSAP volumes. A Bai-Perron test detects five structural breaks in the relationship between ECB surprises and 10-year Treasury yields.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Term-premium dominance, amplified by inconvenient Treasuries.&lt;/strong&gt; At average convenience-yield levels, a one-standard-deviation ECB loosening shock (lowering the 24-month euro OIS by 10 basis points) reduces the 10-year Treasury yield by approximately &lt;strong&gt;4.4 basis points&lt;/strong&gt; (column 5, Table 2). When the Treasury convenience yield is one standard deviation below its historical average (i.e., Treasuries are less convenient), the spillover increases by &lt;strong&gt;1.64 basis points&lt;/strong&gt;, making the total effect approximately &lt;strong&gt;6.1 basis points&lt;/strong&gt; — a shift from the bottom 20th to below the 12th percentile of the unconditional distribution of daily Treasury yield changes. This amplification operates entirely through the term premium; the expected path of short rates shows no statistically significant sensitivity to the convenience yield interacted with ECB shocks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Net Treasury supply amplification.&lt;/strong&gt; Conditional on the net publicly available U.S. debt stock (Treasury debt less Fed holdings, as a percent of GDP), a one-standard-deviation ECB shock at average supply reduces the 10-year yield by approximately &lt;strong&gt;3.9 basis points&lt;/strong&gt;; when net supply is one standard deviation above its historical average (approximately 7.6 percentage points of GDP), the same shock generates a &lt;strong&gt;5.35 basis-point&lt;/strong&gt; decline — a 50-percent amplification (Table 5, column 5).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Intermediary constraints amplification.&lt;/strong&gt; Conditioning on the first principal component of G10 CIP deviations against the dollar (a proxy for intermediary balance-sheet tightness), a CIP deviation one standard deviation above average amplifies the ECB spillover from approximately &lt;strong&gt;3.9 basis points to 6.2 basis points&lt;/strong&gt; (Table 7).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Inflation anchoring.&lt;/strong&gt; Periods when inflation expectations lie outside the interquartile range of the historical distribution are associated with larger spillovers to 10-year Treasury yields, an effect that is statistically significant both above the 75th and below the 25th percentile of expectations, with point estimates of the interaction coefficient reaching approximately &lt;strong&gt;5.0-5.3 basis points&lt;/strong&gt; (Table 6).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Policy asynchronicity.&lt;/strong&gt; Spillovers are especially pronounced when the Federal Reserve is tightening while the ECB is easing. The rolling regressions show term-premium spillovers become dominant (relative to expected-path spillovers) post-2014, coinciding with U.S. normalization. The calibrated model shows that, during policy asynchronicity combined with lower convenience, the home short-rate tightening is partially offset by capital inflows induced by foreign QE, with the attenuation especially pronounced at intermediate and long maturities and persistent across multiple periods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Alternative channels ruled out.&lt;/strong&gt; Horse-race regressions against the VIX, MOVE index, Economic Policy Uncertainty (EPU) index, Monetary Policy Uncertainty (MPU) index, and 30-day EUR/USD spot variance show none of these candidates displaces the convenience channel. Short-rate-risk decompositions (Bundick et al. 2017) and equity-orthogonal risk premium shocks (Leombroni et al. 2021) cannot explain the post-Taper Tantrum timing pattern of rising term-premium spillovers.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;All empirical results apply to ECB-to-U.S. spillovers; the paper explicitly leaves Bank of England-to-U.K. Gilt spillovers for future work.&lt;/li&gt;
&lt;li&gt;The portfolio-balance amplification through convenience is specific to unconventional monetary policy (LSAP shocks); target and forward-guidance components drive spillovers through different channels (expected short-rate path) and do not exhibit the same convenience-contingent amplification.&lt;/li&gt;
&lt;li&gt;The mechanism operates through preferred-habitat investors demanding sovereign-grade credit; the Bund convenience yield does not amplify U.S. spillovers, consistent with Bunds being an imperfect representation of the full portfolio requiring substitution under ECB capital-key-based purchases.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: How do the authors measure ECB monetary policy surprises, and why do they prefer this measure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: Surprises are the first principal component of intraday changes in 1-, 3-, 6-, 12-, and 24-month euro OIS rates plus 5- and 10-year German and French bond yields, measured from 10-20 minutes pre-announcement to 10-20 minutes post-press conference. This cross-section of yields is preferred because it summarizes shocks to the overall stance of policy both at and away from the effective lower bound, including effects on different parts of the yield curve. The composite measure therefore subsumes both conventional rate actions and unconventional (LSAP, forward guidance) dimensions. Surprises are normalized so one unit raises the 24-month euro OIS by 10 basis points.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: What is the key empirical fact about the timing of spillover emergence and dissipation?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: Rolling regressions show ECB spillovers to U.S. Treasury yields became statistically significant when the rolling window began integrating observations starting in approximately 2010 — substantially before the ECB&amp;rsquo;s EAPP (2015) and even before &amp;ldquo;whatever it takes&amp;rdquo; (summer 2012). Moreover, spillovers began to dissipate not when the ECB&amp;rsquo;s Pandemic Emergency Purchase Programme ended (March 2022) but when the Fed announced tapering in November 2021. This timing pattern is inconsistent with a simple &amp;ldquo;both central banks doing QE simultaneously&amp;rdquo; explanation and instead points to the importance of Federal Reserve balance sheet behavior for the convenience of Treasuries.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How do the authors decompose the Treasury yield, and what does the decomposition reveal about the channel of transmission?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: Following standard term-structure decomposition, the n-year yield equals the expected path of short-term rates over the maturity plus a maturity-specific term premium. Rolling regressions on this decomposition show that term-premium spillovers dominate expected-path spillovers, especially post-2014 when the Federal Reserve is out of sync with other advanced economies. Early ECB UMP spillovers showed a more even mix of expected-path and term-premium effects; later spillovers loaded much more heavily on the term premium. This is consistent with the portfolio balance channel — LSAPs remove duration risk and compress term premia, and this effect transmits internationally.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How is the convenience yield proxied, and why does the paper use this proxy in particular?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: The authors use the spread between the sovereign bond yield and the maturity-matched overnight index swap rate (Y − OIS), expressed so that a larger spread (sovereign yield higher than OIS) reflects less convenience. Prior to the GFC, Treasury yields ran below swap rates (negative spread, high convenience); post-GFC, the spread reversed and turned positive, reflecting deterioration in Treasury specialness. This proxy is preferred because it captures the relative convenience as priced by the marginal investors the model focuses on — those with sovereign credit quality preferences and arbitrageurs — rather than broader measures such as the Treasury-to-corporate spread.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the quantitative impact of convenience yield variation on the size of ECB spillovers to U.S. yields?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: In the most conservative specification (Table 2, column 5), an ECB loosening shock that lowers 24-month euro OIS by 10 basis points reduces the 10-year Treasury yield by 4.4 basis points when the convenience yield is at its historical average. When the convenience yield falls one standard deviation below average (Treasuries are less convenient), the spillover increases by 1.64 basis points to approximately 6.1 basis points. A one-standard-deviation change in 10-year Treasury yields in the sample is 5.86 basis points; the 4.4 bp response falls in the bottom 20th percentile of unconditional daily yield changes, while the 6.1 bp response falls below the 12th percentile.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Does the amplification of spillovers from ECB shocks by Treasury inconvenience operate through the term premium or the expected short-rate path?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: The amplification operates entirely through the term premium. In Table 2, columns 7 and 8, the interaction coefficient between the ECB shock and the convenience yield proxy is positive and statistically significant for the 10-year term premium but is not statistically different from zero for the expected path of short rates. The authors interpret this as confirming the portfolio balance channel: displaced Bund investors substitute into Treasuries, raising Treasury prices and compressing term premia, with no mechanical connection to market participants&amp;rsquo; updating of expected future Federal Reserve policy rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does net Treasury supply interact with the size of ECB spillovers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: Net U.S. Treasury supply (debt outstanding as a percent of GDP, less Fed holdings) is strongly positively correlated with the swap spread, confirming the link between supply and convenience. Interaction regressions (Table 5) show that a one-standard-deviation ECB shock at average net supply reduces 10-year yields by 3.9 basis points. When net supply is one standard deviation above average (approximately 7.6 percentage points of GDP), the same shock generates a 5.35 basis-point decline — roughly a 50 percent amplification. The point estimates suggest this operates primarily through term premia, though those interaction coefficients are statistically insignificant in the term premium specification.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do intermediary balance-sheet constraints relate to Treasury convenience and ECB spillover amplification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: The authors follow Du, Hébert, and Huber (2023) in using deviations from covered interest parity (CIP) among G10 currencies against the dollar as a proxy for the shadow cost of intermediary balance-sheet constraints. When CIP deviations are at historical average, the ECB spillover to 10-year Treasury yields is approximately 3.9 basis points; when CIP deviations are one standard deviation above average, the spillover rises to approximately 6.2 basis points. The authors also use the plausibly exogenous variation from quarter-end &amp;ldquo;window dressing&amp;rdquo; (per Correa, Du, and Liao 2020): LSAP-type ECB surprises landing near quarter-end generate larger spillovers to the term premium, and the further into the quarter an announcement occurs, the larger the LSAP shock&amp;rsquo;s effect on the term premium — consistent with balance-sheet constraints amplifying the portfolio balance channel.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: What is the theoretical model, and what is the key innovation relative to the baseline GRV framework?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: The paper extends the two-country preferred-habitat model of Gourinchas, Ray, and Vayanos (in press), in which segmented investors demand bonds of specific maturities and currencies while capital-constrained global arbitrageurs partially bridge the segmentation. The key innovation is allowing the demand-slope parameter α_j(τ) — which in GRV is fixed and governs how inelastic investors are with respect to price — to vary over time as a function of the convenience yield. When Treasuries are special (high convenience), α_H(τ) is large, demand is inelastic, and foreign shocks have limited pass-through. When convenience falls, α_H(τ) shrinks, demand becomes more elastic, investors reallocate more aggressively in response to yield differentials, and U.S. term premia respond more strongly to ECB purchases. Functional derivatives and Malliavin calculus are used to characterize both instantaneous and dynamic amplification effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What does the calibrated model predict about the maturity structure of spillover amplification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: In the calibration exercise (Figure 4), the elasticity perturbation is modeled as a smooth function (transformed Cauchy distribution) centered at the 10-year maturity, and the ECB QE shock is a purchase concentrated at the 5-year maturity amounting to 10 percent of euro-area GDP. The marginal change in the home yield impulse response (the quantity ∂²_{α_H,b} log P^τ_{Hs}) is positive across nearly all maturities and horizons, but is most pronounced around the 5-year maturity and during the first few periods after the shock — where the ECB purchase profile and the demand perturbation are most closely aligned in tenor. Amplification effects are persistent across horizons due to the dynamic multiplier in Theorem 3.1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How does the model rationalize the 2019 yield curve inversion?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: In August 2019, the 10-year Treasury yield fell below short-term rates despite a robust domestic labor market, while the Fed was raising rates and the ECB remained accommodative. The model&amp;rsquo;s asynchronicity exercise (Section 3.3) shows that combining a home short-rate increase with ongoing foreign QE and a contemporaneous decline in Treasury convenience produces attenuated or even reversed yield curve responses. More elastic investors facing a flatter demand curve shift into longer-term Treasuries — whose relative yields remain attractive globally — resulting in a yield-curve inversion driven not by recession expectations but by asymmetric monetary policy and a time-varying convenience premium.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: Do alternative explanations — risk sentiment, policy uncertainty, exchange rate volatility — explain the time variation in ECB spillovers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: No. Horse-race regressions in Table 9 condition the ECB shock on lagged VIX, MOVE index, Economic Policy Uncertainty (Baker et al. 2016), Monetary Policy Uncertainty (Husted et al. 2020), and 30-day EUR/USD spot variance. None of these measures displaces the baseline convenience-yield interaction, which remains statistically significant across all specifications. Elevated EPU is associated with smaller spillovers (consistent with uncertainty impairing substitution), but this does not reduce the magnitude or significance of the convenience-yield interaction. Exchange-rate variance does not alter spillover size. A rolling regression decomposing the term premium into a short-rate-uncertainty component (Bundick et al. 2017) and a residual shows the empirical pattern is more consistent with the residual — not the short-rate-volatility channel. An equity-orthogonal risk premium shock (Leombroni et al. 2021) explains some term premium effects in the early GFC period (2008-2012) but cannot rationalize the post-Taper Tantrum pattern of growing term-premium spillovers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: How does the Swanson (2021) decomposition confirm the portfolio balance channel?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A13: Following Swanson (2021), the authors decompose ECB surprises into a &amp;ldquo;target surprise&amp;rdquo; (change in 3-month OIS futures), a &amp;ldquo;forward guidance surprise&amp;rdquo; (residual from projecting 24-month futures onto the target surprise), and an &amp;ldquo;LSAP surprise&amp;rdquo; (residual from projecting French and German 10-year bond yields onto target and forward guidance). In the full sample (Table 3), LSAP shocks drive spillovers to U.S. yields exclusively at higher maturities and exclusively through the term premium; they have no statistically significant impact on the expected path of short rates. Conditioning LSAP shocks on the convenience yield (Table 4, panel c) shows that it is specifically LSAP-type announcements combined with Treasury inconvenience that generate larger medium- and long-term term-premium spillovers, confirming the portfolio balance mechanism.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: What are the implications for fiscal and monetary policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A14: The paper argues that the persistently low long-term rates and yield curve inversions observed between the GFC and the COVID-19 pandemic were driven partly by ECB LSAPs amplified by U.S. quantitative tightening, which increased net Treasury supply, reduced Fed absorption, constrained dealer balance sheets, and lowered Treasury convenience. Simultaneously, U.S. monetary tightening raised short-term rates while ongoing ECB easing depressed long rates, reshaping the yield curve in a manner consistent with the model. More broadly, the effectiveness of conventional domestic monetary policy tightening is attenuated when the convenience yield is compressed and foreign QE is ongoing — not because the short rate fails to move, but because more elastic investors reallocate around it. This suggests policy asynchronicity, combined with declining convenience, creates a constraint on monetary independence that may require more forceful or coordinated policy action.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Convenience yield (Treasury convenience premium)&lt;/strong&gt;
The non-pecuniary value that investors derive from holding U.S. Treasury securities over and above cash flows and credit risk — arising from their deep and liquid markets, broad regulatory compatibility, high-quality collateral function, and reserve-currency status. Operationalized in this paper as the spread between the n-year Treasury yield and the maturity-matched overnight index swap (OIS) rate; a positive and rising spread indicates declining convenience, not increasing yield risk.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Portfolio balance channel (of unconventional monetary policy transmission)&lt;/strong&gt;
The mechanism by which large-scale asset purchases by one central bank displace investors from their target allocations, inducing them to substitute into other assets — including foreign sovereign bonds — thereby compressing yields and term premia in those markets. Distinguished from the signaling/expected-path channel in that it operates through changes in duration risk (term premia) rather than revisions to expected future short rates, and is unique to UMP because it targets long-duration assets.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Preferred habitat investors&lt;/strong&gt;
Investors with persistent, institutionally determined demand for bonds of specific maturities and issuers (e.g., insurance companies, pension funds), arising from regulatory constraints, risk management practices, or balance sheet matching. Their demand is modeled as relatively price-inelastic when assets command a convenience premium, and more elastic when that premium erodes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Demand-slope parameter α_j(τ)&lt;/strong&gt;
In the extended GRV preferred-habitat model, the parameter governing the price elasticity of preferred-habitat investor demand for country-j bonds of maturity τ. Large values imply inelastic demand (strong habitat preferences), small values imply elastic demand and greater cross-border substitutability. The paper&amp;rsquo;s key innovation is treating this parameter as time-varying — specifically, as a function of the observed Treasury convenience yield rather than a fixed structural constant.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy asynchronicity&lt;/strong&gt;
The condition in which the Federal Reserve is tightening monetary policy (raising rates or conducting quantitative tightening) while other advanced-economy central banks (specifically the ECB) are simultaneously easing through LSAPs. The paper argues that asynchronicity interacts with a declining convenience yield to amplify ECB spillovers to U.S. term premia and attenuate the effectiveness of Federal Reserve tightening at the long end of the yield curve.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Swap spread (as inconvenience proxy)&lt;/strong&gt;
The spread of the sovereign bond yield over the maturity-matched OIS rate (Y − OIS). Expressed so that a larger positive value indicates greater Treasury inconvenience. Prior to the GFC, 10-year Treasury yields ran below swap rates (negative spread); post-GFC, this relationship reversed, with the spread turning persistently positive and exhibiting structural breaks consistent with Bai-Perron tests.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exorbitant privilege&lt;/strong&gt;
The benefit the United States accrues from the global dominance of its sovereign debt and currency, which structurally insulates U.S. financial markets from foreign monetary policy shocks through inelastic global demand for Treasuries. The paper argues this insulation is not structural but endogenous and state-dependent: erosion of exorbitant privilege — operationalized as a declining convenience yield — substantially increases U.S. vulnerability to foreign monetary shocks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gâteaux/Malliavin functional derivative (as used in the model)&lt;/strong&gt;
Mathematical tools used to characterize how the impulse response function of the yield curve to policy shocks changes when the demand-slope parameter α_k(τ) is perturbed. The mixed Gâteaux differential ∂²_{α_k,b} log P^(τ)_{js} captures both the instantaneous amplification (direct pass-through increase) and the intertemporal propagation (dynamic multiplier) of a foreign policy shock under lower convenience, enabling a tractable decomposition of state-contingent spillover magnitudes across maturities and horizons.&lt;/p&gt;</description></item><item><title>Vanguard: Black Veterans and Civil Rights After World War I</title><link>https://macropaperwarehouse.com/papers/vanguard-black-veterans-and-civil-rights-after-world-war-i/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/vanguard-black-veterans-and-civil-rights-after-world-war-i/</guid><description>&lt;p&gt;This paper provides the first causal evidence on how military service shaped Black civil rights activism in the aftermath of World War I. The research question is whether random induction into the segregated National Army caused Black men to join the nascent NAACP and become prominent community leaders during the New Negro era. The authors leverage the WWI draft lottery — in which each registrant&amp;rsquo;s unique serial number was drawn from a bowl to determine induction order — as an instrument for military service, a source of exogenous variation not previously exploited in the literature.&lt;/p&gt;
&lt;p&gt;To support this analysis, Ang and Chinoy construct an unusually rich dataset by digitizing nearly one million Black draft registration cards from the first registration (June 17, 1917), linking them through the 1930 full-count census to 233,517 NAACP member observations across 227 branches from 1912 to 1940, and supplementing with Veterans Administration records, Army Transport Service passenger lists, and biographical dictionaries of prominent African Americans. The instrument — serial number percentile within draft board and race (SNP%) — is validated against all observed pre-draft registrant characteristics and yields a first-stage F-statistic of 1,051 in the preferred specification.&lt;/p&gt;
&lt;p&gt;The main finding is that Black men randomly induced to serve in the military were nearly three times more likely to join the NAACP than observably similar registrants from the same draft board (TSLS coefficient 0.0219, se = 0.0049, against a sample mean NAACP participation rate of 0.8%). The authors estimate that the draft induced more than 10,000 Black men to join the NAACP in total. Military service also raised the probability of appearing in biographical dictionaries of historically prominent African Americans by a factor of roughly 1.6 (TSLS coefficient 0.0027, se = 0.0012, sample mean 0.17%). These results are robust to alternative instruments, flexible polynomial specifications of SNP%, state-year fixed effects, and alternative veteran-status measures from VAMI and ATS records. They are also not explained by differential residential mobility: adding controls for interstate and North-South migration leaves the main coefficient essentially unchanged (0.0217-0.0218).&lt;/p&gt;
&lt;p&gt;In contrast, TSLS estimates for all socioeconomic outcomes — literacy, home ownership, employment, census-predicted income, actual 1940 income, and educational attainment — are small and insignificant, ruling out human capital acquisition as a mechanism. Club involvement measured in the census is likewise unaffected, indicating that NAACP membership reflects specifically civil rights activism rather than generically greater social participation.&lt;/p&gt;
&lt;p&gt;The mechanism the paper identifies is experienced discrimination. Effects on NAACP participation increase monotonically with the racial gap in induction rates across draft boards (significant at p = 0.01). Effects are large and significant for men assigned to camps that restricted Black soldiers&amp;rsquo; access to military training (coefficient 0.0351, se = 0.0104) and to officer promotion (coefficient 0.0360, se = 0.0111), and are large for men in both restriction types simultaneously (coefficient 0.0367, se = 0.0114). In contrast, men attending less discriminatory camps show small and insignificant effects. Among the two all-Black combat divisions, NAACP participation is highest for veterans of the 92nd Division — subjected to constant racial abuse under U.S. command — and lower for the 93rd Division, which served under more hospitable French command. Previously unstudied veteran surveys from Virginia and Connecticut corroborate this narrative: respondents from camps with training and promotion restrictions were more than twice as likely to mention racial injustice, and mentions of injustice were more predictive of postwar civic engagement than any other survey theme.&lt;/p&gt;
&lt;p&gt;The scope of the paper is Black male registrants in the first WWI draft registration (men aged 21-30 as of June 17, 1917), linked to a sample of approximately 300,000 in the 1930 census. Effects are attenuated for men from counties with greater racial hostility — proxied by Confederate state status, Confederate monument density, and county lynching rates — consistent with the interpretation that activism was more feasible in less repressive environments.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification strategy and why was it not feasible to use it before this paper?
A: The paper uses each Black registrant&amp;rsquo;s serial number percentile within his draft board and racial group (SNP%) as an instrument for WWI military service. Unlike the WWII and Vietnam drafts, which used birthday-based lotteries, the WWI lottery assigned induction order by drawing unique serial numbers from a bowl, making serial number rank the source of quasi-random variation. This source had never been exploited in the literature, partly because the serial numbers had to be hand-captured from digitized draft card images.&lt;/p&gt;
&lt;p&gt;Q: How strong is the first stage, and was the lottery truly random?
A: The first-stage F-statistic is 1,051, and a ten-percentile decrease in SNP% is associated with a 34.5 percentage point increase in the probability of serving. Bivariate serial numbers show some non-random patterns — nine of 13 pre-draft characteristics correlate with raw SN% — likely because some Southern boards inflated numbers for white registrants. Conditioning on board fixed effects and using SNP% within board-race cells eliminates these correlations; Panel B of Appendix Table A1 shows the largest standardized coefficient falls to 0.006.&lt;/p&gt;
&lt;p&gt;Q: What is the magnitude of the effect on NAACP membership and how does the causal estimate compare to a naive OLS?
A: The TSLS coefficient is 0.0219 (se = 0.0049) against a sample mean of 0.8%, implying roughly a threefold increase in NAACP membership. The OLS estimate of 0.0116 understates the causal effect, consistent with the marginal man induced by the lottery being observationally weaker than infra-marginal volunteers.&lt;/p&gt;
&lt;p&gt;Q: Does the effect reflect simply that veterans moved to Northern cities where NAACP branches were more accessible?
A: No. Adding indicators for interstate migration and North-South migration leaves the TSLS coefficient essentially unchanged at 0.0218 and 0.0217, respectively. The Great Migration channel is thus not the operative mechanism.&lt;/p&gt;
&lt;p&gt;Q: Did military service improve Black veterans&amp;rsquo; economic outcomes?
A: TSLS estimates for literacy, home ownership, employment, census-predicted income, actual 1940 income, and educational attainment are all small and statistically insignificant. This contrasts sharply with evidence on Black veterans of WWII and Korea (Greenberg et al., 2022) and is consistent with the documented absence of meaningful postwar benefits or training for Black WWI soldiers.&lt;/p&gt;
&lt;p&gt;Q: If it was not human capital or migration, what mechanism does the paper establish?
A: The primary mechanism is exposure to institutional discrimination during military service. Three distinct empirical patterns converge: (1) effects increase monotonically with draft board racial disparities in induction rates; (2) effects are large and significant for men at camps that denied training and promotion, and near zero for men at less discriminatory camps; (3) veteran survey mentions of racial injustice are more common among men from discriminatory camps and are more predictive of postwar NAACP membership than any other survey theme.&lt;/p&gt;
&lt;p&gt;Q: How do the two all-Black combat divisions differ in their postwar NAACP participation, and what does this reveal?
A: Veterans of the 92nd Division, who fought under U.S. command amid constant racial abuse, show the highest NAACP participation rates. Veterans of the 93rd Division, who fought under French command and were received with relative hospitality, show lower (though not statistically significantly lower) participation. Since both divisions received similar formal training and neither group shows socioeconomic gains, the differential reflects discrimination exposure rather than skill acquisition.&lt;/p&gt;
&lt;p&gt;Q: What is the quantitative scale of the effect for the most discriminatory camps?
A: For men assigned to camps with restrictions on both training and promotion, the TSLS coefficient on NAACP membership is 0.0367 (se = 0.0114) — more than 1.5 times the average estimate of 0.0219. Men at camps without restrictions show coefficients that are small and statistically insignificant.&lt;/p&gt;
&lt;p&gt;Q: How does county-level racial hostility moderate the effect?
A: The effects of military service on NAACP membership are larger — more positive — for men from counties with fewer Confederate monuments, lower lynching rates, and non-Confederate state status. This is interpreted as evidence that activism in response to discriminatory military experiences was more feasible in less racially hostile local environments, rather than as evidence that discrimination exposure was lower.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s aggregate policy implication regarding the scale of the draft&amp;rsquo;s effect on the civil rights movement?
A: The authors estimate that the WWI draft induced more than 10,000 Black men to join the NAACP. Veterans accounted for nearly 15% of all male NAACP members, against roughly 8% of Black male adults in the population, and were significantly more likely to appear in biographical dictionaries of prominent African Americans. The draft thus constituted a sizable and measurable contribution to the organizational vanguard of the early civil rights movement.&lt;/p&gt;
&lt;p&gt;Q: How does the paper contribute to the economics of discrimination beyond documenting discriminatory behavior by majority actors?
A: Most economics research on discrimination studies the conduct of white decision-makers (e.g., racial bias in hiring, lending, or bail). This paper examines how experiences of discrimination reshape the political behavior and aspirations of the minority group itself. The results show that institutional betrayal — systematic exclusion, degradation, and denial of training — generated deep discontent that translated into aggressive political mobilization, a dynamic the authors trace through subsequent episodes including the WWII Double V campaign and responses to police killings.&lt;/p&gt;
&lt;p&gt;Serial number percentile within draft board and race (SNP%): The instrument constructed by the authors. Each WWI registrant received a serial number from 1 to the size of his draft board; those numbers were drawn in random order to determine induction priority. SNP% measures where a registrant fell in that draw relative to others in his board and racial group, and serves as the source of quasi-random variation in veteran status.&lt;/p&gt;
&lt;p&gt;New Negro era: The period of invigorated Black political and cultural assertiveness following WWI, characterized by renewed racial pride, economic independence, and progressive politics. The movement spanned the Harlem Renaissance, the Universal Negro Improvement Association, the American Negro Press, and the Brotherhood of Sleeping Car Porters, and represented a rejection of the &amp;ldquo;conservatism, parochialism, and political accommodationism&amp;rdquo; of older Black leaders.&lt;/p&gt;
&lt;p&gt;Draft board racial gap: The authors&amp;rsquo; measure of draft board discrimination, defined as the difference in induction rates between Black and white registrants within a given draft board. The interquartile range spans roughly 0 to 20 percentage points, with a notable fraction of boards exhibiting gaps exceeding 30 percentage points.&lt;/p&gt;
&lt;p&gt;Camp discrimination: The denial of military training and officer promotion opportunities to Black soldiers, documented in War Department reports by military intelligence officers tasked with monitoring the treatment of Black soldiers. The paper classifies each camp as restricted or unrestricted on each dimension and uses this classification to estimate heterogeneous treatment effects.&lt;/p&gt;
&lt;p&gt;Institutional betrayal: The paper&amp;rsquo;s characterization of the U.S. government&amp;rsquo;s treatment of Black WWI soldiers — drafting them at higher rates than whites, denying them training and promotion, and assigning them to menial labor — as generating a profound sense of injustice that motivated postwar political activism rather than loyalty or accommodation.&lt;/p&gt;
&lt;p&gt;NAACP membership as civil rights activism proxy: The paper uses dues-paying membership in local NAACP branches as its primary quantitative measure of civil rights participation. Membership involved active financial cost (annual fees of $1 to $10 at a time when median Black family income was below $500), exposure to harassment and violence in the South, and participation in local protest and legal advocacy, distinguishing it from passive civic engagement.&lt;/p&gt;</description></item><item><title>Voluntary Minimum Wages: The Local Labor Market Effects of National Retailer Policies</title><link>https://macropaperwarehouse.com/papers/voluntary-minimum-wages-the-local-labor-market-effects-of-national-retailer-policies/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/voluntary-minimum-wages-the-local-labor-market-effects-of-national-retailer-policies/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper studies the labor market effects of voluntary minimum wages (VMWs) — company-wide, publicly announced wage floors set by large private employers — in the U.S. low-wage retail and service sector from 2014 to 2023. The central questions are: (1) How do VMWs affect wages and employment at the adopting large retailers? (2) Do VMWs generate wage spillovers to other employers in shared local labor markets?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Setting&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors use anonymized payroll data obtained from a large U.S. credit bureau, covering the wage distributions and employment of over 4,000 firms and approximately 18 million hourly workers (roughly 22–24% of the U.S. hourly workforce) from January 2013 to August 2023. The database is skewed toward retail and service sectors: over a third of covered workers are in retail, and over half in retail and services combined. Critically, the data also include worker flow information — records of individual workers moving between firms — enabling the authors to define shared labor markets via actual employment transitions rather than broad geographic or industry proxies.&lt;/p&gt;
&lt;p&gt;The sample of VMW events consists of &lt;strong&gt;20 voluntary minimum wage policies across 5 large retailers&lt;/strong&gt; (each with over 150,000 employees nationally), restricted to events with no other major wage policy within six months before or after the focal event. Voluntary minimum wage announcements were identified from an inventory maintained by the National Employment Law Project and independently verified through media sources, then matched to anonymized companies using employer size, industry, and observed shifts in the wage distribution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Identification Strategy&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors adapt the &lt;strong&gt;gap design&lt;/strong&gt; from the national minimum wage literature. For each company-by-commuting-zone (CZ) cell, the &amp;ldquo;gap&amp;rdquo; measures the percent increase in average hourly wages that would be required to bring all workers in the area up to the company&amp;rsquo;s new voluntary minimum. The gap is averaged over months −6 to −3 before the event (months −3 to −1 serve as a built-in placebo-in-time check). This variation in bite across CZs — arising because the same nominal VMW level implies different wage increases depending on local wage distributions — is combined with a stacked event study across 20 VMW events. Spillover effects are estimated by regressing log average wages at non-policy establishments on the large retailer&amp;rsquo;s CZ-level gap measure, progressively narrowing the definition of &amp;ldquo;labor market&amp;rdquo; from: (i) all non-policy establishments in the same CZ, to (ii) establishments in industries connected to the large retailer by worker flows (15 three-digit NAICS industries), to (iii) specific establishments with documented pre-event worker flows to or from the large retailer (&amp;ldquo;connected establishments&amp;rdquo;).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Own effects:&lt;/em&gt; For $15 VMW events, moving from a CZ gap of 0 to a gap of 1 is associated with an approximately 88 log point increase in average hourly wages in the six months after adoption. Given that the average establishment-level gap for $15 VMWs is 0.11, the implied average wage increase is approximately 10.45% (the authors&amp;rsquo; estimate is 9–10%, consistent with small wage increases even in zero-gap comparison areas). Employment of workers earning under $30 per hour rose by 4.62% after $15 VMW events, 2.01% after major events (affecting ≥30% of workforce), and 1.25% across all 20 events. These employment increases are &lt;strong&gt;entirely attributable to reduced separations&lt;/strong&gt; rather than new hiring: separation rates fell by 0.42, 0.57, and 1.09 percentage points after all, major, and $15 VMW events respectively — equivalent to reductions of 6.57%, 8.73%, and 15.33% relative to pre-period means. Separations specifically to other database companies fell by 0.07–0.19 percentage points (5.63–13.48% relative to base rates). If anything, new hiring fell modestly after VMW adoption. Total monthly base pay and gross compensation both rose after VMWs, indicating increased total take-home pay without compensatory reductions in hours or bonuses. The total employment elasticity with respect to wages ranges from approximately 0.35 to 0.45, while the quit elasticity is 2.20–2.38 (consistent with dynamic monopsony models in which the labor supply elasticity is twice the quit elasticity).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Spillover effects:&lt;/em&gt; Across all three definitions of the labor market, the paper estimates &lt;strong&gt;precise, economically negligible cross-employer wage spillovers&lt;/strong&gt; in the six months following VMW events. Cross-employer wage elasticities are statistically indistinguishable from zero across all specifications. Among the most narrowly defined sample — establishments with documented pre-event worker flows to or from the large retailer — the upper bound of the confidence interval rules out spillovers greater than 0.2% of wages. No wage spillovers are detected for new hires at non-policy establishments either. These null results are confirmed over a 12-month post-event horizon for the subsample of events with no other major policy nearby.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Mechanism:&lt;/em&gt; The reason for negligible spillovers is that VMWs reduced labor market churn rather than expanding the large retailer&amp;rsquo;s total employment. Hiring away from large retailers by connected non-policy firms falls after VMW adoption — consistent with fewer separations to recruit from — but &lt;strong&gt;overall hiring by non-policy firms does not decline&lt;/strong&gt;, as these firms substitute toward other hiring sources. This substitutability across new hire sources in a thick market is the proximate explanation for the absence of wage pressure on competitor firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Results pertain to large national retailers (&amp;gt;150,000 employees) operating in U.S. commuting zones during 2014–2023. The database covers only employers large enough to participate in credit bureau income verification; smaller employers (representing over 75% of U.S. hourly workers by the BLS comparison) are not observed, and the authors caution that spillover effects on smaller firms cannot be assessed. The authors also explicitly note that their null local spillover results do not rule out national-level strategic wage-setting dynamics — the rapid sequential adoption of VMWs across major retailers may reflect national-level competition rather than local market competition.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What exactly are &amp;ldquo;voluntary minimum wages&amp;rdquo; and how do they differ from statutory minimum wages?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Voluntary minimum wages (VMWs) are company-wide, publicly announced wage floors set unilaterally by private employers, typically well above the applicable statutory (federal, state, or local) minimum. Unlike statutory minimums, which bind all employers in a jurisdiction, VMWs apply only to the announcing company across all of its geographic operations in the U.S. The paper studies VMWs adopted by retailers with over 150,000 workers, which include wage floors at levels such as $9, $10, $12, and $15 per hour. $15 VMWs were adopted at a time when few states or localities had yet reached that threshold, meaning the policy bit into the company wage distribution far more deeply than prevailing statutory floors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How were VMW events identified and matched to anonymized firms in the payroll database?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;VMW events were identified from a database maintained by the National Employment Law Project and verified through an independent review of business news articles. These publicly reported announcements were then matched to the anonymized companies in the credit bureau payroll database using employer size, industry, and the timing of observed shifts in the firms&amp;rsquo; wage distributions. An additional three events were identified directly from data: months where the share of workers earning below a given wage level dropped by at least 15 percentage points (for non-$15 events) or 10 percentage points (for $15 events) while the share at exactly that wage bin jumped by at least 10–20 percentage points. The final sample of 20 events was restricted to those with no other major wage policy in the six months before or after.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How does the gap design work and why does it improve on the fraction-affected approach?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The gap for a given company, commuting zone, and time period is defined as the total wage increase needed to bring all sub-$30 workers up to the company minimum, divided by total wage costs — formally a labor-share-weighted average shortfall from the new minimum across wage bins. The gap leverages more cross-sectional variation in treatment intensity than the simple fraction of workers below the minimum: for a $15 VMW, an area where all workers earn $10 has a gap of 0.50 while an area where all earn $12 has a gap of 0.25. The gap is averaged over months −6 to −3 before the event. The period months −3 to −1 then serve as a placebo window: genuine VMW effects should appear only after the policy&amp;rsquo;s adoption month, not during the period immediately after the gap is measured. If instead the regression picks up mean reversion in noisy wage data, spurious effects would appear in months −3 to −1 rather than at event time 0.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What is the magnitude of the wage effect on the large retailers themselves?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;For $15 VMW events, the stacked event study estimates that moving from a gap of 0 to a gap of 1 is associated with an approximately 88 log point increase in average hourly wages beginning exactly in the month of policy adoption. Given the average establishment-level gap of 0.11 for $15 VMWs, this implies the average establishment raised wages by approximately 9–10% (the authors compute 10.45% from the average gap, consistent with a slight dampening because zero-gap CZs experienced marginally higher wages too). Wage increases are confirmed persistent at 12 months in robustness checks. For all 20 VMW events pooled, effects are somewhat smaller commensurate with the lower average bite.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: How did VMWs affect total employment and its components at the large retailers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;After $15 VMW events, log total employment of sub-$30 workers rose by 4.62%; after major VMW events (≥30% bite), 2.01%; after all 20 events, 1.25%. The increases are entirely driven by retention gains. Separation rates fell by 1.09 percentage points after $15 VMWs, 0.57 p.p. after major events, and 0.42 p.p. after all events — translating to reductions of 15.33%, 8.73%, and 6.57% relative to pre-period means. Separations to other database companies specifically fell by 0.07–0.19 percentage points (5.63–13.48% relative to the base mean). New hiring — measured as year-on-year log change in hires to control for seasonality — fell after VMW adoption, consistent with a reduced need to replace departing workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What do the labor supply elasticities implied by the VMW results look like?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The total employment elasticity with respect to wages ranges from approximately 0.35 to 0.45 across the three event groupings. Under standard dynamic monopsony models, the labor supply elasticity facing the firm equals twice the quit elasticity in steady state (Manning, 2003). The quit elasticity — derived by dividing the proportional reduction in separations by the log wage increase — ranges from 2.20 to 2.38, consistent with the earlier monopsony-based case study of Ford&amp;rsquo;s $5 workday (Raff and Summers, 1987) and implying substantial firm-level wage-setting power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Did VMWs increase total take-home pay or were wage gains offset by reductions in hours or bonuses?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper examines log average monthly base pay and log average gross compensation (which includes bonuses and overtime) as additional outcomes. Both measures rose after $15 VMW events, indicating that the wage floor increase translated into genuine improvements in total take-home pay without compensatory reductions in hours or other non-wage compensation. The monthly gross pay series is an average over calendar year-to-date months, so increases appear gradually rather than as a sharp jump at the adoption month; nevertheless the upward trend is evident and consistent.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: What are the estimated spillover effects on wages at non-policy employers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Across all three definitions of the labor market — all non-policy establishments in the same CZ, establishments in the 15 connected industries in the same CZ, and establishments with documented pre-event worker flows — the estimated cross-employer wage effects are precise zeros. The stacked event study in the post-period shows coefficients centered on zero with small confidence intervals. The difference-in-differences cross-employer wage elasticity (instrumenting the large retailer&amp;rsquo;s wage change with the gap) is also indistinguishable from zero. Among the most exposed connected establishments, the point estimate is slightly positive but economically negligible; the upper confidence interval bound rules out spillovers greater than 0.2%. Results are confirmed over a 12-month horizon for the clean-event subsample.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Could the null spillover result reflect mean reversion bias rather than a true zero?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors address this concern explicitly. For the policy-company gap design, they build in a placebo-in-time check by measuring the gap over months −6 to −3 and checking that no wage effects appear in months −3 to −1. For the non-policy spillover analysis, they also examine an alternative treatment variable — the gap between non-policy establishments&amp;rsquo; wages and the large retailer&amp;rsquo;s new VMW — and find evidence of mean reversion: wages begin rising in the pre-period in the direction of this gap measure. They correct for this by detrending post-period estimates using a linear extrapolation of the pre-period trend. After detrending, spillover effects remain indistinguishable from zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Why are spillover effects so limited if the large retailer is drawing fewer workers away from competitors?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s mechanism analysis shows that while the probability of a non-policy firm hiring a worker from the large retailer falls after a VMW event (consistent with fewer separations to recruit from the large retailer), the &lt;strong&gt;overall rate of hiring by non-policy firms does not decline&lt;/strong&gt;. Non-policy firms substitute toward other hiring sources — primarily other non-policy companies — rather than hiring fewer workers overall. This substitutability across recruiting sources in a thick labor market mutes the competitive pressure on competitor wages: since non-policy firms can replace the reduced flow from VMW companies with workers from other sources without changing total employment, they face no pressure to raise wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How do the results differ when focusing on CZs where the large retailer accounts for a larger employment share?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors test whether larger local market presence amplifies spillovers by splitting the sample at the median employment share of the large retailer in the CZ. They find no evidence of positive wage spillovers even in CZs where the large retailer&amp;rsquo;s employment share is above the median, confirming that neither local market size nor market concentration is a mechanism for spillover transmission in this setting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How do these VMW spillover results compare to prior evidence on employer wage-setting spillovers?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The main prior U.S. evidence (Staiger et al., 2010) studied a federally mandated wage increase at Veterans Affairs hospitals and found a cross-establishment wage elasticity of approximately 0.19 for registered nurses at neighboring hospitals. The authors note two key differences: first, the VA policy increased both wages and employment at treated facilities, whereas VMWs primarily reduced separations without increasing hiring, so the supply of workers to competitor firms was not squeezed. Second, the market for low-wage retail and service workers is likely thicker (more potential hires available) than the market for registered nurses, allowing competitors to substitute hiring sources without bidding up wages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What do the null local spillover results imply about national-level wage dynamics?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors explicitly caution against reading the null local spillover result as implying VMWs have no broader effect on the low-wage labor market. The rapid and successive adoption of VMWs across major retailers during 2021–2022 could reflect national-level strategic wage-setting competition — firms mimicking each other&amp;rsquo;s announcements in an arms-race dynamic during tight labor markets — rather than local competitive transmission. The paper does not test for national-level strategic interactions and calls for further research on this dimension.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Voluntary Minimum Wage (VMW):&lt;/strong&gt; A company-wide, publicly announced wage floor set unilaterally by a private employer, applying across all of the firm&amp;rsquo;s geographic operations in the U.S., typically well above applicable statutory minimums. Distinct from legally mandated minimum wages in that they bind only the announcing firm and arise from the firm&amp;rsquo;s own strategic or reputational motivations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Gap Measure:&lt;/strong&gt; Borrowed from the national minimum wage literature (Card, 1992; Draca et al., 2011), this is the percent increase in a firm&amp;rsquo;s average hourly wage that would be required to bring all workers in a given commuting zone up to the company&amp;rsquo;s new voluntary minimum. Formally the labor-share-weighted average shortfall from the VMW across sub-$30 wage bins. A gap of 0 means no workers fall below the new minimum; a gap of 1 means all workers would need to be raised to the minimum, doubling the average wage. Used as a continuous treatment variable capturing the local bite of the policy.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stacked Event Study:&lt;/strong&gt; An empirical design in which a separate 12-month panel (6 months pre- and post-event) is constructed for each of the 20 VMW events, these datasets are stacked, and the effect of the continuous gap treatment is estimated jointly across all events, with event-specific indicators interacting all regressors to allow each event to have its own intercept.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Placebo-in-Time Check:&lt;/strong&gt; A robustness test built into the gap design by computing the gap over months −6 to −3 and verifying that wage effects do not appear in months −3 to −1 (the period between gap measurement and VMW adoption). Genuine policy effects should materialize at the adoption month; spurious effects driven by mean reversion in noisy wage data would appear in months −3 to −1 because the gap would mechanically predict wage reversion toward the mean in the period immediately following its measurement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Connected Establishments / Poaching and Feeder Establishments:&lt;/strong&gt; Specific firm-by-CZ cells identified as sharing a labor market with the large retailer via actual worker flows. &amp;ldquo;Poaching establishments&amp;rdquo; hired at least one worker from the large retailer in the 12 months before the VMW event. &amp;ldquo;Feeder establishments&amp;rdquo; had at least one worker subsequently hired by the large retailer in the same pre-period. These are the most narrowly defined and most economically relevant labor market competitors for testing spillover effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quit Elasticity / Labor Supply Elasticity (Firm-Level):&lt;/strong&gt; The quit elasticity is the percent change in the separation rate divided by the percent change in wages induced by the VMW. Under standard dynamic monopsony models (Manning, 2003), in steady state the recruit elasticity equals the quit elasticity, and the firm-level labor supply elasticity equals twice the quit elasticity. The authors estimate quit elasticities of 2.20–2.38, implying labor supply elasticities of 4.40–4.76 to the firm — consistent with meaningful but not extreme monopsony power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cross-Employer Wage Elasticity:&lt;/strong&gt; The percent change in wages at a non-policy employer&amp;rsquo;s establishment associated with a 1% change in wages at the large retailer in the same commuting zone, instrumented using the large retailer&amp;rsquo;s gap interacted with the post-event indicator. Estimated to be a precise zero across all market definitions and event groupings in this paper.&lt;/p&gt;</description></item><item><title>Wage growth and labor market tightness</title><link>https://macropaperwarehouse.com/papers/wage-growth-and-labor-market-tightness/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/wage-growth-and-labor-market-tightness/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; Which measures of labor market tightness best predict nominal wage inflation, and do standard measures such as the unemployment rate and the vacancy-to-unemployment ratio capture the relevant slack? The paper also asks whether transitory productivity shocks affect wage growth, and whether the wage Phillips curve is nonlinear.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Motivation and Model.&lt;/strong&gt; Standard measures of labor market tightness have had mixed performance since the COVID-19 pandemic: unemployment quickly returned to pre-pandemic levels while wage growth remained persistently elevated, motivating a search for superior indicators. The paper builds on the theoretical framework of Bloesch, Lee, and Weber (2024), a tractable New Keynesian DSGE model in which firms set wages and workers search on the job. In this model, labor market tightness is well-summarized by either (a) the quits rate or (b) vacancies per effective searcher (V/ES), where effective searchers include both employed and unemployed job seekers. Unemployment enters the model&amp;rsquo;s wage Phillips curve but with a coefficient close to zero, because changes in the unemployment share do not substantially shift the composition of searchers in a way that alters firms&amp;rsquo; wage incentives. Transitory TFP shocks have theoretically ambiguous effects on nominal wage growth because the outcome depends on the central bank&amp;rsquo;s policy response.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methods.&lt;/strong&gt; The main analysis uses quarterly U.S. data from 1990:Q2 to 2024:Q2. Wage growth is measured as the 3-month log change in the Employment Cost Index (ECI) for wages and salaries of private industry workers. Quits and vacancies are drawn from JOLTS (2001:Q1 forward) and extended back to 1990:Q2 using the Davis-Faberman-Haltiwanger series and Barnichon&amp;rsquo;s composite Help Wanted Index, respectively. The authors run a &amp;ldquo;horse race&amp;rdquo; of OLS univariate regressions of wage growth on thirteen separately normalized tightness indicators. They then run bivariate regressions pairing the quits rate with each other indicator to test whether any alternative provides independent predictive power. Robustness is assessed using 12-month ECI changes. An industry-level panel with time and industry fixed effects covering 11 broad sectors from JOLTS for 2001:Q1–2024:Q2 tests whether the same ranking holds within industries. Forecasting exercises use 1-, 2-, and 4-quarter-ahead in-sample regressions plus rolling out-of-sample one-quarter-ahead predictions beginning in 2004:Q1. Nonlinearity is evaluated via threshold regressions at the 25th percentile (unemployment) or 75th percentile (other measures) and via quadratic specifications.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings with Quantitative Magnitudes.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Horse race (aggregate, contemporaneous):&lt;/em&gt; The quits rate explains 55 percent of variation in 3-month ECI wage growth (R² = 0.55), and V/ES explains 52 percent (R² = 0.52), the two highest among all indicators tested. A one standard deviation increase in either quits (0.39 percentage points) or V/ES (0.08) is associated with 0.20 percentage points higher 3-month wage growth. By contrast, the vacancy-to-unemployment ratio (V/U) explains only 41 percent of wage growth and the unemployment rate only 34 percent. Together, quits and V/ES explain nearly two-thirds of wage growth since 1994 and 78 percent since 2020:Q2.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Bivariate regressions:&lt;/em&gt; Conditional on the quits rate, the coefficient on every other tightness indicator drops to near zero, with the sole exception of V/ES, which retains a coefficient of 0.08 (significant) while the quits coefficient remains at 0.14. This result is consistent with the model&amp;rsquo;s prediction that quits and V/ES are close to sufficient statistics for labor market tightness.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;12-month ECI results:&lt;/em&gt; The ranking is preserved at longer horizons; quits and V/ES each explain approximately two-thirds of 12-month wage growth.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Productivity:&lt;/em&gt; Regressions of 3-month ECI wage growth on 3-month changes in labor productivity, TFP, and utilization-adjusted TFP all yield small, negative, and statistically indistinguishable from zero coefficients, consistent with the model&amp;rsquo;s prediction of an ambiguous effect of transitory productivity shocks on nominal wages.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Industry-level panel:&lt;/em&gt; Quits and V/ES remain the strongest predictors of within-industry wage growth after absorbing industry and time fixed effects. A one standard deviation increase in the industry quits rate (0.93 percentage points) is associated with 0.23 percentage points higher quarterly wage growth; a one standard deviation increase in industry V/ES (0.11) is associated with 0.13 percentage points higher wage growth.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;HPW Composite Index:&lt;/em&gt; The Heise-Pearce-Weber (HPW) Index, constructed as an OLS-weighted average of quits and V/ES, achieves a correlation of 0.9 with standardized 3-month ECI wage growth. In-sample forecasting R² for the HPW Index at 1, 2, and 4 quarters ahead is 0.62, 0.74, and 0.77, respectively — the highest of all indicators at each horizon.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Out-of-sample forecasting:&lt;/em&gt; Only the quits rate and the HPW Index consistently outperform a simple AR(1) benchmark throughout the out-of-sample period from 2004:Q1 to 2024:Q1. The forecasting performance of vacancy-based measures (V/U and V/ES) deteriorated steadily after 2015, consistent with evidence of structural shifts in vacancy measurement documented by Mongey and Horwich (2023).&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Nonlinearity:&lt;/em&gt; Threshold regressions and quadratic specifications provide little evidence of meaningful nonlinearity in the wage-tightness relationship for quits, V/ES, or the HPW Index over 1990–2024. The fit improvement from adding threshold terms is marginal, and slope coefficients are broadly stable across the full range of tightness, including the extreme tightness observed after COVID.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What theoretical mechanism links quits and V/ES to nominal wage growth, in contrast to unemployment?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the Bloesch-Lee-Weber (2024) model incorporated in the paper, firms use both wages and vacancies to attract and retain workers from unemployment and from other firms, conditional on the overall mass of effective searchers. Labor market tightness is defined as V/S (vacancies over total searchers), not V/U, because employed workers also search on the job. When tightness is high, workers are harder to recruit and more likely to be poached, pressuring firms to raise wages. Quits are the endogenous component of separations and rise mechanically with tightness, making them a near-equivalent sufficient statistic for V/ES. Unemployment enters the wage Phillips curve in principle because the composition of searchers (employed vs. unemployed) matters for firms&amp;rsquo; wage-setting incentives, but the coefficient on unemployment is calibrated and estimated to be approximately zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the authors extend the quits and vacancies data back to 1990 to cover the full sample period?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;JOLTS data on quits and job openings begin in 2001:Q1. The authors extend the quits rate backward to 1990:Q2 using the Davis, Faberman, and Haltiwanger (2012) series, taking a simple average of the two in overlapping quarters (2001:Q1–2010:Q2). Vacancies are extended back to 1990:Q2 using the composite Help Wanted Index constructed by Barnichon (2010), with a similar overlapping average for 2000:Q4–2021:Q3. The effective-searcher measure (V/ES) is available only from 1994:Q1 because the CPS marginally attached worker series begins then.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How is the V/ES measure constructed, and why does it differ from the standard V/U ratio?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Effective searchers are constructed as ES = U_s + 0.48·U_l + 0.40·Z_want + 0.09·Z_do-not-want + 0.07·N, where U_s is short-term unemployed (less than 27 weeks), U_l is long-term unemployed (27+ weeks), Z_want is marginally attached workers not in the labor force, Z_do-not-want is non-participants not marginally attached, and N is employment. The weights reflect relative search intensities estimated by Abraham, Haltiwanger, and Rendell (2020) and translated to publicly available CPS data by Sahin (2020). Because employed workers constitute a far larger share of the population than the unemployed, including them — even at the low weight of 0.07 — substantially increases the total effective searcher count relative to V/U. This matters because the model predicts that firms&amp;rsquo; wage decisions depend on the full pool of potential recruits and retention risk, not just the unemployed.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What are the results of the bivariate &amp;ldquo;horse race&amp;rdquo; pairing quits with each other tightness measure?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In bivariate OLS regressions of 3-month ECI wage growth on the quits rate plus one other indicator, the coefficient on quits remains approximately 0.14–0.22 percentage points per standard deviation regardless of which other variable is included, while all competing indicators&amp;rsquo; coefficients fall to near zero. The sole partial exception is V/ES, which retains a coefficient of 0.08 (significant at 5%) alongside a quits coefficient of 0.14; the combined fit is 0.60. For all other measures — including V/U (coefficient drops to 0.04), unemployment (0.00), jobs-workers gap (0.02), Conference Board availability (−0.01), and NFIB difficulty hiring (0.01) — the incremental contribution beyond quits is negligible. This result is consistent with the model&amp;rsquo;s prediction that quits and V/ES are jointly near-sufficient statistics for wage growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Do the industry-level panel regressions replicate the aggregate ranking, and why is this an important test?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Yes. In panel regressions with industry and time fixed effects covering 11 JOLTS sectors from 2001:Q1 to 2024:Q2, the quits rate has the highest within-industry R² (0.019) and V/ES the second highest (0.010); all other indicators rank below. This within-industry test is important because it removes the possibility that the aggregate correlations are driven by unobserved macro variables that happen to co-move with quits and V/ES. The bivariate industry panel confirms that, conditional on quits, only V/ES adds substantially to the within-industry fit; all other indicators add negligible explanatory power.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: Why might industry-level TFP shocks have a modest positive effect on wages even though aggregate TFP shocks do not?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;At the industry level, the central bank does not respond to industry-specific TFP shocks. When a particular industry&amp;rsquo;s productivity rises and firms lower prices, consumer demand for that industry&amp;rsquo;s output rises. If demand rises by enough, firms must hire more workers to meet demand despite higher productivity per worker, leading them to post more vacancies and raise wages. At the aggregate level, the central bank does respond to the disinflation associated with positive TFP shocks (following a Taylor rule), which can raise overall consumption enough to require more aggregate hiring and generate a positive TFP-wage correlation — but the direction depends on monetary policy responsiveness, making the aggregate relationship ambiguous and empirically insignificant. The industry regressions find that a 1 percent increase in annual labor productivity is associated with 0.15 percent higher industry annual wage growth, significant at the 10 percent level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How is the HPW Index constructed, and what is its in-sample fit with wage growth?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The HPW Index is constructed as a weighted average of the standardized quits rate and V/ES, where the weights are the OLS coefficients from a bivariate regression of 3-month ECI wage growth on both variables simultaneously (estimated over 1994:Q1–2024:Q2). The index is then normalized to have mean zero and standard deviation of one. The HPW Index achieves a correlation of 0.9 with standardized 3-month ECI wage growth. At the peak of post-pandemic inflation, the index predicted wage growth of approximately 2.6 standard deviations above the mean, corresponding to a quarterly wage growth rate of about 1.3 percent, close to realized values.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: How do the out-of-sample forecasting results compare across indicators, and what accounts for the deterioration of vacancy-based measures?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Rolling out-of-sample one-quarter-ahead predictions from 2004:Q1 to 2024:Q1 show that only the quits rate and the HPW Index consistently outperform an AR(1) benchmark across the full period. V/U performed relatively well until 2015 but then deteriorated steadily, and V/ES similarly weakened after 2015, consistent with the finding by Mongey and Horwich (2023) that the relationship between job vacancies and other labor market indicators has persistently shifted since approximately 2010. The forecasting performance of the unemployment rate and several other standard measures deteriorated sharply in the post-COVID period when wage inflation surged, but quits and HPW maintained their performance throughout.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Is there evidence of nonlinearity in the wage Phillips curve, particularly in the extreme tightness of the post-COVID period?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The paper finds little evidence of meaningful nonlinearity. Threshold regressions at the 25th percentile for unemployment and 75th percentile for other measures yield marginal fit improvements: the R² for unemployment rises from 0.34 to 0.36 (a level shift rather than a slope change), and fit improvements for HPW, quits, and V/ES are essentially zero. Quadratic specifications confirm this: the coefficient on the squared term is insignificant in all specifications. The authors conclude that the relationship between labor market tightness (as measured by quits or the HPW Index) and nominal wage growth is approximately linear, including during the extreme tightness of the COVID aftermath.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: Why does the paper argue that the slope of the wage Phillips curve can be estimated more cleanly than the price Phillips curve?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In the model&amp;rsquo;s price Phillips curve, monetary policy endogenously responds to TFP shocks, creating an omitted variable problem that biases the estimated slope toward zero. In the wage Phillips curve, TFP and monetary policy shocks affect wages only through their general equilibrium effects on labor market tightness — they do not appear directly on the right-hand side. Consequently, the tightness variable is a sufficient statistic for wage inflation in the model, and the slope coefficient can be estimated consistently from reduced-form regressions without the identification problems that plague the price Phillips curve.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Vacancies per Effective Searcher (V/ES).&lt;/strong&gt; The paper&amp;rsquo;s preferred tightness measure, defined as job openings divided by effective searchers, where effective searchers are ES = U_s + 0.48·U_l + 0.40·Z_want + 0.09·Z_do-not-want + 0.07·N. This differs from the standard V/U ratio by including employed workers (at a weight of 0.07 reflecting their search intensity) and distinguishing between short-term and long-term unemployed and non-participants. It is the theoretically correct tightness measure in the on-the-job-search model, where the full pool of potential recruits — not only the unemployed — determines wage pressure.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;On-the-Job Search.&lt;/strong&gt; The mechanism by which employed workers actively search for and receive job offers from other firms. In the Bloesch-Lee-Weber (2024) model underpinning the paper, on-the-job search implies that firms must set wages not only to attract unemployed workers but also to retain employed workers who may be poached. This changes the relevant measure of tightness from V/U to V/S and makes quits — which are the endogenous separations triggered when workers accept outside offers — a near-sufficient statistic for wage growth.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Quits Rate.&lt;/strong&gt; The ratio of voluntary separations (quits) to total employment in private sector, sourced from JOLTS (extended to 1990 using Davis et al. 2012). In the model, quits are the endogenous component of the separation rate and are tightly linked to vacancies per effective searcher because workers quit more frequently when labor market tightness is high and outside offers are plentiful. The paper establishes quits as the single best individual predictor of 3-month ECI wage growth (R² = 0.55) and the best out-of-sample forecaster along with HPW.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;HPW Tightness Index (Heise-Pearce-Weber Index).&lt;/strong&gt; A composite indicator of labor market tightness constructed as the OLS-coefficient-weighted average of the quits rate and V/ES, estimated by regressing 3-month ECI wage growth on both variables simultaneously. The index is normalized to mean zero and standard deviation of one. The HPW Index achieves the highest in-sample forecasting fit at 1, 2, and 4 quarters ahead (R² of 0.62, 0.74, and 0.77, respectively) and consistently outperforms the AR(1) benchmark out of sample, unlike most other indicators.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Wage Phillips Curve.&lt;/strong&gt; The reduced-form relationship between nominal wage inflation and labor market tightness, derived in the paper from first-order conditions of the firm&amp;rsquo;s optimization problem. In the model&amp;rsquo;s representation (equation 3), wage inflation is a function of deviations of V/ES and unemployment from steady state plus expected future wage inflation. The paper argues this relationship can be estimated more cleanly than the price Phillips curve because TFP and monetary policy shocks affect wages only through the tightness term, avoiding the omitted-variable bias that flattens price Phillips curve estimates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Sufficient Statistic for Wage Inflation.&lt;/strong&gt; As used in the paper&amp;rsquo;s model, a variable (or pair of variables) such that once it is included in the wage Phillips curve, no other labor market indicator provides additional explanatory power for wage growth. The model predicts, and the empirical horse race confirms, that quits or V/ES are individually near-sufficient statistics: conditional on the quits rate, the coefficients on all other tightness measures (including unemployment, V/U, jobs-workers gap, and survey measures) fall to approximately zero.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Transitory TFP Shocks and Wage Growth.&lt;/strong&gt; The paper defines these as short-lived, positive shocks to total factor or labor productivity, as measured by 3-month changes in Fernald et al. (2012) series. The theoretical prediction is that their effect on nominal wage growth is ambiguous: if the central bank&amp;rsquo;s policy response lowers real rates enough, aggregate demand rises sufficiently to require more hiring, generating positive wage effects; if the policy response is limited, lower marginal costs reduce vacancies and wages. In the data, the sign is negative across all three productivity measures but statistically indistinguishable from zero in all specifications.&lt;/p&gt;</description></item><item><title>What Do Policies Value?</title><link>https://macropaperwarehouse.com/papers/what-do-policies-value/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/what-do-policies-value/</guid><description>&lt;p&gt;This paper asks a fundamental question about policy design: when a program prioritizes one group over another, is that because the group benefits more from the intervention, or because the policy assigns them higher intrinsic welfare weight? Björkegren, Blumenstock, and Knight develop a two-stage method to decompose observed allocation decisions into their underlying components: (i) welfare weights assigned to different types of people, (ii) heterogeneous treatment effects of the intervention, and (iii) relative weights on different outcomes. The key insight is that the same allocation rule can be consistent with very different value systems depending on how much each group actually benefits.&lt;/p&gt;
&lt;p&gt;The method works as follows. In a first stage, the analyst estimates heterogeneous treatment effects — how much each individual benefits on each outcome dimension — using OLS or machine learning methods (e.g., causal forests). In a second stage, the analyst reconciles the observed ranking of beneficiaries with an implicit welfare function using an exploded logit likelihood, recovering welfare weights (who is valued), impact weights (how different outcomes are valued), and a base value for treatment independent of measured outcomes. Identification requires an exclusion restriction: the covariates used to estimate treatment effect heterogeneity must include variables excluded from the welfare weight specification, allowing the analyst to compare households with similar welfare weights but differential treatment effects. Variants of the method that impose known welfare weights or known impact weights can be used without the exclusion restriction.&lt;/p&gt;
&lt;p&gt;The paper demonstrates the method using PROGRESA, Mexico&amp;rsquo;s large conditional cash transfer program launched in 1997. PROGRESA ranked households by a proxy means test poverty score and transferred approximately 197 pesos per month (roughly $20 USD) to eligible poor households, conditional on school attendance and doctor visits. The analysis uses endline survey data on 7,767 households and focuses on three outcomes emphasized in program documents: log per-capita consumption, child sick days (ages 0-5), and school days missed (ages 6-16).&lt;/p&gt;
&lt;p&gt;The program&amp;rsquo;s average treatment effects were: a 0.149 log point increase in monthly consumption (SE=0.015), a 0.165 reduction in sick days per child (SE=0.051), and a near-zero effect on school days missed (-0.0053, SE=0.028). These effects were heterogeneous: indigenous households, for instance, benefited substantially more from the program.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central empirical finding inverts the naive interpretation of PROGRESA&amp;rsquo;s targeting. Indigenous households were ranked 60.6 log points higher in the program&amp;rsquo;s priority order. A simple regression suggests the program favored them. But after accounting for the fact that indigenous households benefit substantially more from treatment, the method finds that the program&amp;rsquo;s implied welfare weight on indigenous households is, if anything, lower by 17.4% relative to non-indigenous households — not higher. The program&amp;rsquo;s prioritization of indigenous households is thus explained by efficiency, not by preferential welfare weighting.&lt;/p&gt;
&lt;p&gt;Because PROGRESA cash transfers relax household budget constraints and outcomes like consumption reflect household choices, the impact weights capture the difference between how the policy values outcomes and how households value them. The estimates strongly reject non-paternalism: the policy implicitly values consumption and potentially health differently from household decision-makers. Of the total welfare impact, approximately 55% is attributed to the base value of the transfer itself (independent of measured outcomes), approximately 45% to consumption impacts, and less than 1% to health and schooling impacts combined. The implied value of providing the transfer independent of outcomes corresponds to 0.16 log points of consumption, or about 23.1 pesos per person per month — slightly below the average transfer of 33.9 pesos per person per month.&lt;/p&gt;
&lt;p&gt;The paper also runs counterfactual exercises showing how alternative preference structures would have changed the allocation. A policy maximizing only educational impacts would have prioritized richer, smaller households; one maximizing only consumption impacts would have further prioritized indigenous households. These counterfactuals are mapped onto a Pareto frontier across the three outcomes. The estimated welfare weights from the implemented policy align closely with preferences elicited in a 2023 survey of 429 Mexican residents, though residents placed higher value on child health relative to what the policy implied.&lt;/p&gt;
&lt;p&gt;Q: What is the core identification challenge the paper addresses?
A: When a policy prioritizes a group, it could be because the group benefits more (efficiency) or because the policy assigns them intrinsically higher value (preference). These two explanations are observationally equivalent from the allocation alone. The paper separates them by first estimating heterogeneous treatment effects and then inverting the allocation to recover residual welfare weights.&lt;/p&gt;
&lt;p&gt;Q: What is the exclusion restriction required for full identification?
A: The covariates used to estimate treatment effect heterogeneity (x-tilde) must include at least some variables excluded from the welfare weight specification (x). This allows the analyst to compare households with similar welfare weights but different predicted treatment effects, pinning down how much of the ranking reflects efficiency versus preference. Without this restriction, one can still recover conditional preferences by imposing known values for either welfare weights or impact weights.&lt;/p&gt;
&lt;p&gt;Q: How does the exploded logit likelihood work in this setting?
A: The analyst observes a single full ranking of all households, rather than partial orderings from multiple decision-makers. The welfare impact of treating household i is modeled as a linear function of predicted treatment effects scaled by welfare and impact weights, plus an extreme-value-distributed shock. The likelihood of observing household i ranked above household i-prime is the ratio of their exponentiated welfare scores, summed over all households ranked below i. Maximum likelihood recovers the welfare weights, impact weights, and base value simultaneously.&lt;/p&gt;
&lt;p&gt;Q: What were PROGRESA&amp;rsquo;s average treatment effects on the three focal outcomes?
A: Average treatment increased log monthly consumption by 0.149 (SE=0.015), reduced child sick days by 0.165 (SE=0.051), and had a near-zero effect on school days missed (-0.0053, SE=0.028). The consumption and health effects are statistically significant; the schooling effect is not distinguishable from zero.&lt;/p&gt;
&lt;p&gt;Q: What does the analysis find about the welfare weight assigned to indigenous households?
A: In the raw ranking regression, indigenous households are ranked 60.6 log points higher, suggesting the program favored them. After accounting for the fact that indigenous households benefit substantially more from treatment, the method finds the implied welfare weight on indigenous households is lower, not higher — specifically, about 17.4% lower than non-indigenous households. The program&amp;rsquo;s higher ranking of indigenous households is explained entirely by their larger treatment effects, not by preferential weighting.&lt;/p&gt;
&lt;p&gt;Q: How are the impact weights on consumption, health, and schooling interpreted given that outcomes reflect household choices?
A: Because PROGRESA relaxes household budget constraints and outcomes like consumption result from household optimization, the estimated impact weights capture the difference between how the policy values outcomes relative to how households value them (internalities), rather than the absolute policy valuation. A nonzero weight implies the policy disagrees with household preferences — paternalism. The positive coefficient on log consumption implies the policy values this outcome more than households do.&lt;/p&gt;
&lt;p&gt;Q: How much of PROGRESA&amp;rsquo;s welfare impact comes from the base transfer value versus measured outcomes?
A: The base value of the transfer (independent of measured impacts on consumption, health, and schooling) accounts for approximately 55% of total implied welfare impact. The impact on consumption accounts for approximately 45%. Impacts on health and schooling together account for less than 1%. The implied value of the base transfer corresponds to 0.16 log points of consumption per capita, or about 23.1 pesos per person per month — somewhat below the average transfer amount of 33.9 pesos per person per month.&lt;/p&gt;
&lt;p&gt;Q: Does the analysis reject egalitarian welfare weights and non-paternalism?
A: Yes, using Wald tests with bootstrapped covariance matrices. The hypothesis of egalitarian weights (all gamma equal to one) is rejected. Non-paternalism (all beta equal to zero) is strongly rejected. The joint hypothesis of egalitarianism and non-paternalism is also rejected across all specifications tested.&lt;/p&gt;
&lt;p&gt;Q: How do the estimated welfare weights compare to stated preferences of Mexican residents?
A: The 2023 survey of 429 Mexican residents elicited preferences using multiple price lists over how to prioritize different household types. The welfare weights implied by the implemented policy are broadly similar to resident preferences, but the policy places relatively higher welfare weight on indigenous households than the median survey respondent does. Survey respondents value child health impacts more than household decision-makers and more than the implemented policy does, consistent with support for paternalism.&lt;/p&gt;
&lt;p&gt;Q: What do counterfactual allocations reveal about the relationship between policy goals and targeting priorities?
A: A policy maximizing only consumption impacts would further prioritize indigenous households with lower income. A policy maximizing only educational impacts would instead prioritize richer, smaller households. A policy maximizing only health impacts would largely preserve indigenous household prioritization while placing less emphasis on lower-education households. These three extreme policies map to the corners of a Pareto frontier, and the implemented PROGRESA policy lies close to the allocation consistent with surveyed resident preferences.&lt;/p&gt;
&lt;p&gt;Q: What changed when Mexico reformed PROGRESA&amp;rsquo;s poverty score in 2003?
A: The 2003 reform increased the priority of older and smaller households. Applying the method to the new poverty score reveals that it implicitly switched to assigning a positive welfare weight to indigenous households (compared to the negative implied weight under the original score), and placed less welfare weight on lower-income and younger households relative to the original design.&lt;/p&gt;
&lt;p&gt;Q: What are the main limitations and scope conditions of the method?
A: Full identification requires an exclusion restriction (some treatment effect heterogeneity predictors excluded from welfare weights) and sufficient variation in treatment effects across household types. If treatment effects are homogeneous, welfare weights and impact weights cannot be separately identified. If correlated unobservables drive the ranking but are not modeled, the method recovers preferences consistent with included variables only, analogous to omitted variable bias in OLS. The method also requires a way to estimate treatment effect heterogeneity, which is most credible with a randomized pilot, though non-experimental methods are in principle applicable.&lt;/p&gt;
&lt;p&gt;Q: How does this paper relate to the inverse optimum public finance literature?
A: The inverse optimum literature (Bourguignon and Spadaro 2012; Saez and Stantcheva 2016; Hendren 2020) recovers the redistribution preferences consistent with income tax schedules, conditioning on a single covariate (pre-tax income) affecting a single outcome (net-of-tax consumption). This paper generalizes that framework to arbitrary allocation policies conditioning on a vector of covariates and affecting a vector of outcomes, and extends it to settings beyond income taxation where heterogeneous treatment effects can be estimated.&lt;/p&gt;
&lt;p&gt;Q: Can the method be applied when only a binary allocation is observed rather than a full ranking?
A: Yes. A binary allocation corresponds to a ranking with only two levels, and the same exploded logit procedure applies, though with reduced statistical power. The paper provides an empirical illustration of this setting in Section 5.2.1.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Welfare weights (w(x_i)):&lt;/strong&gt; The policy&amp;rsquo;s differential valuation of one household&amp;rsquo;s utility relative to another, expressed as a multiplicative function of household characteristics. Distinct from how much a household benefits — two households may be ranked identically despite different benefits if their welfare weights differ proportionally.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Impact weights (beta_j):&lt;/strong&gt; The policy&amp;rsquo;s relative valuation of different outcome components (consumption, health, schooling). For outcomes that are household choices, impact weights capture the difference between how the policy values the outcome and how the household values it — an internality or paternalistic preference.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Base value (alpha):&lt;/strong&gt; The value a policy assigns to providing a treatment independent of its measured impact on any specific outcome. Captures either a direct utility benefit of treatment or the value of relaxing household budget constraints when outcomes are choices.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exclusion restriction:&lt;/strong&gt; The requirement that the set of covariates used to estimate treatment effect heterogeneity includes at least some variables excluded from the welfare weight specification. Enables separate identification of efficiency-based and preference-based components of a ranking by comparing households similar in welfare weight but different in predicted treatment effects.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Exploded logit likelihood:&lt;/strong&gt; The econometric procedure used in the second stage, adapted for a single complete ranking of all alternatives rather than partial orderings. Treats the observed ranking of household i as a choice from the set of all households ranked below it, with likelihood given by the softmax of welfare scores.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Value audit:&lt;/strong&gt; A retrospective application of the method that reads the implicit values encoded in an implemented policy&amp;rsquo;s allocation decisions, enabling comparison against stated policy objectives, constituent preferences, or normative benchmarks.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Paternalism (in this paper&amp;rsquo;s sense):&lt;/strong&gt; A policy is paternalistic if it assigns nonzero impact weight (beta_j ≠ 0) to outcomes that are household choices — meaning the policy values those outcomes differently from the households making the choices. The envelope theorem implies a non-paternalistic policy would place zero weight on choice outcomes beyond the general constraint relaxation.&lt;/p&gt;</description></item><item><title>What Jobs Come to Mind? Stereotypes About Fields of Study</title><link>https://macropaperwarehouse.com/papers/what-jobs-come-to-mind-stereotypes-about-fields-of-study/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/what-jobs-come-to-mind-stereotypes-about-fields-of-study/</guid><description>&lt;p&gt;Conlon and Patel test whether students stereotype the link between college majors and occupations — that is, whether they exaggerate the likelihood that majors lead to their &amp;ldquo;representative&amp;rdquo; careers (those most overrepresented among a major&amp;rsquo;s graduates relative to other majors, as measured by a likelihood ratio in US census data). The representative career for each major is intuitive: doctors for biology/chemistry, lawyers for political science, counselors for psychology, journalists for communications, artists for art, and so forth.&lt;/p&gt;
&lt;p&gt;The authors draw on three bodies of evidence. First, surveys of first-year undecided undergraduates in Ohio State University&amp;rsquo;s Exploration program (primarily Fall 2020 and Fall 2021 cohorts, ~80% response rate), asking students their beliefs about the share of US graduates in various careers conditional on major, as well as their beliefs about their own likely career. Beliefs are benchmarked against true career shares computed from the 2017–2019 American Community Survey restricted to college graduates aged 30–50. Second, 40+ years (1975–2018) of the CIRP Freshman Survey from UCLA, covering more than nine million nationally representative US college freshmen, which records intended major and intended career. Third, a field experiment embedded in the 2021 OSU survey with an RD design, in which treated students were shown the true share of their top major&amp;rsquo;s representative career before reporting beliefs, intentions, and — via administrative records — actual course enrollments and major declarations up to three years later.&lt;/p&gt;
&lt;p&gt;The main finding is large, systematic overestimation of representative careers. In the OSU survey, students believe 53% of art majors work as artists (true: 17%), 47% of journalism majors work as journalists (true: 4%), 38% of political science majors work as lawyers (true: 16%), and 43% of psychology majors work as counselors (true: 21%). OLS regressions of beliefs on true career frequency and a representative-career indicator yield a stereotyping coefficient θ of 0.32 p.p. (p &amp;lt; 0.01) without career fixed effects and 0.28 p.p. (p &amp;lt; 0.01) with them, meaning students believe representative careers are roughly 28–32 percentage points more common than equally prevalent non-representative careers. These patterns are similar across gender, ethnicity, and first-generation status, replicate in an MTurk sample (θ = 0.30, p &amp;lt; 0.01) and a nationally representative US adult sample (θ = 0.33, p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;In the CIRP data, 63% of biology freshmen expect to become doctors (true: 23%), 62% of psychology freshmen expect to be counselors (true: 21%), 65% of art freshmen expect to be artists (true: 17%), and 42% of communications/journalism freshmen expect to be writers or journalists (true: 4%). The average gap between expected and actual representative-career attainment is 36 p.p., and this gap has been roughly stable since at least the 1970s.&lt;/p&gt;
&lt;p&gt;An implicit association test (IAT) administered to 434 OSU students shows that implicit associations between representative major–career pairs are 0.30–0.36 standard deviations stronger than for non-representative pairs (p &amp;lt; 0.01), and remain 0.24–0.28 SDs stronger (p &amp;lt; 0.01) after controlling for true career frequency. A one-SD increase in individual IAT scores predicts 2.8–4.1 p.p. greater stereotyped beliefs (p &amp;lt; 0.01). Knowing someone with a non-representative major–career combination predicts beliefs 16 p.p. lower for the representative career (p &amp;lt; 0.01) — more than half the stereotyping effect — and also predicts lower IAT scores, suggesting associations arise from personal experience.&lt;/p&gt;
&lt;p&gt;An equilibrium model shows that stereotyping causes students to infer that representative careers have unusually favorable unobservable attributes, and that this inflates enrollment in the representative major among marginal students who are poorly suited to it. Correlational evidence from the NSCG, SIPP, and SHED confirms that majors subject to greater stereotyping are associated with more job dissatisfaction (+6.0% per SD, p &amp;lt; 0.01), greater job-skill mismatch (+3.1%, p &amp;lt; 0.05), more major-career mismatch (+5.4%, p &amp;lt; 0.05), and more regret about field of study (+4.8%, p &amp;lt; 0.05).&lt;/p&gt;
&lt;p&gt;The field experiment shows that correcting beliefs reduces stereotyping and shifts major choices. A 10 p.p. reduction in beliefs about the top major&amp;rsquo;s representative career lowers intentions toward that major by 3.5 p.p. (p &amp;lt; 0.01), reduces enrollment in that major&amp;rsquo;s courses by 0.22 credits in the next semester (p &amp;lt; 0.05), and reduces the probability of declaring that major within one year by 6.1 p.p. (p = 0.23). The same information boosts intentions toward students&amp;rsquo; second-ranked major by 2.1 p.p. (p = 0.17), increases second-major course enrollment by 0.20 credits (p &amp;lt; 0.10), and raises the probability of declaring the second major within a year by 9.9 p.p. (p &amp;lt; 0.01). Treated students also spend on average 0.21 more semesters undecided before declaring a major (p &amp;lt; 0.05). Effects are concentrated in the first year and partially fade over the two-to-three-year follow-up window.&lt;/p&gt;
&lt;p&gt;Q: How do the authors define a major&amp;rsquo;s &amp;ldquo;representative career&amp;rdquo;?
A: The representative career of major M is the career c that maximizes the likelihood ratio R(c, M) = p_{c|M} / p_{c|not-M}, where p_{c|M} is the true share of major-M graduates working in career c and p_{c|not-M} is the share of graduates from all other majors working in c. This ratio captures how much more common a career is among one major&amp;rsquo;s graduates relative to all other graduates. For example, the representative career of communications/journalism is &amp;ldquo;writers and journalists,&amp;rdquo; whose graduates are between 155% and 1,751% more likely to hold their major&amp;rsquo;s representative career than graduates of other majors, even though the absolute frequency of such careers is often modest (ranging from 2% to 60% across fields).&lt;/p&gt;
&lt;p&gt;Q: What is the core model of stereotyped belief formation?
A: The model draws from Bordalo et al. (2016). Let p_{c|M} be the true career share and π_{c|M} the student&amp;rsquo;s belief. The model specifies π_{c|M} = (1 − θ) p_{c|M} + θ · 1[c = c*(M)], where c*(M) is the representative career and θ ∈ [0,1] measures the extent of stereotyping. When θ = 0 the student holds rational beliefs; when θ = 1 beliefs assign all probability mass to the representative career. This formulation implies that students overweight representative careers because those careers come to mind more easily, grounded in a representativeness heuristic based on likelihood ratios.&lt;/p&gt;
&lt;p&gt;Q: What does the regression test for stereotyping find in the OSU survey?
A: The authors regress individual beliefs π_{c|M} on the true frequency p_{c|M} and an indicator for c being the representative career of M, clustering standard errors at the individual and career-by-major level. The estimated θ is 0.32 (p &amp;lt; 0.01) without career fixed effects (Column 1 of Table 1) and 0.28 (p &amp;lt; 0.01) with career fixed effects (Column 2). For self-beliefs about students&amp;rsquo; top-ranked major, the estimates are 0.36–0.43 p.p. (p &amp;lt; 0.01 both with and without career fixed effects). These estimates imply that students regard a major&amp;rsquo;s representative career as 28–43 percentage points more common than an equally prevalent non-representative career for the same major.&lt;/p&gt;
&lt;p&gt;Q: Do the OSU results replicate in other samples?
A: Yes. An MTurk convenience sample of 430 current college students yields a stereotyping coefficient of 0.30 (p &amp;lt; 0.01). A nationally representative sample of US adults yields a coefficient of 0.33 (p &amp;lt; 0.01); this pattern holds separately for college-educated and non-college-educated respondents and for both younger respondents (aged 18–29) and older respondents (aged 30+). The authors also ran a pre-registered 2021 replication survey in a new OSU Exploration cohort and found similar results.&lt;/p&gt;
&lt;p&gt;Q: What does the CIRP Freshman Survey data show about the persistence and scale of stereotyping?
A: Pooling more than nine million US college freshmen surveyed from 1975 to 2018, the CIRP data show that students systematically intend to enter their major&amp;rsquo;s representative career far more often than graduates actually do. Among students who have decided on a major, 63% intend to have their major&amp;rsquo;s representative career while only 27% of college graduates actually attain it — a gap of 36 p.p. (p &amp;lt; 0.01). The specific examples include: 63% of biology freshmen intend to become doctors (true: 23%), 62% of psychology freshmen expect to be counselors (true: 21%), 65% of art freshmen expect to be artists (true: 17%), and 42% of communications/journalism freshmen expect to be writers or journalists (true: 4%). The gap has been stable over the full 40+ year window, with no sign of convergence, and amounts to 40,000–200,000 students per year expecting careers in representative fields that they will not attain.&lt;/p&gt;
&lt;p&gt;Q: Can alternative mechanisms such as overconfidence or motivated reasoning explain the results?
A: The authors argue no, for two reasons. First, students overestimate the prevalence of representative careers not only for majors they plan to pursue (where overconfidence or motivated reasoning might apply) but also for majors they do not plan to pursue — the pattern holds for the gray (population belief) bars across all ten majors in Figure 1. Second, a Shapley-Sharrocks decomposition reported in Table A.V shows that the stereotyping mechanism accounts for a larger share of variance in beliefs than any other mechanism tested. A pre-registered survey also rules out unawareness of non-representative occupations as a driver: students are aware of the overwhelming majority of the 100 most common non-representative occupations, and such unawareness as exists is uncorrelated with stereotyped beliefs.&lt;/p&gt;
&lt;p&gt;Q: What does the IAT reveal about the mechanism behind stereotyping?
A: The IAT was run on 434 OSU Exploration students in Fall 2021, measuring implicit associations between five major–career pairs (Humanities-Writers and Journalists, Sciences-Healthcare, STEM-Business, Social Science-Law, Social Science-Counseling/Education). Participants sorted stimuli faster in &amp;ldquo;matched&amp;rdquo; blocks (where the representative career shares a response key with its major) than in &amp;ldquo;unmatched&amp;rdquo; blocks, yielding DID-IAT effects of 0.30–0.36 SDs (p &amp;lt; 0.01) for all five pairs. After controlling for true career frequency with career and major fixed effects, the effect shrinks only slightly to 0.24–0.28 SDs (p &amp;lt; 0.01), confirming that associations are driven by representativeness beyond base rates. At the individual level, a one-SD increase in DID-IAT scores predicts 4.1 p.p. greater stereotyped beliefs (p &amp;lt; 0.01) without career-by-major fixed effects and 2.8 p.p. (p &amp;lt; 0.01) with them.&lt;/p&gt;
&lt;p&gt;Q: What does the role-model heterogeneity analysis show?
A: Students were asked which major–career combinations they knew personally. Controlling for career-by-major fixed effects, knowing someone with a non-representative major–career combination (i.e., a non-default path) predicts beliefs about the representative career that are 16 p.p. lower (p &amp;lt; 0.01). This is more than half the size of the baseline stereotyping effect (28–32 p.p.). Knowing such a person also predicts lower IAT scores (p &amp;lt; 0.01), implying that personal exposure can reduce both implicit associations and explicit stereotyped beliefs.&lt;/p&gt;
&lt;p&gt;Q: What does the equilibrium model predict about misallocation?
A: The model embeds stereotyped beliefs in a two-stage choice framework: students choose a major first, then choose a career after graduation. It shows two main results (Propositions 1 and 2 in Online Appendix A.1). First, students who perceive the representative career as more common than it is will infer — through a rational expectations mechanism — that the unobservable amenities of that career are particularly favorable, so they will be surprised upon graduation. Second, stereotyping raises misallocation because it draws in marginal students whose career preferences make them poorly matched to the major&amp;rsquo;s representative career, while the inframarginal students who would have chosen the major anyway are better matched. The misallocation effect increases in the extent of stereotyping.&lt;/p&gt;
&lt;p&gt;Q: What correlational evidence links stereotyping to post-graduation mismatch outcomes?
A: Using major-level stereotyping estimates from the OSU data merged with three nationally representative surveys (NSCG, SIPP, SHED), the authors find: a one-SD increase in major-level stereotyping is associated with 6.0% more job dissatisfaction (p &amp;lt; 0.01, NSCG), 3.1% more reports that the job does not fit the worker&amp;rsquo;s skills and experience (p &amp;lt; 0.05, NSCG), 5.4% more reports that the job is unrelated to the field of study (p &amp;lt; 0.05, SIPP), and 4.8% more regret about field of study choice (p &amp;lt; 0.05, SHED). The authors note these are correlational and cannot rule out confounders such as underlying complexity of the career mapping.&lt;/p&gt;
&lt;p&gt;Q: How does the field experiment work and what is its identifying strategy?
A: The experiment was embedded in the second 2021 OSU survey, with students in the treatment group shown the true share of their top major&amp;rsquo;s representative career before reporting beliefs and intentions; control students answered the same questions without receiving this information. The main regression relates outcomes to (True Share − Prior Belief), set to zero for controls. Because students with less accurate prior beliefs may be more likely to choose the relevant major, OLS is potentially inconsistent; the authors use an RD design where the running variable is the information shock (True Share − Prior Belief), with the threshold at zero. Students just above (who overestimated) receive negative news; students just below (who underestimated) receive positive news. The RD estimates are combined with a first-stage estimate of belief updating to produce IV estimates of the effect of a 10 p.p. change in beliefs. Balance tests on predetermined demographics confirm no discontinuities at the threshold.&lt;/p&gt;
&lt;p&gt;Q: What are the first-stage belief-updating results?
A: Students update their posterior beliefs in response to the treatment: in response to information that the representative career is 1 p.p. less likely, students update their posterior beliefs down by 0.37 p.p. (p &amp;lt; 0.01). This under-reaction is consistent with Bayesian updating when priors are informative (Mobius et al. 2022). Students also update beliefs about non-representative careers: a 1 p.p. reduction in the representative career&amp;rsquo;s stated likelihood increases the expected probability of other careers by 0.27 p.p. (p &amp;lt; 0.01).&lt;/p&gt;
&lt;p&gt;Q: What are the effects of the information intervention on major intentions?
A: A 10 p.p. reduction in beliefs about the top major&amp;rsquo;s representative career reduces intentions (stated probability of graduating with that major) by 3.5 p.p. (p &amp;lt; 0.01). This effect is similar across subgroups (Columns 2–4 of Table 2). For students&amp;rsquo; second-ranked major, a 10 p.p. reduction in stereotyping boosts intentions by 2.1 p.p. (p = 0.17), which is imprecisely estimated but consistent in sign with all other outcomes.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on actual course enrollments?
A: In the semester immediately following the experiment, learning that the representative career of the first major is 10 p.p. less likely causes students to enroll in 0.22 fewer credits in that major&amp;rsquo;s field (95% CI: [−0.41, −0.02], p &amp;lt; 0.05), relative to a mean of 0.85 credits. Learning that the representative career of the second major is 10 p.p. less likely causes students to enroll in 0.20 more credits in the second major&amp;rsquo;s field (95% CI: [0.004, 0.40], p &amp;lt; 0.10), relative to a mean of 0.36 credits.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on official major declarations?
A: Within one year of the experiment, students who learned the representative career of their top major is 10 p.p. less likely are 6.1 p.p. less likely to have declared that major (95% CI: [−16.0, 3.8], p = 0.23) and 9.9 p.p. more likely to have declared their second major (95% CI: [2.5, 17.4], p &amp;lt; 0.01); the difference between these two effects is 16.0 p.p. (p &amp;lt; 0.01). By two years out, the effects are more attenuated. Treated students also spend on average 0.21 more semesters undecided before declaring a major (95% CI: [0.02, 0.40], p &amp;lt; 0.05). Effects do not appear to be driven by dropout: treated students are if anything slightly more likely to still be taking classes two to three years later.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Representativeness (likelihood ratio):&lt;/strong&gt; The representativeness R(c, M) of career c for major M is defined as the ratio p_{c|M} / p_{c|not-M} — how much more common career c is among major-M graduates than among graduates of all other majors. This is a relative, not absolute, frequency measure. The representative career (or exemplar) of a major is the career that maximizes this ratio.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stereotyping (as exaggeration of a kernel of truth):&lt;/strong&gt; In this paper&amp;rsquo;s framework, stereotyping means overweighting the representative career when forming beliefs about a major&amp;rsquo;s career distribution. The belief model is π_{c|M} = (1 − θ) p_{c|M} + θ · 1[c = c*(M)], where θ &amp;gt; 0 implies beliefs exaggerate how common the representative career is relative to equally prevalent non-representative careers. This is distinct from overconfidence, motivated reasoning, or simple noise.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;DID-IAT score (difference-in-differences implicit association test):&lt;/strong&gt; The paper&amp;rsquo;s adaptation of the standard IAT to measure relative implicit associations between major and career groups. For a focal major–career pair, the DID-IAT score is the difference in the matched-vs-unmatched IAT D-score for the focal major (relative to a comparison major). A positive score indicates the focal major is more strongly associated with the focal career than the comparison major is. This measures implicit memory-based associations rather than deliberate beliefs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Misallocation (as used in the model):&lt;/strong&gt; The welfare loss arising because stereotyped beliefs draw marginal students — those on the margin between choosing the representative major and not — who have career preferences close to the average rather than being the students best suited to that major. These marginal students end up choosing careers other than the representative career after graduation at higher rates, producing major-career mismatch. Misallocation is shown (Proposition 2) to increase in the extent of stereotyping θ.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Information shock:&lt;/strong&gt; In the field experiment, the information shock for a given student and major is the difference between the true share of the major&amp;rsquo;s representative career and the student&amp;rsquo;s prior belief about that share. Positive shocks correspond to students who overestimated (and thus receive bad news); negative shocks correspond to students who underestimated (and receive good news). The RD design uses the threshold at shock = 0 to generate quasi-experimental variation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Source text origin (implicit in the paper&amp;rsquo;s design):&lt;/strong&gt; The paper measures beliefs about career distributions benchmarked against American Community Survey data on actual career outcomes of college graduates aged 30–50, restricting to respondents born 1958–1997. This defines the objective ground truth against which stereotyping is measured throughout the paper.&lt;/p&gt;</description></item><item><title>What Works and for Whom? Effectiveness and Efficiency of School Capital Investments Across the U.S.</title><link>https://macropaperwarehouse.com/papers/what-works-and-for-whom-effectiveness-and-efficiency-of-school-capital-investments-across-the-u.s./</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/what-works-and-for-whom-effectiveness-and-efficiency-of-school-capital-investments-across-the-u.s./</guid><description>&lt;h2 id="what-works-and-for-whom-effectiveness-and-efficiency-of-school-capital-investments-across-the-us"&gt;What Works and for Whom? Effectiveness and Efficiency of School Capital Investments Across the U.S.&lt;/h2&gt;
&lt;h3 id="research-question"&gt;Research Question&lt;/h3&gt;
&lt;p&gt;This paper investigates which types of school facility investments benefit students (as measured by test scores) and are valued by homeowners (as measured by house prices), and for which student populations these investments are most effective. Prior state-level studies had reached conflicting conclusions about the returns to school capital spending, and no nationwide evidence had distinguished impacts across spending categories or student backgrounds.&lt;/p&gt;
&lt;h3 id="data-and-methodology"&gt;Data and Methodology&lt;/h3&gt;
&lt;p&gt;The authors assemble a novel panel dataset covering approximately 14,000 school bond referenda in 29 U.S. states and 10,146 districts enrolling 71% of all U.S. students, for the period 1990–2017. The dataset combines: (1) ballot-level bond election records including vote shares, proposed amounts, and ballot text; (2) district-level test scores from the Stanford Education Data Archive (SEDA) extended backward to 2003 for all states and as early as 1995 for some, normalized to a national scale via NAEP; (3) a Census-tract-level house price index (Contat and Larson, 2022) aggregated to school districts; and (4) NCES district finance and demographic data.&lt;/p&gt;
&lt;p&gt;Bond ballot texts are classified into eight spending categories using text-analysis: classroom construction/renovation; HVAC; other infrastructure (plumbing, roofs, furnaces); safety and health (pollutant removal, building safety); STEM equipment and labs; athletic facilities; land purchases; and transportation vehicles.&lt;/p&gt;
&lt;p&gt;The identification strategy exploits quasi-random variation from close bond elections, building on the dynamic regression discontinuity (DRD) framework of Cellini et al. (2010). A key methodological contribution is a stacked DRD design that addresses heterogeneous treatment effects correlated with timing: each treatment cohort (districts that narrowly authorize a bond in year c) is matched against &amp;ldquo;clean controls&amp;rdquo; — districts that also proposed a bond in the same cohort but narrowly failed to authorize it and did not authorize any bond in the following ten years. Cohorts are stacked, and a dynamic RD model is estimated controlling for cohort fixed effects and a district&amp;rsquo;s bond proposal history.&lt;/p&gt;
&lt;h3 id="main-findings-with-quantitative-magnitudes"&gt;Main Findings with Quantitative Magnitudes&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Average effects.&lt;/strong&gt; Bond authorization raises capital spending by approximately $1,650 per pupil cumulatively over five years. Test scores increase gradually, reaching 0.079 standard deviations (sd) higher five to eight years after authorization, and 0.073 sd higher nine to twelve years after. 2SLS estimates, amortizing spending over a 30-year project life at a 9% depreciation rate, imply that a $1,000 increase in the flow value of capital spending raises test scores by 0.048 sd. House prices rise by approximately 9% eight to nine years after authorization. When house price effects are estimated against only locally-financed capital spending (not state aid), the 2SLS estimate is 0.8% per $1,000 — roughly consistent with efficiency — suggesting that the larger reduced-form house price response is driven primarily by state aid that supplements local funds rather than by an inefficiently low ex ante spending level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by spending category.&lt;/strong&gt; Category-specific estimates reveal that only certain project types raise test scores: HVAC (+0.20 sd, largest effect), safety and health (+0.15 sd), other infrastructure/plumbing/roofs (+0.15 sd), STEM equipment (+0.15 sd implied), and classroom space (+0.10 sd), all measured three to six years post-election. By contrast, bonds for athletic facilities, land purchases, and transportation produce no detectable effects on test scores. The pattern for house prices is the inverse: athletic facilities generate a 17% house price increase; classroom space generates 14%; STEM generates 11% — while HVAC and safety/health bonds produce no significant effect on house prices. The correlation between category-level test score and house price estimates is −0.07, indicating these are largely orthogonal outcomes.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by student socioeconomic status.&lt;/strong&gt; Effects are concentrated in districts serving socioeconomically disadvantaged students (top tercile of the share of students eligible for free or reduced-price meals, denoted low-SES). In low-SES districts, bond authorization raises test scores by 0.13 sd after seven years and house prices by 15%; in high-SES districts, neither outcome shows a significant effect. 2SLS estimates confirm that a $1,000 increase in cumulative spending raises test scores by 0.08 sd in low-SES districts but produces no detectable change in high-SES districts. The SES gradient persists after conditioning on spending amounts, spending categories, and baseline capital stock, indicating that students in disadvantaged districts have higher marginal returns to capital improvements independent of these channels. High-minority districts (top tercile of Black and Hispanic share) similarly see a 0.12 sd test score gain and 15% house price gain after seven years, versus 0.04 sd and 3% in low-minority districts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Role of baseline capital stock.&lt;/strong&gt; Among districts with below-median capital stock, test score effects are 0.20 sd in low-SES districts seven years post-election. Even among above-median-stock districts, low-SES districts see house price effects exceeding 10% while high-SES districts see no effect. Differences by SES persist after conditioning on capital stock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Policy simulation.&lt;/strong&gt; Closing the spending gap between high- and low-SES districts (approximately $1,000 over 10 years) without changing the composition of spending would raise low-SES test scores by roughly 0.08 sd, closing about 8% of the roughly 1 sd achievement gap. Targeting that same additional spending toward HVAC and safety/health (the highest-impact categories) would generate test score increases approximately three times as large, potentially closing up to 25% of the observed achievement gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reconciling prior literature.&lt;/strong&gt; Replicating state-level estimates, the authors show that Ohio&amp;rsquo;s positive effects are explained by a high share of bonds in low-SES districts funding infrastructure, while Texas&amp;rsquo;s near-zero effects reflect a high share of bonds in higher-SES districts funding classrooms and athletic facilities.&lt;/p&gt;
&lt;h2 id="qa-analytical-steps-mechanisms-and-robustness"&gt;Q&amp;amp;A: Analytical Steps, Mechanisms, and Robustness&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: What is the first-stage effect of bond authorization on capital spending, and does it contaminate other spending categories?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A1: Bond authorization raises per-pupil capital spending by approximately $700 per year at two years post-election and $590 at three years, with cumulative spending $1,650 higher over five years in treated districts relative to districts that narrowly failed to authorize a bond. Bond revenues are legally restricted to capital uses, and the paper confirms that non-capital (current) spending and instructional spending are not affected following authorization. This establishes a clean first stage: bond authorization raises only capital outlays.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: Why does the standard DRD estimator of Cellini et al. (2010) require refinement, and what problem does the stacked DRD design solve?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A2: The original CFR estimator assumes treatment effects are uncorrelated with the timing of treatment — an assumption potentially violated when, for example, bonds financing HVAC (high-impact) versus athletic facilities (amenity-focused) have different propensities to be proposed at different points in time. The stacked DRD design avoids &amp;ldquo;forbidden comparisons&amp;rdquo; by comparing each treatment cohort only against clean controls that propose but fail to authorize a bond in the same year and do not authorize any bond in the subsequent ten years. This ensures consistency even when treatment effects are heterogeneous across cohorts and correlated with timing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How do the authors validate the quasi-random assignment assumption of the regression discontinuity design?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A3: Three tests are performed. First, a McCrary (2008) density test on the vote margin distribution shows no discontinuity at the cutoff in the pooled or stacked data (p-values of 0.59 and 0.24, respectively), though discontinuities are found in Arkansas, Missouri, and Oklahoma — those three states are excluded. Second, pre-election district covariates (income, education, SES shares, enrollment, revenues, expenditures) are smooth around the cutoff in both datasets. Third, pre-election trends in test scores and house prices are flat and parallel between marginally approved and marginally rejected districts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How are the eight spending categories constructed, and how many bonds are successfully classified?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A4: Categories are drawn from the SchoolBondFinder.com classification produced by The Amos Group, then refined by splitting capital improvements into HVAC versus other infrastructure, splitting construction/renovation into classroom versus athletic facility projects, and adding land purchases as a separate category. Keyword-based text analysis of ballot language successfully assigns 75% of the approximately 14,000 bonds to at least one of the eight categories. More than two-thirds of classified bonds receive multiple category designations, with a mean of 2.9 categories per proposed bond and 3.2 per authorized bond.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: Why do HVAC bonds raise test scores but not house prices, while athletic facility bonds raise house prices but not test scores?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A5: The authors interpret this divergence as reflecting what different types of improvements offer to different stakeholders. HVAC improvements reduce excessive heat and air pollution exposure in classrooms, directly improving students&amp;rsquo; learning experiences — consistent with Park et al. (2020) on heat and Gilraine and Zheng (2022) on air pollution. These improvements are not visibly salient to homeowners without school-age children and carry no amenity value for the broader community. Athletic facilities, by contrast, are highly visible and provide a community amenity valued in the housing market regardless of their impact on academic instruction. The near-zero correlation (−0.07) between category-level test score and house price estimates confirms that the two outcomes respond to largely distinct features of capital investments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: What are the three candidate explanations for the larger effects of bond authorization in low-SES districts, and which explanations survive empirical scrutiny?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A6: The three candidates are: (1) larger spending increases after authorization in low-SES districts; (2) a different composition of spending categories (more toward high-impact HVAC and safety); and (3) higher marginal returns per dollar for disadvantaged students, holding spending size and composition fixed. The data confirm all three operate, but the third is the residual: 2SLS estimates show a $1,000 increase raises test scores by 0.08 sd in low-SES districts versus a statistically zero effect in high-SES districts, and within-category estimates show HVAC bonds raise scores by 0.27 sd in low-SES districts but have no detectable effect in high-SES districts. Differences by SES also persist after conditioning on the estimated baseline capital stock, though low capital stock accounts for part of the gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: How does the role of state aid alter the interpretation of the house price effect for spending efficiency?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A7: A 9% house price increase after bond authorization, if taken at face value under Brueckner&amp;rsquo;s (1979) efficiency test, would suggest the ex ante level of school capital spending was inefficiently low. However, state grants that partly match local bond revenues raise actual spending without raising local property taxes proportionally. When the 2SLS house price effect is estimated against only locally financed capital spending (using proposed bond size as the relevant measure), the implied house price increase is just 0.8% per $1,000 — consistent with rough efficiency on average across the full sample. The authors conclude that the large reduced-form house price response is driven primarily by the capitalization of state aid, not by an undersupply of capital investments at the aggregate level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Does household sorting account for the observed test score and house price gains following bond authorization?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A8: Bond authorization produces small but detectable compositional changes: the share of high-SES students is approximately 3 percentage points higher seven years after an election (a roughly 4% increase relative to an average share of 0.73), while enrollment and the share of white students are largely unaffected. However, controlling for district-by-year shares of each sociodemographic group only slightly attenuates the test score and house price estimates, indicating that sorting accounts for a small share of the observed gains.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: Are the findings robust to alternative research designs?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A9: The results are robust to five alternative estimation approaches: (1) the original one-step TOT estimator of Cellini et al. (2010); (2) a version of the stacked DRD where clean controls are districts that do not approve any bonds in the full [c−5, c+10] window; (3) a version that matches treated and control districts in each cohort based on bond history; (4) a version not controlling for future bond history; and (5) the extended two-way fixed effects (ETWFE) estimator of Wooldridge (2021). Results are also robust to linear polynomials with different slopes and quadratic polynomials of the vote margin.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: How does the capital stock measure illuminate mechanism, and what are its limitations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A10: The authors construct a district-level capital stock as the 30-year depreciated sum of capital spending from Census of Governments data (1967–2017) at a 5% depreciation rate. This stock is negatively correlated with the share of low-SES students, confirming that more disadvantaged students attend schools in worse structural condition. Conditioning on this proxy, the SES gradient in bond impacts is reduced but remains. Among districts with below-median capital stock, low-SES districts see test score gains of 0.20 sd after seven years, while among above-median-stock districts the gap narrows to approximately 0.10 vs. 0.05 sd. A key limitation is that detailed school-condition data are unavailable nationally, so the capital stock is a proxy only.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What is the quantitative policy implication of the targeting exercise?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A11: On average, low-SES districts receive about $97 per pupil per year less in capital spending than high-SES districts, so closing this gap over ten years implies approximately $970 in additional cumulative spending. Without changing spending composition, this would raise test scores by roughly 0.08 sd in low-SES districts, closing about 8% of the approximately 1 sd achievement gap between high- and low-SES districts. Redirecting that same additional spending toward the highest-impact categories (HVAC and safety/health) would generate test score gains roughly three times larger, potentially closing up to 25% of the observed achievement gap.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: How do the cross-state differences documented in prior literature map onto the paper&amp;rsquo;s heterogeneity findings?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;A12: The authors replicate earlier state-level estimates and show that Ohio&amp;rsquo;s relatively large positive effects — found by Conlin and Thompson (2017) — are explained by a high concentration of bonds in low-SES districts funding infrastructure, while Texas&amp;rsquo;s near-zero effects — found by Martorell et al. (2016) — reflect a high share of bonds in higher-SES districts funding classrooms and athletic facilities. Wisconsin and Michigan, which showed null effects in earlier studies, similarly have bond compositions and student demographics that predict small impacts under the paper&amp;rsquo;s heterogeneity framework.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Stacked Dynamic Regression Discontinuity (Stacked DRD).&lt;/strong&gt; The paper&amp;rsquo;s primary estimation strategy, which combines the dynamic RD framework of Cellini et al. (2010) with a stacked-cohort design adapted from the staggered difference-in-differences literature. For each treatment cohort (year in which a bond barely passes), &amp;ldquo;clean controls&amp;rdquo; are defined as districts that also proposed a bond in the same year but narrowly failed to authorize it and did not authorize any subsequent bond within ten years. Cohort-specific datasets are stacked and estimated jointly with cohort fixed effects, ensuring that estimates are robust to treatment effect heterogeneity correlated with timing.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Clean Controls.&lt;/strong&gt; Districts used as the counterfactual for treated districts in a given cohort: those that propose a bond in the same year as the treated cohort, barely fail to authorize it, and remain untreated for ten subsequent years. Their &amp;ldquo;clean&amp;rdquo; status is quasi-random because their future non-authorization results from narrow electoral loss rather than any endogenous district choice.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bond Spending Categories.&lt;/strong&gt; Eight mutually-non-exclusive classifications of bond spending derived from ballot text using keyword analysis: classroom space; HVAC; other infrastructure (plumbing, roofs, furnaces); safety and health (pollutant removal, compliance upgrades); STEM equipment and labs; athletic facilities; land purchases; and transportation. These categories are defined in the paper not by administrative accounting codes but by the stated intended use of funds in ballot language.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Treatment-on-the-Treated (TOT) Estimator.&lt;/strong&gt; The CFR estimator that captures the effect of bond authorization against the counterfactual of never authorizing a bond in the foreseeable future, achieved by including leads and lags of a district&amp;rsquo;s bond proposal history as controls. This addresses the problem that multiple elections over time make simple treated-vs-control comparisons confounded by past and future bond activity.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Capital Stock (District-Level Proxy).&lt;/strong&gt; A measure of each district&amp;rsquo;s accumulated school facility capital at a given point in time, constructed as the depreciated 30-year running sum of capital expenditures from the Census of Governments, using a 5% annual depreciation rate. Used as a proxy for facility conditions in the absence of nationally available building-quality data, and confirmed to be negatively correlated with district share of low-SES students.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Brueckner Efficiency Test.&lt;/strong&gt; An application of the theoretical framework linking public good provision levels to house price responses. If a spending increase raises house prices, the initial spending level was below the efficient level; if it lowers house prices, spending was too high. In this paper, the test is refined to use only locally-financed capital spending as the explanatory variable, to strip out the capitalization of state aid and isolate the efficiency assessment for locally-determined spending.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Socio-Economic Status (SES) Terciles.&lt;/strong&gt; Districts are ranked by the share of students eligible for free or reduced-price school meals as of 1995. &amp;ldquo;Low-SES districts&amp;rdquo; refers to those in the top tercile of this share (most disadvantaged); &amp;ldquo;high-SES districts&amp;rdquo; refers to those in the bottom tercile (least disadvantaged). Effects are estimated separately for these subsamples throughout.&lt;/p&gt;</description></item><item><title>What's My Employee Worth? The Effects of Salary Benchmarking</title><link>https://macropaperwarehouse.com/papers/whats-my-employee-worth-the-effects-of-salary-benchmarking/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/whats-my-employee-worth-the-effects-of-salary-benchmarking/</guid><description>&lt;p&gt;This paper studies how salary benchmarking tools — products that reveal aggregate market pay statistics for specific job titles — affect employee compensation. The research question is whether firms&amp;rsquo; access to such tools causally changes how they set salaries, and what this implies about information frictions in labor markets and the policy debate over benchmarking regulation.&lt;/p&gt;
&lt;p&gt;The authors collaborated with the largest U.S. payroll processing company (serving 650,000 firms and 20 million workers), exploiting the staggered roll-out of a proprietary Compensation Benchmark Tool. The tool aggregates payroll data into salary benchmarks by standardized job title, with the median base salary as its most prominent statistic. The study draws on three linked administrative datasets: payroll records (January 2017 to July 2021), tool usage logs (September 2019 to August 2021), and historical benchmark snapshots. The main analytical sample covers new hires at 586 treatment firms that gained tool access and 1,419 matched control firms that did not, within a 10-quarter window around each firm&amp;rsquo;s onboarding date.&lt;/p&gt;
&lt;p&gt;The identification strategy is difference-in-differences, exploiting three sources of variation: which firms gain access; the staggered timing of access (driven by the arbitrary order in which sales representatives introduced the tool); and within treatment firms, whether a specific position was actually searched in the tool. New hires are classified into Searched positions (5,266 hires at treatment firms for positions eventually looked up), Non-Searched positions (39,686 hires at treatment firms for positions not looked up), and Non-Searchable positions (156,865 hires at control firms). Event-study analyses confirm flat pre-trends across all groups, supporting causal interpretation.&lt;/p&gt;
&lt;p&gt;The primary finding is that benchmark access reduces salary dispersion around the median market benchmark by 25%. Before onboarding, the average absolute deviation of offered salaries from the median benchmark in Searched positions was 19.8 percentage points (pp). After onboarding, this fell to 14.9 pp — a drop of 5.0 pp using Non-Searched positions as control (p-value &amp;lt; 0.001) and 6.2 pp using Non-Searchable positions as control (p-value &amp;lt; 0.001). Compression runs in both directions: firms previously paying above the benchmark reduce salaries toward the median, and firms previously paying below raise salaries toward the median. The probability of setting a salary within 2.5% of the median benchmark nearly doubled, from 11.6% to 22.1% after onboarding.&lt;/p&gt;
&lt;p&gt;Effects are heterogeneous by skill level. For low-skill positions (approximately 42% of the sample, e.g., bank teller, receptionist), dispersion falls from 14.5 pp to 8.7 pp — a 40% reduction. For high-skill positions (e.g., software developer), dispersion falls from 24.0 pp to 20.5 pp — a 14.6% reduction. For low-skill positions, compression from below dominates, producing a net average salary increase of +5.0% to +6.7% (p-values 0.014 and 0.001 depending on control group). For high-skill positions, the average salary effect is small and statistically insignificant overall. Twelve-month retention rates for low-skill workers increase by 6.6 to 6.8 pp after benchmarking, and the implied retention elasticity is consistent with prior literature estimates.&lt;/p&gt;
&lt;p&gt;The authors propose a theoretical model to rationalize these findings. Firms are assumed uncertain about the wage distribution (aggregate uncertainty), with private information about their own value of filling a position and affiliated valuations across firms. In equilibrium, firms with higher values make higher offers — generating wage dispersion among identical workers without monopsony power, efficiency wages, or amenity differences. When a firm gains benchmark access, it adjusts its offer toward the threshold wage needed to hire, compressing offers from both sides. In the full-information equilibrium where benchmarks are common knowledge, the mean salary is weakly higher than without benchmarks, because the marginal firm had previously underestimated labor market tightness and offered too little, capturing extraordinary profits. Benchmarking eliminates these informational rents, intensifying competition and raising average pay.&lt;/p&gt;
&lt;p&gt;The scope of the empirical findings is restricted to new hires at firms in the top quartile of U.S. firm size by employment, across all industries and U.S. states, over 2017–2020. The estimated effect is the incremental causal impact of one additional high-quality benchmarking source, since most firms already had access to some pay information through other channels.&lt;/p&gt;
&lt;p&gt;Q: What is the main causal finding of the paper?
A: Access to the salary benchmarking tool reduces the absolute deviation of new-hire salaries from the median market benchmark by approximately 25%. Specifically, average dispersion in Searched positions falls from 19.8 pp before onboarding to 14.9 pp after, a drop of 5.0 pp (using Non-Searched controls, p-value &amp;lt; 0.001) or 6.2 pp (using Non-Searchable controls, p-value &amp;lt; 0.001). The two estimates are statistically indistinguishable from each other, and both are robust to a wide range of specification checks.&lt;/p&gt;
&lt;p&gt;Q: How does compression operate — does it raise or lower salaries?
A: Compression operates in both directions. Firms that would otherwise have paid above the median benchmark reduce salaries toward the median (&amp;ldquo;compression from above&amp;rdquo;), and firms that would otherwise have paid below the median benchmark raise salaries toward the median (&amp;ldquo;compression from below&amp;rdquo;). The probability of offering a salary within 2.5% of the median benchmark nearly doubled, from 11.6% before onboarding to 22.1% after.&lt;/p&gt;
&lt;p&gt;Q: What is the identification strategy, and why is the treatment considered as good as random?
A: The authors use a difference-in-differences design with three sources of variation: which firms gain tool access, the staggered timing of access, and whether specific positions were actually searched within a treatment firm. The payroll company introduced the tool through sales representatives contacting clients in an arbitrary order, not in response to firm characteristics or outcomes. This is corroborated by empirical tests: event-study pre-trends for Searched versus Non-Searched (and Non-Searchable) positions are flat and statistically indistinguishable from zero (pre-treatment coefficients of -0.346 and -0.310, p-values 0.749 and 0.604, respectively).&lt;/p&gt;
&lt;p&gt;Q: How large are the effects for low-skill versus high-skill positions?
A: For low-skill positions (approximately 42% of the sample, e.g., bank teller, receptionist), dispersion drops from 14.5 pp to 8.7 pp — a 40% decline (p-value &amp;lt; 0.001). For high-skill positions (e.g., software developer), dispersion drops from 24.0 pp to 20.5 pp — a 14.6% decline (p-value = 0.021). The larger effect for low-skill positions is consistent with anecdotal accounts from compensation managers, who report treating low-skill candidates as interchangeable and therefore wanting to offer exactly the market rate.&lt;/p&gt;
&lt;p&gt;Q: Does benchmarking raise or lower average salaries?
A: On average across all skill levels, the effect on mean salary is small and statistically insignificant: -0.2% (p-value = 0.756) using Non-Searched controls and +1.7% (p-value = 0.308) using Non-Searchable controls. For low-skill positions specifically, average salaries increase by +5.0% (p-value = 0.014) using Non-Searched controls and +6.7% (p-value = 0.001) using Non-Searchable controls. This net increase for low-skill workers reflects compression from below dominating compression from above in that subset.&lt;/p&gt;
&lt;p&gt;Q: What are the effects on employee retention?
A: For low-skill workers, benchmarking increases the probability of remaining employed at the hiring firm 12 months after the hire date by +6.6 pp (p-value = 0.101) using Non-Searched controls and +6.8 pp (p-value = 0.029) using Non-Searchable controls. The implied retention elasticity from the ratio of salary and retention effects is consistent with average estimates in the prior literature (Sokolova and Sorensen, 2021). No retention effects are reported for high-skill positions.&lt;/p&gt;
&lt;p&gt;Q: What is the theoretical mechanism through which aggregate uncertainty generates wage dispersion?
A: The model assumes a unit mass of firms simultaneously making wage offers to a mass Q &amp;lt; 1 of workers, with only the top Q offers accepted. Firms have private information about their value of filling the position, and values are affiliated (correlated in the sense of Milgrom and Weber, 1982). Because each firm is uncertain about what other firms will offer, higher-value firms rationally form higher beliefs about the prevailing wage distribution and make higher offers. This generates equilibrium wage dispersion among identical workers without monopsony power, efficiency wages, or amenity differences.&lt;/p&gt;
&lt;p&gt;Q: What does the model predict about the equilibrium effects of benchmarking when all firms have access?
A: When the benchmark is common knowledge, all firms make offers with full information about the wage distribution. The firms with the highest values win workers at a uniform wage that makes the marginal firm indifferent between hiring and not hiring. The model proves that the mean salary is higher in expectation under the benchmark equilibrium than in the no-benchmark equilibrium. The intuition is that without benchmarks, the marginal firm underestimates labor market tightness, offers less than the full-information competitive wage, and thereby captures extraordinary profits; benchmarking eliminates those rents and intensifies competition.&lt;/p&gt;
&lt;p&gt;Q: What are the policy implications of the findings regarding antitrust concerns?
A: In 2023, the DOJ and FTC rescinded a long-standing antitrust &amp;ldquo;safety zone&amp;rdquo; for salary benchmarks due to concerns that they could facilitate wage collusion. A 2021 executive order had mandated that agencies consider procompetitive effects as well. The authors&amp;rsquo; model addresses the collusion concern directly: in equilibrium, benchmarking raises (not lowers) average salaries. The empirical evidence is consistent with this — low-skill workers see average salary increases of 5-7% after benchmarking — suggesting a procompetitive justification for the tools.&lt;/p&gt;
&lt;p&gt;Q: How robust are the main results?
A: The main estimates are robust across a wide range of specification checks, including alternative winsorization levels, log-difference and binary (&amp;gt;10% deviation) dependent variables, heteroskedasticity-robust standard errors, exclusion of controls, inclusion of firm fixed effects, exclusion of tipping positions, restriction to Searched positions only, dropping SOC reweighting, and age restrictions. Two additional pieces of evidence corroborate the quasi-experimental findings: a survey experiment with SHRM HR managers shows that hypothetical benchmarks compress stated salary offers from both above and below; and quasi-random benchmark shocks (when large firms abruptly raise a position&amp;rsquo;s base salary by 10% or more) cause firms with tool access to converge to the new benchmark faster than firms without access.&lt;/p&gt;
&lt;p&gt;Q: What does the survey of HR managers reveal about how firms use benchmarks?
A: In a survey of 2,696 HR professionals conducted through SHRM&amp;rsquo;s research panel, 87.6% of those involved in salary-setting report using salary benchmarks. The vast majority (97.4%) use benchmarks to set pay for new hires. The most popular sources are industry surveys (68.0%) and free online data (58.1%), with payroll data services used by 23.2%. The median salary is ranked the most important benchmark statistic by 56.73% of respondents. Most respondents apply filters by state (84.15%) and industry (87.33%) when using the tool.&lt;/p&gt;
&lt;p&gt;Q: What are the main sources of potential attenuation or amplification bias in the estimated effects?
A: Attenuation bias may arise because (1) the benchmark tool studied is among the most advanced available, so firms already had some wage information from other sources, meaning the estimates capture only the incremental effect of one additional high-quality source; and (2) not all positions at treatment firms were searched, so the sample is restricted to positions where firms actually engaged with the benchmark. Potential upward bias could arise if firms adopting the tool were also undergoing broader HR system changes, but the flat event-study pre-trends argue against this explanation.&lt;/p&gt;
&lt;p&gt;Salary Benchmarking: The practice of using aggregated market pay data — provided by third parties such as payroll processors, consulting firms, or online platforms — to identify typical salaries for specific job titles and set internal pay accordingly. In the paper&amp;rsquo;s context, this refers specifically to an online tool that allows employers to look up the median and distributional statistics of base salaries for standardized position titles, filtered by industry and state.&lt;/p&gt;
&lt;p&gt;Aggregate Uncertainty: The paper&amp;rsquo;s label for a distinct source of information friction in which firms are uncertain about the distribution of wages offered by other firms in the market — as opposed to uncertainty about individual worker characteristics. This uncertainty is assumed to be the primitive that generates equilibrium wage dispersion in the model, and its resolution through benchmarking is the mechanism driving the empirical results.&lt;/p&gt;
&lt;p&gt;Salary Dispersion (around the benchmark): Measured empirically as the average absolute percentage difference between a new hire&amp;rsquo;s starting base salary and the median market benchmark for that position, expressed in percentage points. This is the paper&amp;rsquo;s primary outcome variable. Dispersion reflects firms&amp;rsquo; deviation from the market rate in either direction.&lt;/p&gt;
&lt;p&gt;Compression from Above / Compression from Below: Compression from above refers to the reduction in salaries at firms that would otherwise have paid more than the median benchmark after gaining benchmark access. Compression from below refers to the increase in salaries at firms that would otherwise have paid less than the median benchmark. Both directions of adjustment are documented empirically and are predicted by the model.&lt;/p&gt;
&lt;p&gt;Searched / Non-Searched / Non-Searchable Positions: The paper&amp;rsquo;s classification of new hires into three groups for identification purposes. Searched positions are those at treatment firms for which the firm actually looked up the benchmark. Non-Searched positions are at treatment firms but were not looked up, serving as a within-firm control. Non-Searchable positions are at control firms with no tool access, serving as a cross-firm control.&lt;/p&gt;
&lt;p&gt;Affiliation (across firm values): A technical condition borrowed from auction theory (Milgrom and Weber, 1982) used in the paper&amp;rsquo;s model to characterize the correlation structure of firms&amp;rsquo; private valuations of filling a position. Affiliation implies that when one firm has a high value, others are also more likely to have high values, and hence to offer high wages — generating the model&amp;rsquo;s equilibrium wage dispersion.&lt;/p&gt;
&lt;p&gt;Procompetitive Effect of Benchmarking: The paper&amp;rsquo;s term for the welfare-improving property of salary benchmarks identified in the model: by resolving aggregate uncertainty, benchmarks cause the marginal firm to offer closer to the full-information competitive wage, reducing extraordinary profits that arise from informational rents and raising the mean salary in equilibrium. This is the key concept in the paper&amp;rsquo;s contribution to the antitrust policy debate.&lt;/p&gt;</description></item><item><title>When Did Growth Begin? New Estimates of Productivity Growth in England from 1250 to 1870</title><link>https://macropaperwarehouse.com/papers/when-did-growth-begin-new-estimates-of-productivity-growth-in-england-from-1250-to-1870/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/when-did-growth-begin-new-estimates-of-productivity-growth-in-england-from-1250-to-1870/</guid><description>&lt;h2 id="overview"&gt;Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question.&lt;/strong&gt; When did sustained productivity growth begin in England? This paper constructs new estimates of the evolution of productivity in England from 1250 to 1870, with the goal of both dating the onset of growth and using that dating to discriminate between competing theories of why growth began.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Methodological Innovation.&lt;/strong&gt; The core challenge is that real wages over this period were heavily distorted by Malthusian population dynamics. Plague-induced population collapses (most dramatically the Black Death of 1348, which killed roughly 25% of England&amp;rsquo;s population) drove enormous swings in real wages that reflect movements along a stable labor demand curve, not changes in productivity. A naive regression of wages on labor supply is therefore inconsistent, because in a Malthusian world productivity growth induces population growth, making labor supply endogenous to productivity.&lt;/p&gt;
&lt;p&gt;The authors address this by writing down and structurally estimating a full Malthusian model of the economy. Output is produced with fixed land and variable labor (and, in an extended model, capital) via a Cobb-Douglas production function. The labor demand curve equates the real wage to the marginal product of labor. Population growth is increasing in real per-capita income (the Malthus law of motion), capturing both preventive and positive checks. Productivity follows a random walk with drift, and the paper allows for two structural breaks in the average drift rate mu. Exogenous population shocks, modeled as infrequent, sizable plague draws from a beta distribution plus a Gaussian noise term, provide identification: plague shocks and productivity shocks generate observationally distinct dynamics &amp;ndash; plague shocks cause an immediate population drop that gradually reverts, while productivity shocks cause an immediate wage jump followed by a slow population rise to a new steady state. The model is estimated via Bayesian Hamiltonian Monte Carlo (Stan), and structural break dates for mu are chosen by maximizing the Bayes factor (marginal likelihood) over the observed data on real wages, population, and days worked per worker.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Key Data.&lt;/strong&gt; Real wages are from Clark (2010) unskilled building workers series. Post-1540 population is from Wrigley et al. (1997); pre-1540 population trends are from Clark (2007b) manorial records. Days worked per worker are from Humphries and Weisdorf (2019). All series are used as decadal averages.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings.&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Onset of growth: 1600.&lt;/strong&gt; Productivity growth was zero before 1600. The Bayes factor strongly favors a first structural break in mu at 1600; break dates before 1590 and after 1640 are clearly rejected.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Two-phase post-1600 growth.&lt;/strong&gt; Between 1600 and 1810, average productivity growth was 4% per decade (posterior mean; 95% credible interval approximately 2%-6%). After 1810, productivity growth accelerated sharply to 18% per decade (95% CI approximately 12%-23%). The second break date is estimated to 1810 (the only alternative not clearly rejected is 1800).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Magnitude of productivity change.&lt;/strong&gt; By the authors&amp;rsquo; estimates, productivity in England was approximately 540% higher in 1850 than in 1500. This contrasts sharply with Clark&amp;rsquo;s (2010) dual-approach TFP series, which implies essentially no change over this period. The authors attribute the discrepancy to mismeasurement in Clark&amp;rsquo;s land rent series.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Productivity growth preceded the Glorious Revolution.&lt;/strong&gt; Productivity rose by an estimated 48% between 1600 and 1680, well before the Glorious Revolution of 1688 and the English Civil War (1642-1651). This supports the view that economic change contributed to causing the bourgeois institutional reforms of the 17th century, consistent with the Marxist tradition (Hill, 1940, 1961), rather than that institutional change preceded and caused growth.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Weakness of Malthusian population force.&lt;/strong&gt; The elasticity of population growth with respect to real income (gamma) is estimated at 0.09. Combined with a slope of the labor demand curve (alpha) of 0.53, this implies a half-life of plague-induced population dynamics of approximately 150 years. A doubling of real per-capita income stimulated population growth by only 6 percentage points per decade &amp;ndash; indicating Malthusian forces were sufficiently weak to be overwhelmed by post-1800 productivity growth. The model implies that the post-1810 productivity growth rate would have produced a 28-fold long-run increase in steady-state real wages even without the Demographic Transition.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Capital extension.&lt;/strong&gt; When capital is explicitly incorporated, using rates of return on agricultural land and rent charges to infer the capital stock, results are broadly similar: productivity growth from 1600-1810 is 3% per decade and post-1810 is 14% per decade. Capital&amp;rsquo;s production function exponent is estimated at 0.18, confirming that capital accumulation explains only a modest share of growth.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions.&lt;/strong&gt; All estimates are for England specifically. The model assumes competitive factor markets, a Cobb-Douglas (or CES) production function, and a log-linear Malthusian population law of motion. Results are robust to alternative wage series (farm laborers, craftsmen, Allen&amp;rsquo;s series), alternative population sources (Broadberry et al., 2015), constant-days-worked assumption, and alternative prior distributions.&lt;/p&gt;
&lt;h2 id="qa"&gt;Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why can&amp;rsquo;t standard OLS regression of wages on labor supply recover productivity in this setting?&lt;/strong&gt;
In a Malthusian world, productivity growth causes population growth, which in turn raises labor supply. This means labor supply and productivity are positively correlated, biasing OLS estimates. The authors demonstrate this concretely: from 1300 to 1450 (plague era), wages and labor supply moved in opposite directions along a stable labor demand curve, while after 1630 the same data points begin shifting off that curve &amp;ndash; a pattern that OLS would confound with changes in the slope rather than shifts in the intercept.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the authors distinguish empirically between a plague shock and a productivity shock?&lt;/strong&gt;
The two shocks generate fundamentally different dynamics. A plague shock causes an immediate, large drop in population and a corresponding spike in wages; over time, high wages induce population growth and both wages and population gradually return to their pre-plague levels. A permanent productivity shock, by contrast, causes an immediate rise in wages with no contemporaneous population change; population then slowly rises and wages partially revert until a new, higher steady-state population is reached. The model exploits these different impulse-response signatures in the joint data on wages and population to identify the two shocks separately.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: What is the Bayes factor evidence for the 1600 break date?&lt;/strong&gt;
Figure 8 in the paper shows the Bayes factor for models with different first break dates (all holding the second break at 1810). The Bayes factor rises sharply from 1580 to 1600 and falls more gradually from 1600 to 1650. Break dates before 1590 and after 1640 are clearly rejected using the standard rule of thumb that a Bayes factor of 10 constitutes strong evidence. The 1600-1810 pair of break dates yields the highest marginal likelihood of any combination considered.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: How does the paper&amp;rsquo;s productivity estimate compare to Clark&amp;rsquo;s (2010) dual-approach TFP series?&lt;/strong&gt;
Clark&amp;rsquo;s series implies productivity in England was essentially unchanged between the 15th and mid-19th centuries &amp;ndash; a result the paper argues is implausible and inconsistent with Allen&amp;rsquo;s (2005) agricultural TFP estimates (which show a 162% increase in agricultural TFP between 1500 and 1850). The authors&amp;rsquo; baseline estimate implies productivity was approximately 540% higher in 1850 than in 1500. The authors conjecture that a key driver of the difference is mismeasurement in Clark&amp;rsquo;s land rent series, which appears essentially flat from 1250 to 1600 despite enormous plague-induced swings in the land-labor ratio over this period.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What does the Malthusian model imply about &amp;ldquo;Engel&amp;rsquo;s Pause&amp;rdquo; &amp;ndash; the apparent stagnation of real wages during early industrialization?&lt;/strong&gt;
Between 1730 and 1800, real wages fell slightly despite what the model estimates to be substantial productivity growth. The conventional explanation is that the gains from early industrialization accrued to capitalists rather than workers. The authors offer an alternative Malthusian explanation: England&amp;rsquo;s population grew rapidly over this period, and in the Malthusian model this population growth depressed wages relative to productivity. The authors do not reject the distributional explanation but show that Malthusian forces alone are sufficient to explain the wage-productivity divergence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How quantitatively important are days worked (the Industrious Revolution) for the productivity estimates?&lt;/strong&gt;
The authors find that their productivity estimates are largely insensitive to whether the Humphries-Weisdorf (2019) days-worked series or a constant-days assumption is used. The qualitative pattern &amp;ndash; zero growth before 1600, modest growth 1600-1810, rapid acceleration post-1810 &amp;ndash; and the quantitative magnitudes remain similar. What does change is the estimated slope of the labor demand curve alpha: assuming constant days makes the labor demand curve steeper. This robustness is reassuring given that the Industrious Revolution is a contested empirical phenomenon.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: What does the model imply about the speed of Malthusian population dynamics, and how does this compare to prior estimates?&lt;/strong&gt;
The estimated elasticity of population growth to real income gamma = 0.09, combined with alpha = 0.53, implies a half-life of population dynamics of approximately 150 years. This is consistent with but lies between prior structural estimates: Lee and Anderson (2002) find a half-life of 107 years, and Crafts and Mills (2009) find 431 years. All estimates agree that Malthusian dynamics in England were slow relative to the conceptual ideal of rapid subsistence convergence.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Can the model explain the post-1750 population explosion without invoking the Demographic Transition?&lt;/strong&gt;
Yes. The authors simulate predicted population paths from 1740 to 1860 taking real wages and days worked as given and using their estimated gamma and alpha. Despite the weak Malthusian population force, the model can explain the vast majority of the observed population growth from 6 million in 1740 to nearly 20 million in 1860 (10.4% per decade). The key mechanism is that days worked increased substantially over this period, raising per-capita income well above what real wages alone would suggest.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How does incorporating capital change the productivity estimates?&lt;/strong&gt;
In the capital-augmented model, the capital stock is inferred from rates of return on agricultural land and rent charges (Clark 2002, 2010). The capital exponent beta is estimated at 0.18, indicating a modest role for capital in pre-industrial England. Average productivity growth from 1600-1810 falls from 4% to 3% per decade, and post-1810 growth falls from 18% to 14% per decade. The authors conclude that the vast majority of growth from 1600 to 1870 cannot be attributed to capital accumulation. From 1600 to 1860, the estimated capital stock grew by a factor of five (8% per decade).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What theories of the onset of growth are consistent vs. inconsistent with the authors&amp;rsquo; timing evidence?&lt;/strong&gt;
Inconsistent: The North-Weingast (1989) view that the Glorious Revolution of 1688 was the key institutional trigger, since productivity had already risen 48% between 1600 and 1680. Also inconsistent: gradual-growth theories (Kremer 1993, Galor-Weil 2000) in which there is no discrete acceleration. Consistent: Marxist accounts (Hill 1940, 1961) that economic change drove 17th-century institutional change; Acemoglu-Johnson-Robinson (2005) accounts linking Atlantic trade enrichment to the demand for secure property rights (timing broadly consistent, though growth rates do not visibly accelerate after the Civil War or Glorious Revolution); cultural-change accounts (Mokyr, McCloskey) tracing the onset of growth to the spread of literacy and scientific rationalism around 1600; Allen&amp;rsquo;s (2009a) directed-technical-change theory linking 17th-century wage growth to the later profitability of labor-saving innovation in the Industrial Revolution.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: What does the model imply about the long-run real wage consequences of post-1810 productivity growth, even counterfactually assuming Malthusian forces persisted?&lt;/strong&gt;
The steady-state real wage in the Malthusian model is w-bar = mu/(alpha*gamma) minus subsistence-related terms. For mu = 0.018 (the post-1810 estimate), this formula implies a long-run real wage 28 times higher than the steady state under zero productivity growth. In other words, even if the Demographic Transition had not occurred and birth and death rates had remained sensitive to income, post-1810 productivity growth was fast enough relative to the weak Malthusian force to generate substantial sustained rises in living standards.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Labor demand curve (in the paper&amp;rsquo;s sense).&lt;/strong&gt; The equilibrium relationship between real wages and labor supply derived from competitive profit maximization by landowners facing a fixed land endowment: w_t = phi - alpha*l_t + a_t. Productivity is identified as shifts in this curve across time periods. The slope alpha is not simply the land share under a CES production function but equals one minus the labor share divided by the elasticity of substitution between labor and land.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Malthusian population force.&lt;/strong&gt; The feedback mechanism by which higher real wages induce faster population growth, expanding labor supply and pushing wages back toward a steady state. Its speed is governed jointly by gamma (elasticity of population growth with respect to income) and alpha (slope of the labor demand curve); the half-life of wage/population dynamics after a shock equals log(0.5)/log(1 - alpha*gamma). In the paper&amp;rsquo;s estimates, this force was sufficiently weak (half-life approximately 150 years) that post-1800 productivity growth overwhelmed it.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Plague shock (xi_1t).&lt;/strong&gt; An infrequent, large, exogenous negative population shock modeled as a draw from a beta distribution occurring with probability pi. Plagues are the primary source of identifying variation for the pre-1600 period: they generate movements along a stable labor demand curve and allow the slope alpha and the (lack of) productivity trend to be separately identified from labor demand shifts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Structural break in average productivity growth (mu).&lt;/strong&gt; The drift parameter in the random-walk model for the permanent component of productivity. The paper allows two breaks in mu, with break dates chosen to maximize the marginal likelihood (Bayes factor). The best-fitting breaks are at 1600 (zero to 4% per decade) and 1810 (4% to 18% per decade).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Permanent vs. transitory productivity component.&lt;/strong&gt; Productivity is decomposed into a permanent component a-tilde_t (random walk with drift, sigma_epsilon1) and a transitory component epsilon_2t (iid noise, sigma_epsilon2). The paper reports and interprets the permanent component as the meaningful measure of underlying technological change; transitory shocks are treated as measurement error and short-run fluctuations.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Industrious Revolution.&lt;/strong&gt; The hypothesized long-run increase in days worked per worker in England, associated with de Vries (1994, 2008). The paper uses Humphries-Weisdorf (2019) estimates showing a sharp drop after the Black Death followed by a sustained rise from 1350 onward. A key robustness result is that the paper&amp;rsquo;s productivity estimates are insensitive to whether this Industrious Revolution is assumed to have occurred.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Bayes factor (model selection).&lt;/strong&gt; The ratio of marginal likelihoods p(y|M_t)/p(y|M_t&amp;rsquo;) for two competing models, used here to select structural break dates for mu. A factor of 10 is treated as strong evidence. The bridge sampling method of Gronau, Singmann, and Wagenmakers (2020) is used to compute marginal likelihoods.&lt;/p&gt;</description></item><item><title>When is TSLS Actually LATE?</title><link>https://macropaperwarehouse.com/papers/when-is-tsls-actually-late/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/when-is-tsls-actually-late/</guid><description>&lt;p&gt;This paper asks: when does two-stage least squares (TSLS) with covariates actually estimate a local average treatment effect (LATE) — a non-negatively weighted average of causal effects for compliers only? The authors show that the answer is: almost never in practice.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central theoretical result (Theorem 1) is that a linear IV estimand is weakly causal — meaning it cannot have the wrong sign relative to all underlying treatment effects — if and only if the IV specification has &amp;ldquo;rich covariates,&amp;rdquo; defined as the condition that the linear projection of the instrument onto the covariates, L[Z|X], equals the true conditional mean E[Z|X] at every covariate value. Saturated specifications (nonparametric covariate control) always satisfy rich covariates. Outside of two special cases — saturated covariates or an instrument that is mean-independent of covariates — rich covariates is an implicit parametric assumption that can fail.&lt;/p&gt;
&lt;p&gt;When rich covariates fails, the TSLS estimand is &amp;ldquo;level dependent&amp;rdquo;: it depends not only on treatment effects for compliers but also on the levels of potential outcomes for always-takers and never-takers, some of which receive negative weight. The problem arises mechanically because the numerator of the IV estimand, E[Y Z̃], contains a term E[E[Y|X] E[Z̃|X]] that reflects untreated-outcome levels rather than causal contrasts. This term vanishes only when E[Z̃|X] = E[Z|X] − L[Z|X] = 0, i.e., rich covariates.&lt;/p&gt;
&lt;p&gt;To document how common this failure is in practice, the authors surveyed 122 empirical IV papers published in five top economics journals (JPE, AER, QJE, ReStud, Econometrica) between January 2000 and October 2018. Of the 99 papers using TSLS with covariates, only 5 used a saturated specification at any point and only 1 (Chamberlain and Imbens 2004) used saturated specifications exclusively. Nearly a third of TSLS-with-covariates papers explicitly invoked the LATE interpretation; none reported a test of rich covariates.&lt;/p&gt;
&lt;p&gt;The paper applies these findings to thirteen empirical studies. In Card (1995), the original IV estimate of returns to education is 0.132; the Ramsey RESET test overwhelmingly rejects rich covariates, and a DDML estimate of the weakly causal quantity β_rich is modestly smaller, with a relative specification bias of roughly 8% and the gap between β_iv and β_rich representing about 21% of the OLS–IV gap. In Nunn and Wantchekon (2011), the IV estimate of the slave trade&amp;rsquo;s effect on trust is nearly four times as large as the DDML estimate; after reestimation, the null of no effect would not be rejected at conventional significance levels. In Dube and Harish (2020), the DDML estimate of β_rich is about 20% smaller than the original IV estimate (roughly 40% of the OLS–IV gap) and is no longer significantly different from zero at conventional levels.&lt;/p&gt;
&lt;p&gt;The paper also shows that Abadie&amp;rsquo;s (2003) kappa-weighting approach fails under the same necessary condition: it is weakly causal if and only if rich covariates holds, at which point it is numerically identical to standard IV — leaving no reason to use it. Monte Carlo simulations calibrated to Card (1995) show that saturated specifications can exhibit substantial finite-sample bias when the covariate support is large relative to the sample, while DDML partially linear IV (PLIV) converges to β_rich with decreasing bias as sample size grows.&lt;/p&gt;
&lt;p&gt;The authors conclude that two conditions are jointly necessary for TSLS to be interpretable as a non-negatively weighted average of LATEs: (i) rich covariates, and (ii) a first-stage flexible enough to capture any covariate-varying direction of monotonicity. Both conditions fail routinely in published work. The recommended alternatives are: DDML PLIV for estimating β_rich (a weakly causal weighted average of conditional LATEs), or instrument propensity score weighting / Abadie kappa with correctly estimated E[Z|X] for estimating the unconditional ACR/LATE. The Ramsey RESET test is offered as a practical diagnostic for rich covariates violations, and it detected sizable discrepancies in each of the thirteen applications examined.&lt;/p&gt;
&lt;p&gt;Q: What is the paper&amp;rsquo;s central theoretical result?
A: Theorem 1 establishes that, given conditional exogeneity and monotonicity, the linear IV estimand β_iv is weakly causal — i.e., cannot systematically misrepresent the sign of treatment effects — if and only if the IV specification has rich covariates (L[Z|X] = E[Z|X] for every covariate value x). Rich covariates is therefore simultaneously sufficient and necessary; the sufficient direction was a special case of Kolesar (2013), while the necessary direction is novel to this paper.&lt;/p&gt;
&lt;p&gt;Q: What does &amp;ldquo;rich covariates&amp;rdquo; mean and when is it satisfied?
A: Rich covariates means that the linear projection of the instrument onto the included covariates exactly reproduces the instrument&amp;rsquo;s true conditional mean at every point in the covariate support. It is automatically satisfied in two cases: when covariates are specified saturatedly (with an indicator for each covariate cell), or when the instrument is mean-independent of all covariates so E[Z|X] is a constant. Outside these cases, rich covariates is an implicit parametric functional form assumption.&lt;/p&gt;
&lt;p&gt;Q: What goes wrong when rich covariates fails?
A: When L[Z|X] ≠ E[Z|X], the IV estimand becomes &amp;ldquo;level dependent&amp;rdquo;: it depends not only on treatment effects (causal contrasts) but also on the levels of potential outcomes for always-takers and never-takers. Because always-takers always receive Y(1) and never-takers always receive Y(0), the estimand picks up these levels through the term E[E[Y|X] E[Z̃|X]], which is nonzero whenever E[Z̃|X] = E[Z|X] − L[Z|X] ≠ 0. This can cause β_iv to be negative even when all complier and always-taker treatment effects are positive.&lt;/p&gt;
&lt;p&gt;Q: How is the paper&amp;rsquo;s critique different from the two-way fixed effects (TWFE) literature?
A: The TWFE literature (Goodman-Bacon 2021; Sun and Abraham 2021) identifies negative-weight problems arising from heterogeneous treatment effects due to cohort timing, but those estimands are not level dependent. By contrast, the TSLS problems identified here involve level dependence and persist even under constant, homogeneous treatment effects (Proposition 5), making the critique more fundamental and harder to dismiss by assuming effect homogeneity.&lt;/p&gt;
&lt;p&gt;Q: Does the problem disappear if treatment effects are constant?
A: No. Proposition 5 shows that rich covariates remains necessary for β_iv to be weakly causal even under Assumption CLE (constant, linear treatment effects). Level dependence occurs whenever E[Z̃|X] ≠ 0, regardless of effect heterogeneity. The only additional assumption that can substitute is Assumption LIN (linear potential outcome means), which together with constant effects implies β_iv = Δ exactly (Proposition 6), but this combination is a strong parametric restriction.&lt;/p&gt;
&lt;p&gt;Q: What does the survey of empirical papers find?
A: Of 122 IV papers in top journals from 2000–2018, 112 used TSLS, and 99 of those included covariates. Of the 99, only 5 (about 5%) used any saturated specification, and only 1 used saturated specifications exclusively. About a third of TSLS-with-covariates papers explicitly invoked the LATE interpretation. No papers reported a test of rich covariates such as the Ramsey RESET test.&lt;/p&gt;
&lt;p&gt;Q: What happens in the Card (1995) returns-to-education application?
A: The original linear IV estimate of the return to education is 0.132. The RESET test overwhelmingly rejects the null of rich covariates. The DDML estimate of β_rich is modestly smaller, with a relative specification bias of about 0.076 (roughly 8%). The gap between β_iv and β_rich represents about 21% of the OLS–IV gap, which the authors characterize as a sizable fraction of the &amp;ldquo;selection bias&amp;rdquo; corrected by IV. The DDML estimate of the unconditional ACR/LATE (β_acr) is roughly half the size of β_rich.&lt;/p&gt;
&lt;p&gt;Q: What happens in Nunn and Wantchekon (2011)?
A: The RESET test overwhelmingly rejects rich covariates. The IV estimate of the slave trade effect on trust is nearly four times as large as the DDML estimate of β_rich. After reestimation, the null hypothesis that the slave trade had no impact on trust levels would not be rejected at conventional significance levels, reversing the paper&amp;rsquo;s central finding.&lt;/p&gt;
&lt;p&gt;Q: What happens in Dube and Harish (2020)?
A: The RESET test overwhelmingly rejects rich covariates. The DDML estimate of β_rich is about 20% smaller than the original IV estimate, representing roughly 40% of the OLS–IV gap. While estimated with similar precision, the DDML estimate is no longer significantly different from zero at conventional significance levels.&lt;/p&gt;
&lt;p&gt;Q: Does Abadie&amp;rsquo;s (2003) kappa-weighting approach solve the problem?
A: No. Proposition 7 shows that the kappa-weighted estimand β_abadie is weakly causal if and only if rich covariates holds. Moreover, when rich covariates holds, β_abadie is numerically identical to β_iv, so kappa weighting provides no additional benefit. When rich covariates fails, kappa weighting is not weakly causal for the same reason as standard IV.&lt;/p&gt;
&lt;p&gt;Q: What does the Monte Carlo simulation show about practical alternatives?
A: The simulation, calibrated to Card (1995) data with covariates (experience, region indicators), shows that: a linear IV specification without rich covariates converges to β_iv = 0.660, decomposed as +0.391 from positively-weighted compliers, +0.614 from positively-weighted always-takers, and −0.345 from negatively-weighted always-takers — when the true weakly causal quantity β_rich = 0.430. Saturated specifications converge to β_rich but exhibit substantial bias at small sample sizes relative to covariate support. DDML PLIV converges to β_rich with bias decreasing in sample size, making it the recommended practical estimator.&lt;/p&gt;
&lt;p&gt;Q: What is the relationship between this paper and Sloczynski (2020, 2024)?
A: Sloczynski (2020, 2024) maintains rich covariates as an assumption and shows that TSLS can still fail to be weakly causal if monotonicity direction varies with covariates and the first stage omits instrument-covariate interactions. This paper focuses on the necessity of rich covariates itself, under strong (unconditional) monotonicity. Taken together, the two papers establish that both rich covariates and a sufficiently flexible first stage are jointly necessary for TSLS to be interpretable as a non-negatively weighted average of LATEs.&lt;/p&gt;
&lt;p&gt;Q: What practical recommendations do the authors offer?
A: The authors recommend: (1) always running the Ramsey RESET test to check rich covariates, implementable in Stata or R; (2) if using a binary instrument, checking that fitted values L[Z|X] lie in [0,1], necessary for rich covariates; (3) using DDML PLIV to estimate the weakly causal β_rich nonparametrically; and (4) for binary instrument/treatment, using instrument propensity score weighting (e.g., Sloczynski et al. 2024) or Abadie kappa with correctly estimated E[Z|X] to target the unconditional ACR/LATE. All recommended methods are available in mature Stata or R packages.&lt;/p&gt;
&lt;p&gt;Rich covariates: The condition that the linear projection of the instrument Z onto the included covariates X, denoted L[Z|X], exactly equals the true nonparametric conditional mean E[Z|X] at every point in the covariate support. This is both necessary and sufficient for the linear IV estimand to be weakly causal under exogeneity and monotonicity. It is automatically satisfied by saturated covariate specifications or when the instrument is mean-independent of covariates; otherwise it is an implicit parametric assumption.&lt;/p&gt;
&lt;p&gt;Weakly causal estimand: An estimand β is weakly causal if, whenever all subgroup- and covariate-specific treatment effects have the same sign, β has that sign too. This is an intentionally minimal requirement — it merely asks that the estimand not be systematically misleading about the direction of causal effects. An estimand can be weakly causal and still be difficult to interpret as a specific population parameter.&lt;/p&gt;
&lt;p&gt;Level dependence: The phenomenon in which a linear IV estimand depends not only on treatment effects (causal contrasts μ_j(g,x) − μ_{j−1}(g,x)) but also on the levels of potential outcomes (the baseline μ_0(g,x) terms). Level dependence arises when E[Z̃|X] = E[Z|X] − L[Z|X] ≠ 0, causing the always-taker and never-taker potential outcome levels to enter the estimand and potentially reverse its sign.&lt;/p&gt;
&lt;p&gt;Local average treatment effect (LATE): The average treatment effect for the subpopulation of compliers — those whose treatment status is changed by the instrument. In the binary treatment, binary instrument case, LATE = E[Y(1) − Y(0) | T(1) &amp;gt; T(0)]. LATE has a concrete counterfactual interpretation and is non-negatively weighted by construction; the paper asks under what conditions TSLS actually estimates a weighted average of LATEs.&lt;/p&gt;
&lt;p&gt;Partially linear IV (PLIV) / DDML: A modification of classical linear IV in which the linear function of covariates is replaced by an unknown nonparametric function, estimated using machine learning methods (random forests, gradient boosted trees, neural networks) with cross-fitting, as in Chernozhukov et al. (2018). The coefficient on treatment in the PLIV model equals β_rich, the weakly causal IV estimand that would result if rich covariates were exactly satisfied.&lt;/p&gt;
&lt;p&gt;Unconditional average causal response (ACR): When the instrument is binary, ACR = E[Y(T(1)) − Y(T(0)) | T(1) &amp;gt; T(0)], which reduces to the unconditional LATE when treatment is also binary. ACR differs from β_rich because β_rich places extra weight on covariate values with more instrument variation, while ACR weights compliers equally regardless of covariate-specific instrument variance. The paper documents that DDML estimates of β_acr can be roughly half the size of β_rich.&lt;/p&gt;
&lt;p&gt;Saturate and weight (SW) specification: The TSLS specification proposed by Angrist and Pischke (2009, Theorem 4.5.1), in which both covariates and instrument-covariate interactions are fully saturated as excluded variables in the first stage. SW is guaranteed to satisfy rich covariates and, under weak monotonicity allowing direction to vary with covariates, produces a non-negatively weighted average of covariate-specific LATEs. It was used by only one paper (Chamberlain and Imbens 2004) in the authors&amp;rsquo; survey of 99 empirical IV papers.&lt;/p&gt;</description></item><item><title>Who's Afraid of the Minimum Wage? Measuring the Impacts on Independent Businesses Using Matched U.S. Tax Returns</title><link>https://macropaperwarehouse.com/papers/whos-afraid-of-the-minimum-wage-measuring-the-impacts-on-independent-businesses-using-matched-u.s.-tax-returns/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/whos-afraid-of-the-minimum-wage-measuring-the-impacts-on-independent-businesses-using-matched-u.s.-tax-returns/</guid><description>&lt;h2 id="layer-1--overview"&gt;Layer 1 — Overview&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Research Question&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This paper asks how independent (pass-through) businesses in the United States accommodate minimum wage increases — specifically whether they reduce employment, compress profits, pass costs through to customers, or exit — and what happens to the low-earning workers and business owners affected by these adjustments.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Data and Methodology&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors construct a novel linked firm-worker-owner panel dataset from the universe of U.S. tax returns, covering approximately 235,000 pass-through firms (S-corporations, partnerships, and LLCs) per year in highly exposed industries over 2010–2019. &amp;ldquo;Highly exposed&amp;rdquo; industries are defined as those where at least 15% of workers earned below the full-time equivalent of the federal minimum wage ($15,080 per year) in 2013. The dataset links annual business income tax returns to the individual income tax returns and W-2 information reports of all workers and owners.&lt;/p&gt;
&lt;p&gt;The causal identification strategy exploits the six state minimum wage increases that took effect in 2014 (California, Connecticut, Delaware, Michigan, Minnesota, and New Jersey) relative to 24 states that did not change their wage floors at any point from 2012–2018. The empirical workhorse is a panel difference-in-differences event study (Equation 1), augmented by DFL re-weighting (DiNardo et al., 1996) to improve comparability of treatment and control firms on observables. The analysis covers cumulative effects through 2018, by which point the average minimum wage across treatment states had risen 30.6%.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Main Findings&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Employment:&lt;/strong&gt; The average exposed independent firm does not meaningfully reduce employment. The authors estimate an own-wage elasticity of -0.209 (s.e. = 0.0112). Employment adjustments manifest as moderately lower hiring rather than layoffs of existing workers. Reduced hiring is wholly concentrated among teenagers and very part-time jobs paying less than $3,900 annually (with 67% earning less than $1,000 per year).&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Worker earnings:&lt;/strong&gt; Despite the hiring reduction, low-earning workers employed at exposed independent firms experience average earnings gains of approximately $2,000 per year by 2018, relative to comparable workers in untreated states. Young individuals aged 20–26 without a 2013 job earn roughly $4,000 more per year by 2018; teenagers without a 2013 job gain approximately $1,000 per year. Workers in these groups are no less likely — and in some cases slightly more likely — to be employed five years after the minimum wage increase.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Wage bills:&lt;/strong&gt; Average wage bills among surviving treated firms rose 7.03% (s.e. = 0.0153) by 2018. Earnings gains are concentrated among workers earning $15,600–$35,000 annually, with no evidence of reduced earnings for higher-paid workers. The 7% average wage bill increase amounts to only 1.4% of 2013 firm revenues, easing pass-through.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Revenue and profits:&lt;/strong&gt; Revenues of surviving treated firms grew approximately 2.1% more than control firms by 2018. On average, this revenue increase fully offsets the higher wage bill, yielding a small net profit increase of roughly $3,360 (s.e. = $1,123) per owner by 2018, or about 2.7% of mean 2013 owner income.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Firm exit:&lt;/strong&gt; On average across all highly exposed industries, minimum wages increased the five-year exit probability by 0.9 percentage points (s.e. = 0.0029), relative to a baseline raw exit rate of approximately 29%. Exit effects are driven entirely by restaurants: by 2018, restaurants in treated states were 1.85 percentage points (s.e. = 0.0039) more likely to have exited, while the exit response for non-restaurant exposed firms is a precisely estimated zero.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Heterogeneity by productivity within restaurants:&lt;/strong&gt; Exit is concentrated entirely in the bottom productivity quartile (coefficient = 0.0254, s.e. = 0.0079), with no significant effect in the upper three quartiles. Profits among surviving small restaurants rise by $5,941 (s.e. = $1,546) by 2018 relative to 2013. Among small restaurants, the profit gains are larger for firms in the higher productivity quartiles (Q3: +$7,915; Q4: +$9,161). Surviving restaurants also increase non-labor input spending by 2.53% (s.e. = 0.0101), consistent with expanded output following competitor exits.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Entrant characteristics:&lt;/strong&gt; Post-reform restaurant entrants in treatment states have higher wage bills (13.8% higher in logs), higher revenues (4.0% higher), higher value-added (8.4% higher), and higher productivity (net income/revenue ratio 2.24 percentage points higher) than entrants in control states, indicating the minimum wage raises the productivity floor for new entrants.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Owner outcomes after exit:&lt;/strong&gt; Owners of small restaurants forced out by the minimum wage are significantly less likely to own an independent business five years later, but earn no less on average in wages plus business income. Policy-induced exiters are significantly less likely to report negative incomes, suggesting substitution away from risky or marginally profitable business ownership.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;&lt;strong&gt;Theoretical Framework&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The authors present a Cournot competition model with heterogeneous firm productivity and fixed production costs. A minimum wage cost shock raises marginal costs, narrowing margins for all firms. Firms whose cost increases exceed the market price increase cannot cover fixed costs and exit. Remaining firms gain higher markups and larger market shares as demand is reallocated from exiting firms. Selection on ex-ante productivity (the least productive firms exit) limits the distortion to market quantity and amplifies profit gains among productive survivors. The model predicts profit increases only in markets with firm exit, which matches the data: profits rise among restaurants (where exit occurs) but not among retailers (where exit is a precisely estimated zero).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Scope Conditions&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Findings pertain to the short-to-medium run (up to five years post-legislation) of phased-in minimum wage increases averaging 30.6% in six U.S. states. The sample covers pass-through (independent) businesses in highly exposed industries. Longer-run effects may differ if entrants adopt production technologies that rely less on low-wage labor or incumbents reconfigure inputs. Border-county retailers appear to be less able to pass through costs than interior firms, suggesting product market competition is a key moderating factor.&lt;/p&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q1: Why do the authors focus on pass-through businesses rather than publicly traded corporations?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pass-throughs (S-corporations, partnerships, and LLCs) comprise 78% of non-sole-proprietorship businesses and 79% of firms with fewer than 20 employees. They represent the majority organizational form for independent businesses in virtually all two-digit NAICS industry groups except utilities and enterprise management. Because minimum wage concerns are disproportionately raised on behalf of small independent businesses, and because most minimum wage workers in restaurants are employed at pass-throughs, studying pass-throughs directly addresses the policy debate. Additionally, pass-through tax returns link business income directly to the individual tax returns of each owner, enabling the authors to separately identify employee versus owner responses.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q2: How do the authors define &amp;ldquo;highly exposed&amp;rdquo; industries and why does this matter for identification?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Highly exposed industries are defined as four-digit NAICS industries where at least 15% of workers earned below the full-time federal minimum wage equivalent ($15,080 per year) in 2013, using tax data to construct a proxy for minimum wage workers. The analysis focuses on these industries because minimum wage workers are extremely concentrated — the vast majority are in Leisure/Hospitality and Retail. Restricting to highly exposed industries allows the authors to estimate average effects within affected markets and conduct heterogeneity analysis across firm characteristics within those markets, including comparing firms with different baseline shares of low-earning workers that nonetheless all face the market-level cost shock.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q3: How do the employment effects decompose into hiring versus retention?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The average firm subject to a higher wage floor does not lay off existing workers (the retention line is flat in event study estimates). By 2018, firms in treated states hire roughly one fewer worker on average than similar firms in control states, entirely through reduced hiring. This reduced hiring is wholly concentrated among teenagers in very part-time jobs: the missing hires consist entirely of workers who would have earned less than $3,900 annually, with 67% earning less than $1,000 per year. Simultaneously, workers already employed at exposed firms are 2 to 4 percentage points more likely to remain with their 2013 employer by 2016, with prime-age low-earning workers exhibiting the largest retention increases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q4: What happens to low-earning workers and young people in individual-level panels?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Low-earners (those earning below $25,000 in each year from 2012–2014) at exposed independent firms experience average earnings gains of approximately $2,000 per year by 2018 relative to similar workers in untreated states, including teenage low-earners. Young individuals aged 20–26 with no job in 2013 experience a relative earnings increase of approximately $4,000 per year by 2018; teenagers without jobs in 2013 gain approximately $1,000 per year. These workers are no less likely — and often slightly more likely — to be employed relative to their counterparts in control states, so the earnings gains are not offset by employment losses at the individual level.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q5: What is the magnitude of the cost shock for firms and how does it compare to revenues?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;By 2018, the average wage bill among surviving firms in treated states was 7.03% (s.e. = 0.0153) higher than comparable firms in control states. This is consistent with a back-of-envelope calculation: low-earning workers account for about 21% of wage bills at these firms, and states raised minimum wages by 30.6% on average (0.21 × 0.306 = 0.064). However, the 7% wage bill increase amounts to only approximately 1.4% of 2013 firm revenues, making cost pass-through relatively modest. Higher minimum wages have no discernible impact on pension contributions but slightly reduce deductions for other benefits including health insurance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q6: How do surviving firms finance the increased wage bill, and what happens to profits?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Surviving firms finance the wage increase primarily through higher revenues. By 2018, revenues of firms in treated states grew approximately 2.1% more than revenues of firms in control states. On average, this revenue increase outpaces the higher wage bill, resulting in a net profit increase of approximately $3,360 (s.e. = $1,123) per owner by 2018, representing about 2.7% of mean 2013 owner income. There is no evidence of redistribution from middle- or high-income workers within firms; wage bill increases are concentrated among workers earning $15,600–$35,000 annually, consistent with minimum wage spillovers to workers slightly above the statutory floor.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q7: Why do restaurants experience exit effects but retailers do not?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The asymmetry stems from the intensity of low-wage labor in production. While low-earning workers account for a similar share of labor costs at restaurants (41.8%) and retailers (38.5%), labor costs overall are more than twice as large at restaurants relative to retailers. Wage bills account for 39% of variable costs and 27% of revenues at restaurants, but only 16% of variable costs and 13% of revenues at retailers. As a result, raising the minimum wage raises variable costs by 5.76% at restaurants. Non-restaurant exposed firms are able to fully pass through their smaller cost shock, yielding flat profits and neither employment nor exit impacts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q8: Why is firm exit concentrated in the lowest productivity quartile of restaurants rather than among the most exposed firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Cournot framework predicts exits among firms with the lowest ex-ante productivity (highest marginal costs), the largest cost shock (highest share of low-wage labor per unit of output), or a combination. Empirically, productivity is the primary determinant: restaurants across all productivity quartiles use similar shares of low-earning workers (40–44% of wage bills for Q1 through Q4). Exit rises significantly only among restaurants in the bottom productivity quartile (coefficient = 0.0254, s.e. = 0.0079), with no significant effects in Q2–Q4. Among the lowest-productivity restaurants, those most dependent on low-earning labor face the largest exit rates.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q9: How do the model&amp;rsquo;s predictions about profit heterogeneity match the data?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The Cournot model predicts profits should rise only in markets with firm exit (via increased margins and market share reallocation to survivors). This is exactly what the data show. Among restaurants, where exit is concentrated in the bottom productivity quartile, profits among surviving small restaurants rise by $5,941 (s.e. = $1,546) by 2018. Among small restaurants specifically, profit gains increase with productivity: Q3 restaurants gain $7,915 (s.e. = $3,326) and Q4 restaurants gain $9,161 (s.e. = $2,127), while Q1 and Q2 gains are statistically indistinguishable from zero. In non-restaurant exposed industries where the exit effect is a precise zero, profits are also flat — exactly as the model predicts.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q10: What happens to the characteristics of new restaurant entrants after the minimum wage increase?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Post-reform restaurant entrants in treatment states are systematically more productive than entrants in control states. They have wage bills 13.8% higher (in logs), revenues 4.0% higher, value-added 8.4% higher, and productivity ratios (net income/revenue) 2.24 percentage points higher than new entrants in control markets. This implies the minimum wage raises the minimum viable productivity threshold for entrant restaurants, consistent with Sorkin (2015)&amp;rsquo;s insight that minimum wages shape the capital and technology choices of entering firms. The restaurant industry thus becomes more productive on average through both the exit of the least productive incumbents and the entry of more productive new firms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q11: How do worker transition patterns reflect the reallocation of output to surviving firms?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Workers at large independent businesses (top revenue quartile) are 3.52 percentage points more likely to remain with their 2013 employer in 2018 and 2.36 percentage points less likely to switch to another large firm. The large firms that retain more of their existing workforce also reduce their hiring of very part-time teenagers the most — in the top revenue quartile, firms shed roughly 4.5 employment relationships on average, comprising higher retention of 4.15 existing workers offset by reduced hiring of 8.67 very part-time teenage workers. Workers originally at smaller exposed firms are more likely to be found working at larger firms five years out, consistent with demand reallocation from exiting and shrinking small firms toward larger, more productive survivors.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q12: What happens to owners of restaurants that exit due to the minimum wage?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Policy-induced exiters of small restaurants are significantly less likely to own an independent business five years later and less likely to receive all earnings from business ownership, relative to owners of restaurants that exited for other reasons in control states. However, their average incomes (wage income plus ordinary business income) are no lower. This income stability is partly explained by the fact that policy-induced exiters are significantly less likely to report negative incomes five years out, suggesting they substitute away from potentially risky or marginally profitable business ownership toward wage employment or other activities. The utility implications are ambiguous: these former owners may have preferred business ownership even if it did not yield higher income.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q13: What is the role of product market competition in mediating pass-through, as evidenced by border-county analysis?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The border county robustness analysis reveals that product market competition is central to pass-through success. Retailers near state borders, where consumers can cross-state-border shop, face more elastic demand and are less able to finance the wage cost shock with new revenues, exhibiting reduced profits and higher exit rates (though estimates are imprecise). Further from the border, where the cost shock is more commonly felt by all potential substitutes (making market demand elasticity rather than firm demand elasticity the relevant parameter), results are very similar to the full-sample aggregate findings. This confirms that the common nature of the minimum wage cost shock — shared by all competing firms in the market — is a key reason firms can pass through costs to consumers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q14: How do the findings address the divide among independent business owners on minimum wage policy?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The heterogeneous outcomes rationalize why surveys consistently find business owners divided. Among restaurants, some owners (those operating the least productive small restaurants) face exit and loss of business ownership, while surviving productive restaurateurs see higher profits of $5,941–$9,161 per year. Among non-restaurant exposed businesses, owners are broadly unaffected in terms of profits and viability. Uncertainty about whether a given firm&amp;rsquo;s demand is elastic enough to bear cost pass-through — given that owners may be more familiar with the elasticity of firm-level demand from prior unilateral price changes, rather than the relevant market-level demand elasticity applying to a common cost shock — may broaden opposition to include even owners who would ultimately benefit.&lt;/p&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Pass-through businesses (independent businesses):&lt;/strong&gt; Privately owned firms organized as S-corporations, partnerships, or LLCs, taxed by passing income through to the individual returns of owners rather than at the entity level. In 2015, these comprised 78% of non-sole-proprietorship U.S. businesses and 46% of employment. The paper uses &amp;ldquo;pass-through&amp;rdquo; and &amp;ldquo;independent business&amp;rdquo; interchangeably as the unit of analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Highly exposed industries:&lt;/strong&gt; Four-digit NAICS industries where at least 15% of workers earned below the annual full-time equivalent of the federal minimum wage ($15,080) in 2013, as measured in the authors&amp;rsquo; administrative tax data. This threshold proxies the concentration of minimum-wage workers across industries and drives the sample selection for firm-level analysis.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Own-wage elasticity of employment:&lt;/strong&gt; The estimated percentage change in employment at a firm associated with a given percentage change in the firm&amp;rsquo;s minimum wage. The authors estimate this as -0.209 (s.e. = 0.0112), reflecting the average effect across all exposed independent businesses, conditional on the firm&amp;rsquo;s industry, size, and local market characteristics.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;DFL re-weighting (DiNardo-Fortin-Lemieux):&lt;/strong&gt; A non-parametric reweighting procedure that adjusts the distribution of control-group firms to match the distribution of treatment-group firms on observables (specifically, two-year lagged value-added within three-digit NAICS industries). Used to improve pre-reform comparability of treatment and control firm samples without parametric functional form assumptions.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm productivity (in this paper&amp;rsquo;s sense):&lt;/strong&gt; Measured as the ratio of net profits to revenues (net income/revenue) at the firm level in the base year 2013, used to assign firms to productivity quartiles for heterogeneity analysis. This is a firm-level profitability measure constructed from pass-through tax returns, not a total factor productivity estimate requiring production function estimation.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Firm exit:&lt;/strong&gt; An indicator for a firm that filed a tax return in 2013 but did not file a return in a subsequent year t. The average one-year exit rate for highly exposed independent businesses is 5.2%; the cumulative five-year raw exit rate is approximately 29% across treatment and control states.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Cournot competition with heterogeneous productivity and fixed costs:&lt;/strong&gt; The paper&amp;rsquo;s conceptual framework, in which N firms compete in quantities with asymmetric marginal costs (reflecting heterogeneous productivity), a common output price, and a fixed cost of production. Under this framework, a minimum wage cost shock narrows margins unevenly, induces exit among firms that cannot cover fixed costs, and generates both demand reallocation and market share gains for productive survivors — rationalizing simultaneous exit and profit increases in the same industry.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Common cost shock:&lt;/strong&gt; The property that a minimum wage increase raises production costs for all firms employing low-wage workers in the same market simultaneously. Because all competing firms face higher costs, the relevant pass-through parameter is the elasticity of market demand rather than the (higher) elasticity of individual firm demand, facilitating cost pass-through to consumers and distinguishing minimum wages from unilateral price changes by a single firm.&lt;/p&gt;</description></item><item><title>Why Doesn't the United States Have National Health Insurance?</title><link>https://macropaperwarehouse.com/papers/why-doesnt-the-united-states-have-national-health-insurance/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/why-doesnt-the-united-states-have-national-health-insurance/</guid><description>&lt;p&gt;This paper investigates a critical juncture in the development of national health insurance (NHI) in the United States: the post-World War II period when most peer nations moved to establish comprehensive public coverage while the U.S. did not. The authors examine the causal role of the American Medical Association (AMA), which in 1949 hired Whitaker &amp;amp; Baxter&amp;rsquo;s Campaigns, Inc. — the country&amp;rsquo;s first political public relations firm — to direct a nationwide campaign opposing NHI and promoting private (voluntary) health insurance (PHI).&lt;/p&gt;
&lt;p&gt;The Campaign had two main components. First, a physician outreach component in which AMA members distributed pamphlets to patients warning against &amp;ldquo;socialized medicine&amp;rdquo; and encouraging enrollment in private plans, and acted as liaisons to local civic organizations to solicit resolutions against NHI sent to elected officials (nearly 50 million pieces of material were sent to physicians). Second, a mass newspaper advertising component, in which a standard ad was placed across newspapers nationwide, with an additional $19 million (approximately $240 million in current dollars) in coordinated tie-in advertising from roughly 23,000 corporations and industry associations. The messaging framed NHI as &amp;ldquo;un-American&amp;rdquo; and associated private insurance with &amp;ldquo;freedom&amp;rdquo; and &amp;ldquo;the American way,&amp;rdquo; providing little substantive information about insurance products.&lt;/p&gt;
&lt;p&gt;The authors construct novel measures of Campaign exposure by combining (a) per capita pamphlets distributed by AMA physicians and (b) per capita advertising circulation scaled by local newspaper readership, using archival data from the Whitaker &amp;amp; Baxter Archives (Sacramento), the National Archives (Washington D.C.), digitized AMA Medical Directories, the N.W. Ayer &amp;amp; Son&amp;rsquo;s Newspaper Directory, and newly discovered Blue Shield enrollment data from AMA Council on Medical Service annual reports covering 1946–1954.&lt;/p&gt;
&lt;p&gt;The primary estimation strategy exploits spatial variation in Campaign intensity combined with its timing, using event studies with state and year fixed effects and design controls for income per capita and unionization. The identifying assumption — that Campaign intensity was conditionally as-good-as-randomly assigned — is supported by balance tests showing no pre-Campaign correlation between exposure and enrollment or sociodemographic characteristics (with the exception of Black population share), and by the historical record that the Campaign was organized hastily following Truman&amp;rsquo;s unexpected 1948 electoral victory.&lt;/p&gt;
&lt;p&gt;Main findings: A one standard deviation increase in Campaign exposure explains approximately 20% of the post-Campaign increase in PHI enrollment, corresponding to roughly 14 million additional enrollees — an effect comparable in magnitude to increasing average per capita income by approximately $100 (about 7 percent). On public opinion, a one standard deviation increase in Campaign exposure led to a six percentage point decline in popular support for NHI per Gallup survey wave, a reversal occurring against a backdrop of 69% pre-Campaign approval that was trending upward. For context, this six-point magnitude approximates the entire gap in NHI support between union and non-union households, or one-third the racial gap in support. Campaign intensity also predicts civic organizations passing resolutions favoring PHI, Republican legislators adopting speech semantically similar to Campaign propaganda, and — by 1952 — AMA members being five times more likely to donate to the Eisenhower-Nixon ticket than non-AMA physicians, with donation rates increasing in Campaign intensity.&lt;/p&gt;
&lt;p&gt;Scope conditions: The analysis covers 48 U.S. states from 1946 to 1954, ending at the 1954 IRS tax code change that expanded commercial insurers&amp;rsquo; market share. The enrollment data capture Blue Shield (physician-run) plans specifically; the paper explicitly notes that commercial insurer granular data are unavailable for the main Campaign period. The authors argue that multiple subsequent factors — middle-class acquisition of private coverage reducing demand for a public option, incumbent interests defending the status quo, and the persistent ideological linkage of private insurance with freedom — help explain why NHI was not adopted in subsequent decades, though these persistence mechanisms are outside the paper&amp;rsquo;s direct empirical scope.&lt;/p&gt;
&lt;p&gt;Q: What was the AMA&amp;rsquo;s Campaign, and what prompted it?
A: In response to Harry Truman&amp;rsquo;s unexpected 1948 presidential victory alongside a Democratic Congress — and with a majority of informed voters favoring NHI — the AMA hired Whitaker &amp;amp; Baxter&amp;rsquo;s Campaigns, Inc. to run the National Education Campaign (NEC). The Campaign had two components: physician outreach (pamphlet distribution to patients, liaison to civic organizations) and mass newspaper advertising. The AMA paid Whitaker &amp;amp; Baxter approximately $1.2 million per year in current terms, and coordinated an additional $19 million in 1950 dollars (roughly $240 million today) in tie-in advertising from allied corporations and trade groups.&lt;/p&gt;
&lt;p&gt;Q: How is Campaign exposure measured, and how is it validated as conditionally exogenous?
A: Campaign exposure combines two standardized components: per capita pamphlets distributed by AMA physicians (pamphlet quantity from W&amp;amp;B archives scaled by state AMA membership share) and per capita advertising circulation scaled by local newspaper readership (share of adults with more than five years of schooling). The two components are summed and standardized. Exogeneity is supported by balance tables showing no pre-Campaign correlation between exposure and enrollment or Gallup opinion, by the absence of discontinuous changes in income or unionization at Campaign onset, and by the historical fact that Campaign logistics relied on pre-existing networks assembled hastily in response to Truman&amp;rsquo;s unanticipated victory.&lt;/p&gt;
&lt;p&gt;Q: What is the main effect of the Campaign on private health insurance enrollment?
A: A one standard deviation increase in Campaign exposure is associated with a two percentage point increase in the share enrolled in PHI in the preferred specification (Column 4 of Table 1, which includes income, unionization, state fixed effects, and year fixed effects; coefficient 0.020, se 0.007, significant at 1%). This accounts for approximately 20% of the overall post-Campaign increase in PHI enrollment, corresponding to roughly 14 million new enrollees. The pre-Campaign coefficient is not statistically significant (coefficient 0.004, se 0.005), and the F-test p-value for pre-trends is 0.958.&lt;/p&gt;
&lt;p&gt;Q: What is the effect of the Campaign on public opinion toward NHI?
A: Using Gallup survey data, a one standard deviation increase in Campaign exposure led to an approximately six percentage point decline in individual support for NHI legislation per survey wave, against a pre-Campaign approval level of 69% that was trending upward. The F-test p-value for pre-trends in the Gallup event study is 0.179. This six-point effect is approximately equal to the gap in NHI support between union and non-union households, and approximately one-third the racial gap in support.&lt;/p&gt;
&lt;p&gt;Q: What evidence links the Campaign to civic organizations and the legislative process?
A: The Campaign&amp;rsquo;s archives document all civic organizations &amp;ldquo;on record against compulsory health insurance,&amp;rdquo; meaning they had passed resolutions in favor of PHI. The authors find a positive relationship between Campaign intensity and civic organizations passing such resolutions at the county level. Resolutions sent to elected officials were traced to the Congressional Record and to physical folders in the National Archives; their semantic similarity to AMA-WB propaganda is confirmed. Republican legislators&amp;rsquo; speech in the 81st Congress shows increased similarity to Campaign language in proportion to Campaign intensity in their district or state, while Democrat legislators do not show this pattern. NHI and the AMA experienced spikes in mention frequency in the Congressional Record during this period.&lt;/p&gt;
&lt;p&gt;Q: Did the Campaign affect physician political behavior beyond the clinic?
A: By 1952, when the Republican platform had fully adopted the AMA&amp;rsquo;s position, AMA members were approximately five times more likely to donate to the Eisenhower-Nixon ticket than non-AMA physicians, with donation probability increasing in Campaign intensity. The authors digitized the donor list from the National Professional Committee for Eisenhower (NPCE) — a separate lobbying entity created because the AMA legally could not endorse candidates — and linked approximately 80% of physician donors to the AMA Medical Directory.&lt;/p&gt;
&lt;p&gt;Q: What alternative explanations for PHI growth does the paper address, and how?
A: The standard literature attributes PHI growth to the 1942 Stabilization Act wage freeze (which left benefits unconstrained), collective bargaining rights clarified in the late 1940s, and the 1954 IRS tax exemption for employer-paid premiums. The authors include income per capita and unionization as core design controls and show that their Campaign exposure coefficient is stable across specifications with and without these controls (coefficients of 0.025 and 0.020 in Table 1 Columns 1–2 vs. 3–4, respectively). The analysis stops in 1954 before the tax change, and the authors note that by 1952 roughly 63% of households already had some form of medical expense insurance.&lt;/p&gt;
&lt;p&gt;Q: What is the conceptual mechanism through which the Campaign operated?
A: The authors adapt Sobbrio (2011)&amp;rsquo;s indirect lobbying model. Voters hold uniform priors over whether NHI enactment yields net positive or negative social surplus. The private-sector advocate (AMA-WB) sends messages that shift voters&amp;rsquo; posterior beliefs toward the negative-surplus state and, simultaneously, encourage PHI enrollment, which reduces voters&amp;rsquo; private valuation of a public option. Because citizens were likely unaware of the coordinated tie-in advertising across industries and the financial motivation behind physician messaging, the framing operated through naive belief updating. The public-sector advocate (Truman administration, Committee for the Nation&amp;rsquo;s Health) was vastly outresourced — the CNH raised only $104,000 in 1949 — and faced legal constraints on executive lobbying.&lt;/p&gt;
&lt;p&gt;Q: What advertising tactics specifically characterized the Campaign, and what do they imply about mechanisms?
A: Campaign pamphlets and ads provided little or no substantive information about insurance products (coverage, eligibility, cost) and instead tied health insurance to ideological symbols: &amp;ldquo;freedom,&amp;rdquo; &amp;ldquo;the American way,&amp;rdquo; &amp;ldquo;the voluntary way,&amp;rdquo; and warnings about &amp;ldquo;socialized medicine.&amp;rdquo; Word clouds from Campaign materials confirm &amp;ldquo;America&amp;rdquo; and &amp;ldquo;freedom&amp;rdquo; as dominant terms. The authors connect this to behavioral models of advertising (Mullainathan, Schwartzstein and Shleifer 2008) whereby advertisers create or exploit associations to influence product beliefs. The absence of informational content is consistent with effects operating through ideology and identity rather than rational product evaluation.&lt;/p&gt;
&lt;p&gt;Q: What explains why the U.S. did not adopt NHI in subsequent decades after the immediate Campaign period?
A: The authors offer three mechanisms (discussed outside their main empirical scope): First, as middle-class Americans obtained PHI through employers, demand for a public option diminished — the model formalizes this as reduced private valuation of NHI. Second, incumbents who benefit from the private status quo — Blue Cross Blue Shield, AMA, American Hospital Association, and pharmaceutical companies, which today comprise four of the top ten direct federal lobbyists — actively work to maintain it (Acemoglu, Egorov and Sonin 2021). Third, the Campaign&amp;rsquo;s ideological framing proved durable: ideologically similar rhetoric opposing &amp;ldquo;socialized medicine&amp;rdquo; appeared in campaigns against both Clinton-era and Obama-era reform efforts, and has been linked to increased adverse selection and preventable deaths (Bursztyn et al. 2022; Galvani et al. 2022).&lt;/p&gt;
&lt;p&gt;Q: What are the paper&amp;rsquo;s main contributions to the literature?
A: The paper provides the first causal evidence on the AMA&amp;rsquo;s political role in blocking NHI at the post-WWII juncture, contributing to the economic history of U.S. social insurance development. It contributes to the advertising literature by providing credible estimates of a sustained national campaign combining trusted field agents (physicians) with mass media, and to the lobbying literature by documenting indirect lobbying — persuasion of ordinary citizens — as a distinct and effective tool alongside direct lobbying. It also documents physician behavior outside the clinical setting, showing how rents from supply-side constraints were deployed to shape the market for medical services.&lt;/p&gt;
&lt;p&gt;Indirect lobbying: In the paper&amp;rsquo;s usage, persuasion of ordinary citizens via campaigns — as distinct from direct lobbying of policymakers — used to shift median voter beliefs and behavior to achieve legislative goals. Whitaker &amp;amp; Baxter are credited with creating this field through their work at Campaigns, Inc.&lt;/p&gt;
&lt;p&gt;Campaign exposure: The paper&amp;rsquo;s composite treatment variable, constructed as the sum of two standardized components: per capita pamphlets distributed by AMA physicians (physician outreach) and per capita advertising circulation scaled by local newspaper readership (mass communications), then re-standardized to mean 0, standard deviation 1.&lt;/p&gt;
&lt;p&gt;Tie-in advertising: Coordinated newspaper advertisements by third-party corporations and trade associations placed simultaneously with the main AMA-WB Campaign ad, sharing the &amp;ldquo;Voluntary Way is the American Way&amp;rdquo; slogan. Approximately 60% of newspapers with a main Campaign ad also had tie-in ads, averaging three per issue; third-party spending totaled approximately $19 million in 1950 dollars (~$240 million current).&lt;/p&gt;
&lt;p&gt;Voluntary (private) health insurance: In the paper&amp;rsquo;s framing, the AMA-promoted alternative to NHI — prepaid medical service plans run by state medical societies (Blue Shield) or nonprofit hospitals (Blue Cross) — deliberately labeled &amp;ldquo;voluntary&amp;rdquo; to contrast with &amp;ldquo;compulsory&amp;rdquo; NHI, embedding the product within an ideological frame of free choice.&lt;/p&gt;
&lt;p&gt;National Education Campaign (NEC): The AMA&amp;rsquo;s official name for the anti-NHI campaign directed by Whitaker &amp;amp; Baxter starting in 1949, characterized as &amp;ldquo;educational&amp;rdquo; to provide legal cover; the name itself illustrates the indirect lobbying strategy of framing political advocacy as public information.&lt;/p&gt;
&lt;p&gt;Source text origin / abstract-only block: Not a paper-defined concept; excluded.&lt;/p&gt;
&lt;p&gt;Naive voter updating: The paper&amp;rsquo;s modeling assumption (drawn from Sobbrio 2011) that voters held uniform priors on health insurance policy outcomes and updated beliefs via Bayesian message receipt, without awareness of coordination across industries or the financial motivation of physician messengers — making the ideological framing effective.&lt;/p&gt;
&lt;p&gt;Physician field agents: In the Campaign&amp;rsquo;s design, AMA member physicians served as credible, trusted intermediaries who distributed pamphlets to patients and solicited civic organization resolutions, leveraging their social status to amplify the Campaign&amp;rsquo;s reach into communities where mass advertising alone would be insufficient.&lt;/p&gt;</description></item><item><title>Why Is Intermediating Houses So Difficult? Evidence from iBuyers</title><link>https://macropaperwarehouse.com/papers/why-is-intermediating-houses-so-difficult-evidence-from-ibuyers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/why-is-intermediating-houses-so-difficult-evidence-from-ibuyers/</guid><description>&lt;p&gt;This paper examines frictions in dealer intermediation in durable consumer goods markets, using iBuyers — technology-driven real estate companies such as Opendoor and Offerpad — as a lens. The central research question is why dealer intermediation, which provides immediate liquidity by purchasing assets onto a balance sheet and reselling, is so limited in the U.S. housing market (valued at $50 trillion and representing roughly 70% of the median household&amp;rsquo;s net worth) relative to other durable goods markets such as automobiles.&lt;/p&gt;
&lt;p&gt;The authors use CoreLogic deed transaction data and MLS listing data from five markets with substantial iBuyer presence (Phoenix, Las Vegas, Dallas, Orlando, and Gwinnett County, Georgia) over 2013–2018, covering arm&amp;rsquo;s-length, non-foreclosure single-family home and condominium transactions. They supplement this with Redfin ZIP-level data on listing speed and American Community Survey demographics. iBuyers are identified as Opendoor, Offerpad, Knock, Zillow, and Redfin.&lt;/p&gt;
&lt;p&gt;The empirical analysis documents that iBuyers grew from roughly 1% market share in Phoenix in 2015 to about 6% by 2018, acting as balance-sheet intermediaries who hold properties for a median of 105 days. iBuyers purchase homes at a 3.1 percentage point (pp) discount relative to comparable homes sold in the same ZIP-quarter, and sell at a 2.2 pp premium relative to other institutional sellers, for a combined gross spread of approximately 5.3 pp (reported in the abstract and body as ~5%). Sellers to iBuyers show a 6.8 pp higher rate of market exit post-sale and a 4.0 pp higher probability of purchasing before selling, consistent with demand for immediacy from impatient, relocating households.&lt;/p&gt;
&lt;p&gt;Two key frictions constrain intermediation. First, adverse selection: iBuyers rely on algorithmic valuation models (AVMs) that explain over 80% of price variation in iBuyer transactions versus only 68% in non-iBuyer transactions, leaving a residual of soft information (odor, neighbor quality) that sellers know but algorithms cannot capture. iBuyer presence is over three times greater in the lowest pricing-uncertainty tercile versus the highest, and a one standard deviation increase in pricing uncertainty reduces iBuyer presence by 1.23 pp within a ZIP and reduces gross spread per transaction by 1.5 pp. Second, underlying illiquidity: iBuyers are almost entirely absent in market segments where the probability of sale within three months (PSALE) falls below 50%, despite strong seller demand.&lt;/p&gt;
&lt;p&gt;To quantify these frictions, the authors build and calibrate a continuous-time directed search equilibrium model with a dealer intermediary subject to adverse selection. Six parameters are calibrated to match empirical moments: iBuyer market share (5%), purchase discount (3.1 pp), sale premium (2.2 pp), iBuyer concentration in the most versus least liquid PSALE quartiles, impatient seller fraction, and median iBuyer holding time. The calibrated adverse selection parameter (α = 0.35) means the intermediary correctly identifies 35% of low-quality homes as such; the impatient seller share (μ = 0.18) means 18% of unmatched sellers are highly impatient; and the vacancy depreciation rate (d = 0.02) means 2% per period for unoccupied homes. External validation via a difference-in-differences comparison of Phoenix against other markets yields model-consistent predictions of a 0.5 pp reduction in time on market and a 0.8 pp increase in house prices.&lt;/p&gt;
&lt;p&gt;Counterfactual experiments reveal that introducing a 30-day acquisition delay (rather than near-instantaneous) reduces iBuyer market share from 5% to below 2%; eliminating the signal entirely (α = 0) drops market share to just above 1%; and enabling iBuyers to rent vacant properties during the holding period could raise market share above 7.5 pp. A 50% reduction in PSALE reduces iBuyer market share roughly proportionally.&lt;/p&gt;
&lt;p&gt;The calibrated model is then applied to other durable goods markets by varying informational asymmetry, liquidity, and depreciation parameters. Cars — more homogeneous (year/make/model/mileage fully characterizes value), mobile (transportable across markets), and depreciating primarily through use — are predicted to support dealer intermediary market shares of 40–55%, consistent with observed U.S. car dealer market share of ~50%. Reducing the depreciation rate from the housing level (d = 0.02) to a car-like level (d = 0.005) alone increases intermediary market share by about 5 pp. Houses — heterogeneous, immobile, and depreciating through time rather than use — are predicted to support near-zero intermediation under pre-iBuyer technology. The authors also explain COVID-19 iBuyer suspensions (reduced market liquidity made resale untenable) and Zillow&amp;rsquo;s November 2021 exit (very liquid markets eroded the iBuyer speed premium, worsening adverse selection while rapid price appreciation degraded AVM accuracy).&lt;/p&gt;
&lt;p&gt;Q: What discount do iBuyers pay when purchasing homes, and what premium do they earn when selling?
A: iBuyers purchase homes at a 3.1 pp discount relative to comparable homes sold in the same ZIP code and quarter, with a t-statistic of 8.55. They sell at a 2.2 pp premium relative to other institutional sellers. The combined gross spread is approximately 5.3 pp (referred to throughout the paper as roughly 5%).&lt;/p&gt;
&lt;p&gt;Q: How large is the iBuyer market share, and in which markets did they operate?
A: iBuyer market share grew from approximately 1% in Phoenix in 2015 to roughly 6% by 2018. In Gwinnett County, Las Vegas, and Dallas/Orlando, shares reached approximately 4%, 4%, and 2% respectively by 2018. The analysis covers five markets: Phoenix, Las Vegas, Dallas, Orlando, and Gwinnett County (suburban Atlanta).&lt;/p&gt;
&lt;p&gt;Q: What is the evidence that iBuyer sellers are impatient rather than simply lower-quality-house owners?
A: Sellers to iBuyers exhibit a 6.8 pp higher rate of market exit (defined as purchasing a home outside the county or making no subsequent real estate purchase within 12 months), consistent with relocation-driven impatience. They also have a 4.0 pp higher probability of purchasing a new home before completing the sale of their current home, which is enabled by the iBuyer transaction&amp;rsquo;s speed facilitating mortgage approval conditional on the existing property&amp;rsquo;s sale.&lt;/p&gt;
&lt;p&gt;Q: How do the authors measure adverse selection risk and what is its relationship to iBuyer presence?
A: Adverse selection is proxied by the squared residual from a hedonic pricing regression — the variation in transaction prices unexplained by observable characteristics — computed at the ZIP-year level for non-iBuyer transactions. iBuyer presence is over three times greater in the lowest pricing-uncertainty tercile than in the highest. A one standard deviation increase in pricing uncertainty reduces iBuyer presence by 1.23 pp within a ZIP (controlling for ZIP fixed effects, local prices, house age, and square footage), and reduces gross spread per transaction by 1.5 pp.&lt;/p&gt;
&lt;p&gt;Q: What role does underlying asset liquidity play in constraining iBuyer intermediation?
A: iBuyers concentrate almost entirely in market segments where the ex ante probability of selling within three months (PSALE) exceeds 50%, and are essentially absent where PSALE falls below 50%. This holds even though sellers in low-PSALE segments have strong demand for immediacy, implying that illiquidity raises intermediation costs above the demand-side willingness to pay a discount.&lt;/p&gt;
&lt;p&gt;Q: What does the model&amp;rsquo;s calibration reveal about the share of impatient sellers and the accuracy of iBuyer signals?
A: The calibrated adverse selection parameter α = 0.35 means the intermediary correctly identifies 35% of low-quality homes as low quality (the signal is moderately but imperfectly informative). The calibrated impatient seller share μ = 0.18 means approximately 18% of unmatched sellers are highly impatient and willing to accept a significant price discount for immediacy. The vacancy depreciation rate d = 0.02 implies a 2% per period cost for unoccupied properties.&lt;/p&gt;
&lt;p&gt;Q: How important is transaction speed to the iBuyer model?
A: Introducing a 30-day acquisition delay (rather than near-instantaneous purchase) reduces iBuyer market share from 5% to below 2% — a reduction of more than 60%. The model mechanism is that the primary iBuyer customers are highly impatient sellers who place extreme value on immediate transactions; even a moderate delay substantially reduces their willingness to accept a price discount.&lt;/p&gt;
&lt;p&gt;Q: What happens if iBuyers lose their ability to distinguish between high- and low-quality homes?
A: Setting the signal accuracy to zero (α = 0, the &amp;ldquo;naive intermediary&amp;rdquo; case) causes iBuyer market share to fall from 5% to just above 1%. Without any quality signal, severe adverse selection forces the intermediary to offer substantially lower prices to break even, which in turn reduces the number of sellers willing to transact.&lt;/p&gt;
&lt;p&gt;Q: How much would enabling iBuyers to rent vacant properties during the holding period affect market share?
A: The rental-enabled iBuyer counterfactual shows that market share could increase above 7.5 pp from the baseline 5%, because rental income would allow iBuyers to offer higher purchase prices while offsetting carrying costs. This suggests that rental infrastructure or policy changes permitting temporary rentals would substantially expand the scope of dealer intermediation in housing.&lt;/p&gt;
&lt;p&gt;Q: How does the model validate itself externally?
A: The authors use a difference-in-differences design comparing Phoenix (earlier and larger iBuyer entry) to the other four markets. The model predicts iBuyer entry should reduce average time on market and increase house prices; the DiD results show a 0.5 pp reduction in time on market and a 0.8 pp increase in house prices in Phoenix relative to comparison markets post-entry, consistent with model predictions.&lt;/p&gt;
&lt;p&gt;Q: Why did iBuyers suspend operations during the COVID-19 pandemic despite having a contactless technological advantage?
A: The model explains the suspension through the liquidity channel: iBuyers&amp;rsquo; value proposition depends on quickly reselling acquired properties, not merely on contactless buying. When market liquidity collapsed during lockdowns (transaction volumes fell sharply), iBuyers could not resell properties quickly, making intermediation unprofitable regardless of their purchasing-side technological advantage. As liquidity recovered, iBuyers resumed operations.&lt;/p&gt;
&lt;p&gt;Q: What does the model say about Zillow&amp;rsquo;s exit from iBuying in November 2021?
A: In very liquid markets, the iBuyer speed advantage shrinks because homeowners can sell quickly in the traditional market anyway, reducing the discount sellers accept when selling to an iBuyer. With a smaller discount, adverse selection worsens because only sellers with unfavorable private information (knowing their house has problems the algorithm overvalued) choose the iBuyer route. The pandemic-era housing market also featured rapid price appreciation that degraded AVM accuracy trained on historical data, compounding adverse selection. Zillow reported having significantly overpaid for homes, consistent with this mechanism.&lt;/p&gt;
&lt;p&gt;Q: Why is dealer intermediation approximately 50% in car markets but near-zero historically in housing?
A: The model, applied to car-market parameters, predicts 40–55% dealer intermediation, consistent with observed U.S. car market shares. Three structural differences explain the gap: (i) cars are more homogeneous (year/make/model/mileage sufficiently characterizes value), reducing adverse selection; (ii) cars are mobile and can be transported across markets, increasing effective liquidity; and (iii) cars depreciate primarily through use, so holding a car on a dealer lot incurs lower value loss than leaving a house vacant. Reducing the depreciation rate from the housing calibration (d = 0.02) to a car-like level (d = 0.005) alone raises predicted intermediary market share by about 5 pp.&lt;/p&gt;
&lt;p&gt;Q: Does subjective value dispersion (heterogeneity in buyer preferences) play a large role in limiting intermediation?
A: While subjective value dispersion plays a significant role in shaping search market equilibrium (affecting match quality and the gains from household-to-household search), the model finds its effect on the overall level of intermediation is comparatively less pronounced than informational asymmetry, market liquidity, or the opportunity cost of vacancy.&lt;/p&gt;
&lt;p&gt;Q: What evidence supports the claim that iBuyers use algorithmic pricing?
A: Observable property characteristics and ZIP-quarter fixed effects explain over 80% of price variation in iBuyer transactions, compared to only 68% in non-iBuyer transactions. The higher R-squared for iBuyer transactions is consistent with iBuyers relying on measurable, formalizable characteristics rather than soft information (such as odors or neighbor property conditions) that traditional buyers gather through physical visits.&lt;/p&gt;
&lt;p&gt;Q: What are the structural limits on iBuyer expansion even with improved technology?
A: Even with enhanced pricing technology (lower α), the scope for dealer intermediation remains narrow because strong incentives persist for iBuyers to avoid markets where algorithmic valuation is difficult, such as older and less homogeneous housing stock. The fundamental barriers — heterogeneity, immobility, and high vacancy opportunity cost — cannot be overcome by technology alone, meaning iBuyers are unlikely to reach the ~50% market share seen in automobile dealer markets.&lt;/p&gt;
&lt;p&gt;iBuyers: Technology-driven real estate companies (principally Opendoor and Offerpad) that use automated valuation models and online platforms to make near-instantaneous cash offers on homes, functioning as dealer intermediaries who purchase properties onto their balance sheet and resell after a short holding period, thereby providing immediate liquidity to sellers who would otherwise wait 90+ days in the traditional listing process.&lt;/p&gt;
&lt;p&gt;Dealer (Balance Sheet) Intermediation: A form of market-making in which an intermediary purchases an asset outright and holds it on its own balance sheet while finding a subsequent buyer, as distinct from matchmaking intermediaries (brokers) who connect buyers and sellers without taking ownership. The intermediary earns a gross spread between purchase and sale prices.&lt;/p&gt;
&lt;p&gt;Adverse Selection (in iBuyer context): The problem arising because sellers possess soft private information about their property (odors, hidden defects, neighbor quality) that algorithmic valuation models cannot capture, while traditional buyers can acquire this information through physical visits. Because iBuyers price quickly without visits, they disproportionately attract sellers of unobservably lower-quality homes, as measured in the paper by the calibrated parameter α = 0.35 (the fraction of low-quality homes the intermediary correctly identifies).&lt;/p&gt;
&lt;p&gt;Algorithmic Valuation Model (AVM): The pricing technology used by iBuyers to value homes near-instantaneously using observable property characteristics. The paper measures AVM performance by the R-squared of a hedonic regression: over 80% for iBuyer transactions versus 68% for non-iBuyer transactions, with the residual representing information the algorithm misses and traditional buyers discover through visits.&lt;/p&gt;
&lt;p&gt;PSALE (Probability of Sale within 3 Months): An ex ante measure of a property&amp;rsquo;s underlying liquidity, estimated from a probit model on non-iBuyer listings, capturing the probability that a given home sells within three months of listing. The paper uses PSALE as the key liquidity variable; iBuyers are almost entirely absent where PSALE falls below 50%.&lt;/p&gt;
&lt;p&gt;Occupancy Cost: The value loss incurred when a house is held vacant on an intermediary&amp;rsquo;s balance sheet — encompassing both foregone housing service flows (which continue to benefit occupants under traditional listing but are lost under iBuyer ownership) and ongoing maintenance and depreciation costs (calibrated at d = 0.02 per period). This cost distinguishes housing from goods like cars that depreciate primarily through use rather than time.&lt;/p&gt;
&lt;p&gt;Gross Spread: The difference between the price at which an iBuyer sells a property and the price at which it purchased that property, expressed as a percentage of the acquisition price. The paper documents a gross spread of approximately 5% (combining the 3.1 pp purchase discount and the 2.2 pp sale premium), which is persistently positive over the sample period.&lt;/p&gt;</description></item><item><title>Why Is Workplace Sexual Harassment Underreported? The Value of Outside Options amid the Threat of Retaliation</title><link>https://macropaperwarehouse.com/papers/why-is-workplace-sexual-harassment-underreported-the-value-of-outside-options-amid-the-threat-of-retaliation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://macropaperwarehouse.com/papers/why-is-workplace-sexual-harassment-underreported-the-value-of-outside-options-amid-the-threat-of-retaliation/</guid><description>&lt;h2 id="layer-1--summary"&gt;Layer 1 — Summary&lt;/h2&gt;
&lt;h3 id="research-question-and-argument"&gt;Research question and argument&lt;/h3&gt;
&lt;p&gt;Dahl and Knepper address the long-standing puzzle of why workplace sexual harassment is chronically underreported despite high estimated prevalence. Survey evidence indicates that no fewer than 1 in 28 U.S. workers report annual victimization, yet only 1 in 11,000 workers files a formal charge with the Equal Employment Opportunity Commission (EEOC). Even following the #MeToo movement, formal charges rose only about 10%, leaving an enormous gap unexplained.&lt;/p&gt;
&lt;p&gt;The paper&amp;rsquo;s central hypothesis is that employers coerce victims into silence through the credible threat of retaliatory firing. The key mechanism: because reporting constitutes a &amp;ldquo;protected activity&amp;rdquo; triggering employer notification, workers who fear losing their jobs will suppress claims. This threat is most binding when a worker&amp;rsquo;s outside options are weak — when it is hard to find a new job or when unemployment insurance (UI) benefits are thin. The paper tests this hypothesis by asking whether external shocks that reduce the value of outside options increase the threshold of harassment severity above which workers are willing to report.&lt;/p&gt;
&lt;h3 id="measurement-strategy"&gt;Measurement strategy&lt;/h3&gt;
&lt;p&gt;Measuring underreporting directly is impossible by definition. The authors&amp;rsquo; key methodological insight is to use the &lt;em&gt;selectivity&lt;/em&gt; of filed charges as an indirect proxy. Under mild assumptions, if workers become more selective about which incidents they report, the average quality of filed charges must rise. The authors measure quality using the EEOC&amp;rsquo;s own merit determination: a charge is deemed meritorious if the employer settles, the claimant withdraws upon receipt of benefits, or the EEOC finds reasonable cause after investigation. The merit rate thus serves as an observable proxy for the (unobservable) degree of underreporting.&lt;/p&gt;
&lt;p&gt;Baseline descriptive evidence supports the mechanism&amp;rsquo;s relevance: across 2000–2015, sexual harassment charges were nearly 50% more likely to be meritorious than non-harassment charges (27.0% vs. 18.6%), and more than twice as likely to involve employer retaliation (63.4% vs. 30.7%). The proportion of EEOC sexual harassment cases involving retaliation rose from 52% in 2000 to 72% in 2015, a period over which the annual volume of filed charges fell by 37%.&lt;/p&gt;
&lt;h3 id="analysis-1--labor-market-conditions-20002015"&gt;Analysis 1 — Labor market conditions (2000–2015)&lt;/h3&gt;
&lt;p&gt;The first empirical design exploits monthly variation in state-industry unemployment rates over 2000–2015 using EEOC microdata on individual charges. The regression controls for industry, state, and time fixed effects, isolating within-state-industry variation in unemployment. The identifying assumption is that a worker&amp;rsquo;s willingness to file depends only on her outside options and the severity of harassment she experiences, conditional on fixed effects.&lt;/p&gt;
&lt;p&gt;The results indicate that each one percentage point increase in a state-industry&amp;rsquo;s monthly unemployment rate is associated with a 0.5–0.7% increase in the probability that a filed charge is deemed meritorious by the EEOC. This is consistent with the hypothesis that workers become more reluctant to report as outside labor market options weaken.&lt;/p&gt;
&lt;p&gt;Heterogeneity analysis using linked EEO-1 establishment data strengthens the interpretation. The effect is amplified in industries that employ a larger fraction of men and in establishments where male managers account for a higher share of supervisory roles. Oﬀending establishments in the sample have, on average, 2.8 percentage points more male employees and 5 percentage points more male managers than non-offending establishments. The selectivity-unemployment gradient is larger in these male-dominated environments, consistent with a role for gendered power disparities in enabling employer retaliation.&lt;/p&gt;
&lt;h3 id="analysis-2--north-carolina-ui-reform-quasi-experiment"&gt;Analysis 2 — North Carolina UI reform (quasi-experiment)&lt;/h3&gt;
&lt;p&gt;The second design exploits North Carolina&amp;rsquo;s 2013 UI reform as a plausibly exogenous reduction in the value of outside options. In response to the near-insolvency of its UI trust fund following the Great Recession, North Carolina simultaneously reduced maximum weekly benefits by approximately 35% (from $535 to $350 per week) and cut maximum benefit duration from 26 to 20 weeks. Together, these changes reduced the maximum total regular UI benefit available to North Carolinians by approximately 50%, from roughly $14,000 to $7,000. These reforms also violated the Congressional non-reduction rule, making individuals ineligible for an additional 47 weeks of federal Emergency Unemployment Compensation benefits, further amplifying the effective cut. North Carolina was the only state to simultaneously reduce both the level and duration of benefits.&lt;/p&gt;
&lt;p&gt;The authors implement a difference-in-differences design comparing North Carolina to other Southern states that did not change their UI programs, controlling for state and month-year fixed effects. Pre-reform parallel trends are documented via event study. Administrative UI recipiency data show that the short-term UI recipiency rate in North Carolina fell from 33% to 10% — a 59% decline relative to control states — within roughly two years of the reform.&lt;/p&gt;
&lt;p&gt;The main finding is that the selectivity of sexual harassment charges filed in North Carolina increased by approximately 7 percentage points following the reform, representing more than a 30% increase relative to control states. This is consistent with the hypothesis that reduced UI generosity raises the cost of a retaliatory firing, causing workers to suppress all but the most severe harassment incidents.&lt;/p&gt;
&lt;p&gt;The authors note that North Carolina also reduced corporate and personal income taxes shortly after the UI reform. Because tax cuts should increase both labor demand and labor supply (insofar as substitution effects dominate income effects), this would tend to reduce the reporting threshold, leading them to interpret the 30%+ estimate as a lower bound on the causal effect of the UI reform on selectivity.&lt;/p&gt;
&lt;h3 id="formal-model"&gt;Formal model&lt;/h3&gt;
&lt;p&gt;The paper presents a threshold model of reporting behavior adapted from Boone and Van Ours (2006). Workers choose a reporting threshold: the minimum harassment severity above which they will file a charge. The threshold rises when the value of outside options falls, either because the probability of finding a new job declines (recession) or because unemployment benefits shrink. The model predicts that the merit rate of filed charges will rise as outside options weaken. The model explicitly does not predict the volume of charges, because firm behavior — which may adjust endogenously to higher reporting thresholds — is not modeled.&lt;/p&gt;
&lt;h3 id="scope-conditions"&gt;Scope conditions&lt;/h3&gt;
&lt;p&gt;All findings concern formal EEOC charges filed in the United States between 2000 and 2015 (analysis 1) and through the post-2013 reform period (analysis 2). The EEOC definition of illegal harassment requires severity sufficient to create a &amp;ldquo;hostile or offensive work environment&amp;rdquo; or an adverse employment action. The paper&amp;rsquo;s merit measure captures harassment that exceeded this legal threshold; non-meritorious charges may still involve some level of misconduct. The sample for establishment-level heterogeneity analyses covers private firms with 100 or more employees (EEO-1 filers), approximately 40% of U.S. employees. The mechanism specifically concerns retaliation-driven suppression of &lt;em&gt;formal&lt;/em&gt; reporting; effects on informal or anonymous reporting cannot be assessed.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="layer-2--qa"&gt;Layer 2 — Q&amp;amp;A&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the core mechanism the paper proposes to explain underreporting?&lt;/strong&gt;
A: Employers threaten workers with retaliatory firing for engaging in protected activity (filing an EEOC charge). Because the EEOC notifies the named employer within 10 days of receiving a charge, worker anonymity is rarely preserved. When a worker&amp;rsquo;s outside options are weak — because unemployment is high or UI benefits are thin — the expected cost of a retaliatory firing is higher, raising the severity threshold above which a victim is willing to report. Workers therefore &amp;ldquo;tough it out&amp;rdquo; rather than risk their current job.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: How does the paper measure something that is, by definition, not reported?&lt;/strong&gt;
A: By using the quality of filed charges as a proxy for the degree of underreporting. Under the threshold model, if workers only report when harassment exceeds a higher bar, the average quality of what does get filed must rise. The EEOC&amp;rsquo;s own merit determination (settlement, withdrawal with benefits, or reasonable-cause ruling) provides an objective, externally-assessed quality measure. An increase in the merit rate signals that the population of filed charges has become more selected — that is, that the unreported fraction has grown.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the 0.5–0.7% figure mean, and what is its interpretation?&lt;/strong&gt;
A: Each one percentage point increase in a state-industry&amp;rsquo;s monthly unemployment rate is associated with a 0.5–0.7 percentage point increase in the probability that a filed sexual harassment charge receives a merit designation from the EEOC. This is interpreted as evidence that workers become more selective — filing only more severe cases — as outside options weaken, consistent with higher underreporting at lower harassment thresholds.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why did the number of EEOC sexual harassment charges fall by 37% between 2000 and 2015, even as retaliation rates rose?&lt;/strong&gt;
A: The paper offers the interpretation that firms have become more effective at credibly threatening retaliation to suppress reporting. The 37% volume decline does not imply harassment has diminished; it may reflect a rising fraction of victims staying silent. The authors note the model does not make a prediction about volume because firm behavior is not modeled — volume depends on both worker reporting thresholds and employer conduct.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Why is North Carolina&amp;rsquo;s UI reform particularly well-suited as a natural experiment?&lt;/strong&gt;
A: Four features make it attractive. First, the reform was motivated by trust fund insolvency rather than local labor market conditions, making it more plausibly exogenous to harassment reporting trends. Second, it was implemented during a period of historically high unemployment, when the social safety net was unusually relevant to workers considering risky actions. Third, the cuts affected both the intensive margin (benefit level, down ~35%) and the extensive margin (duration, from 26 to 20 weeks; added eligibility restrictions), with total maximum benefits cut by approximately 50%. Extensive-margin cuts are likely particularly salient for workers worried about a retaliatory firing. Fourth, the cuts to regular UI were permanent and primary, rather than affecting supplemental federal programs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What role does industry and establishment gender composition play?&lt;/strong&gt;
A: The underreporting effect — proxied by the merit-unemployment gradient — is amplified in industries with a larger fraction of male coworkers and in establishments with a higher fraction of male managers. Establishments named in sexual harassment charges have, on average, 2.8 percentage points more male employees and 5 percentage points more male managers than non-respondent establishments. The male-manager underreporting gradient is further amplified by higher unemployment, suggesting gendered power disparities interact with labor market conditions to suppress reporting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: Does the paper make predictions about the volume of charges, not just their quality?&lt;/strong&gt;
A: No. The threshold model explicitly does not model firm behavior and makes no prediction about charge volume. Whether volume rises or falls following a labor demand shock is theoretically ambiguous: firms may respond to higher reporting thresholds by escalating harassment (increasing both incidence and severity), or may not respond at all. The identifying assumption requires only that a worker&amp;rsquo;s willingness to file depends on her outside options and the severity of harassment she experiences — not on firm behavior.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the &amp;ldquo;value of a statistical harassment&amp;rdquo; (VSH) figure, and how does it relate to the paper&amp;rsquo;s motivation?&lt;/strong&gt;
A: Hersch (2018) estimates the VSH for serious cases at approximately $7.6 million, roughly comparable to the value of a statistical life (VSL). Dahl and Knepper cite this figure to underscore the magnitude of the underreporting problem: with an estimated 5 million workers victimized annually, the social costs of suppressed reporting are substantial. The comparison to VSL motivates why closing the reporting gap matters for welfare, not just legal compliance.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What is the ex-ante moral hazard interpretation of the UI results?&lt;/strong&gt;
A: Most UI research focuses on ex-post effects — how benefit generosity affects job search behavior for workers who have already lost their jobs. Dahl and Knepper document an ex-ante moral hazard effect: UI generosity affects the behavior of currently employed workers by changing the expected cost of actions (reporting harassment) that might trigger job loss. Lower UI generosity raises the effective cost of a retaliatory firing, discouraging reporting. This is analogous to, but in the opposite direction from, Lusher et al. (2020), who find that UI expansions reduced productivity among currently employed workers.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Q: What does the parallel-trends evidence show for the NC difference-in-differences?&lt;/strong&gt;
A: The paper presents an event study documenting parallel pre-reform trends in the merit rate between North Carolina and control states. The control group is other Southern states that did not change their UI programs during the sample period, excluding AR, FL, GA, and SC (which made changes) and the West South Central division (which exhibited differential pre-trends). The UI recipiency rate tracks closely between NC and control states prior to July 2013, then diverges sharply thereafter, dropping from 33% to 10% in North Carolina within two years — a 59% decline relative to controls.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="key-concepts"&gt;Key Concepts&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Merit determination (EEOC):&lt;/strong&gt; The EEOC assigns a merit designation to a sexual harassment charge if the named employer settles with the employee, the claimant withdraws the charge upon receipt of benefits, or the EEOC itself determines after investigation that there is &amp;ldquo;reasonable cause&amp;rdquo; to believe harassment occurred. As used in this paper, merit designations capture cases where harassment exceeded the legal threshold of a &amp;ldquo;hostile or offensive work environment&amp;rdquo; or produced an adverse employment decision — not all cases involving some level of misconduct.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Selectivity of charges:&lt;/strong&gt; The fraction of filed EEOC sexual harassment charges that receive a merit designation. In the paper&amp;rsquo;s framework, higher selectivity (a higher merit rate) signals that workers are filing only more severe cases — i.e., that underreporting of less severe cases has increased. Selectivity is used as an observable proxy for the (unobservable) degree of underreporting.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Reporting threshold (ᾱ):&lt;/strong&gt; In the paper&amp;rsquo;s threshold model, the minimum level of harassment severity above which a worker will file an EEOC charge. The threshold is determined by the equality between the expected gains from reporting (probability of success times compensation plus elimination of harassment) and the expected costs (probability of retaliation times the gap between current wage and unemployment value). The threshold rises when outside options weaken — either through lower job-finding probabilities or reduced UI benefits.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Outside options:&lt;/strong&gt; In this paper, the expected value to a worker of becoming unemployed: a weighted average of the wage at a new job (weighted by job-finding probability) and unemployment benefits (weighted by the probability of not finding a job). Outside options determine the cost a worker bears if retaliatory firing follows an EEOC charge. The paper&amp;rsquo;s two empirical analyses correspond to two separate shocks to outside options: aggregate labor demand (unemployment rate) and institutional safety net generosity (UI benefits).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Retaliation:&lt;/strong&gt; Defined by the EEOC as punishment for engaging in a protected activity, such as filing a charge. Retaliation arose in 63.4% of all EEOC sexual harassment charges filed between 2000 and 2015 — more than double the rate for non-harassment charges — and rose from 52% of harassment cases in 2000 to 72% in 2015. In the paper&amp;rsquo;s model, the probability of a retaliatory firing is denoted θ, and is treated as fixed (not a function of harassment severity for tractability).&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Ex-ante moral hazard (UI):&lt;/strong&gt; The effect of UI benefit generosity on the behavior of currently employed workers, rather than on those already unemployed. In this paper&amp;rsquo;s context, higher UI generosity reduces the cost of a potential retaliatory firing for currently employed workers, making them more willing to report harassment. The North Carolina UI reform provides evidence of this ex-ante channel: when benefits were cut, the selectivity of harassment charges rose, consistent with workers becoming less willing to risk their jobs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;EEO-1 data:&lt;/strong&gt; A mandatory annual survey of private establishments in the United States with 100 or more employees, covering approximately 40% of all U.S. employees. Collected by the EEOC, these data report the gender, race, and occupational distribution of workers within each establishment. In this paper, the EEO-1 files are linked to EEOC charge microdata to analyze how the gender composition of co-workers and managers moderates both the incidence of reported harassment and the degree of underreporting.&lt;/p&gt;
&lt;hr&gt;
&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Summary based on IZA Discussion Paper 14740. AI-assisted, human review pending.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;</description></item></channel></rss>